Image Title

Search Results for Norris:

Jack Norris, MapR - Spark Summit East 2016 #SparkSummit #theCUBE


 

>>From New York expecting the signal to nine. It's the cube covering sparks summit east brought to you by spark summit. Now your hosts, Dave Volante and George Gilbert >>Right here in Midtown at the Hilton hotel. This has sparked somebody and this is the cube. The cube goes out to the events. We extract the signal from the noise. Jack Norris is here. He's the CMO of Mapbox, long time cube, alum jackets. It's great to see you again. Hey, if you've been here since the beginning of this whole big data >>Meme and it might've started here, I don't know. I think we've yeah, >>I think you're right. I mean, it really did start it. I think in this building, it was our first big data show at the original, you know, uh, uh, Hadoop world. And, uh, and you guys, like I say, I've been there from the start. Uh, you were kind of impatient early on. You said, you know, we're just going to go build solutions and, uh, and ignore the noise and you built a really nice, nice business. Um, you guys have been growing, you're growing your Salesforce and, uh, and things are good and all of a sudden, boom, the spark thing comes in. So we're seeing the evolution. I remember saying to George and the early days of a dupe, we were geeking out talking to all the bits and bytes and then it turned into a business discussion. It's like we're back to the hardcore bits and bites. So give us the update from Matt bar's point of view, where are we in the whole big data space? >>Well, I think, um, I think it has transitioned. I mean, uh, if you look at the typical large fortune company, the web to Datto's, it's really, how do we best leverage our data and how do we leverage our data in that we can, we can make decisions much faster, right? That high-frequency decision-making process. Um, and typically that involves taking production data and analytics and joining them together so that you're actually impacting business as it happens and to do that effectively requires, um, innovations. So the exciting thing about spark is taking and, uh, and having a distributed compute engine, it's much easier to develop and, uh, in much faster. >>So in the remember the early days we'd be at these shows and the big question was, you know, can you take the humans out of the equation? It's like, no, no humans are the last mile. Um, is that, is that changing or would we still need that human interaction or, >>Um, humans are important part of the process, but increasingly if you can adjust and make, you know, small algorithmic decisions, um, and, and make those decisions at that kind of moment of truth, you got big impact, and I'll give you a few examples. So, um, ad platforms, you know, Rubicon project over a hundred billion ad auctions a day, you know, humans, part of that process in terms of setting that up and reviewing the process, but each, you know, each supply and demand decision, there is an automated decision optimizing that has a huge impact on the bottom line, um, fraud, uh, you know, credit card swiping that transaction and deciding is this fraudulent or not avoiding false positives, et cetera, a big leveraged item. So we're seeing things like that across manufacturing, across retail healthcare. And, um, it isn't about asking bigger questions or doing reports and looking back at, you know, what happened last week. It's more, how can I have an infrastructure in place that allows this organization to be agile? Because it's not the companies with the most data that's going to win. It's the companies that are the most agile and making intelligent. >>So it's so much data. Humans can ingest it any faster. I mean, we just, we can't keep up. So the world needs data scientists that needs trained developers. You've got some news I want to talk about on the training side, but even that we can only throw so many bodies at the problem. So it's really software. That's going to allow us to scale it. Software's hard. Software takes time. So we've seen a lot of the spend in the analytics, big data world on, on services. And obviously you guys and others have been working hard to shift it towards software. I want to come back to that training issue. We heard this morning about, uh, Databricks launched a move. They trained 20,000 people. That's a lot, but still long way to go. You guys are putting some investment into training. Talk about that news. Yeah. >>Yeah. Um, well it starts at the underlying software. If you can do things in the platform to make it much easier and do things that are hard to surround with services, like, uh, data protection, right? If you've lost data, it doesn't matter how many people you throw at it, you can't recover it. Right. So that's kind of the starting point you're gonna get fired. >>The, the, uh, the approach we've taken is, is to take, uh, a software product approach to the training as well. So we rolled out on demand training. So it's free, it's on demand. You work at your own pace. It's got different modules, there's some training associated with that, or some hands-on labs, if you will. Um, we launched that last January. So it's basically coming up the year anniversary. We recently celebrated, we trained 50,000 people, uh, on, on Hadoop and big data. Um, today we're announcing expansion on spark classes. We've got full curriculum around spark, including a certification. So you can get sparked certification through this, this map, our on demand training. Okay. >>Gotcha. You said something really, really intriguing that I want to dive into a little bit is where we were talking about the small decisions that can be made really, really fast for that a human in the loop human might have to train them, but it at runtime now where you said, it's not about asking bigger questions, it's finding faster answers, um, what had to change in your platform or in the underlying technology to make that possible. >>You know, um, there's a lot that into it. It's typically a series of functions, uh, a kind of breadth that needs to be brought to the problem as well as squeezing out latencies. So instead of, um, the traditional approach, which is different applications and different analytic techniques dictate a separate silo, a separate, you know, scheme of data. And you've got those all around the organization and data kind of travels, and you get an answer at the end of some period of time. Uh, it's converging that altogether into a single platform, squeezing out those latencies so that you can have an informed action at the speed of business, if you will. And, >>Um, let's say spark never came along. Would that be possible? >>Yes. Yes. Would you, how would you, so if you look at kind of the different architectures that are out there, there's typically deep analytics in terms of, you know, let's go look at the trends, you know, the last seven years, what happened. And then look, let's look at, um, doing actions on a streaming set, say for instance, storm, and then let's do a real time database operations. So you could do that with, with HBase or map RDB and all of that together. What spark has really done is made that whole development process just much easier and much more streamlined. And that's where a lot of the excitements happen. >>So you mentioned earlier, um, to, to use cases, ad tech and fraud detection. Um, and I want to ask you about those in the state of those. So ad tech obviously has come a long way, but it's still got a ways to go. I mean, you look at, I mean, who's making money on ads. Obviously Google will make tons of money. Everybody else is sorta chasing them Facebook making money. It's probably cause they didn't let Google in. Okay. So how will spark affect sort of that business? Uh, and, and what's map, R's sort of role in evolving that, you know, to the next level. >>So, so, um, there's, there's different kind of compute and the types of things you can do, um, on the data. I think increasingly we're seeing the kind of streaming analytics and making those decisions as the data arrives, right. And then there's the whole ecosystem in terms of how do you coordinate those flows of data? It's not just a simple, here's the origin, here's the destination. There's typically a complex data flow. Um, that's where we've kind of focused on map our streams, this huge publish and subscribe infrastructure so that you can get real-time data to the appropriate location and then do the right operations, a lot of that involved with spark, but not exclusively. >>Okay. And then on fraud detection, um, obviously come a long way. Sampling could have died. Yes. And now, but now we're getting too many false positives. You get the call and, you know, I mean, I get a lot of calls because we can buy so much equipment, but, um, but now what about the next level? What are you guys doing to take fraud detection to the next level? So that when I get on the plane in Boston and I land in London, it knows, um, is that a database problem? Is it an integration problem, a systems problem, and how, what role you guys play in solving that? >>Well, there's, there's, um, you know, there's, there's a lot of details and techniques that probably go, um, beyond, you know, what, what we'll share publicly or what are our customers talk about publicly? I think in general, it's the more data that you can apply to a problem. The more context, the better off you are, that's the way I kind of summarize it so that instead of a sampling or instead of a boy, that's a strange purchase over there, it's understanding, well, this is Dave Valenti and this is the full body of, of, uh, expenditures he's done, then the types of things and here's who he frequently purchases from. And here's kind of a transaction trend started in San Francisco, went to New York, et cetera. So in context it would make more sense. So >>Part of that is more data. And the other part of that is just better algorithms and better, better learnings and applying that on a continuous basis. How are your customers dealing with that, that constraint? I mean, if they got a, a hundred dollars to spend, yeah. They can only spend so much on, on each of those gathering more data, cleaning the data, they spent so much time getting it ready versus making their machine learning algorithms or whatever the other techniques to do. What are you seeing there as sort of best practice? It was probably varies. I'm sure, but give us some color on it. >>Um, I'll actually go back to Google and Google a letter last round, um, you know, excellent, excellent insights coming from Google. They wrote a paper called the unreasonable effectiveness of data and in it, they basically squarely addressed that problem. And given the choice to invest in either the complex model and algorithm or put more data at it, putting more data, had a huge impact. And, um, you know, my simple explanation is if you're sampling the data, you have to have a model that tries to recreate reality. If you're looking at all of the data, then the anomalies can, can pop up and be more apparent. And, um, the more context you can bring, the more data from other sources. So you get around, you know, a better picture of what's happening, the better off you are. And so that requires scale. It requires speed and requires different techniques that can be brought to bear, right? The database operation, here's a streaming operation, here's a deep, you know, file machine learning algorithm. >>So there's a lot of vendors in the sort of big data ecosystem are coming at spark from different angles and, um, are, are trying to add value to it and sort of bathe themselves in sort of the halo. Yep. Now you guys took some time upfront to build a converged platform so that you weren't trying to wrap your arms around 37 different projects. Can you tell us how having perhaps not anticipated spark how this converts platform allows you to add more value to it than other approaches? >>So, so we simplify, if you look at the Hadoop ecosystem, it's basically separated into the components for compute and management on top of the data layer, right? The Hadoop distributed file system. So how do you scale data? How do you protect it? It's very simply what's going on. Spark really does a great job at that top layer. Doesn't do anything about defining the underlying storage layer in the Hadoop community that underlying storage layer is a batch system. So you're trying to do, you know, micro batch kind of streaming operations on top of batch oriented data. What we addressed was to take that whole data layer, make it real time, make it random. Read-write converge enterprise storage together with Hadoop support and spark support on a single platform. And that's basically >>With the difference and to make an enterprise great. You guys were really the first to lead the lecture. You were, everybody started talking about attic price straight after you were kind of delivering it. So you've had a lead there. Do you feel like you still have a lead there, or is that the kind of thing where you sort of hit the top of the S-curve and start innovating elsewhere? >>NC state did a study, uh, just this past year, a recent study identified that only 25% of data corruption issues are identified and properly handled by the Hadoop distributed file system. 42% of those are silent. So there's a huge gap in terms of quote unquote enterprise grade features and what we think. >>Yes, silent data corruption has been a problem for decades now. And you're saying it's no different in the duke ecosystem, especially as, as mainstream businesses start to, uh, to adopt this what's happening in the valley. Uh, we're seeing, you know, in the wall street journal every day you read about down rounds, flat rounds, people can't get B rounds. Uh, you guys are funded, you know, you're growing, you're talking about investments, you know, what do you see? Do you, do you feel like you're achieving escape velocity? Um, maybe give us sort of an update on, uh, the state of the business. >>Yeah. I, I think the state of the business is best represented by the customers, right? And the customers kind of vote, right. They vote in terms of, you know, how well is this technology driving their business? So we've got a recent study, um, that kind of shows the, the returns that customers, um, are getting, uh, we've got a 1% chance, a 99% retention rate with our customers. We've got, uh, an expansion rate. That's, that's unbelievable. We've got multi-million dollar customers in, uh, in seven of the top verticals and nine out of the top $10 million customers. So we're seeing significant investments and more importantly, significant returns on the part of customers where they're not just doing a single application on the platform, but multiple >>Applications, Jack Norris map are always focused. Always a pleasure having you on the cube. Thanks very much for coming on. Appreciate it. Keep right there, buddy. We'll be back with our next guest is the cube we're live from spark somebody's right back. Okay.

Published Date : Feb 17 2016

SUMMARY :

covering sparks summit east brought to you by spark summit. It's great to see you again. I think we've yeah, You said, you know, we're just going to go build solutions and, if you look at the typical large fortune company, So in the remember the early days we'd be at these shows and the big question was, you know, and reviewing the process, but each, you know, each supply and demand decision, And obviously you guys and others have been working hard to shift it towards software. If you can do things in the platform to make it much easier and do things that are hard to surround So you can get sparked certification through really fast for that a human in the loop human might have to train them, but it at runtime around the organization and data kind of travels, and you get an answer at the end of some period Would that be possible? let's go look at the trends, you know, the last seven years, what happened. So you mentioned earlier, um, to, to use cases, ad tech and fraud detection. so that you can get real-time data to the appropriate location and then do the right operations, You get the call and, you know, I mean, I get a lot of calls because we can buy so much equipment, but, The more context, the better off you are, that's the way I kind of summarize What are you seeing there as sort of best practice? um, you know, my simple explanation is if you're sampling the data, this converts platform allows you to add more value to it than other approaches? So how do you scale data? You were, everybody started talking about attic price straight after you were kind of delivering it. and properly handled by the Hadoop distributed file system. you know, in the wall street journal every day you read about down rounds, flat rounds, people can't get B rounds. They vote in terms of, you know, Always a pleasure having you on the cube.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave ValentiPERSON

0.99+

Jack NorrisPERSON

0.99+

Dave VolantePERSON

0.99+

New YorkLOCATION

0.99+

LondonLOCATION

0.99+

GeorgePERSON

0.99+

San FranciscoLOCATION

0.99+

BostonLOCATION

0.99+

George GilbertPERSON

0.99+

99%QUANTITY

0.99+

GoogleORGANIZATION

0.99+

42%QUANTITY

0.99+

FacebookORGANIZATION

0.99+

DatabricksORGANIZATION

0.99+

50,000 peopleQUANTITY

0.99+

nineQUANTITY

0.99+

20,000 peopleQUANTITY

0.99+

last weekDATE

0.99+

DattoORGANIZATION

0.99+

last JanuaryDATE

0.99+

$10 millionQUANTITY

0.98+

sevenQUANTITY

0.98+

eachQUANTITY

0.98+

firstQUANTITY

0.98+

MapboxORGANIZATION

0.98+

todayDATE

0.97+

1%QUANTITY

0.97+

HadoopTITLE

0.97+

MattPERSON

0.96+

single platformQUANTITY

0.96+

NCORGANIZATION

0.95+

this morningDATE

0.95+

single applicationQUANTITY

0.94+

25%QUANTITY

0.94+

MidtownLOCATION

0.93+

first bigQUANTITY

0.92+

RubiconORGANIZATION

0.92+

37 different projectsQUANTITY

0.92+

last seven yearsDATE

0.89+

over a hundred billion ad auctions a dayQUANTITY

0.88+

this past yearDATE

0.86+

sparkORGANIZATION

0.85+

multi-million dollarQUANTITY

0.84+

decadesQUANTITY

0.83+

a hundred dollarsQUANTITY

0.79+

data corruptionQUANTITY

0.7+

HBaseTITLE

0.67+

HiltonORGANIZATION

0.67+

RDBTITLE

0.64+

SparkORGANIZATION

0.57+

MapRORGANIZATION

0.57+

mapTITLE

0.57+

SalesforceORGANIZATION

0.53+

2016EVENT

0.51+

- Spark SummitEVENT

0.46+

EastLOCATION

0.42+

Jack Norris - Hadoop on the Hudson - theCUBE


 

>>Live from New York city. It's cute. here's your host? Jeff Frick. >>Hi, Jeff Frick here with the Q we're on the ground at the USS Intrepid at the Hadoop on the Hudson party put on by Matt BARR. It's uh, I think it's the party of the night tonight here in big data week, New York city with strata cough, a dupe world, big data NYC. So Jack a great >>Venue. Yeah, it's excellent. Here. >>The place is filled. I'm just struck by the technology. There's a Gemini capsule over there, about 50 years old. It's about the size of a Volkswagen, I think would be much bigger. And to think that those guys went up into space with probably less technology than is on your four year old flip phone. Amazing. Yeah. >>Not, not much data at all. No. If >>You look at it, just kind of get that bounce on the gravity thing, which I never quite understood. So talk about you guys had some big news today. Once you give us a rundown on some of the announcements, >>We had two big announcements. One was incorporating the map RDB and our community edition that came out. We also reported results from our customers where the majority of customers reported less than a 12 month payback, uh, 65% of five X or greater return and 40%, 10 X or greater. And that included a subset of those customers that had experienced with other distributions. So kind of a Testament to when you get serious about Hadoop, you get serious with Mapbox >>And when they're getting those return on investments, we're always trying to explore where's the big, the big ROI, because it's really in value that's released for the customer. It's not necessarily because it's a cheaper way to do it, >>Right? So, so there are some costs that 63% was cost reduction that was driving it about 41% were top-line revenue projects. And about 23% were related to risk reduction and risk mitigation. And if you add those up, it's greater than a hundred percent because of many customers that are doing multiple applications. >>Great. So you've been coming to Hadoop world for longer than you would admit to me before we came on camera and, and the baseball playoffs are going on right now. I mean, we like to talk in sports analogy. So kind of where are we in, in kind of what inning are we in this adoption of big data and the duke specifically >>Early, early innings. Um, but, uh, what we've seen is the bases are loaded and we're up >>And it's it. And it seems to be we're way past now the POC stage. Now we're really getting in there for that. >>And the, the customer announcement, we did kind of shows how people are hitting it out of the park with Hadoop. And a lot of that is by impacting the operations, impacting the business as it happens. And that's coupling analytics plus this higher arrival rate data from a variety of sources and making adjustments so that you can impact revenue as businesses happening. You can mitigate risk as it's happening. It's not just reporting, looking back >>Function. Right, right. It's being able to react in real time, which is defined by, in time to do something about it. Right. Exactly. All right. Well, thanks for hosting a great party, Jack Norris. Here we are on the ground, uh, at the USS Intrepid at the Hadoop on the Hudson. Uh, uh, if you take a nice picture, tweet that in. I think they got some prizes. Hadoop Hudson is a hashtag Jeff Frick on the ground. You're watching the cube. Thanks. Big ship.

Published Date : Oct 22 2014

SUMMARY :

It's cute. It's uh, I think it's the party of the night tonight here And to think that those guys went up into space with probably less technology than is on your four Not, not much data at all. You look at it, just kind of get that bounce on the gravity thing, which I never quite understood. So kind of a Testament to when you get serious about Hadoop, And when they're getting those return on investments, we're always trying to explore where's the big, And if you add those up, it's greater than a hundred percent because of many customers that are doing multiple applications. So kind of where are we in, Um, but, uh, what we've seen is the bases are loaded and we're up And it seems to be we're way past now the POC stage. And a lot of that is by impacting the operations, It's being able to react in real time, which is defined by,

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Jeff FrickPERSON

0.99+

40%QUANTITY

0.99+

Jack NorrisPERSON

0.99+

Matt BARRPERSON

0.99+

65%QUANTITY

0.99+

63%QUANTITY

0.99+

OneQUANTITY

0.99+

10 XQUANTITY

0.99+

New York cityLOCATION

0.99+

NYCLOCATION

0.99+

todayDATE

0.99+

greater than a hundred percentQUANTITY

0.99+

about 23%QUANTITY

0.99+

VolkswagenORGANIZATION

0.98+

two big announcementsQUANTITY

0.98+

JackPERSON

0.98+

about 41%QUANTITY

0.98+

five XQUANTITY

0.98+

about 50 years oldQUANTITY

0.94+

MapboxORGANIZATION

0.93+

HadoopTITLE

0.93+

tonightDATE

0.91+

less than a 12 monthQUANTITY

0.91+

HudsonLOCATION

0.87+

HadoopLOCATION

0.86+

four year oldQUANTITY

0.83+

Hadoop onLOCATION

0.78+

USS IntrepidORGANIZATION

0.76+

map RDBTITLE

0.68+

Hadoop HudsonTITLE

0.68+

GeminiCOMMERCIAL_ITEM

0.53+

someQUANTITY

0.5+

Hadoop on theTITLE

0.5+

Jack Norris - Hadoop Summit 2014 - theCUBE - #HadoopSummit


 

>>The queue at Hadoop summit, 2014 is brought to you by anchor sponsor Hortonworks. We do, I do. And headline sponsor when disco we make Hadoop invincible >>Okay. Welcome back. Everyone live here in Silicon valley in San Jose. This is a dupe summit. This is Silicon angle and Wiki bonds. The cube is our flagship program. We go out to the events and extract the signal to noise. I'm John barrier, the founder SiliconANGLE joins my cohost, Jeff Kelly, top big data analyst in the, in the community. Our next guest, Jack Norris, COO of map R security enterprise. That's the buzz of the show and it was the buzz of OpenStack summit. Another open source show. And here this year, you're just seeing move after, move at the moon, talking about a couple of critical issues. Enterprise grade Hadoop, Hortonworks announced a big acquisition when all in, as they said, and now cloud era follows suit with their news. Today, I, you sitting back saying, they're catching up to you guys. I mean, how do you look at that? I mean, cause you guys have that's the security stuff nailed down. So what Dan, >>You feel about that now? I think I'm, if you look at the kind of Hadoop market, it's definitely moving from a test experimental phase into a production phase. We've got tremendous customers across verticals that are doing some really interesting production use cases. And we recognized very early on that to really meet the needs of customers required some architectural innovation. So combining the open source ecosystem packages with some innovations underneath to really deliver high availability, data protection, disaster recovery features, security is part of that. But if you can't predict the PR protect the data, if you can't have multitenancy and separate workflows across the cluster, then it doesn't matter how secure it is. You know, you need those. >>I got to ask you a direct question since we're here at Hadoop summit, because we get this question all the time. Silicon lucky bond is so successful, but I just don't understand your business model without plates were free content and they have some underwriters. So you guys have been very successful yet. People aren't looking at map are as good at the quiet leader, like you doing your business, you're making money. Jeff. He had some numbers with us that in the Hindu community, about 20% are paying subscriptions. That's unlike your business model. So explain to the folks out there, the business model and specifically the traction because you have >>Customers. Yeah. Oh no, we've got, we've got over 500 paying customers. We've got at least $1 million customer in seven different verticals. So we've got breadth and depth and our business model is simple. We're an enterprise software company. That's looking at how to provide the best of open source as well as innovations underneath >>The most open distribution of Hadoop. But you add that value separately to that, right? So you're, it's not so much that you're proprietary at all. Right. Okay. >>You clarify that. Right. So if you look at, at this exciting ecosystem, Hadoop is fairly early in its life cycle. If it's a commoditization phase like Linux or, or relational database with my SQL open source, kind of equates the whole technology here at the beginning of this life cycle, early stages of the life cycle. There's some architectural innovations that are really required. If you look at Hadoop, it's an append only file system relying on Linux. And that really limits the types of operations. That types of use cases that you can do. What map ours done is provide some deep architectural innovations, provide complete read-write file systems to integrate data protection with snapshots and mirroring, et cetera. So there's a whole host of capabilities that make it easy to integrate enterprise secure and, and scale much better. Do you think, >>I feel like you were maybe a little early to the market in the sense that we heard Merv Adrian and his keynote this morning. Talk about, you know, it's about 10 years when you start to get these questions about security and governance and we're about nine years into Hadoop. Do you feel like maybe you guys were a little early and now you're at a tipping point, whereas these more, as more and more deployments get ready to go to production, this is going to be an area that's going to become increasingly important. >>I think, I think our timing has been spectacular because we, we kind of came out at a time when there was some customers that were really serious about Hadoop. We were able to work closely with them and prove our technology. And now as the market is just ramping, we're here with all of those features that they need. And what's a, what's an issue. Is that an incremental improvement to provide those kind of key features is not really possible if the underlying architecture isn't there and it's hard to provide, you know, online real-time capabilities in a underlying platform that's append only. So the, the HDFS layer written in Java, relying on the Linux file system is kind of the, the weak underbelly, if you will, of, of the ecosystem. There's a lot of, a lot of important developments happening yarn on top of it, a lot of really kind of exciting things. So we're actively participating in including Apache drill and on top of a complete read-write file system and integrated Hindu database. It just makes it all come to life. >>Yeah. I mean, those things on top are critical, but you know, it's, it's the underlying infrastructure that, you know, we asked, we keep on community about that. And what's the, what are the things that are really holding you back from Paducah and production and the, and the biggest challenge is they cited worth high availability, backup, and recovery and maintaining performance at scale. Those are the top three and that's kind of where Matt BARR has been focused, you know, since day one. >>So if you look at a major retailer, 2000 nodes and map bar 50 unique applications running on a single cluster on 10,000 jobs a day running on top of that, if you look at the Rubicon project, they recently went public a hundred million add actions, a hundred billion ad auctions a day. And on top of that platform, beats music that just got acquired for $3 billion. Basically it's the underlying map, our engine that allowed them to scale and personalize that music service. So there's a, there's a lot of proof points in terms of how quickly we scale the enterprise grade features that we provide and kind of the blending of deep predictive analytics in a batch environment with online capabilities. >>So I got to ask you about your go to market. I'll see Cloudera and Hortonworks have different business models. Just talk about that, but Cloudera got the massive funding. So you get this question all the time. What do you, how do you counter that army and the arms race? I think >>I just wrote an article in Forbes and he says cash is not a strategy. And I think that was, that was an excellent, excellent article. And he goes in and, you know, in this fast growing market, you know, an amount of money isn't necessarily translate to architectural innovations or speeding the development of that. This is a fairly fragmented ecosystem in terms of the stack that runs on top of it. There's no single application or single vendor that kind of drives value. So an acquisition strategy is >>So your field Salesforce has direct or indirect, both mixable. How do you handle the, because Cloudera has got feet on the street and every squirrel will find it, not if they're parked there, parking sales reps and SCS and all the enterprise accounts, you know, they're going to get the, squirrel's going to find a nut once in awhile. Yeah. And they're going to actually try to engage the clients. So, you know, I guess it is a strategy if they're deploying sales and marketing, right? So >>The beauty about that, and in fact, we're all in this together in terms of sharing an API and driving an ecosystem, it's not a fragmented market. You can start with one distribution and move to another, without recompiling or without doing any sort of changes. So it's a fairly open community. If this were a vendor lock-in or, you know, then spending money on brand, et cetera, would, would be important. Our focus is on the, so the sales execution of direct sales, yes, we have direct sales. We also have partners and it depends on the geographies as to what that percentage is. >>And John Schroeder on with the HP at fifth big data NYC has updated the HP relationship. >>Oh, excellent. In fact, we just launched our application gallery app gallery, make it very easy for administrators and developers and analysts to get access and understand what's available in the ecosystem. That's available directly on our website. And one of the featured applications there today is an integration with the map, our sandbox and HP Vertica. So you can get early access, try it and get the best of kind of enterprise grade SQL first, >>First Hadoop app store, basically. Yeah. If you want to call it that way. Right. So like >>Sure. Available, we launched with close to 30, 30 with, you know, a whole wave kind of following that. >>So talk a little bit about, you know, speaking of verdict and kind of the sequel on Hadoop. So, you know, there's a lot of talk about that. Some confusion about the different methods for applying SQL on predicts or map art takes an open approach. I know you'll support things like Impala from, from a competitor Cloudera, talk about that approach from a map arts perspective. >>So I guess our, our, our perspective is kind of unbiased open source. We don't try to pick and choose and dictate what's the right open source based on either our participation or some community involvement. And the reality is with multiple applications being run on the platform, there are different use cases that make difference, you know, make different sense. So whether it's a hive solution or, you know, drill drills available, or HP Vertica people have the choice. And it's part of, of a broad range of capabilities that you want to be able to run on the platform for your workflows, whether it's SQL access or a MapReduce or a spark framework shark, et cetera. >>So, yeah, I mean there is because there's so many different there's spark there's, you know, you can run HP Vertica, you've got Impala, you've got hive. And the stinger initiative is, is that whole kind of SQL on Hadoop ecosystem, still working itself out. Are we going to have this many options in a year or two years from now? Or are they complimentary and potentially, you know, each has its has its role. >>I think the major differences is kind of how it deals with the new data formats. Can it deal with self-describing data? Sources can leverage, Jason file does require a centralized metadata, and those are some of the perspectives and advantages say the Apache drill has to expand the data sets that are possible enabled data exploration without dependency on a, on an it administrator to define that, that metadata. >>So another, maybe not always as exciting, but taking workloads from existing systems, moving them to Hadoop is one of the ways that a lot of people get started with, to do whether associated transformation workloads or there's something in that vein. So I know you've announced a partnership with Syncsort and that's one of the things that they focus on is really making it as easy as possible to meet those. We'll talk a little bit about that partnership, why that makes sense for you and, and >>When your customer, I think it's a great proof point because we announced that partnership around mainframe offload, we have flipped comScore and experience in that, in that press release. And if you look at a workload on a mainframe going to duke, that that seems like that's a, that's really an oxymoron, but by having the capabilities that map R has and making that a system of record with that full high availability and that data protection, we're actually an option to offload from mainframe offload, from sand processing and provide a really cost effective, scalable alternative. And we've got customers that had, had tried to offload from the mainframe multiple times in the past, on successfully and have done it successfully with Mapbox. >>So talk a little bit more about kind of the broader partnership strategy. I mean, we're, we're here at Hadoop summit. Of course, Hortonworks talks a lot about their partnerships and kind of their reseller arrangements. Fedor. I seem to take a little bit more of a direct approach what's map R's approach to kind of partnering and, and as that relates to kind of resell arrangements and things like, >>I think the app gallery is probably a great proof point there. The strategy is, is an ecosystem approach. It's having a collection of tools and applications and management facilities as well as applications on top. So it's a very open strategy. We focus on making sure that we have open API APIs at that application layer, that it's very easy to get data in and out. And part of that architecture by presenting standard file system format, by allowing non Java applications to run directly on our platform to support standard database connections, ODBC, and JDBC, to provide database functionality. In addition to kind of this deep predictive analytics really it's about supporting the broadest set of applications on top of a single platform. What we're seeing in this kind of this, this modern architecture is data gravity matters. And the more processing you can do on a single platform, the better off you are, the more agile, the more competitive, right? >>So in terms of, so you're partnering with people like SAS, for example, to kind of bring some of the, some of the analytic capabilities into the platform. Can you kind of tell us a little bit about any >>Companies like SAS and revolution analytics and Skytree, and I mean, just a whole host of, of companies on the analytics side, as well as on the tools and visualization, et cetera. Yeah. >>Well, I mean, I, I bring up SAS because I think they, they get the fact that the, the whole data gravity situation is they've got it. They've got to go to where the data is and not have the data come to them. So, you know, I give them credit for kind of acknowledging that, that kind of big data truth ism, that it's >>All going to the data, not bringing the data >>To the computer. Jack talk about the success you had with the customers had some pretty impressive numbers talking about 500 customers, Merv agent. The garden was on with us earlier, essentially reiterating not mentioning that bar. He was just saying what you guys are doing is right where the puck is going. And some think the puck is not even there at the same rink, some other vendors. So I gotta give you props on that. So what I want you to talk about the success you have in specifically around where you're winning and where you're successful, you guys have struggled with, >>I need to improve on, yeah, there's a, there's a whole class of applications that I think Hadoop is enabling, which is about operations in analytics. It's taking this, this higher arrival rate machine generated data and doing analytics as it happens and then impacting the business. So whether it's fraud detection or recommendation engines, or, you know, supply chain applications using sensor data, it's happening very, very quickly. So a system that can tolerate and accept streaming data sources, it has real-time operations. That is 24 by seven and highly available is, is what really moves the needle. And that's the examples I used with, you know, add a Rubicon project and, you know, cable TV, >>The very outcome. What's the primary outcomes your clients want with your product? Is it stability? And the platform has enabled development. Is there a specific, is there an outcome that's consistent across all your wins? >>Well, the big picture, some of them are focused on revenues. Like how do we optimize revenue either? It's a new data source or it's a new application or it's existing application. We're exploding the dataset. Some of it's reducing costs. So they want to do things like a mainframe offload or data warehouse offload. And then there's some that are focused on risk mitigation. And if there's anything that they have in common it's, as they moved from kind of test and looked at production, it's the key capabilities that they have in enterprise systems today that they want to make sure they're in Hindu. So it's not, it's not anything new. It's just like, Hey, we've got SLS and I've got data protection policies, and I've got a disaster recovery procedure. And why can't I expect the same level of capabilities in Hindu that I have today in those other systems. >>It's a final question. Where are you guys heading this year? What's your key objectives. Obviously, you're getting these announcements as flurry of announcements, good success state of the company. How many employees were you guys at? Give us a quick update on the numbers. >>So, you know, we just reported this incredible momentum where we've tripled core growth year over year, we've added a tremendous amount of customers. We're over 500 now. So we're basically sticking to our knitting, focusing on the customers, elevating the proof points here. Some of the most significant customers we have in the telco and financial services and healthcare and, and retail area are, you know, view this as a strategic weapon view, this is a huge competitive advantage, and it's helping them impact their business. That's really spring our success. We've, you know, we're, we're growing at an incredible clip here and it's just, it's a great time to have made those calls and those investments early on and kind of reaping the benefits. >>It's. Now I've always said, when we, since the first Hadoop summit, when Hortonworks came out of Yahoo and this whole community kind of burst open, you had to duke world. Now Riley runs at it's a whole different vibe of itself. This was look at the developer vibe. So I got to ask you, and we would have been a big fan. I mean, everyone has enough beachhead to be successful, not about map arbors Hortonworks or cloud air. And this is why I always kind of smile when everyone goes, oh, Cloudera or Hortonworks. I mean, they're two different animals at this point. It would do different things. If you guys were over here, everyone has their quote, swim lanes or beachhead is not a lot of super competition. Do you think, or is it going to be this way for awhile? What's your fork at some? At what point do you see more competition? 10 years out? I mean, Merv was talking a 10 year horizon for innovation. >>I think that the more people learn and understand about Hadoop, the more they'll appreciate these kind of set of capabilities that matter in production and post-production, and it'll migrate earlier. And as we, you know, focus on more developer tools like our sandbox, so people can easily get experienced and understand kind of what map are, is. I think we'll start to see a lot more understanding and momentum. >>Awesome. Jack Norris here, inside the cube CMO, Matt BARR, a very successful enterprise grade, a duke player, a leader in the space. Thanks for coming on. We really appreciate it. Right back after the short break you're live in Silicon valley, I had dupe December, 2014, the right back.

Published Date : Jun 4 2014

SUMMARY :

The queue at Hadoop summit, 2014 is brought to you by anchor sponsor I mean, cause you guys have that's the security stuff nailed down. I think I'm, if you look at the kind of Hadoop market, I got to ask you a direct question since we're here at Hadoop summit, because we get this question all the time. That's looking at how to provide the best of open source But you add that value separately to So if you look at, at this exciting ecosystem, Talk about, you know, it's about 10 years when you start to get these questions about security and governance and we're about isn't there and it's hard to provide, you know, online real-time And what's the, what are the things that are really holding you back from Paducah So if you look at a major retailer, 2000 nodes and map bar 50 So I got to ask you about your go to market. you know, in this fast growing market, you know, an amount of money isn't necessarily all the enterprise accounts, you know, they're going to get the, squirrel's going to find a nut once in awhile. We also have partners and it depends on the geographies as to what that percentage So you can get early If you want to call it that way. a whole wave kind of following that. So talk a little bit about, you know, speaking of verdict and kind of the sequel on Hadoop. And it's part of, of a broad range of capabilities that you want So, yeah, I mean there is because there's so many different there's spark there's, you know, you can run HP Vertica, of the perspectives and advantages say the Apache drill has to expand the data sets why that makes sense for you and, and And if you look at a workload on a mainframe going to duke, So talk a little bit more about kind of the broader partnership strategy. And the more processing you can do on a single platform, the better off you are, Can you kind and I mean, just a whole host of, of companies on the analytics side, as well as on the tools So, you know, I give them credit for kind of acknowledging that, that kind of big data truth So what I want you to talk about the success you have in specifically around where you're winning and you know, add a Rubicon project and, you know, cable TV, And the platform has enabled development. the key capabilities that they have in enterprise systems today that they want to make sure they're in Hindu. Where are you guys heading this year? So, you know, we just reported this incredible momentum where we've tripled core and this whole community kind of burst open, you had to duke world. And as we, you know, focus on more developer tools like our sandbox, a duke player, a leader in the space.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Jeff KellyPERSON

0.99+

Jack NorrisPERSON

0.99+

John SchroederPERSON

0.99+

HPORGANIZATION

0.99+

JeffPERSON

0.99+

$3 billionQUANTITY

0.99+

December, 2014DATE

0.99+

JasonPERSON

0.99+

Matt BARRPERSON

0.99+

10,000 jobsQUANTITY

0.99+

TodayDATE

0.99+

10 yearQUANTITY

0.99+

SyncsortORGANIZATION

0.99+

DanPERSON

0.99+

Silicon valleyLOCATION

0.99+

John barrierPERSON

0.99+

JavaTITLE

0.99+

YahooORGANIZATION

0.99+

10 yearsQUANTITY

0.99+

24QUANTITY

0.99+

HadoopTITLE

0.99+

ClouderaORGANIZATION

0.99+

HortonworksORGANIZATION

0.99+

this yearDATE

0.99+

JackPERSON

0.99+

fifthQUANTITY

0.99+

LinuxTITLE

0.99+

SkytreeORGANIZATION

0.99+

eachQUANTITY

0.99+

bothQUANTITY

0.99+

todayDATE

0.98+

oneQUANTITY

0.98+

MervPERSON

0.98+

about 10 yearsQUANTITY

0.98+

San JoseLOCATION

0.98+

HadoopEVENT

0.98+

about 20%QUANTITY

0.97+

sevenQUANTITY

0.97+

over 500QUANTITY

0.97+

a yearQUANTITY

0.97+

about 500 customersQUANTITY

0.97+

SQLTITLE

0.97+

seven different verticalsQUANTITY

0.97+

two yearsQUANTITY

0.97+

single platformQUANTITY

0.96+

2014DATE

0.96+

ApacheORGANIZATION

0.96+

HadoopLOCATION

0.95+

SiliconANGLEORGANIZATION

0.94+

comScoreORGANIZATION

0.94+

single vendorQUANTITY

0.94+

day oneQUANTITY

0.94+

SalesforceORGANIZATION

0.93+

about nine yearsQUANTITY

0.93+

Hadoop Summit 2014EVENT

0.93+

MervORGANIZATION

0.93+

two different animalsQUANTITY

0.92+

single applicationQUANTITY

0.92+

top threeQUANTITY

0.89+

SASORGANIZATION

0.89+

RileyPERSON

0.88+

FirstQUANTITY

0.87+

ForbesTITLE

0.87+

single clusterQUANTITY

0.87+

MapboxORGANIZATION

0.87+

map RORGANIZATION

0.86+

mapORGANIZATION

0.86+

Jack Norris - BigDataNYC 2013 - theCUBE - #BigDataNYC


 

>>I from Midtown Manhattan, the cute quiet coverage of big data NYC Civicon angled, Wiki bonds production made possible by Hortonworks. We do hairdo and lamb disco and new made invincible. And now your hosts, John furrier and Volante >>Hi buddy. We're back. This is Dave Volante with Jeff Kelly with Wiki bond. And this is the cube Silicon angle's continuous production. We're here at big data NYC right across the street from the Hilton where strata comp and a dupe world is going on. We've got a multi-time cube guest, Jack Norris, the CMO of map bars here, Jack. Welcome back to the cube first. So by the way, thank you so much for the support. As you know, we're across the street here at the Warwick hotel map, our, you guys have always been so generous supporting the cube. We can't thank you enough for that. So really appreciate it. Thank you. So we were able to listen to your keynote yesterday. It was, we, we, we weren't broadcasting, you know, head to head yesterday and had an opportunity to hear your keynote. So, first of all, how did that go? I want to ask you some questions about it. >>It, it was a really well-received and I think people were kind of clamoring to try to separate the myths from, from reality on, on Hadoop, >>We had three myths that you talked about, you know, one related to the distraction. I'd like to get into some of those. So what was the, the first myth was around the, the, the, the district distribution battle. So take us through that. >>So, you know, th the impression that it's a knock-down drag-out competitive battle across Hadoop distributions was the first myth. And the reality is that all of the distribution share the same open source Apache code. And this is one of the first markets that's really, really created, or the first open-source technologies it's really created a market. I mean, look, what's happened here with this whole, this whole big data and Hadoop, but given that early stage, there's the requirement to really combine that open source code with additional innovations to meet customer needs. And so what you see is you see those aggregators that are taken open source, you see others that are taking the open source, and then adding maybe management utility, couple of, of, you know, different applications on top. And then our approach at map R is we're taking the open source with those management innovations, doing some development, the open source community with things like Apache drill, and then really focusing on the underlying architecture, the data platform and providing innovations at that layer. So >>Actually sort of the three major destroys that we talk about all the time. You know, you guys, Hortonworks and Hadoop, you guys have been consistent the whole time as has Hortonworks, right? Cloud era basically put out a post recently saying, Hey, kind of going in a different direction, sort of what I call the tapped out of the Hadoop distro, you know, piece of it. But so there's a lot of discussion around it. You're putting forth the, Hey, it's not an internet seen war, but does it matter is my question? >>Well, I think if you take a step back, the Hadoop ecosystem is incredibly strong growing very, very quickly, fastest growing big data technology, one of the top 10 technologies overall. And I think it's because we are sharing the same API. It is possible for customers to learn on one, develop and move seamlessly to another. And, you know, in the keynote, I talked about the difference between the no SQL market, which is, you know, there is no consensus there and, and customers have to figure out not only what's the right word workload, but what's the technology that's actually going to have some staying power, right? >>That's a powerful comment. Amazon turn the data center and into an API, or you as the duke community is essentially turning data, access into an API. And that is a very powerful and leverageable concept. Okay. Your second myth was around the whole, no SQL yes. Piece of it. You help you put up a slide. I thought I read Jeff Kelly's reports. And I thought, I thought I knew them all, but there were a couple in there that I didn't recognize as you probably knew them all, but so take us through myth. Number two >>Too. I'm sure we missed some >>There wasn't room on the slide for anymore. >>The, yeah, it's basically about the consensus. There is no real consensus. There's no common API. There's no ability to move applications seamlessly across no SQL solutions. If you look at one no SQL solution, and that's, HBase a big inherent advantage because it's integrated with Hindu, you know, this whole trend is about compute and data together. So if you've got a no sequel solution, that's on that same, you know, massive data store, you know, big leg up. And, and then we got into the, well, if you've got HBase, it's included in all the distributions and all the distribution share the same open source, then obviously it must run the same across all distributions. And there, we shared some pretty interesting data to show the difference. When you, when you do architectural differences and innovations underneath that you can dramatically change the performance of, of not only MapReduce, but of no SQL. Yes. >>Okay. So not all no SQL is created equally. Not all HBase is created equally as essentially what you're saying there. Now the third piece was to dupe is enterprise ready, right? Yeah. So you guys were first to say, well, we have a Hadoop platform that's enterprise ready way ahead on that. Got criticized a lot for going down that path shrugged and said, okay, we'll just keep doing business with customers. And you've been again, very clear and consistent on that. So talk about the third myth >>And that's, you know, is, is Hadoop ready for prime time? And I think the way to combat that myth is by customer examples and showing the tremendous success that customers are enjoying with Hadoop. And, you know, we, we don't have time on the cube here to go through all of them, but, you know, I like to point out 90 billion auctions a day with Rubicon, they've surpassed Google in terms of ad reach. They're doing that on Mapbox 1.7 trillion events a month with comScore that's on, on map bar. You look in, in traditional enterprise, you know, a single retailer with over 2000 nodes of Hadoop. I mean, it's a key part of their merchandising and retail operations, and combining all sorts of, of data feeds and all sorts of use cases there, financial services over a thousand nodes of risk medication, personalized offers streamlining their operations. I mean, it's, it's dramatic. And then, you know, we shared some of the more, more interesting ones, esoteric ones like garbage and whiskey and weather prediction. >>There was consider these, we even as diverse and eclectic as they are, they consider these mission critical application. >>Oh, absolutely. No it it's. And I think that's the difference because what we're talking about is not Hadoop as this cash, right? This temporary processing, where we can do, you know, some interesting batch analytics and then take that and put that someplace else. And yes, there are applications like that, but companies soon realized that if I'm going to use this as a key part of my operations, and it's about data on compute, then I want a consistent permanent store. I want a system of record. So all of the SLS and high availability and data protection features that they expect in their enterprise applications should be present in Hadoop, right? That's where we focus. Let's run down a couple of those. >>What are some of the key capabilities that you need in an enterprise enterprise grade platform? That map bar is >>Well, let's, let's take, let's take business continuity cause that's important if you're really going to trust data there. And you know, one of the big drivers as you expand data is how much am I going to spend on it? And if you look at a large investment bank, $270 million of their budget, not total, but incremental to address the additional capacity, there's a big emphasis for let's look at a better way to do that. So instead of spending $15,000 a terabyte, if you can spend a few hundred dollars a terabyte, that's a huge, huge advantage. And that's the focus of Hindu, but to do that, well, then the features that are in this enterprise storage have to be present. And we're talking about, you know, mirroring and not a copy table function, but replication, that's how that's how organizations do it, right. If you're going to recovery and recovery, you know, you can't back up a petabyte of information through a copy function, right? You have to do a snapshot and the snapshots have to be consistent, right. And, and we're not saying anything that, you know, an enterprise administrator doesn't know, there is some confusion when you're more on the developer side as to what these features are and the difference between a fuzzy snapshot and a point in time, consistent snaps. >>Got it. So let's talk a little bit about the, the enterprise data hub, this, this concept that Michael Wilson with clutter introduced yesterday. Tell us a little bit about your take on, on, on Mike's I guess, definition and, and essentially I think trying to name the category of kind of what Hadoop can do and what, and where it sits in the architecture. Did you agree with his, his, >>Yeah. I mean, if you look at, at that description, it's about I'm taking important data and I'm putting it in a dupe and I'm combining a lot of different data sources and it's been referred to as a data lake and a data reservoir and a data ocean. I mean, we've heard a lot of terms. We worked with an outside consultant that was originally an architect at Terre data. It's been about eight months, almost a year ago now where he defined it and enterprise data hub. And it's it's, he went through kind of the list of requirements. And once you move from a transitory to a permanent store, then that becomes an enterprise data hub. And an enterprise data hub can be used to select and process information, maybe it's ETL and serve some downstream applications. It can also be useful to do analysis directly on it, to, you know, to serve different business functions. But the system requirements that he established for that I think are absolutely true. And it's, you have to have the full data protection. You have to have the full disaster recovery. You have to have the full high availability because this is going to be important data serving the organization. If it's data that you can lose, if it's data that you, you don't really care about having highly available, then it's a very narrow use case that that data hub serves. >>So you're saying the enterprise data hub isn't ready for prime time. >>No, I'm saying that there, there are requirements. And we have companies today that have deployed an enterprise data hub and they are quite successful with it. And, you know, the quotes are the ETL functions that they're doing on that hub are 10 times faster and it's 10 times cheaper than what they're seeing. >>Soundbite, Dave, >>I agree, but it's nuanced. Right. And so, you know, the customers cause a lot of vendors, right? They're all saying the same thing to the customers, right? So you've got your messaging that you've, you know, you've proven out over the last several years and then the entire market starts to use the same terminology. So it is, this is why I, like, I think this, what is, what are those >>Things? We're in a little bit of this, this kind of marketing fog here in the relative early stages. I think the best response there is customer proof points. And I think some education in the very beginning, you know, when they're in development and test, it's really important to understand, you know, what is Hadoop and what can I use it for and what data source am I going to leverage? I think the features that we're talking about really start to show up as you deploy in production. And as you expand its use in production and there we've enjoyed tremendous success, >>But he would argue that you have a lead in this space. I wouldn't, I don't think you would either the space being robustness enterprise ready, mission criticality is your lead increasing, decreasing staying the same. >>What's your sense? Well, it's hard cause there's no, you know, th th there's no external service that's out there, you know, interviewing every customer and, and giving numbers. I do know that we passed 500 paying customers. I do know that we've got significant deployments and you can measure those in terms of number of nodes, you know, in the thousands of nodes, you can measure those in terms of use cases. So we've got, you know, one company they've passed 20 different use cases on the same cluster. I think that's an interesting proof point. We're scaling in terms of the number of, of people in an organization that are trained in leveraging the data in map are again in the, in the thousands. So, you know, I think this market is so big and so dynamic that this isn't about, you know, one company success at the expense of everyone. Else's zero sum game. I think, you know, we're all here kind of raising this, this boat and focusing on this paradigm shift, but when it comes to production success, that's our focus. And I think that's where we've, we've proven that >>One thing I'm really want to get your opinion on, you know, as, as to do matures and some of the innovations you guys are doing and, and making the platform, you know, basically a multi application platform, you can do more things with Hadoop. And we've been talking about this on the cube, is that as that happens, you're going to start you as an industry. You're going to start bumping up against the EDW vendors and some of the other database vendors in the traditional world. And you're now you're doing some of the things that those, those tools can do now, you know, two years ago, it was very much just, this is all very complimentary Hadoop and your EDW. There's no overlap. We're gonna all play nice. But increasingly we're seeing that there is an overlap. How do you view that? Is that, and what is your relationship with those, with those EDW vendors and, and what are you hearing from customers when you go into a customer? Okay. >>So, I mean, there's a, there's a lot in that question. I think the F the first comment though, is don't look at Hadoop through this single data warehouse lens. And if you look at, at trying to use Hadoop to completely replace an enterprise data warehouse where there's, here's a few decades of experience, there, there are many organizations that have a lot of activities that are based in that data warehouse. And that's where we're seeing a data warehouse offload that is complimentary, but it gives organizations this lever to say, well, I'm going to control the fill rate, and I'm going to take some of the data that's no longer, you know, really active and put that on Hadoop and really change my ability to manage the costs in a data warehouse environment. The other thing that's interesting is that the types of applications that duper doing, I think are creating a new class it's about operations and analytics, kind of combined together, taking high arrival rate data and making very quick micro changes to optimize whether that's fraud detection or recommendation engines, or taking sensor data and predictive analytics for, for maintenance, et cetera. There is just a tremendous number of, of applications. In some cases, leveraging a new data source in some cases, doing new applications, but it's just opening things up. And, and I think organizations are moving to be very data-driven and Hadoop is at the center of that. >>And you control the field, right? That's another really good soundbites. And, and these that, you mentioned this high arrival rate data, this fraud detection, predictive analytics, maintenance, these are things that you're doing today with >>Navarre right? Yeah, >>Absolutely. Great. All right, Jack. Well, listen, always a pleasure. Thanks very much for coming by. Great to see you again. All right. Keep it right there about Uber, right back with our next guest. This is the cube we're live from the big apple.

Published Date : Oct 30 2013

SUMMARY :

I from Midtown Manhattan, the cute quiet coverage of big data NYC So by the way, thank you so much for the We had three myths that you talked about, you know, one related to the distraction. So, you know, th the impression that it's a knock-down drag-out sort of what I call the tapped out of the Hadoop distro, you know, piece of it. And, you know, in the keynote, I talked about the difference between the no SQL market, And I thought, I thought I knew them all, but there were a couple in there that I didn't recognize as you probably knew them all, that's on that same, you know, massive data store, you know, big leg up. So you guys were first to say, And that's, you know, is, is Hadoop ready for prime time? where we can do, you know, some interesting batch analytics and then take that and put that someplace else. And you know, one of the big drivers as you expand Did you agree with his, his, to, you know, to serve different business functions. And, you know, the quotes are the ETL functions that they're doing on that hub are 10 And so, you know, the customers cause a lot of you know, when they're in development and test, it's really important to understand, you know, I wouldn't, I don't think you would either the space being robustness enterprise so dynamic that this isn't about, you know, one company success at the expense those tools can do now, you know, two years ago, it was very much just, this is all very complimentary Hadoop and your EDW. And if you look at, at trying to use Hadoop to completely replace an enterprise data warehouse And you control the field, right? Great to see you again.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Jeff KellyPERSON

0.99+

Michael WilsonPERSON

0.99+

10 timesQUANTITY

0.99+

JackPERSON

0.99+

Jack NorrisPERSON

0.99+

10 timesQUANTITY

0.99+

AmazonORGANIZATION

0.99+

$270 millionQUANTITY

0.99+

MikePERSON

0.99+

yesterdayDATE

0.99+

Dave VolantePERSON

0.99+

HortonworksORGANIZATION

0.99+

third pieceQUANTITY

0.99+

DavePERSON

0.99+

HadoopTITLE

0.99+

Midtown ManhattanLOCATION

0.99+

UberORGANIZATION

0.99+

VolantePERSON

0.99+

thousandsQUANTITY

0.99+

firstQUANTITY

0.99+

20 different use casesQUANTITY

0.99+

GoogleORGANIZATION

0.99+

secondQUANTITY

0.99+

John furrierPERSON

0.98+

NYCLOCATION

0.98+

two years agoDATE

0.98+

HadoopORGANIZATION

0.98+

first commentQUANTITY

0.98+

RubiconORGANIZATION

0.98+

SQLTITLE

0.97+

Terre dataORGANIZATION

0.97+

OneQUANTITY

0.97+

1.7 trillion eventsQUANTITY

0.97+

thirdQUANTITY

0.97+

todayDATE

0.97+

oneQUANTITY

0.96+

singleQUANTITY

0.96+

a year agoDATE

0.95+

one companyQUANTITY

0.94+

HBaseTITLE

0.94+

NavarrePERSON

0.93+

EDWORGANIZATION

0.92+

over 2000 nodesQUANTITY

0.91+

big appleORGANIZATION

0.91+

first marketsQUANTITY

0.9+

nodesQUANTITY

0.89+

about eight monthsQUANTITY

0.88+

2013DATE

0.88+

SoundbiteORGANIZATION

0.87+

three mythsQUANTITY

0.87+

HinduORGANIZATION

0.87+

first open-sourceQUANTITY

0.86+

Wiki bondORGANIZATION

0.85+

BigDataNYCEVENT

0.85+

$15,000 a terabyteQUANTITY

0.85+

three majorQUANTITY

0.82+

90 billion auctions a dayQUANTITY

0.81+

500 paying customersQUANTITY

0.79+

comScoreORGANIZATION

0.79+

map RORGANIZATION

0.78+

over a thousand nodesQUANTITY

0.77+

HiltonLOCATION

0.77+

few hundred dollars a terabyteQUANTITY

0.76+

Number twoQUANTITY

0.76+

10 technologiesQUANTITY

0.74+

Jack Norris - Hadoop Summit 2013 - theCUBE - #HadoopSummit


 

>>Ash it's, you know, what will that mean to my investment? And the announcement fusion IO is that, you know, we're 25 times faster on read intensive HBase applications. The combination. So as organizations are deploying Hadoop, and they're looking at technology changes coming down the pike, they can rest assured that they'll be able to take advantage of those in a much more aggressive fashion with map R than, than other distribution. >>Jack, how I got to ask you, we were talking last night at the Hadoop summit, kind of the kickoff party and, you know, everyone was there. All the top execs were there and all the developers, you know, we were in the queue. I think, I think that either Dave or myself coined the term, the big three of big data, you guys ROMs cloud Cloudera map R and Hortonworks, really at the, at the beginning of the key players early on and Charles from Cloudera was just recently on. And, and he's like, oh no, this, this enterprise grade stuff has been kicked around. It's been there from the beginning. You guys have been there from the beginning and Matt BARR has never, ever waffled on your, on your messaging. You've always been very clear. Hey, we're going to take a dupe open source a dupe and turn it into an enterprise grade product. Right. So that's clear, right? That's, that's, that's a great, that's a great, so what's your take on this because now enterprise grade is kind of there, I guess, the buzz around getting the, like the folks that have crossed the chasm implemented. So what can you comment on that about one enterprise grade, the reality of it, certainly from your perspective, you haven't been any but others. And then those folks that are now rolling it out for the first time, what can you share with them around? What does it mean to be enterprise grade? >>So enterprise grade is more about the customer experience than, than a marketing claim. And, you know, by enterprise grade, what we're talking about are some of the capabilities and features that they've grown to expect in their, their other enterprise applications. So, you know, the ability to meet full S SLA is full ha recovery from multiple failures, rolling upgrades, data protection was consistent snapshots business continuity with mirroring the ability to share a cluster across multiple groups and have, you know, volumes. I mean, there's a, there's a host of features that fall under the umbrella enterprise grade. And when you move from no support for any of those features to support to a few of them, I don't think that's going to, to ha it's more like moving to low availability. And, and there's just a lot of differences in terms of when we say enterprise grade with those features mean versus w what we view as kind of an incomplete story. So >>What do you, what do you mean by low availability? Well, I mean, it's tongue in cheek. It's nice. It's a good term. It's really saying, you know, just available when you sometimes is that what you mean? Is this not true availability? I mean, availability is 99.9%. Right? >>Right. So if you've got a, an ha solution that can't recover from multiple failures, that's downtime. If you've got an HBase application that's running online and you have data that goes down and it takes 10 to 30 minutes to have the region servers recover it from another place in the distribution, that's downtime. If you have snapshots that aren't consistent across the cluster, that doesn't provide data protection, there's no point in time recovery for, for a cluster. So, you know, there's a lot of details underneath that, but what it, what it amounts to is, do you have interruptions? Do you have downtime? Do you have the potential for losing data? And our answer is you need a series of features that are hardened and proven to deliver that. >>What about recoverability? You mentioned that you guys have done a lot of work in that area with snapshotting, that's kind of being kicked around, are our folks addressing, what are the comp what's your competition doing in those areas of recoverability just mentioned availability. Okay, got that. Recoverability security, compliance, and usability. Those are the areas that seem to be the hot focus areas what's going on in the energy. How would you give them the grade, the letter grade, if you will, candidly, compared to what you guys offer? Well, the, >>The first of all, it's take recoverability. You know, one of the tenants is you have a point in time recovery, the ability to restore to a previous point that's consistent across the cluster. And right now there's, there's no point in time recovery for, for HDFS, for the files. And there's no point in time recovery for HBase tables. So there's snapshot support. It's being talked about in the open source community with respect to snapshots, but it's being referred to in the JIRAs as fuzzy snapshots and really compared to copy table. >>So, Jack, I want to turn the conversation to the, kind of the topic we've talked about before kind of the open versus a proprietary that, that whole debate we've, we've, we've heard about that. We talked about that before here on the cube. So just kind of reiterate for us your take. I mean, we, we hear perhaps because of the show we're at, there's a lot of talk about the open source nature of Hadoop and some of the purists, as you might call them are saying, it's gotta be open a hundred percent Patrick compatible, et cetera. And then there's others that are taking a different approach, explain your approach and why you think that's the key way to make, to really spur adoption of a dupe and make it >>W w we're we're a part of the community we're, we've got, you know, commitment going on. We've, you know, pioneered and pushed a patchy drill, but we have done innovations as well. And I think that those innovations are really required to support and extend the, the whole ecosystem. So canonical distributes RN, three D distribution. We've got, you know, all our, our packages are, are available on get hub and, and open source. So it's not, it's not a binary debate. And I think the, the point being that there's companies that have jumped ahead and now that Peloton is, is, you know, pedaling faster and, and we'll, we'll catch up. We'll streamline. I think the difference is we rearchitected. So we're basically in a race car and, you know, are, are racing ahead with, with enterprise grade features that are required. And there's a lot of work that still needs to be done, needs to be accomplished before that full rearchitecture is, is in place. >>Well, I mean, I think for me, the proof is really in the pudding when you, when it comes to talk about customers that are doing real things and real production, grade mission, critical applications that they're running. And to me that shows the successor or relative success of a given approach. So I know you guys are working with companies like ancestry.com, live nation and Quicken loans. Maybe you could, could you walk us through a couple of those scenarios? Let's take ancestry.com. Obviously they've got a huge amount of data based on the kind of geological information, where do you guys do >>With them? Yeah, so they've got, I mean, they've got the world's largest family genealogy services available on the web. So there's a massive amount of data that they make accessible and, and, you know, ability for, for analysis. And then they've rolled out new features and new applications. One of which is to ship a kit out, have people spit in a tube, returned back and they do DNA matching and reveal additional details. So really some really fabulous leading edge things that are being done with, with the use of, of Hadoop. >>Interesting. So talk about when you went to, to work with them, what were some of their key requirements? Was it around, it was more around the enterprise enterprise, grade security and uptime kind of equation, or was it more around some of the analytics? What, what, what's the kind of the killer use case for them? >>It's kind of, you know, it's, it's hard with a specific company or even, you know, to generalize across companies. Cause they're really three main areas in terms of ease of use and administration dependability, which includes the full ha and then, and then performance. And in some cases, it's, it's just one of those that kind of drives it. And it's used to justify, in other cases, it's kind of a collection. The ease of use is being able to use a cluster, not only as Hadoop, but to access it and treat it like enterprise storage. So it's a complete POSIX compliance file system underneath that allows the, the mounting and access and updates and using it in dynamic read-write. So what that means from an application level, it's, it's faster, it's much easier to administer and it's much easier and reliable for developers to, to utilize. >>I got to ask you about the marketing question cause I see, you know, map our, you guys have done a good job of marketing. Certainly we want to be thankful to you guys is supporting the cube in the past and you guys have been great supporters of our mission, but now the ecosystem's evolving a lot more competition. Claudia mentioned those eight companies they're tracking in quote Hadoop, and certainly Jeff and I, and, and SiliconANGLE by look at there's a lot more because Hadoop washing has been going on now for the term Hadoop watching me and jumping in and doing Hadoop, slapping that onto an existing solution. It's not been happening full, full, full bore for a year. At least what's the next for you guys to break above the noise? Obviously the communities are very active projects are coming online. You guys have your mission in the enterprise. What's the strategy for you guys going forward is more of the same and anything new even share. >>Yeah, I, I, I think as far as breaking above the noise, it will be our customers, their success and their use cases that really put the spotlight on what the differences are in terms of, of, you know, using a big data platform. And I think what, what companies will start to realize is I'd rather analogy between supply chain and the big, the big revolution in supply chain was focusing on inventory at each stage in the supply chain. And how do you reduce that inventory level and how do you speed the, the flow of goods and the agility of a company for competitive advantage. And I think we're going to view data the same way. So companies instead of raw data that they're copying and moving across different silos, if they're able to process data in place and send small results sets, they're going to be faster, more agile and more competitive. >>And that puts the spotlight on what data platform is out there that can support a broad set of applications and it can have the broadest set of functionality. So, you know, what we're delivering is a mission grade, you know, enterprise grade mission, critical support platform that supports MapReduce and does that high performance provides NFS POSIX access. So you can use it like a file system integrates, you know, enterprise grade, no SQL applications. So now you can do, you know, high-speed consistent performance, real time operations in addition to batch streaming, integrated search, et cetera. So it's, it's really exciting to provide that platform and have organizations transform what they're doing. >>How's the feedback on with Ted Dunning? I haven't seen a lot of buzz on the Twittersphere is getting positive feedback here. He's a, a tech athlete. He's a guru, he's an expert. He's got his hands in all the pies. He's a scientist type. What's he up to? What's his, what's his role within Mapa and he's obviously playing in the open-source community. What's he up to these days, >>Chief application architect, he's on the leading edge of my house. So machine learning, so, you know, sharing insights there, he was speaking at the storm meetup two nights ago and sharing how you can integrate long running batch, predictive analytics with real-time streaming and how the use of snapshots really that, that easy and possible. He travels the world and is helping organizations understand how they can take some very complex, long running processes and really simplify and shorten those >>Chance to meet him in New York city had last had duke world at a, at a, a party and great guy, fantastic geek, and certainly is doing a great work and shout out to Ted. Congratulations, continue up that support. How's everyone else doing? How's John and Treevis doing how's the team at map are we're pedaling as best as you can growing >>Really quickly. No, we're just shifting gears. Would it be on pedaling >>Engine? >>Yeah. Give us an update on the company in terms of how the growth and kind of where you guys are moving that. >>Yeah. We're, we're expanding worldwide, you know, just this, you know, last few months we've opened up offices and in London and Munich and Paris, we're expanding in Asia, Japan and Korea. So w our, our sales and services and engineering, and basically across the whole company continues to expand rapidly. Some really great, interesting partnerships and, and a lot of growth Natalie's we add customers, but it's, it's nice to see customers that continue to really grow their use of map are within their organization, both in terms of amount of data that they're analyzing and the number of applications that they're bringing to bear on the platform. >>Well, that a little bit, because I think, you know, one of the, one of the trends we do see is when a company brings in big data, big data platform, and they might start experiment experimenting with it, build an application. And then maybe in the, maybe in the marketing department, then the sales guys see it and they say, well, maybe we can do something with that. How is that typically the kind of the experience you're seeing and how do you support companies that want to start expanding beyond those initial use cases to support other departments, potentially even other physical locations around the world? How do you, how do you kind of, >>That's been the beauty of that is if you have a platform that can support those new applications. So if you know, mission critical workloads are not an issue, if you support volumes so that you can logically separate makes it much easier, which we have. So one of our customers Zions bank, they brought in Matt BARR to do fraud detection. And pretty soon the fact that they were able to collect all of that data, they had other departments coming to them and saying, Hey, we'd like to use that to do analysis on because we're not getting that data from our existing system. >>Yeah. They come in and you're sitting on a goldmine, there are use cases. And you also mentioned kind of, as you're expanding internationally, what's your take on the international market for big data to do specifically is, is the U S kind of a leaps and bounds ahead of the rest of the world in terms of adoption of the technology. What are you seeing out there in terms of where, where the rest of the, >>I wouldn't say leaps and bounds, and I think internationally, they're able to maybe skip some of the experimental steps. So we're seeing, we're seeing deployment of class financial services and telecom, and it's, it's fairly broad recruit technologies there. The largest provider of recruiting services, indeed.com is one of their subsidiaries they're doing a lot with, with Hadoop and map are specifically, so it's, it's, it's been, it's been expanding rapidly. Fantastic. >>I also, you know, when you think about Europe, what's going on with Google and some of the, the privacy concerns even here, or I should say, is there, are there different regulatory environments you've got to navigate when you're talking about data and how you use data when you're starting to expand to other, other locales? >>Yeah. There's typically by vertical, there's different, different requirements, HIPAA and healthcare, and basal to, and financial services. And so all of those, and it, it, it basically, it's the same theme of when you're bringing Hadoop into an organization and into a data center, the same sorts of concerns and requirements and privacy that you're applying in other areas will be applied on Hindu. >>I'm now kind of turning back to the technology. You mentioned Apache drill. I'd love to get an update on kind of where, where that stands. You know, it's put, then put that into context for people. We hear a lot about the SQL and Hadoop question here, where does drill fit into that, into that equation? >>Well, the, the, you know, there's a lot of different approaches to provide SQL access. A lot of that is driven by how do you, how do you leverage some of the talent and organization that, you know, speak SQL? So there's developments with respect to hive, you know, there's other projects out there. Apache drill is an open source project, getting a lot of community involvement. And the design center there is pretty interesting. It started from the beginning as an open source project. And two main differences. One was in looking at supporting SQL it's, let's do full ANSI SQL. So it's full 2003 ANSI, sequel, not a SQL like, and that'll support the greatest number of applications and, you know, avoid a lot of support and, and issues. And the second design center is let's support a broad set of data sources. So nested sources like Jason scheme on discovery, and basically fitting it into an enterprise environment, which sometimes is kinda messy and can get messy as acquisitions happen, et cetera. So it's complimentary, it's about, you know, enabling interactive, low latency queries. >>Jack, I want to give you the final word. We are out of time. Thanks for coming on the cube. Really preached. Great to see you again, keep alumni, but final word. And we'll end the segment here on the cube is your quick thoughts on what's happening here at Hadoop world. What is this show about? Share with the audience? What's the vibe, the summary quick soundbite on Hadoop. >>I think I'll go back to how we started. It's not, if you used to do putz, how you use to do and, you know, look at not only the first application, but what it's going to look like in multiple applications and pay attention to what enterprise grade means. >>Okay. They were secure. We got a more coverage coming, Jack Norris with map R I'll say one of the big three original, big three, still on the, on the list in our mind, and the market's mind with a unique approach to Hadoop and the mid-June great. This is the cube I'm Jennifer with Jeff Kelly. We'll be right back after this short break, >>Let's settle the PR program out there and fighting gap tech news right there. Plenty of the attack was that providing a new gadget. Let's talk about the latest game name, but just the.

Published Date : Jun 27 2013

SUMMARY :

IO is that, you know, we're 25 times faster on read intensive HBase applications. All the top execs were there and all the developers, you know, So, you know, the ability to meet full S SLA is full ha It's really saying, you know, just available when So, you know, there's a lot of details compared to what you guys offer? You know, one of the tenants is you have a point of Hadoop and some of the purists, as you might call them are saying, it's gotta be open a hundred percent that Peloton is, is, you know, pedaling faster and, and we'll, we'll catch up. So I know you guys are working with companies like ancestry.com, live nation and Quicken that they make accessible and, and, you know, ability for, So talk about when you went to, to work with them, what were some of their key requirements? It's kind of, you know, it's, it's hard with a specific company or even, I got to ask you about the marketing question cause I see, you know, map our, you guys have done a good job of marketing. And how do you reduce that inventory level and how do you speed the, you know, what we're delivering is a mission grade, you know, enterprise grade mission, How's the feedback on with Ted Dunning? so, you know, sharing insights there, he was speaking at the storm meetup How's John and Treevis doing how's the team at map are we're pedaling as best as you can No, we're just shifting gears. and basically across the whole company continues to expand rapidly. Well, that a little bit, because I think, you know, one of the, one of the trends we do see is when a company brings in big data, That's been the beauty of that is if you have a platform that can support those And you also mentioned kind of, they're able to maybe skip some of the experimental steps. and it, it, it basically, it's the same theme of when you're bringing Hadoop into We hear a lot about the SQL and Hadoop question support the greatest number of applications and, you know, avoid a lot of support and, Great to see you again, you know, look at not only the first application, but what it's going to look like in multiple This is the cube I'm Jennifer with Jeff Kelly. Plenty of the attack was that providing a new gadget.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
TedPERSON

0.99+

LondonLOCATION

0.99+

ClaudiaPERSON

0.99+

Jeff KellyPERSON

0.99+

AsiaLOCATION

0.99+

Ted DunningPERSON

0.99+

Jack NorrisPERSON

0.99+

DavePERSON

0.99+

JohnPERSON

0.99+

JackPERSON

0.99+

10QUANTITY

0.99+

ParisLOCATION

0.99+

KoreaLOCATION

0.99+

Matt BARRPERSON

0.99+

MunichLOCATION

0.99+

New YorkLOCATION

0.99+

99.9%QUANTITY

0.99+

JenniferPERSON

0.99+

TreevisPERSON

0.99+

25 timesQUANTITY

0.99+

JapanLOCATION

0.99+

GoogleORGANIZATION

0.99+

bothQUANTITY

0.99+

oneQUANTITY

0.99+

JeffPERSON

0.99+

eight companiesQUANTITY

0.99+

first timeQUANTITY

0.99+

mid-JuneDATE

0.99+

CharlesPERSON

0.98+

EuropeLOCATION

0.98+

30 minutesQUANTITY

0.98+

OneQUANTITY

0.98+

first applicationQUANTITY

0.98+

AshPERSON

0.98+

two nights agoDATE

0.98+

HortonworksORGANIZATION

0.98+

each stageQUANTITY

0.97+

SQLTITLE

0.97+

SiliconANGLEORGANIZATION

0.97+

NataliePERSON

0.97+

ancestry.comORGANIZATION

0.96+

HadoopTITLE

0.96+

PatrickPERSON

0.96+

last nightDATE

0.95+

JasonPERSON

0.95+

2003DATE

0.95+

HadoopEVENT

0.94+

ApacheORGANIZATION

0.94+

HadoopPERSON

0.93+

indeed.comORGANIZATION

0.93+

hundred percentQUANTITY

0.92+

HBaseTITLE

0.92+

Hadoop Summit 2013EVENT

0.92+

Quicken loansORGANIZATION

0.92+

two main differencesQUANTITY

0.89+

HIPAATITLE

0.89+

#HadoopSummitEVENT

0.89+

S SLATITLE

0.89+

HadoopORGANIZATION

0.88+

ClouderaORGANIZATION

0.85+

map RTITLE

0.85+

a yearQUANTITY

0.83+

Zions bankORGANIZATION

0.83+

PelotonLOCATION

0.78+

NFSTITLE

0.78+

MapReduceTITLE

0.77+

Cloudera map RORGANIZATION

0.75+

liveORGANIZATION

0.74+

second design centerQUANTITY

0.73+

HinduORGANIZATION

0.7+

theCUBEORGANIZATION

0.7+

three main areasQUANTITY

0.68+

one enterprise gradeQUANTITY

0.65+

Jack Norris | Strata Data Conference 2013


 

>>Okay. We're back here inside the cube, our flagship program about the events and extract the signal from the noise. This is strata conference. O'Reilly media is a big data event. We're talking about Hadoop analytics, data platforms, and big is come into the enterprise from the front door. As we heard them yesterday. I'm John Frey with Dave Volante, wiki.org. And we're here with Jack Norris, our cube alumni, and a favorite guest here. You're a in charge executive at map. Our, you guys are leading the charge with this use of a dupe. Welcome back to the cube. Thank you. Okay, so what's, let's chat about what's going on. What's your take on all the big news out here for the distributions. I'll the big power moose. You guys have a relationship with EMC. Okay. Exclusive relationship with those guys. Intel's got a distribution Horton versus with Microsoft, a lot of things going on. So this is your wheelhouse. So what's your take on the Hadoop action here? >>Well, I think there's an article in Forbes where I think they, they said it best. This is showing that map bars had the right strategy all along. And what we're seeing is, is basically there's a fairly low bar to taking a patchy Hadoop and providing a distribution. And so we're seeing a lot of new entrance in the market and there's, there's a lot of options. If you want to try Hadoop and experiment and get started. And then there's production class Hadoop, which includes enterprise data protection, snapshots mirrors, ability to integrate. And that's basically map R so start and test and dev with, with a lot of options and then move into production, class >>Mapbox. So break it down for the folks out there who are tipping the toe in the water and hearing all the noise. Cause it's right now, the noise level is very high, right? With the, with the recent announcements. But you guys have been doing business obviously for many years in this area. So when people say, Hey, I want to get a Hadoop distribution with enterprise. What, what should they be looking for? Okay. Because it's not that easy to kind of swing through the noise. So could you share with the folks out there, what, what to look for in like the, the table stakes, the check boxes? Cause there's a lot of claims. There's a lot of noise is this. And that is a lot of different options. Some teams have more committers or no committers than others, so that's all noise, but let's what are the key things that customers need to know? So I think there's, miling, >>There's three areas. All right. One is kind of how it integrates into your enterprise. And with Hadoop, you have the Hadoop distributed file system API. That's how you interact. Well, if you're able to also use standard tools that can use standard file and database access, it makes it much, much easier. So map ours unique and supporting NFS and making that happen. That's a, that's a big difference. The second is on dependability and there's high availability capabilities and then there's data protection. So I'll focus on snapshots as an example, you've got data replicated and Hindu. That's great. But if you have a user error, an application error, that's replicated just as quickly. So having the ability to recover and double-edged in time. Yeah. So if I can say, Hey, I made a mistake. Can I go back two minutes earlier with snapshots that makes it possible map ours, unique and snapshot support. And then finally, there's there's disaster recovery mirroring where you can go across clusters, mirror, what's going on across the land and being able to recover in the case of a disaster where you lose a whole cluster or use a whole >>Section and that's not available in >>Other, those aren't available either. That's >>NFS, >>Snapshots has been on the JIRA list for over five years. >>Yeah. Okay. So I wonder >>If I could find that and then there's third. Cause I said three and almost said two, the third is performance and scale and, but >>That'd be for >>Integration, dependability and speed. >>Okay. So dependability Jr's part of the VR snapshots. MDR. Okay. So let's talk about the performance because you guys had asked a Google's a big partner of you guys. So we should, we just had them on the cube strata. So you have to have a record setting. Do you have a record setting? EMC take that. Well, you work with DMC. So let me talk about the performance real quick. Then we'll talk about some of the EMC conversations, but performance, you have a variety of diverse performance benchmarks, Google you have within the enterprise. Can you talk about those? >>So, so what we announced this week was the minute sort world record. So minutes or runs across technologies is just, how can you, you know, how much data can you sort in 60 seconds? And if you look back at, at the previous record that was done in the labs with Microsoft with special purpose software, and they did 1.4 terabytes Hadoop hasn't been used since 2009, it's been several years because it's got features in there that work against performance. Things like checkpointing and logging because it assumes you've got long running MapReduce jobs. So we set the record with our distribution of Hadoop. So we have kind of one hand tied behind our back, given that technology. Secondly, we sent it in the cloud, which is the other hand tied behind our back because it's a, it's a virtualized environment. So we set the record with just with your legs And a 1.5 terabytes in 60 seconds. Very proud of that. >>Well, that's interesting because we've been doing a lot of labs testing, Dave and I and our teams on cost. Right. So, yeah. And it's an interesting benchmark because you always don't look at the nuance, the cost to compare a cloud performance versus bare metal. Most people don't factor into setup, cost of deployment. Exactly. So can you just quickly talk about that and how significant of an order of magnitude of your customer? >>So the, the previous Hadoop record took 3,400 servers about 27,000 cores, 13, 13,000, almost 14,000 discs and did 600 gigs, actually a little less than that at 5 78. And on Google, we did it with 2020 100 virtual instances, 8,000 cores did 1.5 terabytes >>And costs. You spin up the Google versus >>Basically if you look at that and you assume conservatively 4,000 per server, it's $13.8 million worth of hardware previously. And the cost to do that run on Google was $20 and 33 cents. >>Well, you got to discount. I mean, come on a partner mean it really costs that much. I mean, they that's what they would charge for it. Actually >>We are map artist's case on that minute. If you look at the Asheville charges to be 1200, >>Okay. It's not six millions, so millions to thousands. Yep. Okay. That's impressive. We'll have to go look at the numbers. Like we're going to look at GreenPlum's numbers in the next couple of weeks when talking about the Google relationship and men were that the up way with that was that >>Very excited about it. We're actually deployed throughout the cloud. We've got multiple partners Google's in limited preview. So we've got a number of customers kind of, you know, testing that and, and doing some really interesting things. >>So we monitor the data center market. I'll see with our proprietary tool that you know about the viewfinder and crowd spots and thing is that the data center verticals interesting, right? If you look at the sentiment analysis of what the conversation is on, on just the Twitter data, it's Facebook, apple, these companies. And when we dig into the numbers, it's not so much the companies, it's the fact that their data center operations are significantly being looked at as the leading indicator for where CEO's are going. So I want to ask you in your conversations with your customers, what are the conversations around moving to the cloud and where are they on that transition? Because we hear, yeah, one of the cloud for all the benefits you were mentioning, but Google and Facebook, these are the gold standards as, as architecture necessarily a cut and paste architecture, but they see the benefits that they're doing. So what are your conversations with your enterprise customers around the cloud cloud architecture and what other features besides replication and disaster recovery, are they, are they looking at >>Well, it's basically work, workload driven and dataset driven. So data that's already in the cloud are kind of a natural first step is, well, why don't I do the analysis there as well? So things like Google earth and digital advertising data, that's real interesting candidates for that also periodic workload. So if they have workloads that need to spin up and spin down, the, the cloud works, works really well for that. And in some cases it's driven by their own environments. They've got data centers that are approaching capacity and they need to kind of do offloads and then looking at the, at the cloud because it's easy to get up running quickly and uses an alternative. >>I want to do come back to one of your three sort of value props here, particularly the dependability piece and specifically the snapshot. So somebody asked me one time, how do you know a couple of years ago, how do you back up a petabyte as he could do this thing? And then his answer was, well, you don't know. So I want to, I want to ask you how your customers are protecting and, and, and, and what you guys are bringing to the table. >>So snapshots is not a bolt on feature. It's basically a low level feature based on the underlying data architecture. So when we architected that from the beginning, snapshots was, was a, was a core feature. And if you use a technique called redirect on, right, you're not copying the data, right? So you can do efficient, you can do a petabyte snapshot, you know, basically almost instantaneously because you're tracking the pointers of the latest blocks that have been written. So if, if the data change rate is, is basically, data's not changing, you can snapshot every minute and not have any additional storage overhead. >>Right. Okay. And, and so you can set that. So you, you map, map, our technologies will allow them to set that, dial that up, dial it down and switches. >>So we support logical volumes. So you can set policies at that volume and you can say, well, this volume is critical data. And then I can set policies. Well, critical data is every minute. And then I can change what the definition of critical data is. Maybe it's every five minutes, et cetera. So you can set up these different policies at volumes and have snapshots happen independently for each. >>Can you do that by workload or dataset or by application or whatever I get essentially provided as a service, as opposed to kind of a one size fits all approach. >>Exactly. And that, that also corresponds to user access, administrative privileges, you know, other features and policies within the, within the cluster. >>How about the, you know, this whole trend toward bringing SQL into, into Hadoop. What's, what's your take on that? And what's your angle? >>So interactive, SQL's an important aspect because you've got so many people trained in the organization and, and leverage, you know, sequel, but it's one of many use cases that needs to run across a big data platform. So there's a range of big data analytics, batch analytics, interactive capabilities with sequel, database operations, no sequel search streaming, all those are kind of functions that need to run across a platform. So it's a piece, but it's not the big driver, because what we've seen is that there's higher rival rate of machine generated data and machine generated response to respond to those for digital advertising, for recommendation engines for fraud detection can really move the needle for an organization, have huge swings and profitability >>And the ball down the field big time. Yeah. And >>Having an interactive piece with a kind of a human element involved, it doesn't really scale and work on a 24 by seven basis. >>Jack final question, we're over now by a minute. But when I ask a one party question, obviously, very competitive landscape right now in terms of competitiveness, the stakes are higher because the demand in the market market opportunities is massive. What's map ours business strategy going forward, no change in direction. Is it going to be same old, same old. You guys have any new things going down and you see the marketplace. >>We've got a huge lead when it comes to kind of mission critical enterprise grade features. And our focus is one platform. So the ability to support enterprise Hadoop, enterprise HBase and provide those full capabilities for ease of use for dependability, for performance. And, you know, we've seen a lot of companies test on one distribution and switch to map are and will continue to help that in the future. >>Well, we, we will, we will say we've been covering this big data space now going on four years now, Dave and I, and we've watched all the players pivot a few times. You guys have not, you guys have been true to your mission from day one and that we know where you stand. No one, everyone knows where you stand enterprise grade. It's a good strategy. I think everyone's putting that on their label now. So enterprise grade Washington, we call it a congratulations map art and said the cube. We'll be right back with our next guest here on day three wall-to-wall coverage at O'Reilly media. When do our news, our next from 12 to one, we'll be right back after this short break.

Published Date : Mar 4 2013

SUMMARY :

So what's your take on the Hadoop If you want to try Hadoop So could you share with the folks out there, what, what to look for in like the, the table stakes, And with Hadoop, you have the Hadoop That's If I could find that and then there's third. So let's talk about the performance because you And if you look back at, at the previous record that was done in the labs with So can you just quickly talk about that and how significant And on Google, we did it with 2020 100 virtual instances, And costs. And the cost to do that run on Google was $20 Well, you got to discount. If you look at the Asheville charges to be 1200, We'll have to go look at the numbers. So we've got a number of customers kind of, you know, testing that and, So I want to ask you in your conversations with your customers, So data that's already in the cloud are kind of a natural first step is, well, So I want to, I want to ask you how your customers are protecting and, and, So you can do efficient, you can do a petabyte snapshot, So you, you map, So you can set policies at that volume and you can say, Can you do that by workload or dataset or by application or whatever I get essentially provided as a service, you know, other features and policies within the, within the cluster. How about the, you know, this whole trend toward bringing SQL into, into Hadoop. you know, sequel, but it's one of many use cases that needs to run And the ball down the field big time. Having an interactive piece with a kind of a human element involved, and you see the marketplace. So the ability to support enterprise Hadoop, You guys have not, you guys have been true to your mission from day

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave VolantePERSON

0.99+

MicrosoftORGANIZATION

0.99+

$20QUANTITY

0.99+

Jack NorrisPERSON

0.99+

John FreyPERSON

0.99+

appleORGANIZATION

0.99+

$13.8 millionQUANTITY

0.99+

DavePERSON

0.99+

600 gigsQUANTITY

0.99+

GoogleORGANIZATION

0.99+

60 secondsQUANTITY

0.99+

1.5 terabytesQUANTITY

0.99+

33 centsQUANTITY

0.99+

FacebookORGANIZATION

0.99+

3,400 serversQUANTITY

0.99+

six millionsQUANTITY

0.99+

8,000 coresQUANTITY

0.99+

EMCORGANIZATION

0.99+

O'ReillyORGANIZATION

0.99+

1200QUANTITY

0.99+

thirdQUANTITY

0.99+

thousandsQUANTITY

0.99+

AshevilleLOCATION

0.99+

millionsQUANTITY

0.99+

twoQUANTITY

0.99+

TwitterORGANIZATION

0.99+

2009DATE

0.99+

1.4 terabytesQUANTITY

0.99+

SQLTITLE

0.99+

threeQUANTITY

0.99+

yesterdayDATE

0.99+

24QUANTITY

0.99+

this weekDATE

0.99+

four yearsQUANTITY

0.99+

one partyQUANTITY

0.99+

over five yearsQUANTITY

0.99+

three areasQUANTITY

0.99+

HadoopTITLE

0.99+

OneQUANTITY

0.98+

2020DATE

0.98+

oneQUANTITY

0.98+

100 virtual instancesQUANTITY

0.97+

secondQUANTITY

0.97+

one platformQUANTITY

0.97+

first stepQUANTITY

0.97+

JackPERSON

0.97+

one timeQUANTITY

0.97+

SecondlyQUANTITY

0.95+

about 27,000 coresQUANTITY

0.94+

HBaseTITLE

0.93+

13, 13,000QUANTITY

0.93+

GreenPlumORGANIZATION

0.92+

day threeQUANTITY

0.92+

DMCORGANIZATION

0.91+

IntelORGANIZATION

0.9+

a minuteQUANTITY

0.9+

day oneQUANTITY

0.89+

Strata Data ConferenceEVENT

0.89+

4,000 per serverQUANTITY

0.89+

14,000 discsQUANTITY

0.87+

five minutesQUANTITY

0.85+

WashingtonLOCATION

0.84+

one distributionQUANTITY

0.83+

wiki.orgOTHER

0.83+

sevenQUANTITY

0.83+

couple of years agoDATE

0.83+

5 78QUANTITY

0.82+

eachQUANTITY

0.81+

JrPERSON

0.79+

12QUANTITY

0.77+

Jack Norris | Strata-Hadoop World 2012


 

>>Okay. We're back here, live in New York city for big data week. This is siliconangle.tvs, exclusive coverage of Hadoop world strata plus Hadoop world big event, a big data week. And we just wrote a blog post on siliconangle.com calling this the south by Southwest for data geeks and, and, um, it's my prediction that this is going to turn into a, quite the geek Fest. Uh, obviously the crowd here is enormous packed and an amazing event. And, uh, we're excited. This is siliconangle.com. I'm the founder John ferry. I'm joined by cohost update >>Volante of Wiki bond.org, where people go for free research and peers collaborate to solve problems. And we're here with Jack Norris. Who's the vice president of market marketing at map are a company that we've been tracking for quite some time. Jack, welcome back to the cube. Thank you, Dave. I'm going to hand it to you. You know, we met quite a while ago now. It was well over a year ago and we were pushing at you guys and saying, well, you know, open source and nice look, we're solving problems for customers. We got the right model. We think, you know, this is, this is our strategy. We're sticking to it. Watch what happens. And like I said, I have to hand it to you. You guys are really have some great traction in the market and you're doing what you said. And so congratulations on that. I know you've got a lot more work to do, but >>Yeah, and actually the, the topic of openness is when it's, it's pretty interesting. Um, and, uh, you know, if you look at the different options out there, all of them are combining open source with some proprietary. Uh, now in the case of some distributions, it's very small, like an ODBC driver with a proprietary, um, driver. Um, but I think it represents that that any solution combining to make it more open is, is important. So what we've done is make innovations, but what we've made those innovations we've opened up and provided API. It's like NFS for standard access, like rest, like, uh, ODBC drivers, et cetera. >>So, so it's a spectrum. I mean, actually we were at Oracle open world a few weeks ago and you listen to Larry Ellison, talk about the Oracle public cloud mix of actually a very strong case that it's open. You can move data, it's all Java. So it's all about standards. Yeah. And, uh, yeah, it from an opposite, but it was really all about the business value. That's, that's what the bottom line is. So, uh, we had your CEO, John Schroeder on yesterday. Uh, John and I both were very impressed with, um, essentially what he described as your philosophy of we, we not as a product when we have, we have customers when we announce that product and, um, you know, that's impressive, >>Is that what he was also given some good feedback that startup entrepreneurs out there who are obviously a lot of action going on with the startup community. And he's basically said the same thing, get customers. Yeah. And that's it, that's all and use your tech, but don't be so locked into the tech, get the cutters, understand the needs and then deliver that. So you guys have done great. And, uh, I want to talk about the, the show here. Okay. Because, uh, you guys are, um, have a big booth and big presence here at the show. What, what did you guys are learning? I'll say how's the positioning, how's the new news hitting. Give us a quick update. So, >>Uh, a lot of news, uh, first started, uh, on Tuesday where we announced the M seven edition. And, uh, yeah, I brought a demo here for me, uh, for you all. Uh, because the, the big thing about M seven is what we don't have. So, uh, w we're not demoing Regents servers, we're not demoing compactions, uh, we're not demoing a lot of, uh, manual administration, uh, administrative tasks. So what that really means is that we took this stack. And if you look at HBase HBase today has about half of dupe users, uh, adopting HBase. So it's a lot of momentum in the market, uh, and, you know, use for everything from real-time analytics to kind of lightweight LTP processing. But it's an infrastructure that sits on top of a JVM that stores it's data in the Hadoop distributed file system that sits on a JVM that stores its data in a Linux file system that writes to disk. >>And so a lot of the complexity is that stack. And so as an administrator, you have to worry about how data gets permit, uh, uh, you know, kind of basically written across that. And you've got region servers to keep up, uh, when you're doing kind of rights, you have things called compactions, which increased response time. So it's, uh, it's a complex environment and we've spent quite a bit of time in, in collapsing that infrastructure and with the M seven edition, you've got files and tables together in the same layer writing directly to disc. So there's no region servers, uh, there's no compactions to deal with. There's no pre splitting of tables and trying to do manual merges. It just makes it much, much simpler. >>Let's talk about some of your customers in terms of, um, the profile of these guys are, uh, I'm assuming and correct me if I'm wrong, that you're not selling to the tire kickers. You're selling to the guys who actually have some experience with, with a dupe and have run into some of the limitations and you come in and say, Hey, we can solve some of those problems. Is that, is that, is that right? Can you talk about that a little bit >>Characterization? I think part of it is when you're in the evaluation process and when you first hear about Hadoop, it's kind of like the Gartner hype curve, right. And, uh, you know, this stuff, it does everything. And of course you got data protection, cause you've got things replicated across the cluster. And, uh, of course you've got scalability because you can just add nodes and so forth. Well, once you start using it, you realize that yes, I've got data replicated across the cluster, but if I accidentally delete something or if I've got some corruption that's replicated across the cluster too. So things like snapshots are really important. So you can return to, you know, what was it, five minutes before, uh, you know, performance where you can get the most out of your hardware, um, you know, ease of administration where I can cut this up into, into logical volumes and, and have policies at that whole level instead of at an individual file. >>So there's a, there's a bunch of features that really resonate with users after they've had some experience. And those tend to be our, um, you know, our, our kind of key customers. There's a, there's another phase two, which is when you're testing Hadoop, you're looking at, what's possible with this platform. What, what type of analytics can I do when you go into production? Now, all of a sudden you're looking at how does this fit in with my SLS? How does this fit in with my data protection, uh, policies, you know, how do I integrate with my different data sources? And can I leverage existing code? You know, we had one customer, um, you know, a large kind of a systems integrator for the federal government. They have a million lines of code that they were told to rewrite, to run with other distributions that they could use just out of the box with Matt BARR. >>So, um, let's talk about some of those customers. Can you name some names and get >>Sure. So, um, actually I'll, I'll, I'll talk with, uh, we had a keynote today and, uh, we had this beautiful customer video. They've had to cut because of times it's running in our booth and it's screaming on our website. And I think we've got to, uh, actually some of the bumper here, we kind of inserted. So, um, but I want to shout out to those because they ended up in the cutting room floor running it here. Yeah. So one was Rubicon project and, um, they're, they're an interesting company. They're a real-time advertising platform at auction network. They recently passed a Google in terms of number one ad reach as mentioned by comScore, uh, and a lot of press on that. Um, I particularly liked the headline that mentioned those three companies because it was measured by comScore and comScore's customer to map our customer. And Google's a key partner. >>And, uh, yesterday we announced a world record for the Hadoop pterosaur running on, running on Google. So, um, M seven for Rubicon, it allows them to address and replace different point solutions that were running alongside of Hadoop. And, uh, you know, it simplifies their, their potentially simplifies their architecture because now they have more things done with a single platform, increases performance, simplifies administration. Um, another customer is ancestry.com who, uh, you know, maybe you've seen their ads or heard, uh, some of their radio shots. Um, they're they do a tremendous amount of, of data processing to help family services and genealogy and figure out, you know, family backgrounds. One of the things they do is, is DNA testing. Uh, so for an internet service to do that, advanced technology is pretty impressive. And, uh, you know, you send them it's $99, I believe, and they'll send you a DNA kit spit in the tube, you send it back and then they process that and match and give you insights into your family background. So for them simplifying HBase meant additional performance, so they could do matches faster and really simplified administration. Uh, so, you know, and, and Melinda Graham's words, uh, you know, it's simpler because they're just not there. Those, those components >>Jack, I want to ask you about enterprise grade had duped because, um, um, and then, uh, Ted Dunning, because he was, he was mentioned by Tim SDS on his keynote speech. So, so you have some rockstars stars in the company. I was in his management team. We had your CEO when we've interviewed MC Sri vis and Google IO, and we were on a panel together. So as to know your team solid team, uh, so let's talk about, uh, Ted in a minute, but I want to ask you about the enterprise grade Hadoop conversation. What does that mean now? I mean, obviously you guys were very successful at first. Again, we were skeptics at first, but now your traction and your performance has proven this is a market for that kind of platform. What does that mean now in this, uh, at this event today, as this is evolving as Hadoop ecosystem is not just Hadoop anymore. It's other things. Yeah, >>There's, there's, there's three dimensions to enterprise grade. Um, the first is, is ease of use and ease of use from an administrator standpoint, how easy does it integrate into an existing environment? How easy does it, does it fit into my, my it policies? You know, do you run in a lights out data center? Does the Hadoop distribution fit into that? So that's, that's one whole dimension. Um, a key to that is, is, you know, complete NFS support. So it functions like, uh, you know, like standard storage. Uh, a second dimension is undependability reliability. So it's not just, you know, do you have a checkbox ha feature it's do you have automated stateful fail over? Do you have self healing? Can you handle multiple, uh, failures and, and, you know, automated recovery. So, you know, in a lights out data center, can you actually go there once a week? Uh, and then just, you know, replace drives. And a great example of that is one of our customers had a test cluster with, with Matt BARR. It was a POC went on and did other things. They had a power field, they came back a week later and the cluster was up and running and they hadn't done any manual tasks there. And they were, they were just blown away to the recovery process for the other distributions, a long laundry list of, >>So I've got to ask you, I got to ask you this, the third >>One, what's the third one, third one is performance and performance is, is, you know, kind of Ross' speed. It's also, how do you leverage the infrastructure? Can you take advantage of, of the network infrastructure, multiple Knicks? Can you take advantage of heterogeneous hardware? Can you mix and match for different workloads? And it's really about sharing a cluster for different use cases and, and different users. And there's a lot of features there. It's not just raw >>The existing it infrastructure policies that whole, the whole, what happens when something goes wrong. Can you automate that? And then, >>And it's easy to be dependable, fast, and speed the same thing, making HBase, uh, easy, dependable, fast with themselves. >>So the talk of the show right now, he had the keynote this morning is that map. Our marketing has dropped the big data term and going with data Kozum. Is that true? Is that true? So, Joe, Hellerstein just had a tweet, Joe, um, famous, uh, Cal Berkeley professor, computer science professor now is CEO of a startup. Um, what's the industry trifecta they're doing, and he had a good couple of epic tweets this week. So shout out to Joe Hellerstein, but Joel Hellison's tweet that says map our marketing has decided to drop the term big data and go with data Kozum with a shout out to George Gilder. So I'm kind of like middle intellectual kind of humor. So w w w what's what's your response to that? Is it true? What's happening? What is your, the embargo, the VP of marketing? >>Well, if you look at the big data term, I think, you know, there's a lot of big data washing going on where, um, you know, architectures that have been out there for 30 years or, you know, all about big data. Uh, so I think there's a, uh, there's the need for a more descriptive term. Um, the, the purpose of data Kozum was not to try to coin something or try to, you know, change a big data label. It was just to get people to take a step back and think, and to realize that we are in a massive paradigm shift. And, you know, with a shout out to George Gilder, acknowledging, you know, he recognized what the impact of, of making available compute, uh, meant he recognized with Telekom what bandwidth would mean. And if you look at the combination of we've got all this, this, uh, compute efficiency and bandwidth, now data them is, is basically taking those resources and unleashing it and changing the way we do things. >>And, um, I think, I think one of the ways to look at that is the new things that will be possible. And there's been a lot of focus on, you know, SQL interfaces on top of, of Hadoop, which are important. But I think some of the more interesting use cases are taking this machine J generated data that's being produced very, very rapidly and having automated operational analytics that can respond in a very fast time to change how you do business, either, how you're communicating with customers, um, how you're responding to two different, uh, uh, risk factors in the environment for fraud, et cetera, or, uh, just increasing and improving, um, uh, your response time to kind of cost events. We met earlier called >>Actionable insight. Then he said, assigning intent, you be able to respond. It's interesting that you talk about that George Gilder, cause we like to kind of riff and get into the concept abstract concepts, but he also was very big in supply side economics. And so if you look at the business value conversation, one of things we pointed out, uh, yesterday and this morning, so opening, um, review was, you know, the, the top conversations, insight and analytics, you know, as a killer app right now, the app market has not developed. And that's why we like companies like continuity and what you guys are doing under the hood is being worked on right at many levels, performance units of those three things, but analytics is a no brainer insight, but the other one's business value. So when you look at that kind of data, Kozum, I can see where you're going with that. >>Um, and that's kind of what people want, because it's not so much like I'm Republican because he's Republican George Gilder and he bought American spectator. Everyone knows that. So, so obviously he's a Republican, but politics aside, the business side of what big data is implementing is massive. Now that I guess that's a Republican concept. Um, but not really. I mean, businesses is, is, uh, all parties. So relative to data caused them. I mean, no one talks about e-business anymore. We talking to IBM at the IBM conference and they were saying, Hey, that was a great marketing campaign, but no one says, Hey, uh, you and eat business today. So we think that big data is going to have the same effect, which is, Hey, are you, do you have big data? No, it's just assumed. Yeah. So that's what you're basically trying to establish that it's not just about big. >>Yeah. Let me give you one small example, um, from a business value standpoint and, uh, Ted Dunning, you mentioned Ted earlier, chief application architect, um, and one of the coauthors of, of, uh, the book hoot, which deals with machine learning, uh, he dealt with one of our large financial services, uh, companies, and, uh, you know, one of the techniques on Hadoop is, is clustering, uh, you know, K nearest neighbors, uh, you know, different algorithms. And they looked at a particular process and they sped up that process by 30,000 times. So there's a blog post, uh, that's on our website. You can find out additional information on that. And I, >>There's one >>Point on this one point, but I think, you know, to your point about business value and you know, what does data Kozum really mean? That's an incredible speed up, uh, in terms of, of performance and it changes how companies can react in real time. It changes how they can do pattern recognition. And Google did a really interesting paper called the unreasonable effectiveness of data. And in there they say simple algorithms on big data, on massive amounts of data, beat a complex model every time. And so I think what we'll see is a movement away from data sampling and trying to do an 80 20 to looking at all your data and identifying where are the exceptions that we want to increase because there, you know, revenue exceptions or that we want to address because it's a cost or a fraud. >>Well, that's what I, I would give a shout out to, uh, to the guys that digital reasoning Tim asked he's plugged, uh, Ted. It was idolized him in terms of his work. Obviously his work is awesome, but two, he brought up this concept of understanding gap and he showed an interesting chart in his keynote, which was the date explosion, you know, it's up and, you know, straight up, right. It's massive amount of data, 64% unstructured by his calculation. Then he showed out a flat line called attention. So as data's been exploding over time, going up attention mean user attention is flat with some uptick maybe, but so users and humans, they can't expand their mind fast enough. So machine learning technologies have to bridge that gap. That's analytics, that's insight. >>Yeah. There's a big conversation now going on about more data, better models, people trying to squint through some of the comments that Google made and say, all right, does that mean we just throw out >>The models and data trumps algorithms, data >>Trumps algorithms, but the question I have is do you think, and your customer is talking about, okay, well now they have more data. Can I actually develop better algorithms that are simpler? And is it a virtuous cycle? >>Yeah, it's I, I think, I mean, uh, there are there's, there are a lot of debate here, a lot of information, but I think one of the, one of the interesting things is given that compute cycles, given the, you know, kind of that compute efficiency that we have and given the bandwidth, you can take a model and then iterate very quickly on it and kind of arrive at, at insight. And in the past, it was just that amount of data in that amount of time to process. Okay. That could take you 40 days to get to the point where you can do now in hours. Right. >>Right. So, I mean, the great example is fraud detection, right? So we used the sample six months later, Hey, your credit card might've been hacked. And now it's, you know, you got a phone call, you know, or you can't use your credit card or whatever it is. And so, uh, but there's still a lot of use cases where, you know, whether is an example where modeling and better modeling would be very helpful. Uh, excellent. So, um, so Dana custom, are you planning other marketing initiatives around that? Or is this sort of tongue in cheek fun? Throw it out there. A little red meat into the chum in the waters is, >>You know, what really motivated us was, um, you know, the cubes here talking, you know, for the whole day, what could we possibly do to help give them a topic of conversation? >>Okay. Data cosmos. Now of course, we found that on our proprietary HBase tools, Jack Norris, thanks for coming in. We appreciate your support. You guys have been great. We've been following you and continue to follow. You've been a great support of the cube. Want to thank you personally, while we're here. Uh, Matt BARR has been generous underwriter supportive of our great independent editorial. We want to recognize you guys, thanks for your support. And we continue to look forward to watching you guys grow and kick ass. So thanks for all your support. And we'll be right back with our next guest after this short break. >>Thank you. >>10 years ago, the video news business believed the internet was a fat. The science is settled. We all know the internet is here to stay bubbles and busts come and go. But the industry deserves a news team that goes the distance coming up on social angle are some interesting new metrics for measuring the worth of a customer on the web. What zinc every morning, we're on the air to bring you the most up-to-date information on the tech industry with scrutiny on releases of the day and news of industry-wide trends. We're here daily with breaking analysis, from the best minds in the business. Join me, Kristin Filetti daily at the news desk on Silicon angle TV, your reference point for tech innovation 18 months.

Published Date : Oct 25 2012

SUMMARY :

And, uh, we're excited. We think, you know, this is, this is our strategy. Um, and, uh, you know, if you look at the different options out there, we not as a product when we have, we have customers when we announce that product and, um, you know, Because, uh, you guys are, um, have a big booth and big presence here at the show. uh, and, you know, use for everything from real-time analytics to you know, kind of basically written across that. Can you talk about that a little bit And, uh, you know, this stuff, it does everything. And those tend to be our, um, you know, Can you name some names and get uh, we had this beautiful customer video. uh, you know, you send them it's $99, I believe, and they'll send you a DNA so let's talk about, uh, Ted in a minute, but I want to ask you about the enterprise grade Hadoop conversation. So it functions like, uh, you know, like standard storage. is, you know, kind of Ross' speed. Can you automate that? And it's easy to be dependable, fast, and speed the same thing, making HBase, So the talk of the show right now, he had the keynote this morning is that map. there's a lot of big data washing going on where, um, you know, architectures that have been out there for you know, SQL interfaces on top of, of Hadoop, which are important. uh, yesterday and this morning, so opening, um, review was, you know, but no one says, Hey, uh, you and eat business today. uh, you know, K nearest neighbors, uh, you know, different algorithms. Point on this one point, but I think, you know, to your point about business value and you which was the date explosion, you know, it's up and, you know, straight up, right. that Google made and say, all right, does that mean we just throw out Trumps algorithms, but the question I have is do you think, and your customer is talking about, okay, well now they have more data. cycles, given the, you know, kind of that compute efficiency that we have and given And now it's, you know, you got a phone call, you know, We want to recognize you guys, thanks for your support. We all know the internet is here to stay bubbles and busts come and go.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Joe HellersteinPERSON

0.99+

George GilderPERSON

0.99+

Ted DunningPERSON

0.99+

Kristin FilettiPERSON

0.99+

Joel HellisonPERSON

0.99+

John SchroederPERSON

0.99+

JoePERSON

0.99+

JackPERSON

0.99+

Larry EllisonPERSON

0.99+

Jack NorrisPERSON

0.99+

JohnPERSON

0.99+

40 daysQUANTITY

0.99+

Melinda GrahamPERSON

0.99+

64%QUANTITY

0.99+

$99QUANTITY

0.99+

comScoreORGANIZATION

0.99+

TimPERSON

0.99+

DavePERSON

0.99+

TuesdayDATE

0.99+

Matt BARRPERSON

0.99+

HellersteinPERSON

0.99+

GoogleORGANIZATION

0.99+

George GilderPERSON

0.99+

TedPERSON

0.99+

John ferryPERSON

0.99+

30 yearsQUANTITY

0.99+

30,000 timesQUANTITY

0.99+

todayDATE

0.99+

IBMORGANIZATION

0.99+

a week laterDATE

0.99+

yesterdayDATE

0.99+

twoQUANTITY

0.99+

three companiesQUANTITY

0.99+

DanaPERSON

0.99+

Tim SDSPERSON

0.99+

one pointQUANTITY

0.99+

JavaTITLE

0.99+

firstQUANTITY

0.99+

six months laterDATE

0.99+

oneQUANTITY

0.99+

OracleORGANIZATION

0.99+

one customerQUANTITY

0.99+

LinuxTITLE

0.98+

once a weekQUANTITY

0.98+

18 monthsQUANTITY

0.98+

RubiconORGANIZATION

0.98+

HBaseTITLE

0.98+

KozumPERSON

0.98+

GartnerORGANIZATION

0.98+

this morningDATE

0.97+

TelekomORGANIZATION

0.97+

this weekDATE

0.97+

10 years agoDATE

0.97+

second dimensionQUANTITY

0.97+

bothQUANTITY

0.97+

KozumORGANIZATION

0.95+

third oneQUANTITY

0.95+

OneQUANTITY

0.94+

three thingsQUANTITY

0.94+

a year agoDATE

0.94+

HadoopTITLE

0.93+

siliconangle.comOTHER

0.93+

KnicksORGANIZATION

0.93+

RegentsORGANIZATION

0.92+

Jack Norris | Hadoop Summit 2012


 

>>Okay. We're back live in Silicon valley and San Jose, California for the continuous coverage of siliconangle.tv and have duke world 2012. This is ground zero for the alpha geeks in big data. Uh, just the tech elite. We call them tech athletes and, uh, we're excited to cover it on the ground. Extract the signal from the noise here. This is the cube, our flagship telecast. I'm joining my co-host Jeff Kelly from Wiki bond.org, the best analyst in the business. Jeff, welcome back for another segment. End of the day, day one loving every minute. Okay. We're here with our guest. Jack Norris is a cm of map bar Jack. Welcome back to the cube. You've been on a few times. Um, so you guys have some news. Yes. So let's get right to the news. So you guys are a player in the business, so share with your news, the folks. Excellent jump right in. >>So, uh, two big announcements today, we announced that Amazon is integrating map bar as part of their Lastic MapReduce service and both edition or, or free edition. M three is available as well as M five directly with Amazon, Amazon in the cloud. >>So what's the value proposition. Why would a customer say, all right, I want to do this in the cloud manpower, an Amazon cloud rather than doing it on premise. >>Okay. So let's start with, I mean, there's a lot of value propositions, all balled up into one here. Uh, first of all, in the cloud, it allows them to spin up very quickly. Within a couple minutes, you can get, uh, you know, hundreds of nodes available. Um, and, uh, and depending on where you're processing the data, if you've got a lot of data in the cloud already makes a lot of sense to do the Hadoop processing directly there. So that's, that's one area. A second is you might have an on-premise cloud deployment and need to have a disaster recovery. So map R provides point in time, snapshots, uh, as well as, as a white area replication. So you can use mirroring having Amazon available as a target is a huge advantage. And then there's also a third application area where you can do processing of the data in the cloud and then synchronize those results to an on-premise. So basically process where the data is combined the results into a cluster on premise. So you >>Don't have to move the raw data. Uh, >>On-premise actually, it's all about let's do the processing on the data. Well, you know, the whole, >>The value proposition and big data in general is let's not move, move data as little as possible. Yep. Uh, you know, so you bring the computation to the data, if you can. Uh, so what are your take on this event? I mean, we've got, uh, this is a, you know, the 4th of June summit, uh, you know, Hortonworks is now fully taken over the show and talk about what you see out here in terms of, uh, the other vendors that play. And, uh, just to kind of the attendees, the vibe you're seeing, >>Uh, it's a lot of excitement. I think a big difference between last year, which seemed to be very developer focused. We're seeing a lot of, a lot of presentations by customers. A lot of information was shared by our customers today. It was fun to see that, uh, comScore's shared, uh, shared their success. Boeing gap map is, uh, it was great for us. >>Fantastic. We look at Amazon, Amazon, first of all, is the gold standard for public cloud. Right? They've knocked it out of the park. Everyone knows Amazon. Um, but they've been criticized on the big data front because of the cycle times involve on. Um, and some developers and mean for web service spending up and down. No problem. Um, and we're seeing businesses like Netflix run on Amazon. So Amazon is not a stranger to running scale for cloud, but Hadoop has kind of been a klugey thing for Amazon. So I think, you know, talk about why Amazon and you guys is a good fit out to the market. The market reach is great. So you guys know and have a huge addressable market. Are you guys helping solve some of that complexity with the, uh, with the MapReduce side? What's, >>What's the core, I guess the first comment first response would be, I think every customer should have that type of Kluge. Uh, uh, they could have the success that Amazon has in Hadoop. They have a huge number of, of, uh, of Hadoop deployments have been very, very successful. I think, >>I mean, you know what I mean by it's natural, it's, cloogy everywhere right now. That's the problem. But Amazon has huge scale, um, and had not a natural fit. There >>Is not a natural fit >>For the data for the data component. And, uh, uh, the HBase for example, >>Component. So where were Amazons, you know, made it very frictionless is the ability to spin up Hadoop to do the analysis. The gap that was missing is some of the, the ha capabilities. The data protection features the disaster recovery, and, you know, we're map are now it gives options to those customers. You know, if they want those kinds of enterprise enterprise grade features, now they have an option within EMR. It can select a M five and, and get moving if they want a performance. And in NFS, they've got the M three options. >>Well, congratulations. I think it's a great deal for you guys and for Amazon customers. My question for you is, as you guys explore the enterprise ready equation, which has been a big topic this week, um, what does that mean to you guys? Cause it means different things to different people depends on where, how high up to OLTB do you go? Right? I mean, we're how far from batch to real time transactional, um, levels you go, I mean, low bash, no problem. But as you start to get more near real time, it's going to be a little bit different gray in this house used security HDFS. Yeah. >>Yeah. So, so duke represents the strategic platform, right? Deploying that in an organization, um, you know, moving from kind of an experimental kind of lab based to production environment creates a different set of feature requirements. How available is it? How easy is it to integrate, right? How do I kind of protect that information and how do I share it? So when we say enterprise grade, we mean you can have SLA, she can put the data there and, and be confident that the data will remain there, that you can have a point in time recovery for an application error or user mistake. Uh, you can have a disaster recovery features in place. And then the integration is about not recreating the wheel to get access to the information. So Hadoop is very powerful, but it requires interacting through an HDFS API. If you can leverage it like through map bar with NFS standard file based access standard ODBC access, open it up. >>So I can use a standard file browser applications to see and manipulate the data really opens up the use cases. And then finally, what we announced in two dot oh, was multitenancy features. So as you share that information, all of a sudden the SLA is of different groups and well, these guys need it immediately. And if you've got some low grade batch jobs are going to impact that. So you want the ability to protect, to isolate, to secure information, and basically have virtual clusters within a cluster. And those features are important to cloud, but they're also important to on-premise >>So great for the hybrid cloud environments out there. I mean, the multitenancy cracking the code on that. Exactly huge. I mean, that is basically, I mean, right now most enterprises are like private cloud because it's like, they're basically extension of their data center and you're seeing a lot more activity in the hybrid cloud as a gateway to the public cloud. So, >>And, and, you know, frankly, people are kind of struggling with in an experimental with Apache Hadoop and the other distributions, the policies are either at the individual file level or the whole cluster. And it all almost forced the creation of separate physical clusters, which kind of goes against the whole Hadoop concept. So the ability to manage it, a logical layer have separate volumes where you can apply policies to apply that applies to all the content underneath really kind of makes it much, much easier for administrators to kind of deal with these multiple use cases. >>Amazon, Amazon has always been one of those cases for the enterprise where it's been one of those and they've, this has been talked about for years, put the credit card down, go play on Amazon, but then bring it back into the it group for certification. And so I think this is a nice product for you guys to bring that comfort. You know, we're very >>Excited the enterprise saying, Hey, >>Come play in Amazon. It's Bulletproof enterprise. Ready? So congratulations. >>I wonder, can we talk, uh, talk use cases. So what are you seeing in terms of, uh, evolving use cases as, as, uh, duke continues to become more enterprise grade, uh, depending on your definition, uh, but how is that impacting what you're seeing in terms of, even if it's just, uh, you know, the, the, um, the mindset even people think now, okay, now it's enterprise grade, well, maybe, you know, in, in, depending on who you talk to, it's been that way for a bit, but what kind of, uh, use cases are you seeing develop now that it's kind of starting to gain acceptance? It's like, okay, we can trust our data is going to be there, et cetera. >>So th there's a huge range of use cases that, uh, different by industry, different by kind of dataset that's being used against everything from really a deep store where you can do analytics on it. So you're selecting the content to something that's very, very analytic machine learning intensive, where you're doing sophisticated clustering algorithms, uh, et cetera, um, where we've seen kind of an expansion of use cases are around real-time streaming and you get streaming data sets that are kind of entering into the cloud. And, um, some of the more mission, critical data moving beyond just maybe click stream data or things that if you happen to drop a few, you know, not a big deal, right. Versus the kind of trust the business type of content. >>Talk a little bit about the streaming, uh, aspects, uh, because of course, you know, we think of duke, we think of a batch system in terms of streaming data into Hadoop. You know, that's, that's a different, uh, that's something we don't, we haven't heard a lot about. So how do you guys approach that? >>So, uh, one of the artifacts of, of HDFS, which is a, is a distributed file system that scores in the underlying Linux file system, it's append only. So as an administrator, you decide, how frequently do I close the file item? I going to do that an hourly basis on it every eight hours, because you have to close the file for other applications to see the data that's been written. Right? So one of the innovations that, uh, that we pursued was to rewrite that create this dynamic read-write layer. So you can continue to write data in any application is seeing the latest data that's written. So you can Mount the cluster as if it's storage and just continue to write data. There really opens up what's, uh, what's possible companies like Informatica, they're all from a messaging product integrates directly in with, with Matt BARR and provides. >>So what kind of advantage does that provide to the end user? What w w translate that into real business value? Why, why is that important? >>Well, so one example is comScore, comScore handles 30 billion, uh, objects a day, uh, as they go out and try to measure the use of, of the web and being able to continually write and stream that information and scale and handle that in a real time and do analytics and turn around data faster, has tremendous business value to them. If they're stuck in a batch environment where the load times lengthen to the point where all of a sudden they can't keep up and they're actually reporting on, you know, old news. And I think the analogy is forecasting rain a day after it's wet. Isn't exactly valuable. >>Yeah. So you guys, obviously a great deal of the enterprise ready for Amazon, big story, big coup for the company. What's next for you. I want to ask that and make sure you get that out there on your agenda for the next year, but then I want you to take a step back a year, maybe a year and a half ago. Look back at how much has changed in this landscape. Um, share your perspective because the market has gone through an evolution where there's been a market opportunity, and then everyone goes, oh my God, it's bigger than we actually thought. I mean, Jeff, Kelly's a groundbreaking report about the $50 billion market is now being talked about as too low. So big data has absolutely opened up to a huge, and it's changed some of the tactics around strategies. So your strategy, Hortonworks strategy, even cloud era. So, and it's still evolving. So what's changed for the folks out there from a year and a half ago, a year ago to today, and then look out for the next 12 months. What's on your agenda. >>Well, if, if you look back, I think we've been fairly consistent. Um, uh, I'm, I'm not going to take credit for the vision of our CEO and CTO. Uh, but they recognized early on that Hadoop was, uh, was a strategic platform and to be a strategic platform that applied to the broadest number of use cases and organizations required some, some areas, uh, of innovation and particularly the how it, how it scaled, how it was managed, how you stored and protected the information needed a rearchitecture. And I think that, you know, architecture matters when you're going through a paradigm shift, having the right one in place creates this, this ability, you know, to speed innovation. And I think that's, if there's anything that's changed, I think it's the speed of innovation has even increased in the Hadoop community. I think it's, it's created a focus on these enterprise grade features on how do we store this valuable information and, and continue to explore. >>And I think one of the observations I'll make is that on that note is that it really focuses everyone to be just mind your own business and get the products out. You know what I'm saying? We've seen everyone, the product focus be the number one conversation. >>What we've seen is customers, you know, start and they expand rapidly. Some of that student data growth, but a lot of it is student more and more applications are being delivered and, and, uh, and, and the values kind of extracted from the hoop platform and success breeds success. Well, >>Congratulations for all your success, great win with Amazon web services and make that a little bit more easier, more robust, and more, more features for them and you, uh, more revenue for part of our, um, and I want to personally thank you for your support to the cube. Uh, we've expanded with a new studio B software for extra extra interviews, um, and wanna expand the conversation, thanks to your generous support. You can bring the independent coverage out to the market and, um, great community, thanks for helping us out. And we appreciate it. So thank you. Okay. Jack Dorsey with Matt bar, we'll be right back to wrap up day one with that. Jeff and I will give our analysis right at the short break.

Published Date : Jun 14 2012

SUMMARY :

So you guys are a player in the business, so share with your news, Amazon in the cloud. So what's the value proposition. And then there's also a third application area where you can do processing of the data in Don't have to move the raw data. Well, you know, the whole, uh, you know, Hortonworks is now fully taken over the show and talk about what you see out here in terms of, uh, it was great for us. So I think, you know, talk about why Amazon and you guys is a good fit out What's the core, I guess the first comment first response would be, I think every customer I mean, you know what I mean by it's natural, it's, cloogy everywhere right now. For the data for the data component. the disaster recovery, and, you know, we're map are now it gives options to those customers. I think it's a great deal for you guys and for Amazon customers. that the data will remain there, that you can have a point in time recovery for an application error or user mistake. So as you share that information, So great for the hybrid cloud environments out there. So the ability to manage it, And so I think this is a nice product for you guys to So congratulations. So what are you seeing in terms of, uh, evolving use cases as, really a deep store where you can do analytics on it. Talk a little bit about the streaming, uh, aspects, uh, because of course, you know, we think of duke, I going to do that an hourly basis on it every eight hours, because you have to close the file for other applications actually reporting on, you know, old news. I want to ask that and make sure you get that And I think that, you know, architecture matters when you're going through a paradigm shift, And I think one of the observations I'll make is that on that note is that it really focuses everyone to be What we've seen is customers, you know, start and they expand rapidly. You can bring the independent coverage out to the market and, um, great community,

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Jeff KellyPERSON

0.99+

JeffPERSON

0.99+

AmazonORGANIZATION

0.99+

Jack NorrisPERSON

0.99+

Jack DorseyPERSON

0.99+

NetflixORGANIZATION

0.99+

$50 billionQUANTITY

0.99+

Silicon valleyLOCATION

0.99+

30 billionQUANTITY

0.99+

todayDATE

0.99+

InformaticaORGANIZATION

0.99+

a year agoDATE

0.99+

next yearDATE

0.99+

comScoreORGANIZATION

0.99+

a year and a half agoDATE

0.99+

KellyPERSON

0.99+

last yearDATE

0.99+

AmazonsORGANIZATION

0.99+

LinuxTITLE

0.99+

Matt BARRPERSON

0.99+

San Jose, CaliforniaLOCATION

0.99+

one exampleQUANTITY

0.98+

one areaQUANTITY

0.97+

third applicationQUANTITY

0.97+

MattPERSON

0.97+

oneQUANTITY

0.97+

HadoopTITLE

0.97+

this weekDATE

0.96+

2012DATE

0.95+

hundreds of nodesQUANTITY

0.94+

HortonworksORGANIZATION

0.94+

JackPERSON

0.93+

both editionQUANTITY

0.93+

a dayQUANTITY

0.93+

two big announcementsQUANTITY

0.92+

secondQUANTITY

0.9+

next 12 monthsDATE

0.88+

day oneQUANTITY

0.86+

two dotQUANTITY

0.85+

M threeOTHER

0.85+

M threeTITLE

0.84+

MapReduceORGANIZATION

0.82+

Hadoop Summit 2012EVENT

0.79+

first responseQUANTITY

0.79+

every eight hoursQUANTITY

0.78+

SLATITLE

0.77+

JuneDATE

0.77+

first commentQUANTITY

0.77+

Lastic MapReduceTITLE

0.69+

M fiveOTHER

0.69+

BoeingORGANIZATION

0.68+

M fiveTITLE

0.67+

siliconangle.tvOTHER

0.67+

ground zeroQUANTITY

0.67+

Wiki bond.orgORGANIZATION

0.62+

ApacheORGANIZATION

0.61+

4th ofEVENT

0.6+

Jack Norris - Strata Conference 2012 - theCUBE


 

>>Hi everybody. We're back. This is Dave Volante from Wiki bond.org. We're live at strata in Santa Clara, California. This is Silicon angle TVs, continuous coverage of the strata conference. So Riley media or Raleigh media is a great partner of ours. And thanks to them for allowing us to be here. We've been going all week cause it's day three for us. I'm here with Jeff Kelly Wiki bonds that lead big data analysts. And we're here with Jack Norris. Who's the VP of marketing at Matt bar Jack. Welcome to the cube. Thank you, Dave. Thanks very much for coming on. And you know, we've been going all week. You guys are a great sponsor of ours. Thank you for the support. We really appreciate it. How's the show going for you? >>Great. A lot of attention, a lot of focus, a lot of discussion about Hadoop and big data. >>Yeah. So you guys getting a lot of traffic. I mean, it says I hear this 2,500 people here up from 1400 last year. So that's >>Yeah, we've had like five, six people deep in the, in the booth. So I think there's a lot of, a lot of interests. There's interesting. >>You know, when we were here last year, when you looked at the, the infrastructure and the competitive landscape, there wasn't a lot going on and just a very short time, that's completely changed. And you guys have had your hand in that. So, so that's good. Competition is a good thing, right? And, and obviously customers want choice, but so we want to talk about that a little bit. We want to talk about map bar, the kind of problems you're solving. So why don't we start there? What is map are all about? And you've got your own distribution of, of, of enterprise Hadoop. You make it Hadoop enterprise ready? Let's start there. >>Okay. Yeah, I mean, we invested heavily in creating a alternative distribution one that took the best of the open source community with the best of the map, our innovations, and really it's, it's about making Hadoop more applicable, broader use cases, more mission, critical support, you know, being able to sit in and work in a lights out data center environment. >>Okay. So what was the problem that you set out to solve? Why, why do, why do we need another distribution of Hadoop? Let me ask it that way. Get nice and close to. >>So there, there are some just big issues with, with the duke. >>One of those issues, let's talk about that. There's >>Some ease of use issues. There's some deep dependability issues. There's some, some performance. So, you know, let's take those in order right now. If you look at some of the distributions, Apache Hadoop, great technology, but it requires a programmer, right? To get access to the data it's through the Hadoop API, you can't really see the data. So there's a lot of focus of, you know, what do I do once the data's in there opening that up, providing a full file based access, right? So I can look at it and treat it like enterprise storage, see the data, use my standard tools, standard commands, you know, drag and drop from a file browser. You can do that with Matt bar. You can't do that with other districts >>Talking about mountain HDFS as a NFS correct >>Example. Correct. And then, and then just the underlying storage services. The fact that it's append only instead of full random read-write, you know, causes some, some issues. So, you know, that's some of the, the ease of use features. There's a whole lot. We could discuss there. Big picture for reliability. Dependability is there's a single point of failure, multiple single points of failure within Hadoop. So you risk data loss. So people have looked at Hadoop. Traditionally is, is batch oriented. Scratchpad right. We were out to solve that, right? We want to make sure that you can use it for mission critical data, that you don't have a risk of a data loss that you've got full high availability. You've got the full data protection in terms of snapshots and mirroring that you would expect with the enterprise products. >>It gets back to when you guys were, you know, thinking about doing this. I'm not even sure you were at the company at the time, but you, your DNA was there and you're familiar with it. So you guys saw this big data movement. You saw this at duke moon and you said, okay, this is cool. It's going to be big. And it's gonna take a long time for the community to fix all these problems. We can fix them. Now let's go do that. Is that the general discussion? Yeah. >>You know, I think, I think the what's different about this. This is the first open source package. The first open source project that's created a market. If you look at the other open source, you know, Linux, my SQL, et cetera, it was really late in the life cycle of a product. Everyone knew what the features were. It was about, you know, giving an alternative choice, better Unix. Your, your, the focus is on innovation and our founders, you know, have deep enterprise background or CTO was at Google and charge of big table, understands MapReduce at scale, spent time as chief software architect at Spinnaker, which was kind of the fastest clustered Nazanin on the planet. So recognize that the underlying layers of Hadoop needed some rearchitecture and needed some deep investment and to do that effectively and do that quickly required a whole lot of focus. And we thought that was the best way to go to market. >>Talk about the early validation from customers. Obviously you guys didn't just do this in a vacuum, I presume. So you went out and talked to some customers. Yeah. >>What sorts of conversations with customers, why we're in stealth mode? We're probably the loudest stealth >>As you were nodding. And I mean, what were they telling you at the time? Yeah, please go do this. >>The, what we address weren't secrets. I there've been gyrus for open for four or five years on, on these issues. >>Yeah. But at the same time, Jack, you've got this, you got this purist community out there that says, I don't want to, I don't want to rip out HDFS. You know, I want it to be pure. What'd you, what'd you say to those guys, you just say, okay, thank you. We, we understand you're not a prospect. >>And I think, I think that, you know, duke has a huge amount of momentum. And I think a lot of that momentum is that there isn't any risks to adopting Hadoop, right? It's not like the fractured no SQL market where there's 122 different entrance, which one's going to win. Hadoop's got the ecosystem. So when you say pure, it's about the API APIs, it's about making sure that if I create a MapReduce job, it's going to run an Apache. It's going to run a map bar. It's going to run on the other distributions. That's where I think that the heat and the focus is now to do that. You also have to have innovation occurring up and down the stack that that provides choice and alternatives for. >>So when I'm talking about purists, I don't, I agree with you the whole lock-in thing, which is the elephant in the room here. People will worry about lock-in >>Pun intended. >>No, no, but good one good catch. But so, but you're basically saying, Hey, where we're no more locked in than cloud era. Right. I mean, they've got their own >>Actually. I think we're less because it's so easy to get data in and out with our NFS. That there's probably less so, >>So, and I'm gonna come back to that. But so for instance, many, when I, when I say peers, I mean some users in ISV, some guys we've had on here, we had an Abby Mehta from Triceda on the other day, for instance, he's one who said, I just don't have time to mess with that stuff and figure out all that API integration. I mean, there are people out there that just don't want to go that route. Okay. But, but you're saying I'm, I'm inferring this plenty who do right. >>And the, and by the API route, I want to make sure I understand what you're saying. You >>Talked about, Hey, it's all about the API integration. It's not >>About, it's not the, it it's about the API APIs being consistent, a hundred percent compatible. Right. So if I, you know, write a program, that's, that's going after HDFS and the HDFS API, I want to make sure that that'll run on other distributions. Right. >>And that's your promise. Yeah. Okay. All right. So now where I was going with this was th again, there are some peers to say, oh, I just don't want to mess with all that. Now let's talk about what that means to mess with all that. So comScore was a big, high profile case study for you guys. They, they were cloud era customer. They basically, in my understanding is a couple of days migrated from Cloudera to Mapbox. And the impetus was, let's talk about that. Why'd they do that >>Performance data protection, ease of use >>License fee issues. There was some license issues there as well, right? The, the, your, your maintenance pricing was more attractive. Is that true? Or >>I read more mainly about price performance and reliability, and, you know, they tested our stuff at work real well in a test environment, they put it in production environment. Didn't actually tell all their users, they had one guys debug the software for half a day because something was wrong. It finished so quickly. >>So, so it took him a couple of days to migrate and then boom, >>Boom. And they've, they handle about 30 billion objects a day. So there, you know, the use of that really high performance support for, for streaming data flows, you know, they're talking about, they're doing forecasts and insights into web behavior, and, you know, they w the earlier they can do that, the better off they are. So >>Greg, >>So talk about the implications of, of your approach in terms of the customer base. So I'm, I'm imagining that your customers are more, perhaps advanced than a lot of your typical Hadoop users who are just getting started tinkering with Hadoop. Is it fair to say, you know, your customers know what they want and they want performance and they want it now. And they're a little more advanced than perhaps some of the typical early adopters. >>We've got people to go to our website and download the free version. And some of them are just starting off and getting used to Hadoop, but we did specifically target those very experienced Hadoop users that, you know, we're kind of, you know, stubbing their toes on, on the issues. And so they're very receptive to the message of we've made it faster. We've made it more reliable, you know, we've, we've added a lot of ease of use to the, to the Hindu. >>So I found this, let me interrupt, go back to what I was saying before is I found this comment that I found online from Mike Brown comScore. Skipio I presume you mean, he said comScore's map our direct access NFS feature, which exposes a duke distributed file system data as NFS files can then be easily mounted, modified, or overwritten. So that's a data access simplification. You also said we could capitalize on the purchase of map bar with an annual maintenance charge versus a yearly cost per node. NFS allowed our enterprise systems to easily access the data in the cluster. So does that make sense to you that, that enterprise of that annual maintenance charge versus yearly cost per node? I didn't get that. >>Oh, I think he's talking about some, some organizations prefer to do a perpetual license versus a subscription model that's >>Oh, okay. So the traditional way of licensing software >>And that, that you have to do it basically reinforces the fact that we've really invested in have kind of a, a product, you know, orientation rather than just services on top of, of some opensource. >>Okay. So you go in, you license it and then yeah. Perpetual license. >>Then you can also start with the free edition that does all the performance NFS support kick the tires >>Before you buy it. Sorry. Sorry, Jeff. Sorry to interrupt. No, no problem >>At all. So another topic, a lot of interest is security making a dupe enterprise ready. One of the pillars, there is security, making sure access controls, for instance, making sure let's talk about how you guys approach that and maybe how you differentiate from some of the other vendors out there, or the other >>Full Kerberos support. We Lincoln to enterprise standards for access eldap, et cetera. We leveraged the Linux, Pam security, and we also provide volume control. So, you know, right now in Hindu in Apache to dupe other distributions, you put policies at the file level or the entire cluster. And we see many organizations having separate physical clusters because of that limitation, right? And we'd provide volume. So you can define a volume. And in that volume control, access control, administrative privileges data protection class, and, you know, in a sense kind of segregate that content. And that provides a lot of, a lot of control and a lot more, you know, security and protection and separation of data. >>That scenario, the comScore scenario, common where somebody's moving off an existing distribution onto a map are, or, or you more going, going, seeing demand from new customers that are saying, Hey, what's this big data thing I really want to get into it. How's it shake out there >>Right now? There's this huge pent up demand for these features. And we're seeing a lot of people that have run on other distributions switched to map our >>A little bit of everything. How about, can you talk a little bit about your, your channel? You go to market strategy, maybe even some of your ecosystem and partnerships in the little time. >>Sure. So EMC is a big partner of the EMC Greenplum Mr. Edition is basically a map R you can start with any of our additions and upgrade to that. Greenplum with just a licensed key that gives us worldwide service and support. It's been a great partnership. >>We hear a lot of proof of concepts out there >>For, yeah. And then it just hit the news news today about EMC's distribution, Mr. Distribution being available with UCS Cisco's ECS gear. So now that's further expanded the, the footprint that we have about. >>Okay. So you're the EMC relationship. Anything else that you can share with us? >>We have other announcements coming out and >>Then you want to pre-announce in the queue. >>Oops. Did I let that slip >>It's alive? So be careful. And so, in terms of your, your channel strategy, you guys mostly selling direct indirect combination, >>It's it? It, it's kind of an indirect model through these, these large partners with a direct assist. >>Yeah. Okay. So you guys come in and help evangelize. Yep. Excellent. All right. Do you have anything else before we gotta got a roll here? >>Yeah, I did wonder if you could talk a little bit about, you mentioned EMC Greenplum so there's a lot of talk about the data warehouse market, the MPB data warehouses, versus a Hadoop based on that relationship. I'm assuming that Matt BARR thinks well, they're certainly complimentary. Can you just touch on that? And, you know, as opposed to some who think, well, Hadoop is going to be the platform where we go, >>Well, th th there's just, I mean, if you look at the typical organization, they're just really trying to get their, excuse me, their arms around a lot of this machine generated content, this, you know, unstructured data that just growing like wildfire. So there's a lot of Paducah specific use cases that are being rolled out. They're also kind of data lakes, data, oceans, whatever you want to call it, large pools where that information is then being extracted and loaded into data warehouses for further analysis. And I think the big pivot there is if it's well understood what the issue is, you define the schema, then there's a whole host of, of data warehouse applications out there that can be deployed. But there's many things where you don't really understand that yet having to dupe where you don't need to find a schema a is a, is a big value, >>Jack, I'm sorry. We have to go run a couple of minutes behind. Thank you very much for coming on the cube. Great story. Good luck with everything. And sounds like things are really going well and market's heating up and you're in the right place at the right time. So thank you again. Thank you to Jeff. And we'll be right back everybody to the strata conference live in Santa Clara, California, right after this word from our.

Published Date : Apr 27 2012

SUMMARY :

And you know, we've been going all week. A lot of attention, a lot of focus, a lot of discussion about Hadoop So that's So I think there's a lot of, And you guys have had your hand in that. broader use cases, more mission, critical support, you know, being able to sit in and work Let me ask it that way. So there, there are some just big issues with, One of those issues, let's talk about that. So there's a lot of focus of, you know, what do I do once the data's in So you risk data loss. It gets back to when you guys were, you know, thinking about doing this. It was about, you know, giving an alternative choice, better Unix. So you went out and talked to some customers. And I mean, what were they telling you at the time? I there've been gyrus for open for four or five You know, I want it to be And I think, I think that, you know, duke has a huge amount of momentum. So when I'm talking about purists, I don't, I agree with you the whole lock-in thing, I mean, they've got their own I think we're less because it's so easy to get data in and out with our NFS. So, and I'm gonna come back to that. And the, and by the API route, I want to make sure I understand what you're saying. Talked about, Hey, it's all about the API integration. So if I, you know, write a program, that's, that's going after for you guys. Is that true? and, you know, they tested our stuff at work real well in a test environment, they put it in production environment. you know, the use of that really high performance support for, to say, you know, your customers know what they want and they want performance and they want it now. experienced Hadoop users that, you know, we're kind of, you know, So does that make sense to you that, So the traditional way of licensing software And that, that you have to do it basically reinforces the fact that we've really invested in have kind Before you buy it. for instance, making sure let's talk about how you guys approach that and maybe how you differentiate from a lot of control and a lot more, you know, security and protection and separation of data. off an existing distribution onto a map are, or, or you more going, And we're seeing a lot of people that have run on other distributions switched to map our How about, can you talk a little bit about your, your channel? Mr. Edition is basically a map R you can start with any of our additions So now that's further Anything else that you can share with us? you guys mostly selling direct indirect combination, It, it's kind of an indirect model through these, these large partners with Do you have anything else before And, you know, as opposed to some who think, excuse me, their arms around a lot of this machine generated content, this, you know, So thank you again.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
DavePERSON

0.99+

JeffPERSON

0.99+

Jack NorrisPERSON

0.99+

fiveQUANTITY

0.99+

Dave VolantePERSON

0.99+

JackPERSON

0.99+

EMCORGANIZATION

0.99+

last yearDATE

0.99+

Matt BARRPERSON

0.99+

fourQUANTITY

0.99+

UCSORGANIZATION

0.99+

2,500 peopleQUANTITY

0.99+

Santa Clara, CaliforniaLOCATION

0.99+

GregPERSON

0.99+

GoogleORGANIZATION

0.99+

Mike BrownPERSON

0.99+

half a dayQUANTITY

0.99+

SpinnakerORGANIZATION

0.99+

HadoopTITLE

0.99+

comScoreORGANIZATION

0.99+

five yearsQUANTITY

0.99+

RileyORGANIZATION

0.98+

EMC GreenplumORGANIZATION

0.98+

Abby MehtaPERSON

0.98+

LinuxTITLE

0.97+

strata conferenceEVENT

0.97+

SQLTITLE

0.97+

OneQUANTITY

0.97+

one guysQUANTITY

0.97+

todayDATE

0.97+

RaleighORGANIZATION

0.97+

122 different entranceQUANTITY

0.97+

six peopleQUANTITY

0.97+

SkipioPERSON

0.96+

Jeff KellyPERSON

0.95+

single pointQUANTITY

0.95+

about 30 billion objects a dayQUANTITY

0.94+

Strata Conference 2012EVENT

0.93+

ECSORGANIZATION

0.93+

hundred percentQUANTITY

0.91+

TricedaORGANIZATION

0.9+

ApacheTITLE

0.9+

firsQUANTITY

0.9+

PaducahLOCATION

0.89+

GreenplumORGANIZATION

0.89+

single pointsQUANTITY

0.88+

day threeQUANTITY

0.88+

NFSTITLE

0.87+

Wiki bond.orgOTHER

0.87+

1400QUANTITY

0.85+

UnixTITLE

0.85+

Wiki bondsORGANIZATION

0.84+

Silicon angleORGANIZATION

0.83+

MapboxORGANIZATION

0.78+

ApacheORGANIZATION

0.76+

MapReduceORGANIZATION

0.75+

KerberosORGANIZATION

0.75+

first openQUANTITY

0.74+

PamTITLE

0.73+

Matt barORGANIZATION

0.73+

NazaninORGANIZATION

0.61+

ClouderaTITLE

0.59+

moonLOCATION

0.58+

CiscoORGANIZATION

0.54+

oneQUANTITY

0.53+

daysQUANTITY

0.52+

MapReduceTITLE

0.47+

Daniel Newman, Futurum Research | AnsibleFest 2022


 

>>Hey guys. Welcome back to the Cubes coverage of Ansible Fast 2022. This is day two of our wall to wall coverage. Lisa Martin here with John Ferer. John, we're seeing this world where companies are saying if we can't automate it, we need to, The automation market is transforming. There's been a lot of buzz about that. A lot of technical chops here at Ansible Fest. >>Yeah, I mean, we've got a great guest here coming on Cuba alumni, Dean Newman, future room. He travels every event he's got. He's got his nose to the grindstone ear to the ground. Great analysis. I mean, we're gonna get into why it's important. How does Ansible fit into the big picture? It's really gonna be a great segment. The >>Board do it well, John just did my job for me about, I'll introduce him again. Daniel Newman, one of our alumni is Back Principal Analyst at Future and Research. Great to have you back on the cube. >>Yeah, it's good to join you. Excited to be back in Chicago. I don't know if you guys knew this, but for 40 years, this was my hometown. Now I don't necessarily brag about that anymore. I'm, I live in Austin now. I'm a proud Texan, but I did grow up here actually out in the west suburbs. I got off the plane, I felt the cold air, and I almost turned around and said, Does this thing go back? Yeah. Cause I'm, I've, I've grown thin skin. It did not take me long. I, I like the warm, Come on, >>I'm the saying, I'm from California and I got off the plane Monday. I went, Whoa, I need a coat. And I was in Miami a week ago and it was 85. >>Oh goodness. >>Crazy. So you just flew in. Talk about what's going on, your take on, on Ansible. We've talked a lot with the community, with partners, with customers, a lot of momentum. The flywheel of the community is going around and round and round. What are some of your perspectives that you see? >>Yeah, absolutely. Well, let's you know, I'm gonna take a quick step back. We're entering an era where companies are gonna have to figure out how to do more with less. Okay? We've got exponential data growth, we've got more architectural complexity than ever before. Companies are trying to discern how to deal with many different environments. And just at a macro level, Red Hat is one of the companies that is almost certainly gonna be part of this multi-cloud hybrid cloud era. So that should initially give a lot of confidence to the buying group that are looking at how to automate their environments. You're automating workflows, but really with, with Ansible, we're focused on automating it, automating the network. So as companies are kind of dig out, we're entering this recessionary period, Okay, we're gonna call it what it is. The first thing that they're gonna look at is how do we tech our way out of it? >>I had a wonderful one-on-one conversation with ServiceNow ceo, Bill McDermott, and we saw ServiceNow was in focus this morning in the initial opening session. This is the integration, right? Ansible integrating with ServiceNow. What we need to see is infrastructure automation, layers and applications working in concert to basically enable enterprises to be up and running all the time. Let's first fix the problems that are most common. Let's, let's automate 'em, let's script them. And then at some point, let's have them self resolving, which we saw at the end with Project Wisdom. So as I see it, automation is that layer that enterprises, boards, technologists, all can agree upon are basically here's something that can make our business more efficient, more profitable, and it's gonna deal with this short term downturn in a way that tech is actually gonna be the answer. Just like Bill and I said, let's tech our way out of it. >>If you look at the Red Hat being bought by ibm, you see Project Wisdom Project, not a product, it's a project. Project Wisdom is the confluence of research and practitioners kind of coming together with ai. So bringing AI power to the Ansible is interesting. Red Hat, Linux, Rel OpenShift, I mean, Red Hat's kind of position, isn't it? Kind of be in that right spot where a puck might be coming maybe. I mean, what do you think? >>Yeah, as analysts, we're really good at predicting the, the recent past. It's a joke I always like to make, but Red Hat's been building toward the future. I think for some time. Project Wisdom, first of all, I was very encouraged with it. One of the things that many people in the market probably have commented on is how close is IBM in Red Hat? Now, again, it's a $34 billion acquisition that was made, but boy, the cultures of these two companies couldn't be more different. And of course, Red Hat kind of carries this, this sort of middle ground layer where they provide a lot of value in services to companies that maybe don't use IBM at, at, for the public cloud especially. This was a great indication of how you can take the power of IBM's research, which of course has some of the world's most prolific data scientists, engineers, building things for the future. >>You know, you see things like yesterday they launched a, you know, an AI solution. You know, they're building chips, semiconductors, and technologies that are gonna power the future. They're building quantum. Long story short, they have these really brilliant technologists here that could be adding value to Red Hat. And I don't know that the, the world has fully been able to appreciate that. So when, when they got on stage and they kind of say, Here's how IBM is gonna help power the next generation, I was immediately very encouraged by the fact that the two companies are starting to show signs of how they can collaborate to offer value to their customers. Because of course, as John kind of started off with, his question is, they've kind of been where the puck is going. Open source, Linux hybrid cloud, This is the future. In the future. Every company's multi-cloud. And I said in a one-on-one meeting this morning, every company is going to probably have workloads on every cloud, especially large enterprises. >>Yeah. And I think that the secret's gonna be how do you make that evolve? And one of the things that's coming out of the industry over the years, and looking back as historians, we would say, gotta have standards. Well, with cloud, now people standards might slow things down. So you're gonna start to figure out how does the community and the developers are thinking it'll be the canary in the coal mine. And I'd love to get your reaction on that, because we got Cuban next week. You're seeing people kind of align and try to win the developers, which, you know, I always laugh cuz like, you don't wanna win, you want, you want them on your team, but you don't wanna win them. It's like a, it's like, so developers will decide, >>Well, I, I think what's happening is there are multiple forces that are driving product adoption. And John, getting the developers to support the utilization and adoption of any sort of stack goes a long way. We've seen how sticky it can be, how sticky it is with many of the public cloud pro providers, how sticky it is with certain applications. And it's gonna be sticky here in these interim layers like open source automation. And Red Hat does have a very compelling developer ecosystem. I mean, if you sat in the keynote this morning, I said, you know, if you're not a developer, some of this stuff would've been fairly difficult to understand. But as a developer you saw them laughing at jokes because, you know, what was it the whole part about, you know, it didn't actually, the ping wasn't a success, right? And everybody started laughing and you know, I, I was sitting next to someone who wasn't technical and, and you know, she kinda goes, What, what was so funny? >>I'm like, well, he said it worked. Do you see that? It said zero data trans or whatever that was. So, but if I may just really quickly, one, one other thing I did wanna say about Project Wisdom, John, that the low code and no code to the full stack developer is a continuum that every technology company is gonna have to think deeply about as we go to the future. Because the people that tend to know the process that needs to be automated tend to not be able to code it. And so we've seen every automation company on the planet sort of figuring out and how to address this low code, no code environment. I think the power of this partnership between IBM Research and Red Hat is that they have an incredibly deep bench of capabilities to do things like, like self-training. Okay, you've got so much data, such significant size models and accuracy is a problem, but we need systems that can self teach. They need to be able self-teach, self learn, self-heal so that we can actually get to the crux of what automation is supposed to do for us. And that's supposed to take the mundane out and enable those humans that know how to code to work on the really difficult and hard stuff because the automation's not gonna replace any of that stuff anytime soon. >>So where do you think looking at, at the partnership and the evolution of it between IBM research and Red Hat, and you're saying, you know, they're, they're, they're finally getting this synergy together. How is it gonna affect the future of automation and how is it poised to give them a competitive advantage in the market? >>Yeah, I think the future or the, the competitive space is that, that is, is ecosystems and integration. So yesterday you heard, you know, Red Hat Ansible focusing on a partnership with aws. You know, this week I was at Oracle Cloud world and they're talking about running their database in aws. And, and so I'm kind of going around to get to the answer to your question, but I think collaboration is sort of the future of growth and innovation. You need multiple companies working towards the same goal to put gobs of resources, that's the technical term, gobs of resources towards doing really hard things. And so Ansible has been very successful in automating and securing and focusing on very certain specific workloads that need to be automated, but we need more and there's gonna be more data created. The proliferation, especially the edge. So you saw all this stuff about Rockwell, How do you really automate the edge at scale? You need large models that are able to look and consume a ton of data that are gonna be continuously learning, and then eventually they're gonna be able to deliver value to these companies at scale. IBM plus Red Hat have really great resources to drive this kind of automation. Having said that, I see those partnerships with aws, with Microsoft, with ibm, with ServiceNow. It's not one player coming to the table. It's a lot of players. They >>Gotta be Switzerland. I mean they have the Switzerland. I mean, but the thing about the Amazon deal is like that marketplace integration essentially puts Ansible once a client's in on, on marketplace and you get the central on the same bill. I mean, that's gonna be a money maker for Ansible. I >>Couldn't agree more, John. I think being part of these public cloud marketplaces is gonna be so critical and having Ansible land and of course AWS largest public cloud by volume, largest marketplace today. And my opinion is that partnership will be extensible to the other public clouds over time. That just makes sense. And so you start, you know, I think we've learned this, John, you've done enough of these interviews that, you know, you start with the biggest, with the highest distribution and probability rates, which in this case right now is aws, but it'll land on in Azure, it'll land in Google and it'll continue to, to grow. And that kind of adoption, streamlining make it consumption more consumable. That's >>Always, I think, Red Hat and Ansible, you nailed it on that whole point about multicloud, because what happens then is why would I want to alienate a marketplace audience to use my product when it could span multiple environments, right? So you saw, you heard that Stephanie yesterday talk about they, they didn't say multiple clouds, multiple environments. And I think that is where I think I see this layer coming in because some companies just have to work on all clouds. That's the way it has to be. Why wouldn't you? >>Yeah. Well every, every company will probably end up with some workloads in every cloud. I just think that is the fate. Whether it's how we consume our SaaS, which a lot of people don't think about, but it always tends to be running on another hyperscale public cloud. Most companies tend to be consuming some workloads from every cloud. It's not always direct. So they might have a single control plane that they tend to lead the way with, but that is only gonna continue to change. And every public cloud company seems to be working on figuring out what their niche is. What is the one thing that sort of drives whether, you know, it is, you know, traditional, we know the commoditization of traditional storage network compute. So now you're seeing things like ai, things like automation, things like the edge collaboration tools, software being put into the, to the forefront because it's a different consumption model, it's a different margin and economic model. And then of course it gives competitive advantages. And we've seen that, you know, I came back from Google Cloud next and at Google Cloud next, you know, you can see they're leaning into the data AI cloud. I mean, that is their focus, like data ai. This is how we get people to come in and start using Google, who in most cases, they're probably using AWS or Microsoft today. >>It's a great specialty cloud right there. That's a big use case. I can run data on Google and run something on aws. >>And then of course you've got all kinds of, and this is a little off topic, but you got sovereignty, compliance, regulatory that tends to drive different clouds over, you know, global clouds like Tencent and Alibaba. You know, if your workloads are in China, >>Well, this comes back down at least to the whole complexity issue. I mean, it has to get complex before it gets easier. And I think that's what we're seeing companies opportunities like Ansible to be like, Okay, tame, tame the complexity. >>Yeah. Yeah, I totally agree with you. I mean, look, when I was watching the demonstrations today, my take is there's so many kind of simple, repeatable and mundane tasks in everyday life that enterprises need to, to automate. Do that first, you know? Then the second thing is working on how do you create self-healing, self-teaching, self-learning, You know, and, and I realize I'm a little broken of a broken record at this, but these are those first things to fix. You know, I know we want to jump to the future where we automate every task and we have multi-term conversational AI that is booking our calendars and driving our cars for us. But in the first place, we just need to say, Hey, the network's down. Like, let's make sure that we can quickly get access back to that network again. Let's make sure that we're able to reach our different zones and locations. Let's make sure that robotic arm is continually doing the thing it's supposed to be doing on the schedule that it's been committed to. That's first. And then we can get to some of these really intensive deep metaverse state of automation that we talk about. Self-learning, data replication, synthetic data. I'm just gonna throw terms around. So I sound super smart. >>In your customer conversations though, from an looking at the automation journey, are you finding most of them, or some percentage is, is wanting to go directly into those really complex projects rather than starting with the basics? >>I don't know that you're, you're finding that the customers want to do that? I think it's the architecture that often ends up being a problem is we as, as the vendor side, will tend to talk about the most complex problems that they're able to solve before companies have really started solving the, the immediate problems that are before them. You know, it's, we talk about, you know, the metaphor of the cloud is a great one, but we talk about the cloud, like it's ubiquitous. Yeah. But less than 30% of our workloads are in the public cloud. Automation is still in very early days and in many industries it's fairly nascent. And doing things like self-healing networks is still something that hasn't even been able to be deployed on an enterprise-wide basis, let alone at the industrial layer. Maybe at the company's on manufacturing PLAs or in oil fields. Like these are places that have difficult to reach infrastructure that needs to be running all the time. We need to build systems and leverage the power of automation to keep that stuff up and running. That's, that's just business value, which by the way is what makes the world go running. Yeah. Awesome. >>A lot of customers and users are struggling to find what's the value in automating certain process, What's the ROI in it? How do you help them get there so that they understand how to start, but truly to make it a journey that is a success. >>ROI tends to be a little bit nebulous. It's one of those things I think a lot of analysts do. Things like TCO analysis Yeah. Is an ROI analysis. I think the businesses actually tend to know what the ROI is gonna be because they can basically look at something like, you know, when you have an msa, here's the downtime, right? Business can typically tell you, you know, I guarantee you Amazon could say, Look for every second of downtime, this is how much commerce it costs us. Yeah. A company can generally say, if it was, you know, we had the energy, the windmills company, like they could say every minute that windmill isn't running, we're creating, you know, X amount less energy. So there's a, there's a time value proposition that companies can determine. Now the question is, is about the deployment. You know, we, I've seen it more nascent, like cybersecurity can tend to be nascent. >>Like what does a breach cost us? Well there's, you know, specific costs of actually getting the breach cured or paying for the cybersecurity services. And then there's the actual, you know, ephemeral costs of brand damage and of risks and customer, you know, negative customer sentiment that potentially comes out of it. With automation, I think it's actually pretty well understood. They can look at, hey, if we can do this many more cycles, if we can keep our uptime at this rate, if we can reduce specific workforce, and I'm always very careful about this because I don't believe automation is about replacement or displacement, but I do think it is about up-leveling and it is about helping people work on things that are complex problems that machines can't solve. I mean, said that if you don't need to put as many bodies on something that can be immediately returned to the organization's bottom line, or those resources can be used for something more innovative. So all those things are pretty well understood. Getting the automation to full deployment at scale, though, I think what often, it's not that roi, it's the timeline that gets misunderstood. Like all it projects, they tend to take longer. And even when things are made really easy, like with what Project Wisdom is trying to do, semantically enable through low code, no code and the ability to get more accuracy, it just never tends to happen quite as fast. So, but that's not an automation problem, That's just the crux of it. >>Okay. What are some of the, the next things on your plate? You're quite a, a busy guy. We, you, you were at Google, you were at Oracle, you're here today. What are some of the next things that we can expect from Daniel Newman? >>Oh boy, I moved Really, I do move really quickly and thank you for that. Well, I'm very excited. I'm taking a couple of work personal days. I don't know if you're a fan, but F1 is this weekend. I'm the US Grand Prix. Oh, you're gonna Austin. So I will be, I live in Austin. Oh. So I will be in Austin. I will be at the Grand Prix. It is work because it, you know, I'm going with a number of our clients that have, have sponsorships there. So I'll be spending time figuring out how the data that comes off of these really fun cars is meaningfully gonna change the world. I'll actually be talking to Splunk CEO at the, at the race on Saturday morning. But yeah, I got a lot of great things. I got a, a conversation coming up with the CEO of Twilio next week. We got a huge week of earnings ahead and so I do a lot of work on that. So I'll be on Bloomberg next week with Emily Chang talking about Microsoft and Google. Love talking to Emily, but just as much love being here on, on the queue with you >>Guys. Well we like to hear that. Who you're rooting for F one's your favorite driver. I, >>I, I like Lando. Do you? I'm Norris. I know it's not necessarily a fan favorite, but I'm a bit of a McLaren guy. I mean obviously I have clients with Oracle and Red Bull with Ball Common Ferrari. I've got Cly Splunk and so I have clients in all. So I'm cheering for all of 'em. And on Sunday I'm actually gonna be in the Williams Paddock. So I don't, I don't know if that's gonna gimme me a chance to really root for anything, but I'm always, always a big fan of the underdog. So maybe Latifi. >>There you go. And the data that comes off the how many central unbeliev, the car, it's crazy's. Such a scientific sport. Believable. >>We could have Christian, I was with Christian Horner yesterday, the team principal from Reside. Oh yeah, yeah. He was at the Oracle event and we did a q and a with him and with the CMO of, it's so much fun. F1 has been unbelievable to watch the momentum and what a great, you know, transitional conversation to to, to CX and automation of experiences for fans as the fan has grown by hundreds of percent. But just to circle back full way, I was very encouraged with what I saw today. Red Hat, Ansible, IBM Strong partnership. I like what they're doing in their expanded ecosystem. And automation, by the way, is gonna be one of the most robust investment areas over the next few years, even as other parts of tech continue to struggle that in cyber security. >>You heard it here. First guys, investment in automation and cyber security straight from two analysts. I got to sit between. For our guests and John Furrier, I'm Lisa Martin, you're watching The Cube Live from Chicago, Ansible Fest 22. John and I will be back after a short break. SO'S stick around.

Published Date : Oct 19 2022

SUMMARY :

Welcome back to the Cubes coverage of Ansible Fast 2022. He's got his nose to the grindstone ear to the ground. Great to have you back on the cube. I got off the plane, I felt the cold air, and I almost turned around and said, Does this thing go back? And I was in Miami a week ago and it was 85. The flywheel of the community is going around and round So that should initially give a lot of confidence to the buying group that in concert to basically enable enterprises to be up and running all the time. I mean, what do you think? One of the things that many people in the market And I don't know that the, the world has fully been able to appreciate that. And I'd love to get your reaction on that, because we got Cuban next week. And John, getting the developers to support the utilization Because the people that tend to know the process that needs to be the future of automation and how is it poised to give them a competitive advantage in the market? You need large models that are able to look and consume a ton of data that are gonna be continuously I mean, but the thing about the Amazon deal is like that marketplace integration And so you start, And I think that is where I think I see this What is the one thing that sort of drives whether, you know, it is, you know, I can run data on Google regulatory that tends to drive different clouds over, you know, global clouds like Tencent and Alibaba. I mean, it has to get complex before is continually doing the thing it's supposed to be doing on the schedule that it's been committed to. leverage the power of automation to keep that stuff up and running. how to start, but truly to make it a journey that is a success. to know what the ROI is gonna be because they can basically look at something like, you know, I mean, said that if you don't need to put as many bodies on something that What are some of the next things that we can Love talking to Emily, but just as much love being here on, on the queue with you Who you're rooting for F one's your favorite driver. And on Sunday I'm actually gonna be in the Williams Paddock. And the data that comes off the how many central unbeliev, the car, And automation, by the way, is gonna be one of the most robust investment areas over the next few years, I got to sit between.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Lisa MartinPERSON

0.99+

Daniel NewmanPERSON

0.99+

MicrosoftORGANIZATION

0.99+

CaliforniaLOCATION

0.99+

JohnPERSON

0.99+

AlibabaORGANIZATION

0.99+

ChicagoLOCATION

0.99+

Dean NewmanPERSON

0.99+

Emily ChangPERSON

0.99+

John FurrierPERSON

0.99+

AustinLOCATION

0.99+

AmazonORGANIZATION

0.99+

John FererPERSON

0.99+

IBMORGANIZATION

0.99+

EmilyPERSON

0.99+

MiamiLOCATION

0.99+

TencentORGANIZATION

0.99+

ChinaLOCATION

0.99+

OracleORGANIZATION

0.99+

MondayDATE

0.99+

GoogleORGANIZATION

0.99+

AWSORGANIZATION

0.99+

AnsibleORGANIZATION

0.99+

Red HatORGANIZATION

0.99+

40 yearsQUANTITY

0.99+

TwilioORGANIZATION

0.99+

next weekDATE

0.99+

firstQUANTITY

0.99+

yesterdayDATE

0.99+

SundayDATE

0.99+

Saturday morningDATE

0.99+

Futurum ResearchORGANIZATION

0.99+

two companiesQUANTITY

0.99+

BillPERSON

0.99+

Red BullORGANIZATION

0.99+

StephaniePERSON

0.99+

less than 30%QUANTITY

0.99+

85QUANTITY

0.99+

ibmORGANIZATION

0.99+

todayDATE

0.99+

The Cube LiveTITLE

0.99+

two analystsQUANTITY

0.99+

IBM ResearchORGANIZATION

0.99+

McLarenORGANIZATION

0.99+

Bill McDermottPERSON

0.99+

oneQUANTITY

0.99+

Christian HornerPERSON

0.98+

this weekDATE

0.98+

one playerQUANTITY

0.98+

Williams PaddockLOCATION

0.98+

RockwellORGANIZATION

0.98+

Grand PrixEVENT

0.98+

Gil Shneorson, Dell | Dell Technologies World 2022


 

>>The cube presents. Dell technologies world brought to you by Dell. >>Welcome to Las Vegas. Lisa Martin, with Dave Volante. The cube is live at Dell technologies world 2022. Dave, hope you say live, live <laugh>. We are live. We are in person. We are three-D. We are also here on the first day of our coverage with an eight time, right? Eight time cube alum, GA Norris joins us the senior vice president of edge portfolio solutions at Dell technologies. Welcome back our friend. >>Thank you. It's great to be here in this forum with live people, you know, and 3d, >>Isn't it. We're amazing. We're not, we're not via a screen. This is actually real. So Gill a a lot, a lot of buzz, great attendance at this first event, since 20, lot's been going on since then, we're talking a lot about edge. It's not new, but there's a lot changing what's going on there. >>Well, you know, edge has been around for a while. Um, actually since, you know, the beginning of time people were doing, you know, compute and, and applications, they in the, um, in the physical space where data it, but more and more, um, data is based on sensors in cameras and machine vision. And if you wanna make real time decisions, there's a few reasons why you can't just send everything back to a data center or a cloud. Maybe you don't have the right latency, maybe, um, you it's too costly. Maybe you don't have the right end with maybe you have security challenges, maybe have compliance challenges. So the world's moving more and more resources towards where the data is created and to make real time decisions and to generate new business values, things are changing and they're becoming much more, um, um, involved than before, much more. Um, so basically that that's, what's changing. You know, we need to deal with distributed architectures much more than we needed before. >>I think one of the things we've learned in the last very dynamic two years is that access to realtime data is no longer a nice to have it's table stakes for whether we're talking about retail, healthcare, et cetera. So that the, the realtime data access is critical for everybody to these days. >>Right? And it, it could be a real time decision, or it could even be data collection either way. You need to place some device, some comput next to the source. And then, you know, you have a lot of them and you just multiply by multiple use cases and you be, you basically, you have a very complex problem to solve. And if you ask me what's new is that complexity is big coming more and more, um, critical to solve >>Critical. >>Oh, go ahead, please. >>I was just gonna say, talk to me about some of the, from a, from a complexity resolution perspective, what are some of the things that Dell is doing to help organizations as they spread out to the edge more to meet that consumer demand, but reduce that complexity from an infrastructure standpoint. >>So we focus on simplifying. I think that's what people need right now. So there are two things we do. We, we optimize our products, um, whether they need regularization or different temperature envelopes or, uh, management capability, remote management capability, and we create solutions. And so we develop, um, solutions that look at specific, um, outcomes and we size it and we create deployment guides. Um, we do everything we can, um, to simplify the, uh, the edge uses for our customers. >>You know, you guys is talking about, it's not new. I, and I know you do a lot in retail. I think of like the NCR cash register as the, the original edge, you know, but there's other use cases. Uh there's you Gil, you and I have talked about AI inferencing in, in real time, there was a question today in the analyst forum, uh, I think it went to Jeff or nobody wanted to take it. No, maybe it was Michael, but the metaverse, but that there's edge space is the edge industrial I OT. So how do you, I mean, the Tam is enormous. How do you think about the use cases? Are there ones that, that aren't necessarily sort of horizontal for you that you don't go after, like EVs and TA the cars? Or how are you thinking about >>It? Depends. I agree that the, uh, edge business is very verticalized. Um, at the same time, there are very, uh, there is, there are themes that emerge across every industry. Um, so we're trying to solve things horizontally being Dell, we need to solve for, um, repeatability and scale, but we do package, you know, vertical solutions on top of them because that's what people need. Um, so for example, you know, you said, um, NCR being the, uh, the original edge. If I asked you today, name how many applications are, are running in a retail store to enable your experience? You'd say, well, there's self checkout. Maybe there is a, um, fraud detection, >>Let's say a handful >>It's handful. The fact is it's not, it's about 30 different applications, 30 that are running. So you have, you know, digital labels and you have, you know, a curbside delivery and you have inventory management and you have crowd management and you have safety and security. And what happens today is that every one of those solar is purchased separately and deployed separately and connected to the network separately and secured separately. Hence you see the problem, right? And so I know what we do, and we create a solution. For example, we see, okay, infrastructure, what can we consolidate onto an infrastructure that could scale over time? And then we look at it in the context of a solution. So, you know, the solution we're announcing, or we announced last week does just that on the left side, it looks at a consolidated infrastructure based on VxRail and VMware stack. So you can run multiple applications on the right side, it working with a company called deep north for Inso analytics and actually people that, um, and the show they can go and see this in action, um, in our, um, you know, fake retail store, uh, back at the edge booth. Um, but the point is those elements of siloed applications and the need to consolidate their true for every industry. And that's what we're trying to solve for. >>I was just wondering, you said they're true for every industry. Every industry is facing the same challenges there. What, what makes retail so prime for transformation right now? >>That's a great question. So, you know, using my example from before, if you are faced with this set, have a shopper that buys online and they now are coming back to the stores and they need to, they want the same experience. They want the stuff that they search for. They want it available to them. Um, and in fact, we research that 80% of people say, if they have a bad experience will not come back to a retail store. So you've got all of those use cases that you need to put to, you've got this savvy shopping that comes in, you've got heightened labor costs. You've got a supply chain problem in most of those markets, labor >>Shortages as >>Well. It's a perfect storm. And you wanna give an experience, right? So CIOs are looking at this and they go, how do I do all of that? Um, and they, they, as I said before, the key management, the key problem is management of all of those things is why they can innovate faster. And so retail is in this perfect storm where they need to innovate and they want to innovate. And now they're looking for options and we're here to help them. >>You know, a lot of times we talk about the in industrial IOT, we talk about the it and the OT schism. Is there a similar sort of dissonance between it, your peeps, Dell's traditional market, and what's happening, you know, at the near edge, the retail infrastructure sort of different requirements. How are you thinking about that and managing that >>About, um, 50% of edge projects today are, are somehow involving it. Um, usually every project will involve it for networking and security, so they have to manage it either way. And today there's a lot of what we used to call shadow it. When we talked about cloud, this has happens at the edge as well. Now this happened for a good reason because the expertise are the OT people expertise on the, the specific use case. It's true for manufacturing. It's also for true for, for retail. Um, our traditional audience is the it audience and, and we will never be able to merger two worlds unless it was better able to service the OT buyers. And even in the show, I I've had multiple conversations today. We, with people to talk about the divide, how to bring it together, it will come together when it can deliver a better service to the OT, um, constituents. And that's definitely a job for Dell, right? This is what we do. If we enable our it buyer to do a better job in servicing the OT crowd or their business crowd in retail, um, more innovation will happen, you know, across the, those different dimensions. So I'm happy you asked that because that's actually part of the mission we're taking on. >>Where is one of the things I think about when you, you talk about that consumer experience and we're very demanding as consumers. We wanna ha as you described, we wanna have the same experience we expect to have that regardless of where we are. And if that doesn't happen, you, you mentioned that number of 80% of people's survey said, if I have a bad experience with a merchant, I'm out, I'm going somewhere else. Right. So where is the rest of the Csuite in the conversation? I can think of, um, a COO the chief marketing officer from brand value, brand reputation perspective. Are you talking with those folks as well to help make the connective so reality? >>Um, I, I, I don't know that we're having those conversation with those business owners. We we're a, um, a system, an infrastructure company. So, you know, we get involved once they understand, you know, what they want to do. We just look at it in. And so if you solve it one way, it's gonna be one outcome. Maybe there is a better way to look at it. Maybe there's an architecture, maybe there's a more, you know, thoughtful way to think about, you know, the problems before they happen. And, um, but the fact that they're all looking shows you, that their business owners are very, very concerned with, with this reality, their >>Key stakeholders. Can >>We come back to your announcement? Can you, can we unpack that a little bit, uh, for those who might not be familiar with it? What, what, what is it called again? And give us a peel, the onion a little bit Gil. Yeah. >>So, so we call it a Dell technologies validated design. Um, it is essentially reference architecture. Um, we take a use case, we size it. So we, you know, we, um, we save customers, the effort of, of testing and sizing. We document the deployment step by step. We just make it simpler. And as says, before we look for consolidation, so we took a VXL, which is our leading ACI product based on VMware technology with a VMware application management stack with Tansu. Um, and then we, we, we look at that as the infrastructure, and then we test it with a company called deep north and deep north, um, are, um, store analytics. So through machine vision, they can tell you where people are queuing up. If there is somebody in the store that needs help and nobody's approaching, if there is a water spill and somebody might, you know, slip and hurt themselves, if a fridge is open and something may get spot. >>And so all of those things together through machine vision and realtime decisions can have this much better experience. So we put all of this together, we created a design and now it's out there in the market for our partners to use for our customers to use. Um, this is an extension of our manufacturing solutions, where we did the same thing. We partner with a company called PTC. I know of obviously in a company called Litmos, um, to create, um, industrial and the leading solution. So this whole word of solutioning is supposed to look at the infrastructure and a use case and bring them together and document in a way that simplifies things for >>Customers. Do you ever see that becoming a Aku at some point in time or, >>Um, personal, if you ask me? I don't think so. And the reason is there's still a lot of variability in those and skewing, but that's a very formal, you know, internal discussion. Yeah. Um, the point is we are, we want people to buy as much of it as they need to, and, and we really want to help them if Aku could help them, we will get there, but we need to see repeatability before creating skews. >>Can you give us an example of a, of a retail or a manufacturing customer that's using this Dell validated design, this DVD, and that really has reduced or eliminated that complexity that was there before. >>So this solution is new. I mean, it's brand new, we just announced it. So, no, but, um, I don't know what names I can call out, cuz referenceability is probably examples though about generic, but I will tell you that most of the large retailers in the us are based in their stores on Dell technologies. Um, a lot of the trail is in, in those stores and you're talking about thousands of locations with remote management. Um, what we're doing here is we're taking it to the next step by looking at new use cases that they have not been implementing before and saying, look, same infrastructure is valid. You know, scalable is it's scalable. And here are the new use cases with machine vision and other things that here is how you do that. But we're seeing a lot of success in retail in the last few years. >>So what should we expect looking forward, you know, any gaps that customers are asking for trying to fill? What, what two to three years out, what should we expect? >>Um, I think we're gonna stay very true to our simplification message. We want to help people simplify. So if it's simplifying, um, maintenance, if it's simplifying management, if it's simplifying through solutioning, you're gonna see us more and more and more, um, investing in simplification of edge. Um, and that's through our own IP, through our partnerships. Um, there, there is a lot more coming if, if I may say it myself, but, but it's, it's a little too early to, uh, to talk about it. >>So for those folks that are here at the show that get to see it and play with it and touch it and feel it, what would you say some of the biggest impacts are that this technology can deliver tomorrow? >>Well, first of all, it's enabling to do what they want. See, we don't have to go and, and tell people, oh, you probably really need to move things through the edge. They know they need to do it. Our job is to tell them how to do it in a secure way, in a simplified way. So that's, that's a nice thing about this, this market it's happening, whether we want it or not. Um, people in this show can go see some things in action. They can see the solution in action. They can see the manufacturing solution in action and even more so. And I forgot to say part of our announcement was a set of solution centers in Limerick island and in Singapore, that was just open. And soon enough in Austin, Texas saw that, and we will have people come in and have the full experience of IOT OT and edge device devices in action. So AR and VR, I T IEN technology and scanning technology. So they could be, um, thinking about the art of the possible, right? Thinking about this immersive experience that will help them invent with us. And so we're expecting a lot of innovation to come out of those conversations for us and for them. >>So doing a lot of testing before deployment and really gleaning that testing >>Before deployment solution architecture, just ideation, if they're not there yet. So, and I've just been to Singapore in one of those, um, they asked me to, um, pretend I was a, um, retail ski enter in a distribution center and I didn't do so well, but I was still impressed with the technology. So, >>Well, eight time Q alumni. Now you have a career to fall back on if you need to. Exactly. >><laugh> >>GA it's been great to have you. Thank you so much for coming back, talking to us about what's new on day one of Dell technologies world 22. Thank >>You for having me again, >>Our pleasure for Dave Volante. I'm Lisa Martin, coming to you live from the Venetian in Las Vegas at Dell technologies world 2022. This is day one of our coverage stick around Dave and I will be right back with our next guest.

Published Date : May 3 2022

SUMMARY :

Dell technologies world brought to you by Dell. Dave, hope you say live, live <laugh>. It's great to be here in this forum with live people, you know, and 3d, a lot of buzz, great attendance at this first event, since 20, lot's been going on since then, have the right latency, maybe, um, you it's too costly. So that the, the realtime data access is critical for everybody to these days. you know, you have a lot of them and you just multiply by multiple use cases and you be, out to the edge more to meet that consumer demand, but reduce that complexity from an infrastructure standpoint. And so we develop, um, solutions that look at specific, um, outcomes and we size it and I think of like the NCR cash register as the, the original edge, you know, you know, you said, um, NCR being the, uh, the original edge. um, in our, um, you know, fake retail store, uh, back at the edge booth. I was just wondering, you said they're true for every industry. So, you know, using my example from before, if you are faced with And you wanna give an experience, right? you know, at the near edge, the retail infrastructure sort of different requirements. more innovation will happen, you know, across the, those different dimensions. We wanna ha as you described, we wanna have the same experience we expect to have that regardless And so if you solve it one way, it's gonna be one outcome. Can We come back to your announcement? So we, you know, So we put all of this together, we created a design Do you ever see that becoming a Aku at some point in time or, a lot of variability in those and skewing, but that's a very formal, you know, Can you give us an example of a, of a retail or a manufacturing customer that's using this Dell validated but I will tell you that most of the large retailers in the us are based in their stores So if it's simplifying, um, maintenance, and tell people, oh, you probably really need to move things through the edge. and I've just been to Singapore in one of those, um, they asked me to, um, pretend I was Now you have a career to fall back on if you need to. Thank you so much for coming back, talking to us about what's new on day one of Dell technologies I'm Lisa Martin, coming to you live from the Venetian

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Lisa MartinPERSON

0.99+

SingaporeLOCATION

0.99+

Dave VolantePERSON

0.99+

Gil ShneorsonPERSON

0.99+

JeffPERSON

0.99+

DavePERSON

0.99+

50%QUANTITY

0.99+

DellORGANIZATION

0.99+

twoQUANTITY

0.99+

Las VegasLOCATION

0.99+

80%QUANTITY

0.99+

last weekDATE

0.99+

three yearsQUANTITY

0.99+

MichaelPERSON

0.99+

todayDATE

0.99+

two thingsQUANTITY

0.99+

GilPERSON

0.99+

LitmosORGANIZATION

0.99+

ACIORGANIZATION

0.99+

Austin, TexasLOCATION

0.99+

oneQUANTITY

0.99+

eight timeQUANTITY

0.99+

30QUANTITY

0.99+

PTCORGANIZATION

0.98+

VenetianLOCATION

0.98+

Limerick islandLOCATION

0.98+

first dayQUANTITY

0.98+

thousandsQUANTITY

0.97+

VxRailTITLE

0.97+

tomorrowDATE

0.97+

Dell technologiesORGANIZATION

0.96+

TansuORGANIZATION

0.96+

two worldsQUANTITY

0.95+

Dell technologies world 2022EVENT

0.94+

Inso analyticsORGANIZATION

0.93+

2022DATE

0.93+

two yearsQUANTITY

0.92+

first eventQUANTITY

0.91+

Eight time cubeQUANTITY

0.91+

AkuORGANIZATION

0.9+

about 30 different applicationsQUANTITY

0.89+

day oneQUANTITY

0.88+

VMwareTITLE

0.88+

Technologies World 2022EVENT

0.87+

technologies world 22EVENT

0.84+

one wayQUANTITY

0.83+

VMwareORGANIZATION

0.83+

VXLORGANIZATION

0.81+

yearsQUANTITY

0.8+

deep northORGANIZATION

0.79+

one outcomeQUANTITY

0.77+

GAPERSON

0.74+

20QUANTITY

0.74+

GillPERSON

0.72+

NorrisPERSON

0.58+

edgeORGANIZATION

0.56+

GALOCATION

0.49+

NCRORGANIZATION

0.34+

Zak Brown, McLaren Racing | Splunk .conf1


 

>>Hello, and welcome back to the cubes coverage of splunk.com here in the virtual studios in Silicon valley broadcasting around the world's a virtual event. Um, John four-year host of the queue. We've got a great guest, Zach brown, chief executive officer of McLaren racing, really looking forward to this interview, Zach, welcome to the queue. Well, thanks for coming on. Thanks for having me. So we have a huge fan base in the tech community. A lot of geeks love the neurons. They love the tech behind the sport. Uh, and Netflix is driving to survive. Series has absolutely catapulted the popularity of F1 in the tech community. So congratulations on all the success in that program and on, and then on the >>Thank you very much, it's been a, it's been a good run. We've won our first race in a while, but we still have a ways to go to get in that, uh, world championship that, uh, >>So for the techies out there and the folks in our audience that aren't familiar with, the specifics of the racing team and the dynamics, take a minute to explain what you guys do. >>Uh, so McLaren racing, uh, which has a variety of, uh, racing teams, uh, a formula one team in indie car team and extremely team and an e-sports team. Uh, we're the second most successful form of the one team in the history of sport. Now 183 wins 182, uh, when I joined 20 world championships and, uh, we're, we're close to a thousand people to, to run a couple of racing cars and, uh, currently third in the championship, uh, with Lando Norris and, uh, Daniel, Ricardo. >>So talk about the, um, the, the dynamics of the spore. Obviously data is big part of it. Uh, we see the, a lot of the coverage. You can see anything can happen overnight. It's very quick. Um, technology has been being, uh, playing a big role in sport. What's your vision on how that's evolving? Are you happy with where things are, uh, and where do you see it going? >>Yeah, it does some interesting stats. So, um, the car that qualifies first at the beginning of the year, if you didn't touch, it would be last by the end of the year. So that's the pace of a development of a, of a formula one car. We change a, uh, and develop a new part on the car every 14 minutes, 365 days, days a year. Um, and technology plays a huge role. Uh, it's, it's probably the most technical, um, evolved sport in the world. Uh, both safety data, uh, the innovation it's it's awesome. And what a lot of people don't know is a lot of what we develop in a formula. One car ends up in other parts of the world, whether it was a ventilators that we helped develop for the UK government, uh, to working with our, uh, various partners or safety and innovation in the automotive industry. >>You know, I love it. I always loved the IOT internet of things, story around cars, because sensors or instrumentation is a big part of it. Um, and it all comes together. So it's pretty, it's not simple. No, give it feel, give it a taste a little bit about what's it. How complicated is it, how you guys pay attention to the details? What's important. Take us through some of the, some of the inside the ropes around the IOT of the sensors and all the data. >>Yeah. So we have over 300 sensors on our race car. We collect the one and a half terabytes of data. Every race weekend, we have a thousand people, um, and the strong majority of those are working around data and technology, as opposed to physically touching the car out of those thousand people, you probably only have about 60 or 70. They're actually touch the race card at a race weekend. We've been doing connected cars for about 25 years. So that's kind of a new thing here to, to most people, but we've been communicating back and forth with our race car for, for decades all around the world. And what a lot of people don't realize is it all starts in our mission control back in our factory in Woking, England. So wherever we are around the world, the racing team actually starts in England. >>So I want to ask you about the personalities on the team. How big is the staff? What's the makeup of the personnel has to get the drivers. They're critical. They're a very dynamic personalities. We'll come to the side question on that later, but what's the staff look like on when you guys put this together. So you get, you get race day and you got back office support. >>What's the team look like? Yeah. So you've got about a thousand people that, that make up the collective team. You'll have about a hundred in marketing. Uh, you'll have about a hundred in finance, HR, and then you kind of get to the, the racing team. If you'd like 800 people, you have about a hundred people traveling to each race, uh, about 50 people back at the factory, working with data and communications that are grand Prix weekend. And then everybody else is designing manufacturing, production laminating. So we run 24, 7 shifts, uh, three shifts, uh, in certain parts. Uh, we develop, uh, 85% of the car changes of what's allowed to be changed start of the year to the, the end of the year. So the development is, is unbelievable. >>I know you're here in the U S for the U S grand Prix in Austin. Um, coming up, I'm just curious how cars get transported. >>Uh, w when we're traveling around the world, uh, they, they travel on 7 47 and are flown around the world. And then when we're in Europe, we have about 18 trucks that were communing around when we're kind of in the European part of the circuit is usually in the middle of the year. But when we're going to Australia or Singapore, Bahrain, those are, those are on planes form of the one actually does that. They give us an allocation of, of space, and then we have to write a check if we need more space than where >>Yeah. We're allowed. Yeah. And that brings up the security question, because honestly, there's a lot of fans, a lot of people are into it. Also, this potentially security risks. Have you guys thought about that obviously like physical moving the supply chain around from event event, but also technology risk. Um, how do you guys think about security? >>Yeah, it's, it's critically important. We've had, uh, fortunately we've not had any breach of our technology. We have had a breach in the late nineties of our radio communications and, uh, it was in Australia, Mika Hakkinen and a fan, uh, who I think was probably having some fun and were able to break into our radio channel and actually asked Mika to pit. He pitted team wasn't ready. And fortunately, we will run in one, two, but we actually had to reverse the drivers. So security is >>Critically important, probably Katie Scrivener, and they all look, I just hack the radio, was talking to the driver. That is a funny story, but it could be serious. I mean, now you have all kinds of >>The stuff going on and, and, you know, there's a lot of money at stake, you know, so, you know, we're fortunate in this particular instance, it didn't hurt us cause we were running one, two, so we could reverse the drivers and the right guide one. Um, but you know, that could decide, uh, a world championship and you have, you know, tens of millions of dollars online, but even besides the economics, we want to win races. >>You know, what's funny is that you guys have a lot of serious on the line stakes with these races, but you're known for having a lot of fun, the team team dynamic. I have to ask you, when you finish on the podium one and two, there's a Shui with the drivers. How'd that go down. It was pretty, pretty a big spectacle online and >>Yeah, it was, it was good, fun. That's something, obviously Daniel Ricardo is kind of developed as his thing when he, uh, when he wins. And, uh, when we were, uh, before we went on the podium, he said to me, you're going to do the shoe. Yes, of course. In the car show you got to do, we have to like a bunch of 12 year old kids, uh, on the podium, but that's where we're just big kids going, motor racing and >>The end of the day. Well, I gotta say you guys come across really strong as a team, and I love the fun and, you know, competitive side. So congratulations on that, I think is good on the competitive side, take me through the advantage, driving the advantage with data, because that's really the theme here at.com, which is Splunk, which they're a big partner, as well as your other sponsors. Data's big, you know, and it's striving an advantage. Where do you see that coming from? Take us through where you guys see the advantages. Yes. >>So, you know, everything we do is, is precision and, you know, every second, every 10th counts and, um, you know, you can get all this data in, but what do you do with this data? And the humans can, uh, real, uh, react as quickly as is, you know, people like Splunk who can help us, uh, not only collect data, but help us understand data. And, um, you know, typically there's one pit stop, which can be the difference between winning and losing. Um, you have all these different scenarios playing out with weather with tire wear competition. And so, you know, we live by data. We didn't, uh, when, in, in Russia, when we, uh, could have, and it was because we got a bit emotionally caught up in the excitement of trying to win the race instead of staying disciplined and focused on, on data. And so it's a very data-driven sport when I'm on the pit wall, there's a thing called racer instinct, which is my 30 years in the sport. And, uh, your experience and your kind of your gut to make decisions. And every time our team makes a decision that I'm sitting there going, I'm not sure that was the right decision. They're staring at data. I'm not, I'm trusting my 30 years of experience. They'd beat me nine out of 10. >>Yeah. I mean, you know, this is a huge topic too, in the industry, explainable AI is one of the hottest trends in computer science where there's so much algorithms involved. The gut instinct is now coming back. What algorithms are available, knowing when to deploy what algorithms or what data to pay attention to is a huge new gut factor. Yep. Can you explain how the young drivers and the experience folks in the industry are dealing with this new instinct full data-driven? >>Yeah. That's, you know, that's what we have 50 people back at the factory doing, and they're looking at all sorts of information coming in, and then they're taking that information and they're feeding it to our head of strategy. Who's then feeding it to our racing director. Who's getting all these data points in from tire to performance, to reliability, and then the human data from both drivers coming through their engineers. And then he gets all that information in. He has to process it immediately and make decisions, but it's, it's a data-driven sport. >>I saw Lando walking around, got a selfie with them. It's great. Everyone's loving it on Twitter. My family, like get an autograph, the future of the sport. He's a young young driver. So that instincts coming in the future sport comes up all the time. The tires are a big discussion point, but also you've got a lot of presets going on, a lot of data, a lot of going on and you see the future where there's remote, you know, kind of video game you're in the pit wall and you can make decisions and deploy on behalf of the drivers. Is that something that >>Well, that technology is there and we used to do that, but now it's been outlawed because there's a real push to make sure the drivers are driving the car. So that technology is here. It has been deployed in the past. We could do it, but we're trying to find as a sport, the balance between, you know, letting the driver do it. So he, or she might make a mistake and a little bit of excitement to it. So, um, we now there are certain protocols on what we communicate. Um, we can't, um, everything has to be driver fed into the car. So we can now you'll hear all sorts of codes that we're talking through, which there are, um, about 300 different adjustments the driver can make on the steering wheel, which is unbelievable. And so that's us seeing information, getting data in coming to conclusions that we're giving him or her information that we think will help make the car >>A lot of new dimensions for drivers to think about when they're being successful with the gut, that the track data everything's kind of coming together. >>Yeah. It's amazing. Um, when you listen to these drivers on the radio, you forget that they're going 200 plus miles an hour. Cause they sound quite relaxed in this very, you know, open and easy communication of here's what I'm feeling with. Again, we're talking all these codes and then we all, because we can hear each other, there's a lot of trickery that goes on. So for a driver to be going to turn a miles an hour, taking this information and then know what code we're talking, are we kind of throwing a code out there to put the competition off is pretty amazing that they can take this all in. >>You know, I wish I was younger again, like we're old school and the younger generation, I was having a few conversations with a lot of the young audience. They wanted me to ask you, when are you guys going to metaverse the tracks? When can I get involved and participate and maybe even make the team, or how do I become more active, engaged with the McLaren racing team? >>And that technology is almost, we're actually, um, that's in development. So I, I think it won't be long before, you know, Sunday you can log on, uh, and, and race Lando around Monaco and be in the race. So that, that technology is around the corner. >>That's the shadow thing to developing. I see that. E-sports just quick. I know you've got to go on, but last minute we have here, e-sports, what's the future of e-sports with the team, >>But e-sports been great for the sport. You know, it's gone from, you know, when I was growing up, it was video games and now it's real simulation. And, uh, so we've held, I think we're going four years into it. Now we were the first team to really develop any sports platform and we've had competitors go on to help us with our simulation. So it's, it's real racially developed the race car before it goes on the racetrack it's in simulation. And that's where e-sports, >>And this is the new advantage. This is a new normal, this is where you guys see the data driving. The >>Definitely. And I think the other thing it is, you know, somewhat stick and ball sports, you can play in school. And motor racing has historically been partying, which can cost hundreds of thousands of dollars. Now with e-sports you have a less expensive platform to let young men and women around the world, but a steering wheel in their hand and go motor racing. So I think it's also going to kind of bring that younger generation of fan and >>There's so much collective intelligence, potentially competitive advantage data. Again, data coming up final word to end the segment, Splunk, big partner on the data side, obviously helping you guys financially, as well as you do need some sponsorship support to make the team run. Um, what's the relationship with Splunk? Take a minute to talk about the plug. >>It's been a, it's been great, you know, they're, they're two big contributors. We need a lot of money to run the racing team. So they're a great partner in that respect, but more importantly, they're helping us with our whole data journey, making smarter, quicker decisions. So their contribution to being part of the race team. And, uh, we used our technology. Um, it has been great. And I think, um, you know, if I look at our technology partners, uh, we have many that all contribute to making a >>Yeah. I mean, it really is nice. It's data inaction, it's teamwork, it's competitive, it's fun. That's kind of a good, good, >>I think fun is the center of everything that we do. It's the center of everything spunk does. Cause I think if you have fun, people enjoy going to working a little bit harder. We're seven days a week. And uh, you know, a lot of teammates you've got to work well together. So I think if you're having fun, you enjoy what you're doing and it doesn't feel like work. >>Congratulations on climbing up in the rankings and everything on your team. Two great drivers. Thanks for coming on the cube. We appreciate it. Thank you. All right. We're here. The key. We like to have fun here and get all the action on the tech side. Honestly, F1 is technology enabled data, driving the advantage and driving to is a great Netflix series. Check it out. McLaren's featured heavily in there and got a great team. Zach brown Siegel. Thanks for coming on. Appreciate it. I'm sure for your host. Thank you for watching.

Published Date : Oct 19 2021

SUMMARY :

So congratulations on all the success in that program and on, and then on the Thank you very much, it's been a, it's been a good run. take a minute to explain what you guys do. Uh, so McLaren racing, uh, which has a variety of, uh, racing teams, Are you happy with where things are, uh, and where do you see it going? So that's the pace of a development of a, how you guys pay attention to the details? as opposed to physically touching the car out of those thousand people, you probably only have about 60 or 70. So you get, you get race day and you got HR, and then you kind of get to the, the racing team. I know you're here in the U S for the U S grand Prix in Austin. of the year. how do you guys think about security? We have had a breach in the late nineties of our radio communications and, I mean, now you have all kinds of Um, but you know, that could decide, uh, a world championship and you have, you know, tens of millions of dollars online, You know, what's funny is that you guys have a lot of serious on the line stakes with these races, In the car show you got to do, we have to like a bunch Take us through where you guys see the advantages. uh, real, uh, react as quickly as is, you know, people like Splunk who can help us, experience folks in the industry are dealing with this new instinct full data-driven? of information coming in, and then they're taking that information and they're feeding it to our head of strategy. a lot of going on and you see the future where there's remote, you know, kind of video game you're in the pit wall and the balance between, you know, letting the driver do it. A lot of new dimensions for drivers to think about when they're being successful with the gut, that the track data everything's Um, when you listen to these drivers on the radio, you forget that they're going 200 plus When can I get involved and participate and maybe even make the team, or how do I become more active, So I, I think it won't be long before, you know, That's the shadow thing to developing. So it's, it's real racially developed the race car before it goes on the racetrack it's in simulation. This is a new normal, this is where you guys see the data driving. Now with e-sports you have a less expensive platform to let young to end the segment, Splunk, big partner on the data side, obviously helping you guys financially, And I think, um, you know, if I look at our technology partners, That's kind of a good, good, And uh, you know, a lot of teammates you've got to work well together. Honestly, F1 is technology enabled data, driving the advantage and driving to is

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Zak BrownPERSON

0.99+

MikaPERSON

0.99+

Mika HakkinenPERSON

0.99+

Daniel RicardoPERSON

0.99+

AustraliaLOCATION

0.99+

Zach brownPERSON

0.99+

EuropeLOCATION

0.99+

SingaporeLOCATION

0.99+

Katie ScrivenerPERSON

0.99+

ZachPERSON

0.99+

DanielPERSON

0.99+

365 daysQUANTITY

0.99+

Lando NorrisPERSON

0.99+

RussiaLOCATION

0.99+

30 yearsQUANTITY

0.99+

24QUANTITY

0.99+

EnglandLOCATION

0.99+

Zach brown SiegelPERSON

0.99+

Silicon valleyLOCATION

0.99+

800 peopleQUANTITY

0.99+

85%QUANTITY

0.99+

nineQUANTITY

0.99+

NetflixORGANIZATION

0.99+

thirdQUANTITY

0.99+

10QUANTITY

0.99+

SundayDATE

0.99+

twoQUANTITY

0.99+

SplunkORGANIZATION

0.99+

secondQUANTITY

0.99+

each raceQUANTITY

0.99+

four-yearQUANTITY

0.99+

thousand peopleQUANTITY

0.99+

both driversQUANTITY

0.99+

50 peopleQUANTITY

0.99+

BahrainLOCATION

0.99+

RicardoPERSON

0.99+

McLarenORGANIZATION

0.99+

first raceQUANTITY

0.99+

McLaren RacingORGANIZATION

0.99+

firstQUANTITY

0.99+

three shiftsQUANTITY

0.99+

Woking, EnglandLOCATION

0.99+

U SLOCATION

0.99+

oneQUANTITY

0.99+

LandoPERSON

0.99+

MonacoLOCATION

0.99+

AustinLOCATION

0.98+

One carQUANTITY

0.98+

20 world championshipsQUANTITY

0.98+

about 25 yearsQUANTITY

0.98+

JohnPERSON

0.98+

over 300 sensorsQUANTITY

0.98+

U S grand PrixEVENT

0.98+

200 plus miles an hourQUANTITY

0.98+

one teamQUANTITY

0.98+

bothQUANTITY

0.98+

EuropeanLOCATION

0.98+

one pit stopQUANTITY

0.97+

first teamQUANTITY

0.97+

four yearsQUANTITY

0.97+

hundreds of thousands of dollarsQUANTITY

0.97+

late ninetiesDATE

0.97+

grand PrixEVENT

0.97+

Two great driversQUANTITY

0.96+

a miles an hourQUANTITY

0.96+

70QUANTITY

0.96+

seven days a weekQUANTITY

0.96+

183QUANTITY

0.96+

7 shiftsQUANTITY

0.95+

at.comORGANIZATION

0.95+

about 50 peopleQUANTITY

0.94+

about a hundred peopleQUANTITY

0.94+

two big contributorsQUANTITY

0.94+

tens of millions of dollarsQUANTITY

0.94+

one and a half terabytes of dataQUANTITY

0.93+

about 60QUANTITY

0.92+

about 18 trucksQUANTITY

0.92+

12 year oldQUANTITY

0.9+

about a thousand peopleQUANTITY

0.9+

one carQUANTITY

0.89+

decadesQUANTITY

0.87+

UK governmentORGANIZATION

0.86+

about 300 differentQUANTITY

0.86+

about a hundredQUANTITY

0.86+

182QUANTITY

0.86+

Every race weekendQUANTITY

0.83+

James Hodge


 

>> Well, hello everybody, John Walls here on theCUBE and continuing our coverage. So splunk.com for 21, you know, we talk about big data these days, you realize the importance of speed, right? We all get that, but certainly Formula One Racing understands speed and big data, a really neat marriage there. And with us to talk about that is James Hodge, who was the global vice president and chief strategy officer international at Splunk. James, good to see it today. Thanks for joining us here on theCUBE. >> Thank you, John. Thank you for having me and yeah, the speed of McLaren. Like I'm, I'm all for it today. >> Absolutely. And I find it interesting too, that, that you were telling me before we started the interview that you've been in Splunk going on nine years now. And you remember being at splunk.com, you know, back in the past other years and watching theCUBE and here you are! you made it. >> I know, I think it's incredible. I love watching you guys every single year and kind of the talk that guests. And then more importantly, like it reminds me of conf for every time we see theCUBE, no matter where you are, it reminds me of like this magical week there's dot com for us. >> Well, excellent. I'm glad that we could be a part of it at once again and glad you're a part of it here on theCUBE. Let's talk about McLaren now and the partnership, obviously on the racing side and the e-sports side, which is certainly growing in popularity and in demand. So just first off characterize for our audience, that relationship between Splunk and McLaren. >> Well, so we started the relationship almost two years ago. And for us it was McLaren as a brand. If you think about where they were, they recently, I think it's September a Monza. They got a victory P1 and P2. It was over 3200 days since their last victory. So that's a long time to wait. I think of that. There's 3000 days of continual business transformation, trying to get them back up to the grid. And what we found was that ethos, the drive to digital the, the way they're completely changing things, bringing in kind of fluid dynamics, getting people behind the common purpose that really seem to fit the Splunk culture, what we're trying to do and putting data at the heart of things. So kind of Formula One and McLaren, it felt a really natural place to be. And we haven't really looked back since we started at that partnership. It's been a really exciting last kind of 18 months, two years. >> Well, talk a little bit about, about the application here a little bit in terms of data cars, the, the Formula One cars, the F1 cars, they've got hundreds of sensors on them. They're getting, you know, hundreds of thousands or a hundred thousand data points almost instantly, right? I mean, there's this constant processing. So what are those inputs basically? And then how has McLaren putting them to use, and then ultimately, how is Splunk delivering on that from McLaren? >> So I learned quite a lot, you know, I'm, I'm, I been a childhood Formula One fan, and I've learned so much more about F1 over the last kind of couple of years. So it actually starts with the car going out on the track, but anyone that works in the IT function, the car can not go out on track and less monitoring from the car actually is being received by the garage. It's seen as mission critical safety critical. So IT, when you see a car out and you see the race engineer, but that thumbs up the mechanical, the thumbs up IT, get their vote and get to put the thumbs up before the car goes out on track there around about 300 sensors on the car in practice. And there were two sites that run about 120 on race day that gets streamed on a two by two megabits per second, back to the FIA, the regulating body, and then gets streams to the, the garage where they have a 32 unit rack near two of them that have all of their it equipment take that data. They then stream it over the internet over the cloud, back to the technology center in working where 32 race engineers sit in calm conditions to be able to go and start to make decisions on when the car should pit what their strategy should be like to then relate that back to the track side. So you think about that data journey alone, that is way more complicated and what you see on TV, you know, the, the race energy on the pit wall and the driver going around at 300 kilometers an hour. When we look at what Splunk is doing is making sure that is resilient. You know, is the data coming off the car? Is it actually starting to hit the garage when it hits that rack into the garage, other than streaming that back with the right latency back to the working technology center, they're making sure that all of the support decision-making tools there are available, and that's just what we do for them on race weekend. And I'll give you one kind of the more facts about the car. So you start the beginning of the season, they launched the car. The 80% of that car will be different by the end of the season. And so they're in a continual state of development, like constantly developing to do that. So they're moving much more to things like computational fluid dynamics applications before the move to wind tunnel that relies on digital infrastructure to be able to go and accelerate that journey and be able to go make those assumptions. That's a Splunk is becoming the kind of underpinning of to making sure those mission critical applications and systems are online. And that's kind of just scratching the surface of kind of the journey with McLaren. >> Yeah. So, so what would be an example then maybe on race day, what's a stake race day of an input that comes in and then mission control, which I find fascinating, right? You've got 32 different individuals processing this input and then feeding their, their insights back. Right. And so adjustments are being made on the fly very much all data-driven what would be an example of, of an actual application of some information that came in that was quickly, you know, recorded, noted, and then acted upon that then resulted in an improved performance? >> Well, the most important one is pit stop strategy. It can be very difficult to overtake on track. So starting to look at when other teams go into the pit lane and when they come out of the, the pit lane is incredibly important because it gives you a choice. Do you stay also in your current set of tires and hope to kind of get through that team and kind of overtake them, or do you start to go into the pits and get your fresh sets of tires to try and take a different strategy? There are three people in mission control that have full authority to go and make a Pit lane call. And I think like the thing that really resonated for me from learning about McLaren, the technology is amazing, but it's the organizational constructs on how they turn data into an action is really important. People with the right knowledge and access to the data, have the authority to make a call. It's not the team principle, it's not the person on the pit wall is the person with the most amount of knowledge is authorized and kind of, it's an open kind of forum to go and make those decisions. If you see something wrong, you are just as likely to be able to put your hand up and say, something's wrong here. This is my, my decision than anyone else. And so when we think about all these organizations that are trying to transform the business, we can learn a lot from Formula One on how we delegate authority and just think of like technology and data as the beginning of that journey. It's the people in process that F1 is so well. >> We're talking a lot about racing, but of course, McLaren is also getting involved in e-sports. And so people like you like me, we can have that simulated experience to gaming. And I know that Splunk has, is migrating with McLaren in that regard. Right. You know, you're partnering up. So maybe if you could share a little bit more about that, about how you're teaming up with McLaren on the e-sports side, which I'm sure anybody watching this realizes there's a, quite a big market opportunity there right now. >> It's a huge market opportunity is we got McLaren racing has, you know, Formula One, IndyCar and now extreme E and then they have the other branch, which is e-sports so gaming. And one of the things that, you know, you look at gaming, you know, we were talking earlier about Ted Lasso and, you know, the go to the amazing game of football or soccer, depending on kind of what side of the Atlantic you're on. I can go and play something like FIFA, you know, the football game. I can be amazing at that. I have in reality, you know, in real life I have two left feet. I am never going to be good at football however, what we find with e-sports is it makes gaming and racing accessible. I can go and drive the same circuits as Lando Norris and Daniel Ricardo, and I can improve. And I can learn like use data to start to discover different ways. And it's an incredibly expanding exploding industry. And what McLaren have done is they've said, actually, we're going to make a professional racing team, an e-sports team called the McLaren Shadow team. They have this huge competition called the Logitech KeyShot challenge. And when we looked at that, we sort of lost the similarities in what we're trying to achieve. We are quite often starting to merge the physical world and the digital world with our customers. And this was an amazing opportunity to start to do that with the McLaren team. >> So you're creating this really dynamic racing experience, right? That, that, that gives people like me, or like our viewers, the opportunity to get even a better feel for, for the decision-making and the responsiveness of the cars and all that. So again, data, where does that come into play there? Now, What, what kind of inputs are you getting from me as a driver then as an amateur driver? And, and how has that then I guess, how does it express in the game or expressed in, in terms of what's ahead of me to come in a game? >> So actually there are more data points that come out of the F1 2021 Codemasters game than there are in Formula One car, you get a constant stream. So the, the game will actually stream out real telemetry. So I can actually tell your tire pressures from all of your tires. I can see the lateral G-Force longitudinal. G-Force more importantly for probably amateur drivers like you and I, we can see is the tire on asphalt, or is it maybe on graphs? We can actually look at your exact position on track, how much accelerator, you know, steering lock. So we can see everything about that. And that gets pumped out in real time, up to 60 Hertz. So a phenomenal amount of information, what we, when we started the relationship with McLaren, Formula One super excited or about to go racing. And then at Melbourne, there's that iconic moment where one of the McLaren team tested positive and they withdrew from the race. And what we found was, you know, COVID was starting and the Formula One season was put on hold. The FIA created this season and called i can't remember the exact name of it, but basically a replica e-sports gaming F1 series. We're using the game. Some of the real drivers like Lando, heavy gamer was playing in the game and they'd run that the same as race weekends. They brought celebrity drivers in there. And I think my most surreal zoom call I ever was on was with Lando Norris and Pierre Patrick Aubameyang, who was who's the arsenal football captain, who was the guest driver in the series to drive around Monaco and Randy, the head of race strategy as McLaren, trying to coach him on how to go drive the car, what we ended up with data telemetry coming from Splunk. And so Randy could look out here when he pressing the accelerator and the brake pedal. And what was really interesting was Lando was watching how he was entering corners on the video feed and intuitively kind of coming to the same conclusions as Randy. So kind of, you could see that race to intuition versus the real stats, and it was just incredible experience. And it really shows you, you know, racing, you've got that blurring of the physical and the virtual that it's going to be bigger and bigger and bigger. >> So to hear it here, as I understand what you were just saying now, the e-sports racing team actually has more data to adjust its performance and to modify its behaviors, then the real racing team does. Yep. >> Yeah, it completely does. So what we want to be able to do is turn that into action. So how do you do the right car setup? How do you go and do the right practice laps actually have really good practice driver selection. And I think we're just starting to scratch the surface of what really could be done. And the amazing part about this is now think of it more like a digital twin, what we learn on e-sports we can actually say we've learned something really interesting here, and then maybe a low, you know, if we get something wrong, it may be doesn't matter quite as much as maybe getting an analytics wrong on race weekend. >> Right. >> So we can actually start to look and improve through digital and then start to move that support. That's over to kind of race weekend analytics and supporting the team. >> If I could, you know, maybe pun intended here, shift gears a little bit before we run out of time. I mean, you're, you're involved on the business side, you know, you've got, you know, you're in the middle east Africa, right? You've got, you know, quite an international portfolio on your plate. Now let's talk about just some of the data trends there for our viewers here in the U S who maybe aren't as familiar with what's going on overseas, just in terms of, especially post COVID, you know, what, what concerns there are, or, or what direction you're trying to get your clients to, to be taking in terms of getting back to work in terms of, you know, looking at their workforce opportunities and strengths and all those kinds of things. >> I think we've seen a massive shift. I think we've seen that people it's not good enough just to be storing data its how do you go and utilize that data to go and drive your business forwards I think a couple of key terms we're going to see more and more over the next few years is operational resilience and business agility. And I'd make the assertion that operational resilience is the foundation for the business agility. And we can dive into that in a second, but what we're seeing take the Netherlands. For example, we run a survey last year and we found that 87% of the respondents had created new functions to do with data machine learning and AI, as all they're trying to do is go and get more timely data to front line staff to go. And next that the transformation, because what we've really seen through COVID is everything is possible to be digitized and we can experiment and get to market faster. And I think we've just seen in European markets, definitely in Asia Pacific is that the kind of brand loyalty is potentially waning, but what's the kind of loyalty is just to an experience, you know, take a ride hailing app. You know, I get to an airport, I try one ride hailing app. It tells me it's going to be 20 minutes before a taxi arrives. I'm going to go straight to the next app to go and stare. They can do it faster. I want the experience. I don't necessarily want the brand. And we're find that the digital experience by putting data, the forefront of that is really accelerating and actually really encouraging, you know, France, Germany are actually ahead of UK. Let's look, listen, their attitudes and adoption to data. And for our American audience and America, America is more likely, I think it's 72% more likely to have a chief innovation officer than the rest of the world. I think I'm about 64% in EMEA. So America, you are still slightly ahead of us in terms of kind of bringing some of that innovation that. >> I imagine that gap is going to be shrinking though I would think. >> It is massively shrinking. >> So before we, we, we, we are just a little tight on time, but I want to hear about operational resilience and, and just your, your thought that definition, you know, define that for me a little bit, you know, put a little more meat on that bone, if you would, and talk about why, you know, what that is in, in your thinking today and then why that is so important. >> So I think inputting in, in racing, you know, operational resilience is being able to send some response to what is happening around you with people processing technology, to be able to baseline what your processes are and the services you're providing, and be able to understand when something is not performing as it should be, what we're seeing. Things like European Union, in financial services, or at the digital operational resilience act is starting to mandate that businesses have to be operational in resilient service, monitoring fraud, cyber security, and customer experience. And what we see is really operational resilience is the amount of change that can be absorbed before opportunities become risk. So having a stable foundation of operational resilience allows me to become a more agile business because I know my foundation and people can then move and adjust quickly because I have the awareness of my environment and I have the ability to appropriately react to my environment because I've thought about becoming a resilient business with my digital infrastructure is a theme. I think we're going to see in supply chain coming very soon and across all other industries, as we realize digital is our business. Nowadays. >> What's an exciting world. Isn't it, James? That you're, that you're working in right now. >> Oh, I, I love it. You know, you said, you know, eight and an eight and a half years, nine years at Splunk, I'm still smiling. You know, it is like being at the forefront of this diesel wave and being able to help people make action from that. It's an incredible place to be. I, is liberating and yeah, I can't even begin to imagine what's, you know, the opportunities are over the next few years as the world continually evolves. >> Well, every day is a school day, right? >> It is my favorite phrase >> I knew that. >> And it is, James Hodge. Thanks for joining us on theCUBE. Glad to have you on finally, after being on the other side of the camera, it's great to have you on this side. So thanks for making that transition for us. >> Thank you, John. You bet James Hodge joining us here on the cube coverage of splunk.com 21, talking about McLaren racing team speed and Splunk.

Published Date : Oct 18 2021

SUMMARY :

So splunk.com for 21, you know, Thank you for having me and back in the past other I love watching you guys every obviously on the racing ethos, the drive to digital the, about the application here a before the move to wind tunnel that was quickly, you have the authority to make a call. And I know that Splunk has, I can go and drive the same the opportunity to get the series to drive around and to modify its behaviors, And the amazing part about this and then start to move that support. of the data trends there for the next app to go and stare. going to be shrinking though that definition, you know, the ability to appropriately What's an exciting it is like being at the it's great to have you on this side. here on the cube coverage of

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
JohnPERSON

0.99+

RandyPERSON

0.99+

John WallsPERSON

0.99+

Pierre Patrick AubameyangPERSON

0.99+

LandoPERSON

0.99+

JamesPERSON

0.99+

James HodgePERSON

0.99+

McLarenORGANIZATION

0.99+

Ted LassoPERSON

0.99+

Daniel RicardoPERSON

0.99+

80%QUANTITY

0.99+

Asia PacificLOCATION

0.99+

FIAORGANIZATION

0.99+

32 unitQUANTITY

0.99+

3000 daysQUANTITY

0.99+

last yearDATE

0.99+

72%QUANTITY

0.99+

87%QUANTITY

0.99+

two sitesQUANTITY

0.99+

nine yearsQUANTITY

0.99+

twoQUANTITY

0.99+

European UnionORGANIZATION

0.99+

NetherlandsLOCATION

0.99+

Lando NorrisPERSON

0.99+

32 race engineersQUANTITY

0.99+

SplunkORGANIZATION

0.99+

two yearsQUANTITY

0.99+

three peopleQUANTITY

0.99+

MelbourneLOCATION

0.99+

hundreds of thousandsQUANTITY

0.99+

FIFATITLE

0.99+

32 different individualsQUANTITY

0.99+

20 minutesQUANTITY

0.99+

21QUANTITY

0.98+

U SLOCATION

0.98+

todayDATE

0.98+

over 3200 daysQUANTITY

0.97+

AmericaLOCATION

0.97+

two left feetQUANTITY

0.97+

asphaltTITLE

0.97+

splunk.comORGANIZATION

0.97+

east AfricaLOCATION

0.97+

about 120QUANTITY

0.96+

300 kilometers an hourQUANTITY

0.96+

two megabits per secondQUANTITY

0.96+

oneQUANTITY

0.96+

SeptemberDATE

0.95+

Formula OneEVENT

0.95+

SplunkPERSON

0.95+

COVIDPERSON

0.94+

UKLOCATION

0.94+

firstQUANTITY

0.94+

18 monthsQUANTITY

0.93+

about 64%QUANTITY

0.93+

a hundred thousand data pointsQUANTITY

0.92+

Formula One RacingORGANIZATION

0.92+

aroundQUANTITY

0.92+

splunk.comOTHER

0.92+

two years agoDATE

0.9+

eight and an eight and a half yearsQUANTITY

0.89+

up to 60 HertzQUANTITY

0.89+

hundreds of sensorsQUANTITY

0.89+

IndyCarORGANIZATION

0.89+

MonacoLOCATION

0.88+

McLaren ShadowORGANIZATION

0.87+

Steve Wooledge - HP Discover Las Vegas 2014 - theCUBE - #HPDiscover


 

>>Live from Las Vegas, Nevada. It's a queue at HP. Discover 2014 brought to you by HP. >>Welcome back, everyone live here in Las Vegas for HP. Discover 2014. This is the cube we're out. We go where the action is. We're on the ground here at HP. Discover getting all the signals, sharing them with you, extracting the signal from the noise. I'm John furrier, founder of SiliconANGLE. I joined Steve Woolwich VP of product marketing at map art technologies. Great to see you welcome to the cube. Thank you. I know you got a plane to catch up, but I really wanted to squeeze you in because you guys are a leader in the big data space. You guys are in the top three, the three big whales map are Hortonworks, Cloudera. Um, you know, part of the original big data industry, which, you know, when we did the cube, when we first started the industry, you had like 30, 34 employees, total combined with three, one company Cloudera, and then Matt are announced and then Hortonworks, you guys have been part of that. Holy Trinity of, of early pioneers. Give us the update you guys are doing very, very well. Uh, we talked to you guys at the dupe summit last week. So Jack Norris for the party, give us the update what's going on with the momentum and the traction. And then I want to talk about some of the things with the product. >>Yeah. So we've seen a tremendous uptick in sales at map. Are we tripled revenue? We announced that publicly about a month ago. So we went up 300% in sales, over Q3, I'm sorry, Q1 of 2013. And I think it's really, you know, the maturity of the market. As people move more towards production, they appreciate the enterprise features. We built into the map, our distribution for Hadoop. So, um, you know, the stats I would share is that 80% of our customers triple the size of their cluster within the first 12 months and 50% of them doubled the size of the cluster because there's the, you know, they had that first production success use case and they find other applications and start rolling out more and more. So it's been great for us. >>You know, I always joke with Jack Norris, who's the VP of marketing over there. And John Frodo is the CEO about Matt bars, humbleness. You don't have the fanfare of all the height, depressed love cloud era. Now see they had done some pretty amazing things. They've had a liquidity event, so essentially kind of an IPO, if you will, that huge ex uh, financing from Intel and they're doing great big Salesforce. Hortonworks has got their open source play. You guys got, you got your heads down as well. So talk about that. How many employees you guys have and what's going on with the product? How many, how many new, what, how many products do you guys actually, >>We have, well, we have one product. So we have the map, our distribution for Hadoop, and it's got all the open source packages directly within it, but where we really innovate is in the course. So that's where we, we spent our time early on was really innovating that data platform to give everything within the Hadoop ecosystem, more reliability, better availability, performance, security scale, >>It's open source contributions to the court. And you guys put stuff on top of that, uh, >>And how it works. Yeah. And even some projects we lead the projects like with Apache Mahal and Apache drill, which is coming into beta shortly other projects, we commit and contribute back. But, um, so we take in the distribution, we're distributing all those projects, but where we really innovate is at that data platform level. So >>HP is a big data leader officer. They bought, uh, autonomy. They have HP Vertica. You guys are here. Hey, what are you doing here? Obviously we covered the cube, uh, the announcement with, uh, with, with HP Vertica, you here for that reason, is there other biz dev other activity going on other integration opportunities? >>Yeah, a few things. So, um, obviously the HP Vertica news was big. We went into general availability that solution the first week of may. So, um, what we have is the HP Vertica database integrated directly on top of our data platform. So it's this hybrid solution where you have full SQL database directly within your Hadoop distribution. Um, so it had a couple sessions on that. We had, uh, a nice panel discussion with our friends from Cloudera and Hortonworks. So really good discussion with HP about just the ecosystem and how it's evolving. The other things we're doing with HP now is, you know, we've got reference architectures on their hardware lines. So, um, you know, people can deploy Mapbox on the hardware of HP, but then also we're talking with the, um, the autonomy group about enterprise search and looking at a similar type of integration where you could have the search integrated directly into your Hadoop distro. And we've got some joint accounts we're piloting that she goes, now, >>You guys are integrating with HP pretty significantly that deals is working well. Absolutely. What's the coolest thing that you've seen with an HP that you can share. How so I asked you in the big data landscape, everyone's Bucher, you know, hunkering down, working on their feature, but outside in the real world, big data, it's not on the top of mind of the CIO, 24 7. It's probably an item that they're dressing. What have you seen and what have you been most impressed with at HP here? >>Yeah. Say, you know, this is my first HP event like this. I think the strategy they have is really good. I think in certain areas like the cloud in particular with the helium, I think they made a lot of early investments there and place some bets. And I think that's going to pay off well for them. And that marries pretty nicely with our strategy as well in terms of, you know, we have on-premise deployments, but we're also an OEM if you will, within Amazon web services. So we have a lot of agility in the cloud if you will. And I think as those products and the partnerships with HP, evolvable, we'll be playing a lot more with them in the cloud as well. >>I see that asks you a question. I want you to share with the folks out there in your own words, what is it about map bar that they may or may not understand or might not know about? Um, a little humble brag out there and share some, share some, uh, insight of, into, into map bar for folks that don't know you guys as a company and for the folks that may have a misperception of what you guys do shit share with them, with what, what map map is all about. >>Yeah. I mean, for me, I was in this space with Aster data and kind of the whole Hadoop and MapReduce area since 2008 and pretty familiar with everybody in the space. I really looked at Matt bars, the best technology hands down, you look at the Forrester wave and they rank us as having the best technology today, as well as product roadmap. I think the misperception is people think, oh, it's proprietary and close. It's actually the opposite of that. We have an unbiased open-source approach where we'll ship in support in our distribution, in the entire Apache spark stack. We're not selective over which projects within Apache spark. We support. Um, I feel like SQL on Hadoop. We support Impala as well as hive and other SQL on to do technologies, including the ability to integrate HP Vertica directly in the system. And it's because of the openness of our platform. I'd say it's actually more open because of the standards we've integrated into the data platform to support a lot of third-party tools directly within it. So there is no locked in the storage formats are all the same. The code that runs on top of the distribution from the projects is exactly the same. So you can build a project in hive or some other system, and you can port it between any of the distributions. So there isn't a, lock-in >>The end of the day, what the customers want is they want ease of integration. They want reliability. That's right. And so what are you guys working on next? What's the big, uh, product marketing roadmap that you can share with us? >>Yeah, I think for us, because of the innovations we did in the data platform allows us to support not only more applications, but more types of operational systems. So integrating things like fraud detection and recommendation engines directly with the analytical systems to really speed up that, um, accuracy and, and, uh, in targeting and detecting risk and things like that. So I think now over time, you know, Hadoop has sort of been this batch analytic type of platform, but the ability to converge operations and analytics in one system is really going to be enabled by technology like Matt BARR. >>How many employees do you guys have now? Uh, >>I'm not sure what our CFO would. Let me say that before. You can say we're over 200 at this point >>As well. And over five, the customers which got the data, you guys do summit graduations, we covered your relationship with HP during our big data SV. That was exciting. Good to see John Schroeder, big, very impressive team. I'm impressed with map. I will always have been. You guys have Stephanie kept your knitting saved. Are you going to do, and again, leading the big data space, um, and again, not proprietary is a very key word and that's really cool. So thanks for coming on. Like you really appreciate Steve. We'll be right back. This is the cube live in Las Vegas, extracting the city from the noise with map bar here at the HP discover 2014. We'll be right back here for the short break.

Published Date : Jun 12 2014

SUMMARY :

Discover 2014 brought to you by HP. Uh, we talked to you guys at the dupe summit last week. So, um, you know, the stats You guys got, you got your heads down as well. and it's got all the open source packages directly within it, but where we really innovate is in the course. And you guys put stuff on top of that, But, um, so we take in the distribution, we're distributing all those projects, but where we really innovate is uh, the announcement with, uh, with, with HP Vertica, you here for that reason, is there other biz dev other activity So it's this hybrid solution where you have full SQL How so I asked you in the big data landscape, everyone's Bucher, So we have a lot of agility in the cloud if you will. into map bar for folks that don't know you guys as a company and for the folks that may have a misperception of what you So you can build a project in hive or some What's the big, uh, product marketing roadmap that you can So I think now over time, you know, Hadoop has sort of been this batch analytic Let me say that before. And over five, the customers which got the data, you guys do summit graduations,

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
John SchroederPERSON

0.99+

Steve WoolwichPERSON

0.99+

StevePERSON

0.99+

Jack NorrisPERSON

0.99+

HPORGANIZATION

0.99+

John FrodoPERSON

0.99+

threeQUANTITY

0.99+

80%QUANTITY

0.99+

Steve WooledgePERSON

0.99+

50%QUANTITY

0.99+

John furrierPERSON

0.99+

Las VegasLOCATION

0.99+

Matt BARRPERSON

0.99+

HortonworksORGANIZATION

0.99+

AmazonORGANIZATION

0.99+

ClouderaORGANIZATION

0.99+

StephaniePERSON

0.99+

30QUANTITY

0.99+

300%QUANTITY

0.99+

firstQUANTITY

0.99+

last weekDATE

0.99+

AsterORGANIZATION

0.99+

2008DATE

0.98+

Q1DATE

0.98+

Las Vegas, NevadaLOCATION

0.98+

one productQUANTITY

0.98+

34 employeesQUANTITY

0.98+

one systemQUANTITY

0.98+

evolvableORGANIZATION

0.98+

over fiveQUANTITY

0.97+

SQLTITLE

0.97+

three big whalesQUANTITY

0.97+

MapReduceORGANIZATION

0.96+

SiliconANGLEORGANIZATION

0.96+

first 12 monthsQUANTITY

0.95+

Apache MahalORGANIZATION

0.95+

map mapORGANIZATION

0.95+

over 200QUANTITY

0.95+

24OTHER

0.94+

todayDATE

0.94+

IntelORGANIZATION

0.92+

MattPERSON

0.92+

SalesforceORGANIZATION

0.91+

2014DATE

0.9+

ImpalaTITLE

0.9+

HadoopORGANIZATION

0.89+

HP VerticaORGANIZATION

0.89+

map barORGANIZATION

0.89+

HadoopTITLE

0.86+

one companyQUANTITY

0.85+

dupe summitEVENT

0.84+

about a month agoDATE

0.83+

BucherPERSON

0.81+

Discover 2014EVENT

0.78+

first week of mayDATE

0.77+

Apache drillORGANIZATION

0.74+

#HPDiscoverORGANIZATION

0.73+

MapboxTITLE

0.73+

2013DATE

0.72+

SQL onTITLE

0.7+

art technologiesORGANIZATION

0.63+

ApacheORGANIZATION

0.61+