Ken O'Reilly & Kyle Michael Winters, Cisco | Cisco Live EU Barcelona 2020

live from Barcelona Spain it's the cube covering Cisco live 2020s brought to you by Cisco and its ecosystem partners welcome back to Barcelona Spain everybody this is the cube the leader and live tech coverage and we're here day one for us at Cisco live Barcelona even though we did a little preview game preview yesterday my good friend kena Reilly is here he's the director of customer experience at Cisco and he's joined by Kyle winters Technical Marketing engineer for the customer experience technology and transformation group it's six to go guys great to see you thanks for coming on and you know we love talking customer experience Cisco is a it's a big company big portfolio and a lot of complexity for clients and so bring it all together and customer experience is very important can't we have it a conversation with Alastair early today and he was talking about Cisco's commitment from the top chuck Robbins on down to really improve that customer experience bring essentially a digital virtual experience to your customers and you guys obviously fit into that right absolutely so about two years ago when Chuck brought in Maria Martinez that was the first step into really pushing Cisco to focus more on successful outcomes for customers so we had already always sold that way but with the complexity of technology and how fast technology is moving accelerating value realization for customers has never been bigger especially in the security space because as we've talked before you know with everything that goes on today and the fact that the bad guys are trying to get data faster quicker and different getting the technology in play operational and production it has never been more important and we're gonna dig in with Kyle with some detail and double click into the lifecycle specifically and the different points of that journey but that's really important for any customer experience is really understanding that lifecycle that maturity model can you talk about that a little bit yeah so so with us you know we've been at it for about six years when we started as Lancope so we've got a great model and you know our approach to getting outcomes for customers is completely in line with with the strategy of our products and technologies and all security so it's really important that you align with that strategy because salespeople sell and they sell you the what we sell the how we're gonna get you and so you have to understand what it is that customers need and how that technology maps because you don't want a shelf where and you don't want products or technology sitting there waiting to be implemented because you know these days especially with the move to the cloud it's got to get up and running you know within an hour so our model has always been that way we built our model with customer first and so we are you know we are the security experts we're the trusted security adviser so when we go in and work with customers we completely know exactly those outcomes that they need and with all the sort of technologies and products that we have not only with stealthWatch but the other products that sent ulema tree to us we have in Kyle will talk about how our service is completely aligned with those outcomes and the journeys that we will take our customers on yes a faster adoption means faster time to value obviously let's focus in on stealthWatch Kenneth you came in with the stealthWatch acquisitions been very successful I mean Cisco security business grew 22% last quarter we'll talk more about the sort of umbrella but let's drill in with Kyle to stealthWatch services specifically maybe you could sort of take us through you know at a high level what what the areas are and then we can sort of follow up on yeah yes so so our customer maturity model when it comes to services there's kind of three different stages to it it starts with the visibility stage so we have services around being able to deploy an operational I stealthWatch will bring in our best practices and help customers get up to speed and using the system quickly and efficiently from there we also have services around detection capabilities so being able to use automation and integrations to further the detection capabilities of stealthWatch things like being able to classify host groups through automation from source like IP address management systems things like asset discovering classification service that helped drive segmentation efforts all of these things help improve the behavioral algorithms and processes that stealthWatch is using to detect these threats in real time and then from there we have an integration stage as well - which is all about bridging the gap between stealthWatch and the rest of not only Cisco's portfolio but the entirety of our customer security portfolio as well and some of those services include things like sim integrations being able to integrate stealthWatch with Splunk we have services such as our proxy integration service as well a lot of different types of services that we're able to help get our customers to the next stage with their stealth watch environments I got a lot of questions yeah we could get to it and you guys could take it by stage so yes the sort of visibility that's where you start that's when you do the discovery right so what what are you discovering how do you actually do that discovery so a lot of that is about making sure that we've got all the flow and telemetry that we need from the various different sources of our network coming into stealthWatch feeding into the processes and algorithms that are going on there so a lot of things is not only net flow data but getting ice integrated in there as well being able to pull that user attribution data and being able to find sources of data where we maybe can convert it into net flow if it's not already net flow and be able to ingest that data as well we also in that space typically to help set up customers with a lot of different best practices that kind of get them operationalized very quickly and things like being able to build custom reports and dashboards for them will work through them which is kind of understanding the system from a base level to more of a professional fully operational level a lot of times we come in during the stage two and customers don't even understand what's going on in their network they're seeing things that maybe they've never seen before one stealthWatch turns on a great example actually as we were at a large financial firm and we were able within 30 minutes of being on site with them through our services team we were able to identify rogue DNS servers unsecured telnet going on sequel injections suspicious SMB and that's the sage traffic this is all just within 30 minutes of us coming on there and taking a look at this stuff you don't even want to look at sometimes yeah so who's doing this can I mean is this sort of all automated you've got professionals sort of overseeing it in our society yeah so the team that we have the technology transformation team when we've talked about it before that team is kind of on the bleeding edge of helping customers and you know a lot of these services that that Kyle talked about is we are building services that customers are consuming based on their needs today and that's why the team is very flexible we build you know a lot of these integrations with those requirements in mind and then we take those and we can scale that so these are all field engineers we have developers so in in essence it is like a mini development team that goes out and works on the specific things that customers need to protect themselves okay and my understanding is there's a there's an ongoing learning with the customers and a it's a transfer of knowledge from day one right there the customer is with you on this in each of these phases and you're sort of learning as they go along and that's sort of part of the transfer of knowledge it's I would say even a tool a transfer knowledge too because we're teaching them our best practices and how to best be successful with these systems but we also learn from them what's going on what are the trends that they're seeing how can we help get them to the next stage and that's where our technology and transformation group comes and they're able to be on the cutting edge here the problems that the customers are talking about and be able to take stealthWatch to the next level okay let's dig it to the detection phase so this is where you're classifying things like host groups etc I'm interested in how that happens is that you know it used to be you'd get everybody in a room you start drawing pictures and that just doesn't scale it's too complicated today so can you auto classify stuff how does that all work and use them oh yeah genius math to do that so so traditionally the the you know the MIT's a manual effort to classify your whole group somebody who's very familiar with the network comes in and they say okay these are the DNS servers these are the web servers these are this network scanners oh oh today but the problem is that today's networks are so dynamic and fluid that what the network looks like today is not necessarily going to be the same tomorrow so there needs to be that relief from the analyst to be able to come in there needs to be that automation that they can go in each day and know that their system is going to be classified accurately and meaningfully that way the behavioral detection that is built into stealthWatch is also driven and accurate and meaningful - so we have this service so for example our host group automation service and through that we're able to pull in telemetry and data from various different sources such as IP address management systems cmdbs we can do threat feeds as well external threat feeds and we're able to drive the classification based off of the metadata that we see from these different sources so we're able to write different types of automation rules that essentially pull this data in detect the different patterns that we're seeing with that metadata and then drive that classification stealthWatch that way when you come in that next day you know that your network scanners are gonna be classified as Network scanners and your web servers are gonna be web servers etc etc so you you have that integrity of data coming in every single day yeah so a lot of different data sources data quality obviously really important I mean you'd love it if somebody had like you know a single CMDB from ServiceNow boom and pop it right in but that's not always the case we never always the case there's always a challenge and that's where kind of our services engineers come in they're able to work through these different environments and understand what the main admit what the metadata is where we need to go and how we need to classify and driving the classification from there so it does require a little bit of a human element on the front-end but once we get it worked out it can be fully automated you know there's lots of different sources and the quality of the data is not always there we've seen for example customers who have Excel spreadsheets and everything is just you're all over the place and we have to figure out a way to work with that and that's part of what our engineer success is so before we get to the integration piece can you been following this industry for for a while um security is really exciting space it's growing like crazy it's really hard I did a braking analysis piece you know a few weeks ago just talking about the fragmentation in the business you see startups coming out like crazy big valuations at the same time you see companies like Cisco with big portfolios yeah you mentioned Splunk before and they've kind of become a gold standard for for log files but very complex and you talk to security practitioners and they'll tell you our number one problem is just skillsets so get you know paint a picture of what's going on in the security world and what's in the house cisco is trying to address that so the security teams the analysts all the way up the management chain to the sea so they're under tremendous pressure their businesses are growing and so when their businesses are growing the sort of a tax base is growing and the business is growing faster than they can protect it so with the sort of increase in the economy more money more investment to build more point products so you've got a very stressed team a lot of turnover skill sets aren't great and what do we do as an industry we just give them more technology right more tools more tools complexity avalanche ok they're buried all right so we feel and we've made great strides within the security group within Cisco is we're taking the products that we have and we're integrating them under one platform so that it is in a bunch of point products and so that the that's what everybody else is doing I mean the other guys are acquiring companies then they're trying to integrate those because the customers are saying I don't need another point protocol yeah yeah it's too much so you know with us that's the way we approach it and now with the platform that's going to be launching this year the cisco threat response that we've launched you're gonna see later on in this year that we will be selling and positioned in implementing the entire platform yeah so I have a stat I came up with this and my one of my analyses it was the the worldwide economy is like 86 trillion and we spent about 0.014 percent on security so we're barely scratching the surface so this sort of tools avalanche probably isn't gonna change though integration becomes an extremely important aspect of the customer journeys and it's through that and to continue on that point you just made as well - I believe in our Cisco cybersecurity report from 2017 only fifty four six percent or fifty seven percent of actual threats are being investigated remediated so there's always that need to kind of help build bridge that gap make it easier for people to understand these threats and and mitigate and prioritize know what to go after right which part the integration exactly so we do have a lot of different integration services as well - for example I mentioned our sim integration service one thing that we can really do that's really awesome with that is we're able to deploy for example with Splunk a full-fledged stealthWatch for Splunk application that allows you to utilize stealth watches capabilities directly inside of Splunk without having to actually store an index any data inside of Splunk so all these api's are on demand inside of this app and available throughout the rest of the Splunk capabilities as well so you can extend it into other search reporting correlate that against other sets of data that you have and Splunk you can do quite a bit with it we also have other ways absolutely advantage of that is just obviously integration you're not leaving the environment plus its cost you're saving customers money a lot of a lot of customers kind of see their sim as a single pane of glass so being able to bring that stealthWatch value into that single pane is a huge win for our customers not to mention that reduction in licensing costs as well we have other ways to that we can reduce licensing costs some customers like to send their flow data into their sim for deeper analytics and long-term retention and we have a service we call it our flow adapter service and through this service we're essentially able to take buy flow off of the stealthWatch flow collectors and the buy flow is essentially when the raw net flow hits the stealthWatch flow collectors it's coming from multiple different routers and switches on the network this is gets converted into bi flow which is bi-directional deduplicated stitched together flow records so right there by sending that data into a sim or a data Lake as opposed to ronette flow we see data reduction cost anywhere from 15 to 80% depending on how the customers network is architected great any any favorite customer examples you have that you can share where ya guys have gone in you know provided these services and and it's had an outcome that got the customer excited or you found some bad guys or there's one that's one of my favorites so we have this service we call it our asset discovering classification service and I mentioned the host tree of automation service that's if you have some sort of authoritative source we can pull that information in but if a customer doesn't have that authoritative source they don't know what's on their network and a lot of times too they want to do a segmentation effort they're undergoing network segmentation but they need to understand what's on their network how these devices are communicating and that's where our asset discovery classification service comes in we're able to pull in telemetry not just from stealthWatch but other sources such as ice tetration Active Directory I Pam's again as well and we're able to essentially profile these different devices based off of the nature of their behavior so we were at a kind of a large technology company and we were essentially in this effort trying to segment their security cameras and upon segmenting their security cameras we were able to build this report where we can see the security camera and how its communicating with the other parts of the network and we noticed that there was essentially two IP addresses from inside of their network that were accessing all these different security cameras but they were not authorized to so with this service we were able to see that these different these two hosts were unauthorized actually accessing these devices that got reported up through the management chain and ultimately those two employees were no longer at that technology permanence that was discovered nice to love it alright bring us on we're here in the dev net zone sort of all about hit for structures code and software and and and and talk a little bit about the futures where you see this all going yeah so for us for Cisco security the future is really bright we've either built or acquired a portfolio that the customers really need that get absolute outcomes that customers need and through the customer experience organization certainly stealthWatch is fitting into the broader play to to get customers who have all those technologies get that operational and get them success so when we talked last summer I told you the jury was still out we would see how the journeys gonna go and the journey has started it has gotten much better since the summer and this year I think we're gonna be doing some great things for our customers just we can't get in too much of the business but stealthWatch customers are still expanding because I think we told you last time customers can never get enough stealthWatch okay the attack surface is too big right so so we we feel really good about that and the other technologies that they're building really fit into what customers need we're going to the cloud so they're gonna be able to consume cloud on-prem hybrid protect networks the campus protect their cloud infrastructure so we're really checking a lot of boxes in our group brings it all together and takes all the complexity out of that for customers just to get them the outcomes that I named us Cisco is one of my four star security companies for 2020 okay based on spending data that we share from our friends at ETR and the reason was because cisco has both a large presence in the market and but also you have spending momentum I mentioned 22% you know growth last quarter and the security business but you've also got the expertise you put your money where your mouth is you know the big portfolio which helps if you can bring it together and do these types of integrations it simplifies the customers environment and so that's a winner in my book so I named you along with some other high fliers right you know and you see some really interesting startups coming out and probably acquisition targets probably something that aren't your radar but guys thanks so much for coming on the cube thank you thank you I keep it right there everybody we'll be back with our next guest is a Dave Volante for the cubes 2 min Amanda John Faria are also in the house at Cisco live Barcelona right back

Published Date : Jan 28 2020

SUMMARY :

and so that the that's what everybody

ENTITIES

Entity	Category	Confidence
fifty seven percent	QUANTITY	0.99+
two employees	QUANTITY	0.99+
Cisco	ORGANIZATION	0.99+
86 trillion	QUANTITY	0.99+
2017	DATE	0.99+
cisco	ORGANIZATION	0.99+
2020	DATE	0.99+
Excel	TITLE	0.99+
Ken O'Reilly	PERSON	0.99+
22%	QUANTITY	0.99+
tomorrow	DATE	0.99+
Chuck	PERSON	0.99+
kena Reilly	PERSON	0.99+
15	QUANTITY	0.99+
today	DATE	0.99+
Kenneth	PERSON	0.99+
Dave Volante	PERSON	0.99+
fifty four six percent	QUANTITY	0.99+
Kyle	PERSON	0.99+
2 min	QUANTITY	0.99+
two hosts	QUANTITY	0.99+
Kyle winters	PERSON	0.99+
last quarter	DATE	0.99+
yesterday	DATE	0.99+
about 0.014 percent	QUANTITY	0.98+
MIT	ORGANIZATION	0.98+
each day	QUANTITY	0.98+
this year	DATE	0.98+
last summer	DATE	0.98+
Splunk	ORGANIZATION	0.98+
first step	QUANTITY	0.97+
Amanda John Faria	PERSON	0.97+
Maria Martinez	PERSON	0.97+
one platform	QUANTITY	0.97+
30 minutes	QUANTITY	0.96+
80%	QUANTITY	0.96+
one	QUANTITY	0.96+
Kyle	LOCATION	0.96+
two IP addresses	QUANTITY	0.95+
each	QUANTITY	0.95+
one thing	QUANTITY	0.95+
both	QUANTITY	0.94+
Barcelona Spain	LOCATION	0.94+
Cisco live	ORGANIZATION	0.93+
Kyle	ORGANIZATION	0.93+
an hour	QUANTITY	0.93+
30 minutes	QUANTITY	0.92+
single pane	QUANTITY	0.92+
six	QUANTITY	0.92+
Lancope	ORGANIZATION	0.91+
Cisco Live	ORGANIZATION	0.9+
Kyle Michael Winters	PERSON	0.9+
about six years	QUANTITY	0.9+
early today	DATE	0.89+
lot	QUANTITY	0.88+
three different stages	QUANTITY	0.88+
four star	QUANTITY	0.87+
two years ago	DATE	0.86+
Robbins	PERSON	0.83+
ETR	ORGANIZATION	0.83+
single day	QUANTITY	0.83+
day one	QUANTITY	0.83+
ServiceNow	ORGANIZATION	0.83+
a few weeks ago	DATE	0.81+
Barcelona	LOCATION	0.81+
stage two	QUANTITY	0.8+
a lot of questions	QUANTITY	0.79+
next day	DATE	0.77+

Ofer Bengal | O'Reilly Velocity Conference 2013

>>Okay. We're back live here. The velocity conference is Santa Clara live. This is the cube Silicon angle's flagship program. We go out to the events, restrict the signal from the noise. I'm John furry, the founder of Silicon angle. And our next guest is CEO of guarantee a data, his here in the cube. Welcome to the cube. Thank you. It's great to be here. We are at the velocity conference, which is really the intersection of infrastructure and application development, kind of in a holistic way, full stack new technologies. Um, so first tell us a little about your company and what you guys are doing here at velocity. >>Well, we deal with a new type of database, which took the developers community by storm. This is a no sequel in memory database called Radis where this is very, very fast. You know, it, it processes hundreds of thousand transactions per second at sub milliseconds. And this is all about performance. So velocity is the right right place to be when you deal with Radis. >>So why, why red is, first of all, is taking everyone by storm. We use it, um, great technology. Um, why, why, why is it so popular? >>Well is, has many attractive datatypes and commands, which are very useful in many, many use cases today for almost any application. So that's why, you know, developers really love it, >>The in-memory database. So we cover a lot of storage, SSDs and infrastructure. Um, SSDs had brought up, uh, with flash, a whole nother level of caching on the level for storage area networks really exploded open source scale-out. Um, but people still need the real fast, low latency data, no doubt. And that's where in memory, but developers don't need to be storage gurus to do that. So is that an area that you guys are? >>Yes, definitely. The basic idea is to provide developers what they need in terms of database needs, without all the hassle of, you know, operating those databases. So with our products, which with our product, which is called the Radice cloud right now, this product is provided as a fully managed hosting service over various clouds and platforms as a service. So with this product, the user does not need to do anything, simply send your data and forget about it. We take care of scalability, high availability, stabilizing performance, and all the ops. >>So one of the things about the web that's really challenging it's asynchronous, right? So persistence is a really big thing. How do you guys look at that channel? >>Okay. We have built a whole suite of high availability provisions for Radis. First of all, you can with a click of a button with a checkbox, you can replicate your data set within the same data center, uh, and when a node fails, and this is something which happens in the cloud almost everyday, we immediately, uh, switch your data to the, to the replica and, uh, you are up and running without any, any problem whatsoever. So this is one thing we recently last week, we announced another layer, which is multi a Z replication, which means that you can with a click of a button, replicate your data set to another data center. So if the entire data center fails, we immediately use the replica in the other data center, the backup replica. And again, you're up and running without any interruption. >>This really is a value proposition. That's as a dream scenario for developers with dealing with the cloud. I mean, because your alternative is to provision bare metal, exact load Linux systems >>Administrator. This is crazy. I mean, >>Oh, and cuing too, is another another issue. I mean, how do you know? So if I'm going to manage large volumes of data set to say that, um, my side becomes popular, my application becomes popular because, uh, someone shitted, virally, I want to have that queuing and that persistence that's really, really important. I might not have the time to provision a new server, a new database. So what you're saying is if I get this right with Reddis cloud, I can spin up in dynamically handle that those kinds of replication and persistence >>Over, you know, basic red. Is that right? Absolutely. Absolutely. You know, our native red is the open source is basically, uh, limited in scalability. You cannot grow beyond the single master server. Now the community is working for a while and something which is called Reddis cluster, which is supposed to solve all that. However, this is taking for a very long time with our Reddis cloud, you can grow your dataset from megabytes to Jigga bytes, to terabytes and even more, and all that is done in a fully automated manner without you do not need to deal with nodes, clustering, scaling, stuff like that. And while supporting all the data types and commands of Radis, which is really, really unique. >>Yeah. I mean, I got to say, you know, one of the challenges with the cloud is orchestration, right? And so that's one element. So automation has been a big problem for folks on premise on large enterprises and application developers. The other challenge has been real time. So a lot of apps need to have real time, like no JS or things of that nature. So how does a developer, I'm a developer and I'm, I want real time. I want persistence. And I want to have the flexibility to, to, to just push code and everything take care of itself. How does Reddis help me there? >>Well, red is, as I said, is the fastest data store available today, much faster than anything else. Like, you know, people talk to them about HANA SAP HANA, uh, red is, is, uh, 10 X, you know, in terms of speed, we are talking about hundreds of thousands transactions at sub-millisecond latencies. Whenever you want performance, whenever you need performance, the best database for that is rarely snow. >>Okay. So I got to ask you the question, first of all, big fan, really glad you're here in the cube. So we like, we like what you're doing, um, for the folks that don't understand what you guys are doing or are red or new to Retis. Why is it so good? Why is it so popular and what, what benefits does it provide the developer and say a business that wants to use that? >>I would say use cases, use cases, use cases whenever, whenever you, whenever you need a job management, for example, you know, signaling inside your, your, your application. So platforms such as sidekick, sidekicks, you know, et cetera, use Radice whenever you need, uh, stuff like, uh, Twitter type functionality, you know, followers, et cetera. You have a built in clone within radius for that whenever you need, uh, you know, uh, fast analytics, there is nothing better than red is caching, you know, already since replacing Memcached totally today, new apps, uh, page ranks, post ranks, you know, stuff like that. All these are great use cases for remedies. And if you, you know, in any one of those various is the best for that. Yeah. >>Well, congratulations, really like what you guys are doing. Um, and you're at the show here. What are you showing here at velocity? Again? Congratulations on your success. Well-deserved reticence is really becoming the standard. What, what are you guys doing here at velocity and what are you guys showing? >>We demonstrate, uh, first of all, the service we demonstrate the performance. You can, you know, if you have a minute drop over to our booth next door here, and we show the great performance, you know, we are showing hundred thousands of transactions, you know, with large databases in sub-millisecond latencies. This is, you know, this is real life and we are demonstrating our high availability with multi a Z replication and instant out of fail over. >>Okay, well, we are here with Ofer B gal with the system guarantee, a system data, um, Reddis cloud, great product, congratulations on your success. Thanks for coming inside the cube. This is the velocity conference. This is the kind of technology folks that velocity is about the loss of these, the intersection between a, almost a systems view of user experience, user design with cloud and infrastructure or dev ops, whatever you want to call it, we'll figure out a word for it, but it's really kind of coming together. I guess we call it velocity conference. This is the modern infrastructure that a lot of the web-scale companies or hyperscale companies are using and developers, developers who are small-scale today. We'll be, we'll be big scale. We'll use things like redness. This is what it's all about. This is the Silicon ankles flagship program. Go to youtube.com/looking angled to watch the videos go to siliconangle.com to get, to see the blog posts and coverage. We'll be right back with our next guest. After the short break.

Published Date : Jun 20 2013

SUMMARY :

This is the cube Silicon angle's flagship is the right right place to be when you deal with Radis. So why, why red is, first of all, is taking everyone by storm. you know, developers really love it, So is that an area that you guys are? you know, operating those databases. So one of the things about the web that's really challenging it's asynchronous, right? which means that you can with a click of a button, replicate your data set to another data center. I mean, because your alternative is to provision bare metal, exact load Linux systems I mean, I might not have the time to provision a new server, a new database. this is taking for a very long time with our Reddis cloud, you can grow your dataset So a lot of apps need to have real you know, in terms of speed, we are talking about hundreds of thousands transactions So we like, uh, stuff like, uh, Twitter type functionality, you know, Well, congratulations, really like what you guys are doing. This is, you know, this is real life and This is the kind of technology folks that velocity

ENTITIES

Entity	Category	Confidence
Silicon angle	ORGANIZATION	0.99+
last week	DATE	0.99+
siliconangle.com	OTHER	0.99+
Memcached	TITLE	0.99+
today	DATE	0.98+
first	QUANTITY	0.98+
John furry	PERSON	0.98+
one element	QUANTITY	0.97+
Ofer Bengal	PERSON	0.97+
one	QUANTITY	0.97+
Reddis	ORGANIZATION	0.96+
hundred thousands	QUANTITY	0.96+
First	QUANTITY	0.95+
hundreds of thousand transactions per second	QUANTITY	0.94+
Linux	TITLE	0.93+
O'Reilly Velocity Conference 2013	EVENT	0.93+
one thing	QUANTITY	0.93+
Reddis	TITLE	0.92+
HANA SAP HANA	TITLE	0.91+
Santa Clara	LOCATION	0.91+
single master server	QUANTITY	0.91+
Twitter	ORGANIZATION	0.91+
Silicon ankles	ORGANIZATION	0.89+
youtube.com/looking	OTHER	0.88+
Retis	ORGANIZATION	0.86+
transactions	QUANTITY	0.85+
Reddis cloud	TITLE	0.84+
Radice	ORGANIZATION	0.84+
red	TITLE	0.79+
Radis	ORGANIZATION	0.79+
hundreds of thousands transactions	QUANTITY	0.78+
red	ORGANIZATION	0.74+
Ofer	PERSON	0.65+
10 X	TITLE	0.63+
about	QUANTITY	0.51+

Breaking Analysis: Snowflake Summit 2022...All About Apps & Monetization

>> From theCUBE studios in Palo Alto in Boston, bringing you data driven insights from theCUBE and ETR. This is "Breaking Analysis" with Dave Vellante. >> Snowflake Summit 2022 underscored that the ecosystem excitement which was once forming around Hadoop is being reborn, escalated and coalescing around Snowflake's data cloud. What was once seen as a simpler cloud data warehouse and good marketing with the data cloud is evolving rapidly with new workloads of vertical industry focus, data applications, monetization, and more. The question is, will the promise of data be fulfilled this time around, or is it same wine, new bottle? Hello, and welcome to this week's Wikibon CUBE Insights powered by ETR. In this "Breaking Analysis," we'll talk about the event, the announcements that Snowflake made that are of greatest interest, the major themes of the show, what was hype and what was real, the competition, and some concerns that remain in many parts of the ecosystem and pockets of customers. First let's look at the overall event. It was held at Caesars Forum. Not my favorite venue, but I'll tell you it was packed. Fire Marshall Full, as we sometimes say. Nearly 10,000 people attended the event. Here's Snowflake's CMO Denise Persson on theCUBE describing how this event has evolved. >> Yeah, two, three years ago, we were about 1800 people at a Hilton in San Francisco. We had about 40 partners attending. This week we're close to 10,000 attendees here. Almost 10,000 people online as well, and over over 200 partners here on the show floor. >> Now, those numbers from 2019 remind me of the early days of Hadoop World, which was put on by Cloudera but then Cloudera handed off the event to O'Reilly as this article that we've inserted, if you bring back that slide would say. The headline it almost got it right. Hadoop World was a failure, but it didn't have to be. Snowflake has filled the void created by O'Reilly when it first killed Hadoop World, and killed the name and then killed Strata. Now, ironically, the momentum and excitement from Hadoop's early days, it probably could have stayed with Cloudera but the beginning of the end was when they gave the conference over to O'Reilly. We can't imagine Frank Slootman handing the keys to the kingdom to a third party. Serious business was done at this event. I'm talking substantive deals. Salespeople from a host sponsor and the ecosystems that support these events, they love physical. They really don't like virtual because physical belly to belly means relationship building, pipeline, and deals. And that was blatantly obvious at this show. And in fairness, all theCUBE events that we've done year but this one was more vibrant because of its attendance and the action in the ecosystem. Ecosystem is a hallmark of a cloud company, and that's what Snowflake is. We asked Frank Slootman on theCUBE, was this ecosystem evolution by design or did Snowflake just kind of stumble into it? Here's what he said. >> Well, when you are a data clouding, you have data, people want to do things with that data. They don't want just run data operations, populate dashboards, run reports. Pretty soon they want to build applications and after they build applications, they want build businesses on it. So it goes on and on and on. So it drives your development to enable more and more functionality on that data cloud. Didn't start out that way, you know, we were very, very much focused on data operations. Then it becomes application development and then it becomes, hey, we're developing whole businesses on this platform. So similar to what happened to Facebook in many ways. >> So it sounds like it was maybe a little bit of both. The Facebook analogy is interesting because Facebook is a walled garden, as is Snowflake, but when you come into that garden, you have assurances that things are going to work in a very specific way because a set of standards and protocols is being enforced by a steward, i.e. Snowflake. This means things run better inside of Snowflake than if you try to do all the integration yourself. Now, maybe over time, an open source version of that will come out but if you wait for that, you're going to be left behind. That said, Snowflake has made moves to make its platform more accommodating to open source tooling in many of its announcements this week. Now, I'm not going to do a deep dive on the announcements. Matt Sulkins from Monte Carlo wrote a decent summary of the keynotes and a number of analysts like Sanjeev Mohan, Tony Bear and others are posting some deeper analysis on these innovations, and so we'll point to those. I'll say a few things though. Unistore extends the type of data that can live in the Snowflake data cloud. It's enabled by a new feature called hybrid tables, a new table type in Snowflake. One of the big knocks against Snowflake was it couldn't handle and transaction data. Several database companies are creating this notion of a hybrid where both analytic and transactional workloads can live in the same data store. Oracle's doing this for example, with MySQL HeatWave and there are many others. We saw Mongo earlier this month add an analytics capability to its transaction system. Mongo also added sequel, which was kind of interesting. Here's what Constellation Research analyst Doug Henschen said about Snowflake's moves into transaction data. Play the clip. >> Well with Unistore, they're reaching out and trying to bring transactional data in. Hey, don't limit this to analytical information and there's other ways to do that like CDC and streaming but they're very closely tying that again to that marketplace, with the idea of bring your data over here and you can monetize it. Don't just leave it in that transactional database. So another reach to a broader play across a big community that they're building. >> And you're also seeing Snowflake expand its workload types in its unique way and through Snowpark and its stream lit acquisition, enabling Python so that native apps can be built in the data cloud and benefit from all that structure and the features that Snowflake is built in. Hence that Facebook analogy, or maybe the App Store, the Apple App Store as I propose as well. Python support also widens the aperture for machine intelligence workloads. We asked Snowflake senior VP of product, Christian Kleinerman which announcements he thought were the most impactful. And despite the who's your favorite child nature of the question, he did answer. Here's what he said. >> I think the native applications is the one that looks like, eh, I don't know about it on the surface but he has the biggest potential to change everything. That's create an entire ecosystem of solutions for within a company or across companies that I don't know that we know what's possible. >> Snowflake also announced support for Apache Iceberg, which is a new open table format standard that's emerging. So you're seeing Snowflake respond to these concerns about its lack of openness, and they're building optionality into their cloud. They also showed some cost op optimization tools both from Snowflake itself and from the ecosystem, notably Capital One which launched a software business on top of Snowflake focused on optimizing cost and eventually the rollout data management capabilities, and all kinds of features that Snowflake announced that the show around governance, cross cloud, what we call super cloud, a new security workload, and they reemphasize their ability to read non-native on-prem data into Snowflake through partnerships with Dell and Pure and a lot more. Let's hear from some of the analysts that came on theCUBE this week at Snowflake Summit to see what they said about the announcements and their takeaways from the event. This is Dave Menninger, Sanjeev Mohan, and Tony Bear, roll the clip. >> Our research shows that the majority of organizations, the majority of people do not have access to analytics. And so a couple of the things they've announced I think address those or help to address those issues very directly. So Snowpark and support for Python and other languages is a way for organizations to embed analytics into different business processes. And so I think that'll be really beneficial to try and get analytics into more people's hands. And I also think that the native applications as part of the marketplace is another way to get applications into people's hands rather than just analytical tools. Because most people in the organization are not analysts. They're doing some line of business function. They're HR managers, they're marketing people, they're sales people, they're finance people, right? They're not sitting there mucking around in the data, they're doing a job and they need analytics in that job. >> Primarily, I think it is to contract this whole notion that once you move data into Snowflake, it's a proprietary format. So I think that's how it started but it's usually beneficial to the customers, to the users because now if you have large amount of data in paket files you can leave it on S3, but then you using the Apache Iceberg table format in Snowflake, you get all the benefits of Snowflake's optimizer. So for example, you get the micro partitioning, you get the metadata. And in a single query, you can join, you can do select from a Snowflake table union and select from an iceberg table and you can do store procedure, user defined function. So I think what they've done is extremely interesting. Iceberg by itself still does not have multi-table transactional capabilities. So if I'm running a workload, I might be touching 10 different tables. So if I use Apache Iceberg in a raw format, they don't have it, but Snowflake does. So the way I see it is Snowflake is adding more and more capabilities right into the database. So for example, they've gone ahead and added security and privacy. So you can now create policies and do even cell level masking, dynamic masking, but most organizations have more than Snowflake. So what we are starting to see all around here is that there's a whole series of data catalog companies, a bunch of companies that are doing dynamic data masking, security and governance, data observability which is not a space Snowflake has gone into. So there's a whole ecosystem of companies that is mushrooming. Although, you know, so they're using the native capabilities of Snowflake but they are at a level higher. So if you have a data lake and a cloud data warehouse and you have other like relational databases, you can run these cross platform capabilities in that layer. So that way, you know, Snowflake's done a great job of enabling that ecosystem. >> I think it's like the last mile, essentially. In other words, it's like, okay, you have folks that are basically that are very comfortable with Tableau but you do have developers who don't want to have to shell out to a separate tool. And so this is where Snowflake is essentially working to address that constituency. To Sanjeev's point, and I think part of it, this kind of plays into it is what makes this different from the Hadoop era is the fact that all these capabilities, you know, a lot of vendors are taking it very seriously to put this native. Now, obviously Snowflake acquired Streamlit. So we can expect that the Streamlit capabilities are going to be native. >> I want to share a little bit about the higher level thinking at Snowflake, here's a chart from Frank Slootman's keynote. It's his version of the modern data stack, if you will. Now, Snowflake of course, was built on the public cloud. If there were no AWS, there would be no Snowflake. Now, they're all about bringing data and live data and expanding the types of data, including structured, we just heard about that, unstructured, geospatial, and the list is going to continue on and on. Eventually I think it's going to bleed into the edge if we can figure out what to do with that edge data. Executing on new workloads is a big deal. They started with data sharing and they recently added security and they've essentially created a PaaS layer. We call it a SuperPaaS layer, if you will, to attract application developers. Snowflake has a developer-focused event coming up in November and they've extended the marketplace with 1300 native apps listings. And at the top, that's the holy grail, monetization. We always talk about building data products and we saw a lot of that at this event, very, very impressive and unique. Now here's the thing. There's a lot of talk in the press, in the Wall Street and the broader community about consumption-based pricing and concerns over Snowflake's visibility and its forecast and how analytics may be discretionary. But if you're a company building apps in Snowflake and monetizing like Capital One intends to do, and you're now selling in the marketplace, that is not discretionary, unless of course your costs are greater than your revenue for that service, in which case is going to fail anyway. But the point is we're entering a new error where data apps and data products are beginning to be built and Snowflake is attempting to make the data cloud the defacto place as to where you're going to build them. In our view they're well ahead in that journey. Okay, let's talk about some of the bigger themes that we heard at the event. Bringing apps to the data instead of moving the data to the apps, this was a constant refrain and one that certainly makes sense from a physics point of view. But having a single source of data that is discoverable, sharable and governed with increasingly robust ecosystem options, it doesn't have to be moved. Sometimes it may have to be moved if you're going across regions, but that's unique and a differentiator for Snowflake in our view. I mean, I'm yet to see a data ecosystem that is as rich and growing as fast as the Snowflake ecosystem. Monetization, we talked about that, industry clouds, financial services, healthcare, retail, and media, all front and center at the event. My understanding is that Frank Slootman was a major force behind this shift, this development and go to market focus on verticals. It's really an attempt, and he talked about this in his keynote to align with the customer mission ultimately align with their objectives which not surprisingly, are increasingly monetizing with data as a differentiating ingredient. We heard a ton about data mesh, there were numerous presentations about the topic. And I'll say this, if you map the seven pillars Snowflake talks about, Benoit Dageville talked about this in his keynote, but if you map those into Zhamak Dehghani's data mesh framework and the four principles, they align better than most of the data mesh washing that I've seen. The seven pillars, all data, all workloads, global architecture, self-managed, programmable, marketplace and governance. Those are the seven pillars that he talked about in his keynote. All data, well, maybe with hybrid tables that becomes more of a reality. Global architecture means the data is globally distributed. It's not necessarily physically in one place. Self-managed is key. Self-service infrastructure is one of Zhamak's four principles. And then inherent governance. Zhamak talks about computational, what I'll call automated governance, built in. And with all the talk about monetization, that aligns with the second principle which is data as product. So while it's not a pure hit and to its credit, by the way, Snowflake doesn't use data mesh in its messaging anymore. But by the way, its customers do, several customers talked about it. Geico, JPMC, and a number of other customers and partners are using the term and using it pretty closely to the concepts put forth by Zhamak Dehghani. But back to the point, they essentially, Snowflake that is, is building a proprietary system that substantially addresses some, if not many of the goals of data mesh. Okay, back to the list, supercloud, that's our term. We saw lots of examples of clouds on top of clouds that are architected to spin multiple clouds, not just run on individual clouds as separate services. And this includes Snowflake's data cloud itself but a number of ecosystem partners that are headed in a very similar direction. Snowflake still talks about data sharing but now it uses the term collaboration in its high level messaging, which is I think smart. Data sharing is kind of a geeky term. And also this is an attempt by Snowflake to differentiate from everyone else that's saying, hey, we do data sharing too. And finally Snowflake doesn't say data marketplace anymore. It's now marketplace, accounting for its application market. Okay, let's take a quick look at the competitive landscape via this ETR X-Y graph. Vertical access remembers net score or spending momentum and the x-axis is penetration, pervasiveness in the data center. That's what ETR calls overlap. Snowflake continues to lead on the vertical axis. They guide it conservatively last quarter, remember, so I wouldn't be surprised if that lofty height, even though it's well down from its earlier levels but I wouldn't be surprised if it ticks down again a bit in the July survey, which will be in the field shortly. Databricks is a key competitor obviously at a strong spending momentum, as you can see. We didn't draw it here but we usually draw that 40% line or red line at 40%, anything above that is considered elevated. So you can see Databricks is quite elevated. But it doesn't have the market presence of Snowflake. It didn't get to IPO during the bubble and it doesn't have nearly as deep and capable go-to market machinery. Now, they're getting better and they're getting some attention in the market, nonetheless. But as a private company, you just naturally, more people are aware of Snowflake. Some analysts, Tony Bear in particular, believe Mongo and Snowflake are on a bit of a collision course long term. I actually can see his point. You know, I mean, they're both platforms, they're both about data. It's long ways off, but you can see them sort of in a similar path. They talk about kind of similar aspirations and visions even though they're quite in different markets today but they're definitely participating in similar tam. The cloud players are probably the biggest or definitely the biggest partners and probably the biggest competitors to Snowflake. And then there's always Oracle. Doesn't have the spending velocity of the others but it's got strong market presence. It owns a cloud and it knows a thing about data and it definitely is a go-to market machine. Okay, we're going to end on some of the things that we heard in the ecosystem. 'Cause look, we've heard before how particular technology, enterprise data warehouse, data hubs, MDM, data lakes, Hadoop, et cetera. We're going to solve all of our data problems and of course they didn't. And in fact, sometimes they create more problems that allow vendors to push more incremental technology to solve the problems that they created. Like tools and platforms to clean up the no schema on right nature of data lakes or data swamps. But here are some of the things that I heard firsthand from some customers and partners. First thing is, they said to me that they're having a hard time keeping up sometimes with the pace of Snowflake. It reminds me of AWS in 2014, 2015 timeframe. You remember that fire hose of announcements which causes increased complexity for customers and partners. I talked to several customers that said, well, yeah this is all well and good but I still need skilled people to understand all these tools that I'm integrated in the ecosystem, the catalogs, the machine learning observability. A number of customers said, I just can't use one governance tool, I need multiple governance tools and a lot of other technologies as well, and they're concerned that that's going to drive up their cost and their complexity. I heard other concerns from the ecosystem that it used to be sort of clear as to where they could add value you know, when Snowflake was just a better data warehouse. But to point number one, they're either concerned that they'll be left behind or they're concerned that they'll be subsumed. Look, I mean, just like we tell AWS customers and partners, you got to move fast, you got to keep innovating. If you don't, you're going to be left. Either if your customer you're going to be left behind your competitor, or if you're a partner, somebody else is going to get there or AWS is going to solve the problem for you. Okay, and there were a number of skeptical practitioners, really thoughtful and experienced data pros that suggested that they've seen this movie before. That's hence the same wine, new bottle. Well, this time around I certainly hope not given all the energy and investment that is going into this ecosystem. And the fact is Snowflake is unquestionably making it easier to put data to work. They built on AWS so you didn't have to worry about provisioning, compute and storage and networking and scaling. Snowflake is optimizing its platform to take advantage of things like Graviton so you don't have to, and they're doing some of their own optimization tools. The ecosystem is building optimization tools so that's all good. And firm belief is the less expensive it is, the more data will get brought into the data cloud. And they're building a data platform on which their ecosystem can build and run data applications, aka data products without having to worry about all the hard work that needs to get done to make data discoverable, shareable, and governed. And unlike the last 10 years, you don't have to be a keeper and integrate all the animals in the Hadoop zoo. Okay, that's it for today, thanks for watching. Thanks to my colleague, Stephanie Chan who helps research "Breaking Analysis" topics. Sometimes Alex Myerson is on production and manages the podcasts. Kristin Martin and Cheryl Knight help get the word out on social and in our newsletters, and Rob Hof is our editor in chief over at Silicon, and Hailey does some wonderful editing, thanks to all. Remember, all these episodes are available as podcasts wherever you listen. All you got to do is search Breaking Analysis Podcasts. I publish each week on wikibon.com and siliconangle.com and you can email me at David.Vellante@siliconangle.com or DM me @DVellante. If you got something interesting, I'll respond. If you don't, I'm sorry I won't. Or comment on my LinkedIn post. Please check out etr.ai for the best survey data in the enterprise tech business. This is Dave Vellante for theCUBE Insights powered by ETR. Thanks for watching, and we'll see you next time. (upbeat music)

Published Date : Jun 18 2022

SUMMARY :

bringing you data driven that the ecosystem excitement here on the show floor. and the action in the ecosystem. Didn't start out that way, you know, One of the big knocks against Snowflake the idea of bring your data of the question, he did answer. is the one that looks like, and from the ecosystem, And so a couple of the So that way, you know, from the Hadoop era is the fact the defacto place as to where

ENTITIES

Entity	Category	Confidence
Frank Slootman	PERSON	0.99+
Frank Slootman	PERSON	0.99+
Doug Henschen	PERSON	0.99+
Stephanie Chan	PERSON	0.99+
Christian Kleinerman	PERSON	0.99+
AWS	ORGANIZATION	0.99+
Dave Vellante	PERSON	0.99+
Rob Hof	PERSON	0.99+
Benoit Dageville	PERSON	0.99+
2014	DATE	0.99+
Matt Sulkins	PERSON	0.99+
JPMC	ORGANIZATION	0.99+
2019	DATE	0.99+
Cheryl Knight	PERSON	0.99+
Palo Alto	LOCATION	0.99+
Denise Persson	PERSON	0.99+
Alex Myerson	PERSON	0.99+
Tony Bear	PERSON	0.99+
Dave Menninger	PERSON	0.99+
Dell	ORGANIZATION	0.99+
July	DATE	0.99+
Geico	ORGANIZATION	0.99+
November	DATE	0.99+
Snowflake	TITLE	0.99+
40%	QUANTITY	0.99+
Oracle	ORGANIZATION	0.99+
App Store	TITLE	0.99+
Capital One	ORGANIZATION	0.99+
second principle	QUANTITY	0.99+
Sanjeev Mohan	PERSON	0.99+
Snowflake	ORGANIZATION	0.99+
1300 native apps	QUANTITY	0.99+
Tony Bear	PERSON	0.99+
David.Vellante@siliconangle.com	OTHER	0.99+
Kristin Martin	PERSON	0.99+
Mongo	ORGANIZATION	0.99+
Databricks	ORGANIZATION	0.99+
Snowflake Summit 2022	EVENT	0.99+
First	QUANTITY	0.99+
two	DATE	0.99+
Python	TITLE	0.99+
10 different tables	QUANTITY	0.99+
Facebook	ORGANIZATION	0.99+
ETR	ORGANIZATION	0.99+
both	QUANTITY	0.99+
Snowflake	EVENT	0.98+
one place	QUANTITY	0.98+
each week	QUANTITY	0.98+
O'Reilly	ORGANIZATION	0.98+
This week	DATE	0.98+
Hadoop World	EVENT	0.98+
this week	DATE	0.98+
Pure	ORGANIZATION	0.98+
about 40 partners	QUANTITY	0.98+
theCUBE	ORGANIZATION	0.98+
last quarter	DATE	0.98+
One	QUANTITY	0.98+
S3	TITLE	0.97+
Hadoop	LOCATION	0.97+
single	QUANTITY	0.97+
Caesars Forum	LOCATION	0.97+
Iceberg	TITLE	0.97+
single source	QUANTITY	0.97+
Silicon	ORGANIZATION	0.97+
Nearly 10,000 people	QUANTITY	0.97+
Apache Iceberg	ORGANIZATION	0.97+

Breaking Analysis: Data Mesh...A New Paradigm for Data Management

from the cube studios in palo alto in boston bringing you data driven insights from the cube and etr this is breaking analysis with dave vellante data mesh is a new way of thinking about how to use data to create organizational value leading edge practitioners are beginning to implement data mesh in earnest and importantly data mesh is not a single tool or a rigid reference architecture if you will rather it's an architectural and organizational model that's really designed to address the shortcomings of decades of data challenges and failures many of which we've talked about on the cube as important by the way it's a new way to think about how to leverage data at scale across an organization and across ecosystems data mesh in our view will become the defining paradigm for the next generation of data excellence hello and welcome to this week's wikibon cube insights powered by etr in this breaking analysis we welcome the founder and creator of data mesh author thought leader technologist jamaak dagani shamak thank you for joining us today good to see you hi dave it's great to be here all right real quick let's talk about what we're going to cover i'll introduce or reintroduce you to jamaac she joined us earlier this year in our cube on cloud program she's the director of emerging tech at dot works north america and a thought leader practitioner software engineer architect and a passionate advocate for decentralized technology solutions and and data architectures and jamaa since we last had you on as a guest which was less than a year ago i think you've written two books in your spare time one on data mesh and another called software architecture the hard parts both published by o'reilly so how are you you've been busy i've been busy yes um good it's been a great year it's been a busy year i'm looking forward to the end of the year and the end of these two books but it's great to be back and um speaking with you well you got to be pleased with the the momentum that data mesh has and let's just jump back to the agenda for a bit and get that out of the way we're going to set the stage by sharing some etr data our partner our data partner on the spending profile and some of the key data sectors and then we're going to review the four key principles of data mesh just it's always worthwhile to sort of set that framework we'll talk a little bit about some of the dependencies and the data flows and we're really going to dig today into principle number three and a bit around the self-service data platforms and to that end we're going to talk about some of the learnings that shamak has captured since she embarked on the datamess journey with her colleagues and her clients and we specifically want to talk about some of the successful models for building the data mesh experience and then we're going to hit on some practical advice and we'll wrap with some thought exercises maybe a little tongue-in-cheek some of the community questions that we get so the first thing i want to do we'll just get this out of the way is introduce the spending climate we use this xy chart to do this we do this all the time it shows the spending profiles and the etr data set for some of the more data related sectors of the ecr etr taxonomy they they dropped their october data last friday so i'm using the july survey here we'll get into the october survey in future weeks but about 1500 respondents i don't see a dramatic change coming in the october survey but the the y-axis is net score or spending momentum the horizontal axis is market share or presence in the data set and that red line that 40 percent anything over that we consider elevated so for the past eight quarters or so we've seen machine learning slash ai rpa containers and cloud is the four areas where cios and technology buyers have shown the highest net scores and as we've said what's so impressive for cloud is it's both pervasive and it shows high velocity from a spending standpoint and we plotted the three other data related areas database edw analytics bi and big data and storage the first two well under the red line are still elevated the storage market continues to kind of plot along and we've we've plotted the outsourced it just to balance it out for context that's an area that's not so hot right now so i just want to point out that these areas ai automation containers and cloud they're all relevant to data and they're fundamental building blocks of data architectures as are the two that are directly related to data database and analytics and of course storage so it just gives you a picture of the spending sector so i wanted to share this slide jamark uh that that we presented in that you presented in your webinar i love this it's a taxonomy put together by matt turk who's a vc and he called this the the mad landscape machine learning and ai and data and jamock the key point here is there's no lack of tooling you've you've made the the data mesh concept sort of tools agnostic it's not like we need more tools to succeed in data mesh right absolutely great i think we have plenty of tools i think what's missing is a meta architecture that defines the landscape in a way that it's in step with organizational growth and then defines that meta architecture in a way that these tools can actually interoperable and to operate and integrate really well like the the clients right now have a lot of challenges in terms of picking the right tool regardless of the technology they go down the path either they have to go in and big you know bite into a big data solution and then try to fit the other integrated solutions around it or as you see go to that menu of large list of applications and spend a lot of time trying to kind of integrate and stitch this tooling together so i'm hoping that data mesh creates that kind of meta architecture for tools to interoperate and plug in and i think our conversation today around self-subjective platform um hopefully eliminate that yeah we'll definitely circle back because that's one of the questions we get all the time from the community okay let's review the four main principles of data mesh for those who might not be familiar with it and those who are it's worth reviewing jamar allow me to introduce them and then we can discuss a bit so a big frustration i hear constantly from practitioners is that the data teams don't have domain context the data team is separated from the lines of business and as a result they have to constantly context switch and as such there's a lack of alignment so principle number one is focused on putting end-to-end data ownership in the hands of the domain or what i would call the business lines the second principle is data as a product which does cause people's brains to hurt sometimes but it's a key component and if you start sort of thinking about it you'll and talking to people who have done it it actually makes a lot of sense and this leads to principle number three which is a self-serve data infrastructure which we're going to drill into quite a bit today and then the question we always get is when we introduce data meshes how to enforce governance in a federated model so let me bring up a more detailed slide jamar with the dependencies and ask you to comment please sure but as you said the the really the root cause we're trying to address is the siloing of the data external to where the action happens where the data gets produced where the data needs to be shared when the data gets used right in the context of the business so it's about the the really the root cause of the centralization gets addressed by distribution of the accountability end to end back to the domains and these domains this distribution of accountability technical accountability to the domains have already happened in the last you know decade or so we saw the transition from you know one general i.t addressing all of the needs of the organization to technology groups within the itu or even outside of the iit aligning themselves to build applications and services that the different business units need so what data mesh does it just extends that model and say okay we're aligning business with the tech and data now right so both application of the data in ml or inside generation in the domains related to the domain's needs as well as sharing the data that the domains are generating with the rest of the organization but the moment you do that then you have to solve other problems that may arise and that you know gives birth to the second principle which is about um data as a product as a way of preventing data siloing happening within the domain so changing the focus of the domains that are now producing data from i'm just going to create that data i collect for myself and that satisfy my needs to in fact the responsibility of domain is to share the data as a product with all of the you know wonderful characteristics that a product has and i think that leads to really interesting architectural and technical implications of what actually constitutes state has a product and we can have a separate conversation but once you do that then that's the point in the conversation that cio says well how do i even manage the cost of operation if i decentralize you know building and sharing data to my technical teams to my application teams do i need to go and hire another hundred data engineers and i think that's the role of a self-serve data platform in a way that it enables and empowers generalist technologies that we already have in the technical domains the the majority population of our developers these days right so the data platform attempts to mobilize the generalist technologies to become data producers to become data consumers and really rethink what tools these people need um and the last last principle so data platform is really to giving autonomy to domain teams and empowering them and reducing the cost of ownership of the data products and finally as you mentioned the question around how do i still assure that these different data products are interoperable are secure you know respecting privacy now in a decentralized fashion right when we are respecting the sovereignty or the domain ownership of um each domain and that leads to uh this idea of both from operational model um you know applying some sort of a federation where the domain owners are accountable for interoperability of their data product they have incentives that are aligned with global harmony of the data mesh as well as from the technology perspective thinking about this data is a product with a new lens with a lens that all of those policies that need to be respected by these data products such as privacy such as confidentiality can we encode these policies as computational executable units and encode them in every data product so that um we get automation we get governance through automation so that's uh those that's the relationship the complex relationship between the four principles yeah thank you for that i mean it's just a couple of points there's so many important points in there but the idea of the silos and the data as a product sort of breaking down those cells because if you have a product and you want to sell more of it you make it discoverable and you know as a p l manager you put it out there you want to share it as opposed to hide it and then you know this idea of managing the cost you know number three where people say well centralize and and you can be more efficient but that but that essentially was the the failure in your other point related point is generalist versus specialist that's kind of one of the failures of hadoop was you had these hyper specialist roles emerge and and so you couldn't scale and so let's talk about the goals of data mesh for a moment you've said that the objective is to extend exchange you call it a new unit of value between data producers and data consumers and that unit of value is a data product and you've stated that a goal is to lower the cognitive load on our brains i love this and simplify the way in which data are presented to both producers and consumers and doing so in a self-serve manner that eliminates the tapping on the shoulders or emails or raising tickets so how you know i'm trying to understand how data should be used etc so please explain why this is so important and how you've seen organizations reduce the friction across the data flows and the interconnectedness of things like data products across the company yeah i mean this is important um as you mentioned you know initially when this whole idea of a data-driven innovation came to exist and we needed all sorts of you know technology stacks we we centralized um creation of the data and usage of the data and that's okay when you first get started with where the expertise and knowledge is not yet diffused and it's only you know the privilege of a very few people in the organization but as we move to a data driven um you know innovation cycle in the organization as we learn how data can unlock new new programs new models of experience new products then it's really really important as you mentioned to get the consumers and producers talk to each other directly without a broker in the middle because even though that having that centralized broker could be a cost-effective model but if you if we include uh the cost of missed opportunity for something that we could have innovated well we missed that opportunity because of months of looking for the right data then that cost parented the cost benefit parameters and formula changes so um so to to have that innovation um really embedded data-driven innovation embedded into every domain every team we need to enable a model where the producer can directly peer-to-peer discover the data uh use it understand it and use it so the litmus test for that would be going from you know a hypothesis that you know as a data scientist i think there is a pattern and there is an insight in um you know in in the customer behavior that if i have access to all of the different informations about the customer all of the different touch points i might be able to discover that pattern and personalize the experience of my customer the liquid stuff is going from that hypothesis to finding all of the different sources be able to understanding and be able to connect them um and then turn them them into you know training of my machine learning and and the rest is i guess known as an intelligent product got it thank you so i i you know a lot of what we do here in breaking it house is we try to curate and then point people to new resources so we will have some additional resources because this this is not superficial uh what you and your colleagues in the community are creating but but so i do want to you know curate some of the other material that you had so if i bring up this next chart the left-hand side is a curated description both sides of your observations of most of the monolithic data platforms they're optimized for control they serve a centralized team that has hyper-specialized roles as we talked about the operational stacks are running running enterprise software they're on kubernetes and the microservices are isolated from let's say the spark clusters you know which are managing the analytical data etc whereas the data mesh proposes much greater autonomy and the management of code and data pipelines and policy as independent entities versus a single unit and you've made this the point that we have to enable generalists to borrow from so many other examples in the in the industry so it's an architecture based on decentralized thinking that can really be applied to any domain really domain agnostic in a way yes and i think if i pick one key point from that diagram is really um or that comparison is the um the the the data platforms or the the platform capabilities need to present a continuous experience from an application developer building an application that generates some data let's say i have an e-commerce application that generates some data to the data product that now presents and shares that data as as temporal immutable facts that can be used for analytics to the data scientist that uses that data to personalize the experience to the deployment of that ml model now back to that e-commerce application so if we really look at this continuous journey um the walls between these separate platforms that we have built needs to come down the platforms underneath that generate you know that support the operational systems versus supported data platforms versus supporting the ml models they need to kind of play really nicely together because as a user i'll probably fall off the cliff every time i go through these stages of this value stream um so then the interoperability of our data solutions and operational solutions need to increase drastically because so far we've got away with running operational systems an application on one end of the organization running and data analytics in another and build a spaghetti pipeline to you know connect them together neither of the ends are happy i hear from data scientists you know data analyst pointing finger at the application developer saying you're not developing your database the right way and application point dipping you're saying my database is for running my application it wasn't designed for sharing analytical data so so we've got to really what data mesh as a mesh tries to do is bring these two world together closer because and then the platform itself has to come closer and turn into a continuous set of you know services and capabilities as opposed to this disjointed big you know isolated stacks very powerful observations there so we want to dig a little bit deeper into the platform uh jamar can have you explain your thinking here because it's everybody always goes to the platform what do i do with the infrastructure what do i do so you've stressed the importance of interfaces the entries to and the exits from the platform and you've said you use a particular parlance to describe it and and this chart kind of shows what you call the planes not layers the planes of the platform it's complicated with a lot of connection points so please explain these planes and how they fit together sure i mean there was a really good point that you started with that um when we think about capabilities or that enables build of application builds of our data products build their analytical solutions usually we jump too quickly to the deep end of the actual implementation of these technologies right do i need to go buy a data catalog or do i need you know some sort of a warehouse storage and what i'm trying to kind of elevate us up and out is to to to force us to think about interfaces and apis the experiences that the platform needs to provide to run this secure safe trustworthy you know performance mesh of data products and if you focus on then the interfaces the implementation underneath can swap out right you can you can swap one for the other over time so that's the purpose of like having those lollipops and focusing and emphasizing okay what is the interface that provides a certain capability like the storage like the data product life cycle management and so on the purpose of the planes the mesh experience playing data product expense utility plan is really giving us a language to classify different set of interfaces and capabilities that play nicely together to provide that cohesive journey of a data product developer data consumer so then the three planes are really around okay at the bottom layer we have a lot of utilities we have that mad mac turks you know kind of mad data tooling chart so we have a lot of utilities right now they they manage workflow management you know they they do um data processing you've got your spark link you've got your storage you've got your lake storage you've got your um time series of storage you've got a lot of tooling at that level but the layer that we kind of need to imagine and build today we don't buy yet as as long as i know is this linger that allows us to uh exchange that um unit of value right to build and manage these data products so so the language and the apis and interface of this product data product experience plan is not oh i need this storage or i need that you know workflow processing is that i have a data product it needs to deliver certain types of data so i need to be able to model my data it needs to as part of this data product i need to write some processing code that keeps this data constantly alive because it's receiving you know upstream let's say user interactions with a website and generating the profile of my user so i need to be able to to write that i need to serve the data i need to keep the data alive and i need to provide a set of slos and guarantees for my data so that good documentation so that some you know someone who comes to data product knows but what's the cadence of refresh what's the retention of the data and a lot of other slos that i need to provide and finally i need to be able to enforce and guarantee certain policies in terms of access control privacy encryption and so on so as a data product developer i just work with this unit a complete autonomous self-contained unit um and the platform should give me ways of provisioning this unit and testing this unit and so on that's why kind of i emphasize on the experience and of course we're not dealing with one or two data product we're dealing with a mesh of data products so at the kind of mesh level experience we need a set of capabilities and interfaces to be able to search the mesh for the right data to be able to explore the knowledge graph that emerges from this interconnection of data products need to be able to observe the mesh for any anomalies did we create one of these giant master data products that all the data goes into and all the data comes out of how we found ourselves the bottlenecks to be able to kind of do those level machine level capabilities we need to have a certain level of apis and interfaces and once we decide and decide what constitutes that to satisfy this mesh experience then we can step back and say okay now what sort of a tool do i need to build or buy to satisfy them and that's that is not what the data community or data part of our organizations used to i think traditionally we're very comfortable with buying a tool and then changing the way we work to serve to serve the tool and this is slightly inverse to that model that we might be comfortable with right and pragmatists will will to tell you people who've implemented data match they'll tell you they spent a lot of time on figuring out data as a product and the definitions there the organizational the getting getting domain experts to actually own the data and and that's and and they will tell you look the technology will come and go and so to your point if you have those lollipops and those interfaces you'll be able to evolve because we know one thing's for sure in this business technology is going to change um so you you had some practical advice um and i wanted to discuss that for those that are thinking about data mesh i scraped this slide from your presentation that you made and and by the way we'll put links in there your colleague emily who i believe is a data scientist had some really great points there as well that that practitioners should dig into but you made a couple of points that i'd like you to summarize and to me that you know the big takeaway was it's not a one and done this is not a 60-day project it's a it's a journey and i know that's kind of cliche but it's so very true here yes um this was a few starting points for um people who are embarking on building or buying the platform that enables the people enables the mesh creation so it was it was a bit of a focus on kind of the platform angle and i think the first one is what we just discussed you know instead of thinking about mechanisms that you're building think about the experiences that you're enabling uh identify who are the people like what are the what is the persona of data scientists i mean data scientist has a wide range of personas or did a product developer the same what is the persona i need to develop today or enable empower today what skill sets do they have and and so think about experience as mechanisms i think we are at this really magical point i mean how many times in our lifetime we come across a complete blanks you know kind of white space to a degree to innovate so so let's take that opportunity and use a bit of a creativity while being pragmatic of course we need solutions today or yesterday but but still think about the experiences not not mechanisms that you need to buy so that was kind of the first step and and the nice thing about that is that there is an evolutionary there is an iterative path to maturity of your data mesh i mean if you start with thinking about okay which are the initial use cases i need to enable what are the data products that those use cases depend on that we need to unlock and what is the persona of my or general skill set of my data product developer what are the interfaces i need to enable you can start with the simplest possible platform for your first two use cases and then think about okay the next set of data you know data developers they have a different set of needs maybe today i just enable the sql-like querying of the data tomorrow i enable the data scientists file based access of the data the day after i enable the streaming aspect so so have this evolutionary kind of path ahead of you and don't think that you have to start with building out everything i mean one of the things we've done is taking this harvesting approach that we work collaboratively with those technical cross-functional domains that are building the data products and see how they are using those utilities and harvesting what they are building as the solutions for themselves back into the back into the platform but at the end of the day we have to think about mobilization of the large you know largest population of technologies we have we'd have to think about diffusing the technology and making it available and accessible by the generous technologies that you know and we've come a long way like we've we've gone through these sort of paradigm shifts in terms of mobile development in terms of functional programming in terms of cloud operation it's not that we are we're struggling with learning something new but we have to learn something that works nicely with the rest of the tooling that we have in our you know toolbox right now so so again put that generalist as the uh as one of your center personas not the only person of course we will have specialists of course we will always have data scientists specialists but any problem that can be solved as a general kind of engineering problem and i think there's a lot of aspects of data michigan that can be just a simple engineering problem um let's just approach it that way and then create the tooling um to empower those journalists great thank you so listen i've i've been around a long time and so as an analyst i've seen many waves and we we often say language matters um and so i mean i've seen it with the mainframe language it was different than the pc language it's different than internet different than cloud different than big data et cetera et cetera and so we have to evolve our language and so i was going to throw a couple things out here i often say data is not the new oil because because data doesn't live by the laws of scarcity we're not running out of data but i get the analogy it's powerful it powered the industrial economy but it's it's it's bigger than that what do you what do you feel what do you think when you hear the data is the new oil yeah i don't respond to those data as the gold or oil or whatever scarce resource because as you said it evokes a very different emotion it doesn't evoke the emotion of i want to use this i want to utilize it feels like i need to kind of hide it and collect it and keep it to myself and not share it with anyone it doesn't evoke that emotion of sharing i really do think that data and i with it with a little asterisk and i think the definition of data changes and that's why i keep using the language of data product or data quantum data becomes the um the most important essential element of existence of uh computation what do i mean by that i mean that you know a lot of applications that we have written so far are based on logic imperative logic if this happens do that and else do the other and we're moving to a world where those applications generating data that we then look at and and the data that's generated becomes the source the patterns that we can exploit to build our applications as in you know um curate the weekly playlist for dave every monday based on what he has listened to and the you know other people has listened to based on his you know profile so so we're moving to the world that is not so much about applications using the data necessarily to run their businesses that data is really truly is the foundational building block for the applications of the future and then i think in that we need to rethink the definition of the data and maybe that's for a different conversation but that's that's i really think we have to converge the the processing that the data together the substance substance and the processing together to have a unit that is uh composable reusable trustworthy and that's that's the idea behind the kind of data product as an atomic unit of um what we build from future solutions got it now something else that that i heard you say or read that really struck me because it's another sort of often stated phrase which is data is you know our most valuable asset and and you push back a little bit on that um when you hear people call data and asset people people said often have said they think data should be or will eventually be listed as an asset on the balance sheet and i i in hearing what you said i thought about that i said well you know maybe data as a product that's an income statement thing that's generating revenue or it's cutting costs it's not necessarily because i don't share my my assets with people i don't make them discoverable add some color to this discussion i think so i think it's it's actually interesting you mentioned that because i read the new policy in china that cfos actually have a line item around the data that they capture we don't have to go to the political conversation around authoritarian of um collecting data and the power that that creates and the society that leads to but that aside that big conversation little conversation aside i think you're right i mean the data as an asset generates a different behavior it's um it creates different performance metrics that we would measure i mean before conversation around data mesh came to you know kind of exist we were measuring the success of our data teams by the terabytes of data they were collecting by the thousands of tables that they had you know stamped as golden data none of that leads to necessarily there's no direct line i can see between that and actually the value that data generated but if we invert that so that's why i think it's rather harmful because it leads to the wrong measures metrics to measure for success so if you invert that to a bit of a product thinking or something that you share to delight the experience of users your measures are very different your measures are the the happiness of the user they decrease lead time for them to actually use and get value out of it they're um you know the growth of the population of the users so it evokes a very different uh kind of behavior and success metrics i do say if if i may that i probably come back and regret the choice of word around product one day because of the monetization aspect of it but maybe there is a better word to use but but that's the best i think we can use at this point in time why do you say that jamar because it's too directly related to monetization that has a negative connotation or it might might not apply in things like healthcare or you know i think because if we want to take your shortcuts and i remember this conversation years back that people think that the reason to you know kind of collect data or have data so that we can sell it you know it's just the monetization of the data and we have this idea of the data market places and so on and i think that is actually the least valuable um you know outcome that we can get from thinking about data as a product that direct cell an exchange of data as a monetary you know exchange of value so so i think that might redirect our attention to something that really matters which is um enabling using data for generating ultimately value for people for the customers for the organizations for the partners as opposed to thinking about it as a unit of exchange for for money i love data as a product i think you were your instinct was was right on and i think i'm glad you brought that up because because i think people misunderstood you know in the last decade data as selling data directly but you really what you're talking about is using data as a you know ingredient to actually build a product that has value and value either generate revenue cut costs or help with a mission like it could be saving lives but in some way for a commercial company it's about the bottom line and that's just the way it is so i i love data as a product i think it's going to stick so one of the other things that struck me in one of your webinars was one of the q a one of the questions was can i finally get rid of my data warehouse so i want to talk about the data warehouse the data lake jpmc used that term the data lake which some people don't like i know john furrier my business partner doesn't like that term but the data hub and one of the things i've learned from sort of observing your work is that whether it's a data lake a data warehouse data hub data whatever it's it should be a discoverable node on the mesh it really doesn't matter the the technology what are your your thoughts on that yeah i think the the really shift is from a centralized data warehouse to data warehouse where it fits so i think if you just cross that centralized piece uh we are all in agreement that data warehousing provides you know interesting and capable interesting capabilities that are still required perhaps as a edge node of the mesh that is optimizing for certain queries let's say financial reporting and we still want to direct a fair bit of data into a node that is just for those financial reportings and it requires the precision and the um you know the speed of um operation that the warehouse technology provides so i think um definitely that technology has a place where it falls apart is when you want to have a warehouse to rule you know all of your data and model canonically model your data because um it you have to put so much energy into you know kind of try to harness this model and create this very complex the complex and fragile snowflake schemas and so on that that's all you do you spend energy against the entropy of your organization to try to get your arms around this model and the model is constantly out of step with what's happening in reality because reality the model the reality of the business is moving faster than our ability to model everything into into uh into one you know canonical representation i think that's the one we need to you know challenge not necessarily application of data warehousing on a node i want to close by coming back to the issues of standards um you've specifically envisioned data mesh to be technology agnostic as i said before and of course everyone myself included we're going to run a vendor's technology platform through a data mesh filter the reality is per the matt turc chart we showed earlier there are lots of technologies that that can be nodes within the data mesh or facilitate data sharing or governance etc but there's clearly a lack of standardization i'm sometimes skeptical that the vendor community will drive this but maybe like you know kubernetes you know google or some other internet giant is going to contribute something to open source that addresses this problem but talk a little bit more about your thoughts on standardization what kinds of standards are needed and where do you think they'll come from sure i mean the you write that the vendors are not today incentivized to create those open standards because majority of the vet not all of them but some vendors operational model is about bring your data to my platform and then bring your computation to me uh and all will be great and and that will be great for a portion of the clients and portion of environments where that complexity we're talking about doesn't exist so so we need yes other players perhaps maybe um some of the cloud providers or people that are more incentivized to open um open their platform in a way for data sharing so as a starting point i think standardization around data sharing so if you look at the spectrum right now we have um a de facto sound it's not even a standard for something like sql i mean everybody's bastardized to call and extended it with so many things that i don't even know what this standard sql is anymore but we have that for some form of a querying but beyond that i know for example folks at databricks to start to create some standards around delta sharing and sharing the data in different models so i think data sharing as a concept the same way that apis were about capability sharing so we need to have the data apis or analytical data apis and data sharing extended to go beyond simply sql or languages like that i think we need standards around computational prior policies so this is again something that is formulating in the operational world we have a few standards around how do you articulate access control how do you identify the agents who are trying to access with different authentication mechanism we need to bring some of those our ad our own you know our data specific um articulation of policies uh some something as simple as uh identity management across different technologies it's non-existent so if you want to secure your data across three different technologies there is no common way of saying who's the agent that is acting uh to act to to access the data can i authenticate and authorize them so so those are some of the very basic building blocks and then the gravy on top would be new standards around enriched kind of semantic modeling of the data so we have a common language to describe the semantic of the data in different nodes and then relationship between them we have prior work with rdf and folks that were focused on i guess linking data across the web with the um kind of the data web i guess work that we had in the past we need to revisit those and see their practicality in the enterprise con context so so data modeling a rich language for data semantic modeling and data connectivity most importantly i think those are some of the items on my wish list that's good well we'll do our part to try to keep the standards you know push that push that uh uh movement jamaica we're going to leave it there i'm so grateful to have you uh come on to the cube really appreciate your time it's just always a pleasure you're such a clear thinker so thanks again thank you dave that's it's wonderful to be here now we're going to post a number of links to some of the great work that jamark and her team and her books and so you check that out because we remember we publish each week on siliconangle.com and wikibon.com and these episodes are all available as podcasts wherever you listen listen to just search breaking analysis podcast don't forget to check out etr.plus for all the survey data do keep in touch i'm at d vallante follow jamac d z h a m a k d or you can email me at david.velante at siliconangle.com comment on the linkedin post this is dave vellante for the cube insights powered by etrbwell and we'll see you next time you

Published Date : Oct 25 2021

SUMMARY :

all of the you know wonderful

ENTITIES

Entity	Category	Confidence
60-day	QUANTITY	0.99+
one	QUANTITY	0.99+
40 percent	QUANTITY	0.99+
matt turk	PERSON	0.99+
two books	QUANTITY	0.99+
china	LOCATION	0.99+
thousands of tables	QUANTITY	0.99+
dave vellante	PERSON	0.99+
jamaac	PERSON	0.99+
google	ORGANIZATION	0.99+
siliconangle.com	OTHER	0.99+
tomorrow	DATE	0.99+
yesterday	DATE	0.99+
october	DATE	0.99+
boston	LOCATION	0.99+
first step	QUANTITY	0.98+
jamar	PERSON	0.98+
today	DATE	0.98+
jamaica	PERSON	0.98+
both sides	QUANTITY	0.98+
shamak	PERSON	0.98+
dave	PERSON	0.98+
jamark	PERSON	0.98+
first one	QUANTITY	0.98+
o'reilly	ORGANIZATION	0.98+
both	QUANTITY	0.97+
each week	QUANTITY	0.97+
john furrier	PERSON	0.97+
second principle	QUANTITY	0.97+
jamaak dagani shamak	PERSON	0.96+
less than a year ago	DATE	0.96+
earlier this year	DATE	0.96+
three different technologies	QUANTITY	0.96+
jamaa	PERSON	0.95+
each domain	QUANTITY	0.95+
terabytes of data	QUANTITY	0.94+
three planes	QUANTITY	0.94+
july	DATE	0.94+
last decade	DATE	0.93+
about 1500 respondents	QUANTITY	0.93+
decades	QUANTITY	0.93+
first	QUANTITY	0.93+
first two	QUANTITY	0.93+
dot works	ORGANIZATION	0.93+
one key point	QUANTITY	0.93+
first two use cases	QUANTITY	0.92+
last friday	DATE	0.92+
this week	DATE	0.92+
two	QUANTITY	0.92+
three other	QUANTITY	0.92+
ndor	ORGANIZATION	0.92+
first thing	QUANTITY	0.9+
two data	QUANTITY	0.9+
lake	ORGANIZATION	0.89+
four areas	QUANTITY	0.88+
single tool	QUANTITY	0.88+
north america	LOCATION	0.88+
single unit	QUANTITY	0.87+
jamac	PERSON	0.86+
one of	QUANTITY	0.85+
things	QUANTITY	0.85+
david.velante	OTHER	0.83+
past eight quarters	DATE	0.83+
four principles	QUANTITY	0.82+
dave	ORGANIZATION	0.82+
a lot of applications	QUANTITY	0.81+
four main principles	QUANTITY	0.8+
sql	TITLE	0.8+
palo alto	ORGANIZATION	0.8+
emily	PERSON	0.8+
d vallante	PERSON	0.8+

Rob Thomas, IBM | IBM Think 2020

>>From the cube studios in Palo Alto in Boston. It's the cube covering the IBM thing brought to you by IBM. We're back and this is Dave Vellante and you're watching the cube and we're covering wall-to-wall the IBM 2020 I think digital experience. Rob Thomas is here. He's the senior vice president of clouds and data. Right. Warm rub. Always a pleasure to see you. I wish you were face to face, but Hey, we're doing the best we can. As you say, doing the best we can. Great to see you Dave. Hope family safe, healthy, happy as best you can be. Yeah. Ditto. You back out your Robin. Congratulations on on the new role, you and the cube. We've been riding this data wave for quite some time now. It's really been incredible. It really is. And last year I talked to you about how clients, we're slowly making progress on data strategy, starting to experiment with AI. >>We've gotten to the point now where I'd say it's game on for AI, which is exciting to see and that's a lot of what the theme of this year's think is about. Yeah, and I definitely want to dig into that, but I want to start by asking you sort of moves that you saw you're in there seeing your clients make with regard to the cobot night covert 19 crisis. Maybe how you guys are helping them in very interested in what you see as sort of longterm and even, you know, quasi permanent as a result of this. I would first say it this way. I don't, I'm not sure the crisis is going to change businesses as much as it's going to be accelerating. What would have happened anyway, regardless of the industry that you're in. We see clients aggressively looking at how do we get the digital faster? >>How do we automate more than we ever have before? There's the obvious things like business resiliency and business continuity, managing the distributed workforce. So to me, what we've seen is really about, and acceleration, not necessarily in a different direction, but an acceleration on. The thing is that that we're already kind of in the back of their minds or in the back of their plans now that as we'll come to the forefront and I'm encouraged because we see clients moving at a rate and pace that we'd never seen before that's ultimately going to be great for them, great for their businesses. And so I'm really happy to see that you guys have used Watson to really try to get, you know, some good high fidelity answers to the citizens. I wonder if you could explain that initiative. Well, we've had this application called Watson assistant for the last few years and we've been supporting banks, airlines, retailers, companies across all industries and helping them better interact with our customers and in some cases, employees. >>We took that same technology and as we saw the whole covert 19 situation coming, we said, Hey, we can evolve Watson assistant to serve citizens. And so it started by, we started training the models, which are intent based models in Watson assistant on all the publicly available data from the CDC as an example. And we've been able to build a really powerful virtual agent to serve really any citizen that has questions about and what they should be doing. And the response has been amazing. I mean, in the last two weeks we've gone live with 20 organizations, many of which are state and local governments. Okay. Also businesses, the city of Austin children's healthcare of Atlanta. Mmm. They local governments in Spain and Greece all over the world. And in some instances these clients have gotten live in less than 24 hours. Meaning they have a virtual agent that can answer any question. >>They can do that in less than 24 hours. It's actually been amazing to see. So proud of the team that built this over time. And it was kind of proof of the power of technology when we're dealing with any type of a challenge. You know, I had a conversation earlier with Jamie Thomas about quantum and was asking her sort of how your clients are using it. The examples that came up were financial institutions, pharmaceutical know battery manufacturers, um, airlines. And so it strikes me when you think about uh, machine intelligence and AI, the type of AI that you're yeah, at IBM is not consumer oriented AI. It's really designed for businesses. And I wonder if you could sort of add some color to that. Yeah, let's distinguish the difference there. Cause I think you've said it well consumer AI is smart speakers things in our home, you know, music recommendations, photo analysis and that's great and it enriches all of our personal lives. >>AI for business is very different. This is about how do you make better predictions, how do you optimize business processes, how do you automate things that maybe your employees don't want to do in the first time? Our focus in IBM as part of, we've been doing with Watson is really anchoring on three aspects of AI language. So understanding language because the whole business world is about communication of language, trust meaning trusted AI. You understand the models, you understand the data. And then third automation and the whole focus of what we're doing here in the virtual think experience. It's focused on AI for automation. Whether that's automating business processes or the new announcement this week, which is around automating AI opera it operations for a CIO. You, you've talked the years about this notion of an AI ladder. You actually, I actually wrote a book on it and uh, but, but it's been hard for customers to operationalize AI. >>Mmm. We talked about this last year. Thanks. What kind of progress, uh, have we made in the last 12 months? There's been a real recognition of this notion that your AI is only as good as your data. And we use the phrase, there's no AI without IAA, meaning information architecture, it's all the same concept, which is that your data, it has to be ready for AI if you want to too get successful outcomes with AI and the steps of those ladders around how you collect data, how you organize data, how you analyze data, how you infuse that into your business processes. seeing major leaps forward in the last nine months where organizations are understanding that connection and then they're using that to really drive initiatives around AI. So let's talk about that a little bit more. This notion of AI ops, I mean it's essentially the take the concept of dev ops and apply it to the data pipeline if you will. >>Everybody, you know, complains, you know, data scientists complained that all, they spent all their time wrangling data, improving data quality, they don't have line of sight across their organization with regard to other data specialists, whether it's data engineers or even developers. Maybe you could talk a little bit more about that announcement and sort of what you're doing in that area. Sure. So right. Let me put a number on it because the numbers are amazing. Every year organizations lose 2016 point $5 billion of revenue because of outages in it system. That is a staggering number when you think about it. And so then you say, okay, so how do you break down and attack that problem? Well, do you have to get better at fixing problems or you have to get better at avoiding problems altogether. And as you may expect, a little bit of both. You, you want to avoid problems obviously, but in an uncertain world, you're always going to deal with unforeseen challenges. >>So the also the question becomes how fast can you respond and there's no better use of AI. And then to do, I hope you like those tasks, which is understanding your environment, understanding what the systems are saying through their data and identifying issues become before they become outages. And once there is an outage, how do you quickly triage data across all your systems to figure out where is the problem and how you can quickly address it. So we are announcing Watson AI ops, which is the nervous system for a CIO, the manager, all of their systems. What we do is we just collect data, log data from every source system and we build a semantic layer on top that. So Watson understands the systems, understands the normal behavior, understands the acceptable ranges, and then anytime something's not going like it should, Watson raises his hand and says, Hey, you should probably look at this before it becomes a problem. >>We've partnered with companies like Slack, so the UI for Watson AI ops, it's actually in Slack so that companies can use and employees can use a common collaboration tool too. Troubleshoot or look at either systems. It's, it's really powerful. So that we're really proud of. Well I just kind of leads me to my next question, which I mean, IBM got the religion 20 years ago on openness. I mean I can trace it back to the investment you made and Lennox way back when. Um, and of course it's a huge investment last year in red hat, but you know, open source company. So you just mentioned Slack. Talk about open ecosystems and how that it fits into your AI and data strategy. Well, if you think about it, if we're going to take on a challenge this grand, which is AI for all of your it by definition you're going to be dealing with full ecosystem of different providers because every organization has a broad set of capabilities we identified early on. >>That means that our ability to provide open ecosystem interoperability was going to be critical. So we're launching this product with Slack. I mentioned with box, we've got integrations into things like PagerDuty service now really all of the tools of modern it architecture where we can understand the data and help clients better manage those environments. So this is all about an open ecosystem and that's how we've been approaching it. Let's start, it's really about data, applying machine intelligence or AI to that data and about cloud for scale. So I wonder what you're seeing just in terms of that sort of innovation engine. I mean obviously it's gotta be secure. It's, it seems like those are the pillars of innovation for the next 10 plus years. I think you're right. And I would say this whole situation that we're dealing with has emphasized the importance of hybrid deployment because companies have it capabilities on public clouds, on private clouds, really everywhere. >>And so being able to operate that as a single architecture, it's becoming very important. You can use AI to automate tasks across that whole infrastructure that makes a big difference. And to your point, I think we're going to see a massive acceleration hybrid cloud deployments using AI. And this will be a catalyst for that. And so that's something we're trying to help clients with all around the world. You know, you wrote in your book that O'Reilly published that AI is the new electricity and you talked about problems. Okay. Not enough data. If your data is you know, on prem and you're only in the cloud, well that's a problem or too much data. How you deal with all that data, data quality. So maybe we could close on some of the things that you know, you, you talked about in that book, you know, maybe how people can get ahold of it or any other, you know, so the actions you think people should take to get smart on this topic. >>Yeah, so look, really, really excited about this. Paul's the capitalists, a friend of mine and a colleague, we've published this book working with a Riley called the a ladder and it's all the concepts we talked about in terms of how companies can climb this ladder to AI. And we go through a lot of different use cases, scenarios, I think. Yeah. Anybody reading this is going to see their company in one of these examples, our whole ambition was to hopefully plant some seeds of ideas for how you can start to accelerate your journey to AI in any industry right now. Well, Rob, it's always great having you on the cube, uh, your insights over the years and you've been a good friend of ours, so really appreciate you coming on and, uh, and best of luck to you, your family or wider community. I really appreciate it. Thanks Dave. Great to be here and again, wish you and the whole cube team the best and to all of our clients out there around the world. We wish you the best as well. All right. You're watching the cubes coverage of IBM think 20, 20 digital, the vent. We'll be right back right after this short break. This is Dave Volante.

Published Date : May 7 2020

SUMMARY :

the IBM thing brought to you by IBM. and I definitely want to dig into that, but I want to start by asking you sort of moves that you saw you're happy to see that you guys have used Watson to really try to get, you know, I mean, in the last two weeks we've gone live with 20 And I wonder if you could sort of add some color to that. business processes, how do you automate things that maybe your employees don't dev ops and apply it to the data pipeline if you will. And so then you say, okay, so how do you break down and attack that problem? And then to do, I hope you like those tasks, which is understanding and of course it's a huge investment last year in red hat, but you know, open source company. And I would say this whole So maybe we could close on some of the things that you know, you, you talked about in that book, Great to be here and again, wish you and the whole cube team the best and to all

ENTITIES

Entity	Category	Confidence
IBM	ORGANIZATION	0.99+
Spain	LOCATION	0.99+
Dave	PERSON	0.99+
Dave Vellante	PERSON	0.99+
Rob Thomas	PERSON	0.99+
Greece	LOCATION	0.99+
20 organizations	QUANTITY	0.99+
Palo Alto	LOCATION	0.99+
2016	DATE	0.99+
Dave Volante	PERSON	0.99+
Rob	PERSON	0.99+
Jamie Thomas	PERSON	0.99+
Ditto	PERSON	0.99+
$5 billion	QUANTITY	0.99+
Robin	PERSON	0.99+
Atlanta	LOCATION	0.99+
last year	DATE	0.99+
last year	DATE	0.99+
less than 24 hours	QUANTITY	0.99+
Austin	LOCATION	0.99+
Lennox	ORGANIZATION	0.99+
Paul	PERSON	0.98+
Watson assistant	TITLE	0.98+
first	QUANTITY	0.98+
both	QUANTITY	0.98+
third	QUANTITY	0.98+
Boston	LOCATION	0.97+
first time	QUANTITY	0.96+
think 20	COMMERCIAL_ITEM	0.96+
this week	DATE	0.96+
20 years ago	DATE	0.96+
CDC	ORGANIZATION	0.95+
last two weeks	DATE	0.93+
one	QUANTITY	0.92+
this year	DATE	0.92+
Slack	ORGANIZATION	0.91+
last 12 months	DATE	0.91+
last nine months	DATE	0.9+
19	QUANTITY	0.89+
Slack	TITLE	0.88+
20	COMMERCIAL_ITEM	0.88+
last few years	DATE	0.88+
Watson	PERSON	0.87+
Watson	TITLE	0.86+
Think 2020	COMMERCIAL_ITEM	0.85+
single architecture	QUANTITY	0.82+
next 10 plus years	DATE	0.8+
PagerDuty	ORGANIZATION	0.71+
three aspects	QUANTITY	0.71+
O'Reilly	ORGANIZATION	0.69+
Riley	ORGANIZATION	0.62+
19 crisis	EVENT	0.58+
Watson	ORGANIZATION	0.45+
covert	EVENT	0.41+
2020	COMMERCIAL_ITEM	0.37+
cobot night	TITLE	0.3+

Ken O’Reilly, Cisco Stealthwatch | Cisco Live US 2019

>> Narrator: Live from San Diego, California it's theCUBE covering Cisco Live, US, 2019. Brought to you by Cisco and its eco system partners. >> Welcome back to San Diego everybody. This is theCUBE the leader in live tech coverage, My name is Dave Vellante, Stu Miniman is here, Lisa Martin as well but we've got a very special guest now Ken O'Reilly my good friend is here. He's the director of customer experience for Cisco Stealthwatch. Kenny great to see you thanks for coming on. >> Well, thanks for having me, Dave. Good seeing you as well. >> Yes so customer experience, people think about customer experience and security it's not always great right? It's a challenging environment they're constantly sort of chasing their tails it's like the arms race with the bad guy so what is customer experience all about in the context of security? >> So our number one goal for our security customers is to accelerate their value realization so our challenge is to make sure that they get the value out of the product that they're buying because every minute of every day the bad guys are trying to get their assets and their IP and when they buy a technology the quicker you can get it up and running and protect the better it is for our customer. >> So how do you measure like value? It's like reducing the amount of data that you're exposed to losing? Is it increasing the cost of the bad guys getting in? 'cause if I'm a bad guy and it costs me more to get in I would maybe go somewhere else, how do you measure that? >> Right so, you're right, so our whole product strategy is to increase the cost for the bad guy to get the IP or the assets and so for us we have to understand what the value proposition is for our product so that the customers can realize that value, so whether it's tryna help them with the use cases or operationalize the product or in our case what we try to do we have both network users and security users we try to get both groups to adopt the technology and then expand it from there, operation centers to the guys that are doing the thread hunting to the investigations et cetera. So that's how we sort of gauge the value is the number of people that are using the technology and the number of use cases that are actually implemented. >> So we've been talking about security all week Stealthwatch obviously you know one of the flagship products Cisco security business grew 21% last quarter so that's kind of an interesting stat services is 25% of the companies revenue so you're the intersection of two pretty important places for Cisco so specifically when you come into a customer engagement who are you engaging with is it a multidisciplinary are you primarily dealing with the SecOps group or do you touch other parts of the organization? >> Yeah, so typically when a company's looking, it's usually they're looking for network visibility so we're dealing with the network architecture teams and they typically bring in the security architects 'cause today they're working hand in hand, and then from there that's where we say preach the gospel of Stealthwatch we always say you can never have enough Stealthwatch okay? Because you can never have enough visibility 'cause once you turn the lights on and they can see what's going on in their network it's very illuminating for them and then they realize the challenges that they have and what they have to do to protect their assets. >> Yeah I joked at Google Cloud Next it's like the cockroaches all scrambling you know for the corners when you turn the lights on and Stealthwatch at its core is you don't need a lot of fancy AI even though you can apply fancy AI but you start with the basics right? What do ya got, where are the gaps okay, so now once it's exposed what do you do with that information is the customer experience group come in and help implement it faster? That's part of the value so time to value to that? >> So time to value with our experts of course we understand the space we understand our product we understand the challenge and of course our network and security customers are overwhelmed you know the stat that they throw out there is that our large customers have anywhere from 50-100 security products so how do you stand out? So as a vendor our number one goal is to build that relationship with the customer to become the trusted security advisor so we know better than anybody how to get that value how to get it quickly and you know the number one problem that they have Dave is how to operationalize all these tools 'cause Stealthwatch sits in the middle we're a big integration platform we take data, telemetry, NetFlow from a lot of different products and we bring that data together to figure out, to help that customer figure out how to make sense of it update their policies create better policies and really tighten up their security posture. >> Okay so they might like to reduce the number of tools but they really can't right? 'cause their using 'em and so what you do is you bring in a layer to help manage that. >> Absolutely. >> But you're also solving a problem just in terms of exposing gaps and then do you also have tooling to fill those gaps? Or is that partners tools is that Stealthwatch? >> So we have our own what we call integration platform where we have a platform that helps integrate other, not only other Cisco security technologies into our platform but other security technologies as well outside of Cisco so you know it's a platform that we've built it's part of our customer experience sort of tool set but it's a tool set unlike anybody else ever has so that along with what we do with the DevNet group we've built our own set of API's to integrate in with the product API's so we can pump data out to data lakes we can pump data out to SIMS like Splunk and some of the others so you know that's where we are we're a solutions group that's what we do we work on the solutions, long term value you know we work on the lifecycle sort of value chain with customers. We're there with 'em the whole time you know our goal; retention, we want them to renew which means they're investing in us again and of course as Cloud, as their infrastructure is moving the the Cloud and our technologies are moving to the Cloud we have to be there to help them get through all those technology challenges. >> So the pricing model is a subscription model is that right? >> Yeah. >> Or can be or? >> Yes, well we call it term all right? But it's essentially subscription we have switched over the last 18 months from a perm to a term based model. >> Which I mean Chuck Robbins in the conference calls in the earnings calls talks about the importance of you know increasingly having a rateable model and recognizing subscription, so when you say a term so I got to what, sign up for a year, two years, three years or something like that? >> We like three yep. >> So who doesn't right? Okay so you sign up for three years but the price book says monthly I'm sure right so you (laughs) make it look smaller, but it makes sense though because you're not going to start stop, start stop with your security, you really want to get success out of it so you got to have some kind of commitment, let's talk a little bit more about the analytics side of it and how you're applying machine intelligence I mean there's always been some form of analytics largely for reporting and things of that nature but now it's getting more automated so take us on that analytics journey Stealthwatch has been around for what five years? >> 15 yeah over 15 years. >> 15? >> Ken: Yes, yes, yes. >> Oh wow maybe I just found out about it five years ago. >> (laughs) right yeah, not but I mean-- >> Dave: Take us back five years. >> Five years? So the big thing for us in the data that we collect is context. Right so you've talked to TK about the more context you can add to that data the better you are at analyzing that data so for us that's one of the things that we do we add a lot of context to that data through ICE so identity information, what kind of assets they are and that's where we get to through our tools add more context so that our analytical engines so like the cognitive thread analytics, the encrypted thread analytics that we have, that they're able to analyze that data a lot better and that's what we've been doing now for the past three plus years since we were acquired by Cisco is to find a way to add more context to the data so that helps our analytics become much more effective. >> And you can interact with through API's say for instance Splunk you mentioned that so you got that data that you can operate on do you see a point where the machines are actually going to plug the holes? I mean are we on the cusp of that? In other words you see a gap >> Right. >> Dave: Today a human has to take action correct? >> Yes, right, right, right. >> Do you see a point maybe it's two, three, five 10 years but are we going to get to that point? >> I think so down the line I mean because we've seen as we've been able to get better visibility and better context about that data we can make better decisions through the machine all right? So it doesn't take an army of people to read the matrix right, we're getting better at you know synthesizing that matrix down you take our network segmentation capabilities that we've built as part of the Stealthwatch customer experience team we can get to well over 90% identification of the assets on the network which is a lot better than anybody else in the industry all right? So we're getting there and through sort of the final stages of reading that metrics, reading the matrix we're getting to the point where we understand a lot more what's on peoples networks what those assets are. >> So as a security practitioner how do you think we're doing as an industry? I mean I used to go back every year and say okay how much was spent on security? are we more secure, less secure? And it felt like you know as data grew it felt like we were getting more and more and more exposed you've seen the stats where when a company gets infiltrated it takes on average you know 250 days for them to realize they've been infiltrated is that changing, are we getting better as an industry? >> I think in Cisco we are because of the products that we have in that integrated architecture so when we first joined three years ago that was the drum beat and now today we integrate with ICE we're going to integrate with next generation firewall through the integration of the sort of analytics that we've got in the Cloud that's happening right? And we're trying to integrate with other products but you know you go down on the floor and you see the number of point products that is a nightmare for our customers so for us through the customer experience in our organization we're there to take that complexity out and bring all of those technologies together and when you get to that point then you're really making progress with a customer, a customer that's got 50-100 products in the mix that's a recipe for disaster and if it's still like that five years from now customers are still going to be challenged. >> So a big part of your customer experience mission is simplification, speed time, time to value. >> Yes. >> Raise the cost to the bad guys and then do it all over again. >> Yeah, yeah it's just rinse and repeat and that's a life cycle journey and that's what we take our customers through right. >> Now I noticed you have on your phone you got the Bruins logo. >> That's right, right here proud. >> So big game tomorrow any predictions? >> 4-3 in overtime Bruins. >> Oh my God I don't think my heart could take that. >> Could you not take that Dave? It's going to be an overtime game. >> Well it's you know it's rare to have a game seven in any, at the very final one, a lot of game sevens but not to win it all I think the last time at Boston was 1984. >> Ken: Is that right? >> Yeah it's been a long time, so you know I'm excited. >> I know you are (laughs) that's right. >> Warriors fans too we got that thing going out I mean I don't know for all you hoop fans out there so, >> Hopefully there's a game seven for that as well. >> Yeah let's go right, why not? >> Why not, game seven all round. >> All right so Chara is going to play with his broken jaw or whatever's going on. >> Matt Grzelcyk I hope is back. >> Dave: Yeah that would be key. >> That would be key yeah so, >> Dave: sure up the defense >> That's right. (crosstalk) >> Ken: He's a plus minus leader Chara. >> Oh yeah. >> That's right all time. >> Even though we give him a lot of grief. (laughter) he may look slow but he's all time plus minus leader. >> All right Kenny hey thanks so much-- >> All right Dave thanks for having me on all right go Bruins. >> All right keep it right there everybody go Bruins we will be right back Dave Vellante, Stu Miniman and Lisa Martin we're live from Cisco Live in San Diego you're watching theCUBE. (electronic jingle)

Published Date : Jun 11 2019

SUMMARY :

Brought to you by Cisco and its eco system partners. Kenny great to see you thanks for coming on. Good seeing you as well. the quicker you can get it up and running is for our product so that the customers you can never have enough Stealthwatch okay? how to get it quickly and you know the number one 'cause their using 'em and so what you do and some of the others so you know that's where we are we have switched over the last 18 months in the data that we collect is context. at you know synthesizing that matrix down and you see the number of point products is simplification, speed time, time to value. Raise the cost to the bad guys and then and that's what we take our customers through right. you got the Bruins logo. Could you not take that Dave? Well it's you know it's rare to have a game seven All right so Chara is going to play That's right. Even though we give him a lot of grief. All right Dave thanks for having me on go Bruins we will be right back Dave Vellante,

ENTITIES

Entity	Category	Confidence
Dave Vellante	PERSON	0.99+
Ken O'Reilly	PERSON	0.99+
Lisa Martin	PERSON	0.99+
Ken O’Reilly	PERSON	0.99+
Dave	PERSON	0.99+
Cisco	ORGANIZATION	0.99+
Stu Miniman	PERSON	0.99+
Ken	PERSON	0.99+
three years	QUANTITY	0.99+
Chara	PERSON	0.99+
two	QUANTITY	0.99+
250 days	QUANTITY	0.99+
Matt Grzelcyk	PERSON	0.99+
25%	QUANTITY	0.99+
21%	QUANTITY	0.99+
two years	QUANTITY	0.99+
Kenny	PERSON	0.99+
Today	DATE	0.99+
San Diego	LOCATION	0.99+
three	QUANTITY	0.99+
Chuck Robbins	PERSON	0.99+
San Diego, California	LOCATION	0.99+
Bruins	ORGANIZATION	0.99+
Five years	QUANTITY	0.99+
last quarter	DATE	0.99+
both	QUANTITY	0.99+
both groups	QUANTITY	0.99+
15	QUANTITY	0.99+
1984	DATE	0.98+
five years	QUANTITY	0.98+
DevNet	ORGANIZATION	0.98+
three years ago	DATE	0.98+
Warriors	ORGANIZATION	0.98+
five years ago	DATE	0.98+
today	DATE	0.98+
a year	QUANTITY	0.97+
2019	DATE	0.97+
one	QUANTITY	0.97+
over 90%	QUANTITY	0.97+
Stealthwatch	ORGANIZATION	0.96+
game seven	QUANTITY	0.96+
Cisco Stealthwatch	ORGANIZATION	0.96+
50-100	QUANTITY	0.95+
tomorrow	DATE	0.95+
Splunk	TITLE	0.93+
50-100 products	QUANTITY	0.92+
theCUBE	ORGANIZATION	0.88+
first	QUANTITY	0.88+
five 10 years	QUANTITY	0.84+
over 15 years	QUANTITY	0.82+
US	LOCATION	0.82+
game sevens	QUANTITY	0.8+
two pretty	QUANTITY	0.79+
past three plus years	DATE	0.75+
Cisco Live	EVENT	0.74+
Google Cloud Next	TITLE	0.72+
Boston	LOCATION	0.7+
SecOps	ORGANIZATION	0.66+
TK	PERSON	0.65+
ICE	ORGANIZATION	0.59+

J.R. Storment, Cloudability | CUBEConversation, February 2019

[Music] hi I'm Peter Burroughs welcome to another cube conversation from our beautiful Studios here in palo alto california as we do with every cube conversation we come up with a great topic and we find someone who really understands it so they can talk about it we capture them for you so you can learn something about some of the new trends and changes in the industry and we've doing that today too the topic that we're talking about is how do you do a better job of mapping the costs that are being generated by the cloud or that informations coming out of cloud suppliers related to what you're using with the actual business activities that generate the differential capabilities that customers are looking for that's a tough tough challenge and to understand that better we're talking with junior Stormin who's a co-founder of cloud ability Jaron welcome to the cube thanks Peter good to be here so so let's talk about first who are you yeah so I am co-founder of cloud ability and credibility is focused around improving the unit economics of cloud spend so our customers tend to be those who are spending large amounts in AWS or Azure or GC P and we take their billing data their utilization data various metadata about their business and do machine learning and data science on top of it to help them get better visibility into sort of where that spend is going how they're using it but more importantly to give them some controls around how they want to optimize an optimize doesn't necessarily mean save money in a cloud world because most companies who are moving into cloud very heavily are doing that for the innovation for the speed so they can deliver you know better data faster but it's really about fine-tuning the conversation say okay here we want to save money here we want to move faster here we want to focus on quality and really providing a way for the the various groups that aren't normally talking the finance teams with the engineering teams with the procurement teams all these groups to come together and be able to take executive input to say okay how do we want to operate and how do we one improve those your economics as we go well I want to start with just quick comment on this notion of Union I can when people here historically hear the notion of unit unit economics they think of you know increasing scale so the average cost per unit goes down yeah I think you're talking about more than that right are you really also talking about a mapping of what spend is generating to the business activities that actually generate value and ensuring that you get the differential or the optimized Union economics unit cost yeah oh so the mapping is actually really interesting ly challenging in cloud it's hard enough in traditional IT if you look at somebody like AWS they have two hundred thousand SKUs different products you can buy and they now bill at a second level resolution so what this means is you've got all these engineers out there using cloud in a very good way to move quickly and of 8/2 little more features and they kind of have an unlimited credit card that they can go spend on as quickly as they need and they never see the statements they never see the bills and the other side you've got finance teams procurement teams who've sort of lost control of traditionally the power of the PIO that they have to actually rein that in and they're they're struggling just to understand what is the spend and then to the mapping question how do i allocate these hundreds and millions of charges that i have this month into cost centers and business units and getting that sorted in a world where engineers are focused on moving fast or not they're not tagging things based on cost and are typically so once you get that sort of mapping aspect sorted to the next point you brought as is in bringing the business value so how do we start to relate that back there's a concept a lot of you know IT has been a cost center and now it's sexual driver of value in a world where businesses are increasingly delivering their value through software so we need to start tying the spending mapping into the business and then tying that to the value delivered a great example of this I was sitting last week with one of the largest cloud spenders in the world there have been you know nine figures with their primary vendor and in the conversation with the executives we realized that nobody was looking at both sides of that equation you had the the finance people who were saying hey we're tracking the cost and we think I was happening there and then you had the the revenue generators looking at the money coming anyhow the cloud people with that but there wasn't this centralized view to say alright we want to have a conversation about what value I were getting to spend and the question that always comes up what that is I was doing the right amount well let me build on that because it's seat because IT is historically and this is one of the things that we've been doing over the last few years IT has historically done things on a project level yes all right so we had waterfall development we tried to change that with agile we had you know buy the hardware upfront and then deploy the application on a cloud changes that so this project orientation has led to a set of decisions about finance at the moment that the business asides to do it we've changed the practices that we use at a development level we've changed the practices that we use at an asset level is it now time to change the practices that we use at a finance level is that really kind of what's going on here it is that the project analogy is good because what we're seeing is they're shifting from a project basis to a productive basis and products that deliver value increasingly if you think about the change that's happened with DevOps it in the scene and cloud companies are delivering more of their value through software and they're not just using IT for internal projects right it's actually the driver of business how we interact with Airlines and banks and all these things so that's the shift to say okay now we've gotten good at DevOps moving fast and we've gotten good at deploying and building better data stores now we need to bring in this new discipline and the discipline is what the market is calling fin ops which essentially is combining financial financial operations but you simply combine technology applied specifically do a cloud roll and it only can really happen in cloud it can't happen in data centers because data centers have fixed spending right you have to wait to get resources once you make the investment it's a sunk cost there's months of lead time cloud introduced the removal of constraints which means you can get whatever you want as quickly as you want and DevOps meant it's all automated so instead of your collection of 60 servers you've got thousands that are coming up and down all the time so what you don't have to do is bring in all these groups engineers have to think about cost as a new efficiency metric they have to think about the impact of their business at this code this confirmation template they just wrote is going to have and the finance teams have to shift from this mode of I'm under report retro actively and at a quarterly granularity sixty days after it happened and block investment to be I'm going to partner with these teams report in a real-time fashion give them the visibility help forecast and actually bring them together to make better business decisions about the cloud spend so cloud has allowed development to alter practically agile has been around for a long time before the cloud predates the cloud but it became practical and almost demanded as a consequence of what you could do with cloud so cloud change development through agile it changed infrastructure management through DevOps where now you're you're deploying software infrastructure of code and know as code and what you're saying is the third leg of that stool cloud is now changing how you do financial management of technology financial management of IT and we're calling that fin ops yeah and you you you can't really have fin ops without cloud or without DevOps and if you have the two together you alter we need this new set of it's a new operating model the reason this has come to a head of late is you know if you look at going to the Amazon riemeck conferences a few years back it was like well how much is cloud gonna be a thing and okay clouds now gonna be a thing when's it gonna happen now it's about the how and how do we do this better cloud is hitting for the material spend levels now at bigger organizations I mean the you know see the the cloud projections where it's going I think it's now 360 billion the next few years and we're seeing CFO's at public companies look to say okay it's not my biggest line-item yet but it's the most variable and fastest growing cogs expense so it's actually start to affect our margins we needed a new set of process used to actually manage this so one of the things that's coming to market is this new group called the phenoms Foundation which is a non-profit trade association that initially has a few dozen of some of the largest cloudspinners of the world there's the Spotify as the alaskans the nation why it's Autodesk's and they've all come together as a set of best practice practitioners to start to codify this into something that can be you know scaled out in organizations so that group is gonna be putting out a user conference around this area there's a new o'reilly book that's coming out the end of the year that's going to be sort of the treatise and all this stuff pulled together because what we found in you know me is in code ability in the last eight years we bring in technology and platform to show the recommendations of visibility how to do this but the real challenge companies run into is they don't have the internal expertise their finance teams understand what they need to the engineers don't and so you know they came to us last year saying can you help figure out the processes can you educate us and that's really where you know the spin offs foundation is growing bringing together those people to define those processes so the the impact of cloud on each of these different groups on the development group on the infrastructure team and now on the finance team the interest the developer groups I think some of them resisted it but generally speaking it's gone okay and and eventually tooling from a variety different players came along that made it easy to enact best practices and software development through an agile mechanism in the last few years after significant battles within infrastructure teams about whether or not they were going to use software as code we've seen new products new tooling that has facilitated the adoption of those practices what kind of tooling are we going to see introduced that facilitates thin ops so that finance teams procurement teams move from a project orientation to a strategic management of the resource orientation I mean I think the first is on the engineering side is seeing costs become a first-class citizen of an efficiency metric that they need to look at so you know in their build processes baked in the CI CD looking to see am I properly sizing my compute request for the workload that it needs there's some research research just came out showing that I think it's like 80 percent of the market is not using the best discounting options the cloud providers offer you hear these horror stories it's too expensive we said overspend that's not actually a problem with the cloud providers that's a problem with the enterprises not using the tools offered the discounts the reserved ences the infrequent access door exactly so I think at the end of the day it's the first step in this is getting those checks in place to say are we using the things that help drive the right cost for our needs and the other side of that the finance team is really changing the way that they are interacting with their technology teams becoming partners becoming advocates in this versus a passive you know retroactive reporter down the line and this enables these sort of micro optimization discussions that can happen where data center world we bought it some cots is sitting there odd world we can make decisions today that impact you know the business tomorrow so let me make sure I got this so I have a client who who I was having a conversation with them they told me that their their Amazon there AWS bill is 87 gigabytes mm-hmm not that monthly that's 87 pages that's 87 gigabyte yeah so we get we bring this 87 gigabytes in and it's a story about what I consume out of Amazon it's not a story what my business utilizes to achieve its objectives so we're now entering into a world where we're trying to introduce those financial visit that financial visibility into how that spend can be mapped to what the business does so the finance group can look at a common notion of truth and the IT group can look at a combination of troop application owners can look at a common notion of truth and that's what is fin ops is providing if I got that right yeah absolutely and the eighty-seven gigabyte example is the exactly reason why it is fin option not just cloud financial management you can't have a person with a spreadsheet looking at that and trying to make decisions about it right it has to be automated its IT finances code it's got to be baked into the processes you know we we've seen organizations that have hundreds of millions of individual charges hitting them in a consumption based manner the other thing that's come in with the fin ops as a core tenant is we're now seeing a decentralization of accountability for that spend so if you look at the big cloud spenders out there maybe spending tens or hundreds mils a year some of them have thousands of cloud environments gone is the day of we have a centralized Group begins to say we're gonna turn this off turn this off we want to give each of those teams the ability to see there's just their portion of that bill in the right mapped way as you said and to be able to take actions on the back of that so that's changed and they you know you run it you maintain it you understand which shutdown what has sort of come back to the old centralized model is this notion and this is where procurements job is shifted to largely of we deal still want to centralize the rate reduction so engineers you go use less right essentially finance team procurement work together with the cloud vendors to get the best possible rates through reserved instances can be reduced discounts you know volume discounts negotiated rates whatever it is and they become sort of strategic sourcing just say you're gonna use whatever you're going to use and you're gonna watch that to make sure you're using the right amount will targets threshold we're gonna make sure we get the best rate and that's sort of the two sides of the coin well very importantly procurement has always been organized on episodic purchases where the whole point is to bring the price point down and now we're talking about a continuous services where you were literally you're literally basing your business on capabilities provided by a third party and that is a very very very different relation just-in-time purchasing right and it's and it's a new supply chain management process where you have so many SKU options and you are making these purchase decisions sometimes thousands a day and that impacts everything down the road excellent gr storm and co-founder of cloud ability talking about Finn ops and cloud abilities role in helping businesses map the cloud spend to their business activities for a better more optimal views of how they get what they need out of their cloud expenditures Jr thanks very much for being on the connects here and once again I'm Peter burrows and thanks for listening to this acute conversation until next time [Music] you

Published Date : Feb 22 2019

**Summary and Sentiment Analysis are not been shown because of improper transcript**

ENTITIES

Entity	Category	Confidence
Amazon	ORGANIZATION	0.99+
February 2019	DATE	0.99+
80 percent	QUANTITY	0.99+
87 gigabytes	QUANTITY	0.99+
360 billion	QUANTITY	0.99+
60 servers	QUANTITY	0.99+
last week	DATE	0.99+
Jaron	PERSON	0.99+
tens	QUANTITY	0.99+
87 gigabyte	QUANTITY	0.99+
Peter Burroughs	PERSON	0.99+
AWS	ORGANIZATION	0.99+
Peter	PERSON	0.99+
two	QUANTITY	0.99+
last year	DATE	0.99+
sixty days	QUANTITY	0.99+
thousands	QUANTITY	0.99+
Autodesk	ORGANIZATION	0.99+
87	QUANTITY	0.99+
first step	QUANTITY	0.99+
Spotify	ORGANIZATION	0.99+
J.R. Storment	PERSON	0.99+
phenoms Foundation	ORGANIZATION	0.99+
tomorrow	DATE	0.99+
eighty-seven gigabyte	QUANTITY	0.98+
both sides	QUANTITY	0.98+
nine figures	QUANTITY	0.98+
Stormin	PERSON	0.98+
thousands a day	QUANTITY	0.97+
palo alto california	LOCATION	0.97+
third leg	QUANTITY	0.97+
each	QUANTITY	0.97+
today	DATE	0.97+
Jr	PERSON	0.97+
first	QUANTITY	0.96+
DevOps	TITLE	0.96+
one	QUANTITY	0.96+
hundreds mils a year	QUANTITY	0.94+
two sides	QUANTITY	0.94+
two hundred thousand SKUs	QUANTITY	0.92+
last eight years	DATE	0.91+
agile	TITLE	0.89+
second level	QUANTITY	0.89+
o'reilly	ORGANIZATION	0.87+
GC	ORGANIZATION	0.86+
next few years	DATE	0.86+
last few years	DATE	0.84+
Azure	TITLE	0.83+
87 gigabytes	QUANTITY	0.83+
hundreds and	QUANTITY	0.8+
pages	QUANTITY	0.79+
P	TITLE	0.79+
thousands of cloud environments	QUANTITY	0.77+
hundreds of millions of individual charges	QUANTITY	0.76+
one of	QUANTITY	0.73+
last few years	DATE	0.71+
things	QUANTITY	0.71+
Peter burrows	PERSON	0.7+
one of the things	QUANTITY	0.68+
riemeck	EVENT	0.66+
few dozen	QUANTITY	0.64+
end	DATE	0.63+
this month	DATE	0.6+
millions of charges	QUANTITY	0.6+
8/2 little	QUANTITY	0.58+
months	QUANTITY	0.56+
cube	QUANTITY	0.52+

Jason Edelman, Network to Code | Cisco Live EU 2019

>> Live, from Barcelona Spain, it's theCUBE, covering Cisco Live! Europe. Brought to you by Cisco and its ecosystem partners. >> Welcome back to theCUBE, here at Cisco Live! 2019 in Barcelona, Spain, I'm Stu Miniman, happy to welcome to the program a first-time guest, but someone I've known for many years, Jason Edelman, who is the founder of Network to Code. Jason, great to see you, and thanks for joining us. >> Thank you for having me, Stu. >> Alright, Jason, let's first, for our audiences, this is your first time on the program, give us a little bit about your background, and what led to you being the founder of Network to Code. >> Right, so my background is that of a traditional network engineer. I've spent 10+ years managing networks, deploying networks, and really, acting in a pre-sales capacity, supporting Cisco infrastructure. And it was probably around 2012 or 13, working for a large Cisco VAR, that we had access to something called Cisco onePK, and we kind of dove into that as the first SDK to control network devices. We have today iPhone SDKs, SDKs for Android, to program for phone apps, this was one of the first SDKs to program against a router and a switch. And that, for me, was just eye-opening, this is kind of back in 2013 or so, to see what could be done to write code in Python, Seer, Java, against network devices. Now, when this was going on, I didn't know how to code, so I kind of used that as the entrance to ramp up, but that was, for me, the pivot point. And then, the same six-week period, I had a demo of Puppet and Ansible automated networking devices, and so that was the pivot point where it was like, wow, realizing I've spent a career architecture and designing networks, and realizing there's a challenge in operating networks day to day. >> Yeah, Jason, dial back. You've some Cisco certifications in your background? >> Sure, yes, CCIE, yeah. >> Yeah, so I think back, when this all, OpenFlow, and before we even called it Software-Defined Networking, you were blogging about this type of stuff. But, as you said, you weren't a coder. It wasn't your background, you were a network guy, and I think the Network to Code, a lot of the things we've been looking at, career-wise, it's like, does everyone need to become coders? How will the tools mature? Give us a little bit about that journey, as how you got into coding and let's go from there. >> Yeah, it was interesting. In 2010, I started blogging OpenFlow-related, I thought it was going to change the world, saw what NICRO was doing at the time, and then Big Switch at the time, and I just speculated and blogged and really just envisioned this world where networks were different in some capacity. And it took a couple years to really shed light on management and operations of networking, and I made some career shifts. And I remember going back to onePK, at the time, my manager then, who is now our CEO at Network to Code, he actually asked, well, why don't you do it? And it was just like, me? Me, automate our program? What do you mean? And so it was kind of like a moment for me to kind of reflect on what I can do. Now, I will say I don't believe every network engineer should know how to code. That was my on-ramp because of partnership with Cisco at the time, and learning onePK and programming languages, but that was for me, I guess, what I needed as that kick in the butt to say, you know what? I am going to do this. I do believe in the shift that's going to happen in the next couple years, and that was where I kind of just jumped in feet first, and now we are where we are. >> Yeah, Jason, some great points there. I know for myself, I look at, Cisco's gone through so much change. A year ago, up on stage, Cisco's talking about their future is as a software company. You might not even think of us as networking first, you will talk to us about software first. So that initial shift that you saw back in 2010, it's happening. It's a different form than we might have thought originally, and it's not necessarily a product, but we're going through that shift. And I like what you said about how not everybody needs to code, but it's this change in paradigms and what we need to do are different. You've got some connections, we're here in the DevNet Zone. I saw, at the US show in Orlando last year, Network to Code had a small booth, there were a whole bunch of startups in that space. Tell us how you got involved into DevNet, really since the earliest days. >> Yes, since the early days, it was really pre-DevNet. So the emergence of DevNet, I've seen it grow into, the last couple years, Cisco Live! And for us, given what we do at Network to Code, as a network-automation-focused company, we see DevNet in use by our clients, by DevNet solutions and products, things like, mentioned yesterday on a panel, but DevNet has always-on sandboxes, too. One of the biggest barriers we've seen with our clients is getting access to the right lab gear on getting started to automate. So DevNet has these sandboxes always on to hit Nexus API or Catalyst API, right? Things like that. And there's really a very good, structured learning path to get started through DevNet, which usually, where we intersect in our client engagement, so it's kind of like post-DevNet, you're kind of really showing what's possible, and then we'll kind of get in and craft some solutions for our clients. >> Yeah, take us inside some of your clients, if you can. Are most of them hitting the API instead of the COI now when they're engaging? >> Yeah, it's actually a good question. Not usually talked about, but the reality is, APIs are still very new. And so we actively test a lot of the newer APIs from Cisco, as an example. IOS XE has some of the best APIs that exist around RESTCONF, NETCONF, modeled from the same YANG models, and great APIs. But the truth is that a lot of our clients, large enterprises that've been around for 20+ years, the install base is still largely not API-enabled. So a lot of the automation that we do is definitely SSH-based. And when you look at what's possible with platforms, if it is something like a custom in Python, or even an ANSEL off the shelf, a lot of the integrations are hidden from the user, so as long as we're able to accomplish the goal, it's the most important thing right now. And our clients' leaderships sometimes care, and it's true, right? You want the outcome. And initially, it's okay if we're not using the API, but once we do flip that switch, it does provide a bit more structure and safety for automating. But the install base is so large right now that, to automate, you have to use SSH, and we don't believe in waiting 'til every device is API-enabled because it'll just take a while to turn that base. >> Alright, Jason, a major focus of the conference this year has been around multi-cloud. How's that impacting your business and your customers? >> So, it's in our path as a company. Right now, there's a lot of focus around multi-cloud and data center, and the truth is, we're doing a lot of automation in the Campus networking space. Right, automating networks to get deployed in wiring closets and firewalls and load balancers and things like that. So from our standpoint, as we start planning with our clients, we see the services that we offer really port over to multi-cloud and making sure that with whatever automation is being deployed today, regardless of toolset, and look at a tool chain to deploy, if it's a CI/CD Pipeline for networking, be able to do that if you're managing a network in the Campus, a data center network, or multi-cloud network, to make sure we have a uniform-looking field to operations, and doing that. >> Alright, so Jason, you're not only founder of your company, you're also an author. Maybe tell us about the, I believe it's an update, or is it a new book, that recently got out. >> Yes, I'm a co-author of a book with Matt Oswalt and Scott Lowe, and it's an O'Reilly book that was published last year. And look, I'm a believer in education, and to really make a change and change an industry, we have to educate, and I think the book, the goal was to play a small part in really bringing concepts to light. As a network engineer by trade, there's fundamental concepts that network engineers should be aware of, and it could be basics and a lot of these, it could be Python or Jinja templating in YAML and Git and Linux, for that matter. It's just kind of providing that baseline of skills as an entrance into automation. And once you have the baseline, it kind of really uncovers what's possible. So writing the book was great. Great opportunity, and thank you to Matt and Scott for getting involved there. It really took a lot of the work effort and collaborated with them on it. >> Want to get your perception on the show, also. Education, always a key feature of what happens at the show. Not far from us is the Cisco bookshop. I see people getting a lot of the big Cisco books, but I think ten years ago, it was like, everybody, get my CCIE, all my different certifications updated, here. Here in the DevNet Zone, a lot of people, they're building stuff, they're building new pieces, they're playing in the labs, and they're doing some of these environments. What's your experience here at the show? Anything in particular that catches your eye? >> So, I do believe in education. I think to do anything well, you have to be educated on it. And I've read Cisco Press books over the years, probably a dozen of them, for the CCIE and beyond. I think when we look at what's in DevNet, when we look at what's in the bookstore, people have to immerse themselves into the technology, and reading books, like the learning labs that are here in the DevNet Zone, the design sessions that are right behind us. Just amazing for me to have seen the DevNet Zone grow to be what it is today. And really the goal of educating the market of what's possible. See, even from the start, Network to Code, we started as doing a lot of training, because you really can't change the methodology of network operations without being aware of what's possible, and it really does kind of come back to training. Whatever it is, on-demand, streaming, instructor-led, reading a book. Just glad to see this happen here, and a lot more to do around the industry, in the space around community involvement and development, but training, a huge part of it. >> Alright, Jason, want to give you the final word, love the story of network engineer gone entrepreneurial, out of your comfort zone, coding, helping to build a business. So tell us what you see, going forward. >> So, we've grown quite a bit in the past couple years. Right now, we're over 20 engineers strong, and starting from essentially just one a couple years ago, was a huge transformation, and seeing this happen. I believe in bringing on A-players to help make that happen. I think for us as a business, we're continuing to grow and accelerating what we do in this network automation space, but I just think, one thought to throw out there is, oftentimes we talk about lower-level tools, Python, Git, YAML, a lot of new acronyms and buzzwords for network engineers, but also, the flip side is true, too. As our client base evolves, and a lot of them are in the Fortune 100, so large clients, looking at consumption models of technology's super-important, meaning is there ITSM tools deployed today, like a ServiceNow, or Webex teams, or Slack for chat integration. To really think through early on how the internal customers of automation will consume automation, 'cause it really does us no good, Cisco, vendors, or clients no good, if we deploy a great network automation platform, and no one uses it, because it doesn't fit the culture of the brand of the organization. So it's just, as we continue to grow, that's really what's top of mind for us right now. >> Alright, well Jason, congratulations on everything that you've done so far, wish you the best of luck going forward, and thank you so much, of course, for watching. We'll have more coverage, three day, wall-to-wall, here at Cisco Live! 2019 in Barcelona. I'm Stu Miniman, and thanks for watching theCUBE. (electronic music)

Published Date : Jan 30 2019

SUMMARY :

Brought to you by Cisco and its ecosystem partners. Jason, great to see you, and thanks for joining us. and what led to you being the founder of Network to Code. to program for phone apps, this was one of the first You've some Cisco certifications in your background? and I think the Network to Code, as that kick in the butt to say, you know what? And I like what you said about One of the biggest barriers we've seen with our clients instead of the COI now when they're engaging? So a lot of the automation that we do Alright, Jason, a major focus of the conference this year and data center, and the truth is, or is it a new book, that recently got out. And look, I'm a believer in education, and to really Here in the DevNet Zone, a lot of people, the DevNet Zone grow to be what it is today. So tell us what you see, going forward. I believe in bringing on A-players to help make that happen. and thank you so much, of course, for watching.

ENTITIES

Entity	Category	Confidence
Jason	PERSON	0.99+
Cisco	ORGANIZATION	0.99+
Matt	PERSON	0.99+
Jason Edelman	PERSON	0.99+
Stu Miniman	PERSON	0.99+
Matt Oswalt	PERSON	0.99+
2013	DATE	0.99+
2010	DATE	0.99+
Scott Lowe	PERSON	0.99+
Scott	PERSON	0.99+
10+ years	QUANTITY	0.99+
Barcelona	LOCATION	0.99+
Stu	PERSON	0.99+
IOS XE	TITLE	0.99+
Network to Code	ORGANIZATION	0.99+
last year	DATE	0.99+
iPhone	COMMERCIAL_ITEM	0.99+
13	DATE	0.99+
first time	QUANTITY	0.99+
Orlando	LOCATION	0.99+
Barcelona Spain	LOCATION	0.99+
O'Reilly	ORGANIZATION	0.99+
Java	TITLE	0.99+
Python	TITLE	0.99+
Barcelona, Spain	LOCATION	0.99+
three day	QUANTITY	0.99+
six-week	QUANTITY	0.99+
Android	TITLE	0.99+
first-time	QUANTITY	0.98+
20+ years	QUANTITY	0.98+
NICRO	ORGANIZATION	0.98+
yesterday	DATE	0.98+
one	QUANTITY	0.98+
first	QUANTITY	0.98+
ten years ago	DATE	0.98+
Linux	TITLE	0.98+
Git	TITLE	0.97+
Cisco Press	ORGANIZATION	0.97+
DevNet	TITLE	0.97+
A year ago	DATE	0.97+
Cisco Live! 2019	EVENT	0.97+
today	DATE	0.97+
over 20 engineers	QUANTITY	0.96+
this year	DATE	0.96+
YAML	TITLE	0.96+
RESTCONF	TITLE	0.95+
Jinja	TITLE	0.93+
first SDK	QUANTITY	0.93+
next couple years	DATE	0.92+
US	LOCATION	0.92+
first SDKs	QUANTITY	0.92+
Seer	TITLE	0.91+
Nexus API	TITLE	0.91+
theCUBE	ORGANIZATION	0.91+
NETCONF	TITLE	0.88+
Cisco Live EU 2019	EVENT	0.86+
last couple years	DATE	0.86+
2012	DATE	0.85+
One	QUANTITY	0.85+
Catalyst API	TITLE	0.83+
SSH	TITLE	0.81+

Patrick O’Reilly, O’Reilly Venture Partners | Microsoft Ignite 2018

>> Live from Orlando, Florida, it's theCUBE covering Microsoft Ignite. Brought to you by Cohesity and theCUBE's ecosystem partners. >> Welcome back, everyone, to theCUBE's live coverage of Microsoft Ignite. I'm your host, Rebecca Knight, along with my cohost, Stu Miniman. We're joined by Patrick O'Reilly of O'Reilly Venture Partners based in San Francisco. Thanks so much for coming on theCUBE, Patrick. >> Thanks for having me. >> So, you are a serial entrepreneur now working as a VC, what are you doing here? Tell us why you came to Ignite. >> Yeah, well selfishly on the VC side we have a few of our portfolio companies here that have booths, and I wanted to kind of hear what people are asking, you know, why they're interested in the companies and how we're framing, you know, those companies to the end users. I think these type of events are really good to unlock hidden potential, or things that people can tell you that you wouldn't actually have thought about, yeah. >> Yeah, so Patrick, you know, I've known you for a number of years. Usually see you at the opensource shows. Microsoft, you know, publicly very embracing opensource. You know, they love Linux, partnering with Red Hat, even you know, partnering is a lot of things that Microsoft does. They were working with VMware. What's your viewpoint as to how you see Microsoft and the opensource world, and how about this ecosystem? Is this a vibrant ecosystem that, you know, VCs are investing in, or is it just that there's companies of yours that, you know, this is part of the story. >> No, and I think historically we've had the, you know, build versus buy, you know, kind of way of looking at it, but when I typically think of Microsoft, it's more people building glue, you know, code to kind of connect things together, and you tend to have blinders on and not think about what opensource components you can use. You know, you look for like what company has a solution you can buy, or license or OEM, and I think that's changing, you know, over time. You know, Microsoft does an amazing job with developers of giving them very easy to understand languages and amazing tooling, and along with that the documentation and the training, so I kind of felt like you came into development one of two ways. You either were like on the Microsoft track and using the cookie cutter approach, you know, to doing things and getting certified on something, or you were opensource, you learned the scripting language and you just looked at what you can cobble together in the opensource world, and there wasn't a lot of crosspollination, but now I see that those walls kind of dissolving. People are willing to mix and match. >> Yeah, it's interesting, you know, some places I've seen Microsoft, a lot in the Kubernetes show, so you know, first got to know you you were at Kismetic, you know, really the first company around Kubernetes that we knew. You know, I know you're doing a lot of different things but we love your viewpoint on, you know, anything on Microsoft in that space, as well as just what you've seen, you know, as a watcher of the Kubernetes space these days. >> Yeah, I mean I've been... You know, if I step back from Kubernetes, you know, back to like the Apache Mesos and the Mesosphere days, you know, if you rewind all the way back there you kind of had to do a lot of education of like, "What do you mean 'containerization?' "I have VMs, why do I need containers?" And now that we've gotten past that and people actually understand the value of containers, like having an orchestration system in place that works and works with everything, you know, is obviously more important than ever, and it's... I really credit the CNCF and the Linux Foundation for what they've done to kind of bring standards around Kubernetes and shepherd the project, and I think that, you know, the fairly recent announcement from Google that they're fully trusting, you know, CNCF to be the shepherd of that is huge, and it gives a framework for people, like Gabe at Microsoft, to work with, you know, some of the staff at Google, and like, in a collaborative way and move it forward for everyone, and I think, you know, historically containers made a ton of sense on Linux, but now that we have Windows server, you know, supporting containers and theCUBE working, you know, on Windows, I think in the 111... Or sorry, 113 release we'll have full Windows server, you know, support in Kubernetes, like that'll be huge. And just a quick aside, like the reason I even kind of honed in on containers and thought it was interesting is the average server utilization is still so low, but we're not really trained as technologists to care about that, and you know, we're really good at building data centers and tucking them off in places where no one sees, but when the average server's taking like... It's like running a hairdryer on high, you know, for electricity and then they run so hot you have to cool it. Like, we're really not helping the environment, so I think if we can move towards containerization, move towards efficient utilization of our hardware, you know, it'll be better for everyone, not just this ecosystem, so... >> So, talk to, tell our viewers a little bit about your portfolios and your portfolio companies that are here, and how they fit into the ecosystem. >> Yeah, so the one I'm most excited about, or shouldn't probably say it that way, I'll reframe that-- >> Can't have favorites, they're all your babies. (laughs) >> Yeah, they're all my babies. (laughs) >> But Ziften Technologies is great. I think their integration with the Windows, the vendor ATP, you know, advanced threat protection, you know, tool is great. They focus on the Mac and the Linux components and give you that same kind of pane of glass on the Microsoft side to see those endpoints, and like their utilization of AI, like they have an upcoming release where they're using AI to do things, and traditionally in that space it's been like the AB vendors, you know, doing everything and you had kind of, "Here's our signatures, "we're going to scan against those signatures," and it's a creative use of AI now to, like, look for just anomaly detections. These are the things we haven't seen before. Not sure what it is but it looks abnormal, and those are the kind of like spin-outs of companies that I'm looking for, too. Like I want to see people doing more meaningful things, you know, with AI. I think if we look at Azure and what they're offering now, like I don't need to have a bunch of data scientists at my startup. I can implement computer vision just using what off-the-shelf components, you know, from Microsoft and you know, Azure. I can do video indexing, you know, using their services. Like, if I rewind just back three years I would've had to have a team of like four data scientists. They'd be reading whitepapers, they'd be implementing code that like sort of half works, and they would probably take half a year to train some models to get, like, moderate results, and now in a matter of minutes, you know, I can use this off-the-shelf stuff. >> Yeah, it's fascinating, I think back to, you know, we were pretty early at theCUBE at watching the whole big data trend, and back then it was like, "Okay, we're going to "take that two-year project and you know, "drive it down to six months," and now we talk in the AI space is, you know, how can we drive that down even more. In big data there was concern, everything seemed to be custom. In AI we're starting to get to more templatized solutions, rolling out for a lot of industries, and it feels like it's taking off a lot faster than that space is, and I know there's a lot of investment going on in the space, and a lot there, so... Anything in particular, you know, what excites you, what makes a good, you know, AI investment versus, you know, there's just so much happening out there. >> Well, you know, I... I struggle with the name AI a little bit. >> Yeah, no, no, I understand, yeah. >> I'm working on a talk, and you know, I kind of like don't, I don't enjoy the artificial aspect of it because it's really just intelligence, and you know, right now it's a buzzword people are throwing into everything when really they mean, "We use an algorithm." (laughs) You know, it's not truly AI, but when we get to cognition we get, you know, to, you know, someday if we have quantum supremacy we'll have, you know, systems that actually can maybe have a consciousness, you know, and decide things. That's where I'm interested, I'm looking... Like on the devops side I'm looking for people using AI to get away with repetitive tasks. Like I would love to see, you know, someone have a system where it's like, "Hey, we've noticed, you know, 90 times "this week this guy's done this exact "same thing, you know, 99% the same way." Like, let's automate that away. You know, we've been really good in the space to kind of treat infrastructure like code, you know, and be able to tear things up. Like I mean, I've been incredibly excited to see, like just in my career, how we went from, "Okay, you're going to do something meaningful on the web. "You need to build a data center. "You need to, you know, get a bunch of servers, racks," and then you pay all this equipment and oh, by the way, 18 months from now it's going to be obsolete and you're going to have to spend money again, to where now I can just, you know, get some credits to start up in the cloud, you know, try things out and do like really meaningful things. So, just looking for anyone on AI that's going to do something that moves the needle. >> Yeah, now that, yeah, just on the terminology piece, I've lived through the cloud wars and the argument over what was and what isn't, so it's just, you know, the shorthand for this wave that we have there, where AI or ML, or you know, IBM has some interesting terms that they want to call it. We understand that there's intelligence that I can do with software, a lot of machine-to-machine things that are going on, and it's not a lot of, you know, shouldn't be a lot of heavy lifting by people to go in there. Oh, wait, I can train something, I can learn what's happening, so... >> Well, I wanted to ask when... I'm sure a lot of entrepreneurs ears are pricking up when they hear that you want to make these meaningful investments. What is it that you look for in a company, is it... In terms of the leadership team, in terms of any track record, what sort of makes your eyes light up? >> So, I try to go to as many conferences as I can, because I feel that's where, you know, the hallway track and I can meet people. I can see, you know, their talks, see what they're passionate about, so what I'm really looking for is investing more in the people than in the idea, because startups can always pivot, and you look at some of the greatest companies out there, they were pivots from, you know, a slightly different model and they realized that, "Oh, we should go chase down this other thing." So, to me, I'm looking for people that are doing something exciting where they are already, looking to make the leap. You know, for example, like you know, the Spinnaker team or people that do something, you know, like... You know, like if etcd wanted to move off and be a separate company, like things like that where they've done something, they've proven it, and now they want to go start a company around it, and I think right off the bat, like if you've built some interesting technology that people are starting to use you have a decent revenue stream just from support, you know, of that and helping those end users, and I think, you know, with O'Reilly we do something a little different than other people. Like I focus mostly on seed investment, very early stage. Our typical check size is around $500k, and I actually allow people to take us off the cap table and just pay us back. Like you know, I've done nine startups in my career, and it's... Fundraising is one of those things where you only get good at it once you don't need it anymore, (chuckles) and I felt the pain of being on that side of the desk and I want to be in the position where, you know, we can write the checks and not try to, like, have a lot of governance, not try to take a board seat, not give you down pressure, you know, on what you're doing but really be additive. I think moving forward I would love to be in the position where we can help incubate, you know, a lot of companies because we've found that, you know, you all kind of go through, every company goes through the same process like, "Now, we need a real CFO because "we need financial projections." Like, being able to, like, provide those services for portfolio companies where they don't have to go spend their resources chasing that down. >> I'm curious how much some of the big players, or just the gravity of what's happening in the space that you're looking at, so obviously we're here at the Microsoft show, but Google, Amazon, a lot of activity going on and we can call it AI or what you will, VMware even, Oracle, SalesForce, how much of the big players defining and you have to build around them, versus you know, we look at Kubernetes is supposed to make things independent, to be able to be opensource and be able to build solutions, you know, regardless of what platform they're on. >> Yeah, I mean, I think we're living in a world where people have a lot of choice, you know, and we look at even, like we take the example of cloud providers. Like, as long as I don't get vendor lock in and use, you know, their specific features, like I can move around to different cloud providers, I can now say I want to negotiate a better price here and migrate over, and I think just with any of the technologies, like trying to work in ways where companies can work together and be additive, I think that's where we actually move, you know, move down the field. I don't know what analogy's appropriate to use, but you know, I feel like there's a lot of really interesting stuff that we should be doing, and making... Every company doing a slightly different version of the same thing I don't think, you know, makes sense. Like, you know, even silly things like as we mature. Like, you know, back in the day everyone used to have broadcast television. We built all these antennas, we got all this range, you know, and then we moved to digital and we didn't need those antennas, we didn't need that range, so they started decommissioning them, but then companies came along and they're like, "Well, wait, now we have this "unlicensed spectrum we can use." So, now they're using it for internet. You know, you can get 20 megabit connectivity out to a rural farm where now they can put some cheap IoT sensors, and like, do really meaningful things with low cost technologies, like those are the things I'm interesting in. You know, so kids that want to cobble together, you know, IoT sensors and come up with a way to use, you know, what they have in rural areas, and like, and have technology actually help people in a meaningful way, and I think those are a lot of very viable startups, you know, in that space. I do think we live in a world where every company's going to end up graduating into one of the camps, be it, you know, SalesForce, Google, you know, Microsoft, but in that innovation spike, like when they're first starting improving out the companies I think they have a ton of choice, you know. >> You described a very beneficent approach to how you think about VC. Do you think, how would you describe the VC landscape right now? You said you want to be able to just incubate great ideas and help these young companies when they are not good at fundraising and they don't have the smooth, slick deck that will really impress the bigger VC firms. I mean, how, what's wrong with the VC landscape today and what else are you doing to make it better? >> Well, I think the incentives are a little off. You know, I can speak for myself, like when I was... You know, when I was looking to raise VC money and my previous companies, like you know, you get these great offers from people, but then you talk to other entrepreneurs and you're like, you know, I'm not going to call anyone out by name, but you're like, "Well, how is this VC's firm served you," and you start hearing of ways that it was additive, but also kind of put undue pressure on them, or they say things like, "Well, we really didn't "need to raise that round then. "We could've done bridge financing "or we could've figured out how to get a MVP product "out there and brought in some revenue." So, I just think it's the ultrahigh returns that VCs are looking for, and the promises that those VCs are making to their LPs, (chuckles) you know, in their funds to outperform everyone else, and you know, everyone talks to everyone, right? So, if anything's meaningful out there looking for investment kind of the back channel is very vibrant and it's dog-eat-dog, and some of it, I kind of reckon it to, you know, your alma mater, like where you went to school. Like, you know, if you're an MIT person, like MIT's the best place in the world. You know, if you're, you know, some other school, they're the best place in the world, and the VCs tend to kind of, like, fall in those camps, and what I'm looking to do-- >> And those are real biases that impact women and underrepresented minorities, to their detriment. >> Yeah, and you know, and that's the thing I've struggled with, too, when you look at the... Like, let's take Andreessen, you know, for example and you look at the portfolio companies, like you know, you kind of become locked into that ecosystem. Like if you want to go, you know, if I'm on Mesosphere and I want to go partner with someone that's not under that, or they have a company in that portfolio that does similar things, you're going to be pressured into working with the portfolio company over going off and maybe choosing the better, you know, choice for the industry, so I'd like to see, you know, those things change. >> Right, and so, Patrick, we talked a little bit about Ziften, security endpoint, you know, really hot space. I want to give the opportunity, other companies you have here that we should check out. >> Yeah, so we work closely with the team at Turbonomic. I think, you know, what they've done over time, you know, is amazing. I love products where you can just bolt it in and within a short period of time you're getting value. Like, you know, stepping back and just saying one thing about Ziften, like I think it's amazing, because I come from a software development, you know, background, and one thing as a software developer I've always found fascinating is like when you come in wearing the developer hat they give you the keys to the kingdom. They're like, "Oh, here's root access to the servers, "here's where all of our data is, "here's how you do a snapshot of production "to, you know, test it, you know, in staging," and I've always thought that it was a tremendous amount of risk, and you know, on average a company can be hacked for up to 100 days before they even realize that they've had a breach, and like, any kind of company, you know, be it Ziften or anyone in that space, that can showcase that to you. Like, you know, raise up things that you weren't aware of, you know, is really interesting, and then, you know, to the, like, Nico and Turbonomics and the things that they're doing there. Like, to actually get the most out of what you already have, like that's huge to me, because one of the, you know, one of the things I see in cloud computing that we didn't necessarily have, you know, directly owned physical infrastructure is it's almost too easy to spin things up. You know, you've got the guy clicking through the UIs like, "Oh, this instance looks great. "Oh, and it says it's only be $140 this month," and then they end up spinning up 1,000 of those, you know? (laughs) You get that first sticker shock of, like, here's that $250,000 bill that month, (chuckles) you know, for cloud, and companies like Turbonomics can, like, avoid you, you know, making those mistakes. >> Great, Patrick, thank you so much for coming on theCUBE. It was really fun talking. >> Yeah. >> We could talk to you for hours. >> Thanks for having me, I appreciate it. >> I'm Rebecca Knight for Stu Miniman. We will have more from theCUBE's live coverage of Microsoft Ignite coming up in just a little bit. (techy music)

Published Date : Sep 26 2018

SUMMARY :

Brought to you by Cohesity and Welcome back, everyone, to theCUBE's what are you doing here? and how we're framing, you know, Yeah, so Patrick, you know, you know, code to kind of a lot in the Kubernetes show, so you know, and the Mesosphere days, you know, fit into the ecosystem. they're all your babies. Yeah, they're all my babies. and now in a matter of minutes, you know, in the AI space is, you know, Well, you know, I... and you know, right now it's a buzzword you know, the shorthand for this wave What is it that you look and I think, you know, with and be able to build solutions, you know, and use, you know, and what else are you and my previous companies, like you know, minorities, to their detriment. Yeah, and you know, endpoint, you know, really hot space. and then, you know, to the, Great, Patrick, thank you of Microsoft Ignite coming

ENTITIES

Entity	Category	Confidence
Rebecca Knight	PERSON	0.99+
Patrick	PERSON	0.99+
Microsoft	ORGANIZATION	0.99+
Stu Miniman	PERSON	0.99+
Google	ORGANIZATION	0.99+
$250,000	QUANTITY	0.99+
90 times	QUANTITY	0.99+
$140	QUANTITY	0.99+
IBM	ORGANIZATION	0.99+
Amazon	ORGANIZATION	0.99+
Patrick O'Reilly	PERSON	0.99+
San Francisco	LOCATION	0.99+
99%	QUANTITY	0.99+
SalesForce	ORGANIZATION	0.99+
two-year	QUANTITY	0.99+
Linux Foundation	ORGANIZATION	0.99+
CNCF	ORGANIZATION	0.99+
O'Reilly	ORGANIZATION	0.99+
six months	QUANTITY	0.99+
Turbonomics	ORGANIZATION	0.99+
Turbonomic	ORGANIZATION	0.99+
Windows	TITLE	0.99+
Oracle	ORGANIZATION	0.99+
Orlando, Florida	LOCATION	0.99+
half a year	QUANTITY	0.99+
Gabe	PERSON	0.99+
Ziften Technologies	ORGANIZATION	0.99+
20 megabit	QUANTITY	0.99+
1,000	QUANTITY	0.99+
Linux	TITLE	0.99+
O'Reilly Venture Partners	ORGANIZATION	0.99+
theCUBE	ORGANIZATION	0.98+
around $500k	QUANTITY	0.98+
nine startups	QUANTITY	0.98+
Kismetic	ORGANIZATION	0.98+
Patrick O’Reilly	PERSON	0.98+
one	QUANTITY	0.98+
Andreessen	PERSON	0.98+
Mac	COMMERCIAL_ITEM	0.98+
Cohesity	ORGANIZATION	0.98+
this week	DATE	0.97+
VMware	ORGANIZATION	0.96+
MIT	ORGANIZATION	0.96+
two ways	QUANTITY	0.96+
first	QUANTITY	0.95+
Kubernetes	TITLE	0.95+
Spinnaker	ORGANIZATION	0.93+
Ziften	ORGANIZATION	0.92+
this month	DATE	0.91+
ATP	ORGANIZATION	0.9+
Ignite	ORGANIZATION	0.89+
O’Reilly	ORGANIZATION	0.86+
Kubernetes	ORGANIZATION	0.86+
today	DATE	0.86+
18 months	QUANTITY	0.85+
Nico and	ORGANIZATION	0.82+
Venture Partners	ORGANIZATION	0.81+
first company	QUANTITY	0.79+
one thing	QUANTITY	0.78+

Kickoff | theCUBE NYC 2018

>> Live from New York, it's theCUBE covering theCUBE New York City 2018. Brought to you by SiliconANGLE Media and its ecosystem partners. (techy music) >> Hello, everyone, welcome to this CUBE special presentation here in New York City for CUBENYC. I'm John Furrier with Dave Vellante. This is our ninth year covering the big data industry, starting with Hadoop World and evolved over the years. This is our ninth year, Dave. We've been covering Hadoop World, Hadoop Summit, Strata Conference, Strata Hadoop. Now it's called Strata Data, I don't know what Strata O'Reilly's going to call it next. As you all know, theCUBE has been present for the creation at the Hadoop big data ecosystem. We're here for our ninth year, certainly a lot's changed. AI's the center of the conversation, and certainly we've seen some horses come in, some haven't come in, and trends have emerged, some gone away, your thoughts. Nine years covering big data. >> Well, John, I remember fondly, vividly, the call that I got. I was in Dallas at a storage networking world show and you called and said, "Hey, we're doing "Hadoop World, get over there," and of course, Hadoop, big data, was the new, hot thing. I told everybody, "I'm leaving." Most of the people said, "What's Hadoop?" Right, so we came, we started covering, it was people like Jeff Hammerbacher, Amr Awadallah, Doug Cutting, who invented Hadoop, Mike Olson, you know, head of Cloudera at the time, and people like Abi Mehda, who at the time was at B of A, and some of the things we learned then that were profound-- >> Yeah. >> As much as Hadoop is sort of on the back burner now and people really aren't talking about it, some of the things that are profound about Hadoop, really, were the idea, the notion of bringing five megabytes of code to a petabyte of data, for example, or the notion of no schema on write. You know, put it into the database and then figure it out. >> Unstructured data. >> Right. >> Object storage. >> And so, that created a state of innovation, of funding. We were talking last night about, you know, many, many years ago at this event this time of the year, concurrent with Strata you would have VCs all over the place. There really aren't a lot of VCs here this year, not a lot of VC parties-- >> Mm-hm. >> As there used to be, so that somewhat waned, but some of the things that we talked about back then, we said that big money and big data is going to be made by the practitioners, not by the vendors, and that's proved true. I mean... >> Yeah. >> The big three Hadoop distro vendors, Cloudera, Hortonworks, and MapR, you know, Cloudera's $2.5 billion valuation, you know, not bad, but it's not a $30, $40 billion value company. The other thing we said is there will be no Red Hat of big data. You said, "Well, the only Red Hat of big data might be "Red Hat," and so, (chuckles) that's basically proved true. >> Yeah. >> And so, I think if we look back we always talked about Hadoop and big data being a reduction, the ROI was a reduction on investment. >> Yeah. >> It was a way to have a cheaper data warehouse, and that's essentially-- Well, what did we get right and wrong? I mean, let's look at some of the trends. I mean, first of all, I think we got pretty much everything right, as you know. We tend to make the calls pretty accurately with theCUBE. Got a lot of data, we look, we have the analytics in our own system, plus we have the research team digging in, so you know, we pretty much get, do a good job. I think one thing that we predicted was that Hadoop certainly would change the game, and that did. We also predicted that there wouldn't be a Red Hat for Hadoop, that was a production. The other prediction was is that we said Hadoop won't kill data warehouses, it didn't, and then data lakes came along. You know my position on data lakes. >> Yeah. >> I've always hated the term. I always liked data ocean because I think it was much more fluidity of the data, so I think we got that one right and data lakes still doesn't look like it's going to be panning out well. I mean, most people that deploy data lakes, it's really either not a core thing or as part of something else and it's turning into a data swamp, so I think the data lake piece is not panning out the way it, people thought it would be. I think one thing we did get right, also, is that data would be the center of the value proposition, and it continues and remains to be, and I think we're seeing that now, and we said data's the development kit back in 2010 when we said data's going to be part of programming. >> Some of the other things, our early data, and we went out and we talked to a lot of practitioners who are the, it was hard to find in the early days. They were just a select few, I mean, other than inside of Google and Yahoo! But what they told us is that things like SQL and the enterprise data warehouse were key components on their big data strategy, so to your point, you know, it wasn't going to kill the EDW, but it was going to surround it. The other thing we called was cloud. Four years ago our data showed clearly that much of this work, the modeling, the big data wrangling, et cetera, was being done in the cloud, and Cloudera, Hortonworks, and MapR, none of them at the time really had a cloud strategy. Today that's all they're talking about is cloud and hybrid cloud. >> Well, it's interesting, I think it was like four years ago, I think, Dave, when we actually were riffing on the notion of, you know, Cloudera's name. It's called Cloudera, you know. If you spell it out, in Cloudera we're in a cloud era, and I think we were very aggressive at that point. I think Amr Awadallah even made a comment on Twitter. He was like, "I don't understand "where you guys are coming from." We were actually saying at the time that Cloudera should actually leverage more cloud at that time, and they didn't. They stayed on their IPO track and they had to because they had everything betted on Impala and this data model that they had and being the business model, and then they went public, but I think clearly cloud is now part of Cloudera's story, and I think that's a good call, and it's not too late for them. It never was too late, but you know, Cloudera has executed. I mean, if you look at what's happened with Cloudera, they were the only game in town. When we started theCUBE we were in their office, as most people know in this industry, that we were there with Cloudera when they had like 17 employees. I thought Cloudera was going to run the table, but then what happened was Hortonworks came out of the Yahoo! That, I think, changed the game and I think in that competitive battle between Hortonworks and Cloudera, in my opinion, changed the industry, because if Hortonworks did not come out of Yahoo! Cloudera would've had an uncontested run. I think the landscape of the ecosystem would look completely different had Hortonworks not competed, because you think about, Dave, they had that competitive battle for years. The Hortonworks-Cloudera battle, and I think it changed the industry. I think it couldn't been a different outcome. If Hortonworks wasn't there, I think Cloudera probably would've taken Hadoop and making it so much more, and I think they wouldn't gotten more done. >> Yeah, and I think the other point we have to make here is complexity really hurt the Hadoop ecosystem, and it was just bespoke, new projects coming out all the time, and you had Cloudera, Hortonworks, and maybe to a lesser extent MapR, doing a lot of the heavy lifting, particularly, you know, Hortonworks and Cloudera. They had to invest a lot of their R&D in making these systems work and integrating them, and you know, complexity just really broke the back of the Hadoop ecosystem, and so then Spark came in, everybody said, "Oh, Spark's going to basically replace Hadoop." You know, yes and no, the people who got Hadoop right, you know, embraced it and they still use it. Spark definitely simplified things, but now the conversation has turned to AI, John. So, I got to ask you, I'm going to use your line on you in kind of the ask-me-anything segment here. AI, is it same wine, new bottle, or is it really substantively different in your opinion? >> I think it's substantively different. I don't think it's the same wine in a new bottle. I'll tell you... Well, it's kind of, it's like the bad wine... (laughs) Is going to be kind of blended in with the good wine, which is now AI. If you look at this industry, the big data industry, if you look at what O'Reilly did with this conference. I think O'Reilly really has not done a good job with the conference of big data. I think they blew it, I think that they made it a, you know, monetization, closed system when the big data business could've been all about AI in a much deeper way. I think AI is subordinate to cloud, and you mentioned cloud earlier. If you look at all the action within the AI segment, Diane Greene talking about it at Google Next, Amazon, AI is a software layer substrate that will be underpinned by the cloud. Cloud will drive more action, you need more compute, that drives more data, more data drives the machine learning, machine learning drives the AI, so I think AI is always going to be dependent upon cloud ends or some sort of high compute resource base, and all the cloud analytics are feeding into these AI models, so I think cloud takes over AI, no doubt, and I think this whole ecosystem of big data gets subsumed under either an AWS, VMworld, Google, and Microsoft Cloud show, and then also I think specialization around data science is going to go off on its own. So, I think you're going to see the breakup of the big data industry as we know it today. Strata Hadoop, Strata Data Conference, that thing's going to crumble into multiple, fractured ecosystems. >> It's already starting to be forked. I think the other thing I want to say about Hadoop is that it actually brought such great awareness to the notion of data, putting data at the core of your company, data and data value, the ability to understand how data at least contributes to the monetization of your company. AI would not be possible without the data. Right, and we've talked about this before. You call it the innovation sandwich. The innovation sandwich, last decade, last three decades, has been Moore's law. The innovation sandwich going forward is data, machine intelligence applied to that data, and cloud for scale, and that's the sandwich of innovation over the next 10 to 20 years. >> Yeah, and I think data is everywhere, so this idea of being a categorical industry segment is a little bit off, I mean, although I know data warehouse is kind of its own category and you're seeing that, but I don't think it's like a Magic Quadrant anymore. Every quadrant has data. >> Mm-hm. >> So, I think data's fundamental, and I think that's why it's going to become a layer within a control plane of either cloud or some other system, I think. I think that's pretty clear, there's no, like, one. You can't buy big data, you can't buy AI. I think you can have AI, you know, things like TensorFlow, but it's going to be a completely... Every layer of the stack is going to be impacted by AI and data. >> And I think the big players are going to infuse their applications and their databases with machine intelligence. You're going to see this, you're certainly, you know, seeing it with IBM, the sort of Watson heavy lift. Clearly Google, Amazon, you know, Facebook, Alibaba, and Microsoft, they're infusing AI throughout their entire set of cloud services and applications and infrastructure, and I think that's good news for the practitioners. People aren't... Most companies aren't going to build their own AI, they're going to buy AI, and that's how they close the gap between the sort of data haves and the data have-nots, and again, I want to emphasize that the fundamental difference, to me anyway, is having data at the core. If you look at the top five companies in terms of market value, US companies, Facebook maybe not so much anymore because of the fake news, though Facebook will be back with it's two billion users, but Apple, Google, Facebook, Amazon, who am I... And Microsoft, those five have put data at the core and they're the most valuable companies in the stock market from a market cap standpoint, why? Because it's a recognition that that intangible value of the data is actually quite valuable, and even though banks and financial institutions are data companies, their data lives in silos. So, these five have put data at the center, surrounded it with human expertise, as opposed to having humans at the center and having data all over the place. So, how do they, how do these companies close the gap? How do the companies in the flyover states close the gap? The way they close the gap, in my view, is they buy technologies that have AI infused in it, and I think the last thing I'll say is I see cloud as the substrate, and AI, and blockchain and other services, as the automation layer on top of it. I think that's going to be the big tailwind for innovation over the next decade. >> Yeah, and obviously the theme of machine learning drives a lot of the conversations here, and that's essentially never going to go away. Machine learning is the core of AI, and I would argue that AI truly doesn't even exist yet. It's machine learning really driving the value, but to put a validation on the fact that cloud is going to be driving AI business is some of the terms in popular conversations we're hearing here in New York around this event and topic, CUBENYC and Strata Conference, is you're hearing Kubernetes and blockchain, and you know, these automation, AI operation kind of conversations. That's an IT conversation, (chuckles) so you know, that's interesting. You've got IT, really, with storage. You've got to store the data, so you can't not talk about workloads and how the data moves with workloads, so you're starting to see data and workloads kind of be tossed in the same conversation, that's a cloud conversation. That is all about multi-cloud. That's why you're seeing Kubernetes, a term I never thought I would be saying at a big data show, but Kubernetes is going to be key for moving workloads around, of which there's data involved. (chuckles) Instrumenting the workloads, data inside the workloads, data driving data. This is where AI and machine learning's going to play, so again, cloud subsumes AI, that's the story, and I think that's going to be the big trend. >> Well, and I think you're right, now. I mean, that's why you're hearing the messaging of hybrid cloud and from the big distro vendors, and the other thing is you're hearing from a lot of the no-SQL database guys, they're bringing ACID compliance, they're bringing enterprise-grade capability, so you're seeing the world is hybrid. You're seeing those two worlds come together, so... >> Their worlds, it's getting leveled in the playing field out there. It's all about enterprise, B2B, AI, cloud, and data. That's theCUBE bringing you the data here. New York City, CUBENYC, that's the hashtag. Stay with us for more coverage live in New York after this short break. (techy music)

Published Date : Sep 12 2018

SUMMARY :

Brought to you by SiliconANGLE Media for the creation at the Hadoop big data ecosystem. and some of the things we learned then some of the things that are profound about Hadoop, We were talking last night about, you know, but some of the things that we talked about back then, You said, "Well, the only Red Hat of big data might be being a reduction, the ROI was a reduction I mean, first of all, I think we got and I think we're seeing that now, and the enterprise data warehouse were key components and I think we were very aggressive at that point. Yeah, and I think the other point and all the cloud analytics are and cloud for scale, and that's the sandwich Yeah, and I think data is everywhere, and I think that's why it's going to become I think that's going to be the big tailwind and I think that's going to be the big trend. and the other thing is you're hearing New York City, CUBENYC, that's the hashtag.

ENTITIES

Entity	Category	Confidence
Apple	ORGANIZATION	0.99+
Microsoft	ORGANIZATION	0.99+
Amazon	ORGANIZATION	0.99+
Diane Greene	PERSON	0.99+
Google	ORGANIZATION	0.99+
Facebook	ORGANIZATION	0.99+
John	PERSON	0.99+
Alibaba	ORGANIZATION	0.99+
Dave	PERSON	0.99+
Dave Vellante	PERSON	0.99+
Jeff Hammerbacher	PERSON	0.99+
$30	QUANTITY	0.99+
New York	LOCATION	0.99+
2010	DATE	0.99+
IBM	ORGANIZATION	0.99+
Doug Cutting	PERSON	0.99+
Mike Olson	PERSON	0.99+
Hortonworks	ORGANIZATION	0.99+
Dallas	LOCATION	0.99+
O'Reilly	ORGANIZATION	0.99+
Yahoo	ORGANIZATION	0.99+
Cloudera	ORGANIZATION	0.99+
five	QUANTITY	0.99+
AWS	ORGANIZATION	0.99+
Abi Mehda	PERSON	0.99+
John Furrier	PERSON	0.99+
New York City	LOCATION	0.99+
$2.5 billion	QUANTITY	0.99+
SiliconANGLE Media	ORGANIZATION	0.99+
MapR	ORGANIZATION	0.99+
Amr Awadallah	PERSON	0.99+
$40 billion	QUANTITY	0.99+
17 employees	QUANTITY	0.99+
VMworld	ORGANIZATION	0.99+
Today	DATE	0.99+
Impala	ORGANIZATION	0.99+
Nine years	QUANTITY	0.99+
four years ago	DATE	0.98+
last night	DATE	0.98+
last decade	DATE	0.98+
Strata Data Conference	EVENT	0.98+
Strata Conference	EVENT	0.98+
Hadoop Summit	EVENT	0.98+
ninth year	QUANTITY	0.98+
Four years ago	DATE	0.98+
two worlds	QUANTITY	0.97+
five companies	QUANTITY	0.97+
today	DATE	0.97+
Strata Hadoop	EVENT	0.97+
Hadoop World	EVENT	0.96+
CUBE	ORGANIZATION	0.96+
Google Next	ORGANIZATION	0.95+
Twitter	ORGANIZATION	0.95+
this year	DATE	0.95+
Spark	ORGANIZATION	0.95+
US	LOCATION	0.94+
CUBENYC	EVENT	0.94+
Strata O'Reilly	ORGANIZATION	0.93+
next decade	DATE	0.93+

JR Rivers, Cumulus Network | OpenStack Summit 2018

(bright music) >> Hi, I'm Peter Burris. Welcome to another CUBE Conversation from our beautiful studios here in Palo Alto, California. As we do with every CUBE Conversation, we come up with a great topic and we find someone who really understands it so they can talk about it. We capture them for you so you can learn something about some of the new trends and changes in the industry, and we're doing that today too. The topic that we're talking about is, how do you do a better job of mapping the costs that are being generated by the cloud. Well that information's coming out of cloud suppliers related to what you're using with the actual business activities that generate the differential capabilities that customers are looking for. That's a tough, tough challenge, and to understand that better, we're talking with J.R. Storment, who's a co-founder of Cloudability. J.R, welcome to the CUBE. >> Thanks Peter, good to be here. >> So let's talk about... First, who are you? >> Yeah, so I'm co-founder of Cloudability, and Cloudability is focused around improving the unit economics of cloud spend, so our customers tend to be those who are spending large amount in AWS or Azure or GCP. And we take their billing data, their utilization data, various meta data about their business and do machine learning and data science on top of it to help them get better visibility into where that spend is going, how their using it, but more importantly to give them some controls around how they want to optimize. And optimize doesn't necessarily mean save money in a cloud world. Cause most companies who are moving into cloud very heavily are doing that for the innovation, for the speed, so they can deliver better data faster. But it's really about fine-tuning the conversation. Say, "Okay, here we want to save money. "Here we want to move faster. "Here we want to focus on quality." And really providing a way for the various groups that aren't normally talking, the finance teams with the engineering teams with the procurement teams, all these groups to come together, and be able to take executive input to say, "Okay, how do we want to operate? "And how do we want to improve those unit economics as we go?" >> Well, I want to start with just a quick comment on this notion of unit economics. Cause when people historically hear the notion of unit economics, they think of increasing scale so the average cost per unit goes down. But I think you're talking about more than that, right? Aren't you really also talking about a mapping of what spend is generating to the business activities that actually generate value and ensuring that you get the differential or the optimized unit economics or unit cost? >> Yeah, so the mapping is actually really interestingly challenging in cloud. It's hard enough in traditional IT. If you look at somebody like AWS, they have 200,000 SKUs, different products you can buy. And they now bill at a second level resolution. So what this means is you've got all these engineers out there using cloud in a very good way to move quickly, innovate, include more features. And they kind of have an unlimited credit card that they can go spend on as quickly as they need. And they never see the statements. They never see the bills. And the other side, you've got finance teams, procurement teams who've sort of lost control of traditionally the power of the PO that they have to rein that in. And they're struggling just to understand what is the spend. And then to the mapping question, how do I allocate these hundreds of millions of charges that I have this month into cost centers and business units, and getting that sorted in a world where engineers are focused on moving fast. They're not tagging things based on cost center typically. So once you get that sort of mapping aspect sorted to the next point you brought is is then bring in the business value. So how do we start to relate that back. There's a concept a lot of you know, IT has been a cost center, and now it's actual driver of value in a world where businesses are increasingly delivering their value through software. So we need to start tying the spending, mapping of the business and then tying that to the value delivered. A great example of this, I was sitting last week with one of the largest cloud spenders in the world. And they're up in, you know, nine figures with their primary vendor. And in the conversation with the executives, we realized that nobody was looking at both side of that equation. You had the finance people who were saying, "Hey, we're tracking the costs, "and we're figuring out what's happening there." And then you have the revenue generators looking at the money coming in, you know the cloud people with that. But there wasn't this centralized view to say, "Alright, we want to have a conversation about what value are we getting out of this spend." And the question that always comes up with that is are we spending the right amount? I don't know. >> Let me build on that, because IT is historically, and this is one of the things that we've been doing over the last few years, IT has historically done things at a project level. Alright, so we had waterfall development. We tried to change that with Agile. We had buy the hardware upfront and then deploy the application on it, cloud changes that. So this project orientation has led to a set of decisions about finance at the moment that the business decides to do it. We've changed the practices that we use at a development level. We've changed the practices that we use at an asset level. Is it now time to change the practices that we use at a finance level? Is that really kind of what's going on here? >> It is, the project analogy is good. Because what we're seeing is they're shifting from a project basis to a product basis, and products that deliver value. Increasingly if you think about the change that's happened with DevOps in the scene and cloud, companies are delivering more of their value through software, and they're not just using IT for internal projects, right. It's actually the driver of business. It's how we interact with airlines and banks and all these things. So that's the shift to say, okay, now we gotten good at DevOps moving fast, and we've gotten good at deploying and building better data stores. Now we need to bring in this new discipline. And the discipline is what the market is calling FinOps, which essentially combining financial operations. You're essentially combining-- >> Applied to a technology world. >> Applied specifically to a cloud world. And it can only really happen in cloud. It can't happen in data centers. Because data centers have fixed spending, right? You have to wait to get resources. Once you make the investment, it's a sum cost. There's months of lead time. Cloud introduced the removal of constraints, which means you can get whatever you want as quickly as you want. And DevOps meant it's all automated. So instead of your collection of 60 servers, you've got thousands that are coming up and down all the time. So what you now have to do is bring in all these groups. Engineers have to think about cost as a new efficiency metric. They have to think about the impact on their business that this code, this confirmation template they just wrote is going to have. And the finance teams have to shift from this mode of "I'm going to report retroactively at a quarterly granularity, "60 days after it happened and block investment" to be "I'm going to partner with these teams. "Report in a real-time fashion. "Give them the visibility and help forecast. "Actually bring them together and make better business decisions about the cloud spend." >> So cloud has allowed development to alter practically, I mean Agile has been around for a long time, pre-dates the cloud, but it became practical and almost demanded as a consequence of what you could do with cloud. So cloud changed development through Agile. It changed infrastructure management through DevOps. Where now you're deploying software infrastructure as code. And what you're saying is the third leg of that stool, cloud is now changing how you do financial management of technology, financial management of IT. And we're calling that FinOps. >> You can't really have FinOps without cloud or without DevOps, and if you have the two together, you ultimately need this new set of, it's a new operating model. The reason this has sort of come to a head of late is if you look at going to the Amazon re:Invent conferences a few years back, it was like well how much is cloud going to be a thing. And okay, cloud's not going to be a thing. When's it going to happen? Now it's about the how and how do we do this better. Cloud is hitting sort of material spend levels now at big organizations. You always see the cloud projections where it's going, I think it's now 360 billion in the next few years. And we're seeing CFOs at public companies look to say, "Okay, it's not my biggest line item yet. "But it's the most variable and fastest growing "cogs expense, so it's actually "starting to affect our margins. "We need a new set of processes to actually manage this." So one of the things that's coming to market is this new group called the FinOps Foundation, which is a non-profit trade association that initially has a few dozen of some of the largest cloud spenders in the world. There's the Spotifys, the Laciens, the Nationwides, the Autodesks. And they've all come together as a set of best practice practitioners to start to clarify this into something that can be scaled out in organizations. So that group is going to be putting out a user conference around this area. There's a new O'Reilly book that's coming out the end of the year that's going to be sort of the treatise and all this stuff pulled together. Because what we found and you know me, as in Cloudability in the last eight years, we bring in technology and platform to show the recommendations of visibility and how to do this, but the real challenge companies run into is they don't have the internal expertise. Their finance teams understand what they need to. The engineers don't. And so they came to us last year saying, "Can you help figure out the processes? "Can you educate us?" And that's really where this FinOps Foundation has grown out of, of bringing together those people to define those processes. >> So the impact of cloud on each of these different groups, the development group, on the infrastructure team, and now on the finance team. The developer groups, some of them resisted it. But generally speaking, it's gone okay. And eventually tooling from a variety of different players came along that made it easy to enact best practices in software development through an Agile mechanism. In the last few years after significant battles within infrastructure teams about whether or not they were going to use software as code. We've seen new products, new tooling that has facilitate the adoption of those practices. What kind of tooling are we going to see introduced that facilitates FinOps, so that finance teams, procurement teams move from a project orientation to a strategic management of a resource orientation? >> I mean I think the first is on the engineering side is seeing cost become a first class citizen of an efficiency metric that they need to look at. So you know in their build processes baked in the CICD, looking to see am I properly sizing my compute request for the workload that it needs. There's some research that just came out showing that, I think it's 80% of the market is not using the best discounting options that cloud providers offer. You hear these horror stories. Cloud's too expensive, we overspend. That's not actually a problem with the cloud provider. That's a problem with the enterprises not using the tools that offer the discounts, the reservances, the infrequent access. >> Caveat emptor. >> Exactly, so I think at the end of the day, the first step in this is getting those checks in place to say, "Are we using the things that help drive the right cost for our needs?" And on the other side of that, the finance teams really changing the way that they are interacting with the technology teams. Becoming partners, becoming advocates in this versus a passive, retroactive reporter down the line. And this enables these sort of micro-optimization discussions that can happen where data center world, we bought it at some cost, it's sitting there, cloud world, we can make decisions today that impact the business tomorrow. >> So let me make sure I got this. So I have a client who I was having a conversation with him. They told me that their Amazon, their AWS bill, is 87 gigabytes monthly. Not some 87 pages. That's 87 gigabytes. So we bring this 87 gigabytes in, and it's a story about what I consume out of Amazon. It's not a story of what my business utilizes to achieve its objectives. So we're now entering into a world where we're trying to introduce that financial visibility into how that spend can be mapped to what the business does. So the finance group can look at a common notion of truth. And the IT group can look at a common notion of truth. Application owners can looks at a common notion of truth. And that's what FinOps is providing. Have I got that right? >> Absolutely, and the 87 gigabytes example is the exact reason why it is FinOps, and not just cloud financial management. You can't have a person with a spreadsheet looking at data and trying to make decisions about it, right? It has to be automated. It's IT finances code. It's got to be baked into the processes. We've seen organizations that have hundreds of millions of individual charges hitting them in a consumption based manner. The other thing that's come in with the FinOps as a core tenet is we're now seeing a decentralization of accountability for that spend. So if you look at the big cloud spenders out there who are maybe spending tens or hundreds of millions a year, some of them have thousands of cloud environments. Gone is the day of we have a centralized group getting to say, "We're going to turn this off, turn this off." We want to give each of those teams the ability to see just their portion of that bill in the right mapped way, as you said, and to be able to take actions on the back of that. So that's changed in the you know, you run it, you maintain it, you understand what's shut down. What has sort of come back to the old centralized model is this notion, and this is where procurement's job has shifted to largely, of we do still want to centralize the rate reduction. So engineers, you go use less, right? Essentially, finance teams, procurement work together with the cloud vendors to get the best possible rates through reserved instances, can be deduced discounts, you know volume discounts, negotiated rates, whatever it is. And they become sort of strategic sourcing. To say you're going to use whatever you're going to use, and you're going to watch that to make sure you're using the right amount with target thresholds. We're going to make sure we get the best rate for it. And that's sort of the two sides of the coin. >> Well, very important, procurement has always been organized on episodic purchases, where the whole point is to bring the price point down. And now we're talking about a continuous service, where you are literally basing your business on capabilities provided by a third party. And that is a very, very, very different relationship. >> It's just in time purchasing. And it's a new supply-chain management process, where you have so many SKU options, and you are making these purchase decisions, sometimes thousands a day, and that impacts everything down the road. >> Excellent. J.R. Storment, co-founder of Cloudability, talking about FinOps and Cloudability's role in helping businesses map their cloud spend to their business activities for better, more optimal views of how they get what they need out of their cloud expenditures. J.R., thank you very much for being on the CUBE. >> Thanks, Peter. >> And once again, I'm Peter Burris. And thanks for listening to this CUBE Conversation. Until next time.

Published Date : May 24 2018

SUMMARY :

and changes in the industry, So let's talk about... are doing that for the so the average cost per unit goes down. And in the conversation that the business decides to do it. So that's the shift to say, And the finance teams have of what you could do with cloud. So that group is going to be putting out and now on the finance team. that offer the discounts, the reservances, And on the other side of that, And the IT group can look So that's changed in the you know, bring the price point down. and that impacts everything down the road. for being on the CUBE. to this CUBE Conversation.

ENTITIES

Entity	Category	Confidence
Peter	PERSON	0.99+
J.R	PERSON	0.99+
Peter Burris	PERSON	0.99+
tens	QUANTITY	0.99+
J.R. Storment	PERSON	0.99+
80%	QUANTITY	0.99+
Amazon	ORGANIZATION	0.99+
60 servers	QUANTITY	0.99+
AWS	ORGANIZATION	0.99+
Cloudability	ORGANIZATION	0.99+
J.R.	PERSON	0.99+
FinOps Foundation	ORGANIZATION	0.99+
last week	DATE	0.99+
360 billion	QUANTITY	0.99+
Laciens	ORGANIZATION	0.99+
200,000 SKUs	QUANTITY	0.99+
87 gigabytes	QUANTITY	0.99+
last year	DATE	0.99+
first step	QUANTITY	0.99+
Autodesks	ORGANIZATION	0.99+
thousands	QUANTITY	0.99+
First	QUANTITY	0.99+
DevOps	TITLE	0.99+
two sides	QUANTITY	0.99+
Nationwides	ORGANIZATION	0.99+
two	QUANTITY	0.99+
hundreds	QUANTITY	0.99+
Palo Alto, California	LOCATION	0.99+
87 pages	QUANTITY	0.99+
Spotifys	ORGANIZATION	0.99+
first	QUANTITY	0.99+
each	QUANTITY	0.99+
third leg	QUANTITY	0.98+
tomorrow	DATE	0.98+
60 days	QUANTITY	0.98+
second level	QUANTITY	0.98+
hundreds of millions	QUANTITY	0.97+
today	DATE	0.97+
one	QUANTITY	0.97+
CUBE	ORGANIZATION	0.96+
both side	QUANTITY	0.95+
thousands a day	QUANTITY	0.95+
O'Reilly	ORGANIZATION	0.95+
last eight years	DATE	0.94+
hundreds of millions a year	QUANTITY	0.93+
Agile	TITLE	0.93+
FinOps	ORGANIZATION	0.92+
OpenStack Summit 2018	EVENT	0.89+
FinOps	TITLE	0.89+
nine figures	QUANTITY	0.88+
Amazon re:Invent	EVENT	0.88+
Azure	TITLE	0.86+
this month	DATE	0.85+
cloud	QUANTITY	0.83+
next few years	DATE	0.83+
Cumulus Network	ORGANIZATION	0.82+
CUBE	TITLE	0.81+
Conversation	EVENT	0.8+
CUBE Conversation	EVENT	0.79+
few years back	DATE	0.75+
last few years	DATE	0.74+
GCP	TITLE	0.74+
dozen	QUANTITY	0.7+
last	DATE	0.68+
Agile	ORGANIZATION	0.61+
JR Rivers	ORGANIZATION	0.6+
millions	QUANTITY	0.56+
ne	QUANTITY	0.53+
end	DATE	0.51+

W. Curtis Preston, Druva | AWS re:Invent

>> Announcer: Live from Las Vegas, it's theCUBE, covering AWS Reinvent 2017, presented by AWS, intel, and our ecosystem of partners. >> Well, welcome back. We're live here in Las Vegas at Reinvent. AWS putting on it's annual show, and you might notice the volume's gone up a little bit around here. Well, it's 5 o'clock reception time here, so the show floor has a little different vibe to it, you might say, right now. Justin Warren, John Walls, you kind of feel it, don't you, right now? >> Oh yeah, there is an energy just sort of vibrating around. I can feel the energy lifting as the booze starts to flow some more. >> Energy's a good way to put it. >> Yeah. >> Right. We're with W. Curtis Preston, who is the chief technical architect at Druva, and Curtis, thanks for being with us. >> Glad to be here. >> Do you feel a vibe, too? >> I feel the vibe. I feel the vibe standing out in the big line to get in here. >> Yeah, in here. >> And now we're in here, it's yeah, it's a lot of people. >> By the way, for those of you at home not familiar, you were named, this year, on the Deloitte Technology Fast 500 list, 175. >> Hey. >> Quite an honor. >> I assume you're talking about Druva, not me personally. >> Well, yeah, Druva, not you. Although maybe you did, I don't know. >> Yeah, I don't think. >> But that's quite an accomplishment, though. And quite an honor for the company. I mean, tell a little bit about that, about that process, and what do you think that means? What's that stamp of approval for what you guys are doing? >> Well, I think it's just, you know, like a lot of those lists, it's a recognition of the position that we're holding, right? I mean, Druva historically is really well known for their protection of endpoints and SaaS applications. They're expanding into data center and Cloud protection, but I think they're absolutely recognized as the leader in the protection of endpoints. >> Okay, so characterize the Cloud work you guys are doing. Like you said, this is a new move for you, I mean relatively new move, but the market's driving that way, right? >> Yeah. >> People starting to nod their head, and they're thinking, yep, this is where we need to be. >> Yeah, yep. >> So, what has been your strategy then, as far as facilitating what's no longer a trend, it's a way of life. >> Yeah, so I'd say first off, we are definitely unlike a lot of other players. We are a Cloud first company, in that, it's not a strategy, it's a way of life, so our entire application is built in and for the cloud, and by that I mean that it takes advantage of everything that the Cloud offers, right? And when you look at specifically AWS, a lot of backup software products use, well, they all use some kind of database, some kind of catalog to keep track of all the backups. And all of those catalogs, all of those databases, whether it's SQL Server or Db2 or Oracle, they all have scalability limits. We chose to use DynamoDB, which is an incredibly scalable no-SQL database. It's built and available in Amazon as a service, and then all of our products all run in Amazon, right? And so, we can scale both up and down to meet the requirements of a customer. So if we get a new customer. We had a customer that I can't mention by name, but they're a large company that started out with what we consider a small installation of about 10,000 laptops. And that was nice. And then it went well. And then there was a ransomware scare, and so they said, you know what, we're gonna go ahead and do everything. And so suddenly we needed to do 10 times as many laptops. Well, because of the way AWS is, we could scale both the database, the compute, and the storage all instantly to meet the demands of that client. And then once that's done, scale it back down to get back to a state of normal, right? So, for us the Cloud is sort of the core of who we are, and then the only expansion for us is actually protecting the Cloud. So, we've always used the Cloud as our destination, but now our newest offering, Apollo, actually is designed to protect starting with AWS and then expanding into the rest of the Amazon, well, I should say starting with EC2, and then eventually expanding into the rest of the AWS world. >> All right, so, with the tradition of endpoint protection and... >> You're gonna have to speak up, it's really loud in here. >> It's really loud, I'll make sure I'm yelling. So with the heritage that you've got of backing up endpoints and being able to protect endpoints, and now you're moving to protect Cloud workloads, as you say, you've got this Cloud heritage, but you're now looking at protecting workloads that live in the Cloud, what are some of the things that Druva's bringing from that endpoint knowledge that applies to those Cloud type workloads? >> Well I think the idea is that, you know, one of the things about the Cloud, people sort of view, I think there's steps of people using AWS, right? They sort of experiment, and they try out this and that, but once somebody really understands like we did, the things you can do when you can scale your VMs instantly and limitlessly, and your storage and your compute and your databases, once they go down that route, I think the fact that we, it's not necessarily the history of the endpoint itself, but the infrastructure that we built in order to protect those endpoints is already totally scalable and ready to meet the needs of however big of a workload that you'll put in AWS. >> Yeah, I often like to say that Cloud is a state of mind, so if you've already got that state of mind that I want to run my workloads in a Cloud-like way, well I want to be able to protect them in a Cloud-like way, and it sounds like that's really what you're trying to nail there. >> Yeah, and it's a big, because any like, I can look out and see multiple backup products available, and there's a lot of good backup products here. And any of them can run in the cloud, right? You can create a Linux VM or a Windows VM and install your backup software, but it's not going to magically become more scalable because you're running it in Amazon, right? So, designing the product for Amazon and that scalable way of doing things, that's why we talk about being Cloud native. >> Yeah, so how are you attracting customers who would have traditionally thought of you as an endpoint company. It's like, now you're actually saying, look we have these different offerings. So how are you starting to talk to those different kind of customers. How are you finding them and what is it that you're finding resonates with them as compared to some of the other options that they might have? >> Yeah, so as you probably know, I've been in the backup space now for, quarter of a century... (clearing throat) >> Literally wrote the book. >> Literally wrote the book, right? It's on O'Reilly. (laughing) Oreilly.com. >> We'll give you a plug later. >> Don't worry. >> Yeah, yeah, yeah. >> Literally wrote the book. >> Yeah, one thing I can say, there's a couple of things I can say about backups in general, in the average data center. One is, everybody hates their backup software. Right? Like, nobody likes it because it's so hard, right? It's so hard to configure, and using disc as a mechanism instead of tape as a primary mechanism, it's made things better, but it hasn't really solved it, right? It's still this really difficult to manage. There's this massive amount of infrastructure that has to be put in place to do all of that. And because that's so hard and it's so error prone and you're invisible or you're in trouble. No one cares about the millions of backups you get right, only the one restore you got wrong. And so what that translates into is the other truth, which is nobody wants to be the backup guy, right? >> I mean the way I got my first job in backups 24 years ago was a guy named Ron Rodriguez did not want to be the backup guy. >> Curtis, you're it. >> Yeah, you're it. And I within two months, had my first major failure as the backup guy for a 35 billion dollar company, and I thought I was done, I thought I was fired, like so many other backup people, and somehow just accidentally I ended up staying, and so what happens is, it's so hard. So, to go to your question, well what if it was simple, right? The situation is, the current system's not scalable. You're always buying another media server, you're always buying another tape driver, you're always buying another dedupe box. You know, you're always out of something, right? I remember having to go to my boss and being out of tapes. This is, you know, back when tapes were a thing. And I remember saying, "hey, I'm out of tapes." and she was like, we don't have budget." She's like, "what are our choices?" and I go, well, I can stop the backups. She's like, "that's not funny." I'm like, "that's our choice." Right? >> I have so much capacity here. >> These are our choices, right? And so she gave me the tapes that I needed, right? And so it's not scalable, the current system. You're always in need of some piece. It's also super expensive, right? And it's super hard. So we try to be the opposite of that. We try to be scalable, simple, and you know save people money. Right? I know we have a 4S thing. >> It's right there on the tip of your tongue. >> It's right there on the tip of my tongue. But basically we try to be the opposite of everything that backups are. So the big thing is, it's way easier. Just put a piece of software and magic happens, right? And if you're large enough data center that you need to do what we call seeding, where you have to use sneakernet to get the data to us, we have a system for that. If you have a large enough system where the RTO is not going to be able to be met by a copy that's on the other side of the internet, then we have a caching appliance that goes onsite to provide fast recovery. So it's like it's super simple, way less expensive. And I do mean way less expensive. I've seen some TCOs where we compete against other companies, we're even less expensive than renewing what you have, let alone going and buying. >> John: Replacing. >> And replacing it with something, because that happens all the times. Because people are always swapping their backup software, because the problem has got to be the backup software. Right? And I think in the end, it is, right? But, it's because that core architecture, that core way we've done backups, has essentially stayed the same since before I started. All we did was we changed tape to disc, right? And we introduced dedupe, which was great, but there's this technology that we call dedupe, that is really hard when you do it on the backend. You know, there's a company here who makes a lot of money on selling those appliances, right? Except it's really hard to do that, and so it's really expensive to do that. And then you gotta pay for one here and you gotta pay for one over there. With us, you don't buy that. You just go straight to us, and then because we're in AWS, it's already in three locations, right? And it's already offsite. >> Well Curtis, they said 24 years ago it wasn't gonna last, but it did. You made it, congratulations. >> Thanks. >> We appreciate the time here. >> Thanks. >> Thanks for being here with us on theCUBE and onto 175th. Next year who knows where you're going, right? >> Who knows where we're going. >> Excellent, Curtis Preston, joining us here from Druva. Back with more live from Las Vegas. We are at Reinvent at AWS. Back with more in a bit.

Published Date : Nov 29 2017

SUMMARY :

Announcer: Live from Las Vegas, it's theCUBE, and you might notice the volume's I can feel the energy lifting as the booze and Curtis, thanks for being with us. I feel the vibe standing out in the big line to get in here. By the way, for those of you at home not familiar, Although maybe you did, I don't know. What's that stamp of approval for what you guys are doing? of the position that we're holding, right? Okay, so characterize the Cloud work you guys are doing. People starting to nod their head, it's a way of life. and the storage all instantly to meet the demands All right, so, with the tradition You're gonna have to speak up, and being able to protect endpoints, the things you can do when you can Yeah, I often like to say that Cloud is a state of mind, and that scalable way of doing things, Yeah, so how are you attracting customers Yeah, so as you probably know, It's on O'Reilly. No one cares about the millions of backups you get right, I mean the way I got my first job in backups 24 years ago and so what happens is, it's so hard. And so she gave me the tapes that I needed, right? that you need to do what we call seeding, because the problem has got to be the backup software. but it did. Thanks for being here with us on theCUBE and onto 175th. Back with more live from Las Vegas.

ENTITIES

Entity	Category	Confidence
Curtis	PERSON	0.99+
Ron Rodriguez	PERSON	0.99+
Curtis Preston	PERSON	0.99+
Justin Warren	PERSON	0.99+
AWS	ORGANIZATION	0.99+
Las Vegas	LOCATION	0.99+
Amazon	ORGANIZATION	0.99+
10 times	QUANTITY	0.99+
John Walls	PERSON	0.99+
John	PERSON	0.99+
Druva	ORGANIZATION	0.99+
W. Curtis Preston	PERSON	0.99+
5 o'clock	DATE	0.99+
Windows	TITLE	0.99+
Next year	DATE	0.99+
first job	QUANTITY	0.99+
24 years ago	DATE	0.98+
Db2	TITLE	0.98+
about 10,000 laptops	QUANTITY	0.97+
this year	DATE	0.97+
35 billion dollar	QUANTITY	0.97+
SQL Server	TITLE	0.97+
first	QUANTITY	0.97+
both	QUANTITY	0.97+
three locations	QUANTITY	0.96+
one	QUANTITY	0.95+
One	QUANTITY	0.94+
two months	QUANTITY	0.93+
EC2	TITLE	0.91+
Cloud	TITLE	0.9+
175	QUANTITY	0.88+
O'Reilly	ORGANIZATION	0.88+
175th	QUANTITY	0.88+
quarter of a century	QUANTITY	0.87+
first major	QUANTITY	0.87+
Deloitte Technology	ORGANIZATION	0.87+
one thing	QUANTITY	0.84+
DynamoDB	TITLE	0.83+
millions of backups	QUANTITY	0.79+
theCUBE	ORGANIZATION	0.79+
Apollo	ORGANIZATION	0.78+
first company	QUANTITY	0.77+
Reinvent 2017	EVENT	0.77+
Linux VM	TITLE	0.76+
Druva	TITLE	0.69+
4S	COMMERCIAL_ITEM	0.65+
Druva	PERSON	0.65+
Invent	TITLE	0.65+
Oreilly.com	OTHER	0.64+
Oracle	ORGANIZATION	0.58+
SQL	TITLE	0.5+
500	TITLE	0.45+
Reinvent	LOCATION	0.39+

Valentin Bercovici, PencilDATA | Cube Conversation with John Furrier

(light adventurous music) >> Hello everyone, welcome to theCUBE Studios here in Palo Alto. I'm John Furrier, the co-host of theCUBE, co-founder of SiliconANGLE Media. This is our CUBE Conversation Thought Leader Thursday and I'm here with Val Bercovici, who's the founder and CEO of a new startup called PencilDATA. Val, CUBE alumni, been on many times with NetApp and then a variety of other great startups, but now you're doing your own thing around cryptocurrency, blockchain, enterprise-like technical infrastructure. You've been a CTO, now entrepreneur, founder and CEO of PencilDATA. Congratulations, you're on the crypto wave, this wave is coming. >> I believe it's here. >> It's here. >> Timing couldn't be better. >> So, I interviewed Dr. Jian Wang who's the chairman of Alibaba's technology steering committee, also the founder of Alibaba Cloud, just recently in China. Presented by Intel, plug for Intel there, thanks Intel for supporting theCUBE. He said to me, and I put the clip out on Twitter, natively on the video clip, which was, I asked him about blockchain, you know China, they blocked the ICOs, he said, "Blockchain is fundamental, part of the Internet. "It's as fundamental as TCP/IP was." This is the nuance that is attracting a lot of tier one entrepreneurs. Obviously the money side is hyped up beyond all recognition right now. As Don Klein on our team was saying, "It's melting up in terms of hype." But this really speaks to the transformation of the web, and the Internet now, the web is the Internet, from distributed and decentralized. This is a big sea change. Kind of building on the fundamentals of the internet, formerly called the information superhighway, before the web came along, but the web was designed to withstand nuclear disaster, be resilient, be decentralized. >> It reminds me of Back to the Future in many, many ways, because if you're as old as we are, you remember those DARPA origins of the Internet and exactly that decentralized nature, and we've gone away from that, right? As Tim Berners-Lee brought on the HTTP protocol, we've had web protocols, and as major, the FANG vendors have really dominated their usage of that existing layer of technology we've gone away, we've gone to a very, very centralized approach, which as we're seeing with the tech hearings this week, carries all sorts of risks, it's not just business and legal and political. >> And you're referring to the senate hearings, where Facebook, Google, or Alphabet, and Twitter were in front of the senate committee, you're going to tell them about the Russians, the Russian political thing, but they're bringing up the issue of the role of these mega platforms that have all this data and the problem is that this is not what the users bargained for. I mean, I use Facebook as a free app, I love Facebook, Facebook, we love you, WhatsApp here and there, and Instagram, but you know, my bargain was simple. I'll use your free app and I'll let you use some of my data but now you're making billions, $10 billion quarter, fake news has infiltrated the country, I have a poor user experience every day, it's getting worse and worse, a lot of hate and division. This is not what I bargained for. >> Val: Exactly. >> So the world's kind of revolting against these mega-siloed platforms. >> That's the risk of having such centralized control of the technology. If you remember in the old days when Microsoft's dominance was rising, all you had to do was target Windows as a virus platform and you're able to impact thousands of businesses, even in the early Internet days, within hours. And it's the same thing happening right now, there's a weaponization of these social media platforms and Google's search engine technology and so forth. It's the same side effect now, the centralization of that control is the problem. One of the reasons I love the blockstack technology, and blockchain in general is the ability to decentralize these things right now, and the most passionate thing I care about nowadays is being driven out of Europe, where they have a lot more maturity in terms of handling these new scenarios. >> You mean the tech being driven out of Europe. >> The laws. >> The laws, okay. >> Being driven out of Europe. >> Be specific, we'd like an example. >> The major deadline that's coming up in May 25th of 2018 is GDPR, General Data Protection Regulation, where European citizens now in any company, American or otherwise, catering to European citizens, has to respond to things like the right to be forgotten request. You've got 24 hours, as a global corporation with European operations, to respond to European citizens', EU citizens', right to be forgotten request, where all the personally identifiable information, the PII, has to be removed and an audit trail, proving it's been removed, has to be gone from two, three hundred internal systems within 24 hours. And this has teeth by the way, it's not like the $2.7 billion fine that Google just flipped away casually, this has up to 4% of your global profits per incident where you don't meet that requirement. >> Well you bring up a good point, the GDPR is a good one, it has teeth and it's kind of in the weeds with the folks who might not know that regulation, but really it's about the privacy and the rights of the individual. But coming back to Facebook, to connect another dot is, what we're seeing with Facebook, Twitter, and Alphabet with the senate hearings is, and this is why the industry and the media is crumbling, publications are dying, the newspapers, the media's changing, is because knowing your customer is a really important thing. The people who want to be served need to have a closed loop with the publication, and these platforms are bogarting all the data, and so the right of the customer, the users are suffering, and that's what people are generally talking about. You know, personally, a guy can rent a truck and go mow people down in Manhattan, we should know who these people are, like the neighbors, so I think there's going to be a trend towards knowing who your neighbor is, knowing who the customers are, at a level that's not scary privacy violation, but we're going to know who the crazies are, we're going to know what's going on and then that's kind of out there, that's kind of my general feeling. But now, getting back to the impact. GDPR, these big mega platforms where the users are at the center of the value proposition, really comes down to the shift in user expectations around a decentralized Internet. That means agile goes to a whole other level. If I'm a user and I say, "Hey Facebook, "delete my digital exhaust or digital footprints "from Facebook over the past 10 years." I mean, that's hard to do. >> That's hard for them. >> That's not, technically is a really serious problem. >> And it's actually not just a technology challenge, I always love to go back to Conway's Law in these discussions, the org chart, you know, how information, infrastructure is budgeted for, and managed through various different departments within any large enterprise, data-savvy or not, is a challenge, as is coordinating these efforts, actually going beyond the talking phase, towards implementing a master data model. Those are the main challenges right now, and it's a movement that I believe now has political strength to actually migrate across the pond. Over here as well there's a groundswell movement called Digital Sovereignty as a response to GDPR in Europe, where people are realizing that they have the right to be sovereign over their data, their digital exhaust, their digital footprints online and that's a two-way street. You want and demand control over your data, but on the other hand your identity, which you control, has to be authentic as opposed to a fake identity, and your reputation has to be out there as well. >> These signals and these trends you were just referring to, to me are just like little tremors of the tectonic plates that are going to be changing, because if you look at the major shift in technology, let's take blockchain for instance, and look at the impact of a decentralized internet, now global, immutability with the ability now for more agile capability and not just permanent, "I want to erase things" that you're talking about, but three, the younger generation, if we look at what the young kids are doing, I have four kids, my oldest is 22, it's a gaming culture, right? It's a gaming culture, they're online all the time. They're not old like us, my son's like, "Dad, Google Search is for old people." I mean, that's a general sentiment, over-categorizing, but a combination of the new user experience, this younger generation, entrepreneurs and users, and these tremors we're seeing in the marketplace, signaling that, "hey Facebook, you might be too big for your britches," or, "hey Twitter, you got a bot problem, "hey all you gamers using Twitch," this is now a signal, where is it leading to? And where does blockchain in particular impact it? Because this is kind of where everything's converging to. >> So what I'd like to say right now is, you've got Marc Andreessen's premise that software is eating the world. If you extend that, data is feeding it, blockchain is valuing it, and it's AI that's automating it. So in my mind, particularly in my experience earlier this year in the AI industry, you realize that AI today really boils down to machine learning, which in itself boils down to deep learning, which boils down to data, your access to data. Professor Andrew Wang did this at the recent O'Reilly conference up in the city, he got up and lectured as the keynote instead of sharing slides and his number one, two, and three advice to everyone in the audience was, get the right datasets to train your model. If you don't have that you don't have a differentiated business, and that's what inspired PencilDATA, is my encountering of the cold start AI problem where the IP's in a public domain, public datasets are ubiquitous which is fantastic for academics, but as a business you can't differentiate unless you have access to the right datasets to train your models more specifically. >> Okay, as the founder and CEO of PencilDATA, that's your new startup, let's get into some of the reasons why you're starting it. What problem are you attacking? Obviously a pencil, I can see pencil and you erase things, it's got data... >> The internet is no longer written in ink, that's the premise. Now with Pencil you can erase some data. >> Well blockchain is immutable, so this is conflicting in my mind. Help me kind of rationalize this. The benefit of blockchain is everything's permanent, if you're on-chain as they say. >> Exactly. >> If you're off-chain, you could do some things. Is that kind of what we're getting at? >> We're mixing the best of both. So our premise is that again, whether you're an organization or an individual, you need to have, to survive in a new digital economy, control over your data. The blockchain part of it is the visibility side. If you don't know who's doing what to your data, you're far less likely to share it. And once you know who's doing what to your data, in an immutable blockchain, with a detailed audit trail, with strong authentication, of literally who's doing what to your data, gives you that visibility. Then you do what modern asset managers do. You can't really value an asset until you fully control it. And our premise is, you can't control something until you can take it back. So the notion of PencilDATA is the ability to go on-chain for the visibility and off-chain for managing data in encrypted containers, and if a data owner or publisher doesn't like how the subscriber's consuming their data, they have the power to revoke all downloaded copies. >> So is this kind of like a shadow blockchain model? I'm trying to find a mental model because I remember the old days back, I was breaking into the industry in the late '80s, early '90s, WORM drives, write once, read many. And you write it once, it's a laser, it was optical drives at the time. Also, demilitarized zones in networking was an area where there was a safe harbor kind of thing, where people could play around. What metaphor, what mental model can people take away from some of the things that you're trying to solve? Is it like a DMZ, is it like a-- >> The implementation's a lot like a DMZ and the business challenge and opportunity is that there's a lot of tension between protecting data, because we have an epidemic of data breaches right now, I think you're foolish if you're assuming that you haven't been breached yet but you might be, because everyone has been breached, personally and organizationally, so we have to deal with the rising need to protect data more and more. But at the same time, you can't stay in business if you don't optimize the monetization of the data you have. And so PencilDATA walks that fine line between letting you do both, letting you not just protect infrastructure, that's a whole other industry that we're not involved in, but literally protect data at the data level. If you look up terms like crypto anchor you'll see some of the technologies we're taking advantage of there. But being able to monetize data by unlocking all that latent value of data hidden behind firewalls. If you use a physics analogy of potential and kinetic energy, applied to data behind firewalls, there's hundreds of billions of dollars of value in latent data basically, potential data hiding behind firewalls, and when you can safely share it, give the owners control they've never had before, then you expose the value of that data for the first time. >> Alright, so let's take us through where you're at. Obviously super exciting, you're leveraging the blockchain and you've got an ICO, initial coin offering coming up but you're not just doing that for the sake of doing, there's a lot of scams out there, you're taking a little bit more of a pragmatic approach. Give us the status because you're the founder and CEO, what's the makeup of the team, how big are you guys, what are you guys looking for, obviously you're looking for team members most likely. >> We're looking for developers obviously. >> Where in the process are you? >> We are a two-month-old company. We're at the seed stage. And we've actually assembled a world-class team. You hear that a lot, but I'm really, really proud of the team members we have right now. >> World-class, are they from around the world and then they have class? Define world-class. >> They're worldly, like myself, I travel a lot. (laughter) An example, my chief privacy officer is Sheila Fitzpatrick, she's a worldwide recognized leader in data privacy, she's on many, many privacy boards in the US and EU and so forth, and she now is traveling nonstop lecturing on GDPR, itself specifically. She's one of those recognized-- >> Should you see yourself as a solution for GDPR, because that's, again, it does have teeth, I'll just say that we've been reporting on this through Wikibon, our research team as well as theCUBE, it comes up all the time and there's heavy fines associated with it, so it's not like- >> GDPR is the perfect use case because on the one hand, we have that audit trail that proves what you're doing with data. On the other hand we have a kill switch, that revocable use clause for data where you can literally comply with GDPR in minutes or seconds, as opposed to take a full 24 hours to scour database and delete selected records. >> Alright, so what about the product? Give us an example of the product. Will you be, first of all that's right around the corner, it's next year. >> Val: Yeah. >> I think it was a March or April's timeframe, I don't have the exact date but it's pretty soon. >> Public beta before the end of this year, version 1.0 first of second quarter next year. >> For you guys, PencilDATA. >> Yes. >> Clients, are you working with anyone right now, you have a handful? >> So we've actually got really interesting distribution partnerships that we're not in a position to announce right now but the top-tier brand name enterprise cloud vendors, both on the SaaS and infrastructure and database side, they're lining up to work with us. Because we're enabling amazing use cases in healthcare and life sciences, the ability to selectively share patient data with insurers, with healthcare providers, clinical trials now to share more information through differential privacy and collectively have more data to be processed and analyzed. Use cases are just off the charts. >> Well you know we go to all the big data shows, we're horizontally scaled on the event site circuit, but this is the number one thing that comes up, I want to move from batch marketing, batch process, batch business to real-time business, speed is essential, but it's always been a conflict between, how do I enable data to move really fast and be available for applications but protecting the privacy. >> Yeah. >> Do you solve that problem, is that something that you see yourselves solving? >> We aren't necessarily innovating on speed, of data movement, it's going to be a SaaS service. >> So it's availability model. >> It's availability of data that's really never been shared before and I think that's the key here, is we know there's a lot of value locked up behind corporate firewalls. The irony is, we don't even have to sell this outside firewalls initially, when you go to any medium-to-large size enterprise that has more than one site or more than one department, Sales doesn't trust Marketing and vice versa, Engineering doesn't trust Customer Support, neither of the four of them trust each other, so we're actually going to enable more data shared within an enterprise at first. >> So that's a starting point for you guys. >> That's a starting point, that's the easiest low-hanging fruit sale we have. >> Well PencilDATA, it's great stuff, Val, congratulations on that startup. I mean, you've got a world-class management team, and this kind of brings up a point that I've been banging on theCUBE pretty much every time I go out I'll talk about blockchain and ICO because you know, theCUBE is a very decentralized audience and that's a value that we're looking at as well with blockchain. I've got to ask you the personal question, from your own personal perspective, experience, executive and CTO, why is blockchain attracting so many A players? Because you're seeing a lot of what I call A players, entrepreneurs, technical geeks, really jumping into this because they can see it, they can smell the opportunity, and also, it also attracts the scammers as well, but specifically, why are these A players coming in? Is it, what are you hearing, what's the general vibe, what's the anecdotal reason? >> So as you said earlier on, it's a fundamental evolution of the core internet as a technology, as fundamental as HTTP and web was on top of TCP/IP back 20 years ago, but it's got that rare combination of not only being a technical innovation that empowers new use cases on the web, on the internet, it's also got immediate, amazing business applications as a store of value initially, as an actual valuation of various business processes, or datasets in my case, as an ability to exchange that value so transparently, so, in such a friction-less liquid manner, those are some of the amazing innovations it brings to the table and I think the most important thing is not to think of this as being able to do digital transformation or faster analog, it's about completely reimagining the exchange of value, measurement of value, and new kinds of businesses that just weren't possible before. >> And at all points of the stack, not the low levels and at the application level, the business logic, and to the geek side, right? >> Absolutely. >> You agree. I mean, that's great and as you know, theCUBE is looking at a blockchain ICO on down the horizon so keep an eye out for that, CUBEcoins could be in everyone's future, so we're super excited like you. >> I'm looking forward to your presale, just like I'm looking forward to mine. (laughing) >> Well, we'll see. But the bottom line is that this is what the reality is, you know, reimagining the applications is what people are thinking and I think people should beware of the scams out there, and then final question I want to ask you is, obviously we're both in the community together, with our teams. Share your perspective on the ecosystem, because obviously decentralization will change the nature of traditional ecosystems. >> Very much so. >> What's your vision on how the ecosystem will evolve, and how big is it now relative to these early markets? >> We're actually starting to enter the middle innings of the cloud game, if you will, we're seeing a very good maturity, a good diversification of profitable earnings and outcomes for the major cloud players, so I think we've gone well down the cloud path so far. But the decentralized world is in its infancy. It's embryonic right now. And I've always been a proponent of the multi-cloud environment and a multi-cloud world, and decentralization fundamentally is based on and depends on a multi-cloud, not just multi-region, but multi-data-center-in-a-closet scenario as well, to be able to actually have a democratic model for determining where the value is, where the value isn't, blockchain node style. And that is incredibly exciting to me, because that really cements this rebalancing of the pendulum between core and edge in terms of where processing and value happens. >> Yeah, and value exchange obviously now, markup links are becoming the du jour way to exchange value, users are in control, infrastructure equilibrium is interesting. Great stuff. And I'll say, perfect storm for innovation. The waves are coming. (laughing) >> You know, one thing I've learned over the years is, the innovation, change never stops. There's always an opportunity to innovate, and that's what I love about this movement. >> Blockchain, ICO, PencilDATA, check 'em out, Val Bercovici, founder and CEO, great friend of theCUBE, also really strong CTO, check these guys out. This wave of innovation around blockchain ICOs and infrastructure, reimagining, the future is here upon us at theCUBE, be right back with more, thanks for watching. (electronic music)

Published Date : Nov 3 2017

SUMMARY :

I'm John Furrier, the co-host of theCUBE, Kind of building on the fundamentals of the internet, As Tim Berners-Lee brought on the HTTP protocol, the issue of the role of these mega platforms So the world's kind of revolting and blockchain in general is the ability the PII, has to be removed and an audit trail, and it's kind of in the weeds with but on the other hand your identity, which you control, and look at the impact of a decentralized internet, get the right datasets to train your model. some of the reasons why you're starting it. that's the premise. The benefit of blockchain is everything's permanent, Is that kind of what we're getting at? So the notion of PencilDATA is the ability to go from some of the things that you're trying to solve? But at the same time, you can't stay in business what are you guys looking for, of the team members we have right now. and then they have class? in the US and EU and so forth, and she now is traveling because on the one hand, we have that audit trail first of all that's right around the corner, it's next year. I don't have the exact date but it's pretty soon. Public beta before the end of this year, the ability to selectively share patient data available for applications but protecting the privacy. of data movement, it's going to be a SaaS service. neither of the four of them trust each other, That's a starting point, that's the easiest and also, it also attracts the scammers as well, evolution of the core internet as a technology, on down the horizon so keep an eye out for that, I'm looking forward to your presale, reimagining the applications is what people are thinking of the cloud game, if you will, we're seeing a very markup links are becoming the du jour way the innovation, change never stops. the future is here upon us at theCUBE,

ENTITIES

Entity	Category	Confidence
Don Klein	PERSON	0.99+
Sheila Fitzpatrick	PERSON	0.99+
two	QUANTITY	0.99+
Marc Andreessen	PERSON	0.99+
Alibaba	ORGANIZATION	0.99+
John Furrier	PERSON	0.99+
$2.7 billion	QUANTITY	0.99+
Microsoft	ORGANIZATION	0.99+
Palo Alto	LOCATION	0.99+
Val Bercovici	PERSON	0.99+
Valentin Bercovici	PERSON	0.99+
Europe	LOCATION	0.99+
Google	ORGANIZATION	0.99+
Facebook	ORGANIZATION	0.99+
Manhattan	LOCATION	0.99+
Jian Wang	PERSON	0.99+
22	QUANTITY	0.99+
US	LOCATION	0.99+
China	LOCATION	0.99+
Alphabet	ORGANIZATION	0.99+
May 25th of 2018	DATE	0.99+
24 hours	QUANTITY	0.99+
PencilDATA	ORGANIZATION	0.99+
$10 billion	QUANTITY	0.99+
April	DATE	0.99+
Thursday	DATE	0.99+
Andrew Wang	PERSON	0.99+
Twitter	ORGANIZATION	0.99+
next year	DATE	0.99+
Tim Berners-Lee	PERSON	0.99+
March	DATE	0.99+
SiliconANGLE Media	ORGANIZATION	0.99+
General Data Protection Regulation	TITLE	0.99+
GDPR	TITLE	0.99+
billions	QUANTITY	0.99+
CUBE	ORGANIZATION	0.99+
first time	QUANTITY	0.99+
One	QUANTITY	0.99+
more than one site	QUANTITY	0.99+
four kids	QUANTITY	0.99+
both	QUANTITY	0.99+
theCUBE	ORGANIZATION	0.99+
theCUBE Studios	ORGANIZATION	0.99+
Intel	ORGANIZATION	0.98+
three	QUANTITY	0.98+
early '90s	DATE	0.98+
late '80s	DATE	0.98+
this week	DATE	0.98+
more than one department	QUANTITY	0.98+
two-way	QUANTITY	0.98+
Wikibon	ORGANIZATION	0.97+
Alibaba Cloud	ORGANIZATION	0.97+
EU	LOCATION	0.97+
Val	PERSON	0.97+
Pencil	ORGANIZATION	0.97+
second quarter next year	DATE	0.96+
end of this year	DATE	0.96+
Russian	OTHER	0.96+
Twitch	ORGANIZATION	0.96+
hundreds of billions of dollars	QUANTITY	0.95+
20 years ago	DATE	0.95+
one	QUANTITY	0.94+
today	DATE	0.94+

Leah Hunter, Forbes | Samsung Developer Conference 2017

>> Narrator: Live from San Francisco, it's TheCUBE. Covering Samsung Developer Conference 2017 brought to you by Samsung. (techno music) >> Hello there and welcome to the special exclusive coverage of Samsung Developer Conference 2017 here at the Moscone West in San Francisco, TheCUBE's coverage. I'm John Furrier, the co-founder of SiliconANGLE Media and co-host of TheCUBE. We're here with Leah Hunter: author, thought leader, covers technology design, women in tech, a variety of things author at O'Reilly's Safari Books, Fast Company, Forbes, among a lot of other things you've done. Welcome to TheCUBE conversation here at the Samsung Developer Conference. >> Thank you. I appreciate it. >> So, Samsung obviously is tied with Google. We saw Google onstage. The story we're seeing here emerging is the edge of the network of mobile devices. That means the humans involved. That means the consumer and the technology are intersecting. This has been a big part of TheCUBE coverage, we've been looking at this for a while. We were just in China talking with Alibaba Cloud and the design ethos culture. Not just creating user experience, that's been out for a while, but it's not about speeds and feeds anymore. It's about enabling human interactions, we're seeing some bad stuff now. The fake news, all that bad behavior, but now, all the data's out there. This is a big part of the developer design now coming forward. What's your thoughts? >> Well, there are two ways that I see that playing out really powerfully and that it can play out powerfully. One, ethnography and social science is getting embedded into what people are creating now and I'm thrilled to see that, because we're at the beginning of a lot of new technologies, augmented reality is one of my specializations, and we're, you know, sure, it's been around for 60 years if you're counting that way, 15 really deeply, but we're just at the cusp of it really taking hold for consumers. And there's this opportunity for anyone developing AR specifically to build social science ethnography user research into their team to create things in a way that is, like, start as you mean to go on. We can be wise about what our future world looks like. And the second thing is around art. You know, when I came here and I sat down, you mentioned at Alibaba there had just been a conversation about art. Well, in my latest book I interviewed someone who is an artist. His name is Alex Mayhew, he did a bunch of work with Peter Gabriel, he's a digital artist who just happened to slide into technology. And because his background is in something entirely different, he approaches AR in a really different way. He just did something for an art museum in Ontario that's really fantastic and worth checking out. You can actually look up the exhibit. It's called ReBlink. I'm going to write about it, but it's there now. >> Well, you've been covering technology many ways, now you're onto AR, and also you're seeing the front range if you will of these new concepts. But before you get it there, define what ethnography is for the folks that might not know what it is. (laughs) >> Thank you. I forget, okay, so I define ethnography as kind of like seeing the world like a five year old. There's an author that I love, her name is Keri Smith. She writes children's books. I found the first copy of this at the Teat Museum. It's called How to be a Life Artist. But her books are all about close observation, collecting everything, paying attention to the world, and finding everything interesting. Being curious in the same way you do when you're a five year old. Well, that's essentially what an ethnographer does in a business context. They observe, they interview people, they go around and collect data the same way that anyone who's on the data side is doing it with numbers. They do it with quotes and observation and pictures and then aggregate that into a story. >> That brings up a great conversation we're seeing here at the Samsung conference as a trend, a mega trend if you will, and that is the blending of analog and digital. Or, they say, physical to digital. Whatever they want to call it. Internet of things is the tech buzzword, >> (Leah) Yeah >> Internet of things being the senses on devices, or wearables, or things of that nature. That is defined as the edge of the network. This is the big wave that's forcing things to be different at the tech level. So this is where this blending comes in. It's the consumerization of tech. This is a big part of these consumer companies who have to kind of get their act together on cloud computing, and a lot of tech detail. So it's coming down from the edge, the infrastructures being redefined, or replatformed as we say. How do you view that, and what does your data show for you around how companies are reacting, what are the consumer expectations? >> Well, I'm going to speak to what I'm seeing in the world because I approach the world like an ethnographer. I wander around, and I collect interesting bits of things, kind of like a magpie. >> (John) Yeah. >> One thing that I saw this week, or I saw two things that were very interesting. I was just in New York, and I walked past an area where it was branded Amazon, but it kind of looked like a carnival. And I was like, what is going on here? And basically, Amazon is doing pop-ups, I believe they said in 18 cities, they just started in New York, but it's a pop-up where you can text in, and you can buy an item on Amazon that you can't get anywhere else. In this instance it was a Nintendo. You go and you pick it up in this physical space that kind of operates like a carnival and has circusy lights and beautiful trucks and whatever. But I thought that that was the coolest blend, and they also gave me their marketing materials that kind of looked like a ticket to a carnival. But I liked that, because it was a new way a digital focused company is operating in the physical world, to your point. It's a new way of blending those. And Amazon doesn't necessarily have to do it. It's just smart marketing. But it also shows the way that companies are pushing from the internet into the physical world. Now that's also happening in reverse. There's a company I really like called Shimmy that basically uses Kinect sensors to measure your body and make custom-made swimsuits for women. They're using that digital information and they're sort of, like, pushing it, so, yeah. >> Yeah, this is a big thing, I mean, this is about reimagining the future. And I think developers, this is a developer conference, so they tried out all the shiny new toys, Bixby, which is personalization now, IOT, which is kind of a geeky message, but ultimately the developers and the ecosystem partners of Samsung have to create the future together. So the question for you is around how you see the ecosystems developing. I see developers learning more about the real world. Less being behind the wall, if you will. Being the super geeks coding away. You're seeing developers on the front lines. And I think that's super important. I do want to get it noted here that you got a book coming out. >> Yes. >> So tell us what you're working on, cause it's going to ship in December? >> Yeah, I... >> What is the book about? I mean, obviously it's chroniclizing this new wave. What is the book about? Tell us a little bit about the book you're writing. >> So I wrote a book, my last book was about industrial augmented reality specifically, and it was sponsored by PTC, so you can actually go and find it for free. They wanted something that would work around industrial AR, and I wrote it in editorial independence so it is truly my perspective, but what was interesting about that at the time I wrote it, I discovered industrial AR was the most powerful place to play, because there were real world examples of AR actually helping people. >> John: Yeah. >> Now, I've broadened that look to see okay, Goldman Sachs said that there's going to be all this growth. Are the areas that they're looking at, things like education, real estate, you know, construction, is there actually growth there? So it's a broad look at a AR. And it's on O'Reilly's Safari Books. >> John: Well, that's interesting. One of the things that's interesting, you know, I've seen many waves myself, I've been through a bunch of cycles. It used to be the consumers that would lead the trends. But you're bringing up an interesting point around AI, augmented reality, even virtual reality. The innovations coming from the enterprise side. So, industrial IOT is really hot right now cause people are connecting physical plant and equipment. You see drones and it's mostly about industrial, AR's industrial because the use cases are so obvious. >> That's right. >> Not necessarily the consumer side has it yet. So it's almost flipped the entire world around. >> But, with, you know, Pokemon Go, that did sort of give consumers the scent, a scent of, okay, this is what it is you know, with AR kit, it hasn't completely lived up to our expectations, but there has been a flurry of activity around people experimenting to see how it can be applied in a consumer way. And, frankly, you know, there are people like DHL who are starting to roll it out in a way that is somewhere between industrial use and consumer in a broad way. So it's moving there. It is nowhere near ready for it yet. >> Leah Hunter here. A thought leader, writer, author, and a new book coming out. I'll give you the final word. What are you up to? What are you going to do after this event? What's next for you? What's the next couple months look like? Obviously, you've got to jam hard on the book, get that done, what else you working on? >> (laughs) I'm an interesting person to ask that question. I produce a television show called Created Here. I'm flying to Austin after this to interview artists and musicians and shoot our next episode of the show. Then we're going to LA and then New York. >> And where are you based out of? >> Me? >> Yeah. >> San Francisco, New York, a little bit Paris, and some New Orleans. >> You're on the plane a lot. >> I am. I like my life. >> Well, you've got a great life, and obviously great work you're doing. Come by TheCUBE studio in Palo Alto, give us an update on what your findings are as you go get that new perspective of art, artistry, artisans are really going to be the craft, we believe that TheCUBE will be the future of intersecting with technology. More exclusive coverage here in Moscone West in San Francisco, this is the Cube's coverage of Samsung Developer Conference. We'll be right back with more coverage after this short break.

Published Date : Oct 19 2017

SUMMARY :

brought to you by Samsung. I'm John Furrier, the co-founder I appreciate it. This is a big part of the developer And the second thing is around art. the front range if you will of these new concepts. Being curious in the same way you do the blending of analog and digital. That is defined as the edge of the network. because I approach the world like an ethnographer. But it also shows the way that companies are pushing So the question for you is around how What is the book about? about that at the time I wrote it, I discovered Are the areas that they're looking at, One of the things that's interesting, you know, So it's almost flipped the entire world around. consumers the scent, a scent of, okay, this is what it is get that done, what else you working on? and musicians and shoot our next episode of the show. and some New Orleans. I like my life. artistry, artisans are really going to be the craft,

ENTITIES

Entity	Category	Confidence
Alex Mayhew	PERSON	0.99+
Keri Smith	PERSON	0.99+
Peter Gabriel	PERSON	0.99+
John Furrier	PERSON	0.99+
John	PERSON	0.99+
Austin	LOCATION	0.99+
Amazon	ORGANIZATION	0.99+
New York	LOCATION	0.99+
Ontario	LOCATION	0.99+
Alibaba	ORGANIZATION	0.99+
DHL	ORGANIZATION	0.99+
San Francisco	LOCATION	0.99+
Leah Hunter	PERSON	0.99+
LA	LOCATION	0.99+
Samsung	ORGANIZATION	0.99+
Palo Alto	LOCATION	0.99+
New Orleans	LOCATION	0.99+
China	LOCATION	0.99+
Google	ORGANIZATION	0.99+
December	DATE	0.99+
O'Reilly	ORGANIZATION	0.99+
first copy	QUANTITY	0.99+
Paris	LOCATION	0.99+
Moscone West	LOCATION	0.99+
two things	QUANTITY	0.99+
Created Here	TITLE	0.99+
two ways	QUANTITY	0.99+
18 cities	QUANTITY	0.99+
SiliconANGLE Media	ORGANIZATION	0.99+
Goldman Sachs	ORGANIZATION	0.99+
Samsung Developer Conference 2017	EVENT	0.99+
PTC	ORGANIZATION	0.99+
TheCUBE	ORGANIZATION	0.99+
Pokemon Go	TITLE	0.99+
Nintendo	ORGANIZATION	0.98+
Samsung Developer Conference	EVENT	0.98+
15	QUANTITY	0.98+
Forbes	ORGANIZATION	0.98+
this week	DATE	0.98+
60 years	QUANTITY	0.98+
Fast Company	ORGANIZATION	0.98+
One	QUANTITY	0.97+
second thing	QUANTITY	0.97+
How to be a Life Artist	TITLE	0.96+
One thing	QUANTITY	0.93+
Shimmy	ORGANIZATION	0.92+
one	QUANTITY	0.9+
Cube	ORGANIZATION	0.88+
Safari Books	TITLE	0.87+
Teat Museum	ORGANIZATION	0.87+
next couple months	DATE	0.84+
Alibaba Cloud	ORGANIZATION	0.84+
five year old	QUANTITY	0.84+
Safari Books	ORGANIZATION	0.79+
big wave	EVENT	0.68+
Bixby	ORGANIZATION	0.65+
Kinect	COMMERCIAL_ITEM	0.64+
ReBlink	ORGANIZATION	0.6+
wave	EVENT	0.57+
TheCUBE	TITLE	0.55+

Nenshad Bardoliwalla & Stephanie McReynolds | BigData NYC 2017

>> Live from midtown Manhattan, it's theCUBE covering Big Data New York City 2017. Brought to you by Silicon Angle Media and its ecosystem sponsors. (upbeat techno music) >> Welcome back, everyone. Live here in New York, Day Three coverage, winding down for three days of wall to wall coverage theCUBE covering Big Data NYC in conjunction with Strata Data, formerly Strata Hadoop and Hadoop World, all part of the Big Data ecosystem. Our next guest is Nenshad Bardoliwalla Co-Founder and Chief Product Officer of Paxata, hot start up in the space. A lot of kudos. Of course, they launched on theCUBE in 2013 three years ago when we started theCUBE as a separate event from O'Reilly. So, great to see the success. And Stephanie McReynolds, you've been on multiple times, VP of Marketing at Alation. Welcome back, good to see you guys. >> Thank you. >> Happy to be here. >> So, winding down, so great kind of wrap-up segment here in addition to the partnership that you guys have. So, let's first talk about before we get to the wrap-up of the show and kind of bring together the week here and kind of summarize everything. Tell about your partnership you guys have. Paxata, you guys have been doing extremely well. Congratulations. Prakash was talking on theCUBE. Great success. You guys worked hard for it. I'm happy for you. But partnering is everything. Ecosystem is everything. Alation, their collaboration with data. That's there ethos. They're very user-centric. >> Nenshad: Yes. >> From the founders. Seemed like a good fit. What's the deal? >> It's a very natural fit between the two companies. When we started down the path of building new information management capabilities it became very clear that the market had strong need for both finding data, right? What do I actually have? I need an inventory, especially if my data's in Amazon S3, my data is in Azure Blob storage, my data is on-premise in HDFS, my data is in databases, it's all over the place. And I need to be able to find it. And then once I find it, I want to be able to prepare it. And so, one of the things that really drove this partnership was the very common interests that both companies have. And number one, pushing user experience. I love the Alation product. It's very easy to use, it's very intuitive, really it's a delightful thing to work with. And at the same time they also share our interests in working in these hybrid multicloud environments. So, what we've done and what we announced here at Strata is actually this bi-directional integration between the products. You can start in Alation and find a data set that you want to work with, see what collaboration or notes or business metadata people have created and then say, I want to go see this in Paxata. And in a single click you can then actually open it up in Paxata and profile that data. Vice versa you can also be in Paxata and prepare data, and then with a single click push it back, and then everybody who works with Alation actually now has knowledge of where that data is. So, it's a really nice synergy. >> So, you pushed the user data back to Alation, cause that's what they care a lot about, the cataloging and making the user-centric view work. So, you provide, it's almost a flow back and forth. It's a handshake if you will to data. Am I getting that right? >> Yeah, I mean, the idea's to keep the analyst or the user of that data, data scientist, even in some cases a business user, keep them in the flow of their work as much as possible. But give them the advantage of understanding what others in the organization have done with that data prior and allow them to transform it, and then share that knowledge back with the rest of the community that might be working with that data. >> John: So, give me an example. I like your Excel spreadsheet concept cause that's obvious. People know what Excel spreadsheet is so. So, it's Excel-like. That's an easy TAM to go after. All Microsoft users might not get that Azure thing. But this one, just take me through a usecase. >> So, I've got a good example. >> Okay, take me through. >> It's very common in a data lake for your data to be compressed. And when data's compressed, to a user it looks like a black box. So, if the data is compressed in Avro or Parquet or it's even like JSON format. A business user has no idea what's in that file. >> John: Yeah. >> So, what we do is we find the file for them. It may have some comments on that file of how that data's been used in past projects that we infer from looking at how others have used that data in Alation. >> John: So, you put metadata around it. >> We put a whole bunch of metadata around it. It might be comments that people have made. It might be >> Annotations, yeah. >> actual observations, annotations. And the great thing that we can do with Paxata is open that Avro file or Parquet file, open it up so that you can actually see the data elements themselves. So, all of a sudden, the business user has access without having to use a command line utility or understand anything about compression, and how you open that file up-- >> John: So, as Paxata spitting out there nuggets of value back to you, you're kind of understanding it, translating it to the user. And they get to do their thing, you get to do your thing, right? >> It's making a Avro or a Parquet file as easy to use as Excel, basically. Which is great, right? >> It's awesome. >> Now, you've enabled >> a whole new class of people who can use that. >> Well, and people just >> Get turned off when it's anything like jargon, or like, "What is that? I'm afraid it's phishing. Click on that and oh!" >> Well, the scary thing is that in a data lake environment, in a lot of cases people don't even label the files with extensions. They're just files. (Stephanie laughs) So, what started-- >> It's like getting your pictures like DS, JPEG. It's like what? >> Exactly. >> Right. >> So, you're talking about unlabeled-- >> If you looked on your laptop, and if you didn't have JPEG or DOC or PPT. Okay, I don't know that this file is. Well, what you have in the data lake environment is that you have thousands of these files that people don't really know what they are. And so, with Alation we have the ability to get all the value around the curation of the metadata, and how people are using that data. But then somebody says, "Okay, but I understand that this file exists. What's in it?" And then with Click to Profile from Alation you're immediately taken into Paxata. And now you're actually looking at what's in that file. So, you can very quickly go from this looks interesting to let me understand what's inside of it. And that's very powerful. >> Talk about Alation. Cause I had the CEO on, also their lead investor Greg Sands from Costanoa Ventures. They're a pretty amazing team but it's kind of out there. No offense, it's kind of a compliment actually. (Stephanie laughs) >> They got a symbolic >> Stephanie: Keep going. system Stanford guy, who's like super-smart. >> Nenshad: Yeah. >> They're on something that's really unique but it's almost too simple to be. Like, wait a minute! Google for the data, it's an awesome opportunity. How do you describe Alation to people who say, "Hey, what's this Alation thing?" >> Yeah, so I think that the best way to describe it is it's the browser for all of the distributed data in the enterprise. Sorry, so it's both the catalog, and the browser that sits on top of it. It sounds very simple. Conceptually it's very simple but they have a lot of richness in what they're able to do behind the scenes in terms of introspecting what type of work people are doing with data, and then taking that knowledge and actually surfacing it to the end user. So, for example, they have very powerful scenarios where they can watch what people are doing in different data sources, and then based on that information actually bubble up how queries are being used or the different patterns that people are doing to consume data with. So, what we find really exciting is that this is something that is very complex under the covers. Which Paxata is as well being built upon Spark. But they have put in the hard engineering work so that it looks simple to the end user. And that's the exact same thing that we've tried to do. >> And that's the hard problem. Okay, Stephanie back ... That was a great example by the way. Can't wait to have our little analyst breakdown of the event. But back to Alation for you. So, how do you talk about, you've been VP of Marketing of Alation. But you've been around the block. You know B2B, tech, big data. So, you've seen a bunch of different, you've worked at Trifacta, you worked at other companies, and you've seen a lot of waves of innovation come. What's different about Alation that people might not know about? How do you describe the difference? Because it sounds easy, "Oh, it's a browser! It's a catalog!" But it's really hard. Is it the tech that's the secret? Is it the approach? How do you describe the value of Alation? I think what's interesting about Alation is that we're solving a problem that since the dawn of the data warehouse has not been solved. And that is how to help end users really find and understand the data that they need to do their jobs. A lot of our customers talk about this-- >> John: Hold on. Repeat that. Cause that's like a key thing. What problem hasn't been solved since the data warehouse? >> To be able to actually find and fully understand, understand to the point of trust the data that you want to use for your analysis. And so, in the world of-- >> John: That sounds so simple. >> Stephanie: In the world of data warehousing-- >> John: Why is it so hard? >> Well, because in the world of data warehousing business people were told what data they should use. Someone in IT decided how to model the data, came up with a KPR calculation, and told you as a business person, you as a CEO, this is how you're going to monitor you business. >> John: Yeah. >> What business person >> Wants to be told that by an IT guy, right? >> Well, it was bounded by IT. >> Right. >> Expression and discovery >> Should be unbounded. Machine learning can take care of a lot of bounded stuff. I get that. But like, when you start to get into the discovery side of it, it should be free. >> Well, no offense to the IT team, but they were doing their best to try to figure out how to make this technology work. >> Well, just look at the cost of goods sold for storage. I mean, how many EMC drives? Expensive! IT was not cheap. >> Right. >> Not even 10, 15, 20 years ago. >> So, now when we have more self-service access to data, and we can have more exploratory analysis. What data science really introduced and Hadoop introduced was this ability on-demand to be able to create these structures, you have this more iterative world of how you can discover and explore datasets to come to an insight. The only challenge is, without simplifying that process, a business person is still lost, right? >> John: Yeah. >> Still lost in the data. >> So, we simply call that a catalog. But a catalog is much more-- >> Index, catalog, anthology, there's other words for it, right? >> Yeah, but I think it's interesting because like a concept of a catalog is an inventory has been around forever in this space. But the concept of a catalog that learns from other's behavior with that data, this concept of Behavior I/O that Aaron talked about earlier today. The fact that behavior of how people query data as an input and that input then informs a recommendation as an output is very powerful. And that's where all the machine learning and A.I. comes to work. It's hidden underneath that concept of Behavior I/O but that's there real innovation that drives this rich catalog is how can we make active recommendations to a business person who doesn't have to understand the technology but they know how to apply that data to making a decision. >> Yeah, that's key. Behavior and textual information has always been the two fly wheels in analysis whether you're talking search engine or data in general. And I think what I like about the trends here at Big Data NYC this weekend. We've certainly been seeing it at the hundreds of CUBE events we've gone to over the past 12 months and more is that people are using data differently. Not only say differently, there's baselining, foundational things you got to do. But the real innovators have a twist on it that give them an advantage. They see how they can use data. And the trend is collective intelligence of the customer seems to be big. You guys are doing it. You're seeing patterns. You're automating the data. So, it seems to be this fly wheel of some data, get some collective data. What's your thoughts and reactions. Are people getting it? Is this by people doing it by accident on purpose kind of thing? Did people just fell on their head? Or you see, "Oh, I just backed into this?" >> I think that the companies that have emerged as the leaders in the last 15 or 20 years, Google being a great example, Amazon being a great example. These are companies whose entire business models were based on data. They've generated out-sized returns. They are the leaders on the stock market. And I think that many companies have awoken to the fact that data as a monetizable asset to be turned into information either for analysis, to be turned into information for generating new products that can then be resold on the market. The leading edge companies have figured that out, and our adopting technologies like Alation, like Paxata, to get a competitive advantage in the business processes where they know they can make a difference inside of the enterprise. So, I don't think it's a fluke at all. I think that most of these companies are being forced to go down that path because they have been shown the way in terms of the digital giants that are currently ruling the enterprise tech world. >> All right, what's your thoughts on the week this week so far on the big trends? What are obvious, obviously A.I., don't need to talk about A.I., but what were the big things that came out of it? And what surprised you that didn't come out from a trends standpoint buzz here at Strata Data and Big Data NYC? What were the big themes that you saw emerge and didn't emerge what was the surprise? Any surprises? >> Basically, we're seeing in general the maturation of the market finally. People are finally realizing that, hey, it's not just about cool technology. It's not about what distribution or package. It's about can you actually drive return on investment? Can you actually drive insights and results from the stack? And so, even the technologists that we were talking with today throughout the course of the show are starting to talk about it's that last mile of making the humans more intelligent about navigating this data, where all the breakthroughs are going to happen. Even in places like IOT, where you think about a lot of automation, and you think about a lot of capability to use deep learning to maybe make some decisions. There's still a lot of human training that goes into that decision-making process and having agency at the edge. And so I think this acknowledgement that there should be balance between human input and what the technology can do is a nice breakthrough that's going to help us get to the next level. >> What's missing? What do you see that people missed that is super-important, that wasn't talked much about? Is there anything that jumps out at you? I'll let you think about it. Nenshad, you have something now. >> Yeah, I would say I completely agree with what Stephanie said which we are seeing the market mature. >> John: Yeah. >> And there is a compelling force to now justify business value for all the investments people have made. The science experiment phase of the big data world is over. People now have to show a return on that investment. I think that being said though, this is my sort of way of being a little more provocative. I still think there's way too much emphasis on data science and not enough emphasis on the average business analyst who's doing work in the Fortune 500. >> It should be kind of the same thing. I mean, with data science you're just more of an advanced analyst maybe. >> Right. But the idea that every person who works with data is suddenly going to understand different types of machine learning models, and what's the right way to do hyper parameter tuning, and other words that I could throw at you to show that I'm smart. (laughter) >> You guys have a vision with the Excel thing. I could see how you see that perspective because you see a future. I just think we're not there yet because I think the data scientists are still handcuffed and hamstrung by the fact that they're doing too much provisioning work, right? >> Yeah. >> To you're point about >> surfacing the insights, it's like the data scientists, "Oh, you own it now!" They become the sysadmin, if you will, for their department. And it's like it's not their job. >> Well, we need to get them out of data preparation, right? >> Yeah, get out of that. >> You shouldn't be a data scientist-- >> Right now, you have two values. You've got the use interface value, which I love, but you guys do the automation. So, I think we're getting there. I see where you're coming from, but still those data sciences have to set the tone for the generation, right? So, it's kind of like you got to get those guys productive. >> And it's not a .. Please go ahead. >> I mean, it's somewhat interesting if you look at can the data scientist start to collaborate a little bit more with the common business person? You start to think about it as a little bit of scientific inquiry process. >> John: Yeah. >> Right? >> If you can have more innovators around the table in a common place to discuss what are the insights in this data, and people are bringing business perspective together with machine learning perspective, or the knowledge of the higher algorithms, then maybe you can bring those next leaps forward. >> Great insight. If you want my observations, I use the crazy analogy. Here's my crazy analogy. Years it's been about the engine Model T, the car, the horse and buggy, you know? Now, "We got an engine in the car!" And they got wheels, it's got a chassis. And so, it's about the apparatus of the car. And then it evolved to, "Hey, this thing actually drives. It's transportation." You can actually go from A to B faster than the other guys, and people still think there's a horse and buggy market out there. So, they got to go to that. But now people are crashing. Now, there's an art to driving the car. >> Right. >> So, whether you're a sports car or whatever, this is where the value piece I think hits home is that, people are driving the data now. They're driving the value proposition. So, I think that, to me, the big surprise here is how people aren't getting into the hype cycle. They like the hype in terms of lead gen, and A.I., but they're too busy for the hype. It's like, drive the value. This is not just B.S. either, outcomes. It's like, "I'm busy. I got security. I got app development." >> And I think they're getting smarter about how their valuing data. We're starting to see some economic models, and some ways of putting actual numbers on what impact is this data having today. We do a lot of usage analysis with our customers, and looking at they have a goal to distribute data across more of the organization, and really get people using it in a self-service manner. And from that, you're being able to calculate what actually is the impact. We're not just storing this for insurance policy reasons. >> Yeah, yeah. >> And this cheap-- >> John: It's not some POC. Don't do a POC. All right, so we're going to end the day and the segment on you guys having the last word. I want to phrase it this way. Share an anecdotal story you've heard from a customer, or a prospective customer, that looked at your product, not the joint product but your products each, that blew you away, and that would be a good thing to leave people with. What was the coolest or nicest thing you've heard someone say about Alation and Paxata? >> For me, the coolest thing they said, "This was a social network for nerds. I finally feel like I've found my home." (laughter) >> Data nerds, okay. >> Data nerds. So, if you're a data nerd, you want to network, Alation is the place you want to be. >> So, there is like profiles? And like, you guys have a profile for everybody who comes in? >> Yeah, so the interesting thing is part of our automation, when we go and we index the data sources we also index the people that are accessing those sources. So, you kind of have a leaderboard now of data users, that contract one another in system. >> John: Ooh. >> And at eBay leader was this guy, Caleb, who was their data scientist. And Caleb was famous because everyone in the organization would ask Caleb to prepare data for them. And Caleb was like well known if you were around eBay for awhile. >> John: Yeah, he was the master of the domain. >> And then when we turned on, you know, we were indexing tables on teradata as well as their Hadoop implementation. And all of a sudden, there are table structures that are Caleb underscore cussed. Caleb underscore revenue. Caleb underscore ... We're like, "Wow!" Caleb drove a lot of teradata revenue. (Laughs) >> Awesome. >> Paxata, what was the coolest thing someone said about you in terms of being the nicest or coolest most relevant thing? >> So, something that a prospect said earlier this week is that, "I've been hearing in our personal lives about self-driving cars. But seeing your product and where you're going with it I see the path towards self-driving data." And that's really what we need to aspire towards. It's not about spending hours doing prep. It's not about spending hours doing manual inventories. It's about getting to the point that you can automate the usage to get to the outcomes that people are looking for. So, I'm looking forward to self-driving information. Nenshad, thanks so much. Stephanie from Alation. Thanks so much. Congratulations both on your success. And great to see you guys partnering. Big, big community here. And just the beginning. We see the big waves coming, so thanks for sharing perspective. >> Thank you very much. >> And your color commentary on our wrap up segment here for Big Data NYC. This is theCUBE live from New York, wrapping up great three days of coverage here in Manhattan. I'm John Furrier. Thanks for watching. See you next time. (upbeat techo music)

Published Date : Oct 3 2017

SUMMARY :

Brought to you by Silicon Angle Media and Hadoop World, all part of the Big Data ecosystem. in addition to the partnership that you guys have. What's the deal? And so, one of the things that really drove this partnership So, you pushed the user data back to Alation, Yeah, I mean, the idea's to keep the analyst That's an easy TAM to go after. So, if the data is compressed in Avro or Parquet of how that data's been used in past projects It might be comments that people have made. And the great thing that we can do with Paxata And they get to do their thing, as easy to use as Excel, basically. a whole new class of people Click on that and oh!" the files with extensions. It's like getting your pictures like DS, JPEG. is that you have thousands of these files Cause I had the CEO on, also their lead investor Stephanie: Keep going. Google for the data, it's an awesome opportunity. And that's the exact same thing that we've tried to do. And that's the hard problem. What problem hasn't been solved since the data warehouse? the data that you want to use for your analysis. Well, because in the world of data warehousing But like, when you start to get into to the IT team, but they were doing Well, just look at the cost of goods sold for storage. of how you can discover and explore datasets So, we simply call that a catalog. But the concept of a catalog that learns of the customer seems to be big. And I think that many companies have awoken to the fact And what surprised you that didn't come out And so, even the technologists What do you see that people missed the market mature. in the Fortune 500. It should be kind of the same thing. But the idea that every person and hamstrung by the fact that they're doing They become the sysadmin, if you will, So, it's kind of like you got to get those guys productive. And it's not a .. can the data scientist start to collaborate or the knowledge of the higher algorithms, the car, the horse and buggy, you know? So, I think that, to me, the big surprise here is across more of the organization, and the segment on you guys having the last word. For me, the coolest thing they said, Alation is the place you want to be. Yeah, so the interesting thing is if you were around eBay for awhile. And all of a sudden, there are table structures And great to see you guys partnering. See you next time.

ENTITIES

Entity	Category	Confidence
Stephanie	PERSON	0.99+
Stephanie McReynolds	PERSON	0.99+
Greg Sands	PERSON	0.99+
John	PERSON	0.99+
Caleb	PERSON	0.99+
John Furrier	PERSON	0.99+
Nenshad	PERSON	0.99+
New York	LOCATION	0.99+
Prakash	PERSON	0.99+
Amazon	ORGANIZATION	0.99+
Aaron	PERSON	0.99+
Silicon Angle Media	ORGANIZATION	0.99+
2013	DATE	0.99+
thousands	QUANTITY	0.99+
Costanoa Ventures	ORGANIZATION	0.99+
Manhattan	LOCATION	0.99+
two companies	QUANTITY	0.99+
both companies	QUANTITY	0.99+
Excel	TITLE	0.99+
Trifacta	ORGANIZATION	0.99+
Google	ORGANIZATION	0.99+
Strata Data	ORGANIZATION	0.99+
Alation	ORGANIZATION	0.99+
Paxata	ORGANIZATION	0.99+
Nenshad Bardoliwalla	PERSON	0.99+
eBay	ORGANIZATION	0.99+
three days	QUANTITY	0.99+
Microsoft	ORGANIZATION	0.99+
two values	QUANTITY	0.99+
NYC	LOCATION	0.99+
hundreds	QUANTITY	0.99+
Big Data	ORGANIZATION	0.99+
first	QUANTITY	0.99+
one	QUANTITY	0.99+
both	QUANTITY	0.99+
Strata Hadoop	ORGANIZATION	0.99+
Hadoop World	ORGANIZATION	0.99+
earlier this week	DATE	0.98+
Paxata	PERSON	0.98+
today	DATE	0.98+
Day Three	QUANTITY	0.98+
Parquet	TITLE	0.96+
three years ago	DATE	0.96+

Rob Thomas, IBM | Big Data NYC 2017

>> Voiceover: Live from midtown Manhattan, it's theCUBE! Covering Big Data New York City 2017. Brought to you by, SiliconANGLE Media and as ecosystems sponsors. >> Okay, welcome back everyone, live in New York City this is theCUBE's coverage of, eighth year doing Hadoop World now, evolved into Strata Hadoop, now called Strata Data, it's had many incarnations but O'Reilly Media running their event in conjunction with Cloudera, mainly an O'Reilly media show. We do our own show called Big Data NYC here with our community with theCUBE bringing you the best interviews, the best people, entrepreneurs, thought leaders, experts, to get the data and try to project the future and help users find the value in data. My next guest is Rob Thomas, who is the General Manager of IBM Analytics, theCUBE Alumni, been on multiple times successfully executing in the San Francisco Bay area. Great to see you again. >> Yeah John, great to see you, thanks for having me. >> You know IBM is really been interesting through its own transformation and a lot of people will throw IBM in that category but you guys have been transforming okay and the scoreboard yet has to yet to show in my mind what's truly happening because if you still look at this industry, we're only eight years into what Hadoop evolved into now as a large data set but the analytics game just seems to be getting started with the cloud now coming over the top, you're starting to see a lot of cloud conversations in the air. Certainly there's a lot of AI washing, you know, AI this, but it's machine learning and deep learning at the heart of it as innovation but a lot more work on the analytics side is coming. You guys are at the center of that. What's the update? What's your view of this analytics market? >> Most enterprises struggle with complexity. That's the number one problem when it comes to analytics. It's not imagination, it's not willpower, in many cases, it's not even investment, it's just complexity. We are trying to make data really simple to use and the way I would describe it is we're moving from a world of products to platforms. Today, if you want to go solve a data governance problem you're typically integrating 10, 15 different products. And the burden then is on the client. So, we're trying to make analytics a platform game. And my view is an enterprise has to have three platforms if they're serious about analytics. They need a data manager platform for managing all types of data, public, private cloud. They need unified governance so governance of all types of data and they need a data science platform machine learning. If a client has those three platforms, they will be successful with data. And what I see now is really mixed. We've got 10 products that do that, five products that do this, but it has to be integrated in a platform. >> You as an IBM or the customer has these tools? >> Yeah, when I go see clients that's what I see is data... >> John: Disparate data log. >> Yeah, they have disparate tools and so we are unifying what we deliver from a product perspective to this platform concept. >> You guys announce an integrated analytic system, got to see my notes here, I want to get into that in a second but interesting you bring up the word platform because you know, platforms have always been kind of reserved for the big supplier but you're talking about customers having a platform, not a supplier delivering a platform per se 'cause this is where the integration thing becomes interesting. We were joking yesterday on theCUBE here, kind of just kind of ad hoc conceptually like the world has turned into a tool shed. I mean everyone has a tool shed or knows someone that has a tool shed where you have the tools in the back and they're rusty. And so, this brings up the tool conversation, there's too many tools out there that try to be platforms. >> Rob: Yes. >> And if you have too many tools, you're not really doing the platform game right. And complexity also turns into when you bought a hammer it turned into a lawn mower. Right so, a lot of these companies have been groping and trying to iterate what their tool was into something else it wasn't built for. So, as the industry evolves, that's natural Darwinism if you will, they will fall to the wayside. So talk about that dynamic because you still need tooling >> Rob: Yes. but tool will be a function of the work as Peter Burris would say, so talk about how does a customer really get that platform out there without sacrificing the tooling that they may have bought or want to get rid of. >> Well, so think about the, in enterprise today, what the data architecture looks like is, I've got this box that has this software on it, use your terms, has these types of tools on it, and it's isolated and if you want a different set of tooling, okay, move that data to this other box where we have the other tooling. So, it's very isolated in terms of how platforms have evolved or technology platforms today. When I talk about an integrated platform, we are big contributors to Kubernetes. We're making that foundational in terms of what we're doing on Private Cloud and Public Cloud is if you move to that model, suddenly what was a bunch of disparate tools are now microservices against a common architecture. And so it totally changes the nature of the data platform in an enterprise. It's a much more fluid data layer. The term I use sometimes is you have data as a service now, available to all your employees. That's totally different than I want to do this project, so step one, make room in the data center, step two, bring in a server. It's a much more flexible approach so that's what I mean when I say platform. >> So operationalizing it is a lot easier than just going down the linear path of provisioning. All right, so let's bring up the complexity issue because integrated and unified are two different concepts that kind of mean the same thing depending on how you look at it. When you look at the data integration problem, you've got all this complexity around governance, it's a lot of moving parts of data. How does a customer actually execute without compromising the integrity of their policies that they need to have in place? So in other words, what are the baby steps that someone can take, the customers take through with what you guys are dealing with them, how do they get into the game, how do they take steps towards the outcome? They might not have the big money to push it all at once, they might want to take a risk of risk management approach. >> I think there's a clear recipe for doing this right and we have experience of doing it well and doing it not so well, so over time we've gotten some, I'd say a pretty good perspective on that. My view is very simple, data governance has to start with a catalog. And the analogy I use is, you have to do for data what libraries do for books. And think about a library, the first thing you do with books, card catalog. You know where, you basically itemize everything, you know exactly where it sits. If you've got multiple copies of the same book, you can distinguish between which one is which. As books get older they go to archives, to microfilm or something like that. That's what you have to do with your data. >> On the front end. >> On the front end. And it starts with a catalog. And that reason I say that is, I see some organizations that start with, hey, let's go start ETL, I'll create a new warehouse, create a new Hadoop environment. That might be the right thing to do but without having a basis of what you have, which is the catalog, that's where I think clients need to start. >> Well, I would just add one more level of complexity just to kind of reinforce, first of all I agree with you but here's another example that would reinforce this step. Let's just say you write some machine learning and some algorithms and a new policy from the government comes down. Hey, you know, we're dealing with Bitcoin differently or whatever, some GPRS kind of thing happens where someone gets hacked and a new law comes out. How do you inject that policy? You got to rewrite the code, so I'm thinking that if you do this right, you don't have to do a lot of rewriting of applications to the library or the catalog will handle it. Is that right, am I getting that right? >> That's right 'cause then you have a baseline is what I would describe it as. It's codified in the form of a data model or in the form on ontology for how you're looking at unstructured data. You have a baseline so then as changes come, you can easily adjust to those changes. Where I see clients struggle is if you don't have that baseline then you're constantly trying to change things on the fly and that makes it really hard to get to this... >> Well, really hard, expensive, they have to rewrite apps. >> Exactly. >> Rewrite algorithms and machine learning things that were built probably by people that maybe left the company, who knows, right? So the consequences are pretty grave, I mean, pretty big. >> Yes. >> Okay, so let's back to something that you said yesterday. You were on theCUBE yesterday with Hortonworks CEO, Rob Bearden and you were commenting about AI or AI washing. You said quote, "You can't have AI without IA." A play on letters there, sequence of letters which was really an interesting comment, we kind of referenced it pretty much all day yesterday. Information architecture is the IA and AI is the artificial intelligence basically saying if you don't have some sort of architecture AI really can't work. Which really means models have to be understood, with the learning machine kind of approach. Expand more on that 'cause that was I think a fundamental thing that we're seeing at the show this week, this in New York is a model for the models. Who trains the machine learning? Machines got to learn somewhere too so there's learning for the learning machines. This is a real complex data problem and a half. If you don't set up the architecture it may not work, explain. >> So, there's two big problems enterprises have today. One is trying to operationalize data science and machine learning that scale, the other one is getting the cloud but let's focus on the first one for a minute. The reason clients struggle to operationalize this at scale is because they start a data science project and they build a model for one discreet data set. Problem is that only applies to that data set, it doesn't, you can't pick it up and move it somewhere else so this idea of data architecture just to kind of follow through, whether it's the catalog or how you're managing your data across multiple clouds becomes fundamental because ultimately you want to be able to provide machine learning across all your data because machine learning is about predictions and it's hard to do really good predictions on a subset. But that pre-req is the need for an information architecture that comprehends for the fact that you're going to build models and you want to train those models. As new data comes in, you want to keep the training process going. And that's the biggest challenge I see clients struggling with. So they'll have success with their first ML project but then the next one becomes progressively harder because now they're trying to use more data and they haven't prepared their architecture for that. >> Great point. Now, switching to data science. You spoke many times with us on theCUBE about data science, we know you're passionate about you guys doing a lot of work on that. We've observed and Jim Kobielus and I were talking yesterday, there's too much work still in the data science guys plate. There's still doing a lot of what I call, sys admin like work, not the right word, but like administrative building and wrangling. They're not doing enough data science and there's enough proof points now to show that data science actually impacts business in whether it's military having data intelligence to execute something, to selling something at the right time, or even for work or play or consume, or we use, all proof is out there. So why aren't we going faster, why aren't the data scientists more effective, what does it going to take for the data science to have a seamless environment that works for them? They're still doing a lot of wrangling and they're still getting down the weeds. Is that just the role they have or how does it get easier for them that's the big catch? >> That's not the role. So they're a victim of their architecture to some extent and that's why they end up spending 80% of their time on data prep, data cleansing, that type of thing. Look, I think we solved that. That's why when we introduced the integrated analytic system this week, that whole idea was get rid of all the data prep that you need because land the data in one place, machine learning and data science is built into that. So everything that the data scientist struggles with today goes away. We can federate to data on cloud, on any cloud, we can federate to data that's sitting inside Hortonworks so it looks like one system but machine learning is built into it from the start. So we've eliminated the need for all of that data movement, for all that data wrangling 'cause we organized the data, we built the catalog, and we've made it really simple. And so if you go back to the point I made, so one issue is clients can't apply machine learning at scale, the other one is they're struggling to get the cloud. I think we've nailed those problems 'cause now with a click of a button, you can scale this to part of the cloud. >> All right, so how does the customer get their hands on this? Sounds like it's a great tool, you're saying it's leading edge. We'll take a look at it, certainly I'll do a review on it with the team but how do I get it, how do I get a hold of this? What do I do, download it, you guys supply it to me, is it some open source, how do your customers and potential customers engage with this product? >> However they want to but I'll give you some examples. So, we have an analytic system built on Spark, you can bring the whole box into your data center and right away you're ready for data science. That's one way. Somebody like you, you're going to want to go get the containerized version, you go download it on the web and you'll be up and running instantly with a highly performing warehouse integrated with machine learning and data science built on Spark using Apache Jupyter. Any developer can go use that and get value out of it. You can also say I want to run it on my desktop. >> And that's free? >> Yes. >> Okay. >> There's a trial version out there. >> That's the open source, yeah, that's the free version. >> There's also a version on public cloud so if you don't want to download it, you want to run it outside your firewall, you can go run it on IBM cloud on the public cloud so... >> Just your cloud, Amazon? >> No, not today. >> John: Just IBM cloud, okay, I got it. >> So there's variety of ways that you can go use this and I think what you'll find... >> But you have a premium model that people can get started out so they'll download it to your data center, is that also free too? >> Yeah, absolutely. >> Okay, so all the base stuff is free. >> We also have a desktop version too so you can download... >> What URL can people look at this? >> Go to datascience.ibm.com, that's the best place to start a data science journey. >> Okay, multi-cloud, Common Cloud is what people are calling it, you guys have Common SQL engine. What is this product, how does it relate to the whole multi-cloud trend? Customers are looking for multiple clouds. >> Yeah, so Common SQL is the idea of integrating data wherever it is, whatever form it's in, ANSI SQL compliant so what you would expect for a SQL query and the type of response you get back, you get that back with Common SQL no matter where the data is. Now when you start thinking multi-cloud you introduce a whole other bunch of factors. Network, latency, all those types of things so what we talked about yesterday with the announcement of Hortonworks Dataplane which is kind of extending the YARN environment across multi-clouds, that's something we can plug in to. So, I think let's be honest, the multi-cloud world is still pretty early. >> John: Oh, really early. >> Our focus is delivery... >> I don't think it really exists actually. >> I think... >> It's multiple clouds but no one's actually moving workloads across all the clouds, I haven't found any. >> Yeah, I think it's hard for latency reasons today. We're trying to deliver an outstanding... >> But people are saying, I mean this is head room I got but people are saying, I'd love to have a preferred future of multi-cloud even though they're kind of getting their own shops in order, retrenching, and re-platforming it but that's not a bad ask. I mean, I'm a user, I want to move from if I don't like IBM's cloud or I got a better service, I can move around here. If Amazon is too expensive I want to move to IBM, you got product differentiation, I might want to to be in your cloud. So again, this is the customers mindset, right. If you have something really compelling on your cloud, do I have to go all in on IBM cloud to run my data? You shouldn't have to, right? >> I agree, yeah I don't think any enterprise will go all in on one cloud. I think it's delusional for people to think that so you're going to have this world. So the reason when we built IBM Cloud Private we did it on Kubernetes was we said, that can be a substrate if you will, that provides a level of standards across multiple cloud type environments. >> John: And it's got some traction too so it's a good bet there. >> Absolutely. >> Rob, final word, just talk about the personas who you now engage with from IBM's standpoint. I know you have a lot of great developers stuff going on, you've done some great work, you've got a free product out there but you still got to make money, you got to provide value to IBM, who are you selling to, what's the main thing, you've got multiple stakeholders, could you just clarify the stakeholders that you're serving in the marketplace? >> Yeah, I mean, the emerging stakeholder that we speak with more and more than we used to is chief marketing officers who have real budgets for data and data science and trying to change how they're performing their job. That's a major stakeholder, CTOs, CIOs, any C level, >> Chief data officer. >> Chief data officer. You know chief data officers, honestly, it's a mixed bag. Some organizations they're incredibly empowered and they're driving the strategy. Others, they're figure heads and so you got to know how the organizations do it. >> A puppet for the CFO or something. >> Yeah, exactly. >> Our ops. >> A puppet? (chuckles) So, you got to you know. >> Well, they're not really driving it, they're not changing it. It's not like we're mandated to go do something they're maybe governance police or something. >> Yeah, and in some cases that's true. In other cases, they drive the data architecture, the data strategy, and that's somebody that we can engage with right away and help them out so... >> Any events you got going up? Things happening in the marketplace that people might want to participate in? I know you guys do a lot of stuff out in the open, events they can connect with IBM, things going on? >> So we do, so we're doing a big event here in New York on November first and second where we're rolling out a lot of our new data products and cloud products so that's one coming up pretty soon. The biggest thing we've changed this year is there's such a craving for clients for education as we've started doing what we're calling Analytics University where we actually go to clients and we'll spend a day or two days, go really deep and open languages, open source. That's become kind of a new focus for us. >> A lot of re-skilling going on too with the transformation, right? >> Rob: Yes, absolutely. >> All right, Rob Thomas here, General Manager IBM Analytics inside theCUBE. CUBE alumni, breaking it down, giving his perspective. He's got two books out there, The Data Revolution was the first one. >> Big Data Revolution. >> Big Data Revolution and the new one is Every Company is a Tech Company. Love that title which is true, check it out on Amazon. Rob Thomas, Bid Data Revolution, first book and then second book is Every Company is a Tech Company. It's theCUBE live from New York. More coverage after the short break. (theCUBE jingle) (theCUBE jingle) (calm soothing music)

Published Date : Oct 2 2017

SUMMARY :

Brought to you by, SiliconANGLE Media Great to see you again. but the analytics game just seems to be getting started and the way I would describe it is and so we are unifying what we deliver where you have the tools in the back and they're rusty. So talk about that dynamic because you still need tooling that they may have bought or want to get rid of. and it's isolated and if you want They might not have the big money to push it all at once, the first thing you do with books, card catalog. That might be the right thing to do just to kind of reinforce, first of all I agree with you and that makes it really hard to get to this... they have to rewrite apps. probably by people that maybe left the company, Okay, so let's back to something that you said yesterday. and you want to train those models. Is that just the role they have the data prep that you need What do I do, download it, you guys supply it to me, However they want to but I'll give you some examples. There's a That's the open source, so if you don't want to download it, So there's variety of ways that you can go use this that's the best place to start a data science journey. you guys have Common SQL engine. and the type of response you get back, across all the clouds, I haven't found any. Yeah, I think it's hard for latency reasons today. If you have something really compelling on your cloud, that can be a substrate if you will, so it's a good bet there. I know you have a lot of great developers stuff going on, Yeah, I mean, the emerging stakeholder that you got to know how the organizations do it. So, you got to you know. It's not like we're mandated to go do something the data strategy, and that's somebody that we can and cloud products so that's one coming up pretty soon. CUBE alumni, breaking it down, giving his perspective. and the new one is Every Company is a Tech Company.

ENTITIES

Entity	Category	Confidence
Jim Kobielus	PERSON	0.99+
Peter Burris	PERSON	0.99+
Amazon	ORGANIZATION	0.99+
IBM	ORGANIZATION	0.99+
John	PERSON	0.99+
Rob Bearden	PERSON	0.99+
Rob Thomas	PERSON	0.99+
O'Reilly Media	ORGANIZATION	0.99+
80%	QUANTITY	0.99+
10	QUANTITY	0.99+
New York	LOCATION	0.99+
10 products	QUANTITY	0.99+
O'Reilly	ORGANIZATION	0.99+
two days	QUANTITY	0.99+
first book	QUANTITY	0.99+
two books	QUANTITY	0.99+
a day	QUANTITY	0.99+
Rob	PERSON	0.99+
Today	DATE	0.99+
yesterday	DATE	0.99+
New York City	LOCATION	0.99+
Hortonworks	ORGANIZATION	0.99+
San Francisco Bay	LOCATION	0.99+
five products	QUANTITY	0.99+
second book	QUANTITY	0.99+
IBM Analytics	ORGANIZATION	0.99+
this week	DATE	0.99+
SiliconANGLE Media	ORGANIZATION	0.99+
first	QUANTITY	0.99+
first one	QUANTITY	0.99+
theCUBE	ORGANIZATION	0.99+
eight years	QUANTITY	0.99+
Spark	TITLE	0.99+
SQL	TITLE	0.99+
Common SQL	TITLE	0.98+
datascience.ibm.com	OTHER	0.98+
eighth year	QUANTITY	0.98+
One	QUANTITY	0.98+
one issue	QUANTITY	0.97+
Hortonworks Dataplane	ORGANIZATION	0.97+
three platforms	QUANTITY	0.97+
Strata Hadoop	TITLE	0.97+
today	DATE	0.97+
The Data Revolution	TITLE	0.97+
Cloudera	ORGANIZATION	0.97+
second	QUANTITY	0.96+
NYC	LOCATION	0.96+
two big problems	QUANTITY	0.96+
Analytics University	ORGANIZATION	0.96+
step two	QUANTITY	0.96+
one way	QUANTITY	0.96+
November first	DATE	0.96+
Big Data Revolution	TITLE	0.95+
one	QUANTITY	0.94+
Every Company is a Tech Company	TITLE	0.94+
CUBE	ORGANIZATION	0.93+
this year	DATE	0.93+
two different concepts	QUANTITY	0.92+
one system	QUANTITY	0.92+
step one	QUANTITY	0.92+

Greg Sands, Costanoa | Big Data NYC 2017

(electronic music) >> Host: Live from Midtown Manhattan it's The Cube! Covering Big Data New York City 2017, brought to you by Silicon Angle Media, and its Ecosystem sponsors. >> Okay, welcome back everyone. We are here live, The Cube in New York City for Big Data NYC, this is our fifth year, doing our own event, not with O'Reilly or Cloud Era at Strata Data, which as Hadoop World, Strata Conference, Strata Hadoop, now called Strata Data, probably called Strata AI next year, we're The Cube every year, bringing you all the great data, and what's going on. Entrepreneurs, VCs, thought leaders, we interview them and bring that to you. I'm John Furrier with our next guest, Greg Sands, who's the managing director and founder of Costa Nova ventures in Palo Alto, started out as an entrepreneur himself, then single shingle out there, now he's a big VC firm on a third fund. >> On the third fund. >> Third fund. How much in that fund? >> 175 million dollar fund. >> So now you're a big firm now, congratulations, and really great to see your success. >> Thanks very much. I mean, we're still very much an early stage boutique focused on companies that change the way the world does business, but it is the case that we have a bigger team and a bigger fund, to go do the same thing. >> Well you've been great to work with, I've been following you, we've known each other for a while, watched you left Sir Hill and start Costanova, but what's interesting is that, I can kind of joke and kid you, the VC inside joke about being a big firm, because I know you want to be small, and like to be small, help entrepreneurs, that's your thing. But it's really not a big firm, it's a few partners, but a lot of people helping companies, that's your ethos, that's what you're all about at your firm. Take a minute to just share with the folks the kinds of things you do and how you get involved in companies, you're hands on, you roll up your sleeves. You get out of the way at the right time, you help when you can, share your ethos. >> Yeah, absolutely so the way we think of it is, combining the craft of old school venture capital, with a modern operating team, and so since most founder these days are product-oriented, our job is to think like product people, not think like investors. So we think like product people, we do product level analysis, we do customer discovery, we do, we go ride along on sales calls when we're making investment decisions. And then we do the things that great venture capitalists have done for years, and so for example, at Alatian, who I know has been on the show today, we were able to incubate them in our office for a year, I had many conversations with Sathien after he'd sold the first two or three customers. Okay, who's the next person we hire? Who isn't a founder? Who's going to go out and sell? What does that person look like? Do you go straight to a VP? Or do you hire an individual contributor? Do you hire someone for domain, or do you hire someone for talent? And that's the thing that we love doing. Now we've actually built out an operating team so marketing partner, Martino Alcenco, and Jim Wilson as a sales partner, to really help turn that into a program, so that they can, we can take these founders who find product market fit, and say, how do we help you build the right sales process and marketing process, sales team and marketing team, for your company, your customer, your product? >> Well it's interesting since you mention old school venture capital, I'll get into some of the dynamics that are going on in Silicon valley, but it's important to bring that forward, because now with cloud you can get to critical mass on the fly wheel, on economics, you can see the visibility faster now. >> Greg: Absolutely. >> So the game of the old school venture capitalist is all the same, how do you get to cruising altitude, whatever metaphor you want to use, the key was getting there, and sometimes it took a couple of rounds, but now you can get these companies with five million, maybe $10 million funding, they can have unit economics visibility, scales insight, then the scale game comes in, so that seems to be the secret trick right now in venture is, don't overspend, keep the valuation in range and allows you to look for multiple exits potentially, or growth. Talk about that dynamic, because this is like, I call it the hour glass. You get through the hour glass, everyone's down here, but if you can sneak through and get the visibility on the economics, then you grow quickly. >> Absolutely. I mean, it's exactly right an I haven't heard the hour glass metaphor before but I like it. You want to basically get through the narrows of product market fit and the beginnings of scalable sales and marketing. You don't need to know all the answers, but you can do that in a capital-efficient way, building really solid foundations for future explosive growth, look, everybody loves fast growth and big markets, and being grown into. But the number of people who basically don't build those foundations and then say, go big or go home! And they take a ton of money, and they go spend all the money, doing things that just fundamentally don't work, and they blow themselves up. >> Well this is the hourglass problem. You have, once you get through that unique economics, then you have true scale, and value will increase. Everybody wins there so it's about getting through that, and you can get through it fast with good mentoring, but here's the challenge that entrepreneurs fall into the trap. I call it the, I think I made it trap. And what happens is they think they're on the other side of the hourglass, but they still haven't even gone through the straight and narrow yet, and they don't know it. And what they do is they over fund and implode. That seems to be a major trap I see a lot of entrepreneurs fall into, while I got a 50 million pre on my B round, or some monster valuation, and they get way too much cash, and they're behaving as if they're scaling, and they haven't even nailed it yet. >> Well, I think that's right. So there's certainly, there are stages of product market fit, and so I think people hit that first stage, and they say, oh I've got it. And they try to explode out of the gates. And we, in fact I know one good example of somebody saying, hey, by the way, we're doing great in field sales, and our investors want us to go really fast, so we are going to go inside and we, my job was to hire 50 inside people, without ever having tried it. And so we always preach crawl, walk, run, right? Hire a couple, see how it works. Right, in a new channel. Or a new category, or an adjacent space, and I think that it's helpful to have an investor who has seen the whole picture to say, yeah, I know it looks like light at the end of the tunnel, but see how it's a relatively small dot? You still got to go a little farther, and then the other thing I say is, look, don't build your company to feed your venture capitalist ego. Right? People do these big rounds of big valuations, and the big dog investors say, go, go, go! But, you're the CEO. Your job is analyze the data. >> John: You can find during the day (laughs). >> And say, you know, given what we know, how fast should we go? Which investments should we make? And you've got to own that. And I think sometimes our job is just to be the pulling guard and clear space for the CEO to make good decisions. >> So you know I'm a big fan, so my bias is pretty much out there, love what you guys are doing. Tim Carr is a Pivot North doing the same thing. Really adding value, getting down and dirty, but the question that entrepreneurs always ask me and talk privately, not about you, but in general, I don't want the VC to get in the way. I want them, I don't want them to preach to me, I don't want too many know-it-alls on my board, I want added value, but again, I don't want the preaching, I don't want them to get in the way, 'cause that's the fear. I'm not saying the same about VCs in general, but that's kind of the mentality of an entrepreneur. I want someone who's going to help me, be in the boat with me, but not be in my way. How do you address that concern to the founders who think, not think like that, but might have a fear. >> Well, by the way, I think it's a legitimate fear, and I think it actually is uncorrelated with added value, right? I think the idea that the board has certain responsibilities, and management has certain responsibilities, is incredibly important. And I think, I can speak for myself in saying, I'm quite conscious of not crossing that line, I think you talk. >> John: You got to build a return, that's the thing. >> But ultimately I would say to an entrepreneur, I'd just say, hey look, call references. And by the way, here are 30 names and phone numbers, and call any one of them, because I think that people who are, so a venture capital know-it-all, in the board room, telling CEOs what to do, destroys value. It's sand in the gears, and it's bad for the company. >> Absolutely, I agree 100% >> And some of my, when I talk about being a pulling guard for the CEO, that's what I'm talking about, which is blocking people who are destructive. >> And rolling the block for a touchdown, kind of use the metaphor. Adding value, that's the key, and that's why I wanted to get that out there because most guys don't get that nuance, and entrepreneurs, especially the younger ones. So it's good and important. Okay, let's talk about culture, obviously in Silicon Valley, I get, reading this morning in the Wymo guy, and they're writing it, that's the Silicon Valley, that's not crazy, there's a lot of great people in Silicon Valley, you're one of them. The culture's certainly an innovative culture, there's been some things in the press, inclusion and diversity, obviously is super important. This whole brogrammer thing that's been kind of kicked around. How are you dealing with all that? Because, you know, this is a cultural shift, but I think it's being made out more than it really is, but there's still our core issues, your thoughts on the whole inclusion and diversity, and this whole brogrammer blowback thing. >> Yeah, well so I think, so first of all, really important issues, glad we're talking about them, and we all need to get better. And to me the question for us has been, what role do we play? And because I would say it is a relatively small subset of the tech industry, and the venture capital industry. At the same time the behavior of that has become public is appalling. It's appalling and totally unacceptable, and so the question is, okay, how can we be a part of the stand-up part of the ecosystem, and some of which is calling things out when we see them. Though frankly we work with and hang out with people and we don't see them that often, and then part of which is, how do we find a couple of ways to contribute meaningfully? So for example this summer we ran what we called the Costanova Access Fellowship, intentionally, trying to provide first opportunity and venture capital for people who traditionally haven't had as much access. We created an event in the spring called, Seat at the Table, really, particularly around women in the tech industry, and it went so well that we're running it in New York on October 19th, so if you're a woman in tech in New York, we'd love to see you then. And we're just trying to figure-- >> You're doing it in an authentic way though, you're not really doing it from a promotional standpoint. It's legit. >> Yeah, we're just trying to do, you know, pick off a couple of things that we can do, so that we can be on the side of the good guys. >> So I guess what you're saying is just have high integrity, and be part of the solution not part of the problem. >> That's right, and by the way, both of these initiatives were ones that were kicked off in late 2016, so it's not a reaction to things like binary capital, and the problems at uper, both of which are appalling. >> Self-awareness is critical. Let's get back to the nuts and bolts of the real reason why I wanted you to come on, one was to find out how much money you have to spend for the entrepreneurs that are watching. Give us the update on the last fund, so you got a new fund that you just closed, the new fund, fund three. You have your other funds that are still out there, and some funds reserved, which, what's the number amount, how much are you writing checks for? Give the whole thesis. >> Absoluteley. So we're an early stage investor, so we lead series A and seed financing companies that change the way the world does business, so up and down the stack, a business-facing software, data-driven applications. Machine-learning and AI driven applications. >> John: But the filter is changing the way the world works? >> The way, yes, but in particularly the way the world does business. You can think of it as a business-facing software stack. We're not social media investors, it's not what we know, it's not what we're good at. And it includes security and management, and the data stack and-- >> Joe: Enterprise and emerging tech. >> That's right. And the-- >> And every crazy idea in between. >> That's right. (laughs) Absolutely, and so we're participate in or leave seed financings as most typically are half a million to maybe one and a quarter, and we'll lead series A financing, small ones might be two or two and a half million dollars at the outer edge is probably a six million dollar check. We were just opening up in the next couple of days, a thousand square feet of incubation space at world headquarters at Palo Alto. >> John: Nice. >> So Alation, Acme Ticketing and Zen IQ are companies that we invested in. >> Joe: What location is this going to be at? >> That's, near the Fills in downtown Palo Alto, 164 staff, and those three companies are ones where we effectively invested at formation and incubated it for a year, we love doing that. >> At the hangout at Philsmore and get the data. And so you got some funds, what else do you have going on? 175 million? >> So one was a $100 million fund, and then fund two was $135 million fund, and the last investment of fund two which we announced about three weeks ago was called Roadster, so it's ecommerce enablement for the modern dealerships. So Omnichannel and Mobile First infrastructure for auto-dealers. We have already closed, and had the first board meeting for the first new investment of fund three, which isn't yet announced, but in the land of computer vision and deep learning, so a couple of the subjects that we care deeply about, and spend a lot of time thinking about. >> And the average check size for the A round again, seed and A, what do you know about the? The lowest and highest? >> The average for the seed is half a million to one and a quarter, and probably average for a series A is four or five. >> And you'll lead As. >> And we will lead As. >> Okay great. What's the coolest thing you're working on right now that gets you excited? It doesn't have to be a portfolio company, but the research you're doing, thing, tires you're kicking, in subjects, or domains? >> You know, so honestly, one of the great benefits of the venture capital business is that I get up and my neurons are firing right away every day. And I do think that for example, one of the things that we love is is all of the adulant infrastructure and so we've got our friends at Victor Ops that are in the middle of that space, and the thinking about how the modern programmer works, how everybody-- >> Joe: Is security on your radar? >> Security is very much on our radar, in fact, someone who you should have on your show is Asheesh Guptar, and Casey Ella, so she's just joined Bug Crowd as the CEO and Casey moves over to CTO, and the word Bug Bounty was just entered into the Oxford Dictionary for the first time last week, so that to me is the ultimate in category creation. So security and dev ops tools are among the things that we really like. >> And bounties will become the norm as more and more decentralized apps hit the scene. Are you doing anything on decentralized applications? I'm not saying Blockchain in particular, but Blockchain like apps, distributing computing you're well versed on. >> That's right, well we-- >> Blockchain will have an impact in your area. >> Blockchain will have an impact, we just spent an hour talking about it in the context our off site in Decosona Lodge in Pascadero, it felt like it was important that we go there. And digging into it. I think actually the edge computing is actually more actionable for us right now, given the things that we're, given the things that we're interested in, and we're doing and they, it is just fascinating how compute centralizes and then decentralizes, centralizes and then decentralizes again, and I do think that there are a set of things that are fascinating about what your process at the edge, and what you send back to the core. >> As Pet Gelson here said in the QU, if you're not out in front of that next wave, you're driftwood, a lot of big waves coming in, you've seen a lot of waves, you were part of one that changed the world, Netscape browser, or the business plan for that first project manager, congratulations. Now you're at a whole nother generation. You ready? (laughs) >> Absolutely, I'm totally ready, I'm ready to go. >> Greg Sands here in The Cube in New York City, part of Big Data NYC, more live coverage with The Cube after this short break, thanks for watching. (electronic jingle) (inspiring electronic music)

Published Date : Sep 29 2017

SUMMARY :

brought to you by Silicon Angle Media, and founder of Costa Nova ventures in Palo Alto, How much in that fund? congratulations, and really great to see your success. but it is the case that we have the kinds of things you do and how you get And that's the thing that we love doing. I'll get into some of the dynamics that are going on is all the same, how do you get to But the number of people who basically but here's the challenge that and the big dog investors say, go, go, go! for the CEO to make good decisions. but that's kind of the mentality of an entrepreneur. Well, by the way, I think it's a legitimate fear, And by the way, here are 30 names and phone numbers, And some of my, and entrepreneurs, especially the younger ones. and so the question is, okay, You're doing it in an authentic way though, so that we can be on the side of the good guys. not part of the problem. and the problems at uper, of the real reason why I wanted you to come on, companies that change the way the world does business, and the data stack and-- And the-- and a half million dollars at the outer edge So Alation, Acme Ticketing and Zen IQ That's, near the Fills in downtown Palo Alto, And so you got some funds, and the last investment of fund two The average for the seed is but the research you're doing, and the thinking about how the modern are among the things that we really like. more and more decentralized apps hit the scene. and what you send back to the core. or the business plan for that first I'm ready to go. Greg Sands here in The Cube in New York City,

ENTITIES

Entity	Category	Confidence
Greg Sands	PERSON	0.99+
Asheesh Guptar	PERSON	0.99+
John	PERSON	0.99+
two	QUANTITY	0.99+
Tim Carr	PERSON	0.99+
John Furrier	PERSON	0.99+
Costa Nova	ORGANIZATION	0.99+
Palo Alto	LOCATION	0.99+
Joe	PERSON	0.99+
October 19th	DATE	0.99+
Costanova	ORGANIZATION	0.99+
Silicon Angle Media	ORGANIZATION	0.99+
$10 million	QUANTITY	0.99+
New York	LOCATION	0.99+
$100 million	QUANTITY	0.99+
five million	QUANTITY	0.99+
Casey Ella	PERSON	0.99+
$135 million	QUANTITY	0.99+
Zen IQ	ORGANIZATION	0.99+
Omnichannel	ORGANIZATION	0.99+
50 million	QUANTITY	0.99+
three companies	QUANTITY	0.99+
Pascadero	LOCATION	0.99+
Greg	PERSON	0.99+
New York City	LOCATION	0.99+
100%	QUANTITY	0.99+
50	QUANTITY	0.99+
Silicon valley	LOCATION	0.99+
Jim Wilson	PERSON	0.99+
O'Reilly	ORGANIZATION	0.99+
Casey	PERSON	0.99+
Alation	ORGANIZATION	0.99+
half a million	QUANTITY	0.99+
30 names	QUANTITY	0.99+
Silicon Valley	LOCATION	0.99+
175 million	QUANTITY	0.99+
first	QUANTITY	0.99+
Victor Ops	ORGANIZATION	0.99+
Pet Gelson	PERSON	0.99+
both	QUANTITY	0.99+
last week	DATE	0.99+
four	QUANTITY	0.99+
three customers	QUANTITY	0.99+
late 2016	DATE	0.99+
fifth year	QUANTITY	0.99+
Cloud Era	ORGANIZATION	0.99+
Acme Ticketing	ORGANIZATION	0.98+
164 staff	QUANTITY	0.98+
NYC	LOCATION	0.98+
five	QUANTITY	0.98+
Oxford Dictionary	TITLE	0.98+
Midtown Manhattan	LOCATION	0.98+
Alatian	ORGANIZATION	0.98+
175 million dollar	QUANTITY	0.98+
next year	DATE	0.98+
today	DATE	0.97+
first time	QUANTITY	0.97+
third fund	QUANTITY	0.97+
first board	QUANTITY	0.97+
Costanoa	PERSON	0.97+
a year	QUANTITY	0.97+
six	QUANTITY	0.97+
one	QUANTITY	0.97+
one and a quarter	QUANTITY	0.96+
Strata Conference	EVENT	0.96+
The Cube	TITLE	0.96+
Strata AI	EVENT	0.96+
million dollar	QUANTITY	0.96+
2017	EVENT	0.95+
first project	QUANTITY	0.95+
two and a half million dollars	QUANTITY	0.95+
Hadoop World	EVENT	0.94+
Sathien	PERSON	0.93+
single shingle	QUANTITY	0.93+
first two	QUANTITY	0.93+
an hour	QUANTITY	0.92+
this summer	DATE	0.92+
first stage	QUANTITY	0.92+
Bug Crowd	ORGANIZATION	0.91+

Matt Maccaux, Dell EMC | Big Data NYC 2017

>> Announcer: Live from Midtown Manhattan. It's the CUBE. Covering Big Data New York City 2017. Brought to you by Silicon Angle Media and its ecosystem sponsor. (electronic music) >> Hey, welcome back everyone, live here in New York. This is the CUBE here in Manhattan for Big Data NYC's three days of coverage. We're one day three, things are starting to settle in, starting to see the patterns out there. I'll say it's Big Data week here, in conjunction with Hadoop World, formerly known as Strata Conference, Strata-Hadoop, Strata-Data, soon to be Strata-AI, soon to be Strata-IOT. Big Data, Mike Maccaux who's the Global Big Data Practice Lead at Dell EMC. We've been in this world now for multiple years and, well, what a riot it's been. >> Yeah, it has. It's been really interesting as the organizations have gone from their legacy systems, they have been modernizing. And we've sort of seen Big Data 1.0 a couple years ago. Big Data 2.0 and now we're moving on sort of the what's next? >> Yeah. >> And it's interesting because the Big Data space has really lagged the application space. You talk about microservices-based applications, and deploying in the cloud and stateless things. The data technologies and the data space has not quite caught up. The technology's there, but the thinking around it, and the deployment of those, it seems to be a slower, more methodical process. And so what we're seeing in a lot of enterprises is that the ones that got in early, have built out capabilities, are now looking for that, how do we get to the next level? How do we provide self-service? How do we enable our data scientists to be more productive within the enterprise, right? If you're a startup, it's easy, right? You're somewhere in the public cloud, you're using cloud based API, it's all fine. But if you're an enterprise, with the inertia of those legacy systems and governance and controls, it's a different problem to solve for. >> Let's just face it. We'll just call a spade a spade. Total cost of ownership was out of control. Hadoop was great, but it was built for something that tried to be something else as it evolved. And that's good also, because we need to decentralize and democratize the incumbent big data warehouse stuff. But let's face it, Hadoop is not the game anymore, it's everything else. >> Right, yep. >> Around it so, we've seen that, that's a couple years old. It's about business value right now. That seems to be the big thing. The separation between the players that can deliver value for the customers. >> Matt: Yep. >> And show a little bit of headroom for future AI things, they've seen that. And have the cloud on premise play. >> Yep. >> Right now, to me, that's the call here. What do you, do you agree? >> I absolutely see it. It's funny, you talk to organizations and they say, "We're going cloud, we're doing cloud." Well what does that mean? Can you even put your data in the cloud? Are you allowed to? How are you going to manage that? How are you going to govern that? How are you going to secure that? So many organizations, once they've asked those questions, they've realized, maybe we should start with the model of cloud on premise. And figure out what works and what doesn't. How do users actually want to self serve? What do we templatize for them? And what do we give them the freedom to do themselves? >> Yeah. >> And they sort of get their sea legs with that, and then we look at sort of a hybrid cloud model. How do we be able to span on premise, off premise, whatever your public cloud is, in a seamless way? Because we don't want to end up with the same thing that we had with mainframes decades ago, where it was, IBM had the best, it was the fastest, it was the most efficient, it was the new paradigm. And then 10 years later, organizations realized they were locked in, there was different technology. The same thing's true if you go cloud native. You're sort of locked in. So how do you be cloud agnostic? >> How do you get locked in a cloud native? You mean with Amazon? >> Or any of them, right? >> Okay. >> So they all have their own APIs that are really good for doing certain things. So Google's TensorFlow happens to be very good. >> Yeah. Amazon EMR. >> But you build applications that are using those native APIS, you're sort of locked. And maybe you want to switch to something else. How do you do that? So the idea is to-- >> That's why Kubernetes is so important, right now. That's a very key workload and orchestration container-based system. >> That's right, so we believe that containerization of workloads that you can define in one place, and deploy anywhere is the path forward, right? Deploy 'em on prem, deploy 'em in a private cloud, public cloud, it doesn't matter the infrastructure. Infrastructure's irrelevant. Just like Hadoop is sort of not that important anymore. >> So let me get your reaction on this. >> Yeah. So Dell EMC, so you guys have actually been a supplier. They've been the leading supplier, and now with Dell EMC across the portfolio of everything. From Dell computers, servers and what not, to storage, EMC's run the table on that for many generations. Yeah, there's people nippin' at your heels like Pure, okay that's fine. >> Sure. It's still storage is storage. You got to store the data somewhere, so storage will always be around. Here's what I heard from a CXO. This is the pattern I hear, but I'll just summarize it in one conversation. And then you can give a reaction to it. John, my life is hell. I have application development investment plan, it's just boot up all these new developers. New dev ops guys. We're going to do open source, I got to build that out. I got that, trying to get dev ops going on. >> Yep. >> That's a huge initiative. I got the security team. I'm unbundling from my IT department, into a new, difference in a reporting to the board. And then I got all this data governance crap underneath here, and then I got IOT over the top, and I still don't know where my security holes are. >> Yep. And you want to sell me what? (Matt laughs) So that's the fear. >> That's right. >> Their plates are full. How do you guys help that scenario? You walk in, actually security's pretty much, important obviously you can see that. But how do you walk into that conversation? >> Yeah, it's sort of stop the madness, right? >> (laughs) That's right. >> And all of that matters-- >> No, but this is all critical. Every room in the house is on fire. >> It is. >> And I got to get my house in order, so your comment to me better not be hype. TensorFlow, don't give me this TensorFlow stuff. >> That's right. >> I want real deal. >> Right, I need, my guys are-- >> I love TensorFlow but, doesn't put the fire out. >> They just want spark, right? I need to speed up my-- >> John: All right, so how do you help me? >> So, what we'd do is, we want to complement and augment their existing capabilities with better ways of scaling their architecture. So let's help them containerize their big data workload so that they can deploy them anywhere. Let's help them define centralized security policies that can be defined once and enforced everywhere, so that now we have a way to automate the deployment of environments. And users can bring their own tools. They can bring their data from outside, but because we have intelligent centralized policies, we can enforce that. And so with our elastic data platform, we are doing that with partners in the industry, Blue Talent and Blue Data, they provide that capability on top of whatever the customer's infrastructure is. >> How important is it to you guys that Dell EMC are partnering. I know Michael Dell talks about it all the time, so I know it's important. But I want to hear your reaction. Down in the trenches, you're in the front lines, providing the value, pulling things together. Partnerships seem to be really important. Explain how you look at that, how you guys do your partners. You mentioned Blue Talent and Blue Data. >> That's right, well I'm in the consulting organization. So we are on the front lines. We are dealing with customers day in and day out. And they want us to help them solve their problems, not put more of our kit in their data centers, on their desktops. And so partnering is really key, and our job is to find where the problems are with our customers, and find the best tool for the best job. The right thing for the right workload. And you know what? If the customer says, "We're moving to Amazon," then Dell EMC might not sell any more compute infrastructure to that customer. They might, we might not, right? But it's our job to help them get there, and by partnering with organizations, we can help that seamless. And that strengthens the relationship, and they're going to purchase-- >> So you're saying that you will put the customer over Dell EMC? >> Well, the customer is number one to Dell EMC. Net promoter score is one of the most important metrics that we have-- >> Just want to make sure get on the record, and that's important, 'cause Amazon, and you know, we saw it in Net App. I've got to say, give Net App credit. They heard from customers early on that Amazon was important. They started building into Amazon support. So people saying, "Are you crazy?" VMware, everyone's saying, "Hey you capitulated "by going to Amazon." Turns out that that was a damn good move. >> That's right. >> For Kelsinger. >> Yep. >> Look at VM World. They're going to own the cloud service provider market as an arms dealer. >> Yep. >> I mean, you would have thought that a year ago, no way. And then when they did the deal, they said, >> We have really smart leadership in the organization. Obviously Michael is a brilliant man. And it sort of trickles on down. It's customer first, solve the customer's problems, build the relationship with them, and there will be other things that come, right? There will be other needs, other workloads. We do happen to have a private cloud solution with Virtustream. Some of these customers need that intermediary step, before they go full public, with a hosted private solution using a Virtustream. >> All right, so what's the, final question, so what's the number one thing you're working on right now with customers? What's the pattern? You got the stack rank, you're requests, your deliverables, where you spend your time. What's the top things you're working on? >> The top thing right now is scaling architectures. So getting organizations past, they've already got their first 20 use cases. They've already got lakes, they got pedabytes in there. How do we enable self service so that we can actually bring that business value back, as you mentioned. Bring that business value back by making those data scientists productive. That's number one. Number two is aligning that to overall strategy. So organizations want to monetize their data, but they don't really know what that means. And so, within a consulting practice, we help our customers define, and put a road map in place, to align that strategy to their goals, the policies, the security, the GDP, or the regulations. You have to marry the business and the technology together. You can't do either one in isolation. Or ultimately, you're not going to be efficient. >> All right, and just your take on Big Data NYC this year. What's going on in Manhattan this year? What's the big trend from your standpoint? That you could take away from this show besides it becoming a sprawl of you know, everyone just promoting their wares. I mean it's a big, hyped show that O'Reilly does, >> It is. >> But in general, what's the takeaway from the signal? >> It was good hearing from customers this year. Customer segments, I hope to see more of that in the future. Not all just vendors showing their wares. Hearing customers actually talk about the pain and the success that they've had. So the Barclay session where they went up and they talked about their entire journey. It was a packed room, standing room only. They described their journey. And I saw other banks walk up to them and say, "We're feeling the same thing." And this is a highly competitive financial services space. >> Yeah, we had Packsotta's customer on Standard Bank. They came off about their journey, and how they're wrangling automating. Automating's the big thing. Machine learning, automation, no doubt. If people aren't looking at that, they're dead in my mind. I mean, that's what I'm seeing. >> That's right. And you have to get your house in order before you can start doing the fancy gardening. >> John: Yeah. >> And organizations aspire to do the gardening, right? >> I couldn't agree more. You got to be able to drive the car, you got to know how to drive the car if you want to actually play in this game. But it's a good example, the house. Got to get the house in order. Rooms are on fire (laughs) right? Put the fires out, retrench. That's why private cloud's kicking ass right now. I'm telling you right now. Wikibon nailed it in their true private cloud survey. No other firm nailed this. They nailed it, and it went viral. And that is, private cloud is working and growing faster than some areas because the fact of the matter is, there's some bursting through the clouds, and great use cases in the cloud. But, >> Yep. >> People have to get the ops right on premise. >> Matt: That's right, yep. >> I'm not saying on premise is going to be the future. >> Not forever. >> I'm just saying that the stack and rack operational model is going cloud model. >> Yes. >> John: That's absolutely happening, that's growing. You agree? >> Absolutely, we completely, we see that pattern over and over and over again. And it's the Goldilocks problem. There's the organizations that say, "We're never going to go cloud." There's the organizations that say, "We're going to go full cloud." For big data workloads, I think there's an intermediary for the next couple years, while we figure out operating pulse. >> This evolution, what's fun about the market right now, and it's clear to me that, people who try to get a spot too early, there's too many diseconomies of scale. >> Yep. >> Let the evolution, Kubernetes looking good off the tee right now. Docker containers and containerization in general's happened. >> Yep. >> Happening, dev ops is going mainstream. >> Yep. >> So that's going to develop. While that's developing, you get your house in order, and certainly go to the cloud for bursting, and other green field opportunities. >> Sure. >> No doubt. >> But wait until everything's teed up. >> That's right, the right workload in the right place. >> I mean Amazon's got thousands of enterprises using the cloud. >> Yeah, absolutely. >> It's not like people aren't using the cloud. >> No, they're, yeah. >> It's not 100% yet. (laughs) >> And what's the workload, right? What data can you put there? Do you know what data you're putting there? How do you secure that? And how do you do that in a repeatable way. Yeah, and you think cloud's driving the big data market right now. That's what I was saying earlier. I was saying, I think that the cloud is the unsubtext of this show. >> It's enabling. I don't know if it's driving, but it's the enabling factor. It allows for that scale and speed. >> It accelerates. >> Yeah. >> It accelerates... >> That's a better word, accelerates. >> Accelerates that horizontally scalable. Mike, thanks for coming on the CUBE. Really appreciate it. More live action we're going to have some partners on with you guys. Next, stay with us. Live in Manhattan, this is the CUBE. (electronic music)

Published Date : Sep 29 2017

SUMMARY :

Brought to you by Silicon Angle Media This is the CUBE here in Manhattan sort of the what's next? And it's interesting because the decentralize and democratize the The separation between the players And have the cloud on premise play. Right now, to me, that's the call here. the model of cloud on premise. IBM had the best, it was the fastest, So Google's TensorFlow happens to be very good. So the idea is to-- and orchestration container-based system. and deploy anywhere is the path forward, right? So let me get your So Dell EMC, so you guys have And then you can give a reaction to it. I got the security team. So that's the fear. How do you guys help that scenario? Every room in the house is on fire. And I got to get my house in order, doesn't put the fire out. the deployment of environments. How important is it to you guys And that strengthens the relationship, Well, the customer is number one to Dell EMC. and you know, we saw it in Net App. They're going to own the cloud service provider market I mean, you would have thought that a year ago, no way. build the relationship with them, You got the stack rank, you're the policies, the security, the GDP, or the regulations. What's the big trend from your standpoint? and the success that they've had. Automating's the big thing. And you have to get your house in order But it's a good example, the house. the stack and rack operational model John: That's absolutely happening, that's growing. And it's the Goldilocks problem. and it's clear to me that, Kubernetes looking good off the tee right now. and certainly go to the cloud for bursting, That's right, the right workload in the I mean Amazon's got It's not 100% yet. And how do you do that in a repeatable way. but it's the enabling factor. Mike, thanks for coming on the CUBE.

ENTITIES

Entity	Category	Confidence
John	PERSON	0.99+
Michael	PERSON	0.99+
Amazon	ORGANIZATION	0.99+
Mike Maccaux	PERSON	0.99+
Matt Maccaux	PERSON	0.99+
IBM	ORGANIZATION	0.99+
Matt	PERSON	0.99+
Manhattan	LOCATION	0.99+
Silicon Angle Media	ORGANIZATION	0.99+
Dell	ORGANIZATION	0.99+
EMC	ORGANIZATION	0.99+
New York	LOCATION	0.99+
100%	QUANTITY	0.99+
Blue Data	ORGANIZATION	0.99+
Mike	PERSON	0.99+
Blue Talent	ORGANIZATION	0.99+
Dell EMC	ORGANIZATION	0.99+
Standard Bank	ORGANIZATION	0.99+
Big Data	ORGANIZATION	0.99+
this year	DATE	0.99+
one	QUANTITY	0.99+
VM World	ORGANIZATION	0.99+
Michael Dell	PERSON	0.99+
thousands	QUANTITY	0.99+
Barclay	ORGANIZATION	0.99+
Hadoop	TITLE	0.98+
three days	QUANTITY	0.98+
decades ago	DATE	0.98+
NYC	LOCATION	0.98+
one day	QUANTITY	0.98+
one conversation	QUANTITY	0.98+
Goldilocks	PERSON	0.98+
O'Reilly	ORGANIZATION	0.98+
a year ago	DATE	0.98+
Wikibon	ORGANIZATION	0.98+
Midtown Manhattan	LOCATION	0.98+
10 years later	DATE	0.97+
TensorFlow	ORGANIZATION	0.97+
first 20 use cases	QUANTITY	0.97+
Google	ORGANIZATION	0.97+
Kelsinger	PERSON	0.97+
New York City	LOCATION	0.96+
first	QUANTITY	0.95+
VMware	ORGANIZATION	0.93+
Strata Conference	EVENT	0.93+
Big Data	EVENT	0.92+
Strata-Hadoop	EVENT	0.9+
Strata-Data	EVENT	0.9+
Number two	QUANTITY	0.9+
next couple years	DATE	0.86+
couple years ago	DATE	0.84+
2017	DATE	0.84+
Global Big Data	ORGANIZATION	0.83+
Packsotta	ORGANIZATION	0.83+
Hadoop World	ORGANIZATION	0.83+
Big Data 2.0	TITLE	0.81+
three	QUANTITY	0.79+
couple years	QUANTITY	0.76+
Big Data 1.0	TITLE	0.73+
Net App	TITLE	0.72+
2017	EVENT	0.71+
one place	QUANTITY	0.69+
number one	QUANTITY	0.67+
Kubernetes	ORGANIZATION	0.67+
enterprises	QUANTITY	0.66+

Andrew Gilman and Andrew Burt, Immuta | Big Data NYC 2017

>> Narrator: Live from Midtown Manhattan it's theCUBE! Covering Big Data, New York City 2017. Brought to you by SiliconANGLE Media and its ecosystem sponsor. >> Okay, welcome back everyone. Live here in New York this is theCUBE's coverage of Big Data NYC, our event. We've been doing it for five years, it's our event in conjunction with Strata Data, which is the O'Reilly Media that we run, it's a separate event. But we've been covering the Big Data for eight years since 2010, Hadoop World. This is theCUBE. Of course theCUBE is never going to change, they might call it Strata AI next year, whatever trend that they might see. But we're going to keep it theCUBE. This is in New York City, our eighth year of coverage. Guys, welcome to theCUBE. Our next two guests is Andrew Burt, Chief Privacy Officer and Andrew Gillman, Chief Customer Officer and CMO. It's a start-up so you got all these fancy titles, but you're on the A-team from Immuta. Hot start-up. Welcome to theCUBE. Great to see you again. >> Thanks for having us, appreciate it. >> Okay, so you guys are the start-up feature here this week on theCUBE, our little segment here. I think you guys are the hottest start-up that is out there and that people aren't really talking a lot about. So you guys are brand new, you guys have got a really good reputation. Getting a lot of props inside the community. Especially in the people who know data, data science, and know some of the intelligence organizations. But respectful people like Dan Hutchin says you guys are rockstars and doing great. So why all the buzz inside the community? Now you guys are just starting to go to the market? What's the update on the company? >> So great story. Founded in 2014, (mumbles) Investment, it was announced earlier this year. And the team, group of serial entrepreneurs sold their last company CSC, ran the public sector business for them for a while. Really special group of engineers and technologists and data scientists. Headquartered out of D.C. Customer success organization out of Columbus, Ohio, and we're servicing Fortune 100 companies. >> John: So Immuta, I-M-M-U-T-A. >> Immuta.com we just launched the new website earlier this week in preparation for the show. And the easiest way-- >> Immuta, immutable, I mean-- >> Immutable, I'm sure there's a backstory. >> Immutable, yeah. We do not ever touch the raw data. So we're all about managing risk and managing the integrity of the data. And so risk and integrity and security are baked into everything we do. We want our customers to know that their data will be immutable, and that in using us they'll never pose an additional risk to that underlying data. >> I think of blockchain when I think of immutability, like I'm so into blockchaining these dayS as you guys know, I've been totally into it. >> There's no blockchain in their technology. >> I know, but let's get down to why the motivation to enter the market. There's a lot of noisy stuff out there. Why do we need another unified platform? >> The big opportunity that we saw was, organizations had spent basically the past decade refining and upgrading their application infrastructure. But in doing so under the guise of digital transformation. We've really built that organization's people processes to support monolithic applications. Now those applications are moving to the cloud, they're being rearchitected in a microsurfaces architecture. So we have all this data now, how do we manage it for the new application, which we see is really algorithm-centric? The Amazons of the world have proven, how do you compete against anyone? How do you disrupt any industry? That's operationalize your data in a new way. >> Oh, they were developer-centric right? They were very focused on the developer. You guys are saying you're algorithm-centric, meaning the software within the software kind of thing. >> It's really about, we see the future enterprise to compete. You have to build thousands of algorithms. And each one of those algorithms is going to do something very specific, very precise, but faster than any human can do. And so how do you enable an application, excuse me, an algorithm-centric infrastructure to support that? And today, as we go and meet with our customers and other groups, the people, the processes, the data is everywhere. The governance folks who have to control how the data is used, the laws are dynamic. The tooling is complex. So this whole world looks very much like pre-DevOps IT, or pre-cloud IT. It takes on average between four to six months to get a data scientist up and running on a project. >> Let's get into the company. I wanted to just get that gist out, put some context. I see the problem you solve: a lot of algorithms out there, more and more open sources coming up to the scene. With the Linux Foundation, having their new event Rebrand the Open Source summit, shows exponential growth in open source. So no doubt about it, software's going to be new guys coming on, new gals. Tons of software. What is the company positioning? What do you guys do? How many employees? Let's go down by the numbers and then talk about the problem that you solve. >> Okay, cool. So, company. We'll be about 40 people by Q1. Heavy engineering, go to market. We're operating and working with, as I mentioned, Fortune 100 clients. Highly regulated industries. Financial services, healthcare, government, insurance, et cetera. So where you have lots of data that you need to operationalize, that's very sensitive to use. What else? Company positioning. So we are positioned as data management for data science. So the opportunity that we saw, again, managing data for applications is very different than managing data for algorithm development, data sciences. >> John: So you're selling to the CDO, Chief Data Officer? Are you selling to the analytics? >> In a lot of our customers, like in financial services, we're going right into the line of business. We're working with managing directors who are building next generation analytics infrastructure that need to unify and connect the data in a new way that's dynamic. It's not just the data that they have within their organization, they're looking to bring data in from outside. They want to also work collaboratively with governance professionals and lawyers who in financial services, they are, you know, we always jest in the company that different organizations have these cool new tools, like data scientists have all their new tools. And the data owners have flash disks and they have all this. But the governance people still have Microsoft Word. And maybe the newer tools are like Wikis. So now we can get it off of Word and make it shareable. But what we allow them to do is, and what Andrew Burt has really driven, is the ability for you to take internal logic, internal policies, external regulations, and put them into code that becomes dynamically enforceable as you're querying the data, as you're using it, to train algorithms, and to drive, mathematical decision-making in the enterprise. >> Let's jump into some of the privacy. You're the Chief Privacy Officer, which is codeword for you're doing all the governance stuff. And there's a lot of stuff business-wise that's going on around GDPR which is actually relevant. There's a lot of dollars on table for that too, so it's probably good for business. But there's a lot of policy stuff going on. What's going on with you guys in this area? >> So I think policy is really catching up to the world of big data. We've known for a very long time that data is incredibly important. It's the lifeblood of an increasingly large number of organizations, and because data is becoming more important, laws are starting to catch up. I think GDPR is really, it's hot to talk about. I think it is just the beginning of a larger trend. >> People are scared. People are nervous. It's like they don't know, this could be a blank check that they're signing away. The enforcement side is pretty outrageous. >> So I mean-- >> Is that right? I mean people are scared, or do you think? >> I think people are terrified because they know that its important, and they're also terrified because data scientists, and folks in IT have never really had to think very seriously about implementing complex laws. I think GDPR is the first example of laws, forcing technology to basically blend software and law. The only way, I mean one of our theses is, the only way to actually solve for GDPR is to invent laws within the software you're using. And so, we're moving away from this meetings and memos type approach to governing data, which is very slow and can take months, and we need it to happen dynamically. >> This is why I wanted to bring you guys in. Not only, Andrew, we knew each other from another venture, but what got my attention for you guys was really this intersection between law and society and tech. And this is just the beginning. You look at the tell-signs there. Peter Burris who runs research for Wikibon coined the term programming the real world. Life basically. You've got wearables, you've got IOT, this is happening. Self-driving cars. Who decides what side of the street people walk on now? Law and code are coming together. That's algorithm. There'll be more of them. Is there an algorithm for the algorithms? Who teaches the data set, who shares the data set? Wait a minute, I don't want to share my data set because I have a law that says I can't. Who decides all this stuff? >> Exactly. We're starting to enter a world where governments really, really care about that stuff. Just in-- >> In Silicon Valley, that's not in their DNA. You're seeing it all over the front pages of the news, they can't even get it right in inclusion and diversity. How can they work with laws? >> Tension is brewing. In the U.S. our regulatory environment is a little more lax, we want to see innovation happen first and then regulate. But the EU is completely different. Their laws in China and Russia and elsewhere around the world. And it's basically becoming impossible to be a global organization and still take that approach where you can afford to be scared of the law. >> John: I don't know how I feel about this because I get all kinds of rushes of intoxication to fear. Look at what's going on with Bitcoin and Blockchain, underbelly is a whole new counterculture going on around in-immutable data. Anonymous cultures, where they're complete anonymous underbellies going on. >> I think the risk-factors going up, when you mentioned IOTs, so its where you are and your devices and your home. Now think about 23 and Me, Verily, Freenome, where you're digitizing your DNA. We've already started to do that with MRIs and other operations that we've had. You think about now, I'm handing over my DNA to an organization because I want find out my lineage. I want to learn about where I came from. How do I make sure that the derived data off of that digital DNA is used properly? Not just for me, as Andrew, but for my progeny. That introduces some really interesting ethical issues. It's an intersection of this new wave of investment, to your point, like in Silicon Valley, of bringing healthcare into data science, into technology and the intersection. And the underpinning of the whole thing is the data. How do we manage the data, and what do we do-- >> And AI really is the future here. Even though machine-learning is the key part of AI, we just put out an article this morning on SiliconANGLE from Gina Smith, our new writer. Google Brain Chief: AI tops humans in computer vision, and healthcare will never be the same. They talk about little things, like in 2011 you can barely do character recognition of pictures, now you can 100%. Now you take that forward, in Heidelberg, Germany, the event this week we were covering the Heidelberg Laureate Forum, or HLF 2017. All the top scientists were there talking about this specific issue of, this is society blending in with tech. >> Absolutely. >> This societal impact, legal impact, kind of blending. Algorithms are the only thing that are going to scale in this area. This is what you guys are trying to do, right? >> Exactly, that's the interesting thing. When you look at training models and algorithms in AI, right, AI is the new cloud. We're in New York, I'm walking down the street, and there's the algorithm you're writing, and everything is Ernestine Young. Billboards on algorithms, I mean who would have thought, right? An AI. >> John: theCUBE is going to be an AI pretty soon. "Hey, we're AI! "Brought to you by, hey, Siri, do theCUBE interview." >> But the interesting part of the whole AI and the algorithm is you have n number of models. We have lots of data scientists and AI experts. Siri goes off. >> Sorry Siri, didn't mean to do that. >> She's trying to join the conversation. >> Didn't mean to insult you, Siri. But you know, it's applied math by a different name. And you have n number of models, assuming 90% of all algorithms are single linear regression. What ultimately drives the outcome is going to be how you prepare and manage the data. And so when we go back to the governance story. Governance in applications is very different than governance in data science because how we actually dynamically change the data is going to drive the outcome of that algorithm directly. If I'm in Immuta, we connect the data, we connect the data science tools. We allow you to control the data in a unique way. I refer to that as data personalization. It's not just, can I subscribe to the data? It's what does the data look like based on who I am and what those internal and external policies are? Think about this for example, I'm training a model that doesn't mask against race, and doesn't generalize against age. What do you think is going to happen to that model when it goes to start to interact? Either it's delivered as-- >> Well context is critical. And the usability of data, because it's perishable at this point. Data that comes in quick is worth more, but historically the value goes down. But it's worth more when you train the machine. So it's two different issues. >> Exactly. So it's really about longevity of the model. How can we create and train a model that's going to be able to stay in? It's like the new availability, right? That it's going to stay, it's going to be relevant, and it's going to keep us out of jail, and keep us from getting sued as long as possible. >> Well Jeff Dean, I just want to quote one more thing to add context. I want to ask Andrew over here about his view on this. Jeff Dean, the Google Brain Chief behind all of the stuff is saying AI-enabled healthcare. The sector's set to grow at an annual rate of 40% through 2021, when it's expected to hit 6.6 billion spent on AI-enabled healthcare. 6.6 billion. Today it's around 600 million. That's the growth just in AI healthcare impact. Just healthcare. This is going to go from a policy privacy issue, One, healthcare data has been crippled with HIPPA slowing us down. But where is the innovation going to come from? Where's the data going to be in healthcare? And other verticals. This is one vertical. Financial services is crazy too. >> I mean, honestly healthcare is one of the most interesting examples of applied AI, and it's because there's no other realm, at least now, where people are thinking about AI, and the risk is so apparent. If you get a diagnosis and the doctor doesn't understand why it's very apparent. And if they're using a model that has a very low level of transparency, that ends up being really important. I think healthcare is a really fascinating sector to think about. But all of these issues, all of these different types of risks that have been around for a while are starting to become more and more important as AI takes-- >> John: Alright, so I'm going to wrap up here. Give you guys both a chance, and you can't copy each other's answer. So we'll start with you Andrew over here. Explain Immuta in a simple way. Someone who's not in the industry. What do you guys do? And then do a version for someone in the industry. So elevator pitch for someone who's a friend, who's not in the industry, and someone who is. >> So Immuta is a data management platform for data science. And what that actually gives you is, we take the friction out of trying to access data, and trying to control data, and trying to comply with all of the different rules that surround the use of that data. >> John: Great, now do the one for normal people. >> That was the normal pitch. >> Okay! (laughing) I can't wait to hear the one for the insiders. >> And then for the insiders-- >> Just say, "It's magic". >> It's magic. >> We're magic, you know. >> Coming from the infrastructure role, I like to refer to it as a VMWare for data science. We create an abstraction layer than sits between the data and the data science tools, and we'll dynamically enforce policies based on the values of the organization. But also, it drives better outcomes. Because today, the data owners aren't confident that you're going to do with the data what you say you're going to do. So they try to hold it. Like the old server-huggers, the data-huggers. So we allowed them to unlock that and make it universally available. We allow the governance people to get off those memos, that have to be interpreted by IT and enforced, and actually allow them to write code and have it be enforced as the policy mandates. >> And the number one problem you solve is what? >> Accelerate with confidence. We allow the data scientists to go and build models faster by connecting to the data in a way that they're confident that when they deploy their model, that it's going to go into production, and it's going to stay into production for as long as possible. >> And what's the GDPR angle? You've got the legal brain over here, in policy. What's going on with GDPR? How are you guys going to be a solution for that? >> We have the most, I'd say, robust option of policy enforcement on data, I think, available. We make it incredibly easy to comply with GDPR. We actually put together a sample memo that says, "Here's what it looks like to comply with GDPR." It's written from a governance department, sent to the internal data science department. It's about a page and a half long. We actually make that very onerous process-- >> (mumbles) GDPR, you guys know the size of that market? In terms of spend that's going to be coming around the corner? I think it's like the Y2K problem that's actually real. >> Exactly, it feels the same way. And actually Andrew and his team have taken apart the regulation article by article and have actually built-in product features that satisfy that. It's an interesting and unique--- >> John: I think it's really impressive that you guys bring a legal and a policy mind into the product discussion. I think that's something that I think you guys are doing a little bit different than I see anyone out there. You're bringing legal and policy into the software fabric, which is unique, and I think it's going to be the standard in my opinion. Hopefully this is a good trend, hopefully you guys keep in touch. Thanks for coming on theCUBE, thanks for-- >> Thanks for having us. >> For making time to come over. This is theCUBE, breaking out the start-up action sharing the hot start-ups here, that really are a good position in the marketplace, as the generation of the infrastructure changes. It's a whole new ballgame. Global development platform, called the Internet. The new Internet. It's decentralized, we even get into Blockchain, we want to try that a little later, maybe another segment. It's theCUBE in New York City. More after this short break.

Published Date : Sep 29 2017

SUMMARY :

Brought to you by SiliconANGLE Media Great to see you again. Thanks for having us, and know some of the intelligence organizations. And the team, group of serial entrepreneurs And the easiest way-- managing the integrity of the data. as you guys know, to enter the market. The Amazons of the world have proven, meaning the software within the software kind of thing. And each one of those algorithms is going to do something I see the problem you solve: a lot of algorithms out there, So the opportunity that we saw, again, managing data is the ability for you to take internal logic, What's going on with you guys in this area? It's the lifeblood of an increasingly large It's like they don't know, and folks in IT have never really had to think This is why I wanted to bring you guys in. We're starting to enter a world where governments really, You're seeing it all over the front pages of the news, and elsewhere around the world. because I get all kinds of rushes of intoxication to fear. How do I make sure that the derived data And AI really is the future here. Algorithms are the only thing that are going to scale Exactly, that's the interesting thing. "Brought to you by, hey, Siri, do theCUBE interview." and the algorithm is you have n number of models. is going to be how you prepare and manage the data. And the usability of data, So it's really about longevity of the model. Where's the data going to be in healthcare? and the risk is so apparent. and you can't copy each other's answer. that surround the use of that data. I can't wait to hear the one for the insiders. We allow the governance people to get off those memos, We allow the data scientists to go and build models faster How are you guys going to be a solution for that? We have the most, I'd say, robust option In terms of spend that's going to be coming around the corner? Exactly, it feels the same way. and I think it's going to be the standard in my opinion. that really are a good position in the marketplace,

ENTITIES

Entity	Category	Confidence
Andrew	PERSON	0.99+
Andrew Burt	PERSON	0.99+
Jeff Dean	PERSON	0.99+
John	PERSON	0.99+
2014	DATE	0.99+
Peter Burris	PERSON	0.99+
Gina Smith	PERSON	0.99+
Dan Hutchin	PERSON	0.99+
Ernestine Young	PERSON	0.99+
Andrew Gillman	PERSON	0.99+
Andrew Gilman	PERSON	0.99+
2011	DATE	0.99+
90%	QUANTITY	0.99+
O'Reilly Media	ORGANIZATION	0.99+
D.C.	LOCATION	0.99+
100%	QUANTITY	0.99+
New York City	LOCATION	0.99+
New York	LOCATION	0.99+
Linux Foundation	ORGANIZATION	0.99+
Siri	TITLE	0.99+
GDPR	TITLE	0.99+
6.6 billion	QUANTITY	0.99+
Word	TITLE	0.99+
Silicon Valley	LOCATION	0.99+
Today	DATE	0.99+
40%	QUANTITY	0.99+
five years	QUANTITY	0.99+
2021	DATE	0.99+
EU	ORGANIZATION	0.99+
Columbus, Ohio	LOCATION	0.99+
eighth year	QUANTITY	0.99+
SiliconANGLE Media	ORGANIZATION	0.99+
today	DATE	0.99+
China	LOCATION	0.99+
thousands	QUANTITY	0.98+
HLF 2017	EVENT	0.98+
both	QUANTITY	0.98+
Wikibon	ORGANIZATION	0.98+
next year	DATE	0.98+
NYC	LOCATION	0.98+
two different issues	QUANTITY	0.98+
one	QUANTITY	0.98+
Immuta.com	ORGANIZATION	0.98+
Google	ORGANIZATION	0.98+
single	QUANTITY	0.97+
each one	QUANTITY	0.97+
Immuta	ORGANIZATION	0.97+
Heidelberg, Germany	LOCATION	0.97+
four	QUANTITY	0.97+
two guests	QUANTITY	0.97+
this week	DATE	0.97+
earlier this year	DATE	0.97+
Russia	LOCATION	0.97+
around 600 million	QUANTITY	0.97+
eight years	QUANTITY	0.97+
Y2K	ORGANIZATION	0.96+
six months	QUANTITY	0.96+
Big Data	TITLE	0.96+
Heidelberg Laureate Forum	EVENT	0.96+
U.S.	LOCATION	0.95+
theCUBE	ORGANIZATION	0.95+
earlier this week	DATE	0.95+
Freenome	ORGANIZATION	0.95+
Midtown Manhattan	LOCATION	0.93+
past decade	DATE	0.92+
2010	DATE	0.92+
about a page and a half	QUANTITY	0.91+

Arun Murthy, Hortonworks | BigData NYC 2017

>> Coming back when we were a DOS spreadsheet company. I did a short stint at Microsoft and then joined Frank Quattrone when he spun out of Morgan Stanley to create what would become the number three tech investment (upbeat music) >> Host: Live from mid-town Manhattan, it's theCUBE covering the BigData New York City 2017. Brought to you by SiliconANGLE Media and its ecosystem sponsors. (upbeat electronic music) >> Welcome back, everyone. We're here, live, on day two of our three days of coverage of BigData NYC. This is our event that we put on every year. It's our fifth year doing BigData NYC in conjunction with Hadoop World which evolved into Strata Conference, which evolved into Strata Hadoop, now called Strata Data. Probably next year will be called Strata AI, but we're still theCUBE, we'll always be theCUBE and this our BigData NYC, our eighth year covering the BigData world since Hadoop World. And then as Hortonworks came on we started covering Hortonworks' data summit. >> Arun: DataWorks Summit. >> DataWorks Summit. Arun Murthy, my next guest, Co-Founder and Chief Product Officer of Hortonworks. Great to see you, looking good. >> Likewise, thank you. Thanks for having me. >> Boy, what a journey. Hadoop, years ago, >> 12 years now. >> I still remember, you guys came out of Yahoo, you guys put Hortonworks together and then since, gone public, first to go public, then Cloudera just went public. So, the Hadoop World is pretty much out there, everyone knows where it's at, it's got to nice use case, but the whole world's moved around it. You guys have been, really the first of the Hadoop players, before ever Cloudera, on this notion of data in flight, or, I call, real-time data but I think, you guys call it data-in-motion. Batch, we all know what Batch does, a lot of things to do with Batch, you can optimize it, it's not going anywhere, it's going to grow. Real-time data-in-motion's a huge deal. Give us the update. >> Absolutely, you know, we've obviously been in this space, personally, I've been in this for about 12 years now. So, we've had a lot of time to think about it. >> Host: Since you were 12? >> Yeah. (laughs) Almost. Probably look like it. So, back in 2014 and '15 when we, sort of, went public and we're started looking around, the thesis always was, yes, Hadoop is important, we're going to love you to manage lots and lots of data, but a lot of the stuff we've done since the beginning, starting with YARN and so on, was really enable the use cases beyond the whole traditional transactions and analytics. And Drop, our CO calls it, his vision's always been we've got to get into a pre-transactional world, if you will, rather than the post-transactional analytics and BIN and so on. So that's where it started. And increasingly, the obvious next step was to say, look enterprises want to be able to get insights from data, but they also want, increasingly, they want to get insights and they want to deal with it in real-time. You know while you're in you shopping cart. They want to make sure you don't abandon your shopping cart. If you were sitting at at retailer and you're on an island and you're about to walk away from a dress, you want to be able to do something about it. So, this notion of real-time is really important because it helps the enterprise connect with the customer at the point of action, if you will, and provide value right away rather than having to try to do this post-transaction. So, it's been a really important journey. We went and bought this company called Onyara, which is a bunch of geeks like us who started off with the government, built this batching NiFi thing, huge community. Its just, like, taking off at this point. It's been a fantastic thing to join hands and join the team and keep pushing in the whole streaming data style. >> There's a real, I don't mean to tangent but I do since you brought up community I wanted to bring this up. It's been the theme here this week. It's more and more obvious that the community role is becoming central, beyond open-source. We all know open-source, standing on the shoulders before us, you know. And Linux Foundation showing code numbers hitting up from $64 million to billions in the next five, ten years, exponential growth of new code coming in. So open-source certainly blew me. But now community is translating to things you start to see blockchain, very community based. That's a whole new currency market that's changing the financial landscape, ICOs and what-not, that's just one data point. Businesses, marketing communities, you're starting to see data as a fundamental thing around communities. And certainly it's going to change the vendor landscape. So you guys compare to, Cloudera and others have always been community driven. >> Yeah our philosophy has been simple. You know, more eyes and more hands are better than fewer. And it's been one of the cornerstones of our founding thesis, if you will. And you saw how that's gone on over course of six years we've been around. Super-excited to have someone like IBM join hands, it happened at DataWorks Summit in San Jose. That announcement, again, is a reflection of the fact that we've been very, very community driven and very, very ecosystem driven. >> Communities are fundamentally built on trust and partnering. >> Arun: Exactly >> Coding is pretty obvious, you code with your friends. You code with people who are good, they become your friends. There's an honor system among you. You're starting to see that in the corporate deals. So explain the dynamic there and some of the successes that you guys have had on the product side where one plus one equals more than two. One plus one equals five or three. >> You know IBM has been a great example. They've decided to focus on their strengths which is around Watson and machine learning and for us to focus on our strengths around data management, infrastructure, cloud and so on. So this combination of DSX, which is their data science work experience, along with Hortonworks is really powerful. We are seeing that over and over again. Just yesterday we announced the whole Dataplane thing, we were super excited about it. And now to get IBM to say, we'll get in our technologies and our IP, big data, whether it's big Quality or big Insights or big SEQUEL, and the word has been phenomenal. >> Well the Dataplane announcement, finally people who know me know that I hate the term data lake. I always said it's always been a data ocean. So I get redemption because now the data lakes, now it's admitting it's a horrible name but just saying stitching together the data lakes, Which is essentially a data ocean. Data lakes are out there and you can form these data lakes, or data sets, batch, whatever, but connecting them and integrating them is a huge issue, especially with security. >> And a lot of it is, it's also just pragmatism. We start off with this notion of data lake and say, hey, you got too many silos inside the enterprise in one data center, you want to put them together. But then increasingly, as Hadoop has become more and more mainstream, I can't remember the last time I had to explain what Hadoop is to somebody. As it has become mainstream, couple things have happened. One is, we talked about streaming data. We see all the time, especially with HTF. We have customers streaming data from autonomous cars. You have customers streaming from security cameras. You can put a small minify agent in a security camera or smart phone and can stream it all the way back. Then you get into physics. You're up against the laws of physics. If you have a security camera in Japan, why would you want to move it all the way to California and process it. You'd rather do it right there, right? So with this notion of a regional data center becomes really important. >> And that talks to the Edge as well. >> Exactly, right. So you want to have something in Japan that collects all of the security cameras in Tokyo, and you do analysis and push what you want back here, right. So that's physics. The other thing we are increasingly seeing is with data sovereignty rules especially things like GDPR, there's now regulation reasons where data has to naturally stay in different regions. Customer data from Germany cannot move to France or visa versa, right. >> Data governance is a huge issue and this is the problem I have with data governance. I am really looking for a solution so if you can illuminate this it would be great. So there is going to be an Equifax out there again. >> Arun: Oh, for sure. >> And the problem is, is that going to force some regulation change? So what we see is, certainly on the mugi bond side, I see it personally is that, you can almost see that something else will happen that'll force some policy regulation or governance. You don't want to screw up your data. You also don't want to rewrite your applications or rewrite you machine learning algorithms. So there's a lot of waste potential by not structuring the data properly. Can you comment on what's the preferred path? >> Absolutely, and that's why we've been working on things like Dataplane for almost a couple of years now. We is to say, you have to have data and policies which make sense, given a context. And the context is going to change by application, by usage, by compliance, by law. So, now to manage 20, 30, 50 a 100 data lakes, would it be better, not saying lakes, data ponds, >> [Host} Any Data. >> Any data >> Any data pool, stream, river, ocean, whatever. (laughs) >> Jacuzzis. Data jacuzzis, right. So what you want to do is want a holistic fabric, I like the term, you know Forrester uses, they call it the fabric. >> Host: Data fabric. >> Data fabric, right? You want a fabric over these so you can actually control and maintain governance and security centrally, but apply it with context. Last not least, is you want to do this whether it's on frame or on the cloud, or multi-cloud. So we've been working with a bank. They were probably based in Germany but for GDPR they had to stand up something in France now. They had French customers, but for a bunch of new reasons, regulation reasons, they had to sign up something in France. So they bring their own data center, then they had only the cloud provider, right, who I won't name. And they were great, things are working well. Now they want to expand the similar offering to customers in Asia. It turns out their favorite cloud vendor was not available in Asia or they were not available in time frame which made sense for the offering. So they had to go with cloud vendor two. So now although each of the vendors will do their job in terms of giving you all the security and governance and so on, the fact that you are to manage it three ways, one for OnFrame, one for cloud vendor A and B, was really hard, too hard for them. So this notion of a fabric across these things, which is Dataplane. And that, by the way, is based by all the open source technologies we love like Atlas and Ranger. By the way, that is also what IBM is betting on and what the entire ecosystem, but it seems like a no-brainer at this point. That was the kind of reason why we foresaw the need for something like a Dataplane and obviously couldn't be more excited to have something like that in the market today as a net new service that people can use. >> You get the catalogs, security controls, data integration. >> Arun: Exactly. >> Then you get the cloud, whatever, pick your cloud scenario, you can do that. Killer architecture, I liked it a lot. I guess the question I have for you personally is what's driving the product decisions at Hortonworks? And the second part of that question is, how does that change your ecosystem engagement? Because you guys have been very friendly in a partnering sense and also very good with the ecosystem. How are you guys deciding the product strategies? Does it bubble up from the community? Is there an ivory tower, let's go take that hill? >> It's both, because what typically happens is obviously we've been in the community now for a long time. Working publicly now with well over 1,000 customers not only puts a lot of responsibility on our shoulders but it's also very nice because it gives us a vantage point which is unique. That's number one. The second one we see is being in the community, also we see the fact that people are starting to solve the problems. So it's another elementary for us. So you have one as the enterprise side, we see what the enterprises are facing which is kind of where Dataplane came in, but we also saw in the community where people are starting to ask us about hey, can you do multi-cluster Atlas? Or multi-cluster Ranger? Put two and two together and say there is a real need. >> So you get some consensus. >> You get some consensus, and you also see that on the enterprise side. Last not least is when went to friends like IBM and say hey we're doing this. This is where we can position this, right. So we can actually bring in IGSC, you can bring big Quality and bring all these type, >> [Host} So things had clicked with IBM? >> Exactly. >> Rob Thomas was thinking the same thing. Bring in the power system and the horsepower. >> Exactly, yep. We announced something, for example, we have been working with the power guys and NVIDIA, for deep learning, right. That sort of stuff is what clicks if you're in the community long enough, if you have the vantage point of the enterprise long enough, it feels like the two of them click. And that's frankly, my job. >> Great, and you've got obviously the landscape. The waves are coming in. So I've got to ask you, the big waves are coming in and you're seeing people starting to get hip with the couple of key things that they got to get their hands on. They need to have the big surfboards, metaphorically speaking. They got to have some good products, big emphasis on real value. Don't give me any hype, don't give me a head fake. You know, I buy, okay, AI Wash, and people can see right through that. Alright, that's clear. But AI's great. We all cheer for AI but the reality is, everyone knows that's pretty much b.s. except for core machine learning is on the front edge of innovation. So that's cool, but value. [Laughs] Hey I've got the integrate and operationalize my data so that's the big wave that's coming. Comment on the community piece because enterprises now are realizing as open source becomes the dominant source of value for them, they are now really going to the next level. It used to be like the emerging enterprises that knew open source. The guys will volunteer and they may not go deeper in the community. But now more people in the enterprises are in open source communities, they are recruiting from open source communities, and that's impacting their business. What's your advice for someone who's been in the community of open source? Lessons you've learned, what is the best practice, from your standpoint on philosophy, how to build into the community, how to build a community model. >> Yeah, I mean, the end of the day, my best advice is to say look, the community is defined by the people who contribute. So, you get advice if you contribute. Which means, if that's the fundamental truth. Which means you have to get your legal policies and so on to a point that you can actually start to let your employees contribute. That kicks off a flywheel, where you can actually go then recruit the best talent, because the best talent wants to stand out. Github is a resume now. It is not a word doc. If you don't allow them to build that resume they're not going to come by and it's just a fundamental truth. >> It's self governing, it's reality. >> It's reality, exactly. Right and we see that over and over again. It's taken time but it as with things, the flywheel has changed enough. >> A whole new generation's coming online. If you look at the young kids coming in now, it is an amazing environment. You've got TensorFlow, all this cool stuff happening. It's just amazing. >> You, know 20 years ago that wouldn't happen because the Googles of the world won't open source it. Now increasingly, >> The secret's out, open source works. >> Yeah, (laughs) shh. >> Tell everybody. You know they know already but, This is changing some of the how H.R. works and how people collaborate, >> And the policies around it. The legal policies around contribution so, >> Arun, great to see you. Congratulations. It's been fun to watch the Hortonworks journey. I want to appreciate you and Rob Bearden for supporting theCUBE here in BigData NYC. If is wasn't for Hortonworks and Rob Bearden and your support, theCUBE would not be part of the Strata Data, which we are not allowed to broadcast into, for the record. O'Reilly Media does not allow TheCube or our analysts inside their venue. They've excluded us and that's a bummer for them. They're a closed organization. But I want to thank Hortonworks and you guys for supporting us. >> Arun: Likewise. >> We really appreciate it. >> Arun: Thanks for having me back. >> Thanks and shout out to Rob Bearden. Good luck and CPO, it's a fun job, you know, not the pressure. I got a lot of pressure. A whole lot. >> Arun: Alright, thanks. >> More Cube coverage after this short break. (upbeat electronic music)

Published Date : Sep 28 2017

SUMMARY :

the number three tech investment Brought to you by SiliconANGLE Media This is our event that we put on every year. Co-Founder and Chief Product Officer of Hortonworks. Thanks for having me. Boy, what a journey. You guys have been, really the first of the Hadoop players, Absolutely, you know, we've obviously been in this space, at the point of action, if you will, standing on the shoulders before us, you know. And it's been one of the cornerstones Communities are fundamentally built on that you guys have had on the product side and the word has been phenomenal. So I get redemption because now the data lakes, I can't remember the last time I had to explain and you do analysis and push what you want back here, right. so if you can illuminate this it would be great. I see it personally is that, you can almost see that We is to say, you have to have data and policies Any data pool, stream, river, ocean, whatever. I like the term, you know Forrester uses, the fact that you are to manage it three ways, I guess the question I have for you personally is So you have one as the enterprise side, and you also see that on the enterprise side. Bring in the power system and the horsepower. if you have the vantage point of the enterprise long enough, is on the front edge of innovation. and so on to a point that you can actually the flywheel has changed enough. If you look at the young kids coming in now, because the Googles of the world won't open source it. This is changing some of the how H.R. works And the policies around it. and you guys for supporting us. Thanks and shout out to Rob Bearden. More Cube coverage after this short break.

ENTITIES

Entity	Category	Confidence
Asia	LOCATION	0.99+
France	LOCATION	0.99+
Arun	PERSON	0.99+
IBM	ORGANIZATION	0.99+
Rob Bearden	PERSON	0.99+
Germany	LOCATION	0.99+
Arun Murthy	PERSON	0.99+
Japan	LOCATION	0.99+
NVIDIA	ORGANIZATION	0.99+
Tokyo	LOCATION	0.99+
2014	DATE	0.99+
California	LOCATION	0.99+
12	QUANTITY	0.99+
five	QUANTITY	0.99+
Frank Quattrone	PERSON	0.99+
three	QUANTITY	0.99+
two	QUANTITY	0.99+
Onyara	ORGANIZATION	0.99+
$64 million	QUANTITY	0.99+
Microsoft	ORGANIZATION	0.99+
San Jose	LOCATION	0.99+
O'Reilly Media	ORGANIZATION	0.99+
each	QUANTITY	0.99+
Morgan Stanley	ORGANIZATION	0.99+
Linux Foundation	ORGANIZATION	0.99+
One	QUANTITY	0.99+
fifth year	QUANTITY	0.99+
Atlas	ORGANIZATION	0.99+
20	QUANTITY	0.99+
one	QUANTITY	0.99+
Rob Thomas	PERSON	0.99+
three days	QUANTITY	0.99+
eighth year	QUANTITY	0.99+
yesterday	DATE	0.99+
SiliconANGLE Media	ORGANIZATION	0.99+
six years	QUANTITY	0.99+
Equifax	ORGANIZATION	0.99+
next year	DATE	0.99+
NYC	LOCATION	0.99+
Hortonworks	ORGANIZATION	0.99+
second part	QUANTITY	0.99+
both	QUANTITY	0.99+
Ranger	ORGANIZATION	0.99+
50	QUANTITY	0.98+
30	QUANTITY	0.98+
Yahoo	ORGANIZATION	0.98+
Strata Conference	EVENT	0.98+
DataWorks Summit	EVENT	0.98+
Hadoop	TITLE	0.98+
'15	DATE	0.97+
20 years ago	DATE	0.97+
Forrester	ORGANIZATION	0.97+
GDPR	TITLE	0.97+
second one	QUANTITY	0.97+
one data center	QUANTITY	0.97+
Github	ORGANIZATION	0.96+
about 12 years	QUANTITY	0.96+
three ways	QUANTITY	0.96+
Manhattan	LOCATION	0.95+
day two	QUANTITY	0.95+
this week	DATE	0.95+
NiFi	ORGANIZATION	0.94+
Dataplane	ORGANIZATION	0.94+
BigData	ORGANIZATION	0.94+
Hadoop World	EVENT	0.93+
billions	QUANTITY	0.93+

Day Two Kickoff | Big Data NYC

(quite music) >> I'll open that while he does that. >> Co-Host: Good, perfect. >> Man: All right, rock and roll. >> This is Robin Matlock, the CMO of VMware, and you're watching theCUBE. >> This is John Siegel of VPA Product Marketing at Dell EMC. You're watching theCUBE. >> This is Matthew Morgan, I'm the chief marketing officer at Druva and you are watching theCUBE. >> Announcer: Live from midtown Manhattan, it's theCUBE. Covering BigData New York City 2017. Brought to you by SiliconANGLE Media and its ecosystem sponsors. (rippling music) >> Hello, everyone, welcome to a special CUBE live presentation here in New York City for theCUBE's coverage of BigData NYC. This is where all the action's happening in the big data world, machine learning, AI, the cloud, all kind of coming together. This is our fifth year doing BigData NYC. We've been covering the Hadoop ecosystem, Hadoop World, since 2010, it's our eighth year really at ground zero for the Hadoop, now the BigData, now the Data Market. We're doing this also in conjunction with Strata Data, which was Strata Hadoop. That's a separate event with O'Reilly Media, we are not part of that, we do our own event, our fifth year doing our own event, we bring in all the thought leaders. We bring all the influencers, meaning the entrepreneurs, the CEOs to get the real story about what's happening in the ecosystem. And of course, we do it with our analyst at Wikibon.com. I'm John Furrier with my cohost, Jim Kobielus, who's the chief analyst for our data piece. Lead analyst Jim, you know the data world's changed. We had commenting yesterday all up on YouTube.com/SiliconAngle. Day one was really set the table. And we kind of get the whiff of what's happening, we can kind of feel the trend, we got a finger on the pulse. Two things going on, two big notable stories is the world's continuing to expand around community and hybrid data and all these cool new data architectures, and the second kind of substory is the O'Reilly show has become basically a marketing. They're making millions of dollars over there. A lot of people were, last night, kind of not happy about that, and what's giving back to the community. So, again, the community theme is still resonating strong. You're starting to see that move into the corporate enterprise, which you're covering. What are you finding out, what did you hear last night, what are you hearing in the hallways? What is kind of the tea leaves that you're reading? What are some of the things you're seeing here? >> Well, all things hybrid. I mean, first of all it's building hybrid applications for hybrid cloud environments and there's various layers to that. So yesterday on theCUBE we had, for example, one layer is hybrid semantic virtualization labels are critically important for bridging workloads and microservices and data across public and private clouds. We had, from AtScale, we had Bruno Aziza and one of his customers discussing what they're doing. I'm hearing a fair amount of this venerable topic of semantic data virtualization become even more important now in the era of hybrid clouds. That's a fair amount of the scuttlebutt in the hallway and atrium talks that I participated in. Also yesterday from BMC we had Basil Faruqi talking about basically talking about automating data pipelines. There are data pipelines in hybrid environments. Very, very important for DevOps, productionizing these hybrid applications for these new multi-cloud environments. That's quite important. Hybrid data platforms of all sorts. Yesterday we had from ActIn Jeff Veis discussing their portfolio for on-prem, public cloud, putting the data in various places, and speeding up the queries and so forth. So hybrid data platforms are going increasingly streaming in real time. What I'm getting is that what I'm hearing is more and more of a layering of these hybrid environments is a critical concern for enterprises trying to put all this stuff together, and future-proof it so they can add on all the new stuff. That's coming along like cirrus clouds, without breaking interoperability, and without having to change code. Just plug and play in a massively multi-cloud environment. >> You know, and also I'm critical of a lot of things that are going on. 'Cause to your point, the reason why I'm kind of critical on the O'Reilly show and particularly the hype factor going on in some areas is two kinds of trends I'm seeing with respect to the owners of some of the companies. You have one camp that are kind of groping for solutions, and you'll see that with they're whitewashing new announcements, this is going on here. It's really kind of-- >> Jim: I think it's AI now, by the way. >> And they're AI-washing it, but you can, the tell sign is they're always kind of doing a magic trick of some type of new announcement, something's happening, you got to look underneath that, and say where is the deal for the customers? And you brought this up yesterday with Peter Burris, which is the business side of it is really the conversation now. It's not about the speeds and feeds and the cluster management, it's certainly important, and those solutions are maturing. That came up yesterday. The other thing that you brought up yesterday I thought was notable was the real emphasis on the data science side of it. And it's that it's still not easy or data science to do their job. And this is where you're seeing productivity conversations come up with data science. So, really the emphasis at the end of the day boils down to this. If you don't have any meat on the bone, you don't have a solution that rubber hits the road where you can come in and provide a tangible benefit to a company, an enterprise, then it's probably not going to work out. And we kind of had that tool conversation, you know, as people start to grow. And so as buyers out there, they got to look, and kind of squint through it saying where's the real deal? So that kind of brings up what's next? Who's winning, how do you as an analyst look at the playing field and say, that's good, that's got traction, that's winning, mm not too sure? What's your analysis, how do you tell the winners from the losers, and what's your take on this from the data science lens? >> Well, first of all you can tell the winners when they have an ample number of referenced customers who are doing interesting things. Interesting enough to get a jaded analyst to pay attention. Doing something that changes the fabric of work or life, whatever, clearly. Solution providers who can provide that are, they have all the hallmarks of a winner meaning they're making money, and they're likely to grow and so forth. But also the hallmarks of a winner are those, in many ways, who have a vision and catalyze an ecosystem around that vision of something that could be made, possibly be done before but not quite as efficiently. So you know, for example, now the way what we're seeing now in the whole AI space, deep learning, is, you know, AI means many things. The core right now, in terms of the buzzy stuff is deep learning for being able to process real time streams of video, images and so forth. And so, what we're seeing now is that the vendors who appear to be on the verge of being winners are those who use deep learning inside some new innovation that has enough, that appeals to a potential mass market. It's something you put on your, like an app or something you put on your smart phone, or it's something you buy at Walmart, install in your house. You know, the whole notion of clearly Alexa, and all that stuff. Anything that takes chatbot technology, really deep learning powers chatbots, and is able to drive a conversational UI into things that you wouldn't normally expect to talk to you and does it well in a way that people have to have that. Those are the vendors that I'm looking for, in terms of those are the ones that are going to make a ton of money selling to a mass market, and possibly, and very much once they go there, they're building out a revenue stream and a business model that they can conceivably take into other markets, especially business markets. You know, like Amazon, 20-something years ago when they got started in the consumer space as the exemplar of web retailing, who expected them 20 years later to be a powerhouse provider of business cloud services? You know, so we're looking for the Amazons of the world that can take something as silly as a conversational UI inside of a, driven by DL, inside of a consumer appliance and 20 years from now, maybe even sooner, become a business powerhouse. So that's what's new. >> Yeah, the thing that comes up that I want to get your thoughts on is that we've seen data integration become a continuing theme. The other thing about the community play here is you start to see customers align with syndicates or partnerships, and I think it's always been great to have customer traction, but, as you pointed out, as a benchmark. But now you're starting to see the partner equation, because this isn't open, decentralized, distributed internet these days. And it is looking like it's going to form differently than they way it was, than the web days and with mobile and connected devices it IoT and AI. A whole new infrastructure's developing, so you're starting to see people align with partnerships. So I think that's something that's signaling to me that the partnership is amping up. I think the people are partnering more. We've had Hortonworks on with IBM, people are partner, some people take a Switzerland approach where they partner with everyone. You had, WANdisco partners with all the cloud guys, I mean, they have unique ITP. So you have this model where you got to go out, do something, but you can't do it alone. Open source is a key part of this, so obviously that's part of the collaboration. This is a key thing. And then they're going to check off the boxes. Data integration, deep learning is a new way to kind of dig deeper. So the question I have for you is, the impact on developers, 'cause if you can connect the dots between open source, 90% of the software written will be already open source, 10% differentiated, and then the role of how people going to market with the enterprise of a partnership, you can almost connect the dots and saying it's kind of a community approach. So that leaves the question, what is the impact to developers? >> Well the impact to developers, first of all, is when you go to a community approach, and like some big players are going more community and partnership-oriented in hot new areas like if you look at some of the recent announcements in chatbots and those technologies, we have sort of a rapprochement between Microsoft and Facebook and so forth, or Microsoft and AWS. The impact for developers is that there's convergence among the companies that might have competed to the death in particular hot new areas, like you know, like I said, chatbot-enabled apps for mobile scenarios. And so it cuts short the platform wars fairly quickly, harmonizes around a common set of APIs for accessing a variety of competing offerings that really overlap functionally in many ways. For developers, it's simplification around a broader ecosystem where it's not so much competition on the underlying open source technologies, it's now competition to see who penetrates the mass market with actually valuable solutions that leverage one or more of those erstwhile competitors into some broader synthesis. You know, for example, the whole ramp up to the future of self-driving vehicles, and it's not clear who's going to dominate there. Will it be the vehicle manufacturers that are equipping their cars with all manner of computerized everything to do whatnot? Or will it be the up-and-comers? Will it be the computer companies like Apple and Microsoft and others who get real deep and invest fairly heavily in self-driving vehicle technology, and become themselves the new generation of automakers in the future? So, what we're getting is that going forward, developers want to see these big industry segments converge fairly rapidly around broader ecosystems, where it's not clear who will be the dominate player in 10 years. The developers don't really care, as long as there is consolidation around a common framework to which they can develop fairly soon. >> And open source is obviously a key role in this, and how is deep learning impacting some of the contributions that are being made, because we're starting to see the competitive advantage in collaboration on the community side is with the contributions from companies. For example, you mentioned TensorFlow multiple times yesterday from Google. I mean, that's a great contribution. If you're a young kind coming into the developer community, I mean, this is not normal. It wasn't like this before. People just weren't donating massive libraries of great stuff already pre-packaged, So all new dynamics emerging. Is that putting pressure on Amazon, is that putting pressure on AWS and others? >> It is. First of all, there is a fair amount of, I wouldn't call it first-mover advantage for TensorFlow, there've been a number of DL toolkits on the market, open source, for the last several years. But they achieved the deepest and broadest adoption most rapidly, and now they are a, TensorFlow is essentially a defacto standard in the way, that we just go back, betraying my age, 30, 40 years ago where you had two companies called SAS and SPSS that quickly established themselves as the go-to statistical modeling tools. And then they got a generation, our generation, of developers, or at least of data scientists, what became known as data scientists, to standardize around you're either going to go with SAS or SPSS if you're going to do data mining. Cut ahead to the 2010s now. The new generation of statistical modelers, it's all things DL and machine learning. And so SAS versus SPSS is ages ago, those companies are, those products still exist. But now, what are you going to get hooked on in school? What are you going to get hooked on in high school, for that matter, when you're just hobby-shopping DL? You'll probably get hooked on TensorFlow, 'cause they have the deepest and the broadest open source community where you learn this stuff. You learn the tools of the trade, you adopt that tool, and everybody else in your environment is using that tool, and you got to get up to speed. So the fact is, that broad adoption early on in a hot new area like DL, means tons. It means that essentially TensorFlow is the new Spark, where Spark, you know, once again, Spark just in the past five years came out real fast. And it's been eclipsed, as it were, on the stack of cool by TensorFlow. But it's a deepening stack of open source offerings. So the new generation of developers with data science workbenches, they just assume that there's Spark, and they're going to increasingly assume that there's TensorFlow in there. They're going to increasingly assume that there are the libraries and algorithms and models and so forth that are floating around in the open source space that they can use to bootstrap themselves fairly quickly. >> This is a real issue in the open source community which we talked, when we were in LA for the Open Source Summit, was exactly that. Is that, there are some projects that become fashionable, so for example, a cloud-native foundation, very relevant but also hot, really hot right now. A lot of people are jumping on board the cloud natives bandwagon, and rightfully so. A lot of work to be done there, and a lot of things to harvest from that growth. However, the boring blocking and tackling projects don't get all the fanfare but are still super relevant, so there's a real challenge of how do you nurture these awesome projects that we don't want to become like a nightclub where nobody goes anymore because it's not fashionable. Some of these open source projects are super important and have massive traction, but they're not as sexy, or flair-ish as some of that. >> Dl is not as sexy, or machine learning, for that matter, not as sexy as you would think if you're actually doing it, because the grunt work, John, as we know for any statistical modeling exercise, is data ingestion and preparation and so forth. That's 75% of the challenge for deep learning as well. But also for deep learning and machine learning, training the models that you build is where the rubber meets the road. You can't have a really strongly predictive DL model in terms of face recognition unless you train it against a fair amount of actual face data, whatever it is. And it takes a long time to train these models. That's what you hear constantly. I heard this constantly in the atrium talking-- >> Well that's a data challenge, is you need models that are adapting and you need real time, and I think-- >> Oh, here-- >> This points to the real new way of doing things, it's not yesterday's model. It's constantly evolving. >> Yeah, and that relates to something I read this morning or maybe it was last night, that Microsoft has made a huge investment in AI and deep learning machinery. They're doing amazing things. And one of the strategic advantages they have as a large, established solution provider with a search engine, Bing, is that from what I've been, this is something I read, I haven't talked to Microsoft in the last few hours to confirm this, that Bing is a source of training data that they're using for machine learning and I guess deep learning modeling for their own solutions or within their ecosystem. That actually makes a lot of sense. I mean, Google uses YouTube videos heavily in its deep learning for training data. So there's the whole issue of if you're a pipsqueak developer, some, you know, I'm sorry, this sounds patronizing. Some pimply-faced kid in high school who wants to get real deep on TensorFlow and start building and tuning these awesome kickass models to do face recognition, or whatever it might be. Where are you going to get your training data from? Well, there's plenty of open source database, or training databases out there you can use, but it's what everybody's using. So, there's sourcing the training data, there's labeling the training data, that's human-intensive, you need human beings to label it. There was a funny recent episode, or maybe it was a last-season episode of Silicone Valley that was all about machine learning and building and training models. It was the hot dog, not hot dog episode, it was so funny. They bamboozle a class on the show, fictionally. They bamboozle a class of college students to provide training data and to label the training data for this AI algorithm, it was hilarious. But where are you going to get the data? Where are you going to label it? >> Lot more work to do, that's basically what you're getting at. >> Jim: It's DevOps, you know, but it's grunt work. >> Well, we're going to kick off day two here. This is the SiliconeANGLE Media theCUBE, our fifth year doing our own event separate from O'Reilly media but in conjunction with their event in New York City. It's gotten much bigger here in New York City. We call it BigData NYC, that's the hashtag. Follow us on Twitter, I'm John Furrier, Jim Kobielus, we're here all day, we've got Peter Burris joining us later, head of research for Wikibon, and we've got great guests coming up, stay with us, be back with more after this short break. (rippling music)

Published Date : Sep 27 2017

SUMMARY :

This is Robin Matlock, the CMO of VMware, This is John Siegel of VPA Product Marketing This is Matthew Morgan, I'm the chief marketing officer Brought to you by SiliconANGLE Media What is kind of the tea leaves that you're reading? That's a fair amount of the scuttlebutt I'm kind of critical on the O'Reilly show is really the conversation now. Doing something that changes the fabric So the question I have for you is, the impact on developers, among the companies that might have competed to the death and how is deep learning impacting some of the contributions You learn the tools of the trade, you adopt that tool, and a lot of things to harvest from that growth. That's 75% of the challenge for deep learning as well. This points to the in the last few hours to confirm this, that's basically what you're getting at. This is the SiliconeANGLE Media theCUBE,

ENTITIES

Entity	Category	Confidence
Jim Kobielus	PERSON	0.99+
Robin Matlock	PERSON	0.99+
Apple	ORGANIZATION	0.99+
Microsoft	ORGANIZATION	0.99+
IBM	ORGANIZATION	0.99+
Matthew Morgan	PERSON	0.99+
Basil Faruqi	PERSON	0.99+
Jim	PERSON	0.99+
John Siegel	PERSON	0.99+
O'Reilly Media	ORGANIZATION	0.99+
Amazon	ORGANIZATION	0.99+
John Furrier	PERSON	0.99+
yesterday	DATE	0.99+
90%	QUANTITY	0.99+
Peter Burris	PERSON	0.99+
two companies	QUANTITY	0.99+
New York City	LOCATION	0.99+
SPS	ORGANIZATION	0.99+
SAS	ORGANIZATION	0.99+
AWS	ORGANIZATION	0.99+
John	PERSON	0.99+
75%	QUANTITY	0.99+
LA	LOCATION	0.99+
Silicone Valley	TITLE	0.99+
Facebook	ORGANIZATION	0.99+
10%	QUANTITY	0.99+
Walmart	ORGANIZATION	0.99+
2010s	DATE	0.99+
YouTube	ORGANIZATION	0.99+
SiliconANGLE Media	ORGANIZATION	0.99+
AtScale	ORGANIZATION	0.99+
Google	ORGANIZATION	0.99+
10 years	QUANTITY	0.99+
WANdisco	ORGANIZATION	0.99+
Jeff Veis	PERSON	0.99+
fifth year	QUANTITY	0.99+
one	QUANTITY	0.99+
Yesterday	DATE	0.99+
Dell EMC	ORGANIZATION	0.99+
VMware	ORGANIZATION	0.99+
eighth year	QUANTITY	0.99+
BigData	ORGANIZATION	0.99+
millions of dollars	QUANTITY	0.99+
Bing	ORGANIZATION	0.99+
BMC	ORGANIZATION	0.98+
Amazons	ORGANIZATION	0.98+
last night	DATE	0.98+
two kinds	QUANTITY	0.98+
Spark	TITLE	0.98+
Hortonworks	ORGANIZATION	0.98+
Day one	QUANTITY	0.98+
20 years later	DATE	0.98+
VPA	ORGANIZATION	0.98+
2010	DATE	0.98+
ActIn	ORGANIZATION	0.98+
Open Source Summit	EVENT	0.98+
one layer	QUANTITY	0.98+
Druva	ORGANIZATION	0.97+
Alexa	TITLE	0.97+
day two	QUANTITY	0.97+
Bruno Aziza	PERSON	0.97+
SPSS	TITLE	0.97+
Switzerland	LOCATION	0.97+
Two things	QUANTITY	0.96+
NYC	LOCATION	0.96+
Wikibon	ORGANIZATION	0.96+
30	DATE	0.95+
Wikibon.com	ORGANIZATION	0.95+
SiliconeANGLE Media	ORGANIZATION	0.95+
O'Reilly	ORGANIZATION	0.95+

Day One Kickoff | BigData NYC 2017

(busy music) >> Announcer: Live from Midtown Manhattan, it's the Cube, covering Big Data New York City 2017, brought to you by SiliconANGLE Media and its ecosystem sponsors. >> Hello, and welcome to the special Cube presentation here in New York City for Big Data NYC, in conjunction with all the activity going on with Strata, Hadoop, Strata Data Conference right around the corner. This is the Cube's special annual event in New York City where we highlight all the trends, technology experts, thought leaders, entrepreneurs here inside the Cube. We have our three days of wall to wall coverage, evening event on Wednesday. I'm John Furrier, the co-host of the Cube, with Jim Kobielus, and Peter Burris will be here all week as well. Kicking off day one, Jim, the monster week of Big Data NYC, which now has turned into, essentially, the big data industry is a huge industry. But now, subsumed within a larger industry of AI, IoT, security. A lot of things have just sucked up the big data world that used to be the Hadoop world, and it just kept on disrupting, and creative disruption of the old guard data warehouse market, which now, looks pale in comparison to the disruption going on right now. >> The data warehouse market is very much vibrant and alive, as is the big data market continuing to innovate. But the innovations, John, have moved up the stack to artificial intelligence and deep learning, as you've indicated, driving more of the Edge applications in the new generation of mobile and smart appliances and things that are coming along like smart, self-driving vehicles and so forth. What we see is data professionals and developers are moving towards new frameworks, like TensorFlow and so forth, for development of the truly disruptive applications. But big data is the foundation. >> I mean, the developers are the key, obviously, open source is growing at an enormous rate. We just had the Linux Foundation, we now have the Open Source Summit, they have kind of rebranded that. They're going to see explosion from code from 64 million lines of code to billions of lines of code, exponential growth. But the bigger picture is that it's not just developers, it's the enterprises now who want hybrid cloud, they want cloud technology. I want to get your reaction to a couple of different threads. One is the notion of community based software, which is open source, extending into the enterprise. We're seeing things like blockchain is hot right now, security, two emerging areas that are overlapping in with big data. You obviously have classic data market, and then you've got AI. All these things kind of come in together, kind of just really putting at the center of all that, this core industry around community and software AI, particular. It's not just about machine learning anymore and data, it's a bigger picture. >> Yeah, in terms of a community, development with open source, much of what we see in the AI arena, for example, with the up and coming, they're all open source tools. There's TensorFlow, there's Cafe, there's Theano and so forth. What we're seeing is not just the frameworks for developing AI that are important, but the entire ecosystem of community based development of capabilities to automate the acquisition of training data, which is so critically important for tuning AI, for its designated purpose, be it doing predictions and abstractions. DevOps, what are coming into being are DevOps frameworks to span the entire life cycle of the creation and the training and deployment and iteration of AI. What we're going to see is, like at the last Spark Summit, there was a very interesting discussion from a Stanford researcher, new open source tools that they're developing out in, actually, in Berkeley, I understand, for, related to development of training data in a more automated fashion for these new challenges. The communities are evolving up the stack to address these requirements with fairly bleeding edge capabilities that will come in the next few years into the mainstream. >> I had a chat with a big time CTO last night, he worked at some of the big web scale company, I won't say the name, give it away. But basically, he asked me a question about IoT, how real is it, and obviously, it's hyped up big time, though. But the issue in all this new markets like IoT and AI is the role of security, because a lot of enterprises are looking at the IoT, certainly in the industrial side has the most relevant low hanging fruit, but at the end of the day, the data modeling, as you're pointing out, becomes a critical thing. Connecting IoT devices to, say, an IP network sounds trivial in concept, but at the end of the day, the surface area for security is oak expose, that's causing people to stop what they're doing, not deploying it as fast. You're seeing kind of like people retrenching and replatforming at the core data centers, and then leveraging a lot of cloud, which is why Azure is hot, Microsoft Ignite Event is pretty hot this week. Role of cloud, role of data in IoT. Is IoT kind of stalled in your mind? Or is it bloating? >> I wouldn't say it's stalled or that it's bloating, but IoT is definitely coming along as the new development focus. For the more disruptive applications that can derive more intelligence directly to the end points that can take varying degrees of automated action to achieve results, but also to very much drive decision support in real time to people on their mobiles or in whatever. What I'm getting at is that IoT is definitely a reality in the real world in terms of our lives. It's definitely a reality in terms of the index generation of data applications. But there's a lot of the back end in terms of readying algorithms and in training data for deployment of really high quality IoT applications, Edge applications, that hasn't come together yet in any coherent practice. >> It's emerging, it's emerging. >> It's emerging. >> It's a lot more work to do. OK, we're going to kick off day one, we've got some great guests, we see Rob Bearden in the house, Rob Thomas from IBM. >> Rob Bearden from Hortonworks. >> Rob Bearden from Hortonworks, and Rob Thomas from IBM. I want to bring up, Rob wrote a book just recently. He wrote Big Data Revolution, but he also wrote a new book called, Every Company is a Tech Company. But he mentions, he kind of teases out this concept of a renaissance, so I want to get your thoughts on this. If you look at Strata, Hadoop, Strata Data, the O'Reilly Conference, which has turned into like a marketing machine, right. A lot of hype there. But as the community model grows up, you're starting to see a renaissance of real creative developers, you're starting to see, not just open source, pure, full stack developers doing all the heavy lifting, but real creative competition, in a renaissance, that's really the key. You're seeing a lot more developer action, tons outside of the, what was classically called the data space. The role of data and how it relates to the developer phenomenon that's going on right now. >> Yeah, it's the maker culture. Rob, in fact, about a year or more ago, IBM, at one of their events, they held a very maker oriented event, I think they called it Datapalooza at one point. What it's looking at, what's going on is it's more than just classic software developers are coming to the fore. When you're looking at IoT or Edge applications, it's hardware developers, it's UX developers, it's developers and designers who are trying to change and drive data driven applications into changing the very fabric of how things are done in the real world. What Peter Burris, we had a wiki about him called Programming in the Real World. What that all involves is there's a new set of skill sets that are coming together to develop these applications. It's well beyond just simply software development, it's well beyond simply data scientists. Maker culture. >> Programming in the real world is a great concept, because you need real time, which comes back down to this. I'm looking for this week from the guests we talked to, what their view is of the data market right now. Because if you want to get real time, you've got to move from that batch world to the real time world. I'm not saying batch is over, you've still got to store data, and that's growing at an exponential rate as well. But real time data, how do you use data in real time, how do the modelings work, how do you scale that. How do you take a DevOps culture to the data world is what I'm looking for. What are you looking for this week? >> What I'm looking for this week, I'm looking for DevOps solutions or platforms or environments for teams of data scientists who are building and training and deploying and evaluating, iterating deep learning and machine learning and natural language processing applications in a continuous release pipeline, and productionizing them. At Wikibon, we are going deeper in that whole notion of DevOps for data science. I mean, IBM's called it inside ops, others call it data ops. What we're seeing across the board is that more and more of our customers are focusing on how do we bring it all together, so the maker culture. >> Operationalizing it. >> Operationalizing it, so that the maker cultures that they have inside their value chain can come together and there's a standard pattern workflow of putting this stuff out and productionizing it, AI productionized in the real world. >> Moving in from the proof of concept notion to actually just getting things done, putting it out in the network, and then bringing it to the masses with operational support. >> Right, like the good folks at IBM with Watson data platform, on some levels, is a DevOPs for data science platform, but it's a collaborative environment. That's what I'm looking to see, and there's a lot of other solution providers who are going down that road. >> I mean, to me, if people have the community traction, that is the new benchmark, in my opinion. You heard it here on the Cube. Community continues to scale, you can start seeing it moving out of open source, you're seeing things like blockchain, you're seeing a decentralized Internet now happening everywhere, not just distributed but decentralized. When you have decentralization, community and software really shine. It's the Cube here in New York City all week. Stay with us for wall to wall coverage through Thursday here in New York City for Big Data NYC, in conjunction with Strata Data, this is the Cube, we'll be back with more coverage after this short break. (busy music) (serious electronic music) (peaceful music) >> Hi, I'm John Furrier, the Co-founder of SiliconANGLE Media, and Co-host of the Cube. I've been in the tech business since I was 19, first programming on mini computers in a large enterprise, and then worked at IBM and Hewlett Packard, a total of nine years in the enterprise, various jobs from programming, training, consulting, and ultimately, as an executive sales person, and then started my first company in 1997, and moved to Silicon Valley in 1999. I've been here ever since. I've always loved technology, and I love covering emerging technology. I was trained as a software developer and love business. I love the impact of software and technology to business. To me, creating technology that starts a company and creates value and jobs is probably one of the most rewarding things I've ever been involved in. I bring that energy to the Cube, because the Cube is where all the ideas are, and where the experts are, where the people are. I think what's most exciting about the Cube is that we get to talk to people who are making things happen, entrepreneurs, CEO of companies, venture capitalists, people who are really, on a day in and day out basis, building great companies. In the technology business, there's just not a lot real time live TV coverage, and the Cube is a non-linear TV operation. We do everything that the TV guys on cable don't do. We do longer interviews, we ask tougher questions. We ask, sometimes, some light questions. We talk about the person and what they feel about. It's not prompted and scripted, it's a conversation, it's authentic. For shows that have the Cube coverage, it makes the show buzz, it creates excitement. More importantly, it creates great content, great digital assets that can be shared instantaneously to the world. Over 31 million people have viewed the Cube, and that is the result of great content, great conversations. I'm so proud to be part of the Cube with a great team. Hi, I'm John Furrier, thanks for watching the Cube. >> Announcer: Coming up on the Cube, Tekan Sundar, CTO of Wine Disco. Live Cube coverage from Big Data NYC 2017 continues in a moment. >> Announcer: Coming up on the Cube, Donna Prlich, Chief Product Officer at Pentaho. Live Cube coverage from Big Data New York City 2017 continues in a moment. >> Announcer: Coming up on the Cube, Amit Walia, Executive Vice President and Chief Product Officer at Informatica. Live Cube coverage from Big Data New York City continues in a moment. >> Announcer: Coming up on the Cube, Prakash Nodili, Co-founder and CEO of Pexif. Live Cube coverage from Big Data New York City continues in a moment. (serious electronic music)

Published Date : Sep 27 2017

SUMMARY :

it's the Cube, covering Big Data New York City 2017, and creative disruption of the old guard as is the big data market continuing to innovate. kind of just really putting at the center of all that, and the training and deployment and iteration of AI. and replatforming at the core data centers, in the real world in terms of our lives. It's a lot more work to do. in a renaissance, that's really the key. in the real world. Programming in the real world is a great concept, so the maker culture. Operationalizing it, so that the maker cultures Moving in from the proof of concept notion Right, like the good folks at IBM that is the new benchmark, in my opinion. and that is the result of great content, continues in a moment. continues in a moment. continues in a moment. Prakash Nodili, Co-founder and CEO of Pexif.

ENTITIES

Entity	Category	Confidence
Jim Kobielus	PERSON	0.99+
Donna Prlich	PERSON	0.99+
Rob Bearden	PERSON	0.99+
Amit Walia	PERSON	0.99+
Rob Thomas	PERSON	0.99+
Peter Burris	PERSON	0.99+
Prakash Nodili	PERSON	0.99+
IBM	ORGANIZATION	0.99+
John Furrier	PERSON	0.99+
Jim	PERSON	0.99+
1997	DATE	0.99+
Berkeley	LOCATION	0.99+
Silicon Valley	LOCATION	0.99+
1999	DATE	0.99+
Hewlett Packard	ORGANIZATION	0.99+
SiliconANGLE Media	ORGANIZATION	0.99+
Thursday	DATE	0.99+
New York City	LOCATION	0.99+
John	PERSON	0.99+
nine years	QUANTITY	0.99+
Hortonworks	ORGANIZATION	0.99+
Wednesday	DATE	0.99+
Rob	PERSON	0.99+
Pexif	ORGANIZATION	0.99+
Tekan Sundar	PERSON	0.99+
Linux Foundation	ORGANIZATION	0.99+
first company	QUANTITY	0.99+
first	QUANTITY	0.99+
three days	QUANTITY	0.99+
Wikibon	ORGANIZATION	0.99+
Datapalooza	EVENT	0.99+
64 million lines	QUANTITY	0.98+
NYC	LOCATION	0.98+
Midtown Manhattan	LOCATION	0.98+
Big Data	ORGANIZATION	0.98+
19	QUANTITY	0.98+
this week	DATE	0.97+
Over 31 million people	QUANTITY	0.97+
Spark Summit	EVENT	0.97+
last night	DATE	0.97+
Open Source Summit	EVENT	0.97+
Strata	EVENT	0.96+
One	QUANTITY	0.96+
Programming in the Real World	TITLE	0.96+
Big Data	EVENT	0.96+
Informatica	ORGANIZATION	0.96+
day one	QUANTITY	0.96+
Strata Data	ORGANIZATION	0.95+
two emerging areas	QUANTITY	0.95+
billions of lines	QUANTITY	0.93+
Microsoft	ORGANIZATION	0.93+
TensorFlow	TITLE	0.92+
Strata Data Conference	EVENT	0.92+
Day One	QUANTITY	0.92+
Live Cube	COMMERCIAL_ITEM	0.92+
Cube	ORGANIZATION	0.91+
Every Company is a Tech Company	TITLE	0.9+
Azure	TITLE	0.9+
about a year or more ago	DATE	0.9+
Cube	COMMERCIAL_ITEM	0.9+
2017	EVENT	0.89+
Wine Disco	ORGANIZATION	0.89+
Big Data Revolution	TITLE	0.88+
Strata	ORGANIZATION	0.88+
Theano	TITLE	0.88+
Watson	ORGANIZATION	0.85+
DevOps	TITLE	0.84+
Ignite Event	EVENT	0.84+

Day One Wrap | BigData NYC 2017

>> Announcer: Live from midtown Manhattan, it's theCUBE covering BigData New York City 2017. Brought to you by SiliconANGLE Media, and its ecosystem sponsors. >> Hello everyone, welcome back to our day one, at Big Data NYC, of three days of wall to wall coverage. This is theCUBE. I'm John Furrier, with my co-hosts Jim Kobielus and Peter Burris. We do this event every year, this is theCUBE's BigData NYC. It's our event that we run in New York City. We have a lot of great content, we have theCUBE going live, we don't go to Strata anymore. We do our own event in conjunction, they have their own event. You can go pay over there and get the booth space, but we do our media event and attract all the influencers, the VIPs, the executives, the entrepreneurs, we've been doing it for five years, we're super excited, and thank our sponsors for allowing us to get here and really appreciate the community for continuing to support theCUBE. We're here to wrap up day one what's going on in New York, certainly we've had a chance to check out the Strata situations, Strata Data, which is Cloudera, and O'Reilly, mainly O'Reilly media, they run that, kind of old school event, guys. Let's kind of discuss the impact of the event in context to the massive growth that's going outside of their event. And their event is a walled garden, you got to pay to get in, they're very strict. They don't really let a lot of people in, but, okay. Outside of that the event it going global, the activity around big data is going global. It's more than Hadoop, we certainly thought about that's old news, but what's the big trend this year? As the horizontally scalable cloud enters the equation. >> I think the big trend, John, is the, and we've talked about in our research, is that we have finally moved away from big data, being associated with a new type of infrastructure. The emergence of AI, deep learning, machine learning, cognitive, all these different names for relatively common things, are an indications that we're starting to move up into people thinking about applications, people thinking about services they can use to get access, or they can get access to build their applications. There's not enough skills. So I think that's probably the biggest thing is that the days of failure being measured by whether or not you can scale your cluster up, are finally behind us. We're using the cloud, other resources, we have enough expertise, the technologies are becoming simpler and more straightforward to do that. And now we're thinking about how we're going to create value out of all of this, which is how we're going to use the data to learn something new about what we're doing in the organization, combine it with advanced software technologies that actually dramatically reduce the amount of work that's necessary to make a decision. >> And the other trend I would say, on top of that, just to kind of put a little cherry on top of that, kind of the business focus which is again, not the speeds and feeds, although under the hood, lot of great innovation going on from deep learning, and there's a ton of stuff. However, the conversation is the business value, how it's transforming work and, but the one thing that nobody's talking about is, this is why I'm not bullish on these one shows, one show meets all kind of thing like O'Reilly Media does, because there's multiple personas in a company now in the ecosystem. There are now a variety of buyers of some products. At least in the old days, you'd go talk to the IT CIO and you're in. Not anymore. You have an analytics person, a Chief Data Officer, you might have an IT person, you might have a cloud person. So you're seeing a completely broader set of potential buyers that are driving the change. We heard Paxata talk about that. And this is a dynamic. >> Yeah, definitely. We see a fair amount of, what I'm sensing about Strata, how it's evolving these big top shows around data, it's evolving around addressing a broader, what we call maker culture. It's more than software developers. It's business analysts, it's the people who build the hardware for the internet of things into which AI and machine learning models are being containerized and embedded. I've, you know, one of the takeaways from today so far, and the keynotes are tomorrow at Strata, but I've been walking the atrium at the Javits Center having some interesting conversations, in addition, of course, to the ones we've been having here at theCUBE. And what I'm notic-- >> John: What are those hallway conversations that you're having? >> Yeah. >> What's going on over there? >> Yeah, what I've, the conversations I've had today have been focused on, the chief trend that I'm starting to sense here is that the productionization of the machine learning development process or pipeline, is super hot. It spans multiple data platforms, of course. You've got a bit of Hadoop in the refinery layer, you've got a bit of in-memory columnar databases, like the Act In discussed at their own, but the more important, not more important, but just as important is that what users are looking at is how can we build these DevOps pipelines for continuous management of releases of machine learning models for productionization, but also for ongoing evaluation and scoring and iteration and redeployment into business applications. You know there's, I had conversations with Mapbar, I had conversations with IBM, I mean, these were atrium conversations about things that they are doing. IBM had an announcement today on the wires and so forth with some relevance to that. And so I'm seeing a fair, I'm hearing, I'm sensing a fair amount of It's The Apps, it's more than just Hadoop. But it's very much the flow of these, these are the core pieces, like AI, core pieces of intellectual property in the most disruptive applications that are being developed these days in all manner, in business and industry in the consumer space. >> So I did not go over to the show floor yet, I've not been over to the Atrium. But, I'll bet you dollars to donuts this is indicative of something that always happens in a complex technology environment. And again, this is something we've thought about particularly talked about here on theCUBE, in fact we talked to Paxata about it a little bit as well. And that is, as an organization gains experience, it starts to specialize. But there's always moments, there' always inflection points in the process of gaining that experience. And by that, or one of the indications of that is that you end up with some people starting to specialize, but not quite sure what they're specializing in yet. And I think that's one of the things that's happening right now is that the skills gap is significant. At the same time that the skills gap is being significant, we're seeing people start to declare their specializations that they don't have skills, necessarily, to perform yet. And the tools aren't catching up. So there's still this tension model, open source, not necessarily focusing on the core problem. Skills looking for tools, and explosion in the number of tools out there, not focused on how you simplify, streamline, and put into operation. How all these things work together. It's going to be an interesting couple of years, but the good news, ultimately, is that we are starting to see for the first time, even on theCUBE interviews today, the emergence of a common language about how we think about the characteristics of the problem. And I think that that heralds a new round of experience and a new round of thinking about what is all the business analysts, the data scientists, the developer, the infrastructure person, business person. >> You know, you bring up that comment, those comments, about the specialists and the skills. We talked, Jim and I talked on the segment this morning about tool shed. We're talking about there are so many tools out there, and everyone loves a good tool, a hammer. But the old expression is if you're a hammer, everything looks like a nail, that's cliche. But what's happened is there are a plethora of tools, right, and tools are good. Platforms are better. As people start to replatformize everything they could have too many tools. So we asked the C Chief Data Officer, he goes yeah, I try to manage the tool tsunami, but his biggest issue was he buys a hammer, and it turns into a lawnmower. That's a vendor mentality of-- >> What a truck. Well, but that's a classic example of what I'm talking about. >> Or someone's trying to use a hammer to mow the lawn right? Again, so this is what you're getting at. >> Yeah! >> The companies out there are groping for relevance, and that's how you can see the pretenders from the winners. >> Well, a tool, fundamentally, is pedagogical. A tool describes the way work is going to be performed, and that's been a lot of what's been happening over the course of the past few years. Now, businesses that get more experience, they're describing their own way of thinking throughout a problem. And they're still not clear on how to bring the tools together because the tools are being generated, put into the marketplace by an expanding array of folks and companies, and they're now starting to shuffle for position. But I think ultimately, what we're going to see happen over the next year and I think this is an inflection point, going back to this big tent notion, is the idea that ultimately we are going to see greater specialization over the next few years. My guess is that this year will probably, should get better, or should get bigger, I'm not certain it will because it's focused on the problems that we already solved and not moving into the problems that we need to focus on. >> Yeah, I mean, a lot of the problems I have with the O'Reilly show is that they try to throw default leadership out there, and there's some smart people that go to that, but the problem is is that it's too monetization, they try to make too much money from the event when this action's happening. And this is where the tool becomes, the hammer becomes a lawnmower, because what's happening is that the vendor's trying to stay alive. And you mentioned this earlier, to your point, the customers that are buyers of the technology don't want to have something that's not going to be a fit, that's going to be agile from us. They don't want the hammer that they bought to turn into something that they didn't buy it for. And sometimes, teams can't make that leap, skillset-wise, to literally pivot overnight. Especially as a startup. So this is where the selection of the companies makes a big difference. And a lot of the clients, a lot of customers that we're serving on the end user side are reaching the conclusion that the tools themselves, while important, are clearly not where the value is. The value is in how they put them together for their business. And that's something that's going to have to, again, that's a maturation process, roles, responsibilities, the chief data officer, they're going to have a role in that or not, but ultimately, they're going to have to start finding their pipelines, their process for ingestion out to analysis. >> Let me get your reaction, you guys, your reactions to this tape. Because one of the things that I heard today, and I think this validates a bigger trend as we talk about the landscape of the markup from the event to how people are behaving and promoting and building products and companies. The pattern that I'm hearing, we said it multiple times on theCUBE today and one from the guy who's basically reading the script, is, in his interview, explaining 'cause it's so factual, I asked him the straight-up question, how do you deal with suppliers? What's happening is the trend is don't show me sizzle. I want to see the steak. Don't sell me hype, I got too many business things to work on right now, I need to nail down some core things. I got application development, I got security to build out big time, and then I got all those data channels that I need, I don't have time for you to sell me a hammer that might not be a hammer in the future! So I need real results, I need real performance that's going to have a business impact. That is the theme, and that trumps the hype. I see that becoming a huge thing right now. Your thoughts, reactions, guys-- >> Well I'll start-- >> What's your reaction then? True or false on the trend? Be-- >> Peter: True! >> Get down to business. >> I'll say that much, true, but go ahead. >> I'll say true as well, but let me just add some context. I think a show like O'Reilly Strata is good up to a point, especially to catalyze an industry, a growing industry like big data's own understanding of it, of the value that all these piece parts, Hadoop and Spark and so forth, can add, can provide when deployed in a unit according to some emerging patterns, whatever. But at a certain point where a space like this becomes well-established, it just becomes a pure marketing event. And customers, at a certain point say, you know, I come here for ideas about things that I can do in my environ, my business, that could actually many ways help me to do new things. You know, you can't get that at a marketing-oriented, you can get that, as a user, more at a research-oriented show. When it's an emerging market, like let's say Spark has been, like the Spark Summit was in the beginning, those are kind of like, when industries go through the phase those are sort of in the beginning, sort of research-focused shows where industry, the people who are doing the development of this new architecture, they talk ideas. Now I think in 2017, where we're at now, is what the idea is everybody's trying to get their heads around, they're all around AI, what the heck that is. For a show like an O'Reilly Ready show to have relevance in a market that's in this much ferment of really innovation around AI and deep learning, there needs to be a core research focus that you don't get at this point in the lifecycle of Strata, for example. So that's my take on what's going on. >> So, my take is this. And first of all, I agree with everything you said, so it's not in opposition to anything. Many years ago I had this thought that I think still is very true. And that is the value of industry, the value of infrastructure is inversely correlated with the degree to which anybody knows anything about it. So if I know a lot about my infrastructure, it's not creating a lot of business value. In fact, more often than not, it's not working, which is why people end up knowing more about it. But the problem is, the way that technology has always been sold is as a differentiated, some sort of value-add thing. So you end up with this tension. And this is an application domain, a very, very complex application domain like big data. The tension is, my tool is so great that, and it's differentiating all those other stuff, yeah but it becomes valuable to me if and only if nobody knows it exists. So I think, and one of the reasons why I bring this up, John, is many of the companies that are in the big data space today that are most successful are companies that are positioning themselves as a service. There's a lot of interesting SaaS applications for big data analysis, pipeline management, all the other things you can talk about, that are actually being rendered as a service, and not as a product. So that all you need to know is what the tool does. You don't need to know the tool. And I don't know that that's necessarily going to last, but I think it's very, very interesting that a lot of the more successful companies that we're talking to are themselves mere infrastructure SaaS companies. >> Because-- >> AtScale is interesting, though. They came in as a service. But their service has an interesting value proposition. They can allow you to essentially virtualize the data to play with it, so people can actually sandbox data. And if it gets traction, they can then double-down on it. So to me that's a freebie. To me, I'm a customer, I got to love that kind of environment because you're essentially giving almost a developer-like environment-- >> Peter: Value without necessarily-- >> Yeah, the cost, and the guy gets the signal from the marketplace, his customer, of what data resolves. To me that's a very cool scene. I don't, you saying that's bad, or? >> No, no, I think it's interesting. I think it's-- >> So you're saying service is-- >> So what I'm saying is, what I'm saying is, that the value of infrastructure is inversely proportional to the degree to which anybody knows anything about it. But you've got a bunch of companies who are selling, effectively, infrastructure software, so it's a value-add thing, and that creates a problem. And a lot of other companies not only have the ability to sell something as a service as opposed to a product, they can put the service froward, and people are using the service and getting what they need out of it without knowing anything about the tool. >> I like that. Let me just maybe possibly restate what you just said. When a market goes toward a SaaS go-to-market delivery model for solutions, the user, the buyer's focus is shifted away from what the solution can do, I mean, how it works under the cover. >> Peter: Quote, value-add-- >> To what it can do potentially for you. >> The business, that's right. >> But you're not going to, don't get distracted by the implementation details. You have then as a user become laser-focused on, wow, there's a bunch of things that this can do for me. I don't care how it works, really. You SaaS provider, you worry about that stuff. I can worry now about somehow extracting the value. I'm not distracted. >> This show, or this domain, is one of the domains where SaaS has moved, just as we're thinking about moving up the stack, the SaaS business model is moving down the stack in the big data world. >> All right, so, in summary, the stack is changing. Predictions for the next few days. What are we going to see come out of Strata Data, and our BigData NYC? 'Cause remember, this show was always a big hit, but it's very clear from the data on our dashboards, we're seeing all the social data. Microsoft Ignite is going on, and Microsoft Azure, just in the past few years, has burst on the scene. Cloud is sucking the oxygen out of the big data event. Or is it? >> I doubt it was sucking it out of the event, but you know, theCUBE is in, theCUBE is not at Ignite. Where's theCUBE right now? >> John: BigData NYC. >> No, it's here, but it's also at the Splunk show. >> John: That's true. >> And isn't it interesting-- >> John: We're sucking the data out of two events. >> Did a lot of people coming in, exactly. A lot of people coming-- >> We're live streaming in a streaming data kind of-- >> John just said we suck, there's that record saying that. >> We're sucking all the data. >> So we are-- >> We're sharing data. These videos are data-driven. >> Yeah, absolutely, but the point is, ultimately, is that, is that Splunk is an example of a company that's putting forward a service about how you do this and not necessarily a product focus. And a lot of the folks that are coming on theCUBE here are also going on to theCUBE down in Washington D.C., which is where the Splunk show's at. And so I think one of the things, one of the predictions I'll make, is that we're going to hear over the next couple of days more companies talk about their SaaS trash. >> Yeah, I mean I just think, I agree with you, but I also agree with the comments about the technology coming together. And here's one thing I want to throw on the table. I've gotten the sense a few times about connecting the dots on it, we'll put it out publicly for comment right now. The role that communities will play outside of developer, is going to be astronomical. I think we're seeing signals, certainly open-source communities have been around for a long time. They continue to grow shoulders of giants before them. Even these events like O'Reilly, which are a small community that they rely on is now not the only game in town. We're seeing the notion of a community strategy in things like Blockchain, you're seeing it in business, you're seeing people rolling out their recruitment to say, data scientists. You're seeing a community model developing in business, yes or no? >> Yes, but I would say, I would put it this way, John. That it's always been there. The difference is that we're now getting enough experience with things that have occurred, for example, collaboration, communal, communal collaboration in open-source software that people are now saying, and they've developed a bunch of social networking techniques where they can actually analyze how those communities work together, but now they're saying, hmm, I've figured out how to do an assessment analysis understanding that community. I'm going to see if I can take that same concept and apply it over here to how sales works, or how B-to-B engagement works, or how marketing gets conducted, or how sales and marketing work together. And they're discovering that the same way of thinking is actually very fruitful over there. So I totally agree, 100%. >> So they don't rely on other people's version of a community, they can essentially construct their own. >> They are, they are-- >> John: Or enabling their own. >> That's right, they are bringing that approach to thinking about a community-driven business and they're applying it to a lot of new ways, and that's very exciting. >> As the world gets connected with mobile and internet of things as we're seeing, it's one big online community. We're seeing things, I'm writing a post right now, what you could, what B-to-B markets should learn from the fake news problem. And that is content and infrastructure are now contextually tied together. >> Peter: Totally. >> And related. The payload of the fake news is also related to the gamification of the network effect, hence the targeting, hence the weaponization. >> Hey, we wrote the three Cs, we wrote a piece on the three Cs of strategy a year and a half ago. Content, community, context. And at the end of the day, the most important thing to what you're saying about, is that there is, you know, right now people talk about social networking. Social media, you think Facebook. Facebook is a community with a single context, stay in touch with your friends. >> Connections. >> Connections. But what you're really saying is that for the first time we're now going to see an enormous amount of technology being applied to the fullness of all the communities. We're going to see a lot more communities being created with the software, each driven by what content does, creates value, against the context of how it works, where the community's defined in terms of what do we do? >> Let me focus on the fact that bringing, using community as a framework for understanding how the software world is evolving. The software world is evolving towards, I've said this many times in my work about a resurge, the data scientists or data people, data science skills are the core developers in this new era. Now, what is data science all about at its heart? Machine learning, building, and training machine learning models. And so training machine learning models is everything towards making sure that they are fit for their predicted purpose of classification. Training data, where you get all the training data from to feed all, to train all these models? Where do you get all the human resources to label, to do the labeling of the data sets, and so forth, that you need communities, crowdsourcing and whatnot, and you need sustainable communities that can supply the data and the labeling services, and so forth, to be able to sustain the AI and machine learning revolution. So content, creating data and so forth, really rules in this new era, like-- >> The interest in machine learning is at an all-time high, I guess. >> Jim: Yeah, oh yeah, very much so. >> Got it, I agree. I think the social grab, interest grab, value grab is emerging. I think communities, content, context, communities are relevant. I think a lot of things are going to change, and that the scuttlebutt that I'm hearing in this area now is it's not about the big event anymore. It's about the digital component. I think you're seeing people recognize that, but they still want to do the face-to-face. >> You know what, that's right. That's right, they still want, let's put it this way. That there are, that the whole point of community is we do things together. And there are some things that are still easier to do together if we get together. >> But B-to-B marketing, you just can't say, we're not going to do events when there's a whole machinery behind events. Legion batch marketing, we call it. There's a lot of stuff that goes on in that funnel. You can't just say hey, we're going to do a blog post. >> People still need to connect. >> So it's good, but there's some online tools that are happening, so of course. You wanted to say something? >> Yeah, I just want to say one thing. Face to face validates the source of expertise. I don't really fully trust an expert, I can't in my heart engage with them, 'til I actually meet them and figure out in person whether they really do have the goods, or whether they're repurposing some thinking that they got from elsewhere and they gussy it up. So face, there's no substitute for face-to-face to validate the expertise. The expertise that you value enough to want to engage in your solution, or whatever it might be. >> Awesome, I agree. Online activities, the content, we're streaming the data, theCUBE, this is our annual event in New York City. We've got three days of coverage, Tuesday, Wednesday, Thursday, here, theCUBE in Manhattan, right around the corner from Strata Hadoop, the Javits Center of influencers. We're here with the VIPs, with the entrepreneurs, with the CEOs and all the top analysts from WikiBon and around the community. Be there tomorrow all day, day one wrap up is done. Thanks for watching, see you tomorrow. (rippling music)

Published Date : Sep 27 2017

SUMMARY :

Brought to you by SiliconANGLE Media, of the event in context to the massive growth is that the days of failure being measured by of potential buyers that are driving the change. and the keynotes are tomorrow at Strata, is that the productionization of the machine learning is that the skills gap is significant. But the old expression is if you're a hammer, of what I'm talking about. Again, so this is what you're getting at. and that's how you can see the pretenders from the winners. is the idea that ultimately we are going to see And a lot of the clients, a lot of customers from the event to how people are behaving of it, of the value that all these piece parts, And that is the value of industry, So to me that's a freebie. from the marketplace, his customer, of what data resolves. I think it's-- And a lot of other companies not only have the ability for solutions, the user, the buyer's focus To what it can do by the implementation details. is one of the domains where SaaS has moved, Cloud is sucking the oxygen out of the big data event. I doubt it was sucking it out of the event, but you know, Did a lot of people coming in, exactly. We're sharing data. And a lot of the folks that are coming on theCUBE here is now not the only game in town. and apply it over here to how sales works, of a community, they can essentially construct their own. and they're applying it to a lot of new ways, from the fake news problem. hence the targeting, hence the weaponization. And at the end of the day, the most important thing We're going to see a lot more communities being created that can supply the data and the labeling services, is at an all-time high, I guess. and that the scuttlebutt that I'm hearing And there are some things that are still easier to do There's a lot of stuff that goes on in that funnel. that are happening, so of course. The expertise that you value enough to want to engage and around the community.

ENTITIES

Entity	Category	Confidence
Jim Kobielus	PERSON	0.99+
Peter Burris	PERSON	0.99+
O'Reilly	ORGANIZATION	0.99+
Jim	PERSON	0.99+
John	PERSON	0.99+
IBM	ORGANIZATION	0.99+
O'Reilly Media	ORGANIZATION	0.99+
Manhattan	LOCATION	0.99+
2017	DATE	0.99+
John Furrier	PERSON	0.99+
New York City	LOCATION	0.99+
Peter	PERSON	0.99+
Washington D.C.	LOCATION	0.99+
New York	LOCATION	0.99+
tomorrow	DATE	0.99+
five years	QUANTITY	0.99+
two events	QUANTITY	0.99+
100%	QUANTITY	0.99+
Cloudera	ORGANIZATION	0.99+
Microsoft	ORGANIZATION	0.99+
SiliconANGLE Media	ORGANIZATION	0.99+
first time	QUANTITY	0.99+
today	DATE	0.99+
Wednesday	DATE	0.99+
a year and a half ago	DATE	0.99+
Thursday	DATE	0.99+
one	QUANTITY	0.99+
Spark Summit	EVENT	0.99+
three days	QUANTITY	0.99+
Tuesday	DATE	0.98+
Javits Center	LOCATION	0.98+
Splunk	ORGANIZATION	0.98+
Paxata	ORGANIZATION	0.98+
Facebook	ORGANIZATION	0.98+
next year	DATE	0.97+
this year	DATE	0.97+
SaaS	TITLE	0.97+
day one	QUANTITY	0.96+
NYC	LOCATION	0.96+
first	QUANTITY	0.96+
one thing	QUANTITY	0.96+
WikiBon	ORGANIZATION	0.95+
one show	QUANTITY	0.94+
one shows	QUANTITY	0.94+
BigData	ORGANIZATION	0.94+
Many years ago	DATE	0.93+
Strata	LOCATION	0.93+
Strata Hadoop	LOCATION	0.92+
each	QUANTITY	0.91+
three Cs	QUANTITY	0.9+
Javits Center	ORGANIZATION	0.89+
midtown Manhattan	LOCATION	0.88+
theCUBE	ORGANIZATION	0.87+
Strata	TITLE	0.87+
past few years	DATE	0.87+

Sam Ramji, Google Cloud Platform | VMworld 2017

>> Welcome to our presentation here at VM World 2017. I'm John Furrier, co-host of The Cube, with Dave Vellante who's taking a lunch break. We are at VM World on the ground on the floor where we have Google's vice president of product management developer platforms Sam Ramji. Welcome to The Cube conversation. >> Great, thank you very much John. >> So you had a keynote this morning. You know, came up on stage, big announcement. Let's get right to it. That container as a service from Pivotal, VM Ware, and Google announced kind of a joint announcement. It was kind of weird. It wasn't a fully joint but it really came from Pivotal. Clarify what the announcement was. >> Sure, so what we announced is the result of a bunch of co-engineering that we've been doing in the open source with Pivotal around kubernetes running on bosh. So, if you've been paying attention to cloud foundry, you'd know that cloud foundry is the runtime layer and there's something called bosh sitting underneath it that does the cluster management and cluster operations. Pivotal is bringing that to commercial GA later this year. So what we announced with Pivotal and VMWare is that we're going to have cost incompatibility between Pivotal's kubernetes and Google's kubernetes. Google's kubernetes service is called Google Container Engine Pivotal's offering is called Pivotal Container Service. The big deal here is that PKS is going to be the standard way that you can get kubernetes from any of the Dell Group companies, whether that's VMWare, EMC. That gives us one consistent target for compatibility because one of the things that I pointed out in the keynote was inconsistency is the enemy in the data center. That's what makes operations difficult. >> And Kubo was announced at Cloud Foundry, Stu Miniman covered it, but that wasn't commercially available. That's the nuance, right? >> That's right, and that still is available in the open source. So what we've committed to is, we've said, every time that we update Google Container Engine, Pivotal Container Service is also going to update, so we have constant compatibility, that that's delivered on top of VMWare's infrastructure including NSX for networking and then the final twist is a big reason why people choose Google Cloud is because of our services. So Big Table, Big Query, a dynamically scaling data warehouse that we run an enormous amount of Google workloads on. Spanner, right. Which is why all of your data is consisted globally across Google's planet scaled data centers. And finally, all of our new machine learning and AI investments, those services will be delivered down to Pivotal Container Service, right, that's going to be there out of the box at launch and we'll keep adding to that catalog. >> It's just that Google Next was a lot of conversations, Oh Google's catching up to Amazon, Amazon's done a great job no doubt about it. We love Amazon. Andy Jassy was here as well. >> Super capable very competent engineering team. >> There's a lot of workloads in VMWare community that runs on AWS but it's not the only game in town. Jerry Chen, investor in Docker, friend of ours, we know, called this years ago. It's not going to be a one cloud winner take all game. Clearly. But there's the big three lining up, AWS, Microsoft, Google, you guys are doing great. So I got to ask you, what is the biggest misconception that people have about Google Cloud out in the market? 'Cause a lot of enterprises are used to running ops, maybe not as much dev as there is ops, and dev ops comes in with cloud native, there's a lot of confusion, what is the thing that you'd like to clarify about Google that they may not know about? >> The single most important thing to clarify about Google Cloud is our strategy is open-hybrid cloud. We think that we are in an amazing place to run workloads, we also recognize that compute belongs everywhere. We think that the durable state of computing is more of a mosaic than a uni-directional arrow that says everything goes to cloud. We think you want to run your containers and your VM's in clouds. We think you want to run them in your data centers. We also think you want to move them around. So we've been diehard committed to building out the open-source projects, the protocols to let all of that information flow, and then providing services that can get anywhere. So open-hybrid cloud is the strategy, and that's what we've committed to with kubernetes, with tensorflow, with apache beam, with so much of the open-source that we've contributed to Linux and others, and then maintaining open standards compatibility for our services. >> Well, it's great to see you at Google because I know your history, great open source guy, you know open source, it's been really part of your life, and bringing that to Google's great, so congratulations. >> There's a reason for that though, it's pragmatic. This is not a crazy crusade. The value of open source is giving control to the customer. And I think that the most ethical way that you can build businesses and markets is based on customer choice. Giving them the ability to move to where they want. Reducing their costs of switching. If they stay with you, then you're really producing a value-added service. So I've spent time in the operator shoes, in the developer shoes, and in the vendor shoes. When I've spent time buying and running the software on my own, I really always valued and preferred things that would let me move my stuff around. I preferred open source. So that's really the method to the madness here. It's not about opening everything up insanely, giving everything away. It serves customers better and in the long run, the better you serve customers, you'll build a winning business. >> We're here on the ground floor at VMWorld 2017 in Las Vegas, where behind us is the VM Village. And obviously Sam was on stage with the big announcement with Pivotal VMWare. And this is kind of important now, we got to debate now, usually I'm not the contrarian in the group, I'm usually the guy who's like yeah, rah rah, entrepreneurial, optimistic, yeah we can do that! You know that future's here, go to the future! But I was kind of skeptical and I told VMWare and I saw Pat Gelsinger and Michael Dell in the hallways and I'm like, they thought this was going to be the big announcement, and it was their big announcement, but I was kind of like, guys, I mean, it's the long game, these guys in the VMWare community, their operations guys, their not going to connect the dots and there was kind of an applause but not a standing ovation that Google would've gotten at a Google Next conference where the geeks would've been like going crazy. What is the operational dynamic that you're seeing in this market that Google's looking at and bringing value to, so that's the question for you. >> This is what the big change in the industry is is going from only worrying about increasing application velocity to figuring out how to do that with reliability. So there's a whole community of operators that I think many of us have left behind as we've talked about clouds and cloud data. We've done a great job of appealing to developers, enabling them to be more productive, but with operators, we've kind of said, well, your mileage may vary or we don't have time for you, or you have to figure it out yourself. I think the next big phase in adoption of cloud native technology is to say, first of all, open-hybrid, run your stuff wherever you want. >> Well you've got to have experience running cloud. Now you bring that knowledge out here. >> And that's the next piece. How do we offer you the tools and the skills that you need as an operator to have that same consistency, those same guarantees you used to have, and move everything forward in the future? Because if you turn one audience, one community, into the bad people who are holding everything back, that's a losing proposition, you have to give everybody a path to win, right? Everybody wants to be the good guy. So I think, now we need to start paying really close attention to operators and be approachable, right? I would like to see GCP become the most approachable cloud. We're already well known as the most advanced cloud. But can we be the easiest to adopt as well, and that's our challenge, to get the experience. >> You got to get that touch, that these enterprise teams historically have had, but it's interesting I mean, the mosaic you'd mentioned requires some unification, right? You got to be likable. You got to be approachable. And that's where you guys are going, I know you guys are building out for that, but the question is, for you, because Google has a lot of experience, and I know from personal knowledge Google's depth of people and talent, not always the cleanest execution out to the market in terms of the front-facing white glove service that some of these other companies have done, but you guys are certainly strong. >> Well, I think this is where Diane Greene has been driving the transformation, I mean like, she breathes, eats, sleeps, dreams enterprise. So, being both a board member at Google and being the SVP of Google Cloud, she's really bringing the discipline to say, you know, white glove service is mandatory. We have a pretty substantial professional services organization and building out partnerships with Accenture, with PWC, with Deloitte, with everyone to make sure that these things are all serviceable and properly packaged all the way down to the end user. So, no doubt there's more, more room for us to improve, there's miles to go on the journey, but the focus and the drive to make sure that we're delivering the enterprise requirements, Dianne never lets us stop thinking about that. >> It's like math, right, the order of operations is super important, and there's a lot of stuff going on in the cloud right now that's complex. >> Yes. >> Ease of use is the number one thing that we're hearing, because one, it's a moving a train in general, right? But the cloud's growing, a lot of complexity, how do you guys view that? And the question I want to ask you is, we know what cloud looks like today. Amazon, they're doing great. Multi-horse race if you will. But in 2022, the expectations and what it looks like then is going to be completely different, if you just take the trajectory of what's happening. So cleaning up kubernetes, making that a manageable, all the self updates, makes a lot of sense, and I think that's the dots no one's connecting here, I get the long game, but what's the customer's view in your opinion as someone who's sitting back and with the Google perch looking out over the horizon, 2022, what's it like for the customer? >> That's an outstanding question. So I think, 2022, looking back, we've actually absorbed so much of this complexity that we can provide ease of use to every workload and to every segment. Backing into that, ease of use looks different, like, let's think about tooling, ease of use looks different to an electrician verus a carpenter versus a plumber. They're doing different jobs, they need different tools, so I think about those as different audiences and different workloads. So if you're trying to migrate virtual machines to a cloud, ease of use means a thing and it includes taking care of the networking layer, how do we make sure that our cloud network shows up like an on premises network, and you don't have to set up some weird VPC configuration, how can those just look like part of your LAN subject to your same security controls. That's a whole path of engineering for a particular division of the company. For a different division of the company focused on databases ease of use is wow, I've got this enormous database, I'm straining at the edges, how do I move that to the cloud? Well, what kind of database is it, right? Is it a SQL database? Is it a NoSQL database? So engineering that in, that's the key. The other thing that we have to do for ease of use is upscaling. So a lot of things that we talked about before are the need to drive IT efficiency through automation. But who's going to teach people how to do the automation especially while they're being held to a very high SLA standard for their own data center and held to a high standard for velocity movement to the cloud. This is where Google has invented a discipline called SRE or site reliability engineering, and it's basically the meta discipline around what many people call dev ops. We think that this is absolutely teachable, it's learnable, it's becoming a growing community. You can get O'Reilly books on the topics. So I think we have an accountability to the industry to go and teach every operator and every operating group, hey here's what SRE looks like, some of your folks might want to do this, because that will give you the lift to make all of these workloads much easier to manage 'cause it's not just about velocity, it's also about reliability. >> It's interesting, we've got about a minute left or so. I'm just going to get your thoughts on this because you've certainly seen it on the developer side, stack wars, whatever you want to call them, the my stack runs this tech, but last night I heard in the hallway here multiple times the general consensus of two stacks coming together, not just software stacks, hardware stacks, you're seeing things that have never run together or been tested together before. So the site reliability is a very interesting concept and developers get pissed off when stacks don't work, right? So this is a super kind of nuance in this new use case that are emerging because stuff's happened that's never been done before. >> Yeah, so this is where the common tutorials get really interesting, especially as we build out a planetary scale computer at Google. Right, we're no longer thinking about how does the GPU as part of your daughter board, we think about what about racks of GPU's as part of your datacenters using NVDIA K80's, what does it mean to have 180 teraflops of tensor processing capability in a cloud TPU. So getting container centric is crucial and making it really easy to attach to all of those devices by having open source drivers making sure they're all Linux compatible and developers can get to them is going to be part of the substrate to make sure that application developers can target those devices, operators can set a policy that say, yes, I want this to deploy preferentially to environments with a TPU or a GPU and that the whole system can just work and be operable. >> Great, Sam thanks so much for taking the time to stop by. One on one conversation with Sam Ramji who's a Google Cloud, he's a vice president of product management and developer platforms for Google. We'll see you at Google Next. Thanks for spending the time. I'm John Furrier, thanks for watching. >> Thank you John.

Published Date : Aug 29 2017

SUMMARY :

We are at VM World on the ground on the floor Let's get right to it. The big deal here is that PKS is going to be the standard That's the nuance, right? Pivotal Container Service is also going to update, It's just that Google Next was a lot of conversations, that runs on AWS but it's not the only game in town. the open-source projects, the protocols to let all and bringing that to Google's great, so congratulations. So that's really the method to the madness here. You know that future's here, go to the future! We've done a great job of appealing to developers, Now you bring that knowledge out here. and that's our challenge, to get the experience. not always the cleanest execution out to the market but the focus and the drive to make sure It's like math, right, the order of operations And the question I want to ask you is, I'm straining at the edges, how do I move that to the cloud? So the site reliability is a very interesting concept and that the whole system can just work and be operable. Great, Sam thanks so much for taking the time to stop by.

ENTITIES

Entity	Category	Confidence
Dave Vellante	PERSON	0.99+
Sam Ramji	PERSON	0.99+
Jerry Chen	PERSON	0.99+
Dianne	PERSON	0.99+
PWC	ORGANIZATION	0.99+
Deloitte	ORGANIZATION	0.99+
Andy Jassy	PERSON	0.99+
Diane	PERSON	0.99+
Microsoft	ORGANIZATION	0.99+
Google	ORGANIZATION	0.99+
Amazon	ORGANIZATION	0.99+
John Furrier	PERSON	0.99+
AWS	ORGANIZATION	0.99+
John	PERSON	0.99+
Accenture	ORGANIZATION	0.99+
Sam	PERSON	0.99+
Las Vegas	LOCATION	0.99+
Stu Miniman	PERSON	0.99+
Pivotal	ORGANIZATION	0.99+
Michael Dell	PERSON	0.99+
2022	DATE	0.99+
Pat Gelsinger	PERSON	0.99+
EMC	ORGANIZATION	0.99+
O'Reilly	ORGANIZATION	0.99+
Linux	TITLE	0.99+
Greene	PERSON	0.99+
one	QUANTITY	0.99+
single	QUANTITY	0.98+
VM Ware	ORGANIZATION	0.98+
180 teraflops	QUANTITY	0.98+
VMWorld 2017	EVENT	0.98+
Dell Group	ORGANIZATION	0.98+
both	QUANTITY	0.97+
Cloud Foundry	ORGANIZATION	0.97+
PKS	ORGANIZATION	0.97+
two stacks	QUANTITY	0.96+
VMworld	EVENT	0.96+
The Cube	ORGANIZATION	0.96+
VM World 2017	EVENT	0.96+
last night	DATE	0.96+
Docker	ORGANIZATION	0.95+
NVDIA	ORGANIZATION	0.94+
K80	COMMERCIAL_ITEM	0.94+
One	QUANTITY	0.94+
later this year	DATE	0.94+
NoSQL	TITLE	0.93+
Google Cloud	TITLE	0.93+
NSX	ORGANIZATION	0.92+
today	DATE	0.91+
SQL	TITLE	0.9+
one community	QUANTITY	0.89+
one audience	QUANTITY	0.88+
three	QUANTITY	0.87+
a minute	QUANTITY	0.84+
VMWare	ORGANIZATION	0.82+
this morning	DATE	0.79+
Cloud	TITLE	0.79+

Rob O’Reilly & Raja Ramachandran | Food IT 2017

>> Announcer: From the computer history museum, in the heart of Silicon Valley, it's The Cube. Covering food IT, Fork to Farm. Brought to you by Western Digital. >> Hey, welcome back to The Cube. From the food IT event, From Fork to Farm, yep, you heard that right, Fork to Farm. I'm Lisa Martin. Really excited to be joined by my next guests who are influencing the food chain with Big Data, Cloud, IoT and Blockchain in some very, very interesting ways. We have Rob O'Reilly, senior member and technical staff of Analog Devices. Welcome. >> Thank you. >> And we have Raja Ramachandran, the founder and CEO of Ripe.io. Welcome. >> Thank you Lisa. >> So I made that joke about the Fork to Farm because we think so often how trendy it is, farm to table, farm to mouth. And this has been a really interesting event for us to talk with so many different people and companies across the food chain that we often, I think, take for granted. So Rob, wanted to kind of start with you. Analog Devices has been around for 50 years. You serve a lot of markets. So how is, and maybe kind of tell me sort of the genesis, and I know you were involved in this, of Analog Devices evolving to start using Cloud, Big Data, IoT in the food and agriculture space. What was the opportunity that you saw light bulb moment? >> Yup. It's an interesting story. We started with a piece of technology, a sensor that we can connect. I was looking of an app to apply, 'cause it was a full sensor to the Cloud strategy I was working on. And through some conference attendees that I had met and from a fellow who's now our partner, we kind of put together a strategy of "Well we've got the sensor to the Cloud, "where would we apply this?" And we decided though a little bit of banter, tomatoes. And most of it was because, in New England specifically, there's a lot of, there's 7,000 farms in Massachusetts. >> Lisa: Wow. >> Not all of them produce tomatoes, but a lot of them do. So it was like having a test bed right in our backyard. And from that point it's grown to what it is now. >> And I hear that you don't like tomatoes. >> I really don't like tomatoes. >> Lisa: What about heirloom tomatoes? >> I don't like any tomatoes. >> Lisa: Mozzarella, little basil, no? >> No, no. (laughs) I don't mind pasta sauce so much, but that's just because it's all salt. >> Lisa: That's true. >> And sugar. But no, and I've managed to get through this entire project without anybody forcing me to eat a tomato, so. >> That's good, they're respectful. >> I'm proud of that. >> So I was joking earlier, we cover a lot of events across enterprise innovation, and we were at a Hadoop Dataworks events a couple weeks ago and one of the guests was talking about Big Data and how it's influencing shipping, and how shipping companies are leveraging Big Data to determine how often they should clean the ships to remove barnacles 'cause it slows them down. So the funny thing that popped into my mind from that show is, barnacles and Big Data? Never thought that. Today, the wow factor for me, the internet of tomatoes. What is the internet of tomatoes? >> The problem statement when we started was "Why do tomatoes taste like cardboard?" >> Lisa: He really doesn't like tomatoes! (laughs) >> And, you know, in order to go dig into that was let's collect data. So there's a variety of methods that we use to collect the data. We had to create all of this on our own, so we created our own apps for the phones, our own matchups for the web, our own gateways. We built our hardware, we 3-D printed all the housings, and two of us just went off and started to deploy so we could collect data. The second half of it was, "well, what is in the tomato? "and why does it taste the way it does?" So we started doing some chemistry analysis. So a bunch of refractometers and other instruments so we can see what the sugar levels were, what the acid levels were. We infused ourselves into the Boston Tomato Contest, which they have annually. So we showed up, we looked like the Rolling Stones. We showed up with cases of, trap cases of equipment. It took us about 11 and a half hours to test 113, I think it was, tomatoes, and then we compared those to the chefs' scorecards. And in the chef's scorecard, there wasn't just a taste profile, there was the looks and everything else. Well I found a few markers between what the chef's profile said was a good tasting tomato and what the chemistry said. So a year later we showed up with our optical solution and we managed to test 450 tomatoes. >> Wow. >> About 100 of those go to the slicing table, so we had information on 100 of them and we did the same thing. So it got to the point to where we at least had that reconciliation of "what's the farmer doing "and how does it taste?" And by bringing Raja and his group in, we're bringing a lot more of other Big Data, if you will. Other weather data, aerial drone data, you know, anything we could find in a telematic range that would affect the processing or whatever of the tomato. So that in a nutshell is the internet of tomatoes. >> And is this something that, you know, being able to aggregate Big Data from a variety of sources, something that you're planning to then take to, I heard you earlier in the talk, talking about kind of at the relationship building stage. Is this a dialogue that you're having yet with farms? You mentioned 7,000 farms in Massachusets. What's that kind of conversation like? >> Well that's a very interesting dynamic and I think, you know, that data point for the industry is you better go talk to the farmer. It's really been interesting, the hesitation from a farmer to talk to a semiconductor company was odd. But I wasn't John Deer, I wasn't Monsanto, so they were a little more open. And they understand, a lot of these farmers that I'm dealing with now are generational, you know they're fifth, sixth generation. They really haven't made significant change on their farm in 100 years. >> Probably nor do they have a lot of data that's automated, right? There's probably a lot of things that are in Excel. >> And a lot of it is, I mean beyond their first level of contact, say with a seed or a pesticide manufacturer, They have no idea what's going on in the rest of the world. Unlike, you know, a lot of the big, large farms that we see. But at the smaller region, they're regional. And we've still have Hatfield-McCoy type things going on in New England, where families don't talk to each other, they don't share information. So through one of our work groups, we actually invited two of them, and I felt like match maker. We were trying to just get these two to talk. And they did, and they both realized that they were spending way too much money on fertilizer, and they were both over watering. So, it's still Hatfield and McCoys but at least I think they wink at each other every once in a while. >> Right, I love that you bought that up. That was something that was talked about a number of times today is the lack of collaboration maybe that's still in the sort of competitive stage. So Raja, talk to us about Ripe.io. First of all, I think the name is fantastic, but Blockchain and food. What's the synergy? And what opportunity did you see coming from the financial services industry? >> So, you know one of the key points about what we felt brings all this together is creating a web of trust. And so in financial markets, insurance markets, healthcare markets, you know big institutional regulated markets, there's a lot of regulations that really bind together that notion of trust, because you have a way in which you could effectively call out foul. Now, so there's a center of gravity in each of those industries, whether it's a central bank, you know or a state regulator insurance, so the government in healthcare. Here, there's not. It's disparate. It's completely fragmented, yet somehow magically we all get food everyday, ane we're not dead you know. So from that perspective we just marvel at the fact that you're there. So, bringing Blockchain was a way to basically talk to the farmer, talk to the distributor, talk to the buyer, the producer, and all these different constituents, including certifiers, USDA, whomever it might be. And then also even health to health companies, right, so that you can relate it. So the idea is to basically take all of these desperate sets of data, because they don't necessarily collaborate in full, capture it in the way that we're working with ADI so that you can create a real story about where that food came from, how is it curated, how did it get transported, what's in it, you know, do I get it on time, is it ripe, is it tasty and so on, right? And so we looked at Blockchain as a technology, an enabling technology that quickly captures the data, allows each to preserve its own security about it, and then combine it so that you can achieve real outcomes. So you can automate things like, were you sustainable? Were you of quality? Did you meet these taste factors? Was it certified? That's what excited us. We though, this is a perfect place because you've got to feed 9,000,000,000 people and no one trusts their food, you know? >> Lisa: Right. >> So we felt this would be an excellent opportunity to deploy Blockchain. >> And it's interesting that you know, the transparency is one of the things that we hear from the consumers, you know. We want all these things. We want hormone free, cage free, et cetera. We want organic, we want to make sure it is organic, but we also want that transparency. I'm curious since you are talking to the farmers, the distributors and the consumers, what were some of the different requirements coming from each, and how do you blend that to really have that visibility or that traceability from seed to consumption? >> And it's a good point right, because there's all these competing factors where farmers want certain information done, they don't want the price to go to zero because it's so commoditized. The distributor, not entirely sure if they want anybody to know what they do is if they deliver it, they've done their job. The aggregator, a grocery store, a restaurant or whomever, are really feeling the pinch of demographic changes. Not only in America, but globally, you know about this notion that "I need to know more about my food". Millennials are doing it, look at Amazon and Whole Foods. >> Lisa: Yup. >> That is a tipping point of like where this is all going to go. So for us, what Blockchain does allows for each of those drivers to remain clean. And so in essence, what you can do is you take something called smart contracts, not a great word but basically these are codes in which you've got a checklist or if-then statements that you can say, "What does the farmer want?" "What is the distributor doing to get something there?" And of course the buyer. And so in that sense, we've talked a lot about a scorecard or this notion that you can basically highlight and show all of these different values, so that if the consumer is looking for, you know, I definitely want this in my lettuce, in my beets, in whatever it is, and I need to make this type of salad, how acidic should my tomatoes be? Well that's hard to count, like combine all that information. Since we're capturing that data set and validating it to make sure that they're true, then you actually enable that trust for that consumer. So the consumer may want a lot of information, the issue is will they pay for it? There's some evidence that they will. The second part is, you know, does the grocer have the ability to manage wide varietals in their shelf space, and so on. All the techniques that a grocer would go through, yet they want a clean supply chain. >> Lisa: Right. >> So you know, so like what're we're saying is that this is definitely not easy. And so we're taking it where the influencer of the entire chain is able to help drive it, in the meanwhile we're trying to help create a farmer community that creates a level of trust. Bind those together, we believe Blockchain and a lot of the technology that ADI is deploying helps achieve that. >> And it sounds like from a technology perspective, you're leveraging Blockchain, Big Data, aggregating that to help farmers, even consumers, grocers, retailers, become more data-driven businesses. >> Oh absolutely. I mean in one instance we've got, you know a customer that they're learning how Blockchain can be used to open up their markets and improve their existing customer service. So what they have are like data sets, you know Rob would definitely understand this, but basically you have data set on like what's best for apples, pears, avocados to ripen, you know. Now, they know it in their heads, right? But the issue is, they don't know when there's conditions that change. The grocery store says I want Braeburn apples to be 20% more crisper, well they actually have the answer but they don't know how to tie all that together. >> Lisa: Right. >> So this data-driven capability exposes automation, so that you can fulfill on that. Create new markets, 'cause if your growers don't have it you can go find it from elsewhere. And for the consumer, you're going to deliver that component on time. And so in that sense, you know these things are revealed as ways to, not only like lower cost you know, because in the end Blockchain has this sort of notion that it lowers costs. Like any technology, if you insert it, it typically adds costs. And I'm not saying that our Blockchain does, but the greater value is branding, preserving it, you know. A better economic consequence about it, a better customer satisfaction because I now have knowledge in transparency. >> Lisa: Right. >> So you can't value these things right, because I'm a millennial like all of a sudden I got all my information, well how did you value it? I just paid $60 at Whole Foods, or is it something else? >> Lisa: Right. >> So we think that there's whole new economic revitalization about the entire farming system and the food nag system, because if you show the transparency, you've got something. >> That's so interesting. Last question, and we're almost out of time, Rob you mentioned a lot of small farms in Massachusets. Where are those small farms in terms of readiness to look at technologies and the influence of Big Data? Is it still fairly early in those discussions, or is your market more the larger farms that ... >> I said it earlier, we're at the beginning of the beginning. I was actually shocked, excuse me, when I went out and started talking to them. I was under some assumption that a lot of this was already going on. And it turns out it's not, certainly at that level. So we were like new to these guys, and the fact that we had a technology that would help them was unique to them. The issue was, well how do you communicate with them? How would you sell that? What's the distribution channel? So through a lot of the workshops that we do with the farmers we ask the question, "If their is new technology and you want to go get it, "what do you do?" They google it. I said, "Okay, that's probably not the answer "I was looking for." (laughs) But no, the supporting infrastructure, the rest of the ecosystem they need to take advantage just isn't there yet. So a lot of that I think is slow for the adoption, but it's also kind of helped us because we're working on technologies. You know, timing is everything. So the fact that we've had time to catch up to what we thought was really needed, and then learned more from the farmer, well no, no this is really what they want. So we've been able to iterate. You know, we're a very small team. We've been able to fail miserably many, many times. But the good news is, when we're successful that's all people see. And the farmers are starting to see that, that hey, we're getting actionable data. You're telling me things that I kind of knew, 'cause they fly by the seat of their pants a lot. >> They want it validated, verified. >> Oh yeah, they're very frugal. >> Trustworthy, as you said Raja. >> There's a big push back to spend any money on anything at a farm. That's just the way it is, it's not anything unique. So when you show up now with some technology that could help them, they just want to make sure that you're spot on, you can predict what it is, and when they hand me the money they can start planning on the return on their investment. >> Well gentlemen, we want to thank you so much for sharing your insights, Blockchain of food, what ADI is doing in their 50th year. Sounds like the beginning is very exciting and we wish you the best of luck. I'm not going to hold my breath that you're going to like tomatoes but, you know. (laughs) We wish you the best of luck and enjoy the rest of today. We want to thank you for watching The Cube at the Food IT event, From Fork to Farm. I'm Lisa Martin, thanks for watching. (upbeat pop music)

Published Date : Jun 29 2017

SUMMARY :

Brought to you by Western Digital. From the food IT event, From Fork to Farm, And we have Raja Ramachandran, So I made that joke about the Fork to Farm a sensor that we can connect. And from that point it's grown to what it is now. I don't mind pasta sauce so much, But no, and I've managed to get through this entire project and one of the guests was talking about Big Data And in the chef's scorecard, there wasn't just So that in a nutshell is the internet of tomatoes. And is this something that, you know, and I think, you know, that data point for the industry a lot of data that's automated, right? Unlike, you know, a lot of the big, large farms that we see. And what opportunity did you see coming from So the idea is to basically So we felt this would be an excellent opportunity one of the things that we hear from the consumers, you know. Not only in America, but globally, you know And so in essence, what you can do is you take So you know, so like what're we're saying is aggregating that to help farmers, even consumers, apples, pears, avocados to ripen, you know. And so in that sense, you know these things are revealed because if you show the transparency, you've got something. Rob you mentioned a lot of small farms in Massachusets. And the farmers are starting to see that, So when you show up now and we wish you the best of luck.

ENTITIES

Entity	Category	Confidence
Lisa Martin	PERSON	0.99+
Lisa	PERSON	0.99+
Rob O'Reilly	PERSON	0.99+
Raja Ramachandran	PERSON	0.99+
two	QUANTITY	0.99+
Whole Foods	ORGANIZATION	0.99+
America	LOCATION	0.99+
New England	LOCATION	0.99+
Western Digital	ORGANIZATION	0.99+
Rob	PERSON	0.99+
Rob O’Reilly	PERSON	0.99+
20%	QUANTITY	0.99+
fifth	QUANTITY	0.99+
Massachusetts	LOCATION	0.99+
$60	QUANTITY	0.99+
450 tomatoes	QUANTITY	0.99+
Amazon	ORGANIZATION	0.99+
USDA	ORGANIZATION	0.99+
Excel	TITLE	0.99+
7,000 farms	QUANTITY	0.99+
Silicon Valley	LOCATION	0.99+
50th year	QUANTITY	0.99+
9,000,000,000 people	QUANTITY	0.99+
Ripe.io	ORGANIZATION	0.99+
Hatfield	ORGANIZATION	0.99+
second part	QUANTITY	0.99+
Analog Devices	ORGANIZATION	0.99+
a year later	DATE	0.99+
one	QUANTITY	0.99+
100 years	QUANTITY	0.99+
today	DATE	0.99+
second half	QUANTITY	0.99+
Monsanto	ORGANIZATION	0.99+
Massachusets	LOCATION	0.99+
first level	QUANTITY	0.99+
both	QUANTITY	0.98+
Today	DATE	0.98+
sixth generation	QUANTITY	0.98+
zero	QUANTITY	0.98+
The Cube	TITLE	0.98+
Raja	PERSON	0.97+
each	QUANTITY	0.97+
Big Data	TITLE	0.97+
about 11 and a half hours	QUANTITY	0.97+
Food IT	EVENT	0.97+
Hadoop Dataworks	EVENT	0.96+
113	QUANTITY	0.96+
one instance	QUANTITY	0.96+
First	QUANTITY	0.96+
Boston Tomato Contest	EVENT	0.95+
John Deer	PERSON	0.95+
ADI	ORGANIZATION	0.95+
About 100	QUANTITY	0.93+
Fork to Farm	EVENT	0.93+
Braeburn	ORGANIZATION	0.9+
100 of them	QUANTITY	0.88+
Fork	EVENT	0.88+
Rolling Stones	ORGANIZATION	0.82+
Food IT 2017	EVENT	0.81+
50 years	QUANTITY	0.81+
couple weeks ago	DATE	0.8+

Recommend Videos

Sentiment Analysis

AWS Comprehend

Search Results for O'Reilly: