Image Title

Search Results for Sarbanes:

Lie 1, The Most Effective Data Architecture Is Centralized | Starburst


 

(bright upbeat music) >> In 2011, early Facebook employee and Cloudera co-founder Jeff Hammerbacher famously said, "The best minds of my generation are thinking about how to get people to click on ads, and that sucks!" Let's face it. More than a decade later, organizations continue to be frustrated with how difficult it is to get value from data and build a truly agile and data-driven enterprise. What does that even mean, you ask? Well, it means that everyone in the organization has the data they need when they need it in a context that's relevant to advance the mission of an organization. Now, that could mean cutting costs, could mean increasing profits, driving productivity, saving lives, accelerating drug discovery, making better diagnoses, solving supply chain problems, predicting weather disasters, simplifying processes, and thousands of other examples where data can completely transform people's lives beyond manipulating internet users to behave a certain way. We've heard the prognostications about the possibilities of data before and in fairness we've made progress, but the hard truth is the original promises of master data management, enterprise data warehouses, data marts, data hubs, and yes even data lakes were broken and left us wanting for more. Welcome to The Data Doesn't Lie... Or Does It? A series of conversations produced by theCUBE and made possible by Starburst Data. I'm your host, Dave Vellante, and joining me today are three industry experts. Justin Borgman is the co-founder and CEO of Starburst, Richard Jarvis is the CTO at EMIS Health, and Teresa Tung is cloud first technologist at Accenture. Today, we're going to have a candid discussion that will expose the unfulfilled, and yes, broken promises of a data past. We'll expose data lies: big lies, little lies, white lies, and hidden truths. And we'll challenge, age old data conventions and bust some data myths. We're debating questions like is the demise of a single source of truth inevitable? Will the data warehouse ever have feature parity with the data lake or vice versa? Is the so-called modern data stack simply centralization in the cloud, AKA the old guards model in new cloud close? How can organizations rethink their data architectures and regimes to realize the true promises of data? Can and will an open ecosystem deliver on these promises in our lifetimes? We're spanning much of the Western world today. Richard is in the UK, Teresa is on the West Coast, and Justin is in Massachusetts with me. I'm in theCUBE studios, about 30 miles outside of Boston. Folks, welcome to the program. Thanks for coming on. >> Thanks for having us. >> Okay, let's get right into it. You're very welcome. Now, here's the first lie. The most effective data architecture is one that is centralized with a team of data specialists serving various lines of business. What do you think Justin? >> Yeah, definitely a lie. My first startup was a company called Hadapt, which was an early SQL engine for IDU that was acquired by Teradata. And when I got to Teradata, of course, Teradata is the pioneer of that central enterprise data warehouse model. One of the things that I found fascinating was that not one of their customers had actually lived up to that vision of centralizing all of their data into one place. They all had data silos. They all had data in different systems. They had data on prem, data in the cloud. Those companies were acquiring other companies and inheriting their data architecture. So despite being the industry leader for 40 years, not one of their customers truly had everything in one place. So I think definitely history has proven that to be a lie. >> So Richard, from a practitioner's point of view, what are your thoughts? I mean, there's a lot of pressure to cut cost, keep things centralized, serve the business as best as possible from that standpoint. What does your experience show? >> Yeah, I mean, I think I would echo Justin's experience really that we as a business have grown up through acquisition, through storing data in different places sometimes to do information governance in different ways to store data in a platform that's close to data experts people who really understand healthcare data from pharmacies or from doctors. And so, although if you were starting from a greenfield site and you were building something brand new, you might be able to centralize all the data and all of the tooling and teams in one place. The reality is that businesses just don't grow up like that. And it's just really impossible to get that academic perfection of storing everything in one place. >> Teresa, I feel like Sarbanes-Oxley have kind of saved the data warehouse, right? (laughs) You actually did have to have a single version of the truth for certain financial data, but really for some of those other use cases I mentioned, I do feel like the industry has kind of let us down. What's your take on this? Where does it make sense to have that sort of centralized approach versus where does it make sense to maybe decentralize? >> I think you got to have centralized governance, right? So from the central team, for things like Sarbanes-Oxley, for things like security, for certain very core data sets having a centralized set of roles, responsibilities to really QA, right? To serve as a design authority for your entire data estate, just like you might with security, but how it's implemented has to be distributed. Otherwise, you're not going to be able to scale, right? So being able to have different parts of the business really make the right data investments for their needs. And then ultimately, you're going to collaborate with your partners. So partners that are not within the company, right? External partners. We're going to see a lot more data sharing and model creation. And so you're definitely going to be decentralized. >> So Justin, you guys last, jeez, I think it was about a year ago, had a session on data mesh. It was a great program. You invited Zhamak Dehghani. Of course, she's the creator of the data mesh. One of our fundamental premises is that you've got this hyper specialized team that you've got to go through if you want anything. But at the same time, these individuals actually become a bottleneck, even though they're some of the most talented people in the organization. So I guess, a question for you Richard. How do you deal with that? Do you organize so that there are a few sort of rock stars that build cubes and the like or have you had any success in sort of decentralizing with your constituencies that data model? >> Yeah. So we absolutely have got rockstar data scientists and data guardians, if you like. People who understand what it means to use this data, particularly the data that we use at EMIS is very private, it's healthcare information. And some of the rules and regulations around using the data are very complex and strict. So we have to have people who understand the usage of the data, then people who understand how to build models, how to process the data effectively. And you can think of them like consultants to the wider business because a pharmacist might not understand how to structure a SQL query, but they do understand how they want to process medication information to improve patient lives. And so that becomes a consulting type experience from a set of rock stars to help a more decentralized business who needs to understand the data and to generate some valuable output. >> Justin, what do you say to a customer or prospect that says, "Look, Justin. I got a centralized team and that's the most cost effective way to serve the business. Otherwise, I got duplication." What do you say to that? >> Well, I would argue it's probably not the most cost effective, and the reason being really twofold. I think, first of all, when you are deploying a enterprise data warehouse model, the data warehouse itself is very expensive, generally speaking. And so you're putting all of your most valuable data in the hands of one vendor who now has tremendous leverage over you for many, many years to come. I think that's the story at Oracle or Teradata or other proprietary database systems. But the other aspect I think is that the reality is those central data warehouse teams, as much as they are experts in the technology, they don't necessarily understand the data itself. And this is one of the core tenets of data mesh that Zhamak writes about is this idea of the domain owners actually know the data the best. And so by not only acknowledging that data is generally decentralized, and to your earlier point about Sarbanes-Oxley, maybe saving the data warehouse, I would argue maybe GDPR and data sovereignty will destroy it because data has to be decentralized for those laws to be compliant. But I think the reality is the data mesh model basically says data's decentralized and we're going to turn that into an asset rather than a liability. And we're going to turn that into an asset by empowering the people that know the data the best to participate in the process of curating and creating data products for consumption. So I think when you think about it that way, you're going to get higher quality data and faster time to insight, which is ultimately going to drive more revenue for your business and reduce costs. So I think that that's the way I see the two models comparing and contrasting. >> So do you think the demise of the data warehouse is inevitable? Teresa, you work with a lot of clients. They're not just going to rip and replace their existing infrastructure. Maybe they're going to build on top of it, but what does that mean? Does that mean the EDW just becomes less and less valuable over time or it's maybe just isolated to specific use cases? What's your take on that? >> Listen, I still would love all my data within a data warehouse. I would love it mastered, would love it owned by a central team, right? I think that's still what I would love to have. That's just not the reality, right? The investment to actually migrate and keep that up to date, I would say it's a losing battle. Like we've been trying to do it for a long time. Nobody has the budgets and then data changes, right? There's going to be a new technology that's going to emerge that we're going to want to tap into. There's going to be not enough investment to bring all the legacy, but still very useful systems into that centralized view. So you keep the data warehouse. I think it's a very, very valuable, very high performance tool for what it's there for, but you could have this new mesh layer that still takes advantage of the things I mentioned: the data products in the systems that are meaningful today, and the data products that actually might span a number of systems. Maybe either those that either source systems with the domains that know it best, or the consumer-based systems or products that need to be packaged in a way that'd be really meaningful for that end user, right? Each of those are useful for a different part of the business and making sure that the mesh actually allows you to use all of them. >> So, Richard, let me ask you. Take Zhamak's principles back to those. You got the domain ownership and data as product. Okay, great. Sounds good. But it creates what I would argue are two challenges: self-serve infrastructure, let's park that for a second, and then in your industry, one of the most regulated, most sensitive, computational governance. How do you automate and ensure federated governance in that mesh model that Teresa was just talking about? >> Well, it absolutely depends on some of the tooling and processes that you put in place around those tools to centralize the security and the governance of the data. And I think although a data warehouse makes that very simple 'cause it's a single tool, it's not impossible with some of the data mesh technologies that are available. And so what we've done at EMIS is we have a single security layer that sits on top of our data mesh, which means that no matter which user is accessing which data source, we go through a well audited, well understood security layer. That means that we know exactly who's got access to which data field, which data tables. And then everything that they do is audited in a very kind of standard way regardless of the underlying data storage technology. So for me, although storing the data in one place might not be possible, understanding where your source of truth is and securing that in a common way is still a valuable approach, and you can do it without having to bring all that data into a single bucket so that it's all in one place. And so having done that and investing quite heavily in making that possible has paid dividends in terms of giving wider access to the platform, and ensuring that only data that's available under GDPR and other regulations is being used by the data users. >> Yeah. So Justin, we always talk about data democratization, and up until recently, they really haven't been line of sight as to how to get there, but do you have anything to add to this because you're essentially doing analytic queries with data that's all dispersed all over. How are you seeing your customers handle this challenge? >> Yeah, I mean, I think data products is a really interesting aspect of the answer to that. It allows you to, again, leverage the data domain owners, the people who know the data the best, to create data as a product ultimately to be consumed. And we try to represent that in our product as effectively, almost eCommerce like experience where you go and discover and look for the data products that have been created in your organization, and then you can start to consume them as you'd like. And so really trying to build on that notion of data democratization and self-service, and making it very easy to discover and start to use with whatever BI tool you may like or even just running SQL queries yourself. >> Okay guys, grab a sip of water. After the short break, we'll be back to debate whether proprietary or open platforms are the best path to the future of data excellence. Keep it right there. (bright upbeat music)

Published Date : Aug 22 2022

SUMMARY :

has the data they need when they need it Now, here's the first lie. has proven that to be a lie. of pressure to cut cost, and all of the tooling have kind of saved the data So from the central team, for that build cubes and the like and to generate some valuable output. and that's the most cost effective way is that the reality is those of the data warehouse is inevitable? and making sure that the mesh one of the most regulated, most sensitive, and processes that you put as to how to get there, aspect of the answer to that. or open platforms are the best path

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave VellantePERSON

0.99+

RichardPERSON

0.99+

Justin BorgmanPERSON

0.99+

JustinPERSON

0.99+

Richard JarvisPERSON

0.99+

Teresa TungPERSON

0.99+

Jeff HammerbacherPERSON

0.99+

TeresaPERSON

0.99+

TeradataORGANIZATION

0.99+

OracleORGANIZATION

0.99+

MassachusettsLOCATION

0.99+

Zhamak DehghaniPERSON

0.99+

UKLOCATION

0.99+

2011DATE

0.99+

two challengesQUANTITY

0.99+

HadaptORGANIZATION

0.99+

40 yearsQUANTITY

0.99+

StarburstORGANIZATION

0.99+

two modelsQUANTITY

0.99+

thousandsQUANTITY

0.99+

BostonLOCATION

0.99+

FacebookORGANIZATION

0.99+

Sarbanes-OxleyORGANIZATION

0.99+

EachQUANTITY

0.99+

first lieQUANTITY

0.99+

AccentureORGANIZATION

0.99+

GDPRTITLE

0.99+

TodayDATE

0.98+

todayDATE

0.98+

SQLTITLE

0.98+

Starburst DataORGANIZATION

0.98+

EMIS HealthORGANIZATION

0.98+

ClouderaORGANIZATION

0.98+

oneQUANTITY

0.98+

first startupQUANTITY

0.98+

one placeQUANTITY

0.98+

about 30 milesQUANTITY

0.98+

OneQUANTITY

0.97+

More than a decade laterDATE

0.97+

EMISORGANIZATION

0.97+

single bucketQUANTITY

0.97+

first technologistQUANTITY

0.96+

three industry expertsQUANTITY

0.96+

single toolQUANTITY

0.96+

single versionQUANTITY

0.94+

ZhamakPERSON

0.92+

theCUBEORGANIZATION

0.91+

single sourceQUANTITY

0.9+

West CoastLOCATION

0.87+

one vendorQUANTITY

0.84+

single security layerQUANTITY

0.81+

about a year agoDATE

0.75+

IDUORGANIZATION

0.68+

IsTITLE

0.65+

a secondQUANTITY

0.64+

EDWORGANIZATION

0.57+

examplesQUANTITY

0.55+

echoCOMMERCIAL_ITEM

0.54+

twofoldQUANTITY

0.5+

LieTITLE

0.35+

Starburst The Data Lies FULL V2b


 

>>In 2011, early Facebook employee and Cloudera co-founder Jeff Ocker famously said the best minds of my generation are thinking about how to get people to click on ads. And that sucks. Let's face it more than a decade later organizations continue to be frustrated with how difficult it is to get value from data and build a truly agile data-driven enterprise. What does that even mean? You ask? Well, it means that everyone in the organization has the data they need when they need it. In a context that's relevant to advance the mission of an organization. Now that could mean cutting cost could mean increasing profits, driving productivity, saving lives, accelerating drug discovery, making better diagnoses, solving, supply chain problems, predicting weather disasters, simplifying processes, and thousands of other examples where data can completely transform people's lives beyond manipulating internet users to behave a certain way. We've heard the prognostications about the possibilities of data before and in fairness we've made progress, but the hard truth is the original promises of master data management, enterprise data, warehouses, data marts, data hubs, and yes, even data lakes were broken and left us wanting from more welcome to the data doesn't lie, or doesn't a series of conversations produced by the cube and made possible by Starburst data. >>I'm your host, Dave Lanta and joining me today are three industry experts. Justin Borgman is this co-founder and CEO of Starburst. Richard Jarvis is the CTO at EMI health and Theresa tongue is cloud first technologist at Accenture. Today we're gonna have a candid discussion that will expose the unfulfilled and yes, broken promises of a data past we'll expose data lies, big lies, little lies, white lies, and hidden truths. And we'll challenge, age old data conventions and bust some data myths. We're debating questions like is the demise of a single source of truth. Inevitable will the data warehouse ever have featured parody with the data lake or vice versa is the so-called modern data stack, simply centralization in the cloud, AKA the old guards model in new cloud close. How can organizations rethink their data architectures and regimes to realize the true promises of data can and will and open ecosystem deliver on these promises in our lifetimes, we're spanning much of the Western world today. Richard is in the UK. Teresa is on the west coast and Justin is in Massachusetts with me. I'm in the cube studios about 30 miles outside of Boston folks. Welcome to the program. Thanks for coming on. Thanks for having us. Let's get right into it. You're very welcome. Now here's the first lie. The most effective data architecture is one that is centralized with a team of data specialists serving various lines of business. What do you think Justin? >>Yeah, definitely a lie. My first startup was a company called hit adapt, which was an early SQL engine for hit that was acquired by Teradata. And when I got to Teradata, of course, Teradata is the pioneer of that central enterprise data warehouse model. One of the things that I found fascinating was that not one of their customers had actually lived up to that vision of centralizing all of their data into one place. They all had data silos. They all had data in different systems. They had data on prem data in the cloud. You know, those companies were acquiring other companies and inheriting their data architecture. So, you know, despite being the industry leader for 40 years, not one of their customers truly had everything in one place. So I think definitely history has proven that to be a lie. >>So Richard, from a practitioner's point of view, you know, what, what are your thoughts? I mean, there, there's a lot of pressure to cut cost, keep things centralized, you know, serve the business as best as possible from that standpoint. What, what is your experience show? >>Yeah, I mean, I think I would echo Justin's experience really that we, as a business have grown up through acquisition, through storing data in different places sometimes to do information governance in different ways to store data in, in a platform that's close to data experts, people who really understand healthcare data from pharmacies or from, from doctors. And so, although if you were starting from a Greenfield site and you were building something brand new, you might be able to centralize all the data and all of the tooling and teams in one place. The reality is that that businesses just don't grow up like that. And, and it's just really impossible to get that academic perfection of, of storing everything in one place. >>Y you know, Theresa, I feel like Sarbanes Oxley kinda saved the data warehouse, you know, right. You actually did have to have a single version of the truth for certain financial data, but really for those, some of those other use cases, I, I mentioned, I, I do feel like the industry has kinda let us down. What's your take on this? Where does it make sense to have that sort of centralized approach versus where does it make sense to maybe decentralized? >>I, I think you gotta have centralized governance, right? So from the central team, for things like star Oxley, for things like security for certainly very core data sets, having a centralized set of roles, responsibilities to really QA, right. To serve as a design authority for your entire data estate, just like you might with security, but how it's implemented has to be distributed. Otherwise you're not gonna be able to scale. Right? So being able to have different parts of the business really make the right data investments for their needs. And then ultimately you're gonna collaborate with your partners. So partners that are not within the company, right. External partners, we're gonna see a lot more data sharing and model creation. And so you're definitely going to be decentralized. >>So, you know, Justin, you guys last, geez, I think it was about a year ago, had a session on, on data mesh. It was a great program. You invited Jamma, Dani, of course, she's the creator of the data mesh. And her one of our fundamental premises is that you've got this hyper specialized team that you've gotta go through. And if you want anything, but at the same time, these, these individuals actually become a bottleneck, even though they're some of the most talented people in the organization. So I guess question for you, Richard, how do you deal with that? Do you, do you organize so that there are a few sort of rock stars that, that, you know, build cubes and, and the like, and, and, and, or have you had any success in sort of decentralizing with, you know, your, your constituencies, that data model? >>Yeah. So, so we absolutely have got rockstar, data scientists and data guardians. If you like people who understand what it means to use this data, particularly as the data that we use at emos is very private it's healthcare information. And some of the, the rules and regulations around using the data are very complex and, and strict. So we have to have people who understand the usage of the data, then people who understand how to build models, how to process the data effectively. And you can think of them like consultants to the wider business, because a pharmacist might not understand how to structure a SQL query, but they do understand how they want to process medication information to improve patient lives. And so that becomes a, a consulting type experience from a, a set of rock stars to help a, a more decentralized business who needs to, to understand the data and to generate some valuable output. >>Justin, what do you say to a, to a customer or prospect that says, look, Justin, I'm gonna, I got a centralized team and that's the most cost effective way to serve the business. Otherwise I got, I got duplication. What do you say to that? >>Well, I, I would argue it's probably not the most cost effective and, and the reason being really twofold. I think, first of all, when you are deploying a enterprise data warehouse model, the, the data warehouse itself is very expensive, generally speaking. And so you're putting all of your most valuable data in the hands of one vendor who now has tremendous leverage over you, you know, for many, many years to come. I think that's the story at Oracle or Terra data or other proprietary database systems. But the other aspect I think is that the reality is those central data warehouse teams is as much as they are experts in the technology. They don't necessarily understand the data itself. And this is one of the core tenants of data mash that that jam writes about is this idea of the domain owners actually know the data the best. >>And so by, you know, not only acknowledging that data is generally decentralized and to your earlier point about SAR, brain Oxley, maybe saving the data warehouse, I would argue maybe GDPR and data sovereignty will destroy it because data has to be decentralized for, for those laws to be compliant. But I think the reality is, you know, the data mesh model basically says, data's decentralized, and we're gonna turn that into an asset rather than a liability. And we're gonna turn that into an asset by empowering the people that know the data, the best to participate in the process of, you know, curating and creating data products for, for consumption. So I think when you think about it, that way, you're going to get higher quality data and faster time to insight, which is ultimately going to drive more revenue for your business and reduce costs. So I think that that's the way I see the two, the two models comparing and contrasting. >>So do you think the demise of the data warehouse is inevitable? I mean, I mean, you know, there Theresa you work with a lot of clients, they're not just gonna rip and replace their existing infrastructure. Maybe they're gonna build on top of it, but what does that mean? Does that mean the E D w just becomes, you know, less and less valuable over time, or it's maybe just isolated to specific use cases. What's your take on that? >>Listen, I still would love all my data within a data warehouse would love it. Mastered would love it owned by essential team. Right? I think that's still what I would love to have. That's just not the reality, right? The investment to actually migrate and keep that up to date. I would say it's a losing battle. Like we've been trying to do it for a long time. Nobody has the budgets and then data changes, right? There's gonna be a new technology. That's gonna emerge that we're gonna wanna tap into. There's going to be not enough investment to bring all the legacy, but still very useful systems into that centralized view. So you keep the data warehouse. I think it's a very, very valuable, very high performance tool for what it's there for, but you could have this, you know, new mesh layer that still takes advantage of the things. I mentioned, the data products in the systems that are meaningful today and the data products that actually might span a number of systems, maybe either those that either source systems for the domains that know it best, or the consumer based systems and products that need to be packaged in a way that be really meaningful for that end user, right? Each of those are useful for a different part of the business and making sure that the mesh actually allows you to use all of them. >>So, Richard, let me ask you, you take, take Gemma's principles back to those. You got to, you know, domain ownership and, and, and data as product. Okay, great. Sounds good. But it creates what I would argue are two, you know, challenges, self-serve infrastructure let's park that for a second. And then in your industry, the one of the high, most regulated, most sensitive computational governance, how do you automate and ensure federated governance in that mesh model that Theresa was just talking about? >>Well, it absolutely depends on some of the tooling and processes that you put in place around those tools to be, to centralize the security and the governance of the data. And I think, although a data warehouse makes that very simple, cause it's a single tool, it's not impossible with some of the data mesh technologies that are available. And so what we've done at emus is we have a single security layer that sits on top of our data match, which means that no matter which user is accessing, which data source, we go through a well audited well understood security layer. That means that we know exactly who's got access to which data field, which data tables. And then everything that they do is, is audited in a very kind of standard way, regardless of the underlying data storage technology. So for me, although storing the data in one place might not be possible understanding where your source of truth is and securing that in a common way is still a valuable approach and you can do it without having to bring all that data into a single bucket so that it's all in one place. And, and so having done that and investing quite heavily in making that possible has paid dividends in terms of giving wider access to the platform and ensuring that only data that's available under GDPR and other regulations is being used by, by the data users. >>Yeah. So Justin, I mean, Democrat, we always talk about data democratization and you know, up until recently, they really haven't been line of sight as to how to get there. But do you have anything to add to this because you're essentially taking, you know, do an analytic queries and with data that's all dispersed all over the, how are you seeing your customers handle this, this challenge? >>Yeah. I mean, I think data products is a really interesting aspect of the answer to that. It allows you to, again, leverage the data domain owners, people know the data, the best to, to create, you know, data as a product ultimately to be consumed. And we try to represent that in our product as effectively a almost eCommerce like experience where you go and discover and look for the data products that have been created in your organization. And then you can start to consume them as, as you'd like. And so really trying to build on that notion of, you know, data democratization and self-service, and making it very easy to discover and, and start to use with whatever BI tool you, you may like, or even just running, you know, SQL queries yourself, >>Okay. G guys grab a sip of water. After this short break, we'll be back to debate whether proprietary or open platforms are the best path to the future of data excellence, keep it right there. >>Your company has more data than ever, and more people trying to understand it, but there's a problem. Your data is stored across multiple systems. It's hard to access and that delays analytics and ultimately decisions. The old method of moving all of your data into a single source of truth is slow and definitely not built for the volume of data we have today or where we are headed while your data engineers spent over half their time, moving data, your analysts and data scientists are left, waiting, feeling frustrated, unproductive, and unable to move the needle for your business. But what if you could spend less time moving or copying data? What if your data consumers could analyze all your data quickly? >>Starburst helps your teams run fast queries on any data source. We help you create a single point of access to your data, no matter where it's stored. And we support high concurrency, we solve for speed and scale, whether it's fast, SQL queries on your data lake or faster queries across multiple data sets, Starburst helps your teams run analytics anywhere you can't afford to wait for data to be available. Your team has questions that need answers. Now with Starburst, the wait is over. You'll have faster access to data with enterprise level security, easy connectivity, and 24 7 support from experts, organizations like Zolando Comcast and FINRA rely on Starburst to move their businesses forward. Contact our Trino experts to get started. >>We're back with Jess Borgman of Starburst and Richard Jarvis of EVAs health. Okay, we're gonna get to lie. Number two, and that is this an open source based platform cannot give you the performance and control that you can get with a proprietary system. Is that a lie? Justin, the enterprise data warehouse has been pretty dominant and has evolved and matured. Its stack has mature over the years. Why is it not the default platform for data? >>Yeah, well, I think that's become a lie over time. So I, I think, you know, if we go back 10 or 12 years ago with the advent of the first data lake really around Hudu, that probably was true that you couldn't get the performance that you needed to run fast, interactive, SQL queries in a data lake. Now a lot's changed in 10 or 12 years. I remember in the very early days, people would say, you you'll never get performance because you need to be column there. You need to store data in a column format. And then, you know, column formats we're introduced to, to data apes, you have Parque ORC file in aro that were created to ultimately deliver performance out of that. So, okay. We got, you know, largely over the performance hurdle, you know, more recently people will say, well, you don't have the ability to do updates and deletes like a traditional data warehouse. >>And now we've got the creation of new data formats, again like iceberg and Delta and Hodi that do allow for updates and delete. So I think the data lake has continued to mature. And I remember a, a quote from, you know, Kurt Monash many years ago where he said, you know, know it takes six or seven years to build a functional database. I think that's that's right. And now we've had almost a decade go by. So, you know, these technologies have matured to really deliver very, very close to the same level performance and functionality of, of cloud data warehouses. So I think the, the reality is that's become a line and now we have large giant hyperscale internet companies that, you know, don't have the traditional data warehouse at all. They do all of their analytics in a data lake. So I think we've, we've proven that it's very much possible today. >>Thank you for that. And so Richard, talk about your perspective as a practitioner in terms of what open brings you versus, I mean, look closed is it's open as a moving target. I remember Unix used to be open systems and so it's, it is an evolving, you know, spectrum, but, but from your perspective, what does open give you that you can't get from a proprietary system where you are fearful of in a proprietary system? >>I, I suppose for me open buys us the ability to be unsure about the future, because one thing that's always true about technology is it evolves in a, a direction, slightly different to what people expect. And what you don't want to end up is done is backed itself into a corner that then prevents it from innovating. So if you have chosen a technology and you've stored trillions of records in that technology and suddenly a new way of processing or machine learning comes out, you wanna be able to take advantage and your competitive edge might depend upon it. And so I suppose for us, we acknowledge that we don't have perfect vision of what the future might be. And so by backing open storage technologies, we can apply a number of different technologies to the processing of that data. And that gives us the ability to remain relevant, innovate on our data storage. And we have bought our way out of the, any performance concerns because we can use cloud scale infrastructure to scale up and scale down as we need. And so we don't have the concerns that we don't have enough hardware today to process what we want to do, want to achieve. We can just scale up when we need it and scale back down. So open source has really allowed us to maintain the being at the cutting edge. >>So Jess, let me play devil's advocate here a little bit, and I've talked to Shaak about this and you know, obviously her vision is there's an open source that, that the data meshes open source, an open source tooling, and it's not a proprietary, you know, you're not gonna buy a data mesh. You're gonna build it with, with open source toolings and, and vendors like you are gonna support it, but to come back to sort of today, you can get to market with a proprietary solution faster. I'm gonna make that statement. You tell me if it's a lie and then you can say, okay, we support Apache iceberg. We're gonna support open source tooling, take a company like VMware, not really in the data business, but how, the way they embraced Kubernetes and, and you know, every new open source thing that comes along, they say, we do that too. Why can't proprietary systems do that and be as effective? >>Yeah, well, I think at least with the, within the data landscape saying that you can access open data formats like iceberg or, or others is, is a bit dis disingenuous because really what you're selling to your customer is a certain degree of performance, a certain SLA, and you know, those cloud data warehouses that can reach beyond their own proprietary storage drop all the performance that they were able to provide. So it is, it reminds me kind of, of, again, going back 10 or 12 years ago when everybody had a connector to Haddo and that they thought that was the solution, right? But the reality was, you know, a connector was not the same as running workloads in Haddo back then. And I think similarly, you know, being able to connect to an external table that lives in an open data format, you know, you're, you're not going to give it the performance that your customers are accustomed to. And at the end of the day, they're always going to be predisposed. They're always going to be incentivized to get that data ingested into the data warehouse, cuz that's where they have control. And you know, the bottom line is the database industry has really been built around vendor lockin. I mean, from the start, how, how many people love Oracle today, but our customers, nonetheless, I think, you know, lockin is, is, is part of this industry. And I think that's really what we're trying to change with open data formats. >>Well, that's interesting reminded when I, you know, I see the, the gas price, the tees or gas price I, I drive up and then I say, oh, that's the cash price credit card. I gotta pay 20 cents more, but okay. But so the, the argument then, so let me, let me come back to you, Justin. So what's wrong with saying, Hey, we support open data formats, but yeah, you're gonna get better performance if you, if you keep it into our closed system, are you saying that long term that's gonna come back and bite you cuz you're gonna end up, you mentioned Oracle, you mentioned Teradata. Yeah. That's by, by implication, you're saying that's where snowflake customers are headed. >>Yeah, absolutely. I think this is a movie that, you know, we've all seen before. At least those of us who've been in the industry long enough to, to see this movie play over a couple times. So I do think that's the future. And I think, you know, I loved what Richard said. I actually wrote it down. Cause I thought it was an amazing quote. He said, it buys us the ability to be unsure of the future. Th that that pretty much says it all the, the future is unknowable and the reality is using open data formats. You remain interoperable with any technology you want to utilize. If you want to use spark to train a machine learning model and you want to use Starbust to query via sequel, that's totally cool. They can both work off the same exact, you know, data, data sets by contrast, if you're, you know, focused on a proprietary model, then you're kind of locked in again to that model. I think the same applies to data, sharing to data products, to a wide variety of, of aspects of the data landscape that a proprietary approach kind of closes you in and locks you in. >>So I, I would say this Richard, I'd love to get your thoughts on it. Cause I talked to a lot of Oracle customers, not as many te data customers, but, but a lot of Oracle customers and they, you know, they'll admit, yeah, you know, they're jamming us on price and the license cost they give, but we do get value out of it. And so my question to you, Richard, is, is do the, let's call it data warehouse systems or the proprietary systems. Are they gonna deliver a greater ROI sooner? And is that in allure of, of that customers, you know, are attracted to, or can open platforms deliver as fast in ROI? >>I think the answer to that is it can depend a bit. It depends on your businesses skillset. So we are lucky that we have a number of proprietary teams that work in databases that provide our operational data capability. And we have teams of analytics and big data experts who can work with open data sets and open data formats. And so for those different teams, they can get to an ROI more quickly with different technologies for the business though, we can't do better for our operational data stores than proprietary databases. Today we can back off very tight SLAs to them. We can demonstrate reliability from millions of hours of those databases being run at enterprise scale, but for an analytics workload where increasing our business is growing in that direction, we can't do better than open data formats with cloud-based data mesh type technologies. And so it's not a simple answer. That one will always be the right answer for our business. We definitely have times when proprietary databases provide a capability that we couldn't easily represent or replicate with open technologies. >>Yeah. Richard, stay with you. You mentioned, you know, you know, some things before that, that strike me, you know, the data brick snowflake, you know, thing is, oh, is a lot of fun for analysts like me. You've got data bricks coming at it. Richard, you mentioned you have a lot of rockstar, data engineers, data bricks coming at it from a data engineering heritage. You get snowflake coming at it from an analytics heritage. Those two worlds are, are colliding people like PJI Mohan said, you know what? I think it's actually harder to play in the data engineering. So I E it's easier to for data engineering world to go into the analytics world versus the reverse, but thinking about up and coming engineers and developers preparing for this future of data engineering and data analytics, how, how should they be thinking about the future? What, what's your advice to those young people? >>So I think I'd probably fall back on general programming skill sets. So the advice that I saw years ago was if you have open source technologies, the pythons and Javas on your CV, you commander 20% pay, hike over people who can only do proprietary programming languages. And I think that's true of data technologies as well. And from a business point of view, that makes sense. I'd rather spend the money that I save on proprietary licenses on better engineers, because they can provide more value to the business that can innovate us beyond our competitors. So I think I would my advice to people who are starting here or trying to build teams to capitalize on data assets is begin with open license, free capabilities, because they're very cheap to experiment with. And they generate a lot of interest from people who want to join you as a business. And you can make them very successful early, early doors with, with your analytics journey. >>It's interesting. Again, analysts like myself, we do a lot of TCO work and have over the last 20 plus years. And in world of Oracle, you know, normally it's the staff, that's the biggest nut in total cost of ownership, not an Oracle. It's the it's the license cost is by far the biggest component in the, in the blame pie. All right, Justin, help us close out this segment. We've been talking about this sort of data mesh open, closed snowflake data bricks. Where does Starburst sort of as this engine for the data lake data lake house, the data warehouse fit in this, in this world? >>Yeah. So our view on how the future ultimately unfolds is we think that data lakes will be a natural center of gravity for a lot of the reasons that we described open data formats, lowest total cost of ownership, because you get to choose the cheapest storage available to you. Maybe that's S3 or Azure data lake storage, or Google cloud storage, or maybe it's on-prem object storage that you bought at a, at a really good price. So ultimately storing a lot of data in a deal lake makes a lot of sense, but I think what makes our perspective unique is we still don't think you're gonna get everything there either. We think that basically centralization of all your data assets is just an impossible endeavor. And so you wanna be able to access data that lives outside of the lake as well. So we kind of think of the lake as maybe the biggest place by volume in terms of how much data you have, but to, to have comprehensive analytics and to truly understand your business and understand it holistically, you need to be able to go access other data sources as well. And so that's the role that we wanna play is to be a single point of access for our customers, provide the right level of fine grained access controls so that the right people have access to the right data and ultimately make it easy to discover and consume via, you know, the creation of data products as well. >>Great. Okay. Thanks guys. Right after this quick break, we're gonna be back to debate whether the cloud data model that we see emerging and the so-called modern data stack is really modern, or is it the same wine new bottle? When it comes to data architectures, you're watching the cube, the leader in enterprise and emerging tech coverage. >>Your data is capable of producing incredible results, but data consumers are often left in the dark without fast access to the data they need. Starers makes your data visible from wherever it lives. Your company is acquiring more data in more places, more rapidly than ever to rely solely on a data centralization strategy. Whether it's in a lake or a warehouse is unrealistic. A single source of truth approach is no longer viable, but disconnected data silos are often left untapped. We need a new approach. One that embraces distributed data. One that enables fast and secure access to any of your data from anywhere with Starburst, you'll have the fastest query engine for the data lake that allows you to connect and analyze your disparate data sources no matter where they live Starburst provides the foundational technology required for you to build towards the vision of a decentralized data mesh Starburst enterprise and Starburst galaxy offer enterprise ready, connectivity, interoperability, and security features for multiple regions, multiple clouds and everchanging global regulatory requirements. The data is yours. And with Starburst, you can perform analytics anywhere in light of your world. >>Okay. We're back with Justin Boardman. CEO of Starbust Richard Jarvis is the CTO of EMI health and Theresa tongue is the cloud first technologist from Accenture. We're on July number three. And that is the claim that today's modern data stack is actually modern. So I guess that's the lie it's it is it's is that it's not modern. Justin, what do you say? >>Yeah. I mean, I think new isn't modern, right? I think it's the, it's the new data stack. It's the cloud data stack, but that doesn't necessarily mean it's modern. I think a lot of the components actually are exactly the same as what we've had for 40 years, rather than Terra data. You have snowflake rather than Informatica you have five trend. So it's the same general stack, just, you know, a cloud version of it. And I think a lot of the challenges that it plagued us for 40 years still maintain. >>So lemme come back to you just, but okay. But, but there are differences, right? I mean, you can scale, you can throw resources at the problem. You can separate compute from storage. You really, you know, there's a lot of money being thrown at that by venture capitalists and snowflake, you mentioned it's competitors. So that's different. Is it not, is that not at least an aspect of, of modern dial it up, dial it down. So what, what do you say to that? >>Well, it, it is, it's certainly taking, you know, what the cloud offers and taking advantage of that, but it's important to note that the cloud data warehouses out there are really just separating their compute from their storage. So it's allowing them to scale up and down, but your data still stored in a proprietary format. You're still locked in. You still have to ingest the data to get it even prepared for analysis. So a lot of the same sort of structural constraints that exist with the old enterprise data warehouse model OnPrem still exist just yes, a little bit more elastic now because the cloud offers that. >>So Theresa, let me go to you cuz you have cloud first in your, in your, your title. So what's what say you to this conversation? >>Well, even the cloud providers are looking towards more of a cloud continuum, right? So the centralized cloud, as we know it, maybe data lake data warehouse in the central place, that's not even how the cloud providers are looking at it. They have news query services. Every provider has one that really expands those queries to be beyond a single location. And if we look at a lot of where our, the future goes, right, that that's gonna very much fall the same thing. There was gonna be more edge. There's gonna be more on premise because of data sovereignty, data gravity, because you're working with different parts of the business that have already made major cloud investments in different cloud providers. Right? So there's a lot of reasons why the modern, I guess, the next modern generation of the data staff needs to be much more federated. >>Okay. So Richard, how do you deal with this? You you've obviously got, you know, the technical debt, the existing infrastructure it's on the books. You don't wanna just throw it out. A lot of, lot of conversation about modernizing applications, which a lot of times is a, you know, a microservices layer on top of leg legacy apps. How do you think about the modern data stack? >>Well, I think probably the first thing to say is that the stack really has to include the processes and people around the data as well is all well and good changing the technology. But if you don't modernize how people use that technology, then you're not going to be able to, to scale because just cuz you can scale CPU and storage doesn't mean you can get more people to use your data, to generate you more, more value for the business. And so what we've been looking at is really changing in very much aligned to data products and, and data mesh. How do you enable more people to consume the service and have the stack respond in a way that keeps costs low? Because that's important for our customers consuming this data, but also allows people to occasionally run enormous queries and then tick along with smaller ones when required. And it's a good job we did because during COVID all of a sudden we had enormous pressures on our data platform to answer really important life threatening queries. And if we couldn't scale both our data stack and our teams, we wouldn't have been able to answer those as quickly as we had. So I think the stack needs to support a scalable business, not just the technology itself. >>Well thank you for that. So Justin let's, let's try to break down what the critical aspects are of the modern data stack. So you think about the past, you know, five, seven years cloud obviously has given a different pricing model. De-risked experimentation, you know that we talked about the ability to scale up scale down, but it's, I'm, I'm taking away that that's not enough based on what Richard just said. The modern data stack has to serve the business and enable the business to build data products. I, I buy that. I'm a big fan of the data mesh concepts, even though we're early days. So what are the critical aspects if you had to think about, you know, paying, maybe putting some guardrails and definitions around the modern data stack, what does that look like? What are some of the attributes and, and principles there >>Of, of how it should look like or, or how >>It's yeah. What it should be. >>Yeah. Yeah. Well, I think, you know, in, in Theresa mentioned this in, in a previous segment about the data warehouse is not necessarily going to disappear. It just becomes one node, one element of the overall data mesh. And I, I certainly agree with that. So by no means, are we suggesting that, you know, snowflake or Redshift or whatever cloud data warehouse you may be using is going to disappear, but it's, it's not going to become the end all be all. It's not the, the central single source of truth. And I think that's the paradigm shift that needs to occur. And I think it's also worth noting that those who were the early adopters of the modern data stack were primarily digital, native born in the cloud young companies who had the benefit of, of idealism. They had the benefit of it was starting with a clean slate that does not reflect the vast majority of enterprises. >>And even those companies, as they grow up mature out of that ideal state, they go buy a business. Now they've got something on another cloud provider that has a different data stack and they have to deal with that heterogeneity that is just change and change is a part of life. And so I think there is an element here that is almost philosophical. It's like, do you believe in an absolute ideal where I can just fit everything into one place or do I believe in reality? And I think the far more pragmatic approach is really what data mesh represents. So to answer your question directly, I think it's adding, you know, the ability to access data that lives outside of the data warehouse, maybe living in open data formats in a data lake or accessing operational systems as well. Maybe you want to directly access data that lives in an Oracle database or a Mongo database or, or what have you. So creating that flexibility to really Futureproof yourself from the inevitable change that you will, you won't encounter over time. >>So thank you. So there, based on what Justin just said, I, my takeaway there is it's inclusive, whether it's a data Mar data hub, data lake data warehouse, it's a, just a node on the mesh. Okay. I get that. Does that include there on Preem data? O obviously it has to, what are you seeing in terms of the ability to, to take that data mesh concept on Preem? I mean, most implementations I've seen in data mesh, frankly really aren't, you know, adhering to the philosophy. They're maybe, maybe it's data lake and maybe it's using glue. You look at what JPMC is doing. Hello, fresh, a lot of stuff happening on the AWS cloud in that, you know, closed stack, if you will. What's the answer to that Theresa? >>I mean, I, I think it's a killer case for data. Me, the fact that you have valuable data sources, OnPrem, and then yet you still wanna modernize and take the best of cloud cloud is still, like we mentioned, there's a lot of great reasons for it around the economics and the way ability to tap into the innovation that the cloud providers are giving around data and AI architecture. It's an easy button. So the mesh allows you to have the best of both worlds. You can start using the data products on-prem or in the existing systems that are working already. It's meaningful for the business. At the same time, you can modernize the ones that make business sense because it needs better performance. It needs, you know, something that is, is cheaper or, or maybe just tap into better analytics to get better insights, right? So you're gonna be able to stretch and really have the best of both worlds. That, again, going back to Richard's point, that is meaningful by the business. Not everything has to have that one size fits all set a tool. >>Okay. Thank you. So Richard, you know, talking about data as product, wonder if we could give us your perspectives here, what are the advantages of treating data as a product? What, what role do data products have in the modern data stack? We talk about monetizing data. What are your thoughts on data products? >>So for us, one of the most important data products that we've been creating is taking data that is healthcare data across a wide variety of different settings. So information about patients' demographics about their, their treatment, about their medications and so on, and taking that into a standards format that can be utilized by a wide variety of different researchers because misinterpreting that data or having the data not presented in the way that the user is expecting means that you generate the wrong insight. And in any business, that's clearly not a desirable outcome, but when that insight is so critical, as it might be in healthcare or some security settings, you really have to have gone to the trouble of understanding the data, presenting it in a format that everyone can clearly agree on. And then letting people consume in a very structured, managed way, even if that data comes from a variety of different sources in, in, in the first place. And so our data product journey has really begun by standardizing data across a number of different silos through the data mesh. So we can present out both internally and through the right governance externally to, to researchers. >>So that data product through whatever APIs is, is accessible, it's discoverable, but it's obviously gotta be governed as well. You mentioned you, you appropriately provided to internally. Yeah. But also, you know, external folks as well. So the, so you've, you've architected that capability today >>We have, and because the data is standard, it can generate value much more quickly and we can be sure of the security and, and, and value that that's providing because the data product isn't just about formatting the data into the correct tables, it's understanding what it means to redact the data or to remove certain rows from it or to interpret what a date actually means. Is it the start of the contract or the start of the treatment or the date of birth of a patient? These things can be lost in the data storage without having the proper product management around the data to say in a very clear business context, what does this data mean? And what does it mean to process this data for a particular use case? >>Yeah, it makes sense. It's got the context. If the, if the domains own the data, you, you gotta cut through a lot of the, the, the centralized teams, the technical teams that, that data agnostic, they don't really have that context. All right. Let's send Justin, how does Starburst fit into this modern data stack? Bring us home. >>Yeah. So I think for us, it's really providing our customers with, you know, the flexibility to operate and analyze data that lives in a wide variety of different systems. Ultimately giving them that optionality, you know, and optionality provides the ability to reduce costs, store more in a data lake rather than data warehouse. It provides the ability for the fastest time to insight to access the data directly where it lives. And ultimately with this concept of data products that we've now, you know, incorporated into our offering as well, you can really create and, and curate, you know, data as a product to be shared and consumed. So we're trying to help enable the data mesh, you know, model and make that an appropriate compliment to, you know, the, the, the modern data stack that people have today. >>Excellent. Hey, I wanna thank Justin Theresa and Richard for joining us today. You guys are great. I big believers in the, in the data mesh concept, and I think, you know, we're seeing the future of data architecture. So thank you. Now, remember, all these conversations are gonna be available on the cube.net for on-demand viewing. You can also go to starburst.io. They have some great content on the website and they host some really thought provoking interviews and, and, and they have awesome resources, lots of data mesh conversations over there, and really good stuff in, in the resource section. So check that out. Thanks for watching the data doesn't lie or does it made possible by Starburst data? This is Dave Valante for the cube, and we'll see you next time. >>The explosion of data sources has forced organizations to modernize their systems and architecture and come to terms with one size does not fit all for data management today. Your teams are constantly moving and copying data, which requires time management. And in some cases, double paying for compute resources. Instead, what if you could access all your data anywhere using the BI tools and SQL skills your users already have. And what if this also included enterprise security and fast performance with Starburst enterprise, you can provide your data consumers with a single point of secure access to all of your data, no matter where it lives with features like strict, fine grained, access control, end to end data encryption and data masking Starburst meets the security standards of the largest companies. Starburst enterprise can easily be deployed anywhere and managed with insights where data teams holistically view their clusters operation and query execution. So they can reach meaningful business decisions faster, all this with the support of the largest team of Trino experts in the world, delivering fully tested stable releases and available to support you 24 7 to unlock the value in all of your data. You need a solution that easily fits with what you have today and can adapt to your architecture. Tomorrow. Starbust enterprise gives you the fastest path from big data to better decisions, cuz your team can't afford to wait. Trino was created to empower analytics anywhere and Starburst enterprise was created to give you the enterprise grade performance, connectivity, security management, and support your company needs organizations like Zolando Comcast and FINRA rely on Starburst to move their businesses forward. Contact us to get started.

Published Date : Aug 22 2022

SUMMARY :

famously said the best minds of my generation are thinking about how to get people to the data warehouse ever have featured parody with the data lake or vice versa is So, you know, despite being the industry leader for 40 years, not one of their customers truly had So Richard, from a practitioner's point of view, you know, what, what are your thoughts? although if you were starting from a Greenfield site and you were building something brand new, Y you know, Theresa, I feel like Sarbanes Oxley kinda saved the data warehouse, I, I think you gotta have centralized governance, right? So, you know, Justin, you guys last, geez, I think it was about a year ago, had a session on, And you can think of them Justin, what do you say to a, to a customer or prospect that says, look, Justin, I'm gonna, you know, for many, many years to come. But I think the reality is, you know, the data mesh model basically says, I mean, you know, there Theresa you work with a lot of clients, they're not just gonna rip and replace their existing that the mesh actually allows you to use all of them. But it creates what I would argue are two, you know, Well, it absolutely depends on some of the tooling and processes that you put in place around those do an analytic queries and with data that's all dispersed all over the, how are you seeing your the best to, to create, you know, data as a product ultimately to be consumed. open platforms are the best path to the future of data But what if you could spend less you create a single point of access to your data, no matter where it's stored. give you the performance and control that you can get with a proprietary system. I remember in the very early days, people would say, you you'll never get performance because And I remember a, a quote from, you know, Kurt Monash many years ago where he said, you know, know it takes six or seven it is an evolving, you know, spectrum, but, but from your perspective, And what you don't want to end up So Jess, let me play devil's advocate here a little bit, and I've talked to Shaak about this and you know, And I think similarly, you know, being able to connect to an external table that lives in an open data format, Well, that's interesting reminded when I, you know, I see the, the gas price, And I think, you know, I loved what Richard said. not as many te data customers, but, but a lot of Oracle customers and they, you know, And so for those different teams, they can get to an ROI more quickly with different technologies that strike me, you know, the data brick snowflake, you know, thing is, oh, is a lot of fun for analysts So the advice that I saw years ago was if you have open source technologies, And in world of Oracle, you know, normally it's the staff, easy to discover and consume via, you know, the creation of data products as well. really modern, or is it the same wine new bottle? And with Starburst, you can perform analytics anywhere in light of your world. And that is the claim that today's So it's the same general stack, just, you know, a cloud version of it. So lemme come back to you just, but okay. So a lot of the same sort of structural constraints that exist with So Theresa, let me go to you cuz you have cloud first in your, in your, the data staff needs to be much more federated. you know, a microservices layer on top of leg legacy apps. So I think the stack needs to support a scalable So you think about the past, you know, five, seven years cloud obviously has given What it should be. And I think that's the paradigm shift that needs to occur. data that lives outside of the data warehouse, maybe living in open data formats in a data lake seen in data mesh, frankly really aren't, you know, adhering to So the mesh allows you to have the best of both worlds. So Richard, you know, talking about data as product, wonder if we could give us your perspectives is expecting means that you generate the wrong insight. But also, you know, around the data to say in a very clear business context, It's got the context. And ultimately with this concept of data products that we've now, you know, incorporated into our offering as well, This is Dave Valante for the cube, and we'll see you next time. You need a solution that easily fits with what you have today and can adapt

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
RichardPERSON

0.99+

Dave LantaPERSON

0.99+

Jess BorgmanPERSON

0.99+

JustinPERSON

0.99+

TheresaPERSON

0.99+

Justin BorgmanPERSON

0.99+

TeresaPERSON

0.99+

Jeff OckerPERSON

0.99+

Richard JarvisPERSON

0.99+

Dave ValantePERSON

0.99+

Justin BoardmanPERSON

0.99+

sixQUANTITY

0.99+

DaniPERSON

0.99+

MassachusettsLOCATION

0.99+

20 centsQUANTITY

0.99+

TeradataORGANIZATION

0.99+

OracleORGANIZATION

0.99+

JammaPERSON

0.99+

UKLOCATION

0.99+

FINRAORGANIZATION

0.99+

40 yearsQUANTITY

0.99+

Kurt MonashPERSON

0.99+

20%QUANTITY

0.99+

twoQUANTITY

0.99+

fiveQUANTITY

0.99+

JessPERSON

0.99+

2011DATE

0.99+

StarburstORGANIZATION

0.99+

10QUANTITY

0.99+

AccentureORGANIZATION

0.99+

seven yearsQUANTITY

0.99+

thousandsQUANTITY

0.99+

pythonsTITLE

0.99+

BostonLOCATION

0.99+

GDPRTITLE

0.99+

TodayDATE

0.99+

two modelsQUANTITY

0.99+

Zolando ComcastORGANIZATION

0.99+

GemmaPERSON

0.99+

StarbustORGANIZATION

0.99+

JPMCORGANIZATION

0.99+

FacebookORGANIZATION

0.99+

JavasTITLE

0.99+

todayDATE

0.99+

AWSORGANIZATION

0.99+

millionsQUANTITY

0.99+

first lieQUANTITY

0.99+

10DATE

0.99+

12 yearsQUANTITY

0.99+

one placeQUANTITY

0.99+

TomorrowDATE

0.99+

Starburst The Data Lies FULL V1


 

>>In 2011, early Facebook employee and Cloudera co-founder Jeff Ocker famously said the best minds of my generation are thinking about how to get people to click on ads. And that sucks. Let's face it more than a decade later organizations continue to be frustrated with how difficult it is to get value from data and build a truly agile data-driven enterprise. What does that even mean? You ask? Well, it means that everyone in the organization has the data they need when they need it. In a context that's relevant to advance the mission of an organization. Now that could mean cutting cost could mean increasing profits, driving productivity, saving lives, accelerating drug discovery, making better diagnoses, solving, supply chain problems, predicting weather disasters, simplifying processes, and thousands of other examples where data can completely transform people's lives beyond manipulating internet users to behave a certain way. We've heard the prognostications about the possibilities of data before and in fairness we've made progress, but the hard truth is the original promises of master data management, enterprise data, warehouses, data marts, data hubs, and yes, even data lakes were broken and left us wanting from more welcome to the data doesn't lie, or doesn't a series of conversations produced by the cube and made possible by Starburst data. >>I'm your host, Dave Lanta and joining me today are three industry experts. Justin Borgman is this co-founder and CEO of Starburst. Richard Jarvis is the CTO at EMI health and Theresa tongue is cloud first technologist at Accenture. Today we're gonna have a candid discussion that will expose the unfulfilled and yes, broken promises of a data past we'll expose data lies, big lies, little lies, white lies, and hidden truths. And we'll challenge, age old data conventions and bust some data myths. We're debating questions like is the demise of a single source of truth. Inevitable will the data warehouse ever have featured parody with the data lake or vice versa is the so-called modern data stack, simply centralization in the cloud, AKA the old guards model in new cloud close. How can organizations rethink their data architectures and regimes to realize the true promises of data can and will and open ecosystem deliver on these promises in our lifetimes, we're spanning much of the Western world today. Richard is in the UK. Teresa is on the west coast and Justin is in Massachusetts with me. I'm in the cube studios about 30 miles outside of Boston folks. Welcome to the program. Thanks for coming on. Thanks for having us. Let's get right into it. You're very welcome. Now here's the first lie. The most effective data architecture is one that is centralized with a team of data specialists serving various lines of business. What do you think Justin? >>Yeah, definitely a lie. My first startup was a company called hit adapt, which was an early SQL engine for hit that was acquired by Teradata. And when I got to Teradata, of course, Teradata is the pioneer of that central enterprise data warehouse model. One of the things that I found fascinating was that not one of their customers had actually lived up to that vision of centralizing all of their data into one place. They all had data silos. They all had data in different systems. They had data on prem data in the cloud. You know, those companies were acquiring other companies and inheriting their data architecture. So, you know, despite being the industry leader for 40 years, not one of their customers truly had everything in one place. So I think definitely history has proven that to be a lie. >>So Richard, from a practitioner's point of view, you know, what, what are your thoughts? I mean, there, there's a lot of pressure to cut cost, keep things centralized, you know, serve the business as best as possible from that standpoint. What, what is your experience show? >>Yeah, I mean, I think I would echo Justin's experience really that we, as a business have grown up through acquisition, through storing data in different places sometimes to do information governance in different ways to store data in, in a platform that's close to data experts, people who really understand healthcare data from pharmacies or from, from doctors. And so, although if you were starting from a Greenfield site and you were building something brand new, you might be able to centralize all the data and all of the tooling and teams in one place. The reality is that that businesses just don't grow up like that. And, and it's just really impossible to get that academic perfection of, of storing everything in one place. >>Y you know, Theresa, I feel like Sarbanes Oxley kinda saved the data warehouse, you know, right. You actually did have to have a single version of the truth for certain financial data, but really for those, some of those other use cases, I, I mentioned, I, I do feel like the industry has kinda let us down. What's your take on this? Where does it make sense to have that sort of centralized approach versus where does it make sense to maybe decentralized? >>I, I think you gotta have centralized governance, right? So from the central team, for things like star Oxley, for things like security for certainly very core data sets, having a centralized set of roles, responsibilities to really QA, right. To serve as a design authority for your entire data estate, just like you might with security, but how it's implemented has to be distributed. Otherwise you're not gonna be able to scale. Right? So being able to have different parts of the business really make the right data investments for their needs. And then ultimately you're gonna collaborate with your partners. So partners that are not within the company, right. External partners, we're gonna see a lot more data sharing and model creation. And so you're definitely going to be decentralized. >>So, you know, Justin, you guys last, geez, I think it was about a year ago, had a session on, on data mesh. It was a great program. You invited Jamma, Dani, of course, she's the creator of the data mesh. And her one of our fundamental premises is that you've got this hyper specialized team that you've gotta go through. And if you want anything, but at the same time, these, these individuals actually become a bottleneck, even though they're some of the most talented people in the organization. So I guess question for you, Richard, how do you deal with that? Do you, do you organize so that there are a few sort of rock stars that, that, you know, build cubes and, and the like, and, and, and, or have you had any success in sort of decentralizing with, you know, your, your constituencies, that data model? >>Yeah. So, so we absolutely have got rockstar, data scientists and data guardians. If you like people who understand what it means to use this data, particularly as the data that we use at emos is very private it's healthcare information. And some of the, the rules and regulations around using the data are very complex and, and strict. So we have to have people who understand the usage of the data, then people who understand how to build models, how to process the data effectively. And you can think of them like consultants to the wider business, because a pharmacist might not understand how to structure a SQL query, but they do understand how they want to process medication information to improve patient lives. And so that becomes a, a consulting type experience from a, a set of rock stars to help a, a more decentralized business who needs to, to understand the data and to generate some valuable output. >>Justin, what do you say to a, to a customer or prospect that says, look, Justin, I'm gonna, I got a centralized team and that's the most cost effective way to serve the business. Otherwise I got, I got duplication. What do you say to that? >>Well, I, I would argue it's probably not the most cost effective and, and the reason being really twofold. I think, first of all, when you are deploying a enterprise data warehouse model, the, the data warehouse itself is very expensive, generally speaking. And so you're putting all of your most valuable data in the hands of one vendor who now has tremendous leverage over you, you know, for many, many years to come. I think that's the story at Oracle or Terra data or other proprietary database systems. But the other aspect I think is that the reality is those central data warehouse teams is as much as they are experts in the technology. They don't necessarily understand the data itself. And this is one of the core tenants of data mash that that jam writes about is this idea of the domain owners actually know the data the best. >>And so by, you know, not only acknowledging that data is generally decentralized and to your earlier point about SAR, brain Oxley, maybe saving the data warehouse, I would argue maybe GDPR and data sovereignty will destroy it because data has to be decentralized for, for those laws to be compliant. But I think the reality is, you know, the data mesh model basically says, data's decentralized, and we're gonna turn that into an asset rather than a liability. And we're gonna turn that into an asset by empowering the people that know the data, the best to participate in the process of, you know, curating and creating data products for, for consumption. So I think when you think about it, that way, you're going to get higher quality data and faster time to insight, which is ultimately going to drive more revenue for your business and reduce costs. So I think that that's the way I see the two, the two models comparing and contrasting. >>So do you think the demise of the data warehouse is inevitable? I mean, I mean, you know, there Theresa you work with a lot of clients, they're not just gonna rip and replace their existing infrastructure. Maybe they're gonna build on top of it, but what does that mean? Does that mean the E D w just becomes, you know, less and less valuable over time, or it's maybe just isolated to specific use cases. What's your take on that? >>Listen, I still would love all my data within a data warehouse would love it. Mastered would love it owned by essential team. Right? I think that's still what I would love to have. That's just not the reality, right? The investment to actually migrate and keep that up to date. I would say it's a losing battle. Like we've been trying to do it for a long time. Nobody has the budgets and then data changes, right? There's gonna be a new technology. That's gonna emerge that we're gonna wanna tap into. There's going to be not enough investment to bring all the legacy, but still very useful systems into that centralized view. So you keep the data warehouse. I think it's a very, very valuable, very high performance tool for what it's there for, but you could have this, you know, new mesh layer that still takes advantage of the things. I mentioned, the data products in the systems that are meaningful today and the data products that actually might span a number of systems, maybe either those that either source systems for the domains that know it best, or the consumer based systems and products that need to be packaged in a way that be really meaningful for that end user, right? Each of those are useful for a different part of the business and making sure that the mesh actually allows you to use all of them. >>So, Richard, let me ask you, you take, take Gemma's principles back to those. You got to, you know, domain ownership and, and, and data as product. Okay, great. Sounds good. But it creates what I would argue are two, you know, challenges, self-serve infrastructure let's park that for a second. And then in your industry, the one of the high, most regulated, most sensitive computational governance, how do you automate and ensure federated governance in that mesh model that Theresa was just talking about? >>Well, it absolutely depends on some of the tooling and processes that you put in place around those tools to be, to centralize the security and the governance of the data. And I think, although a data warehouse makes that very simple, cause it's a single tool, it's not impossible with some of the data mesh technologies that are available. And so what we've done at emus is we have a single security layer that sits on top of our data match, which means that no matter which user is accessing, which data source, we go through a well audited well understood security layer. That means that we know exactly who's got access to which data field, which data tables. And then everything that they do is, is audited in a very kind of standard way, regardless of the underlying data storage technology. So for me, although storing the data in one place might not be possible understanding where your source of truth is and securing that in a common way is still a valuable approach and you can do it without having to bring all that data into a single bucket so that it's all in one place. And, and so having done that and investing quite heavily in making that possible has paid dividends in terms of giving wider access to the platform and ensuring that only data that's available under GDPR and other regulations is being used by, by the data users. >>Yeah. So Justin, I mean, Democrat, we always talk about data democratization and you know, up until recently, they really haven't been line of sight as to how to get there. But do you have anything to add to this because you're essentially taking, you know, do an analytic queries and with data that's all dispersed all over the, how are you seeing your customers handle this, this challenge? >>Yeah. I mean, I think data products is a really interesting aspect of the answer to that. It allows you to, again, leverage the data domain owners, people know the data, the best to, to create, you know, data as a product ultimately to be consumed. And we try to represent that in our product as effectively a almost eCommerce like experience where you go and discover and look for the data products that have been created in your organization. And then you can start to consume them as, as you'd like. And so really trying to build on that notion of, you know, data democratization and self-service, and making it very easy to discover and, and start to use with whatever BI tool you, you may like, or even just running, you know, SQL queries yourself, >>Okay. G guys grab a sip of water. After this short break, we'll be back to debate whether proprietary or open platforms are the best path to the future of data excellence, keep it right there. >>Your company has more data than ever, and more people trying to understand it, but there's a problem. Your data is stored across multiple systems. It's hard to access and that delays analytics and ultimately decisions. The old method of moving all of your data into a single source of truth is slow and definitely not built for the volume of data we have today or where we are headed while your data engineers spent over half their time, moving data, your analysts and data scientists are left, waiting, feeling frustrated, unproductive, and unable to move the needle for your business. But what if you could spend less time moving or copying data? What if your data consumers could analyze all your data quickly? >>Starburst helps your teams run fast queries on any data source. We help you create a single point of access to your data, no matter where it's stored. And we support high concurrency, we solve for speed and scale, whether it's fast, SQL queries on your data lake or faster queries across multiple data sets, Starburst helps your teams run analytics anywhere you can't afford to wait for data to be available. Your team has questions that need answers. Now with Starburst, the wait is over. You'll have faster access to data with enterprise level security, easy connectivity, and 24 7 support from experts, organizations like Zolando Comcast and FINRA rely on Starburst to move their businesses forward. Contact our Trino experts to get started. >>We're back with Jess Borgman of Starburst and Richard Jarvis of EVAs health. Okay, we're gonna get to lie. Number two, and that is this an open source based platform cannot give you the performance and control that you can get with a proprietary system. Is that a lie? Justin, the enterprise data warehouse has been pretty dominant and has evolved and matured. Its stack has mature over the years. Why is it not the default platform for data? >>Yeah, well, I think that's become a lie over time. So I, I think, you know, if we go back 10 or 12 years ago with the advent of the first data lake really around Hudu, that probably was true that you couldn't get the performance that you needed to run fast, interactive, SQL queries in a data lake. Now a lot's changed in 10 or 12 years. I remember in the very early days, people would say, you you'll never get performance because you need to be column there. You need to store data in a column format. And then, you know, column formats we're introduced to, to data apes, you have Parque ORC file in aro that were created to ultimately deliver performance out of that. So, okay. We got, you know, largely over the performance hurdle, you know, more recently people will say, well, you don't have the ability to do updates and deletes like a traditional data warehouse. >>And now we've got the creation of new data formats, again like iceberg and Delta and Hodi that do allow for updates and delete. So I think the data lake has continued to mature. And I remember a, a quote from, you know, Kurt Monash many years ago where he said, you know, know it takes six or seven years to build a functional database. I think that's that's right. And now we've had almost a decade go by. So, you know, these technologies have matured to really deliver very, very close to the same level performance and functionality of, of cloud data warehouses. So I think the, the reality is that's become a line and now we have large giant hyperscale internet companies that, you know, don't have the traditional data warehouse at all. They do all of their analytics in a data lake. So I think we've, we've proven that it's very much possible today. >>Thank you for that. And so Richard, talk about your perspective as a practitioner in terms of what open brings you versus, I mean, look closed is it's open as a moving target. I remember Unix used to be open systems and so it's, it is an evolving, you know, spectrum, but, but from your perspective, what does open give you that you can't get from a proprietary system where you are fearful of in a proprietary system? >>I, I suppose for me open buys us the ability to be unsure about the future, because one thing that's always true about technology is it evolves in a, a direction, slightly different to what people expect. And what you don't want to end up is done is backed itself into a corner that then prevents it from innovating. So if you have chosen a technology and you've stored trillions of records in that technology and suddenly a new way of processing or machine learning comes out, you wanna be able to take advantage and your competitive edge might depend upon it. And so I suppose for us, we acknowledge that we don't have perfect vision of what the future might be. And so by backing open storage technologies, we can apply a number of different technologies to the processing of that data. And that gives us the ability to remain relevant, innovate on our data storage. And we have bought our way out of the, any performance concerns because we can use cloud scale infrastructure to scale up and scale down as we need. And so we don't have the concerns that we don't have enough hardware today to process what we want to do, want to achieve. We can just scale up when we need it and scale back down. So open source has really allowed us to maintain the being at the cutting edge. >>So Jess, let me play devil's advocate here a little bit, and I've talked to Shaak about this and you know, obviously her vision is there's an open source that, that the data meshes open source, an open source tooling, and it's not a proprietary, you know, you're not gonna buy a data mesh. You're gonna build it with, with open source toolings and, and vendors like you are gonna support it, but to come back to sort of today, you can get to market with a proprietary solution faster. I'm gonna make that statement. You tell me if it's a lie and then you can say, okay, we support Apache iceberg. We're gonna support open source tooling, take a company like VMware, not really in the data business, but how, the way they embraced Kubernetes and, and you know, every new open source thing that comes along, they say, we do that too. Why can't proprietary systems do that and be as effective? >>Yeah, well, I think at least with the, within the data landscape saying that you can access open data formats like iceberg or, or others is, is a bit dis disingenuous because really what you're selling to your customer is a certain degree of performance, a certain SLA, and you know, those cloud data warehouses that can reach beyond their own proprietary storage drop all the performance that they were able to provide. So it is, it reminds me kind of, of, again, going back 10 or 12 years ago when everybody had a connector to Haddo and that they thought that was the solution, right? But the reality was, you know, a connector was not the same as running workloads in Haddo back then. And I think similarly, you know, being able to connect to an external table that lives in an open data format, you know, you're, you're not going to give it the performance that your customers are accustomed to. And at the end of the day, they're always going to be predisposed. They're always going to be incentivized to get that data ingested into the data warehouse, cuz that's where they have control. And you know, the bottom line is the database industry has really been built around vendor lockin. I mean, from the start, how, how many people love Oracle today, but our customers, nonetheless, I think, you know, lockin is, is, is part of this industry. And I think that's really what we're trying to change with open data formats. >>Well, that's interesting reminded when I, you know, I see the, the gas price, the tees or gas price I, I drive up and then I say, oh, that's the cash price credit card. I gotta pay 20 cents more, but okay. But so the, the argument then, so let me, let me come back to you, Justin. So what's wrong with saying, Hey, we support open data formats, but yeah, you're gonna get better performance if you, if you keep it into our closed system, are you saying that long term that's gonna come back and bite you cuz you're gonna end up, you mentioned Oracle, you mentioned Teradata. Yeah. That's by, by implication, you're saying that's where snowflake customers are headed. >>Yeah, absolutely. I think this is a movie that, you know, we've all seen before. At least those of us who've been in the industry long enough to, to see this movie play over a couple times. So I do think that's the future. And I think, you know, I loved what Richard said. I actually wrote it down. Cause I thought it was an amazing quote. He said, it buys us the ability to be unsure of the future. Th that that pretty much says it all the, the future is unknowable and the reality is using open data formats. You remain interoperable with any technology you want to utilize. If you want to use spark to train a machine learning model and you want to use Starbust to query via sequel, that's totally cool. They can both work off the same exact, you know, data, data sets by contrast, if you're, you know, focused on a proprietary model, then you're kind of locked in again to that model. I think the same applies to data, sharing to data products, to a wide variety of, of aspects of the data landscape that a proprietary approach kind of closes you in and locks you in. >>So I, I would say this Richard, I'd love to get your thoughts on it. Cause I talked to a lot of Oracle customers, not as many te data customers, but, but a lot of Oracle customers and they, you know, they'll admit, yeah, you know, they're jamming us on price and the license cost they give, but we do get value out of it. And so my question to you, Richard, is, is do the, let's call it data warehouse systems or the proprietary systems. Are they gonna deliver a greater ROI sooner? And is that in allure of, of that customers, you know, are attracted to, or can open platforms deliver as fast in ROI? >>I think the answer to that is it can depend a bit. It depends on your businesses skillset. So we are lucky that we have a number of proprietary teams that work in databases that provide our operational data capability. And we have teams of analytics and big data experts who can work with open data sets and open data formats. And so for those different teams, they can get to an ROI more quickly with different technologies for the business though, we can't do better for our operational data stores than proprietary databases. Today we can back off very tight SLAs to them. We can demonstrate reliability from millions of hours of those databases being run at enterprise scale, but for an analytics workload where increasing our business is growing in that direction, we can't do better than open data formats with cloud-based data mesh type technologies. And so it's not a simple answer. That one will always be the right answer for our business. We definitely have times when proprietary databases provide a capability that we couldn't easily represent or replicate with open technologies. >>Yeah. Richard, stay with you. You mentioned, you know, you know, some things before that, that strike me, you know, the data brick snowflake, you know, thing is, oh, is a lot of fun for analysts like me. You've got data bricks coming at it. Richard, you mentioned you have a lot of rockstar, data engineers, data bricks coming at it from a data engineering heritage. You get snowflake coming at it from an analytics heritage. Those two worlds are, are colliding people like PJI Mohan said, you know what? I think it's actually harder to play in the data engineering. So I E it's easier to for data engineering world to go into the analytics world versus the reverse, but thinking about up and coming engineers and developers preparing for this future of data engineering and data analytics, how, how should they be thinking about the future? What, what's your advice to those young people? >>So I think I'd probably fall back on general programming skill sets. So the advice that I saw years ago was if you have open source technologies, the pythons and Javas on your CV, you commander 20% pay, hike over people who can only do proprietary programming languages. And I think that's true of data technologies as well. And from a business point of view, that makes sense. I'd rather spend the money that I save on proprietary licenses on better engineers, because they can provide more value to the business that can innovate us beyond our competitors. So I think I would my advice to people who are starting here or trying to build teams to capitalize on data assets is begin with open license, free capabilities, because they're very cheap to experiment with. And they generate a lot of interest from people who want to join you as a business. And you can make them very successful early, early doors with, with your analytics journey. >>It's interesting. Again, analysts like myself, we do a lot of TCO work and have over the last 20 plus years. And in world of Oracle, you know, normally it's the staff, that's the biggest nut in total cost of ownership, not an Oracle. It's the it's the license cost is by far the biggest component in the, in the blame pie. All right, Justin, help us close out this segment. We've been talking about this sort of data mesh open, closed snowflake data bricks. Where does Starburst sort of as this engine for the data lake data lake house, the data warehouse fit in this, in this world? >>Yeah. So our view on how the future ultimately unfolds is we think that data lakes will be a natural center of gravity for a lot of the reasons that we described open data formats, lowest total cost of ownership, because you get to choose the cheapest storage available to you. Maybe that's S3 or Azure data lake storage, or Google cloud storage, or maybe it's on-prem object storage that you bought at a, at a really good price. So ultimately storing a lot of data in a deal lake makes a lot of sense, but I think what makes our perspective unique is we still don't think you're gonna get everything there either. We think that basically centralization of all your data assets is just an impossible endeavor. And so you wanna be able to access data that lives outside of the lake as well. So we kind of think of the lake as maybe the biggest place by volume in terms of how much data you have, but to, to have comprehensive analytics and to truly understand your business and understand it holistically, you need to be able to go access other data sources as well. And so that's the role that we wanna play is to be a single point of access for our customers, provide the right level of fine grained access controls so that the right people have access to the right data and ultimately make it easy to discover and consume via, you know, the creation of data products as well. >>Great. Okay. Thanks guys. Right after this quick break, we're gonna be back to debate whether the cloud data model that we see emerging and the so-called modern data stack is really modern, or is it the same wine new bottle? When it comes to data architectures, you're watching the cube, the leader in enterprise and emerging tech coverage. >>Your data is capable of producing incredible results, but data consumers are often left in the dark without fast access to the data they need. Starers makes your data visible from wherever it lives. Your company is acquiring more data in more places, more rapidly than ever to rely solely on a data centralization strategy. Whether it's in a lake or a warehouse is unrealistic. A single source of truth approach is no longer viable, but disconnected data silos are often left untapped. We need a new approach. One that embraces distributed data. One that enables fast and secure access to any of your data from anywhere with Starburst, you'll have the fastest query engine for the data lake that allows you to connect and analyze your disparate data sources no matter where they live Starburst provides the foundational technology required for you to build towards the vision of a decentralized data mesh Starburst enterprise and Starburst galaxy offer enterprise ready, connectivity, interoperability, and security features for multiple regions, multiple clouds and everchanging global regulatory requirements. The data is yours. And with Starburst, you can perform analytics anywhere in light of your world. >>Okay. We're back with Justin Boardman. CEO of Starbust Richard Jarvis is the CTO of EMI health and Theresa tongue is the cloud first technologist from Accenture. We're on July number three. And that is the claim that today's modern data stack is actually modern. So I guess that's the lie it's it is it's is that it's not modern. Justin, what do you say? >>Yeah. I mean, I think new isn't modern, right? I think it's the, it's the new data stack. It's the cloud data stack, but that doesn't necessarily mean it's modern. I think a lot of the components actually are exactly the same as what we've had for 40 years, rather than Terra data. You have snowflake rather than Informatica you have five trend. So it's the same general stack, just, you know, a cloud version of it. And I think a lot of the challenges that it plagued us for 40 years still maintain. >>So lemme come back to you just, but okay. But, but there are differences, right? I mean, you can scale, you can throw resources at the problem. You can separate compute from storage. You really, you know, there's a lot of money being thrown at that by venture capitalists and snowflake, you mentioned it's competitors. So that's different. Is it not, is that not at least an aspect of, of modern dial it up, dial it down. So what, what do you say to that? >>Well, it, it is, it's certainly taking, you know, what the cloud offers and taking advantage of that, but it's important to note that the cloud data warehouses out there are really just separating their compute from their storage. So it's allowing them to scale up and down, but your data still stored in a proprietary format. You're still locked in. You still have to ingest the data to get it even prepared for analysis. So a lot of the same sort of structural constraints that exist with the old enterprise data warehouse model OnPrem still exist just yes, a little bit more elastic now because the cloud offers that. >>So Theresa, let me go to you cuz you have cloud first in your, in your, your title. So what's what say you to this conversation? >>Well, even the cloud providers are looking towards more of a cloud continuum, right? So the centralized cloud, as we know it, maybe data lake data warehouse in the central place, that's not even how the cloud providers are looking at it. They have news query services. Every provider has one that really expands those queries to be beyond a single location. And if we look at a lot of where our, the future goes, right, that that's gonna very much fall the same thing. There was gonna be more edge. There's gonna be more on premise because of data sovereignty, data gravity, because you're working with different parts of the business that have already made major cloud investments in different cloud providers. Right? So there's a lot of reasons why the modern, I guess, the next modern generation of the data staff needs to be much more federated. >>Okay. So Richard, how do you deal with this? You you've obviously got, you know, the technical debt, the existing infrastructure it's on the books. You don't wanna just throw it out. A lot of, lot of conversation about modernizing applications, which a lot of times is a, you know, a microservices layer on top of leg legacy apps. How do you think about the modern data stack? >>Well, I think probably the first thing to say is that the stack really has to include the processes and people around the data as well is all well and good changing the technology. But if you don't modernize how people use that technology, then you're not going to be able to, to scale because just cuz you can scale CPU and storage doesn't mean you can get more people to use your data, to generate you more, more value for the business. And so what we've been looking at is really changing in very much aligned to data products and, and data mesh. How do you enable more people to consume the service and have the stack respond in a way that keeps costs low? Because that's important for our customers consuming this data, but also allows people to occasionally run enormous queries and then tick along with smaller ones when required. And it's a good job we did because during COVID all of a sudden we had enormous pressures on our data platform to answer really important life threatening queries. And if we couldn't scale both our data stack and our teams, we wouldn't have been able to answer those as quickly as we had. So I think the stack needs to support a scalable business, not just the technology itself. >>Well thank you for that. So Justin let's, let's try to break down what the critical aspects are of the modern data stack. So you think about the past, you know, five, seven years cloud obviously has given a different pricing model. De-risked experimentation, you know that we talked about the ability to scale up scale down, but it's, I'm, I'm taking away that that's not enough based on what Richard just said. The modern data stack has to serve the business and enable the business to build data products. I, I buy that. I'm a big fan of the data mesh concepts, even though we're early days. So what are the critical aspects if you had to think about, you know, paying, maybe putting some guardrails and definitions around the modern data stack, what does that look like? What are some of the attributes and, and principles there >>Of, of how it should look like or, or how >>It's yeah. What it should be. >>Yeah. Yeah. Well, I think, you know, in, in Theresa mentioned this in, in a previous segment about the data warehouse is not necessarily going to disappear. It just becomes one node, one element of the overall data mesh. And I, I certainly agree with that. So by no means, are we suggesting that, you know, snowflake or Redshift or whatever cloud data warehouse you may be using is going to disappear, but it's, it's not going to become the end all be all. It's not the, the central single source of truth. And I think that's the paradigm shift that needs to occur. And I think it's also worth noting that those who were the early adopters of the modern data stack were primarily digital, native born in the cloud young companies who had the benefit of, of idealism. They had the benefit of it was starting with a clean slate that does not reflect the vast majority of enterprises. >>And even those companies, as they grow up mature out of that ideal state, they go buy a business. Now they've got something on another cloud provider that has a different data stack and they have to deal with that heterogeneity that is just change and change is a part of life. And so I think there is an element here that is almost philosophical. It's like, do you believe in an absolute ideal where I can just fit everything into one place or do I believe in reality? And I think the far more pragmatic approach is really what data mesh represents. So to answer your question directly, I think it's adding, you know, the ability to access data that lives outside of the data warehouse, maybe living in open data formats in a data lake or accessing operational systems as well. Maybe you want to directly access data that lives in an Oracle database or a Mongo database or, or what have you. So creating that flexibility to really Futureproof yourself from the inevitable change that you will, you won't encounter over time. >>So thank you. So there, based on what Justin just said, I, my takeaway there is it's inclusive, whether it's a data Mar data hub, data lake data warehouse, it's a, just a node on the mesh. Okay. I get that. Does that include there on Preem data? O obviously it has to, what are you seeing in terms of the ability to, to take that data mesh concept on Preem? I mean, most implementations I've seen in data mesh, frankly really aren't, you know, adhering to the philosophy. They're maybe, maybe it's data lake and maybe it's using glue. You look at what JPMC is doing. Hello, fresh, a lot of stuff happening on the AWS cloud in that, you know, closed stack, if you will. What's the answer to that Theresa? >>I mean, I, I think it's a killer case for data. Me, the fact that you have valuable data sources, OnPrem, and then yet you still wanna modernize and take the best of cloud cloud is still, like we mentioned, there's a lot of great reasons for it around the economics and the way ability to tap into the innovation that the cloud providers are giving around data and AI architecture. It's an easy button. So the mesh allows you to have the best of both worlds. You can start using the data products on-prem or in the existing systems that are working already. It's meaningful for the business. At the same time, you can modernize the ones that make business sense because it needs better performance. It needs, you know, something that is, is cheaper or, or maybe just tap into better analytics to get better insights, right? So you're gonna be able to stretch and really have the best of both worlds. That, again, going back to Richard's point, that is meaningful by the business. Not everything has to have that one size fits all set a tool. >>Okay. Thank you. So Richard, you know, talking about data as product, wonder if we could give us your perspectives here, what are the advantages of treating data as a product? What, what role do data products have in the modern data stack? We talk about monetizing data. What are your thoughts on data products? >>So for us, one of the most important data products that we've been creating is taking data that is healthcare data across a wide variety of different settings. So information about patients' demographics about their, their treatment, about their medications and so on, and taking that into a standards format that can be utilized by a wide variety of different researchers because misinterpreting that data or having the data not presented in the way that the user is expecting means that you generate the wrong insight. And in any business, that's clearly not a desirable outcome, but when that insight is so critical, as it might be in healthcare or some security settings, you really have to have gone to the trouble of understanding the data, presenting it in a format that everyone can clearly agree on. And then letting people consume in a very structured, managed way, even if that data comes from a variety of different sources in, in, in the first place. And so our data product journey has really begun by standardizing data across a number of different silos through the data mesh. So we can present out both internally and through the right governance externally to, to researchers. >>So that data product through whatever APIs is, is accessible, it's discoverable, but it's obviously gotta be governed as well. You mentioned you, you appropriately provided to internally. Yeah. But also, you know, external folks as well. So the, so you've, you've architected that capability today >>We have, and because the data is standard, it can generate value much more quickly and we can be sure of the security and, and, and value that that's providing because the data product isn't just about formatting the data into the correct tables, it's understanding what it means to redact the data or to remove certain rows from it or to interpret what a date actually means. Is it the start of the contract or the start of the treatment or the date of birth of a patient? These things can be lost in the data storage without having the proper product management around the data to say in a very clear business context, what does this data mean? And what does it mean to process this data for a particular use case? >>Yeah, it makes sense. It's got the context. If the, if the domains own the data, you, you gotta cut through a lot of the, the, the centralized teams, the technical teams that, that data agnostic, they don't really have that context. All right. Let's send Justin, how does Starburst fit into this modern data stack? Bring us home. >>Yeah. So I think for us, it's really providing our customers with, you know, the flexibility to operate and analyze data that lives in a wide variety of different systems. Ultimately giving them that optionality, you know, and optionality provides the ability to reduce costs, store more in a data lake rather than data warehouse. It provides the ability for the fastest time to insight to access the data directly where it lives. And ultimately with this concept of data products that we've now, you know, incorporated into our offering as well, you can really create and, and curate, you know, data as a product to be shared and consumed. So we're trying to help enable the data mesh, you know, model and make that an appropriate compliment to, you know, the, the, the modern data stack that people have today. >>Excellent. Hey, I wanna thank Justin Theresa and Richard for joining us today. You guys are great. I big believers in the, in the data mesh concept, and I think, you know, we're seeing the future of data architecture. So thank you. Now, remember, all these conversations are gonna be available on the cube.net for on-demand viewing. You can also go to starburst.io. They have some great content on the website and they host some really thought provoking interviews and, and, and they have awesome resources, lots of data mesh conversations over there, and really good stuff in, in the resource section. So check that out. Thanks for watching the data doesn't lie or does it made possible by Starburst data? This is Dave Valante for the cube, and we'll see you next time. >>The explosion of data sources has forced organizations to modernize their systems and architecture and come to terms with one size does not fit all for data management today. Your teams are constantly moving and copying data, which requires time management. And in some cases, double paying for compute resources. Instead, what if you could access all your data anywhere using the BI tools and SQL skills your users already have. And what if this also included enterprise security and fast performance with Starburst enterprise, you can provide your data consumers with a single point of secure access to all of your data, no matter where it lives with features like strict, fine grained, access control, end to end data encryption and data masking Starburst meets the security standards of the largest companies. Starburst enterprise can easily be deployed anywhere and managed with insights where data teams holistically view their clusters operation and query execution. So they can reach meaningful business decisions faster, all this with the support of the largest team of Trino experts in the world, delivering fully tested stable releases and available to support you 24 7 to unlock the value in all of your data. You need a solution that easily fits with what you have today and can adapt to your architecture. Tomorrow. Starbust enterprise gives you the fastest path from big data to better decisions, cuz your team can't afford to wait. Trino was created to empower analytics anywhere and Starburst enterprise was created to give you the enterprise grade performance, connectivity, security management, and support your company needs organizations like Zolando Comcast and FINRA rely on Starburst to move their businesses forward. Contact us to get started.

Published Date : Aug 20 2022

SUMMARY :

famously said the best minds of my generation are thinking about how to get people to the data warehouse ever have featured parody with the data lake or vice versa is So, you know, despite being the industry leader for 40 years, not one of their customers truly had So Richard, from a practitioner's point of view, you know, what, what are your thoughts? although if you were starting from a Greenfield site and you were building something brand new, Y you know, Theresa, I feel like Sarbanes Oxley kinda saved the data warehouse, I, I think you gotta have centralized governance, right? So, you know, Justin, you guys last, geez, I think it was about a year ago, had a session on, And you can think of them Justin, what do you say to a, to a customer or prospect that says, look, Justin, I'm gonna, you know, for many, many years to come. But I think the reality is, you know, the data mesh model basically says, I mean, you know, there Theresa you work with a lot of clients, they're not just gonna rip and replace their existing that the mesh actually allows you to use all of them. But it creates what I would argue are two, you know, Well, it absolutely depends on some of the tooling and processes that you put in place around those do an analytic queries and with data that's all dispersed all over the, how are you seeing your the best to, to create, you know, data as a product ultimately to be consumed. open platforms are the best path to the future of data But what if you could spend less you create a single point of access to your data, no matter where it's stored. give you the performance and control that you can get with a proprietary system. I remember in the very early days, people would say, you you'll never get performance because And I remember a, a quote from, you know, Kurt Monash many years ago where he said, you know, know it takes six or seven it is an evolving, you know, spectrum, but, but from your perspective, And what you don't want to end up So Jess, let me play devil's advocate here a little bit, and I've talked to Shaak about this and you know, And I think similarly, you know, being able to connect to an external table that lives in an open data format, Well, that's interesting reminded when I, you know, I see the, the gas price, And I think, you know, I loved what Richard said. not as many te data customers, but, but a lot of Oracle customers and they, you know, And so for those different teams, they can get to an ROI more quickly with different technologies that strike me, you know, the data brick snowflake, you know, thing is, oh, is a lot of fun for analysts So the advice that I saw years ago was if you have open source technologies, And in world of Oracle, you know, normally it's the staff, easy to discover and consume via, you know, the creation of data products as well. really modern, or is it the same wine new bottle? And with Starburst, you can perform analytics anywhere in light of your world. And that is the claim that today's So it's the same general stack, just, you know, a cloud version of it. So lemme come back to you just, but okay. So a lot of the same sort of structural constraints that exist with So Theresa, let me go to you cuz you have cloud first in your, in your, the data staff needs to be much more federated. you know, a microservices layer on top of leg legacy apps. So I think the stack needs to support a scalable So you think about the past, you know, five, seven years cloud obviously has given What it should be. And I think that's the paradigm shift that needs to occur. data that lives outside of the data warehouse, maybe living in open data formats in a data lake seen in data mesh, frankly really aren't, you know, adhering to So the mesh allows you to have the best of both worlds. So Richard, you know, talking about data as product, wonder if we could give us your perspectives is expecting means that you generate the wrong insight. But also, you know, around the data to say in a very clear business context, It's got the context. And ultimately with this concept of data products that we've now, you know, incorporated into our offering as well, This is Dave Valante for the cube, and we'll see you next time. You need a solution that easily fits with what you have today and can adapt

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
RichardPERSON

0.99+

Dave LantaPERSON

0.99+

Jess BorgmanPERSON

0.99+

JustinPERSON

0.99+

TheresaPERSON

0.99+

Justin BorgmanPERSON

0.99+

TeresaPERSON

0.99+

Jeff OckerPERSON

0.99+

Richard JarvisPERSON

0.99+

Dave ValantePERSON

0.99+

Justin BoardmanPERSON

0.99+

sixQUANTITY

0.99+

DaniPERSON

0.99+

MassachusettsLOCATION

0.99+

20 centsQUANTITY

0.99+

TeradataORGANIZATION

0.99+

OracleORGANIZATION

0.99+

JammaPERSON

0.99+

UKLOCATION

0.99+

FINRAORGANIZATION

0.99+

40 yearsQUANTITY

0.99+

Kurt MonashPERSON

0.99+

20%QUANTITY

0.99+

twoQUANTITY

0.99+

fiveQUANTITY

0.99+

JessPERSON

0.99+

2011DATE

0.99+

StarburstORGANIZATION

0.99+

10QUANTITY

0.99+

AccentureORGANIZATION

0.99+

seven yearsQUANTITY

0.99+

thousandsQUANTITY

0.99+

pythonsTITLE

0.99+

BostonLOCATION

0.99+

GDPRTITLE

0.99+

TodayDATE

0.99+

two modelsQUANTITY

0.99+

Zolando ComcastORGANIZATION

0.99+

GemmaPERSON

0.99+

StarbustORGANIZATION

0.99+

JPMCORGANIZATION

0.99+

FacebookORGANIZATION

0.99+

JavasTITLE

0.99+

todayDATE

0.99+

AWSORGANIZATION

0.99+

millionsQUANTITY

0.99+

first lieQUANTITY

0.99+

10DATE

0.99+

12 yearsQUANTITY

0.99+

one placeQUANTITY

0.99+

TomorrowDATE

0.99+

Starburst Panel Q1


 

>>In 2011, early Facebook employee and Cloudera co-founder Jeff Ocker famously said the best minds of my generation are thinking about how to get people to click on ads. And that sucks. Let's face it more than a decade later organizations continue to be frustrated with how difficult it is to get value from data and build a truly agile data driven enterprise. What does that even mean? You ask? Well, it means that everyone in the organization has the data they need when they need it. In a context that's relevant to advance the mission of an organization. Now that could mean cutting costs could mean increasing profits, driving productivity, saving lives, accelerating drug discovery, making better diagnoses, solving, supply chain problems, predicting weather disasters, simplifying processes, and thousands of other examples where data can completely transform people's lives beyond manipulating internet users to behave a certain way. We've heard the prognostications about the possibilities of data before and in fairness we've made progress, but the hard truth is the original promises of master data management, enterprise data, warehouses, data, Mars, data hubs, and yes, even data lakes were broken and left us wanting for more welcome to the data doesn't lie, or does it a series of conversations produced by the cube and made possible by Starburst data. >>I'm your host, Dave Lanta and joining me today are three industry experts. Justin Borgman is this co-founder and CEO of Starburst. Richard Jarvis is the CTO at EMI health and Theresa tongue is cloud first technologist at Accenture. Today we're gonna have a candid discussion that will expose the unfulfilled and yes, broken promises of a data past we'll expose data lies, big lies, little lies, white lies, and hidden truths. And we'll challenge, age old data conventions and bust some data myths. We're debating questions like is the demise of a single source of truth. Inevitable will the data warehouse ever have feature parody with the data lake or vice versa is the so-called modern data stack simply centralization in the cloud, AKA the old guards model in new cloud close. How can organizations rethink their data architectures and regimes to realize the true promises of data can and will and open ecosystem deliver on these promises in our lifetimes, we're spanning much of the Western world today. Richard is in the UK. Teresa is on the west coast and Justin is in Massachusetts with me. I'm in the cube studios about 30 miles outside of Boston folks. Welcome to the program. Thanks for coming on. Thanks for having us. Let's get right into it. You're very welcome. Now here's the first lie. The most effective data architecture is one that is centralized with a team of data specialists serving various lines of business. What do you think Justin? >>Yeah, definitely a lie. My first startup was a company called hit adapt, which was an early SQL engine for IDU that was acquired by Teradata. And when I got to Teradata, of course, Terada is the pioneer of that central enterprise data warehouse model. One of the things that I found fascinating was that not one of their customers had actually lived up to that vision of centralizing all of their data into one place. They all had data silos. They all had data in different systems. They had data on-prem data in the cloud. You know, those companies were acquiring other companies and inheriting their data architecture. So, you know, despite being the industry leader for 40 years, not one of their customers truly had everything in one place. So I think definitely history has proven that to be a lie. >>So Richard, from a practitioner's point of view, you know, what, what are your thoughts? I mean, there, there's a lot of pressure to cut cost, keep things centralized, you know, serve the business as best as possible from that standpoint. What, what is your experience, Joe? >>Yeah, I mean, I think I would echo Justin's experience really that we, as a business have grown up through acquisition, through storing data in different places sometimes to do information governance in different ways to store data in, in a platform that's close to data experts, people who really understand healthcare data from pharmacies or from, from doctors. And so, although if you were starting from a Greenfield site and you were building something brand new, you might be able to centralize all the data and all of the tooling and teams in one place. The reality is that that businesses just don't grow up like that. And, and it's just really impossible to get that academic perfection of, of storing everything in one place. >>Y you know, Theresa, I feel like Sarbanes Oxley kinda saved the data warehouse, you know? Right. But you actually did have to have a single version of the truth for certain financial data, but really for those, some of those other use cases, I, I mentioned, I, I do feel like the industry has kinda let us down. What's your take on this? Where does it make sense to have that sort of centralized approach versus where does it make sense to maybe decentralized? >>I, I think you gotta have centralized governance, right? So from the central team, for things like swans Oxley, for things like security, for certain very core data sets, having a centralized set of roles, responsibilities to really QA, right. To serve as a design authority for your entire data estate, just like you might with security, but how it's implemented has to be distributed. Otherwise you're not gonna be able to scale. Right? So being able to have different parts of the business really make the right data investments for their needs. And then ultimately you're gonna collaborate with your partners. So partners that are not within the company, right. External partners, we're gonna see a lot more data sharing and model creation. And so you're definitely going to be decentralized. >>So, you know, Justin, you guys last, geez, I think it was about a year ago, had a session on, on data mesh. It was a great program. You invited JAK, Dani, of course, she's the creator of the data mesh. And her one of our fundamental premises is that you've got this hyper specialized team that you've gotta go through. And if you want anything, but at the same time, these, these individuals actually become a bottleneck, even though they're some of the most talented people in the organization. So I guess question for you, Richard, how do you deal with that? Do you, do you organize so that there are a few sort of rock stars that, that, you know, build cubes and, and the like, and, and, and, or have you had any success in sort of decentralizing with, you know, your, your constituencies, that data model? >>Yeah. So, so we absolutely have got rockstar, data scientists and data guardians. If you like people who understand what it means to use this data, particularly as the data that we use at emos is very private it's healthcare information. And some of the, the rules and regulations around using the data are very complex and, and strict. So we have to have people who understand the usage of the data, then people who understand how to build models, how to process the data effectively. And you can think of them like consultants to the wider business, because a pharmacist might not understand how to structure a SQL query, but they do understand how they want to process medication information to improve patient lives. And so that becomes a, a consulting type experience from a, a set of rock stars to help a, a more decentralized business who needs to, to understand the data and to generate some valuable output. >>Justin, what do you say to a, to a customer or prospect that says, look, Justin, I'm gonna, I got a centralized team and that's the most cost effective way to serve the business. Otherwise I got, I got duplication. What do you say to that? >>Well, I, I would argue it's probably not the most cost effective and, and the reason being really twofold. I think, first of all, when you are deploying a enterprise data warehouse model, the, the data warehouse itself is very expensive, generally speaking. And so you're putting all of your most valuable data in the hands of one vendor who now has tremendous leverage over you, you know, for many, many years to come, I think that's the story of Oracle or Terra data or other proprietary database systems. But the other aspect I think is that the reality is those central data warehouse teams is as much as they are experts in the technology. They don't necessarily understand the data itself. And this is one of the core tenets of data mash that that jam writes about is this idea of the domain owners actually know the data the best. >>And so by, you know, not only acknowledging that data is generally decentralized and to your earlier point about, so Oxley, maybe saving the data warehouse, I would argue maybe GDPR and data sovereignty will destroy it because data has to be decentralized for, for those laws to be compliant. But I think the reality is, you know, the data mesh model basically says, data's decentralized, and we're gonna turn that into an asset rather than a liability. And we're gonna turn that into an asset by empowering the people that know the data, the best to participate in the process of, you know, curating and creating data products for, for consumption. So I think when you think about it, that way, you're going to get higher quality data and faster time to insight, which is ultimately going to drive more revenue for your business and reduce costs. So I think that that's the way I see the two, the two models comparing and con contrasting. >>So do you think the demise of the data warehouse is inevitable? I mean, I mean, you know, there Theresa you work with a lot of clients, they're not just gonna rip and replace their existing infrastructure. Maybe they're gonna build on top of it, but the, what does that mean? Does that mean the ed w just becomes, you know, less and less valuable over time, or it's maybe just isolated to specific use cases. What's your take on that? >>Listen, I still would love all my data within a data warehouse would love it. Mastered would love it owned by essential team. Right? I think that's still what I would love to have. That's just not the reality, right? The investment to actually migrate and keep that up to date. I would say it's a losing battle. Like we've been trying to do it for a long time. Nobody has the budgets and then data changes, right? There's gonna be a new technology. That's gonna emerge that we're gonna wanna tap into. There's gonna be not enough investment to bring all the legacy, but still very useful systems into that centralized view. So you keep the data warehouse. I think it's a very, very valuable, very high performance tool for what it's there for, but you could have this, you know, new mesh layer that still takes advantage of the things. I mentioned, the data products in the systems that are meaningful today and the data products that actually might span a number of systems. Maybe either those that either source systems, the domains that know it best, or the consumer based systems and products that need to be packaged in a way that be really meaningful for that end user, right? Each of those are useful for a different part of the business and making sure that the mesh actually allows you to lose all of them. >>So, Richard, let me ask you, you take, take Gemma's principles back to those. You got, you know, the domain ownership and, and, and data as product. Okay, great. Sounds good. But it creates what I would argue or two, you know, challenges self-serve infrastructure let's park that for a second. And then in your industry, one of the high, most regulated, most sensitive computational governance, how do you automate and ensure federated governance in that mesh model that Theresa was just talking about? >>Well, it absolutely depends on some of the tooling and processes that you put in place around those tools to be, to centralize the security and the governance of the data. And, and I think, although a data warehouse makes that very simple, cause it's a single tool, it's not impossible with some of the data mesh technologies that are available. And so what we've done at EMI is we have a single security layer that sits on top of our data mesh, which means that no matter which user is accessing, which data source, we go through a well audited well understood security layer. That means that we know exactly who's got access to which data field, which data tables. And then everything that they do is, is audited in a very kind of standard way, regardless of the underlying data storage technology. So for me, although storing the data in one place might not be possible understanding where your source of truth is and securing that in a common way is still a valuable approach and you can do it without having to bring all that data into a single bucket so that it's all in one place. >>And, and so having done that and investing quite heavily in making that possible has paid dividends in terms of giving wider access to the platform and ensuring that only data that's available under GDPR and other regulations is being used by, by the data users. >>Yeah. So Justin mean Democrat, we always talk about data democratization and you know, up until recently, they really haven't been line of sight as to how to get there. But do you have anything to add to this because you're essentially taking, you know, doing analytic queries and with data, that's all dispersed all over the, how are you seeing your customers handle this, this challenge? >>Yeah, I mean, I think data products is a really interesting aspect of the answer to that. It allows you to, again, leverage the data domain owners, people know the data, the best to, to create, you know, data as a product ultimately to be consumed. And we try to represent that in our product as effectively, almost eCommerce, like experience where you go and discover and look for the data products that have been created in your organization. And then you can start to consume them as, as you'd like. And so really trying to build on that notion of, you know, data democratization and self-service, and making it very easy to discover and, and start to use with whatever BI tool you, you may like, or even just running, you know, SQL queries yourself. >>Okay. G guys grab a sip of water. After the short break, we'll be back to debate whether proprietary or open platforms are the best path to the future of data excellence. Keep it right there.

Published Date : Aug 2 2022

SUMMARY :

famously said the best minds of my generation are thinking about how to get people to Teresa is on the west coast and Justin is in Massachusetts with me. So, you know, despite being the industry leader for 40 years, not one of their customers truly had So Richard, from a practitioner's point of view, you know, what, what are your thoughts? you might be able to centralize all the data and all of the tooling and teams in one place. Y you know, Theresa, I feel like Sarbanes Oxley kinda saved the data warehouse, I, I think you gotta have centralized governance, right? of rock stars that, that, you know, build cubes and, and the like, And you can think of them like consultants Justin, what do you say to a, to a customer or prospect that says, look, Justin, I'm gonna, you know, for many, many years to come, I think that's the story of Oracle or Terra data or other proprietary But I think the reality is, you know, the data mesh model basically says, I mean, you know, there Theresa you work with a lot of clients, they're not just gonna rip and replace their existing you know, new mesh layer that still takes advantage of the things. But it creates what I would argue or two, you know, Well, it absolutely depends on some of the tooling and processes that you put in place around And, and so having done that and investing quite heavily in making that possible But do you have anything to add to this because you're essentially taking, you know, the best to, to create, you know, data as a product ultimately to be consumed. open platforms are the best path to the future of

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave LantaPERSON

0.99+

DaniPERSON

0.99+

RichardPERSON

0.99+

Justin BorgmanPERSON

0.99+

JustinPERSON

0.99+

Jeff OckerPERSON

0.99+

TheresaPERSON

0.99+

Richard JarvisPERSON

0.99+

TeresaPERSON

0.99+

MassachusettsLOCATION

0.99+

TeradataORGANIZATION

0.99+

40 yearsQUANTITY

0.99+

OracleORGANIZATION

0.99+

UKLOCATION

0.99+

twoQUANTITY

0.99+

JoePERSON

0.99+

GDPRTITLE

0.99+

JAKPERSON

0.99+

2011DATE

0.99+

StarburstORGANIZATION

0.99+

BostonLOCATION

0.99+

thousandsQUANTITY

0.99+

two modelsQUANTITY

0.99+

EMIORGANIZATION

0.99+

FacebookORGANIZATION

0.99+

GemmaPERSON

0.99+

TeradaORGANIZATION

0.99+

AccentureORGANIZATION

0.99+

EachQUANTITY

0.99+

first lieQUANTITY

0.99+

todayDATE

0.99+

first startupQUANTITY

0.98+

ClouderaORGANIZATION

0.98+

TodayDATE

0.98+

SQLTITLE

0.98+

first technologistQUANTITY

0.97+

one placeQUANTITY

0.97+

DemocratORGANIZATION

0.97+

singleQUANTITY

0.97+

about 30 milesQUANTITY

0.97+

oneQUANTITY

0.96+

three industry expertsQUANTITY

0.95+

more than a decade laterDATE

0.94+

OneQUANTITY

0.94+

hit adaptORGANIZATION

0.94+

Terra dataORGANIZATION

0.93+

GreenfieldLOCATION

0.92+

single sourceQUANTITY

0.91+

single toolQUANTITY

0.91+

OxleyPERSON

0.91+

one vendorQUANTITY

0.9+

single bucketQUANTITY

0.9+

single versionQUANTITY

0.88+

about a year agoDATE

0.85+

Theresa tonguePERSON

0.83+

emosORGANIZATION

0.82+

MarsORGANIZATION

0.8+

swans OxleyPERSON

0.77+

IDUTITLE

0.69+

firstQUANTITY

0.59+

a secondQUANTITY

0.55+

Sarbanes OxleyORGANIZATION

0.53+

MasteredPERSON

0.45+

Q1QUANTITY

0.37+

Manoj Suvarna, Deloitte LLP & Arte Merritt, AWS | Amazon re:MARS 2022


 

(upbeat music) >> Welcome back, everyone. It's theCUBE's coverage here in Las Vegas. I'm John Furrier, your host of theCUBE with re:MARS. Amazon re:MARS stands for machine learning, automation, robotics, and space. Lot of great content, accomplishment. AI meets meets robotics and space, industrial IoT, all things data. And we've got two great guests here to unpack the AI side of it. Manoj Suvarna, Managing Director at AI Ecosystem at Deloitte and Arte Merritt, Conversational AI Lead at AWS. Manoj, it's great to see you CUBE alumni. Art, welcome to theCUBE. >> Thanks for having me. I appreciate it. >> So AI's the big theme. Actually, the big disconnect in the industry has been the industrial OT versus IT, and that's happening. Now you've got space and robotics meets what we know is machine learning and AI which we've been covering. This is the confluence of the new IoT market. >> It absolutely is. >> What's your opinion on that? >> Yeah, so actually it's taking IoT beyond the art of possible. One area that we have been working very closely with AWS. We're strategic alliance with them. And for the past six years, we have been investing a lot in transformations. Transformation as it relate to the cloud, transformation as it relate to data modernization. The new edge is essentially on AI and machine learning. And just this week, we announced a new solution which is more focused around enhancing contact center intelligence. So think about the edge of the contact center, where we all have experiences around dealing with customer service and how to really take that to the next level, challenges that clients are facing in every part of that business. So clearly. >> Well, Conversational AI is a good topic. Talk about the relationship with Deloitte and Amazon for a second around AI because you guys have some great projects going on right now. That's well ahead of the curve on solving the scale problem 'cause there's a scale and problem, practical problem and then scale. What's the relationship with Amazon and Deloitte? >> We have a great alliance and relationship. Deloitte brings that expertise to help folks build high quality, highly effective conversational AI and enterprises are implementing these solutions to really try to improve the overall customer experience. So they want to help agents improve productivity, gain insights into the reasons why folks are calling but it's really to provide that better user experience being available 24/7 on channels users prefer to interact. And the solutions that Deloitte is building are highly advanced, super exciting. Like when we show demos of them to potential customers, the eyes light up and they want those solutions. >> John: Give an example when their eyes light up. What are you showing there? >> One solution, it's called multimodal interfaces. So what this is, is when you're call into like a voice IVR, Deloitte's solution will send the folks say a mobile app or a website. So the person can interact with both the phone touching on the screen and the voice and it's all kept in sync. So imagine you call the doctor's office or say I was calling a airline and I want to change my flight or sorry, change the seat. If they were to say, seat 20D is available. Well, I don't know what that means, but if you see the map while you're talking, you can say, oh, 20D is the aisle. I'm going to select that. So Deloitte's doing those kind of experiences. It's incredible. >> Manoj, this is where the magic comes into play when you bring data together and you have integration like this. Asynchronously or synchronously, it's all coming together. You have different platforms, phone, voice, silo databases potentially, the old way. Now, the new ways integrating. What makes it all work? What's the key to success? >> Yeah, it's certainly not a trivial feat. Bringing together all of these ecosystems of relationships, technologies all put together. We cannot do it alone. This is where we partner with AWS with some of our other partners like Salesforce and OneReach and really trying to bring a symphony of some of these solutions to bear. When you think about, going back to the example of contact center, the challenges that the pandemic posed in the last couple of years was the fact that who's a humongous rise in volume of number of calls. You can imagine people calling in asking for all kinds of different things, whether it's airlines whether it is doctor's office and retail. And then couple with that is the fact that there's the labor shortage. And how do you train agents to get them to be productive enough to be able to address hundreds or thousands of these calls? And so that's where we have been starting to, we have invested in those solutions bringing those technologies together to address real client problems, not just slideware but actual production environments. And that's where we launched this solution called TrueServe as of this week, which is really a multimodal solution that is built with preconceived notions of technologies and libraries where we can then be industry agnostic and be able to deliver those experiences to our clients based on whatever vertical or industry they're in. >> Take me through the client's engagement here because I can imagine they want to get a practical solution. They're going to want to have it up and running, not like a just a chatbot, but like they completely integrated system. What's the challenge and what's the outcome first set of milestones that you see that they do first? Do they just get the data together? Are they deploying a software solution? What's the use cases? >> There's a couple different use cases. We see there's the self-service component that we're talking about with the chatbots or voice IVR solutions. There's also use cases for helping the agents, so real-time agent assist. So you call into a contact center, it's transcribed in real time, run through some sort of knowledge base to give the agents possible answers to help the user out, tying in, say the Salesforce data, CRM data, to know more about the user. Like if I was to call the airline, it's going to say, "Are you calling about your flight to San Francisco tomorrow?" It knows who I am. It leverages that stuff. And then the key piece is the analytics knowing why folks are calling, not just your metrics around, length of calls or deflections, but what were the reasons people were calling in because you can use that data to improve your underlying products or services. These are the things that enterprise are looking for and this is where someone like Deloitte comes in, brings that expertise, speeds up the time to market and really helps the customers. >> Manoj, what was the solution you mentioned that you guys announced? >> Yeah, so this is called Deloitte TrueServe. And essentially, it's a combination of multiple different solutions combinations from AWS, from Salesforce, from OneReach. All put together with our joint engineering and really delivering that capability. Enhancing on that is the analytics component, which is really critical, especially because when you think about the average contact center, less than 10% of the data gets analyzed today, and how do you then extract value out of that data and be able to deliver business outcomes. >> I was just talking to some of the other day about Zoom. Everyone records their zoom meetings, and no one watches them. I mean, who's going to wade through that. Call center is even more high volume. We're talking about massive data. And so will you guys automate that? Do you go through every single piece of data, every call and bring it down? Is that how it works? >> Go ahead. >> There's just some of the things you can do. Analyze the calls for common themes, like figuring out like topic modeling, what are the reasons people are calling in. Summarizing that stuff so you can see what those underlying issues are. And so that could be, like I was mentioning, improving the product or service. It could also be for helping train the agents. So here's how to answer that question. And it could even be reinforcing positive experiences maybe an agent had a particular great call and that could be a reference for other folks. >> Yeah, and also during the conversation, when you think about within 60 to 90 seconds, how do you identify the intonation, the sentiments of the client customer calling in and be able to respond in real time for the challenges that they might be facing and the ability to authenticate the customer at the same time be able to respond to them. I think that is the advancements that we are seeing in the market. >> I think also your point about the data having residual values also excellent because this is a long tail of value in this data, like for predictions and stuff. So NASA was just on before you guys came on, talking about the Artemis project and all the missions and they have to run massive amounts of simulations. And this is where I've kind of seen the dots connect here. You can run with AI, run all the heavy lifting without human touching it to get that first ingestion or analysis, and then iterating on the data based upon what else happens. >> Manoj: Absolutely. >> This is now the new normal, right? Is this? >> It is. And it's transverse towards across multiple domains. So the example we gave you was around Conversational AI. We're now looking at that for doing predictive analytics. Those are some examples that we are doing jointly with AWS SageMaker. We are working on things like computer vision with some of the capabilities and what computer vision has to offer. And so when you think about the continuum of possibilities of what we can bring together from a tools, technology, services perspective, really the sky is the limit in terms of delivering these real experiences to our clients. >> So take me through a customer. Pretending I'm a customer, I get it. I got to do this. It's a competitive advantage. What are the outcomes that they are envisioning? What are some of the patterns you're seeing with customers? What outcomes are they expecting and what kind of high level upside you see them envisioning coming out of the data? >> So when you think about the CxOs today and the board, a lot of them are thinking about, okay, how do you build more efficiency in those system? How do you enable a technology or solution for them to not only increase their top line but as well as their bottom line? How do you enhance the customer experience, which in this case is spot on because when you think about, when customers go repeat to a vendor, it's based on quality, it's based on price. Customer experience is now topping that where your first experience, whether it's through a chat or a virtual assistant or a phone call is going to determine the longevity of that customer with you as a vendor. And so clearly, when you think about how clients are becoming AI fuel, this is where we are bringing in new technologies, new solutions to really push the art to the limit and the art of possible. >> You got a playbook too to do this? >> Yeah, yeah, absolutely. We have done that. And in fact, we are now taking that to the next level up. So something that I've mentioned about this before, which is how do you trust an AI system as it's building up. >> Hold on, I need to plug in. >> Yeah, absolutely. >> I put this here for a reason to remind me. No, but also trust is a big thing. Just put that trustworthy. This is an AI ethics question. >> Arte: It's a big. >> Let's get into it. This is huge. Data's data. Data can be biased from coming in >> Part of it, there are concerns you have to look at the bias in the data. It's also how you communicate through these automated channels, being empathetic, building trust with the customer, being concise in the answers and being accessible to all sorts of different folks and how they might communicate. So it's definitely a big area. >> I mean, you think about just normal life. We all lived situations where we got a text message from a friend or someone close to us where, what the hell, what are you saying? And they had no contextual bad feelings about it or, well, there's misunderstandings 'cause the context isn't there 'cause you're rapid fire them on the subway. I'm riding my bike. I stop and text, okay, I'm okay. Church response could mean I'm busy or I'm angry. Like this is now what you said about empathy. This is now a new dynamic in here. >> Oh, the empathy is huge, especially if you're say a financial institution or building that trust with folks and being empathetic. If someone's reaching out to a contact center, there's a good chance they're upset about something. So you have to take that. >> John: Calm them down first. >> Yeah, and not being like false like platitude kind of things, like really being empathetic, being inclusive in the language. Those are things that you have conversation designers and linguistics folks that really look into that. That's why having domain expertise from folks like Deloitte come in to help with that. 'Cause maybe if you're just building the chat on your own, you might not think of those things. But the folks with the domain expertise will say like, Hey, this is how you script it. It's the power of words, getting that message across clearly. >> The linguistics matter? >> Yeah, yeah. >> It does. >> By vertical too, I mean, you could pick any the tribe, whatever orientation and age, demographics, genders. >> All of those things that we take for granted as a human. When you think about trust, when you think about bias, when you think about ethics, it just gets amplified. Because now you're dealing with millions and millions of data points that may or may not be the right direction in terms of somebody's calling in depending on what age group they're in. Some questions might not be relevant for that age group. Now a human can determine that, but a bot cannot. And so how do you make sure that when you look at this data coming in, how do you build models that are ethically aware of the contextual algorithms and the alignment with it and also enabling that experience to be much enhanced than taking it backwards, and that's really. >> I can imagine it getting better with as people get scaled up a bit 'cause then you're going to have to start having AI to watch the AI at some point, as they say. Where are we in the progress in the industry right now? Because I know there's been a lot of news stories around, ethics and AI and bias and it's a moving train actually, but still problems are going to be solved. Are we at the tipping point yet? Are we still walking in before we crawl or crawling before we walk? I should say, I mean, where are we? >> I think we are in between a crawling or walk phase. And the reason for that is because it varies depending on whether you're regulated industry or unregulated. In the regulated industry, there are compliance regulations requirements, whether it's government whether it's banking, financial institutions where they have to meet Sarbanes-Oxley and all kinds of compliance requirements, whereas an unregulated industry like retail and consumer, it is anybody's gain. And so the reality of it is that there is more of an awareness now. And that's one of the reasons why we've been promoting this jointly with AWS. We have a framework that we have established where there are multiple pillars of trust, bias, privacy, and security that companies and organizations need to think about. Our data scientists, ML engineers need to be familiar with it, but because while they're super great in terms of model building and development, when it comes to the business, when it comes to the client or a customer, it is super important for them to trust this platform, this algorithm. And that is where we are trying to build that momentum, bring that awareness. One of my colleagues has written this book "Trustworthy AI". We're trying to take the message out to the market to say, there is a framework. We can help you get there. And certainly that's what we are doing. >> Just call Deloitte up and you're going to take care of them. >> Manoj: Yeah. >> On the Amazon side, Amazon Web Services. I always interview Swami every year at re:Invent and he always get the updates. He's been bullish on this for a long time on this Conversational AI. What's the update on the AWS side? Where are you guys at? What's the current trends that you're riding? What wave are you riding right now? >> So some of the trends we see in customer interest, there's a couple of things. One is the multimodal interfaces we we're just chatting about where the voice IVA is synced with like a web or mobile experience, so you take that full advantage of the device. The other is adding additional AI into the Conversational AI. So one example is a customer that included intelligent document processing as part of the chatbot. So instead of typing your name and address, take a photo of your driver's license. It was an insurance onboarding chatbot, so you could take a photo of your existing insurance policy. It'll extract that information to build the new insurance policy. So folks get excited about that. And the third area we see interest is what's called multi-bot orchestration. And this is where you can have one main chatbot. Marshall user across different sub-chatbots based on the use case persona or even language. So those things get people really excited and then AWS is launching all sorts of new features. I don't know which one is coming out. >> I know something's coming out tomorrow. He's right at corner. He's big smile on his face. He wouldn't tell me. It's good. >> We have for folks like the closer alliance relationships, we we're able to get previews. So there a preview of all the new stuff. And I don't know what I could, it's pretty exciting stuff. >> You get in trouble if you spill the beans here. Don't, be careful. I'll watch you. We'll talk off camera. All exciting stuff. >> Yeah, yeah. I think the orchestrator bot is interesting. Having the ability to orchestrate across different contextual datasets is interesting. >> One of the areas where it's particularly interesting is in financial services. Imagine a bank could have consumer accounts, merchant accounts, investment banking accounts. So if you were to chat with the chatbot and say I want to open account, well, which account do you mean? And so it's able to figure out that context to navigate folks to those sub-chatbots behind the scenes. And so it's pretty interesting style. >> Awesome. Manoj while we're here, take a minute to quickly give a plug for Deloitte. What your program's about? What customers should expect if they work with you guys on this project? Give a quick commercial for Deloitte. >> Yeah, no, absolutely. I mean, Deloitte has been continuing to lead the AI field organization effort across our client base. If you think about all the Fortune 100, Fortune 500, Fortune 2000 clients, we certainly have them where they are in advanced stages of multiple deployments for AI. And we look at it all the way from strategy to implementation to operational models. So clients don't have to do it alone. And we are continuing to build our ecosystem of relationships, partnerships like the alliances that we have with AWS, building the ecosystem of relationships with other emerging startups, to your point about how do you continue to innovate and bring those technologies to your clients in a trustworthy environment so that we can deliver it in production scale. That is essentially what we're driving. >> Well, Arte, there's a great conversation and the AI will take over from here as we end the segment. I see a a bot coming on theCUBE later and there might be CUBE be replaced with robots. >> Right, right, right, exactly. >> I'm John Furrier, calling from Palo Alto. >> Someday, CUBE bot. >> You can just say, Alexa do my demo for me or whatever it is. >> Or digital twin for John. >> We're going to have a robot on earlier do a CUBE interview and that's Dave Vellante. He'd just pipe his voice in and be fun. Well, thanks for coming on, great conversation. >> Thank you. Thanks for having us. >> CUBE coverage here at re:MARS in Las Vegas. Back to the event circle. We're back in the line. Got re:Inforce and don't forget re:Invent at the end of the year. CUBE coverage of this exciting show here. Machine learning, automation, robotics, space. That's MARS, it's re:MARS. I'm John Furrier. Thanks for watching. (gentle music)

Published Date : Jun 24 2022

SUMMARY :

Manoj, it's great to see you CUBE alumni. I appreciate it. of the new IoT market. And for the past six years, on solving the scale problem And the solutions that What are you showing there? So the person can interact What's the key to success? and be able to deliver those What's the use cases? it's going to say, "Are you and be able to deliver business outcomes. of the other day about Zoom. the things you can do. and the ability to and they have to run massive So the example we gave you What are some of the patterns And so clearly, when you that to the next level up. a reason to remind me. Data can be biased from coming in being concise in the answers 'cause the context isn't there Oh, the empathy is huge, But the folks with the domain you could pick any the tribe, and the alignment with it in the industry right now? And so the reality of it is that you're going to take care of them. and he always get the updates. So some of the trends we I know something's coming out tomorrow. We have for folks like the if you spill the beans here. Having the ability to orchestrate One of the areas where with you guys on this project? So clients don't have to do it alone. and the AI will take over from I'm John Furrier, You can just say, We're going to have a robot Thanks for having us. We're back in the line.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
DeloitteORGANIZATION

0.99+

Dave VellantePERSON

0.99+

AmazonORGANIZATION

0.99+

JohnPERSON

0.99+

NASAORGANIZATION

0.99+

AWSORGANIZATION

0.99+

Amazon Web ServicesORGANIZATION

0.99+

millionsQUANTITY

0.99+

John FurrierPERSON

0.99+

hundredsQUANTITY

0.99+

Manoj SuvarnaPERSON

0.99+

Las VegasLOCATION

0.99+

San FranciscoLOCATION

0.99+

less than 10%QUANTITY

0.99+

ManojPERSON

0.99+

AI EcosystemORGANIZATION

0.99+

first experienceQUANTITY

0.99+

SwamiPERSON

0.99+

OneQUANTITY

0.99+

Arte MerrittORGANIZATION

0.99+

tomorrowDATE

0.99+

oneQUANTITY

0.99+

OneReachORGANIZATION

0.99+

90 secondsQUANTITY

0.98+

Palo AltoLOCATION

0.98+

one exampleQUANTITY

0.98+

firstQUANTITY

0.98+

todayDATE

0.98+

bothQUANTITY

0.97+

AlexaTITLE

0.97+

this weekDATE

0.97+

thousandsQUANTITY

0.97+

Deloitte LLPORGANIZATION

0.97+

One solutionQUANTITY

0.96+

two great guestsQUANTITY

0.95+

theCUBEORGANIZATION

0.95+

SalesforceORGANIZATION

0.94+

Sarbanes-OxleyORGANIZATION

0.94+

one main chatbotQUANTITY

0.94+

third areaQUANTITY

0.94+

One areaQUANTITY

0.92+

Fortune 500ORGANIZATION

0.92+

60QUANTITY

0.92+

first setQUANTITY

0.91+

Trustworthy AITITLE

0.9+

CUBETITLE

0.89+

last couple of yearsDATE

0.88+

CUBEORGANIZATION

0.88+

pandemicEVENT

0.84+

MARSTITLE

0.83+

past six yearsDATE

0.8+

Fortune 100ORGANIZATION

0.78+

InventEVENT

0.76+

Predictions 2022: Top Analysts See the Future of Data


 

(bright music) >> In the 2010s, organizations became keenly aware that data would become the key ingredient to driving competitive advantage, differentiation, and growth. But to this day, putting data to work remains a difficult challenge for many, if not most organizations. Now, as the cloud matures, it has become a game changer for data practitioners by making cheap storage and massive processing power readily accessible. We've also seen better tooling in the form of data workflows, streaming, machine intelligence, AI, developer tools, security, observability, automation, new databases and the like. These innovations they accelerate data proficiency, but at the same time, they add complexity for practitioners. Data lakes, data hubs, data warehouses, data marts, data fabrics, data meshes, data catalogs, data oceans are forming, they're evolving and exploding onto the scene. So in an effort to bring perspective to the sea of optionality, we've brought together the brightest minds in the data analyst community to discuss how data management is morphing and what practitioners should expect in 2022 and beyond. Hello everyone, my name is Dave Velannte with theCUBE, and I'd like to welcome you to a special Cube presentation, analysts predictions 2022: the future of data management. We've gathered six of the best analysts in data and data management who are going to present and discuss their top predictions and trends for 2022 in the first half of this decade. Let me introduce our six power panelists. Sanjeev Mohan is former Gartner Analyst and Principal at SanjMo. Tony Baer, principal at dbInsight, Carl Olofson is well-known Research Vice President with IDC, Dave Menninger is Senior Vice President and Research Director at Ventana Research, Brad Shimmin, Chief Analyst, AI Platforms, Analytics and Data Management at Omdia and Doug Henschen, Vice President and Principal Analyst at Constellation Research. Gentlemen, welcome to the program and thanks for coming on theCUBE today. >> Great to be here. >> Thank you. >> All right, here's the format we're going to use. I as moderator, I'm going to call on each analyst separately who then will deliver their prediction or mega trend, and then in the interest of time management and pace, two analysts will have the opportunity to comment. If we have more time, we'll elongate it, but let's get started right away. Sanjeev Mohan, please kick it off. You want to talk about governance, go ahead sir. >> Thank you Dave. I believe that data governance which we've been talking about for many years is now not only going to be mainstream, it's going to be table stakes. And all the things that you mentioned, you know, the data, ocean data lake, lake houses, data fabric, meshes, the common glue is metadata. If we don't understand what data we have and we are governing it, there is no way we can manage it. So we saw Informatica went public last year after a hiatus of six. I'm predicting that this year we see some more companies go public. My bet is on Culebra, most likely and maybe Alation we'll see go public this year. I'm also predicting that the scope of data governance is going to expand beyond just data. It's not just data and reports. We are going to see more transformations like spark jawsxxxxx, Python even Air Flow. We're going to see more of a streaming data. So from Kafka Schema Registry, for example. We will see AI models become part of this whole governance suite. So the governance suite is going to be very comprehensive, very detailed lineage, impact analysis, and then even expand into data quality. We already seen that happen with some of the tools where they are buying these smaller companies and bringing in data quality monitoring and integrating it with metadata management, data catalogs, also data access governance. So what we are going to see is that once the data governance platforms become the key entry point into these modern architectures, I'm predicting that the usage, the number of users of a data catalog is going to exceed that of a BI tool. That will take time and we already seen that trajectory. Right now if you look at BI tools, I would say there a hundred users to BI tool to one data catalog. And I see that evening out over a period of time and at some point data catalogs will really become the main way for us to access data. Data catalog will help us visualize data, but if we want to do more in-depth analysis, it'll be the jumping off point into the BI tool, the data science tool and that is the journey I see for the data governance products. >> Excellent, thank you. Some comments. Maybe Doug, a lot of things to weigh in on there, maybe you can comment. >> Yeah, Sanjeev I think you're spot on, a lot of the trends the one disagreement, I think it's really still far from mainstream. As you say, we've been talking about this for years, it's like God, motherhood, apple pie, everyone agrees it's important, but too few organizations are really practicing good governance because it's hard and because the incentives have been lacking. I think one thing that deserves mention in this context is ESG mandates and guidelines, these are environmental, social and governance, regs and guidelines. We've seen the environmental regs and guidelines and posts in industries, particularly the carbon-intensive industries. We've seen the social mandates, particularly diversity imposed on suppliers by companies that are leading on this topic. We've seen governance guidelines now being imposed by banks on investors. So these ESGs are presenting new carrots and sticks, and it's going to demand more solid data. It's going to demand more detailed reporting and solid reporting, tighter governance. But we're still far from mainstream adoption. We have a lot of, you know, best of breed niche players in the space. I think the signs that it's going to be more mainstream are starting with things like Azure Purview, Google Dataplex, the big cloud platform players seem to be upping the ante and starting to address governance. >> Excellent, thank you Doug. Brad, I wonder if you could chime in as well. >> Yeah, I would love to be a believer in data catalogs. But to Doug's point, I think that it's going to take some more pressure for that to happen. I recall metadata being something every enterprise thought they were going to get under control when we were working on service oriented architecture back in the nineties and that didn't happen quite the way we anticipated. And so to Sanjeev's point it's because it is really complex and really difficult to do. My hope is that, you know, we won't sort of, how do I put this? Fade out into this nebula of domain catalogs that are specific to individual use cases like Purview for getting data quality right or like data governance and cybersecurity. And instead we have some tooling that can actually be adaptive to gather metadata to create something. And I know its important to you, Sanjeev and that is this idea of observability. If you can get enough metadata without moving your data around, but understanding the entirety of a system that's running on this data, you can do a lot. So to help with the governance that Doug is talking about. >> So I just want to add that, data governance, like any other initiatives did not succeed even AI went into an AI window, but that's a different topic. But a lot of these things did not succeed because to your point, the incentives were not there. I remember when Sarbanes Oxley had come into the scene, if a bank did not do Sarbanes Oxley, they were very happy to a million dollar fine. That was like, you know, pocket change for them instead of doing the right thing. But I think the stakes are much higher now. With GDPR, the flood gates opened. Now, you know, California, you know, has CCPA but even CCPA is being outdated with CPRA, which is much more GDPR like. So we are very rapidly entering a space where pretty much every major country in the world is coming up with its own compliance regulatory requirements, data residents is becoming really important. And I think we are going to reach a stage where it won't be optional anymore. So whether we like it or not, and I think the reason data catalogs were not successful in the past is because we did not have the right focus on adoption. We were focused on features and these features were disconnected, very hard for business to adopt. These are built by IT people for IT departments to take a look at technical metadata, not business metadata. Today the tables have turned. CDOs are driving this initiative, regulatory compliances are beating down hard, so I think the time might be right. >> Yeah so guys, we have to move on here. But there's some real meat on the bone here, Sanjeev. I like the fact that you called out Culebra and Alation, so we can look back a year from now and say, okay, he made the call, he stuck it. And then the ratio of BI tools to data catalogs that's another sort of measurement that we can take even though with some skepticism there, that's something that we can watch. And I wonder if someday, if we'll have more metadata than data. But I want to move to Tony Baer, you want to talk about data mesh and speaking, you know, coming off of governance. I mean, wow, you know the whole concept of data mesh is, decentralized data, and then governance becomes, you know, a nightmare there, but take it away, Tony. >> We'll put this way, data mesh, you know, the idea at least as proposed by ThoughtWorks. You know, basically it was at least a couple of years ago and the press has been almost uniformly almost uncritical. A good reason for that is for all the problems that basically Sanjeev and Doug and Brad we're just speaking about, which is that we have all this data out there and we don't know what to do about it. Now, that's not a new problem. That was a problem we had in enterprise data warehouses, it was a problem when we had over DoOP data clusters, it's even more of a problem now that data is out in the cloud where the data is not only your data lake, is not only us three, it's all over the place. And it's also including streaming, which I know we'll be talking about later. So the data mesh was a response to that, the idea of that we need to bait, you know, who are the folks that really know best about governance? It's the domain experts. So it was basically data mesh was an architectural pattern and a process. My prediction for this year is that data mesh is going to hit cold heart reality. Because if you do a Google search, basically the published work, the articles on data mesh have been largely, you know, pretty uncritical so far. Basically loading and is basically being a very revolutionary new idea. I don't think it's that revolutionary because we've talked about ideas like this. Brad now you and I met years ago when we were talking about so and decentralizing all of us, but it was at the application level. Now we're talking about it at the data level. And now we have microservices. So there's this thought of have we managed if we're deconstructing apps in cloud native to microservices, why don't we think of data in the same way? My sense this year is that, you know, this has been a very active search if you look at Google search trends, is that now companies, like enterprise are going to look at this seriously. And as they look at it seriously, it's going to attract its first real hard scrutiny, it's going to attract its first backlash. That's not necessarily a bad thing. It means that it's being taken seriously. The reason why I think that you'll start to see basically the cold hearted light of day shine on data mesh is that it's still a work in progress. You know, this idea is basically a couple of years old and there's still some pretty major gaps. The biggest gap is in the area of federated governance. Now federated governance itself is not a new issue. Federated governance decision, we started figuring out like, how can we basically strike the balance between getting let's say between basically consistent enterprise policy, consistent enterprise governance, but yet the groups that understand the data and know how to basically, you know, that, you know, how do we basically sort of balance the two? There's a huge gap there in practice and knowledge. Also to a lesser extent, there's a technology gap which is basically in the self-service technologies that will help teams essentially govern data. You know, basically through the full life cycle, from develop, from selecting the data from, you know, building the pipelines from, you know, determining your access control, looking at quality, looking at basically whether the data is fresh or whether it's trending off course. So my prediction is that it will receive the first harsh scrutiny this year. You are going to see some organization and enterprises declare premature victory when they build some federated query implementations. You going to see vendors start with data mesh wash their products anybody in the data management space that they are going to say that where this basically a pipelining tool, whether it's basically ELT, whether it's a catalog or federated query tool, they will all going to get like, you know, basically promoting the fact of how they support this. Hopefully nobody's going to call themselves a data mesh tool because data mesh is not a technology. We're going to see one other thing come out of this. And this harks back to the metadata that Sanjeev was talking about and of the catalog just as he was talking about. Which is that there's going to be a new focus, every renewed focus on metadata. And I think that's going to spur interest in data fabrics. Now data fabrics are pretty vaguely defined, but if we just take the most elemental definition, which is a common metadata back plane, I think that if anybody is going to get serious about data mesh, they need to look at the data fabric because we all at the end of the day, need to speak, you know, need to read from the same sheet of music. >> So thank you Tony. Dave Menninger, I mean, one of the things that people like about data mesh is it pretty crisply articulate some of the flaws in today's organizational approaches to data. What are your thoughts on this? >> Well, I think we have to start by defining data mesh, right? The term is already getting corrupted, right? Tony said it's going to see the cold hard light of day. And there's a problem right now that there are a number of overlapping terms that are similar but not identical. So we've got data virtualization, data fabric, excuse me for a second. (clears throat) Sorry about that. Data virtualization, data fabric, data federation, right? So I think that it's not really clear what each vendor means by these terms. I see data mesh and data fabric becoming quite popular. I've interpreted data mesh as referring primarily to the governance aspects as originally intended and specified. But that's not the way I see vendors using it. I see vendors using it much more to mean data fabric and data virtualization. So I'm going to comment on the group of those things. I think the group of those things is going to happen. They're going to happen, they're going to become more robust. Our research suggests that a quarter of organizations are already using virtualized access to their data lakes and another half, so a total of three quarters will eventually be accessing their data lakes using some sort of virtualized access. Again, whether you define it as mesh or fabric or virtualization isn't really the point here. But this notion that there are different elements of data, metadata and governance within an organization that all need to be managed collectively. The interesting thing is when you look at the satisfaction rates of those organizations using virtualization versus those that are not, it's almost double, 68% of organizations, I'm sorry, 79% of organizations that were using virtualized access express satisfaction with their access to the data lake. Only 39% express satisfaction if they weren't using virtualized access. >> Oh thank you Dave. Sanjeev we just got about a couple of minutes on this topic, but I know you're speaking or maybe you've always spoken already on a panel with (indistinct) who sort of invented the concept. Governance obviously is a big sticking point, but what are your thoughts on this? You're on mute. (panelist chuckling) >> So my message to (indistinct) and to the community is as opposed to what they said, let's not define it. We spent a whole year defining it, there are four principles, domain, product, data infrastructure, and governance. Let's take it to the next level. I get a lot of questions on what is the difference between data fabric and data mesh? And I'm like I can't compare the two because data mesh is a business concept, data fabric is a data integration pattern. How do you compare the two? You have to bring data mesh a level down. So to Tony's point, I'm on a warpath in 2022 to take it down to what does a data product look like? How do we handle shared data across domains and governance? And I think we are going to see more of that in 2022, or is "operationalization" of data mesh. >> I think we could have a whole hour on this topic, couldn't we? Maybe we should do that. But let's corner. Let's move to Carl. So Carl, you're a database guy, you've been around that block for a while now, you want to talk about graph databases, bring it on. >> Oh yeah. Okay thanks. So I regard graph database as basically the next truly revolutionary database management technology. I'm looking forward for the graph database market, which of course we haven't defined yet. So obviously I have a little wiggle room in what I'm about to say. But this market will grow by about 600% over the next 10 years. Now, 10 years is a long time. But over the next five years, we expect to see gradual growth as people start to learn how to use it. The problem is not that it's not useful, its that people don't know how to use it. So let me explain before I go any further what a graph database is because some of the folks on the call may not know what it is. A graph database organizes data according to a mathematical structure called a graph. The graph has elements called nodes and edges. So a data element drops into a node, the nodes are connected by edges, the edges connect one node to another node. Combinations of edges create structures that you can analyze to determine how things are related. In some cases, the nodes and edges can have properties attached to them which add additional informative material that makes it richer, that's called a property graph. There are two principle use cases for graph databases. There's semantic property graphs, which are use to break down human language texts into the semantic structures. Then you can search it, organize it and answer complicated questions. A lot of AI is aimed at semantic graphs. Another kind is the property graph that I just mentioned, which has a dazzling number of use cases. I want to just point out as I talk about this, people are probably wondering, well, we have relation databases, isn't that good enough? So a relational database defines... It supports what I call definitional relationships. That means you define the relationships in a fixed structure. The database drops into that structure, there's a value, foreign key value, that relates one table to another and that value is fixed. You don't change it. If you change it, the database becomes unstable, it's not clear what you're looking at. In a graph database, the system is designed to handle change so that it can reflect the true state of the things that it's being used to track. So let me just give you some examples of use cases for this. They include entity resolution, data lineage, social media analysis, Customer 360, fraud prevention. There's cybersecurity, there's strong supply chain is a big one actually. There is explainable AI and this is going to become important too because a lot of people are adopting AI. But they want a system after the fact to say, how do the AI system come to that conclusion? How did it make that recommendation? Right now we don't have really good ways of tracking that. Machine learning in general, social network, I already mentioned that. And then we've got, oh gosh, we've got data governance, data compliance, risk management. We've got recommendation, we've got personalization, anti money laundering, that's another big one, identity and access management, network and IT operations is already becoming a key one where you actually have mapped out your operation, you know, whatever it is, your data center and you can track what's going on as things happen there, root cause analysis, fraud detection is a huge one. A number of major credit card companies use graph databases for fraud detection, risk analysis, tracking and tracing turn analysis, next best action, what if analysis, impact analysis, entity resolution and I would add one other thing or just a few other things to this list, metadata management. So Sanjeev, here you go, this is your engine. Because I was in metadata management for quite a while in my past life. And one of the things I found was that none of the data management technologies that were available to us could efficiently handle metadata because of the kinds of structures that result from it, but graphs can, okay? Graphs can do things like say, this term in this context means this, but in that context, it means that, okay? Things like that. And in fact, logistics management, supply chain. And also because it handles recursive relationships, by recursive relationships I mean objects that own other objects that are of the same type. You can do things like build materials, you know, so like parts explosion. Or you can do an HR analysis, who reports to whom, how many levels up the chain and that kind of thing. You can do that with relational databases, but yet it takes a lot of programming. In fact, you can do almost any of these things with relational databases, but the problem is, you have to program it. It's not supported in the database. And whenever you have to program something, that means you can't trace it, you can't define it. You can't publish it in terms of its functionality and it's really, really hard to maintain over time. >> Carl, thank you. I wonder if we could bring Brad in, I mean. Brad, I'm sitting here wondering, okay, is this incremental to the market? Is it disruptive and replacement? What are your thoughts on this phase? >> It's already disrupted the market. I mean, like Carl said, go to any bank and ask them are you using graph databases to get fraud detection under control? And they'll say, absolutely, that's the only way to solve this problem. And it is frankly. And it's the only way to solve a lot of the problems that Carl mentioned. And that is, I think it's Achilles heel in some ways. Because, you know, it's like finding the best way to cross the seven bridges of Koenigsberg. You know, it's always going to kind of be tied to those use cases because it's really special and it's really unique and because it's special and it's unique, it's still unfortunately kind of stands apart from the rest of the community that's building, let's say AI outcomes, as a great example here. Graph databases and AI, as Carl mentioned, are like chocolate and peanut butter. But technologically, you think don't know how to talk to one another, they're completely different. And you know, you can't just stand up SQL and query them. You've got to learn, know what is the Carl? Specter special. Yeah, thank you to, to actually get to the data in there. And if you're going to scale that data, that graph database, especially a property graph, if you're going to do something really complex, like try to understand you know, all of the metadata in your organization, you might just end up with, you know, a graph database winter like we had the AI winter simply because you run out of performance to make the thing happen. So, I think it's already disrupted, but we need to like treat it like a first-class citizen in the data analytics and AI community. We need to bring it into the fold. We need to equip it with the tools it needs to do the magic it does and to do it not just for specialized use cases, but for everything. 'Cause I'm with Carl. I think it's absolutely revolutionary. >> Brad identified the principal, Achilles' heel of the technology which is scaling. When these things get large and complex enough that they spill over what a single server can handle, you start to have difficulties because the relationships span things that have to be resolved over a network and then you get network latency and that slows the system down. So that's still a problem to be solved. >> Sanjeev, any quick thoughts on this? I mean, I think metadata on the word cloud is going to be the largest font, but what are your thoughts here? >> I want to (indistinct) So people don't associate me with only metadata, so I want to talk about something slightly different. dbengines.com has done an amazing job. I think almost everyone knows that they chronicle all the major databases that are in use today. In January of 2022, there are 381 databases on a ranked list of databases. The largest category is RDBMS. The second largest category is actually divided into two property graphs and IDF graphs. These two together make up the second largest number databases. So talking about Achilles heel, this is a problem. The problem is that there's so many graph databases to choose from. They come in different shapes and forms. To Brad's point, there's so many query languages in RDBMS, in SQL. I know the story, but here We've got cipher, we've got gremlin, we've got GQL and then we're proprietary languages. So I think there's a lot of disparity in this space. >> Well, excellent. All excellent points, Sanjeev, if I must say. And that is a problem that the languages need to be sorted and standardized. People need to have a roadmap as to what they can do with it. Because as you say, you can do so many things. And so many of those things are unrelated that you sort of say, well, what do we use this for? And I'm reminded of the saying I learned a bunch of years ago. And somebody said that the digital computer is the only tool man has ever device that has no particular purpose. (panelists chuckle) >> All right guys, we got to move on to Dave Menninger. We've heard about streaming. Your prediction is in that realm, so please take it away. >> Sure. So I like to say that historical databases are going to become a thing of the past. By that I don't mean that they're going to go away, that's not my point. I mean, we need historical databases, but streaming data is going to become the default way in which we operate with data. So in the next say three to five years, I would expect that data platforms and we're using the term data platforms to represent the evolution of databases and data lakes, that the data platforms will incorporate these streaming capabilities. We're going to process data as it streams into an organization and then it's going to roll off into historical database. So historical databases don't go away, but they become a thing of the past. They store the data that occurred previously. And as data is occurring, we're going to be processing it, we're going to be analyzing it, we're going to be acting on it. I mean we only ever ended up with historical databases because we were limited by the technology that was available to us. Data doesn't occur in patches. But we processed it in patches because that was the best we could do. And it wasn't bad and we've continued to improve and we've improved and we've improved. But streaming data today is still the exception. It's not the rule, right? There are projects within organizations that deal with streaming data. But it's not the default way in which we deal with data yet. And so that's my prediction is that this is going to change, we're going to have streaming data be the default way in which we deal with data and how you label it and what you call it. You know, maybe these databases and data platforms just evolved to be able to handle it. But we're going to deal with data in a different way. And our research shows that already, about half of the participants in our analytics and data benchmark research, are using streaming data. You know, another third are planning to use streaming technologies. So that gets us to about eight out of 10 organizations need to use this technology. And that doesn't mean they have to use it throughout the whole organization, but it's pretty widespread in its use today and has continued to grow. If you think about the consumerization of IT, we've all been conditioned to expect immediate access to information, immediate responsiveness. You know, we want to know if an item is on the shelf at our local retail store and we can go in and pick it up right now. You know, that's the world we live in and that's spilling over into the enterprise IT world We have to provide those same types of capabilities. So that's my prediction, historical databases become a thing of the past, streaming data becomes the default way in which we operate with data. >> All right thank you David. Well, so what say you, Carl, the guy who has followed historical databases for a long time? >> Well, one thing actually, every database is historical because as soon as you put data in it, it's now history. They'll no longer reflect the present state of things. But even if that history is only a millisecond old, it's still history. But I would say, I mean, I know you're trying to be a little bit provocative in saying this Dave 'cause you know, as well as I do that people still need to do their taxes, they still need to do accounting, they still need to run general ledger programs and things like that. That all involves historical data. That's not going to go away unless you want to go to jail. So you're going to have to deal with that. But as far as the leading edge functionality, I'm totally with you on that. And I'm just, you know, I'm just kind of wondering if this requires a change in the way that we perceive applications in order to truly be manifested and rethinking the way applications work. Saying that an application should respond instantly, as soon as the state of things changes. What do you say about that? >> I think that's true. I think we do have to think about things differently. It's not the way we designed systems in the past. We're seeing more and more systems designed that way. But again, it's not the default. And I agree 100% with you that we do need historical databases you know, that's clear. And even some of those historical databases will be used in conjunction with the streaming data, right? >> Absolutely. I mean, you know, let's take the data warehouse example where you're using the data warehouse as its context and the streaming data as the present and you're saying, here's the sequence of things that's happening right now. Have we seen that sequence before? And where? What does that pattern look like in past situations? And can we learn from that? >> So Tony Baer, I wonder if you could comment? I mean, when you think about, you know, real time inferencing at the edge, for instance, which is something that a lot of people talk about, a lot of what we're discussing here in this segment, it looks like it's got a great potential. What are your thoughts? >> Yeah, I mean, I think you nailed it right. You know, you hit it right on the head there. Which is that, what I'm seeing is that essentially. Then based on I'm going to split this one down the middle is that I don't see that basically streaming is the default. What I see is streaming and basically and transaction databases and analytics data, you know, data warehouses, data lakes whatever are converging. And what allows us technically to converge is cloud native architecture, where you can basically distribute things. So you can have a node here that's doing the real-time processing, that's also doing... And this is where it leads in or maybe doing some of that real time predictive analytics to take a look at, well look, we're looking at this customer journey what's happening with what the customer is doing right now and this is correlated with what other customers are doing. So the thing is that in the cloud, you can basically partition this and because of basically the speed of the infrastructure then you can basically bring these together and kind of orchestrate them sort of a loosely coupled manner. The other parts that the use cases are demanding, and this is part of it goes back to what Dave is saying. Is that, you know, when you look at Customer 360, when you look at let's say Smart Utility products, when you look at any type of operational problem, it has a real time component and it has an historical component. And having predictive and so like, you know, my sense here is that technically we can bring this together through the cloud. And I think the use case is that we can apply some real time sort of predictive analytics on these streams and feed this into the transactions so that when we make a decision in terms of what to do as a result of a transaction, we have this real-time input. >> Sanjeev, did you have a comment? >> Yeah, I was just going to say that to Dave's point, you know, we have to think of streaming very different because in the historical databases, we used to bring the data and store the data and then we used to run rules on top, aggregations and all. But in case of streaming, the mindset changes because the rules are normally the inference, all of that is fixed, but the data is constantly changing. So it's a completely reversed way of thinking and building applications on top of that. >> So Dave Menninger, there seem to be some disagreement about the default. What kind of timeframe are you thinking about? Is this end of decade it becomes the default? What would you pin? >> I think around, you know, between five to 10 years, I think this becomes the reality. >> I think its... >> It'll be more and more common between now and then, but it becomes the default. And I also want Sanjeev at some point, maybe in one of our subsequent conversations, we need to talk about governing streaming data. 'Cause that's a whole nother set of challenges. >> We've also talked about it rather in two dimensions, historical and streaming, and there's lots of low latency, micro batch, sub-second, that's not quite streaming, but in many cases its fast enough and we're seeing a lot of adoption of near real time, not quite real-time as good enough for many applications. (indistinct cross talk from panelists) >> Because nobody's really taking the hardware dimension (mumbles). >> That'll just happened, Carl. (panelists laughing) >> So near real time. But maybe before you lose the customer, however we define that, right? Okay, let's move on to Brad. Brad, you want to talk about automation, AI, the pipeline people feel like, hey, we can just automate everything. What's your prediction? >> Yeah I'm an AI aficionados so apologies in advance for that. But, you know, I think that we've been seeing automation play within AI for some time now. And it's helped us do a lot of things especially for practitioners that are building AI outcomes in the enterprise. It's helped them to fill skills gaps, it's helped them to speed development and it's helped them to actually make AI better. 'Cause it, you know, in some ways provide some swim lanes and for example, with technologies like AutoML can auto document and create that sort of transparency that we talked about a little bit earlier. But I think there's an interesting kind of conversion happening with this idea of automation. And that is that we've had the automation that started happening for practitioners, it's trying to move out side of the traditional bounds of things like I'm just trying to get my features, I'm just trying to pick the right algorithm, I'm just trying to build the right model and it's expanding across that full life cycle, building an AI outcome, to start at the very beginning of data and to then continue on to the end, which is this continuous delivery and continuous automation of that outcome to make sure it's right and it hasn't drifted and stuff like that. And because of that, because it's become kind of powerful, we're starting to actually see this weird thing happen where the practitioners are starting to converge with the users. And that is to say that, okay, if I'm in Tableau right now, I can stand up Salesforce Einstein Discovery, and it will automatically create a nice predictive algorithm for me given the data that I pull in. But what's starting to happen and we're seeing this from the companies that create business software, so Salesforce, Oracle, SAP, and others is that they're starting to actually use these same ideals and a lot of deep learning (chuckles) to basically stand up these out of the box flip-a-switch, and you've got an AI outcome at the ready for business users. And I am very much, you know, I think that's the way that it's going to go and what it means is that AI is slowly disappearing. And I don't think that's a bad thing. I think if anything, what we're going to see in 2022 and maybe into 2023 is this sort of rush to put this idea of disappearing AI into practice and have as many of these solutions in the enterprise as possible. You can see, like for example, SAP is going to roll out this quarter, this thing called adaptive recommendation services, which basically is a cold start AI outcome that can work across a whole bunch of different vertical markets and use cases. It's just a recommendation engine for whatever you needed to do in the line of business. So basically, you're an SAP user, you look up to turn on your software one day, you're a sales professional let's say, and suddenly you have a recommendation for customer churn. Boom! It's going, that's great. Well, I don't know, I think that's terrifying. In some ways I think it is the future that AI is going to disappear like that, but I'm absolutely terrified of it because I think that what it really does is it calls attention to a lot of the issues that we already see around AI, specific to this idea of what we like to call at Omdia, responsible AI. Which is, you know, how do you build an AI outcome that is free of bias, that is inclusive, that is fair, that is safe, that is secure, that its audible, et cetera, et cetera, et cetera, et cetera. I'd take a lot of work to do. And so if you imagine a customer that's just a Salesforce customer let's say, and they're turning on Einstein Discovery within their sales software, you need some guidance to make sure that when you flip that switch, that the outcome you're going to get is correct. And that's going to take some work. And so, I think we're going to see this move, let's roll this out and suddenly there's going to be a lot of problems, a lot of pushback that we're going to see. And some of that's going to come from GDPR and others that Sanjeev was mentioning earlier. A lot of it is going to come from internal CSR requirements within companies that are saying, "Hey, hey, whoa, hold up, we can't do this all at once. "Let's take the slow route, "let's make AI automated in a smart way." And that's going to take time. >> Yeah, so a couple of predictions there that I heard. AI simply disappear, it becomes invisible. Maybe if I can restate that. And then if I understand it correctly, Brad you're saying there's a backlash in the near term. You'd be able to say, oh, slow down. Let's automate what we can. Those attributes that you talked about are non trivial to achieve, is that why you're a bit of a skeptic? >> Yeah. I think that we don't have any sort of standards that companies can look to and understand. And we certainly, within these companies, especially those that haven't already stood up an internal data science team, they don't have the knowledge to understand when they flip that switch for an automated AI outcome that it's going to do what they think it's going to do. And so we need some sort of standard methodology and practice, best practices that every company that's going to consume this invisible AI can make use of them. And one of the things that you know, is sort of started that Google kicked off a few years back that's picking up some momentum and the companies I just mentioned are starting to use it is this idea of model cards where at least you have some transparency about what these things are doing. You know, so like for the SAP example, we know, for example, if it's convolutional neural network with a long, short term memory model that it's using, we know that it only works on Roman English and therefore me as a consumer can say, "Oh, well I know that I need to do this internationally. "So I should not just turn this on today." >> Thank you. Carl could you add anything, any context here? >> Yeah, we've talked about some of the things Brad mentioned here at IDC and our future of intelligence group regarding in particular, the moral and legal implications of having a fully automated, you know, AI driven system. Because we already know, and we've seen that AI systems are biased by the data that they get, right? So if they get data that pushes them in a certain direction, I think there was a story last week about an HR system that was recommending promotions for White people over Black people, because in the past, you know, White people were promoted and more productive than Black people, but it had no context as to why which is, you know, because they were being historically discriminated, Black people were being historically discriminated against, but the system doesn't know that. So, you know, you have to be aware of that. And I think that at the very least, there should be controls when a decision has either a moral or legal implication. When you really need a human judgment, it could lay out the options for you. But a person actually needs to authorize that action. And I also think that we always will have to be vigilant regarding the kind of data we use to train our systems to make sure that it doesn't introduce unintended biases. In some extent, they always will. So we'll always be chasing after them. But that's (indistinct). >> Absolutely Carl, yeah. I think that what you have to bear in mind as a consumer of AI is that it is a reflection of us and we are a very flawed species. And so if you look at all of the really fantastic, magical looking supermodels we see like GPT-3 and four, that's coming out, they're xenophobic and hateful because the people that the data that's built upon them and the algorithms and the people that build them are us. So AI is a reflection of us. We need to keep that in mind. >> Yeah, where the AI is biased 'cause humans are biased. All right, great. All right let's move on. Doug you mentioned mentioned, you know, lot of people that said that data lake, that term is not going to live on but here's to be, have some lakes here. You want to talk about lake house, bring it on. >> Yes, I do. My prediction is that lake house and this idea of a combined data warehouse and data lake platform is going to emerge as the dominant data management offering. I say offering that doesn't mean it's going to be the dominant thing that organizations have out there, but it's going to be the pro dominant vendor offering in 2022. Now heading into 2021, we already had Cloudera, Databricks, Microsoft, Snowflake as proponents, in 2021, SAP, Oracle, and several of all of these fabric virtualization/mesh vendors joined the bandwagon. The promise is that you have one platform that manages your structured, unstructured and semi-structured information. And it addresses both the BI analytics needs and the data science needs. The real promise there is simplicity and lower cost. But I think end users have to answer a few questions. The first is, does your organization really have a center of data gravity or is the data highly distributed? Multiple data warehouses, multiple data lakes, on premises, cloud. If it's very distributed and you'd have difficulty consolidating and that's not really a goal for you, then maybe that single platform is unrealistic and not likely to add value to you. You know, also the fabric and virtualization vendors, the mesh idea, that's where if you have this highly distributed situation, that might be a better path forward. The second question, if you are looking at one of these lake house offerings, you are looking at consolidating, simplifying, bringing together to a single platform. You have to make sure that it meets both the warehouse need and the data lake need. So you have vendors like Databricks, Microsoft with Azure Synapse. New really to the data warehouse space and they're having to prove that these data warehouse capabilities on their platforms can meet the scaling requirements, can meet the user and query concurrency requirements. Meet those tight SLS. And then on the other hand, you have the Oracle, SAP, Snowflake, the data warehouse folks coming into the data science world, and they have to prove that they can manage the unstructured information and meet the needs of the data scientists. I'm seeing a lot of the lake house offerings from the warehouse crowd, managing that unstructured information in columns and rows. And some of these vendors, Snowflake a particular is really relying on partners for the data science needs. So you really got to look at a lake house offering and make sure that it meets both the warehouse and the data lake requirement. >> Thank you Doug. Well Tony, if those two worlds are going to come together, as Doug was saying, the analytics and the data science world, does it need to be some kind of semantic layer in between? I don't know. Where are you in on this topic? >> (chuckles) Oh, didn't we talk about data fabrics before? Common metadata layer (chuckles). Actually, I'm almost tempted to say let's declare victory and go home. And that this has actually been going on for a while. I actually agree with, you know, much of what Doug is saying there. Which is that, I mean I remember as far back as I think it was like 2014, I was doing a study. I was still at Ovum, (indistinct) Omdia, looking at all these specialized databases that were coming up and seeing that, you know, there's overlap at the edges. But yet, there was still going to be a reason at the time that you would have, let's say a document database for JSON, you'd have a relational database for transactions and for data warehouse and you had basically something at that time that resembles a dupe for what we consider your data life. Fast forward and the thing is what I was seeing at the time is that you were saying they sort of blending at the edges. That was saying like about five to six years ago. And the lake house is essentially on the current manifestation of that idea. There is a dichotomy in terms of, you know, it's the old argument, do we centralize this all you know in a single place or do we virtualize? And I think it's always going to be a union yeah and there's never going to be a single silver bullet. I do see that there are also going to be questions and these are points that Doug raised. That you know, what do you need for your performance there, or for your free performance characteristics? Do you need for instance high concurrency? You need the ability to do some very sophisticated joins, or is your requirement more to be able to distribute and distribute our processing is, you know, as far as possible to get, you know, to essentially do a kind of a brute force approach. All these approaches are valid based on the use case. I just see that essentially that the lake house is the culmination of it's nothing. It's a relatively new term introduced by Databricks a couple of years ago. This is the culmination of basically what's been a long time trend. And what we see in the cloud is that as we start seeing data warehouses as a check box items say, "Hey, we can basically source data in cloud storage, in S3, "Azure Blob Store, you know, whatever, "as long as it's in certain formats, "like, you know parquet or CSP or something like that." I see that as becoming kind of a checkbox item. So to that extent, I think that the lake house, depending on how you define is already reality. And in some cases, maybe new terminology, but not a whole heck of a lot new under the sun. >> Yeah. And Dave Menninger, I mean a lot of these, thank you Tony, but a lot of this is going to come down to, you know, vendor marketing, right? Some people just kind of co-op the term, we talked about you know, data mesh washing, what are your thoughts on this? (laughing) >> Yeah, so I used the term data platform earlier. And part of the reason I use that term is that it's more vendor neutral. We've tried to sort of stay out of the vendor terminology patenting world, right? Whether the term lake houses, what sticks or not, the concept is certainly going to stick. And we have some data to back it up. About a quarter of organizations that are using data lakes today, already incorporate data warehouse functionality into it. So they consider their data lake house and data warehouse one in the same, about a quarter of organizations, a little less, but about a quarter of organizations feed the data lake from the data warehouse and about a quarter of organizations feed the data warehouse from the data lake. So it's pretty obvious that three quarters of organizations need to bring this stuff together, right? The need is there, the need is apparent. The technology is going to continue to converge. I like to talk about it, you know, you've got data lakes over here at one end, and I'm not going to talk about why people thought data lakes were a bad idea because they thought you just throw stuff in a server and you ignore it, right? That's not what a data lake is. So you've got data lake people over here and you've got database people over here, data warehouse people over here, database vendors are adding data lake capabilities and data lake vendors are adding data warehouse capabilities. So it's obvious that they're going to meet in the middle. I mean, I think it's like Tony says, I think we should declare victory and go home. >> As hell. So just a follow-up on that, so are you saying the specialized lake and the specialized warehouse, do they go away? I mean, Tony data mesh practitioners would say or advocates would say, well, they could all live. It's just a node on the mesh. But based on what Dave just said, are we gona see those all morphed together? >> Well, number one, as I was saying before, there's always going to be this sort of, you know, centrifugal force or this tug of war between do we centralize the data, do we virtualize? And the fact is I don't think that there's ever going to be any single answer. I think in terms of data mesh, data mesh has nothing to do with how you're physically implement the data. You could have a data mesh basically on a data warehouse. It's just that, you know, the difference being is that if we use the same physical data store, but everybody's logically you know, basically governing it differently, you know? Data mesh in space, it's not a technology, it's processes, it's governance process. So essentially, you know, I basically see that, you know, as I was saying before that this is basically the culmination of a long time trend we're essentially seeing a lot of blurring, but there are going to be cases where, for instance, if I need, let's say like, Upserve, I need like high concurrency or something like that. There are certain things that I'm not going to be able to get efficiently get out of a data lake. And, you know, I'm doing a system where I'm just doing really brute forcing very fast file scanning and that type of thing. So I think there always will be some delineations, but I would agree with Dave and with Doug, that we are seeing basically a confluence of requirements that we need to essentially have basically either the element, you know, the ability of a data lake and the data warehouse, these need to come together, so I think. >> I think what we're likely to see is organizations look for a converge platform that can handle both sides for their center of data gravity, the mesh and the fabric virtualization vendors, they're all on board with the idea of this converged platform and they're saying, "Hey, we'll handle all the edge cases "of the stuff that isn't in that center of data gravity "but that is off distributed in a cloud "or at a remote location." So you can have that single platform for the center of your data and then bring in virtualization, mesh, what have you, for reaching out to the distributed data. >> As Dave basically said, people are happy when they virtualized data. >> I think we have at this point, but to Dave Menninger's point, they are converging, Snowflake has introduced support for unstructured data. So obviously literally splitting here. Now what Databricks is saying is that "aha, but it's easy to go from data lake to data warehouse "than it is from databases to data lake." So I think we're getting into semantics, but we're already seeing these two converge. >> So take somebody like AWS has got what? 15 data stores. Are they're going to 15 converge data stores? This is going to be interesting to watch. All right, guys, I'm going to go down and list do like a one, I'm going to one word each and you guys, each of the analyst, if you would just add a very brief sort of course correction for me. So Sanjeev, I mean, governance is going to to be... Maybe it's the dog that wags the tail now. I mean, it's coming to the fore, all this ransomware stuff, which you really didn't talk much about security, but what's the one word in your prediction that you would leave us with on governance? >> It's going to be mainstream. >> Mainstream. Okay. Tony Baer, mesh washing is what I wrote down. That's what we're going to see in 2022, a little reality check, you want to add to that? >> Reality check, 'cause I hope that no vendor jumps the shark and close they're offering a data niche product. >> Yeah, let's hope that doesn't happen. If they do, we're going to call them out. Carl, I mean, graph databases, thank you for sharing some high growth metrics. I know it's early days, but magic is what I took away from that, so magic database. >> Yeah, I would actually, I've said this to people too. I kind of look at it as a Swiss Army knife of data because you can pretty much do anything you want with it. That doesn't mean you should. I mean, there's definitely the case that if you're managing things that are in fixed schematic relationship, probably a relation database is a better choice. There are times when the document database is a better choice. It can handle those things, but maybe not. It may not be the best choice for that use case. But for a great many, especially with the new emerging use cases I listed, it's the best choice. >> Thank you. And Dave Menninger, thank you by the way, for bringing the data in, I like how you supported all your comments with some data points. But streaming data becomes the sort of default paradigm, if you will, what would you add? >> Yeah, I would say think fast, right? That's the world we live in, you got to think fast. >> Think fast, love it. And Brad Shimmin, love it. I mean, on the one hand I was saying, okay, great. I'm afraid I might get disrupted by one of these internet giants who are AI experts. I'm going to be able to buy instead of build AI. But then again, you know, I've got some real issues. There's a potential backlash there. So give us your bumper sticker. >> I'm would say, going with Dave, think fast and also think slow to talk about the book that everyone talks about. I would say really that this is all about trust, trust in the idea of automation and a transparent and visible AI across the enterprise. And verify, verify before you do anything. >> And then Doug Henschen, I mean, I think the trend is your friend here on this prediction with lake house is really becoming dominant. I liked the way you set up that notion of, you know, the data warehouse folks coming at it from the analytics perspective and then you get the data science worlds coming together. I still feel as though there's this piece in the middle that we're missing, but your, your final thoughts will give you the (indistinct). >> I think the idea of consolidation and simplification always prevails. That's why the appeal of a single platform is going to be there. We've already seen that with, you know, DoOP platforms and moving toward cloud, moving toward object storage and object storage, becoming really the common storage point for whether it's a lake or a warehouse. And that second point, I think ESG mandates are going to come in alongside GDPR and things like that to up the ante for good governance. >> Yeah, thank you for calling that out. Okay folks, hey that's all the time that we have here, your experience and depth of understanding on these key issues on data and data management really on point and they were on display today. I want to thank you for your contributions. Really appreciate your time. >> Enjoyed it. >> Thank you. >> Thanks for having me. >> In addition to this video, we're going to be making available transcripts of the discussion. We're going to do clips of this as well we're going to put them out on social media. I'll write this up and publish the discussion on wikibon.com and siliconangle.com. No doubt, several of the analysts on the panel will take the opportunity to publish written content, social commentary or both. I want to thank the power panelists and thanks for watching this special CUBE presentation. This is Dave Vellante, be well and we'll see you next time. (bright music)

Published Date : Jan 7 2022

SUMMARY :

and I'd like to welcome you to I as moderator, I'm going to and that is the journey to weigh in on there, and it's going to demand more solid data. Brad, I wonder if you that are specific to individual use cases in the past is because we I like the fact that you the data from, you know, Dave Menninger, I mean, one of the things that all need to be managed collectively. Oh thank you Dave. and to the community I think we could have a after the fact to say, okay, is this incremental to the market? the magic it does and to do it and that slows the system down. I know the story, but And that is a problem that the languages move on to Dave Menninger. So in the next say three to five years, the guy who has followed that people still need to do their taxes, And I agree 100% with you and the streaming data as the I mean, when you think about, you know, and because of basically the all of that is fixed, but the it becomes the default? I think around, you know, but it becomes the default. and we're seeing a lot of taking the hardware dimension That'll just happened, Carl. Okay, let's move on to Brad. And that is to say that, Those attributes that you And one of the things that you know, Carl could you add in the past, you know, I think that what you have to bear in mind that term is not going to and the data science needs. and the data science world, You need the ability to do lot of these, thank you Tony, I like to talk about it, you know, It's just a node on the mesh. basically either the element, you know, So you can have that single they virtualized data. "aha, but it's easy to go from I mean, it's coming to the you want to add to that? I hope that no vendor Yeah, let's hope that doesn't happen. I've said this to people too. I like how you supported That's the world we live I mean, on the one hand I And verify, verify before you do anything. I liked the way you set up We've already seen that with, you know, the time that we have here, We're going to do clips of this as well

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave MenningerPERSON

0.99+

DavePERSON

0.99+

Dave VellantePERSON

0.99+

Doug HenschenPERSON

0.99+

DavidPERSON

0.99+

Brad ShimminPERSON

0.99+

DougPERSON

0.99+

Tony BaerPERSON

0.99+

Dave VelanntePERSON

0.99+

TonyPERSON

0.99+

CarlPERSON

0.99+

BradPERSON

0.99+

Carl OlofsonPERSON

0.99+

MicrosoftORGANIZATION

0.99+

2014DATE

0.99+

Sanjeev MohanPERSON

0.99+

Ventana ResearchORGANIZATION

0.99+

2022DATE

0.99+

OracleORGANIZATION

0.99+

last yearDATE

0.99+

January of 2022DATE

0.99+

threeQUANTITY

0.99+

381 databasesQUANTITY

0.99+

IDCORGANIZATION

0.99+

InformaticaORGANIZATION

0.99+

SnowflakeORGANIZATION

0.99+

DatabricksORGANIZATION

0.99+

twoQUANTITY

0.99+

SanjeevPERSON

0.99+

2021DATE

0.99+

GoogleORGANIZATION

0.99+

OmdiaORGANIZATION

0.99+

AWSORGANIZATION

0.99+

SanjMoORGANIZATION

0.99+

79%QUANTITY

0.99+

second questionQUANTITY

0.99+

last weekDATE

0.99+

15 data storesQUANTITY

0.99+

100%QUANTITY

0.99+

SAPORGANIZATION

0.99+

Nenshad Bardoliwalla, DataRobot | AWS re:Invent 2021


 

>>Welcome back everybody to AWS reinvent. You're watching the cube, the leader in high tech coverage. My name is Dave Volante with my co-host David Nicholson. We're here all week. We got two sets, 20 plus thousand people here live at AWS reinvent. 21 of course last year was virtual. We got a hybrid event running. We had two studios running before the show running. A lot of pre-records really excited to have ninja Bardelli Walla, who is the chief product officer at data robot. Really interesting AI company. We're going to talk about insights with machine intelligence and then shout. It's great to see you again. It's been awhile. >>Great to see you as well. And I'm so happy to be on the cube. I think eight years since I first came on. >>When you launched the company that you founded back then Peck Sada on the cube, that was part >>Of the inner robot >>Family part of data, robot family. And of course, friend of the cube. Chris Lynch is the executive chairman of data robot. So a lot of connections, I always joke a hundred people in our industry, 99 seats, but tell us about data robot. What's the, what's the scoop these days. >>Thanks. Thanks very much for the opportunity to speak with both of you. Uh, I think we're seeing some very interesting trends. Uh, we've all been in the industry long enough to recognize, uh, that hype cycles they're cycles. They go in waves and, uh, the level of interest in AI has never been higher. Uh, every company in the world is looking for the opportunity to take advantage of AI, to improve their business processes, whether it's to improve their revenue it's to lower their cost profile or it's to lower their risk. What we're seeing that's most interesting is that, uh, we spend a lot of time working with companies on what we consider applied AI. That is how do we solve real business problems, uh, with the technology and not just run a bunch of experiments. You know, it's very tempting for a lot of us, Dave and David, uh, to, to do, uh, you know, spin up a spark cluster with 10,000 nodes and slosh a bunch of data through it. >>But the question we always ask at data robot is what is the business value of doing this? Why are we using these AI techniques and in order to solve what problem? So the biggest trend we see a data robot and one that we feel we're very well positioned to solve is that companies are coming out of that experimental phase. There's still a lot of experimentation going on and they're saying, okay, we, we stood up a cluster. Uh, we got a bunch of Python notebooks running around here, but we haven't really seen a return on our investment yet data robot, can you help us actually make AI real and concrete in terms of achieving a specific business outcome for us? >>Well, and I want to test something on your niche. That's something we've talked about a lot on the cube is a change in the way in which companies are architecting their data. When we first, it was like, okay, create a Hadoop cluster. And that spark came along to make that easier, but it was still this highly technical, highly centralized, hyper specialized roles where the business, people who have a really good understanding of the outcome had to kind of beg to get what they wanted because it was so technical and the success was defined as, Hey, it worked or we ran the experiment and it looks like it has promise. So now it seems like with companies like data robot, you're democratizing AI, allowing organizations to inject AI into their business processes, their applications. And it seems to be more business led. One of you could comment on that. >>I think that is a various dude observation. Uh, we launched this concept a little bit earlier this year of AI cloud. And the idea behind AI cloud is if you want to democratize AI, which is in fact has been DataRobot's vision since 2012, we were the first company on the cloud. The first AI cloud that ever existed was data robots in 2014. And the entire idea was that we knew that data scientists would always play a very important role in an organization, but yet the demand for AI would vastly outstrip the supply. And so in order to solve that challenge, we built AI cloud. We've actually spent over a million engineering hours in building this technology over the, over the last decade and put this together in a way where all of the different personas and the organizations, you have people who create AI applications. >>Those are the folks we usually think about, but those are the data scientists. Those are the analysts, those are the data engineers, but then you actually have to put it into production. You've got to run the system. So you also have to democratize this capability for the folks who are going to operate the system for the folks in risk and compliance. We're actually going to, uh, ensure that the system is operating in accordance with your policies and compliance regimes. And then the third wave of democratization, which we've just embarked on is then how do you bring AI into the hands of the actual business people? How do you put on a mobile device or a web browser, or in context, in an application with the decision, the ability for AI to drive a decision in your organization, which leads to an action, which helps drive you towards the outcome you're trying to optimize for. >>So AI cloud is about this pervasive tapestry, bringing together the creators, the consumers, the individuals who operate these systems into a single system that can lower the barrier to entry for people who don't have the skills, but allow you to plug in and go deep underneath the covers and modify whatever you need to, if you have that level of technical skill and that ability for us to kind of slide, slide the slider in one direction or the other, I could slide it to the right and say, I want all automation, something data robot has pioneered and is absolutely the leader in, but we can also, especially in these last couple of years, say, I want to be able to use as much code as I want to bring in. And the beauty of the model is that customers can choose how much they want to let the machine drive or how much they want to let the human being drive. David. I love that, >>That idea of a slider, because now you're talking about generalists getting access to really powerful tools. >>Yeah, no, exactly. And I, I'm curious, what's your view on where we are culturally with AI at this point? And what I mean by culturally is the idea that, okay, that's great. You put powerful tools in the hands of business users. Um, do most of us still need to have a lot of visibility under the covers to understand the inner workings so that we trust what we're being told? You know, I'm fine pulling a lever and having a little biscuit come out of SWOT as long as I've gotten a tour of the kitchen at some point in time. Yes. I mean, where are we with that? Where where's the level of >>Absolutely fantastic question and it's one that's, it's actually pervasive to the way data robot operates. So trust gets, uh, engendered by multiple different capabilities that you build throughout the platform. The first one is around, uh, explainability. So when you get a prediction from a system, just like you mentioned, you know, if, if the stakes are not very high, you know, you, uh, we're here in Las Vegas, of course I'm thinking of slot machines. If you get a biscuit at the end of it and it tastes pretty good. Hey, great. Right? When you're making a mission critical business decision, you don't want to be in the position where you don't understand why the system is making the decision. It does. So we have historically invested an enormous amount of effort in explainability tools, having the system actually at a prediction level, explain to you, why is it making the recommendation it's making? >>For example, the system says this customer has a high likelihood of churn. Why? Because their account balance has been declining over the last five months. Uh, number two, because their credit score has been going down. And what gives you the trust is actually the machine and the human able to communicate in the same language and same vernacular about the business value. So that's one part of it. The second part is about transparency, right? So one of the things that the automated machine learning movement, that data robot pioneered, uh, has been, I'd say rightfully criticized for frankly, is that it's too much of a black box. It's too much magic. I load my dataset. I press the start button and data robot does everything else for me. Well, that's not very satisfying when you have a 10 or a hundred million dollar decision coming on the other side, even if the technology is actually doing the job correctly, which data robot usually does. >>So where we've morphed and evolved our position in the market and where I have driven our technology portfolio at data robot is to say, you know what? There is a very important aspect of trust that needs to be brought to bear here, which is that if somebody wants to see code, let them see code. And in fact, the beauty of AI cloud is that on the same platform, the people who don't like code, but are, are very good at understanding the business domain con uh, the business domain knowledge and the context. They now have the ability to do that. But when they're at the stage before they're going to deploy anything to production. Now you can raise your hand at data robot and actually use our workflow and say, I need a coder to review this. I want the professional data scientist who has all this knowledge who understands and has read up on the latest advances in hyper parameter tuning to look at the model and tell me that this is going to be okay. And so we allow both the less technical folks and the very deep technical data scientists, the ability to collaborate on the same environment, which allows you to build trust in terms of the human side of, Hey, I don't want to just let anybody throw a model into production. I like, >>I mean, I see those, the transparency and the explainability is almost two sides of the same coin, right? Because you know, if you're gonna be accused of gender bias, you can say, no, here's how the system may, it's not like, you know, you think about the internet. It tells you it's a cat, but you don't really know how the machine determined that you're breaking apart, blowing away that black box. And the other thing I like what you said was you have data producers and data consumers, and you also talked about context because a lot of times the data producers, they don't necessarily care about the context or the PI data pipeline. People necessarily care about the context. So, okay. So now we're at the point where you're democratizing data, you're doing some great work. What are some of the blockers that you see today that you're obliterating with data robot? Maybe you could talk about that a little bit. Sure. >>So, so I think, uh, you know, one very important concept is that, uh, in a democracy, we talked about democratization. You still have rules, you still have governance. It's not a free for all the free for all version of that is called NRG. That's not what any company wants, right? So we have to blend the freedom and flexibility that we want businesses to have with the compliance and regulatory observability that we need in order to be successful. So what we're seeing in, in our, in our customer base and what companies are coming to data robot to discuss is, okay, we've tried these experiments. Now we want to actually get to real business value. And one of the things that's really unique about data robot is that we have put, uh, we have, we've worked in our system on over 1 million projects, training models, inside data robot. >>We have seen every type of use case across different industries, whether it's healthcare or manufacturing, uh, or, or retail, uh, we have the ability to understand those different data sets and actually to come up with models. So we have that breadth of information there if you aggregate that over time, right? So again, we did not come to AI. This is not a fad for us. We didn't start as one kind of company than slap the AI label on and say, Hey, we're an AI company now, right? We have been AI native since day one. And in that process, what we have found is working on these, this million plus projects on these data sets across these industries, we have a very good sense of which projects will actually deliver value and which don't. And that gets to a previous point that you were making, which is that you have to know and partner with an organization who it's not just about the technology. So we have fantastic people who we call our customer facing data scientists who will tell the customer, look, I know you think this is a really high value use case, but we've tried it at other customers. And unfortunately it didn't work very well. Let's steer you, cause you need with a, with a technology that is largely at the early stage and the maturity that organizations have with it, you need to help them in order to deliver success. And no vendor has delivered more successful production deployment of AI than data road. >>No, don't go down that path. It's a dead end as a cul-de-sac. So just avoid it. So we talked about transparency, explainability governance. Can you get that to the point where it's self-serve as you, as you put data in the hands of business, people where the context lives, the domain experts, can you get to self-serve and federate that governance? Yes. >>So you can, uh, that's one of the key principles of what we, what we do at data robot. And it comes back to a concept that I learned, uh, you, you both will remember. We were in the Sarbanes-Oxley crazy world of, I dunno, was that 15 years of saved data warehousing. >>Everybody wanted to talk about socks. You know, my wife would hear me on the phone. She'd be like, what is your sudden obsession with socks? I'm like, no, no, it's not what you fit. And so, um, but what came from Sarbanes Oxley are, are these, uh, longstanding principles around the segregation of duties and segregation of responsibilities. You can have democracy democratization with governance, if you have the right segregation of duties. So for example, I have somebody who can generate lots of different models, right? But I don't allow them to, to, uh, in a self-service way, just deploy into production. I actually have a workflow system which will go through multiple rigorous approvals and say, these three people have signed off, they've done an audit, uh, an, an audit assessment of this model. It's good to go, let's go and drop it into production. So the way that you get to self-service with governance is to have the right controls and policies and frameworks that surround the self-service model with the right checks and balances that implement the segregation of duties I'm talking >>And you get that right. And then you can automate it and then you can really scale, right? You gotta have your back because it's such a great topic. We, we barely scratched the surface. It was great to see you again, congratulations on all the success. And, uh, as I say any time, let's do this again. Fantastic. Thank >>You so much. All right, you're welcome. And thank you for watching you watching the cubes coverage of AWS reinvent 2021, Dave Volante for David Nicholson. Keep it right there. You're watching the cube, the leader in high-tech coverage.

Published Date : Dec 2 2021

SUMMARY :

It's great to see you again. Great to see you as well. And of course, friend of the cube. Dave and David, uh, to, to do, uh, you know, spin up a spark cluster with 10,000 So the biggest trend we see a data robot and one that we feel we're very well positioned to the outcome had to kind of beg to get what they wanted because it was so And the idea behind AI cloud is if you want So you also have to democratize this capability for the folks who are going to operate the system that can lower the barrier to entry for people who don't have the skills, That idea of a slider, because now you're talking about generalists getting access to really the inner workings so that we trust what we're being told? So when you get a prediction from a system, just like you mentioned, you know, if, if the stakes are not very high, And what gives you the trust is actually the same environment, which allows you to build trust in terms of the human side of, And the other thing I like what you said And one of the things that's really unique about data robot is that we have put, the maturity that organizations have with it, you need to help them in order to deliver success. people where the context lives, the domain experts, can you get to self-serve and federate that governance? And it comes back to a concept that I learned, uh, you, you both will remember. So the way that you get to self-service And then you can automate it and then you can really scale, right? And thank you for watching you watching the cubes coverage of AWS reinvent 2021,

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
David NicholsonPERSON

0.99+

2014DATE

0.99+

Dave VolantePERSON

0.99+

Chris LynchPERSON

0.99+

DavidPERSON

0.99+

DavePERSON

0.99+

Las VegasLOCATION

0.99+

15 yearsQUANTITY

0.99+

99 seatsQUANTITY

0.99+

second partQUANTITY

0.99+

one partQUANTITY

0.99+

two setsQUANTITY

0.99+

two studiosQUANTITY

0.99+

2012DATE

0.99+

last yearDATE

0.99+

Nenshad BardoliwallaPERSON

0.99+

three peopleQUANTITY

0.99+

Bardelli WallaPERSON

0.99+

10QUANTITY

0.99+

AWSORGANIZATION

0.99+

10,000 nodesQUANTITY

0.99+

PythonTITLE

0.99+

firstQUANTITY

0.99+

bothQUANTITY

0.99+

first oneQUANTITY

0.99+

eight yearsQUANTITY

0.98+

oneQUANTITY

0.98+

todayDATE

0.98+

one directionQUANTITY

0.97+

over a millionQUANTITY

0.97+

20 plus thousand peopleQUANTITY

0.96+

DataRobotORGANIZATION

0.96+

OneQUANTITY

0.95+

first companyQUANTITY

0.95+

over 1 million projectsQUANTITY

0.95+

21QUANTITY

0.92+

last decadeDATE

0.9+

SarbanesLOCATION

0.9+

two sidesQUANTITY

0.89+

one kindQUANTITY

0.89+

single systemQUANTITY

0.89+

third wave of democratizationEVENT

0.87+

million plus projectsQUANTITY

0.87+

NRGORGANIZATION

0.86+

earlier this yearDATE

0.85+

a hundred peopleQUANTITY

0.83+

a hundred million dollarQUANTITY

0.81+

InventEVENT

0.74+

last couple of yearsDATE

0.73+

last five monthsDATE

0.67+

number twoQUANTITY

0.66+

Peck SadaORGANIZATION

0.6+

2021DATE

0.6+

Sarbanes OxleyORGANIZATION

0.59+

AWS reinventEVENT

0.57+

conceptQUANTITY

0.55+

day oneQUANTITY

0.53+

OxleyLOCATION

0.36+

Breaking Analysis: How Snowflake Plans to Change a Flawed Data Warehouse Model


 

>> From theCUBE Studios in Palo Alto in Boston, bringing you data-driven insights from theCUBE in ETR. This is Breaking Analysis with Dave Vellante. >> Snowflake is not going to grow into its valuation by stealing the croissant from the breakfast table of the on-prem data warehouse vendors. Look, even if snowflake got 100% of the data warehouse business, it wouldn't come close to justifying its market cap. Rather Snowflake has to create an entirely new market based on completely changing the way organizations think about monetizing data. Every organization I talk to says it wants to be, or many say they already are data-driven. why wouldn't you aspire to that goal? There's probably nothing more strategic than leveraging data to power your digital business and creating competitive advantage. But many businesses are failing, or I predict, will fail to create a true data-driven culture because they're relying on a flawed architectural model formed by decades of building centralized data platforms. Welcome everyone to this week's Wikibon Cube Insights powered by ETR. In this Breaking Analysis, I want to share some new thoughts and fresh ETR data on how organizations can transform their businesses through data by reinventing their data architectures. And I want to share our thoughts on why we think Snowflake is currently in a very strong position to lead this effort. Now, on November 17th, theCUBE is hosting the Snowflake Data Cloud Summit. Snowflake's ascendancy and its blockbuster IPO has been widely covered by us and many others. Now, since Snowflake went public, we've been inundated with outreach from investors, customers, and competitors that wanted to either better understand the opportunities or explain why their approach is better or different. And in this segment, ahead of Snowflake's big event, we want to share some of what we learned and how we see it. Now, theCUBE is getting paid to host this event, so I need you to know that, and you draw your own conclusions from my remarks. But neither Snowflake nor any other sponsor of theCUBE or client of SiliconANGLE Media has editorial influence over Breaking Analysis. The opinions here are mine, and I would encourage you to read my ethics statement in this regard. I want to talk about the failed data model. The problem is complex, I'm not debating that. Organizations have to integrate data and platforms with existing operational systems, many of which were developed decades ago. And as a culture and a set of processes that have been built around these systems, and they've been hardened over the years. This chart here tries to depict the progression of the monolithic data source, which, for me, began in the 1980s when Decision Support Systems or DSS promised to solve our data problems. The data warehouse became very popular and data marts sprung up all over the place. This created more proprietary stovepipes with data locked inside. The Enron collapse led to Sarbanes-Oxley. Now, this tightened up reporting. The requirements associated with that, it breathed new life into the data warehouse model. But it remained expensive and cumbersome, I've talked about that a lot, like a snake swallowing a basketball. The 2010s ushered in the big data movement, and Data Lakes emerged. With a dupe, we saw the idea of no schema online, where you put structured and unstructured data into a repository, and figure it all out on the read. What emerged was a fairly complex data pipeline that involved ingesting, cleaning, processing, analyzing, preparing, and ultimately serving data to the lines of business. And this is where we are today with very hyper specialized roles around data engineering, data quality, data science. There's lots of batch of processing going on, and Spark has emerged to improve the complexity associated with MapReduce, and it definitely helped improve the situation. We're also seeing attempts to blend in real time stream processing with the emergence of tools like Kafka and others. But I'll argue that in a strange way, these innovations actually compound the problem. And I want to discuss that because what they do is they heighten the need for more specialization, more fragmentation, and more stovepipes within the data life cycle. Now, in reality, and it pains me to say this, it's the outcome of the big data movement, as we sit here in 2020, that we've created thousands of complicated science projects that have once again failed to live up to the promise of rapid cost-effective time to insights. So, what will the 2020s bring? What's the next silver bullet? You hear terms like the lakehouse, which Databricks is trying to popularize. And I'm going to talk today about data mesh. These are other efforts they look to modernize datalakes and sometimes merge the best of data warehouse and second-generation systems into a new paradigm, that might unify batch and stream frameworks. And this definitely addresses some of the gaps, but in our view, still suffers from some of the underlying problems of previous generation data architectures. In other words, if the next gen data architecture is incremental, centralized, rigid, and primarily focuses on making the technology to get data in and out of the pipeline work, we predict it's going to fail to live up to expectations again. Rather, what we're envisioning is an architecture based on the principles of distributed data, where domain knowledge is the primary target citizen, and data is not seen as a by-product, i.e, the exhaust of an operational system, but rather as a service that can be delivered in multiple forms and use cases across an ecosystem. This is why we often say the data is not the new oil. We don't like that phrase. A specific gallon of oil can either fuel my home or can lubricate my car engine, but it can't do both. Data does not follow the same laws of scarcity like natural resources. Again, what we're envisioning is a rethinking of the data pipeline and the associated cultures to put data needs of the domain owner at the core and provide automated, governed, and secure access to data as a service at scale. Now, how is this different? Let's take a look and unpack the data pipeline today and look deeper into the situation. You all know this picture that I'm showing. There's nothing really new here. The data comes from inside and outside the enterprise. It gets processed, cleanse or augmented so that it can be trusted and made useful. Nobody wants to use data that they can't trust. And then we can add machine intelligence and do more analysis, and finally deliver the data so that domain specific consumers can essentially build data products and services or reports and dashboards or content services, for instance, an insurance policy, a financial product, a loan, that these are packaged and made available for someone to make decisions on or to make a purchase. And all the metadata associated with this data is packaged along with the dataset. Now, we've broken down these steps into atomic components over time so we can optimize on each and make them as efficient as possible. And down below, you have these happy stick figures. Sometimes they're happy. But they're highly specialized individuals and they each do their job and they do it well to make sure that the data gets in, it gets processed and delivered in a timely manner. Now, while these individual pieces seemingly are autonomous and can be optimized and scaled, they're all encompassed within the centralized big data platform. And it's generally accepted that this platform is domain agnostic. Meaning the platform is the data owner, not the domain specific experts. Now there are a number of problems with this model. The first, while it's fine for organizations with smaller number of domains, organizations with a large number of data sources and complex domain structures, they struggle to create a common data parlance, for example, in a data culture. Another problem is that, as the number of data sources grows, organizing and harmonizing them in a centralized platform becomes increasingly difficult, because the context of the domain and the line of business gets lost. Moreover, as ecosystems grow and you add more data, the processes associated with the centralized platform tend to get further genericized. They again lose that domain specific context. Wait (chuckling), there are more problems. Now, while in theory organizations are optimizing on the piece parts of the pipeline, the reality is, as the domain requires a change, for example, a new data source or an ecosystem partnership requires a change in access or processes that can benefit a domain consumer, the reality is the change is subservient to the dependencies and the need to synchronize across these discrete parts of the pipeline or actually, orthogonal to each of those parts. In other words, in actuality, the monolithic data platform itself remains the most granular part of the system. Now, when I complain about this faulty structure, some folks tell me this problem has been solved. That there are services that allow new data sources to really easily be added. A good example of this is Databricks Ingest, which is, it's an auto loader. And what it does is it simplifies the ingestion into the company's Delta Lake offering. And rather than centralizing in a data warehouse, which struggles to efficiently allow things like Machine Learning frameworks to be incorporated, this feature allows you to put all the data into a centralized datalake. More so the argument goes, that the problem that I see with this, is while the approach does definitely minimizes the complexities of adding new data sources, it still relies on this linear end-to-end process that slows down the introduction of data sources from the domain consumer beside of the pipeline. In other words, the domain experts still has to elbow her way into the front of the line or the pipeline, in this case, to get stuff done. And finally, the way we are organizing teams is a point of contention, and I believe is going to continue to cause problems down the road. Specifically, we've again, we've optimized on technology expertise, where for example, data engineers, well, really good at what they do, they're often removed from the operations of the business. Essentially, we created more silos and organized around technical expertise versus domain knowledge. As an example, a data team has to work with data that is delivered with very little domain specificity, and serves a variety of highly specialized consumption use cases. All right. I want to step back for a minute and talk about some of the problems that people bring up with Snowflake and then I'll relate it back to the basic premise here. As I said earlier, we've been hammered by dozens and dozens of data points, opinions, criticisms of Snowflake. And I'll share a few here. But I'll post a deeper technical analysis from a software engineer that I found to be fairly balanced. There's five Snowflake criticisms that I'll highlight. And there are many more, but here are some that I want to call out. Price transparency. I've had more than a few customers telling me they chose an alternative database because of the unpredictable nature of Snowflake's pricing model. Snowflake, as you probably know, prices based on consumption, just like AWS and other cloud providers. So just like AWS, for example, the bill at the end of the month is sometimes unpredictable. Is this a problem? Yes. But like AWS, I would say, "Kill me with that problem." Look, if users are creating value by using Snowflake, then that's good for the business. But clearly this is a sore point for some users, especially for procurement and finance, which don't like unpredictability. And Snowflake needs to do a better job communicating and managing this issue with tooling that can predict and help better manage costs. Next, workload manage or lack thereof. Look, if you want to isolate higher performance workloads with Snowflake, you just spin up a separate virtual warehouse. It's kind of a brute force approach. It works generally, but it will add expense. I'm kind of reminded of Pure Storage and its approach to storage management. The engineers at Pure, they always design for simplicity, and this is the approach that Snowflake is taking. Usually, Pure and Snowflake, as I have discussed in a moment, is Pure's ascendancy was really based largely on stealing share from Legacy EMC systems. Snowflake, in my view, has a much, much larger incremental market opportunity. Next is caching architecture. You hear this a lot. At the end of the day, Snowflake is based on a caching architecture. And a caching architecture has to be working for some time to optimize performance. Caches work well when the size of the working set is small. Caches generally don't work well when the working set is very, very large. In general, transactional databases have pretty small datasets. And in general, analytics datasets are potentially much larger. Is it Snowflake in the analytics business? Yes. But the good thing that Snowflake has done is they've enabled data sharing, and it's caching architecture serves its customers well because it allows domain experts, you're going to hear this a lot from me today, to isolate and analyze problems or go after opportunities based on tactical needs. That said, very big queries across whole datasets or badly written queries that scan the entire database are not the sweet spot for Snowflake. Another good example would be if you're doing a large audit and you need to analyze a huge, huge dataset. Snowflake's probably not the best solution. Complex joins, you hear this a lot. The working set of complex joins, by definition, are larger. So, see my previous explanation. Read only. Snowflake is pretty much optimized for read only data. Maybe stateless data is a better way of thinking about this. Heavily right intensive workloads are not the wheelhouse of Snowflake. So where this is maybe an issue is real-time decision-making and AI influencing. A number of times, Snowflake, I've talked about this, they might be able to develop products or acquire technology to address this opportunity. Now, I want to explain. These issues would be problematic if Snowflake were just a data warehouse vendor. If that were the case, this company, in my opinion, would hit a wall just like the NPP vendors that proceeded them by building a better mouse trap for certain use cases hit a wall. Rather, my promise in this episode is that the future of data architectures will be really to move away from large centralized warehouses or datalake models to a highly distributed data sharing system that puts power in the hands of domain experts at the line of business. Snowflake is less computationally efficient and less optimized for classic data warehouse work. But it's designed to serve the domain user much more effectively in our view. We believe that Snowflake is optimizing for business effectiveness, essentially. And as I said before, the company can probably do a better job at keeping passionate end users from breaking the bank. But as long as these end users are making money for their companies, I don't think this is going to be a problem. Let's look at the attributes of what we're proposing around this new architecture. We believe we'll see the emergence of a total flip of the centralized and monolithic big data systems that we've known for decades. In this architecture, data is owned by domain-specific business leaders, not technologists. Today, it's not much different in most organizations than it was 20 years ago. If I want to create something of value that requires data, I need to cajole, beg or bribe the technology and the data team to accommodate. The data consumers are subservient to the data pipeline. Whereas in the future, we see the pipeline as a second class citizen, with a domain expert is elevated. In other words, getting the technology and the components of the pipeline to be more efficient is not the key outcome. Rather, the time it takes to envision, create, and monetize a data service is the primary measure. The data teams are cross-functional and live inside the domain versus today's structure where the data team is largely disconnected from the domain consumer. Data in this model, as I said, is not the exhaust coming out of an operational system or an external source that is treated as generic and stuffed into a big data platform. Rather, it's a key ingredient of a service that is domain-driven and monetizable. And the target system is not a warehouse or a lake. It's a collection of connected domain-specific datasets that live in a global mesh. What is a distributed global data mesh? A data mesh is a decentralized architecture that is domain aware. The datasets in the system are purposely designed to support a data service or data product, if you prefer. The ownership of the data resides with the domain experts because they have the most detailed knowledge of the data requirement and its end use. Data in this global mesh is governed and secured, and every user in the mesh can have access to any dataset as long as it's governed according to the edicts of the organization. Now, in this model, the domain expert has access to a self-service and obstructed infrastructure layer that is supported by a cross-functional technology team. Again, the primary measure of success is the time it takes to conceive and deliver a data service that could be monetized. Now, by monetize, we mean a data product or data service that it either cuts cost, it drives revenue, it saves lives, whatever the mission is of the organization. The power of this model is it accelerates the creation of value by putting authority in the hands of those individuals who are closest to the customer and have the most intimate knowledge of how to monetize data. It reduces the diseconomies at scale of having a centralized or a monolithic data architecture. And it scales much better than legacy approaches because the atomic unit is a data domain, not a monolithic warehouse or a lake. Zhamak Dehghani is a software engineer who is attempting to popularize the concept of a global mesh. Her work is outstanding, and it's strengthened our belief that practitioners see this the same way that we do. And to paraphrase her view, "A domain centric system must be secure and governed with standard policies across domains." It has to be trusted. As I said, nobody's going to use data they don't trust. It's got to be discoverable via a data catalog with rich metadata. The data sets have to be self-describing and designed for self-service. Accessibility for all users is crucial as is interoperability, without which distributed systems, as we know, fail. So what does this all have to do with Snowflake? As I said, Snowflake is not just a data warehouse. In our view, it's always had the potential to be more. Our assessment is that attacking the data warehouse use cases, it gave Snowflake a straightforward easy-to-understand narrative that allowed it to get a foothold in the market. Data warehouses are notoriously expensive, cumbersome, and resource intensive, but they're a critical aspect to reporting and analytics. So it was logical for Snowflake to target on-premise legacy data warehouses and their smaller cousins, the datalakes, as early use cases. By putting forth and demonstrating a simple data warehouse alternative that can be spun up quickly, Snowflake was able to gain traction, demonstrate repeatability, and attract the capital necessary to scale to its vision. This chart shows the three layers of Snowflake's architecture that have been well-documented. The separation of compute and storage, and the outer layer of cloud services. But I want to call your attention to the bottom part of the chart, the so-called Cloud Agnostic Layer that Snowflake introduced in 2018. This layer is somewhat misunderstood. Not only did Snowflake make its Cloud-native database compatible to run on AWS than Azure in the 2020 GCP, what Snowflake has done is to obstruct cloud infrastructure complexity and create what it calls the data cloud. What's the data cloud? We don't believe the data cloud is just a marketing term that doesn't have any substance. Just as SAS is Simplified Application Software and iOS made it possible to eliminate the value drain associated with provisioning infrastructure, a data cloud, in concept, can simplify data access, and break down fragmentation and enable shared data across the globe. Snowflake, they have a first mover advantage in this space, and we see a number of fundamental aspects that comprise a data cloud. First, massive scale with virtually unlimited compute and storage resource that are enabled by the public cloud. We talk about this a lot. Second is a data or database architecture that's built to take advantage of native public cloud services. This is why Frank Slootman says, "We've burned the boats. We're not ever doing on-prem. We're all in on cloud and cloud native." Third is an obstruction layer that hides the complexity of infrastructure. and fourth is a governed and secured shared access system where any user in the system, if allowed, can get access to any data in the cloud. So a key enabler of the data cloud is this thing called the global data mesh. Now, earlier this year, Snowflake introduced its global data mesh. Over the course of its recent history, Snowflake has been building out its data cloud by creating data regions, strategically tapping key locations of AWS regions and then adding Azure and GCP. The complexity of the underlying cloud infrastructure has been stripped away to enable self-service, and any Snowflake user becomes part of this global mesh, independent of the cloud that they're on. Okay. So now, let's go back to what we were talking about earlier. Users in this mesh will be our domain owners. They're building monetizable services and products around data. They're most likely dealing with relatively small read only datasets. They can adjust data from any source very easily and quickly set up security and governance to enable data sharing across different parts of an organization, or, very importantly, an ecosystem. Access control and governance is automated. The data sets are addressable. The data owners have clearly defined missions and they own the data through the life cycle. Data that is specific and purposely shaped for their missions. Now, you're probably asking, "What happens to the technical team and the underlying infrastructure and the cluster it's in? How do I get the compute close to the data? And what about data sovereignty and the physical storage later, and the costs?" All these are good questions, and I'm not saying these are trivial. But the answer is these are implementation details that are pushed to a self-service layer managed by a group of engineers that serves the data owners. And as long as the domain expert/data owner is driving monetization, this piece of the puzzle becomes self-funding. As I said before, Snowflake has to help these users to optimize their spend with predictive tooling that aligns spend with value and shows ROI. While there may not be a strong motivation for Snowflake to do this, my belief is that they'd better get good at it or someone else will do it for them and steal their ideas. All right. Let me end with some ETR data to show you just how Snowflake is getting a foothold on the market. Followers of this program know that ETR uses a consistent methodology to go to its practitioner base, its buyer base each quarter and ask them a series of questions. They focus on the areas that the technology buyer is most familiar with, and they ask a series of questions to determine the spending momentum around a company within a specific domain. This chart shows one of my favorite examples. It shows data from the October ETR survey of 1,438 respondents. And it isolates on the data warehouse and database sector. I know I just got through telling you that the world is going to change and Snowflake's not a data warehouse vendor, but there's no construct today in the ETR dataset to cut a data cloud or globally distributed data mesh. So you're going to have to deal with this. What this chart shows is net score in the y-axis. That's a measure of spending velocity, and it's calculated by asking customers, "Are you spending more or less on a particular platform?" And then subtracting the lesses from the mores. It's more granular than that, but that's the basic concept. Now, on the x-axis is market share, which is ETR's measure of pervasiveness in the survey. You can see superimposed in the upper right-hand corner, a table that shows the net score and the shared N for each company. Now, shared N is the number of mentions in the dataset within, in this case, the data warehousing sector. Snowflake, once again, leads all players with a 75% net score. This is a very elevated number and is higher than that of all other players, including the big cloud companies. Now, we've been tracking this for a while, and Snowflake is holding firm on both dimensions. When Snowflake first hit the dataset, it was in the single digits along the horizontal axis and continues to creep to the right as it adds more customers. Now, here's another chart. I call it the wheel chart that breaks down the components of Snowflake's net score or spending momentum. The lime green is new adoption, the forest green is customers spending more than 5%, the gray is flat spend, the pink is declining by more than 5%, and the bright red is retiring the platform. So you can see the trend. It's all momentum for this company. Now, what Snowflake has done is they grabbed a hold of the market by simplifying data warehouse. But the strategic aspect of that is that it enables the data cloud leveraging the global mesh concept. And the company has introduced a data marketplace to facilitate data sharing across ecosystems. This is all about network effects. In the mid to late 1990s, as the internet was being built out, I worked at IDG with Bob Metcalfe, who was the publisher of InfoWorld. During that time, we'd go on speaking tours all over the world, and I would listen very carefully as he applied Metcalfe's law to the internet. Metcalfe's law states that the value of the network is proportional to the square of the number of connected nodes or users on that system. Said another way, while the cost of adding new nodes to a network scales linearly, the consequent value scores scales exponentially. Now, apply that to the data cloud. The marginal cost of adding a user is negligible, practically zero, but the value of being able to access any dataset in the cloud... Well, let me just say this. There's no limitation to the magnitude of the market. My prediction is that this idea of a global mesh will completely change the way leading companies structure their businesses and, particularly, their data architectures. It will be the technologists that serve domain specialists as it should be. Okay. Well, what do you think? DM me @dvellante or email me at david.vellante@siliconangle.com or comment on my LinkedIn? Remember, these episodes are all available as podcasts, so please subscribe wherever you listen. I publish weekly on wikibon.com and siliconangle.com, and don't forget to check out etr.plus for all the survey analysis. This is Dave Vellante for theCUBE Insights powered by ETR. Thanks for watching. Be well, and we'll see you next time. (upbeat music)

Published Date : Nov 14 2020

SUMMARY :

This is Breaking Analysis and the data team to accommodate.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave VellantePERSON

0.99+

Frank SlootmanPERSON

0.99+

Bob MetcalfePERSON

0.99+

Zhamak DehghaniPERSON

0.99+

MetcalfePERSON

0.99+

AWSORGANIZATION

0.99+

100%QUANTITY

0.99+

Palo AltoLOCATION

0.99+

November 17thDATE

0.99+

75%QUANTITY

0.99+

SnowflakeORGANIZATION

0.99+

fiveQUANTITY

0.99+

2020DATE

0.99+

SnowflakeTITLE

0.99+

1,438 respondentsQUANTITY

0.99+

2018DATE

0.99+

OctoberDATE

0.99+

david.vellante@siliconangle.comOTHER

0.99+

todayDATE

0.99+

more than 5%QUANTITY

0.99+

theCUBE StudiosORGANIZATION

0.99+

FirstQUANTITY

0.99+

2020sDATE

0.99+

Snowflake Data Cloud SummitEVENT

0.99+

SecondQUANTITY

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

both dimensionsQUANTITY

0.99+

theCUBEORGANIZATION

0.99+

iOSTITLE

0.99+

DSSORGANIZATION

0.99+

1980sDATE

0.99+

each companyQUANTITY

0.99+

decades agoDATE

0.98+

zeroQUANTITY

0.98+

firstQUANTITY

0.98+

2010sDATE

0.98+

each quarterQUANTITY

0.98+

ThirdQUANTITY

0.98+

20 years agoDATE

0.98+

DatabricksORGANIZATION

0.98+

earlier this yearDATE

0.98+

bothQUANTITY

0.98+

PureORGANIZATION

0.98+

fourthQUANTITY

0.98+

IDGORGANIZATION

0.97+

TodayDATE

0.97+

eachQUANTITY

0.97+

Decision Support SystemsORGANIZATION

0.96+

BostonLOCATION

0.96+

single digitsQUANTITY

0.96+

siliconangle.comOTHER

0.96+

oneQUANTITY

0.96+

SparkTITLE

0.95+

Legacy EMCORGANIZATION

0.95+

KafkaTITLE

0.94+

LinkedInORGANIZATION

0.94+

SnowflakeEVENT

0.92+

first moverQUANTITY

0.92+

AzureTITLE

0.91+

InfoWorldORGANIZATION

0.91+

dozens andQUANTITY

0.91+

mid toDATE

0.91+

Kent Graziano and Felipe Hoffa, Snowflake | Snowflake Data Cloud Summit 2020


 

(upbeat music) >> From the CUBE studios in Palo Alto, in Boston, connecting with thought leaders all around the world. This is a CUBE conversation. >> Hi everyone, this is Dave Vellante from the CUBE. And we're getting ready for the Snowflake Data cloud summit four geographies, eight tracks more than 40 sessions for this global event. Starts on November 17th, where we're tracking the rise of the Data cloud. You're going to hear a lot about that, now, by now, you know, the story of Snowflake or you know, what maybe you don't but a new type of cloud native database was introduced in the middle part of last decade. And a new set of analytics workloads has emerged that is powering a transformation within organizations. And it's doing this by putting data at the core of businesses and organizations. You know, for years we marched to the cadence of Moore's law. That was the innovation engine of our industry, but now that's changed it's data plus machine intelligence plus cloud. That's the new innovation cocktail for the technology industry and industries overall. And at the Data cloud summit we'll hear from Snowflake executives, founders, technologists, customers, and ecosystems partners. And of course, you going to hear from interviews on the CUBE. So, let's dig in a little bit more and help me are two Snowflake experts. Felipe Hoffa is a data cloud advocate and Kent Graziano is a chief technical evangelist post at Snowflake. Gents, great to see you. Thanks for coming on. >> Yeah, thanks for having us on, this is great. >> Thank you. >> So guys first, I got to congratulate you on getting to this point. You've achieved beyond escape velocity and obviously one of the most important IPOs of the year, but you got a lot of work to do. I know that what, what are the substantive aspects behind the Data cloud? >> I mean, it's a new concept right? We've been talking about infrastructure clouds and SaaS applications living in application clouds and Data cloud is the ability to really share all that data that we've been collected. You know, we've spent what how many a decade or more with big data now but have we been able to use it effectively? And that's really where the Data cloud is coming in and Snowflake and making that a more seamless, friendly, easy experience to get access to the data. I've been in data warehousing for nearly 30 years now. And our dream has always been to be able to augment an organization's analytics with data from outside their organization. And that's just been a massive pain in the neck with having to move files around and replicate the data and maybe losing track of where it came from or where it went. And the Data cloud is really giving our customers the ability to do that in a much more governed way, a much more seamless way and really make it push button to give anyone access to the data they need and have the performance to do the analytics in near real time. It's total game changer is as you already know and just it's crazy what we're able to do today compared it to what we could do when I started out in my career. >> Well, I'm going to come back to that 'cause I want to tap your historical perspective, but Felipe let me ask you, So, why did you join Snowflake? You're you're the newbie here? What attracted you? >> Exactly? I'm the newbie, I used to work at Google until August. I was there for 10 years. I was a developer advocate there also for data you might have heard about the BigQuery. I was doing a lot of that. And then as time went by Snowflake started showing up more and more in my feeds within my customers in my community. And it came the time, well, I felt that like, you know, when wherever you're working, once in a while you think I should leave this place I should try something new, I should move my career forward. While at Google, I thought that so many times, as anyone would do, and it was only when Snowflake showed up, like where Snowflake is going now, why Snowflake is being received by all the customers that I saw this opportunity. And I decided that moving to Snowflake would be a step forward for me. And so far I'm pretty happy, like the timing has been incredible, but more than the timing and everything, it's really, really a great place for data. What I love first is data, sharing data, analyzing data and how Snowflake is doing it's for me to mean phenomenal. >> So, Kent, I want to come back to you and I say tap maybe your historical perspective here. And you said it's always been a dream that you could do these other things bringing in external data. I would say this, that I don't want to push a little bit on this because I have often said that the EDW marketplace really never lived up to its promises of 360 degree views of the customer real time or near real time analytics. And, and it really has been as you kind of described are a real challenge for a lot of organizations. When Hadoop came in we got excited that it was going to actually finally live up to that vision and, and duped it a lot and don't get me wrong, I mean, the whole concept of bring that compute to data and lowering the cost and so forth. But it certainly didn't minimize complexity. And, and it seems like, feels like Snowflake is on the cusp of actually delivering on that promise that we've been talking about for 30 years. I wonder, if you could share your perspective is it, are we going to get there this time? >> Yeah. And as far as I can tell working with all of our customers some of them are there. I mean, they thought through those struggles that you were talking about that I saw throughout my career and now with getting on Snowflake they're delivering customer 360 they're integrating weblogs and IOT data with structured data from their ERP systems or CRM systems, their supply chain systems. And it really is coming to fruition. I mean, the industry leaders, you know, Bill Inman and Claudia Imhoff, they've had this vision the whole time but the technology just wasn't able to support it. And the cloud, as we said about the internet, changed everything. And then Ben wine teary, and they're in their vision and building the system, taking the best concepts from the Hadoop world and the data Lake world and the enterprise data warehouse world and putting it all together into this, this architecture that's now Snowflake and the Data cloud solve it. I mean, it's the classic benefit of hindsight is 2020 after years in the industry, they'd seen these problems and said like, how can we solve them? Does the Cloud let us solve these problems? And the answer was yes, but it did require writing everything from scratch and starting over with, because the architecture of the Cloud just allows you to do things that you just couldn't do before. >> Yeah. I'm glad you brought up you know, some of the originators of the data warehouse because it really wasn't their fault. They were trying to solve a problem. It was the marketers that took it and really kind of made promises that they couldn't keep. But, the reality is when you talk to customers in the so old EDW days and this is the other thing I want to tap you guys' brains on. It was very challenging. I mean, one customer one time referred to it as a snake, swallowing a basketball. And what he meant by that is every time there's a change Sarbanes Oxley comes and we have to ingest all this new data. It's like, Oh, it's to say everything slows down to a grinding halt. Every time Intel came out with a new microprocessor, they would go out and grab a new server as fast as they possibly could. He called it chasing the chips and it was this endless cycle of pain. And so, you know, the originators of the data whereas they didn't have the compute power they didn't have the Cloud. And so, and of course they didn't have the 30, 40 years of pain to draw upon. But I wonder if you could, could maybe talk a little bit about the kinds of things that can be done now that we haven't been able to do here to form. >> Well, yeah. I remember early on having a conversation with Bill about this idea of near real time data warehousing and saying, is this real, is this something really people need? And at the time he was a couple of decades ago, he said now to them they just want to load their data sooner than once a month. That was the goal. And that was going to be near real time for them. And, but now I'm seeing it with our customers. It's like, now we can do it, you know, with things like the Kafka technology and snow pipe in Snowflake that people are able to get that refresh way faster and have near real time analytics access to that data in a much more timely manner. And so it really is coming true. And the, the compute power that's there, as you said, we've now got this compute power in the Cloud that we never dreamed of. I mean, you would think of only certain, very large, massive global companies or governments could afford super computers. And that's what it would have taken. And now we've got nearly the power of a super computer in our mobile device that we all carry around with us. So being able to harness all that now in the Cloud is really opening up opportunities to do things with data and access data in a way that, again really, we just kind of dreamed of before as like we can democratize data when we get to this point. And I think that's where we are. We're at that inflection point where now it's possible to do it. So the challenge on organizations is going to be how do we do it effectively? How do we do it with agility? And how do we do it in a governed manner? You mentioned Sarbanes Oxley, GDPR, CCPA, all of those are out there. And so we have all of that as well. And so that's where we're going to get into it, right into the governance and being able to do that in a very quick, flexible, extensible manner and Snowflakes really letting people do it now. >> Well, yeah. And you know, again, we've been talking about Hadoop and I, again, for all my fond thoughts of that era, and it's not like Hadoop is gone but it was a lot of excitement around it, but governance was a huge problem. And it was kind of a bolt on. Now, Felipe I going to ask you, like, when you think about a company like Google, your former employer, you know, data is at the core of their business. And so many companies the data is not at the core of their business. Something else is, it's a process or a manufacturing facility or whatever it is. And the data is sort of on the outskirts. You know, we often talk about in, in stove pipes. And so we're now seeing organizations really put data at the core of their, it becomes central to their DNA. I'm curious as to your thoughts on that. And also, if you've got a lot of experience with developers, is there a developer angle here in this new data world? >> For sure, I mean, I love seeing everything like throughout my career at Google and my two months here and talking to so many companies, you never thought before like these are database companies but they are the ones that keep rowing. The ones that keep moving to the next stage of their development is because they are focusing on data. They are adapting the processes, they are learning from it. Me, I focus a lot on developers. So, I met when I started this career as an advocate of first, I was a software engineer and my work so far, has we worked, I really loved talking to the engineers on the other companies. Like, maybe I'm not the one solving the business problem, but at the end of the day, when these companies have a business problem that they want to grow, they want to have data. There are other engineers that are scientists like me that want to work for the company and bring the best technology to solve the problems. And Yeah, there's so much where data can help, yes, as we evolved the system for the company, and also for us, for understanding the systems things like of survivability, and recently there was a big company a big launch on survivability (indistinct) whether they are running all of their data warehousing needs. And all of that needs on snowflake, just because running these massive systems and being able to see how they're working generates a lot of data. And then how do you manage it? How do you analyze it? Or Snowflake is really there to help cover the two areas. >> It's interesting my business partner, John farrier cohost of the CUBE, he said, gosh I would say middle of the last decade, maybe even around the time 2013, when Snowflake was just coming out, he said, he predicted the data would be the new development kit. And it's really at the center of a lot of the data life cycle the what I call the data pipelines. I know people use that term differently but I'm very excited about the Data cloud summit and what we're going to learn there. And I get to interview a lot of really cool people. So, I appreciate you guys coming up, but, Kent who should attend the Data cloud summit, I mean, what should they expect to learn? >> Well, as you said earlier, Dave, there's so many tracks and there's really kind of something for everyone. So, we've got a track on unlocking the value of the Data cloud, which is really going to speak to the business leaders, you know, as to what that vision is, what can we do from an organizational perspective with the Data cloud to get that value from the data to move our businesses forward. But we've also done for the technicians migrating to snowflake. Sessions on how to do the migration, modernizing your data Lake, data science, how to do analytics with the, and data science in Snowflake and in the Data cloud, and even down to building apps. So the developers and building data products. So, you know, we've got stuff for developers, we've got stuff for data scientists. We've got stuff for the data architects like myself and the data engineers on how to build all of this out. And then there's going to be some industry solution spotlights as well. So we can talk about different verticals folks in FinTech and healthcare, there's going to be stuff for them. And then for our data superheroes we have a hallway track where we're going to get talks from the folks that are in our data superheroes which is really our community advocacy program. So these are folks who are out there in the trenches using Snowflake delivering value at their organizations. And they're going to talk down and dirty. How did they make this stuff happen? So it's going to be to some hope, really something for everyone, fireside chats with our executives. Of course something I'm really looking forward to myself. So was fun to hear from Frank and Christian and Benoit about what's the next big thing, what are we doing now? Where are we going with all of this? And then there is going to be a some awards we'll be giving out our data driver awards for our most innovative customers. So this is going to be a lot, a lot for everybody to consume and enjoy and learn about this, this new space of, of the Data cloud. >> Well, thank you for that Kent. And I'll second that, at least there's going to be a lot for everybody. If you're an existing Snowflake customer there's going to be plenty of two or one content, we can get in to the how to use and the best practice, if you're really not that familiar with Snowflake, or you're not a customer, there's a lot of one-on-one content going on. So, Felipe, I'd love to hear from you what people can expect at the Data cloud summit. >> Totally, so I would like to plus one to everyone that can say we have a phenomenal schedule that they, the executive will be there. I really wanted to especially highlight the session I'm preparing with Trevor Noah. I'm sure you might have heard of him. And we are having him at the Data cloud summit and we are going to have a session. We are going to talk about data. We are preparing a session. That's all about how people that love data that people that want to make that actionable. How can they bring storytelling and make it more, have more impact as he has well learn to do through his life? >> That's awesome, So, we have Trevor Noah, we're not just going to totally geek out here. we're going to have some great entertainment as well. So, I want you to go to snowflake.com and click on Data cloud summit 2020 there's four geos. It starts on November 17th and then runs through the week and in the following week in Japan. So, so check that out. We'll see you there. This is Dave Vellante for the CUBE. Thanks for watching. (upbeat music)

Published Date : Oct 20 2020

SUMMARY :

From the CUBE studios And at the Data cloud summit Yeah, thanks for having and obviously one of the most our customers the ability to do that And I decided that moving to Snowflake of the customer real time And the cloud, as we in the so old EDW days And at the time he was And the data is sort of on the outskirts. and bring the best technology And it's really at the center of a lot and in the Data cloud, and and the best practice, if at the Data cloud summit and in the following week in Japan.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave VellantePERSON

0.99+

Felipe HoffaPERSON

0.99+

Claudia ImhoffPERSON

0.99+

FelipePERSON

0.99+

30QUANTITY

0.99+

GoogleORGANIZATION

0.99+

Palo AltoLOCATION

0.99+

DavePERSON

0.99+

John farrierPERSON

0.99+

JapanLOCATION

0.99+

November 17thDATE

0.99+

Trevor NoahPERSON

0.99+

Bill InmanPERSON

0.99+

10 yearsQUANTITY

0.99+

Kent GrazianoPERSON

0.99+

two monthsQUANTITY

0.99+

SnowflakeORGANIZATION

0.99+

BostonLOCATION

0.99+

CUBEORGANIZATION

0.99+

KentPERSON

0.99+

360 degreeQUANTITY

0.99+

2020DATE

0.99+

IntelORGANIZATION

0.99+

oneQUANTITY

0.99+

twoQUANTITY

0.99+

BillPERSON

0.99+

AugustDATE

0.99+

FrankPERSON

0.99+

2013DATE

0.98+

eight tracksQUANTITY

0.98+

one timeQUANTITY

0.98+

firstQUANTITY

0.98+

Data cloud summit 2020EVENT

0.98+

two areasQUANTITY

0.98+

Data cloudEVENT

0.97+

HadoopTITLE

0.97+

todayDATE

0.97+

30 yearsQUANTITY

0.96+

one customerQUANTITY

0.96+

last decadeDATE

0.96+

nearly 30 yearsQUANTITY

0.96+

SnowflakeTITLE

0.96+

more than 40 sessionsQUANTITY

0.96+

BenoitPERSON

0.96+

MoorePERSON

0.96+

four geosQUANTITY

0.96+

once a monthQUANTITY

0.95+

EDWORGANIZATION

0.94+

Snowflake Data cloudEVENT

0.94+

GDPRTITLE

0.94+

Data cloudORGANIZATION

0.93+

Snowflake Data Cloud Summit 2020EVENT

0.92+

SnowflakeEVENT

0.92+

four geographiesQUANTITY

0.92+

ChristianPERSON

0.91+

BigQueryTITLE

0.9+

couple of decades agoDATE

0.89+

HadoopLOCATION

0.88+

Data cloud summitEVENT

0.87+

40 yearsQUANTITY

0.84+

Sarbanes OxleyCOMMERCIAL_ITEM

0.84+

BenPERSON

0.82+

KafkaORGANIZATION

0.79+

snowflake.comOTHER

0.75+

a decadeQUANTITY

0.74+

Sarbanes OxleyORGANIZATION

0.7+

Kent Graziano and Felipe Hoffa, Snowflake | Snowflake Data Cloud Summit 2020


 

>> (Instructor)From the cube studios in Palo Alto, in Boston, connecting with thought leaders all around the world. This is a cube conversation. >> Hi everyone. This is Dave Volante, the cube, and we're getting ready for the snowflake data cloud summit four geographies eight tracks, more than 40 sessions for this global event starts on November 17th, where we're tracking the rise of the data cloud. You're going to hear a lot about that now by now, you know the story of Snowflake or you know, what maybe you don't, but a new type of cloud native database was introduced in the middle part of last decade. And a new set of analytics workloads has emerged that is powering a transformation within organizations. And it's doing this by putting data at the core of businesses and organizations. You know for years, we marched to the cadence of Moore's law. That was the innovation engine of our industry, but now that's changed it's data plus machine intelligence plus cloud. That's the new innovation cocktail for the technology industry and industries overall. And at the data cloud summit, we'll hear from snowflake executives, founders, technologists, customers, and ecosystems partners. And of course, you're going to hear from interviews on the cube. So let's dig in a little bit more and to help me, are two snowflake experts, Filipe Hoffa is a data cloud advocate and Kent Graziano is a chief technical evangelists post at Snowflake. Gents great to see you. Thanks for coming on. >> Yeah thanks for having us on this is great. >> Thank you. >> So guys, first, I got to congratulate you on getting to this point. You've achieved beyond escape velocity, and obviously one of the most important IPOs of the year, but you got a lot of work to do I know that Filipe, let me start with you data cloud. What's a data cloud and what are we going to learn about it at the data cloud summit? >> Oh, that's an excellent question. And let me tell you a little bit about our story here. And I really, really, really admire what Kent has done. I joined the snowflake like less than two months ago, and for me it's been a huge learning experience. And I look up to Kent a lot on how we deliver the message and how do we deliver all of that. So I would love to hear his answer first. >> Okay, that's cool. Okay Kent later on. So talk of data cloud, that's a catchy phrase, right? But it vectors into at least two of the components of my innovation, innovation cocktail. What, what are the substantive substantive aspects behind the data cloud? >> I mean, it's a, it's a new concept, right? We've been talking about infrastructure clouds and SAS applications living in an application clouds so data cloud is the ability to really share all that data that we've been collecting. You know, we've, we've spent what, how many days a decade or more with big data now, but have we been able to use it effectively? And that's, that's really where the data cloud is coming in and snowflake in making that a more seamless, friendly, easy experience to get access to the data. I've been in data warehousing for nearly 30 years now. And our dream has always been to be able to augment an organization's analytics with data from outside their organization. And that's just been a massive pain in the neck with having to move files around and replicate the data and maybe losing track of where it came from or where it went. And the data cloud is really giving our customers the ability to do that in a much more governed way, a much more seamless way, and really make it push button to give anyone access to the data they need and have the performance to do the analytics in near real time. It's it's total game changer as, as you already know, and just it's crazy what we're able to do today, compared to what we could do when I started out in my career. >> Well, I'm going to come back to that cause I want to tap your historical perspective, but Filipe, let me ask you. So why did you join snowflake? You're you're the newbie here. What attracted you? >> Exactly, I'm the newbie. I used to work at Google until August. I was there for 10 years. I was a developer advocate there also for data, you might have heard about a big query. I was doing a lot of that and then as time went by, Snowflake started showing up more and more in my feeds, within my customers, in my community. And it came the time. When, I felt that like, you know, when wherever you're working, once in a while you think I should leave this place, I should try something new. I should move my career forward. While at Google, I thought that so many times as anyone would do, and it was only when snowflake showed up, like where snowflake is going now, how snowflake is, is being received by all the customers that I saw this opportunity. And I decided that moving to Snowflake would be a step forward for me. And so far I'm pretty happy. Like the timing has been incredible, but more than the timing and everything, it's really, really a great place for data. What I love first is data sharing data, analyzing data and how Snowflake is doing it it promotes me in phenomena. >> So Ken, I want to come back to you and I say, tap, maybe your historical perspective here. And you said, you know, it's always been a dream that you could do these other things bring in external data. I would say this, that I don't want to push a little bit on this because I have often said that the EDW marketplace really never lived up to its promises of 360 degree views of the customer in real time or near real time analytics. And, and it really has been, as you kind of described are a real challenge for a lot of organizations when Hadoop came in you know, we had, we we we got excited that it was kind of going to actually finally live up to that vision and and and we duped it a lot. And it don't get me wrong. I mean, the whole concept of, you know, bring the compute to data and the lowering the cost and so forth, but it certainly didn't minimize complexity. And, and it seems like, feels like Snowflake is on the cusp of actually delivering that promise that we've been talking about for 30 years. I wonder if you could share your perspective, is it, are we going to get there this time? >> Yeah. And as far as I can tell working with all of our customers, some of them are there. I mean, they're, they Fought through those struggles that you were talking about that I saw throughout my career and now with getting on Snowflake they're, they're delivering customer 360, they're integrating weblogs and IOT data with structured data from their ERP systems or CRM systems, their supply chain systems. And it really is coming to fruition. I mean, the, you know, the industry leaders, you know, Bill Inman and Claudia M Hoff, they've had this vision the whole time, but the technology just wasn't able to support it. And the cloud, as we said about the internet, changed everything and then Ben Y and Terry, in their vision and building the system, taking the best concepts from the Hadoop world and the data Lake world and the enterprise data warehouse world, and putting it all together into this, this architecture, that's now, you know Snowflake and the data cloud solved it. I mean, it's the, you know, the, the classic benefit of her insight is 2020 after years in the industry, they had seen these problems and said like, how can we solve them? Does the cloud let us solve these problems? And the answer was yes, but it did require writing everything from scratch and starting over with because the architecture the cloud just allows you to do things that you just couldn't do before. Yeah I'm glad you brought up, you know, some of the originators of the data warehouse, because it really wasn't their fault. They were trying to solve a problem. That was the marketers that took it and really kind of made promises that they couldn't keep. But the reality is when you talk to customers in the, in the, so the old EDW days, and this is the other thing I want to, I want to tap your guys' brains on. It was very challenging. I mean, one, one customer, one time referred to it as a snake, swallowing a basketball. And what he meant by that is you know, every time there's a change, you know, Sarbanes Oxley comes and we have to ingest all this new data. It's like, Oh, it's just everything slows down to a grinding halt. Every time Intel came out with a new microprocessor, they would go out and grab a new server as fast as they possibly could. He called it chasing the chips, and it was this endless cycle of pain. And so, you know, the originators of the data whereas they didn't, they didn't have you know the compute power, they didn't have the cloud. >> Yeah. >> And so, and of course they didn't have the 30- 40 years of pain to draw upon. But, but I wonder if you could, could maybe talk a little bit about the kinds of things that can be done now that we haven't been able to do here before. >> Well, yeah I remember early on having a conversation with, with Bill about this idea of near real time data warehousing and saying, is this real? Is this something really need people need? And at the time it was, was a couple of decades ago, he said no to them they just want to load their data sooner than once a month. >> Yeah. >> That was the goal. And that was going to be near real time for them. And, but now I'm seeing it with our customers. It's like, now we can do it, you know, with things like the Kafka technology and snow pipe in, in Snowflake, that people are able to get that refresh way faster and have near real time analytics access to that data in a much more timely manner. And so it really is coming true. And the, the compute power that's there, as you said, you know we, we've now got this compute power in the cloud that we never dreamed of. I mean, you would think of only certain very large, massive global companies or governments could afford supercomputers. And that's what it would have taken. And now we've got nearly the power of a supercomputer in our mobile device that we all carry around with us. So being able to harness all that now in the cloud is really opening up opportunities to do things with data and access data in a way that again really we just kind of dreamed of before. It's like, we can, we can democratize data when we get to this point. And I think that's the, that's where we are, we're at that inflection point where now it's, it's possible to do it. So the challenge on organizations is going to be, how do we do it effectively? How do we do it with agility? And how do we do it in a governed manner? You mentioned Sarbanes Oxley, GDPR, CCPA, all of those are out there. And so we have all of that as well. And so that's where, that's where we're going to get into it, right. Is into the governance and being able to do that in a very quick, flexible, extensible manner and you know, Snowflakes really letting people do it now. >> Well, yeah and you know, again, we've been talking about Hadoop and again, for all my, my fond thoughts of that era, and it's not like hadoop is gone, but, but it was a lot of excitement around it but but governance was a huge problem and it was kind of a ball tough enough. Felipe I got to ask you, like when you think about a company like Google your former employer, you know, data is at the core of their business. And so many companies, the data is not at the core of their business. Something else is it's a process or a manufacturing facility or you know whatever it is. And the data is sort of on the outskirts. You know, we often talk about in, in stove pipes. And so we're now seeing organizations really put data at the core of their it becomes, you know, central to their, to their DNA. I'm curious as to your thoughts on that. And also if you've got a lot of experience with developers, is there, is there a developer angle here in this new data world? >> Oh, for sure. I mean, I love seeing every, like throughout my career at Google and my two months here and talking to so many companies, you never thought before, like these are database companies, but the the ones that keep rowing. The ones that keep moving to the next stage of their development is because they are focusing on data. They are adapting the processes they learning from it. And me, I focus a lot on developers. So I mean when I started This career as an advocate. First I was a software engineer and my work so far, has been work, I really loved talking to the engineers on the other companies. Like maybe I'm not the one solving the business problem, but at the end of the day, when these companies have a business problem that they want to row, they want to have data. There are other engineers that are scientists likes me that are, that, that want to work for work for the company and bring the best technology to solve the problems. Yeah, there's so much where data can help as we evolve the system for the company. And also for us for understanding the systems, things like observability and recently, there was a big company, a big launch on observability the company name is observable, where they are running all of their data warehousing needs. And all of their data needs on Snowflake, just because running these massive systems and being able to see how they're working generates a lot of data. And then how do you manage it? How do you analyze it? Or snowflake is already there to help. >> Well you know >> I covered the two areas. >> It's interesting my, my business partner, John farrier, cohost of the cube, he said, gosh, I would say middle of the last decade, maybe even around the time, you know, 2013, when Snowflake was just coming out, he said, he predicted the data would be the new development kit. And you know, it's really at the center of a lot of, you know, the data life cycle, the, the, what I call the data pipelines. I know people use that term differently, but, but I'm, I'm very excited about the data cloud summit and what we're going to learn there. And I get to interview a lot of really cool people. And so I appreciate you guys coming on, but Kent, who, who should attend the data cloud summit, I mean, what, what are the, what should they expect to learn? >> Well, as you said earlier, Dave, there's, there's so many tracks and there's really kind of something for everyone. So we've got a track on unlocking the value of the data cloud, which is really going to speak to, you know, the business leaders, you know, as to what that vision is, what can we do from an organizational perspective, with the data cloud to get that value from the data to, to move our businesses forward. But we've also got, you know, for the technicians migrating to Snowflake training sessions on how to do the migration, modernizing your data like data science, you know how to do analytics with the, and data science in Snowflake and in the data cloud and even down to building apps. So the developers and building data products. So, you know, we've got stuff for developers, we've got stuff for data scientists. We've got stuff for the, the data architects like myself and the data engineers on how to, how to build all of this out. And then there's going to be some industry solutions spotlights as well. So we can talk about different verticals of folks in FinTech and, and in healthcare. There's going to be stuff for them. And then for our, our data superheroes, we have a hallway track where we're going to get talks from the folks that are in our data superheroes, which is really our community advocacy program. So these are folks who are out there in the trenches using Snowflake, delivering value at, at their organizations. And they're going to talk you know down and dirty. How did they make this stuff happen? So there's going to be just really something for everyone, fireside chats with our executives, of course, something I'm really looking forward to in myself. It's always fun to, to hear from Frank and Christian. And Benwah about, you know, what's the next big thing, you know, what are we doing now? Where are we going with all of this? And then there is going to be some awards. We'll be giving out our data driver awards for our most innovative customers. So this is going to be a lot, a lot for everybody to consume and enjoy and learn about this, this new space of, of the data cloud. >> Well, thank you for that Kent. And I'll second that, I mean, there's going to be a lot for everybody. If you're an existing Snowflake customer, there's going to be plenty of two on one content we can get in to the how to's and the best practice. If you're really not that familiar with Snowflake, or you're not a customer, there's a lot of one-on-one content going on. If you're an investor and you want to figure out, okay, what is this vision? And can, you know, will this company grow into its massive valuation and how are they going to do that? I think you're going to, you're going to hear about the data cloud and really try get a perspective. And you can make your own judgment as to, to, you know, whether or not you think that it's going to be as large a market as many people think. So Felipe, I'd love to hear from you what people can expect at the data cloud summit. >> Totally, so I would love to plus one to everyone that Kent said. We have a phenomenal schedule that the the executive will be there. And I really wanted to specially highlight the session I'm preparing with Trevor Noah. I'm sure you might have heard of him. And we are having him at the data cloud summit, and we are going to have a session. We're going to talk about data. We are preparing a session, That's all about how people that love data, that people that want to make data actionable. How can they bring storytelling and make it more, have more impact as he has well learned to do through his life. >> That's awesome, So yeah, Trevor Noah, we're not just going to totally geek out here. We're going to, we're going to have some great entertainment as well. So I want you to go to snowflake.com and click on data cloud summit, 2020 there's four geos. It starts on November 17th and then runs through the week and then the following week in Japan. So, so check that out. We'll see you there. This is Dave Volante for the cube. Thanks for watching. (soft music)

Published Date : Oct 16 2020

SUMMARY :

(Instructor)From the cube And at the data cloud summit, us on this is great. and obviously one of the most And let me tell you a little behind the data cloud? And the data cloud is to that cause I want to tap And I decided that moving to Snowflake I mean, the whole concept of, you know, and the data cloud solved it. bit about the kinds of things And at the time it was, was and you know, Snowflakes really And the data is sort of on the outskirts. and bring the best technology And I get to interview a and in the data cloud and So Felipe, I'd love to hear from you We have a phenomenal schedule that the This is Dave Volante for the cube.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Claudia M HoffPERSON

0.99+

Dave VolantePERSON

0.99+

John farrierPERSON

0.99+

Palo AltoLOCATION

0.99+

Filipe HoffaPERSON

0.99+

30QUANTITY

0.99+

DavePERSON

0.99+

FilipePERSON

0.99+

JapanLOCATION

0.99+

Felipe HoffaPERSON

0.99+

November 17thDATE

0.99+

FelipePERSON

0.99+

Bill InmanPERSON

0.99+

GoogleORGANIZATION

0.99+

2020DATE

0.99+

KenPERSON

0.99+

Trevor NoahPERSON

0.99+

10 yearsQUANTITY

0.99+

360 degreeQUANTITY

0.99+

Kent GrazianoPERSON

0.99+

two areasQUANTITY

0.99+

BenwahPERSON

0.99+

BostonLOCATION

0.99+

BillPERSON

0.99+

two monthsQUANTITY

0.99+

Ben YPERSON

0.99+

2013DATE

0.99+

TerryPERSON

0.99+

KentPERSON

0.99+

IntelORGANIZATION

0.99+

FrankPERSON

0.99+

more than 40 sessionsQUANTITY

0.99+

eight tracksQUANTITY

0.99+

AugustDATE

0.99+

FirstQUANTITY

0.98+

twoQUANTITY

0.98+

MoorePERSON

0.98+

firstQUANTITY

0.98+

one timeQUANTITY

0.98+

30 yearsQUANTITY

0.97+

last decadeDATE

0.97+

four geosQUANTITY

0.97+

SnowflakeTITLE

0.96+

nearly 30 yearsQUANTITY

0.96+

Snowflake Data Cloud Summit 2020EVENT

0.96+

SnowflakeORGANIZATION

0.96+

once a monthQUANTITY

0.96+

SnowflakeEVENT

0.96+

oneQUANTITY

0.95+

EDWORGANIZATION

0.95+

GDPRTITLE

0.94+

ChristianPERSON

0.93+

two snowflakeQUANTITY

0.93+

Sarbanes OxleyORGANIZATION

0.92+

todayDATE

0.89+

HadoopTITLE

0.89+

snowflake.comOTHER

0.88+

snowflakeEVENT

0.87+

couple of decades agoDATE

0.86+

SnowflakePERSON

0.86+

one customerQUANTITY

0.84+

less than two months agoDATE

0.83+

hadoopTITLE

0.82+

four geographiesQUANTITY

0.82+

data cloudEVENT

0.79+

KentORGANIZATION

0.79+

40 yearsQUANTITY

0.76+

a decadeQUANTITY

0.72+

snowflake dataEVENT

0.72+

CCPAORGANIZATION

0.72+

HadoopLOCATION

0.71+

KafkaORGANIZATION

0.68+

Frank Slootman Dave Vellante Cube Conversation


 

>>from the Cube Studios in Palo Alto in Boston, connecting with thought leaders all around >>the world. This is a cute conversation high, but this is Day Volonte. And as you know, we've been tracking the next generation of clouds. Sometimes we call it Cloud to two point. Frank's Lukman is here to really unpack this with me. Frank. Great to see you. Thanks for coming on. >>Yeah, you as well. They could see it >>s o obviously hot off your AIPO A lot of buzz around that. Uh, that's fine. We could we could talk about that, but I really want to talk about the future. What? Before we get off the I p o. That was something you told me when you're CEO service. Now you said, hey, we're priced to perfection, so it looks like snowflakes gonna be priced to perfection. It's a marathon, though. You You made that clear. I presume it's not any different here for you. Yeah, >>well, I think you know the service now. Journey was different in the sense that we were kind of under the underdogs, and people sort of discovered over the years the full potential of the company and I think there's stuff like they pretty much discovered a day. One. It's a little bit more, More sometimes it's nice to be an underdog. Were a bit of an over dog in this, uh, this particular scenario, but, you know, it is what it is, Andre. You know, it's all about execution delivering the results, delivering on our vision, Uh, you know, being great with our customers. And, uh, hopefully the chips will fall where they where they may. At that point, >>yeah, you're you're You're a poorly kept secret at this point, Frank. After a while, I wanted, you know, I've got some excerpts of your book that that I've been reading. And, of course, I've been following your career since the two thousands. You're off sailing. You mentioned in your book that you were kind of retired. You were done, and then you get sucked back in now. Why? I mean, are you in this for the sport? What's the story here? >>Uh, actually, that that's not a bad way of characterizing it. I think I am in that, uh, you know, for the sport, uh, you know the only way to become the best version of yourself is to be to be under the gun and, uh, you know, every single day. And that's that's certainly what we are. It sort of has its own rewards building great products, building great companies, regardless off you know what the spoils. Maybe it has its own rewards. And I It's hard for people like us to get off the field and, you know, hang it up. So here we are. >>You know, you're putting forth this vision now the data cloud, which obviously it's good marketing, but I'm really happy because I don't like the term Enterprise Data Warehouse. I don't think it reflects what you're trying to accomplish. E D. W. It's slow on Lee. A few people really know how to use it. The time value of data is gone by the time you know, your business is moving faster than the data in the D. W. And it really became a savior because of Sarbanes Oxley. That's really what it came a reporting mechanism. So I've never seen What you guys are doing is is e d w. So I want you to talk about the data cloud. I want to get into the to the vision a little bit and maybe challenge you on a couple things so our audience can better understand it. Yes. So >>the notion of a data cloud is is actually, uh, you know, type of cloud that we haven't had. I mean, data has been been fragmented and locked up in a million different places in different clouds. Different cloud regions, obviously on premise, um, And for data science teams, you know, they're trying thio drive analysis across datasets, which is incredibly hard, Which is why you know, a lot of this resorts to, you know, programming on bond things of that sort of. ITT's hardly scalable because the data is not optimized. The economics are not optimized. There's no governance model and so on. But a data cloud is actually the ability thio loosely couple and lightly Federated uh, data, regardless of where it is. So it doesn't have scale limitations or performance limitations. Uh, the way traditional data warehouses have had it. So we really have a fighting chance off really killing the silos and unlocking the bunkers and allowing the full promise of data sciences and ml On day I thio really happen. I mean, a lot of lot of the analysis that happens on data is on the single data set because it's just too damn hard, you know, to drive analysis across multiple data sets. And, you know, when we talk to our customers, they have very precise designs on what they're trying to do. They say, Look, we are trying to discover, you know, through through through deep learning You know what the patterns are that lead to transactions. You know, whether it's if you're streaming company. Maybe it's that you're signing up for a channel or you're buying a movie or whatever it is. What is the pattern you know, of data points that leads us to that desired outcome. Once you have a very accurate description of the data relationships, you know that results in that outcome, you can then search for it and scale it, you know, tens of million times over. That's what digital enterprises do, right? So in order to discover these patterns enriched the data to the point where the patterns become incredibly predictive. Uh, that's that's what snowflake is formed, right? But it requires a completely Federated Data mo because you're not gonna find a data pattern in the in the single data set per se right? So that's that's what it's all about. I mean, the outcomes of a data cloud are very, very closely related to the business outcomes that the user is seeking, right? It's not some infrastructure process. It has a very remote relationship with business outcome. This is very, very closely related. >>So it doesn't take a brain surgeon to look at the Trillion Years Club. And so I could see that I could see the big you know, trillion dollars apple $2 trillion market cap companies. They got data at the core, whereas most companies most incumbents. Yeah, it might be a bottling plant that the core, some manufacturing or some other processes they put, they put data around it in these silos. It seems like you're trying toe really? Bring that innovation and put data at the core. And you've got an architecture to do that. You talk about your multi cluster shared storage architecture. You mentioned you mentioned data sharing it. Will this, in your opinion, enable, for instance, incumbents to do what a lot of the startups were able to do with the cloud days? I mean they got access to data centers, which they they couldn't have before the cloud you're trying to do with something similar with data. >>Yeah, so So, you know, obviously there's no doubt that the cloud is a critical enabler. This wouldn't be happening. Uh, you know what? I was at the same time, the trails that have been blessed by the likes of Facebook and Google. Uh, e the reason those enterprises are so extraordinary valuable is is because of what they know. Uh, you know, through data and how they can monetize what they know through data. But that is now because that power is now becoming available, you know, to every single enterprise out there. Right, Because the data platform, the underlying cloud capabilities, we are now delivering that to anybody who wants it. Now, you still need to have strong date engineering data science capabilities. It's not like falling off a log, but fundamentally, those capabilities are now, you know, broadly accessible in the marketplace. >>So we're talking upfront about some of the differences between what you've done earlier in your career. Like I said, you're the worst kept secret, you know, Data domain. I would say it was sort of somewhat of a niche market. You you blew it up until it was very disruptive, but it was somewhat limited in what could be done. Uh, and and maybe some of that limitation, you know, wouldn't have occurred if you stay the price, uh, independent company service. Now you mop the table up because you really had no competition there, Not the case here. You you've got some of the biggest competitors in the world, so talk about that. And what gives you confidence that you can continue to dominate, >>But, you know, it's actually interesting that you bring up these companies. I mean, data. The man was a scenario where we were constrained on market and literally we were a data backup company. As you recall, we needed to move into backup software. Need to move the primary storage. While we knew it, we couldn't execute on it because it took tremendous resource is which, back in the day, it was much harder than one of this right now. So we ended up selling the company to E M. C and and now part of Dell. But way short, uh, we're left with some trauma from that experience, Uh, that, you know, why couldn't we, you know, execute on that transformation? So coming to service now, we were extremely. I'm certainly need personally, extremely attuned to the challenges that we have endured in our prior company. One of the reasons why you saw service now break out at scale at tremendous growth rights is because of what we have learned from the prior journey. We're not gonna ever get caught again in a situation where we could not sustain our markets and sustain our growth. So if service I was very much the execution model was very much a reaction to what we had encountered in the prior company. Now coming into snowflake totally different deal. Because not only is there's a large market, this is a developing market. I think you've pointed out in some of your broadcasting that this market is very much in flux on the reason is that you know, technology is now capable of doing things for for people and enterprises that they could never do before. So people are spending way mawr resource is than they ever thought possible on these new capability. So you can't think in terms of static markets and static data definitions, it means nothing. Okay, These things are so in transition right now, it's very difficult for people you know to to scope that the scale of this opportunity. >>Yeah. I wanna understand you're thinking around and, you know, I've written about the TAM, and can Snowflake grow into its valuation and the way I drew it, I said, Okay, you got data Lakes and you got Enterprise Data Warehouse. That's pretty well understood. But I called it data as a service to cover the closest analogy to your data cloud. And then even beyond that, when you start bringing in the edge and real time data, uh, talk about how you're thinking about that, Tam. And what what you have to do to participate. You have toe, you know, bring adjacent capabilities, ISAT this read data sharing that will get you there. In other words, you're not like a transaction system. You hear people talking about converge databases, you hear? Talk about real time inference at the edge that today anyway, isn't what snowflake is about. Does that vision of data sharing and the data cloud does that allow you to participate in that massive, multi $100 billion tam that that I laid out and probably others as well. >>Yeah, well, it is always difficult. Thio defined markets based on historical concept that probably not gonna apply whole lot for much longer. I mean, the way we think of it is that data is the beating heart of the digital enterprise on, uh, you know, digital enterprises today. What do you look at? People against the car door dash or so on. Um, they were built from the ground up to be digital on the prices and data Is the beating heart off their operation Data operations is their manufacturing, if you will, um, every other enterprise out there is is working very hard to become digital or part digital and is going to learn to develop data platforms like what we're talking about here to data Cloud Azaz. Well, as the expertise in terms of data engineering and data scientist to really fully become a digital enterprise, right. So, you know, we view data as driving operations off the digital enterprise. That's really what it iss right data, and it's completely data driven. And there's no people involved. People are developing and supporting the process. But in the execution, it is end to end. Data driven. Being that data is the is the signal that initiates the process is technol assess. Their there being a detective, and then they fully execute the entire machinery probe Problematic machinery, if you will, um, you know, of the processes that have been designed, for example, you know, I may fit a certain pattern. You know, that that leads to some transactional context. But I've not fully completed that pattern until I click on some Lincoln. And all of a sudden proof I have become, you know, a prime prospect system, the text that in the real time and then unleashes Oh, it's outreach and capabilities to get me to transact me. You and I are experiencing this every day. You know, when we're when we're online, you just may not fully re election. That's what's happening behind the scenes. That's really what this is all about. So and so to me, this is sort of the new online transaction processing is enter and, uh, you know, data digital. Uh, no process that is continually acquiring, analyzing and acting on data. >>Well, you've talked about the time time value of of data. It loses value over time. And to the extent that you can actually affect decisions, maybe before you lose the customer before you lose the patient even even more importantly or before you lose the battle. Uh, there's all kinds of, you know, mental models that you can apply this. So automation is a key part of that. And then again, I think a lot of people like you said, if you just try to look at historical markets, you can't really squint through those and apply them. You really have toe open up your mind and think about the new possibilities. And so I could see your your component of automation. I I see what's happening in the r P. A space and and I could see See these this massive opportunities Thio really change society, change business, your last thoughts. >>There's just there's just no scenario that I can envision where data is not completely core in central to a digital enterprise, period. >>Yeah, I think I really do think, Frank, your your your Your vision is misunderstood somewhat. I think people say Okay. Hey, we'll bet on salute men Scarpelli the team. That's great to do that. But I think this is gonna unfold in a way that people may be having predicted that maybe you guys, yourselves and your founders, you know, haven't have aren't able to predict as well. But you've got that good, strong architectural philosophy that you're pursuing and it just kind of feels right, doesn't it? >>You know, I mean, one of the 100 conversations and, uh, you know, things is the one of the reasons why we also wrote our book. You know, the rights of the data cloud is to convey to the marketplace that this is not an incremental evolution, that this is not sort of building on the past. There is a real step function here on the way to think about it is that typically enterprises and institutions will look at a platform like snowflakes from a workload context. In other words, I have this business. I have this workload. This is very much historically defined, by the way. And then they benchmark us, you know, against what they're what they're already doing on some legacy platform. And they decided, like, Yeah, this is a good fit. We're gonna put Snowflake here. Maybe there, but it's still very workload centric, which means that we are essentially perpetuating the mentality off the past. Right? We were doing it. Wanna work, load of the time We're creating the new silos and the new bunkers of data in the process. And we're really not approaching this with level of vision that the data science is really required to drive maximum benefit from data. So our arguments and this is this is not an easy arguments is to say, toc IOS on any other sea level person that wants to listen to that look, you know, just thinking about, you know, operational context and operational. Excellent. It's like we have toe have a platform that allows us unfettered access to the data that, you know, we may need to, you know, bring the analytical power to right. If you have to bring in political power to a diversity of data sets, how are we going to do that right? The data lives in, like, 500 different places. It's just not possible, right, other than with insane amounts of programming and complexity, and then we don't have the performance, and we don't have to economics, and we don't have the governance and so on. So you really want to set yourself up with a data cloud so that you can unleash your data science, uh, capabilities, your machine learning your deep learning capabilities, aan den, you really get the full throttle advantage. You know of what the technology can do if you're going to perpetuate the silo and bunkering of data by doing it won't work. Load of the time. You know, 5, 10 years from now, we're having the same conversation we've been having over the last 40 years, you know? >>Yeah. Operationalize ing your data is gonna require busting down those those silos, and it's gonna require something like the data cloud to really power that to the next decade and beyond. Frank's movement Thanks so much for coming in. The Cuban helping us do a preview here of what's to come. >>You bet, Dave. Thanks. >>All right. Thank you for watching. Everybody says Dave Volonte for the Cube will see you next time

Published Date : Oct 16 2020

SUMMARY :

And as you know, we've been tracking the next generation of clouds. Yeah, you as well. Before we get off the I p o. That was something you told me when you're CEO service. this particular scenario, but, you know, it is what it is, Andre. I wanted, you know, I've got some excerpts of your book that that I've been reading. uh, you know, for the sport, uh, you know the only way to become the best version of yourself is to it. The time value of data is gone by the time you know, your business is moving faster than the data is on the single data set because it's just too damn hard, you know, to drive analysis across And so I could see that I could see the big you know, trillion dollars apple Uh, you know, through data and how they can monetize what Uh, and and maybe some of that limitation, you know, wouldn't have occurred if you stay the price, Uh, that, you know, why couldn't we, you know, execute on and the data cloud does that allow you to participate in that massive, And all of a sudden proof I have become, you know, a prime prospect system, Uh, there's all kinds of, you know, mental models that you completely core in central to a digital enterprise, period. maybe you guys, yourselves and your founders, you know, haven't have aren't able to predict as well. You know, I mean, one of the 100 conversations and, uh, you know, things and it's gonna require something like the data cloud to really power that to the next Everybody says Dave Volonte for the Cube will see you next time

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave VolontePERSON

0.99+

FrankPERSON

0.99+

Frank SlootmanPERSON

0.99+

ScarpelliPERSON

0.99+

DellORGANIZATION

0.99+

DavePERSON

0.99+

Palo AltoLOCATION

0.99+

appleORGANIZATION

0.99+

FacebookORGANIZATION

0.99+

GoogleORGANIZATION

0.99+

LeePERSON

0.99+

IOSTITLE

0.99+

AndrePERSON

0.99+

BostonLOCATION

0.99+

Cube StudiosORGANIZATION

0.99+

Trillion Years ClubORGANIZATION

0.99+

two thousandsQUANTITY

0.99+

100 conversationsQUANTITY

0.99+

trillion dollarsQUANTITY

0.98+

todayDATE

0.98+

ITTORGANIZATION

0.98+

oneQUANTITY

0.98+

$2 trillionQUANTITY

0.98+

OneQUANTITY

0.97+

a dayQUANTITY

0.97+

singleQUANTITY

0.97+

Cloud AzazORGANIZATION

0.96+

next decadeDATE

0.96+

TAMORGANIZATION

0.96+

$100 billionQUANTITY

0.96+

Enterprise Data WarehouseORGANIZATION

0.95+

Dave VellantePERSON

0.95+

500 differentQUANTITY

0.94+

two pointQUANTITY

0.93+

D. W.LOCATION

0.92+

Sarbanes OxleyPERSON

0.91+

5QUANTITY

0.9+

SnowflakeTITLE

0.87+

SnowflakeORGANIZATION

0.86+

single data setQUANTITY

0.86+

tens of million timesQUANTITY

0.85+

10 yearsQUANTITY

0.83+

E M. CORGANIZATION

0.83+

LincolnPERSON

0.82+

Day VolontePERSON

0.82+

LukmanPERSON

0.82+

CubanPERSON

0.8+

last 40 yearsDATE

0.77+

snowflakesTITLE

0.75+

single enterpriseQUANTITY

0.64+

TamORGANIZATION

0.63+

ThioPERSON

0.62+

millionQUANTITY

0.53+

single dayQUANTITY

0.49+

CubePERSON

0.36+

Kent Graziano and Felipe Hoffa V1


 

>> Narrator: From theCUBE Studios in Palo Alto in Boston, connecting with thought leaders all around the world, this is theCUBE Conversation. >> Hi everyone, this is Dave Vellante at theCUBE, and we're getting ready for the Snowflake Data Cloud Summit. four geographies, eight tracks, more than 40 sessions for this global event. starts on November 17th, where we're tracking the rise of the data cloud. You're going to hear a lot about that. Now, by now, you know the story of Snowflake or, you know what? Maybe you don't. But a new type of cloud-native database was introduced in the middle part of the last decade. And a new set of analytics workloads has emerged, that is powering a transformation within the organizations. And it's doing this by putting data at the core of businesses and organizations. For years, we marched to the cadence of Moore's law. That was the innovation engine of our industry, but now that's changed. It's data, plus machine intelligence, plus cloud. That's the new innovation cocktail for the technology industry and industries overall. And in the Data Cloud Summit, we'll hear from Snowflake executives, founders, technologists, customers, and ecosystems partners. And of course, you're going to hear from interviews on theCUBE. So let's dig in a little bit more. And to help me are two Snowflake experts. Felipe Hoffa is a data cloud advocate and Kent Graziano is a chief technical evangelist, both at Snowflake. Gents, great to see you, thanks for coming on. >> Thanks for having us on, this is great. >> Thank you. >> So guys, first, I got to congratulate you on getting to this point. You've achieved beyond escape velocity and obviously one of the most important IPOs of the year, but you got a lot of work to do and I know that. Felipe, let me start with you. Data cloud, what's a data cloud and what are we going to learn about it at the Data Cloud Summit? >> Oh, that's an excellent question. And, let me tell you a little bit about our story here. And I really, really, really admire what Kent has done. I joined Snowflake like less than two months ago and for me, it's been a huge learning experience. And I look up to Kent a lot on how we deliver the method here, how do we deliver all of that? So, I would love to hear his answer first. >> Dave: Okay, that's cool. Okay Kent, leader on. (Kent laughing) So we took it. Data cloud, that's a catchy phrase, right? But it vectors into at least two of the components of my innovation cocktail. What are the substantive aspects behind the data cloud? >> I mean, it's a new concept, right? We've been talking about infrastructure clouds and SaaS applications living in the application cloud, so data cloud is the ability to really share all that data that we've been collecting. We've spent what? How many da-- A decade or more with big data now, but have we been able to use it effectively? And that's really where the data cloud is coming in and Snowflake, in making that a more seamless, friendly, easy experience to get access to the data. I've been in data warehousing for nearly 30 years now. And our dream has always been to be able to augment an organization's analytics with data from outside their organization. And that's just been a massive pain in the neck with having to move files around and replicate the data and maybe losing track of where it came from or where it went. And the data cloud is really giving our customers the ability to do that in a much more governed way, a much more seamless way, and really make it push button to give anyone access to the data they need and have the performance to do the analytics in near real-time. It's a total game changer as you already know. And just, it's crazy what we're able to do today compared to what we could do when I started out in my career. >> Well, I'm going to come back to that 'cause I want to tap your historical perspective. But Felipe, let me ask you, so why did you join Snowflake? You're the newbie here, what attracted you? >> And finally, I'm the newbie. I used to work at Google until August. I was there for 10 years, I was a developer advocate there also for data, you might have heard about the BigQuery, I was doing a lot of that. And though as time went by, Snowflake started showing up more and more in my feeds, within my customers, in my community. And it came the time when I felt like-- Wherever you're working, once in a while you think, "I should leave this place, "I should try something new, "I should move my career forward." While at Google, I thought that so many times as anyone will do. And it was only when Snowflake showed up, like where Snowflake is going now, how Snowflake is being received by all the customers, that I saw this opportunity. And I decided that moving to Snowflake would be a step forward for me. And so far I'm pretty happy, like the timing has been incredible, but more than the timing and everything, it's really, really a great place for data. What I love first is data, sharing data, analyzing data and how Snowflake is doing it, its promising phenomena. >> So, Kent, I want to come back to you and I said, tap maybe your historical perspective here. And you said, it's always been a dream that you could do these other things, bring in external data. I would say this, that I would want to push a little bit on this because I have often said that the EDW marketplace really never lived up to its promises of 360 degree views of the customer, in real-time or near real-time analytics. And it really has been, as you kind of described it, a real challenge for a lot of organizations. When Hadoop came in, we had-- We got excited that it was going to actually finally live up to that vision and Hadoop did a lot. And don't get me wrong, I mean, the whole concept of, bring the computer data and lowering the cost and so forth. But it certainly didn't minimize complexity. And it seems like, feels like Snowflake is on the cusp of actually delivering on that promise that we've been talking about for 30 years. I wonder if you could share your perspective as an o-- Are we going to get there this time? >> Yeah. And as far as I can tell working with all of our customers, some of them are there. I mean, they thought through those struggles that you were talking about, that I saw throughout my career. And now with getting on Snowflake they're delivering customer 360, they're integrating weblogs and IOT data with structured data from their ERP systems or CRM systems, their supply chain systems and it really is coming to fruition. I mean, the industry leaders, Bill Inmon and Claudia Imhoff, they've had this vision the whole time, but the technology just wasn't able to support it and the cloud, as we said about the internet, changed everything. And then Benoit and Thierry in their vision in building the system, taking the best concepts from the Hadoop world and the data lake world and the enterprise data warehouse world, and putting it all together into this architecture, that's now Snowflake and the data cloud, solved it. I mean, it's-- The classic benefit of hindsight is 20/20, after years in the industry, they had seen these problems and said like, "How can we solve them? "Does the cloud let us solve these problems?" And the answer was, yes, but it did require writing everything from scratch and starting over with, because the architecture of the cloud just allows you to do things that you just couldn't do before. >> Yeah, I'm glad you brought up some of the originators of the data warehouse, because it really wasn't their fault, they were trying to solve a problem. It was the marketers that took it and really kind of made promises that they couldn't keep. But, the reality is when you talk to customers in the sort of the old EDW days, and this is the other thing I want to tap you guys' brains on, it was very challenging. I mean, and one customer one time referred to it as a snake swallowing a basketball. And what he meant by that is, every time there's a change, or Sarbanes-Oxley comes and we have to ingest all this new data. It's like aargh! It's just everything slows down to a grinding halt. Every time Intel came out with a new microprocessor they would go out and grab a new server as fast as they possibly could, he called it chasing the chips. And it was this endless cycle of pain. And so, the originators of the data warehouse, they didn't have the compute power, they didn't have the cloud. And so-- And of course they didn't have like 30, 40 years of pain to draw upon. But I wonder if you could maybe talk a little bit about the kinds of things that can be done now that we haven't been able to do here tofore. >> Well, yeah. I remember early on having a conversation with Bill about this idea of near real-time data warehousing and saying, "Is this real? "Is this something really people need?" And at the time, it was a couple of decades ago, he said, "No, to them, they just want to load their data "sooner than once a month." That was the goal. And they-- That was going to be near real-time for them. And, but now I'm seeing it with our customers. It's like, now we can do it. With things like the Kafka technology and Snowpipe in Snowflake, that people are able to get that refresh way faster and have near real-time analytics access to that data in a much more timely manner. And so it really is coming true. And the compute power that's there, as you said, we've now got this compute power in the cloud that we never dreamed of. I mean, you would think of only certain, very large, massive global companies or governments could afford supercomputers. And that's what it would have taken. And now we've got nearly the power of a super computer in our mobile device that we all carry around with us. So being able to harness all of that now in the cloud, is really opening up opportunities to do things with data and access data in a way that, again, really, we just kind of dreamed of before. Its like, we can democratize data when we get to this point. And I think that's where we are, we're at that inflection point, where now it's possible to do it. So the challenge on organizations is going to be how do we do it effectively? How do we do it with agility? And how do we do it in a governed manner? You mentioned Sarbanes-Oxley, GDPR, CCPA, all of those are out there. And so we have all of that as well. And so that's where we're going to get into it, ride us into the governance and being able to do that in a very quick, flexible, extensible manner. And Snowflakes really letting people do it now. >> Well, yeah. And again, we've been talking about Hadoop, and again, for all my fond thoughts of that era, and it's not like Hadoop is gone, but there was a lot of excitement around it, but governance was a huge problem. And it was kind of a bolt on. And now, Felipe I got to ask you, when you think about a company like Google, your former employer, data is at the core of their business. And so many companies, the data is not at the core of their business, something else is, it's a process or a manufacturing facility or whatever it is. And the data is sort of on the outskirts. We often talk about in stovepipes. And so we're now seeing organizations really, put data at the core of their... And it becomes central to their DNA. I'm curious as to your thoughts on that. And also, if you've got a lot of experience with developers, is there a developer angle here in this new data world? >> Oh, for sure. I mean, I love seeing every-- Like throughout my career at Google and my two months here, I'm talking to so many companies, that you never thought before, like these are database companies. But the ones that keep growing, the ones that keep moving to the next stage of their development is because they are focusing on data, they are adopting the processes, They are learning from it. And, me per-- I focus a lot on developers, so I mean, when I started this career as an advocate, first, I was a software engineer. And my work so far, has been... (mumbles) I really love talking to the engineers on the other companies, like... Maybe I'm not the one solving the business problem, but at the end of the day, when these companies have a business problem through out the world, they want to have data. There are other engineers that are scientists like me that are... That want to work for the company and bring the best technology to solve the problems. Yeah, for example, there's so much where data can help. If, as we evolve the systems for the company and also for us for understanding these systems, things like observability. And recently, there was a big company, a big launch on observability, on the company names of Cyberroam, where they are running all of their data warehousing needs and all of their data needs on Snowflake. Just because running these massive systems and being able to see how they're working, generates a lot of data. And then how do you manage it? How do you analyze it? Snowflake is ready there to help and support the two areas. >> It's interesting, my business partner, John Furrier, co-host of theCUBE, he said, gosh, I would say the middle of the last decade, maybe even around the time, 2013, when Snowflake was just coming out. He said... He predicted that data would be the new development kit. And, it's really at the center of a lot of the data life cycle, the-- What I call the data pipelines, I know people use that term differently. But, I'm very excited about the Data Cloud Summit and what we're going to learn there. And I get to interview a lot of really cool people. And so I appreciate you guys coming on. But Kent, who should attend the Data Cloud Summit? I mean, what are the-- What should they expect to learn? >> Well, as you said earlier Dave, there's so many tracks and there's really kind of something for everyone. So we've got a track on unlocking the value of the data cloud, which is really going to speak to the business leaders, as to what that vision is, what can we do from an organizational perspective with the data cloud to get them value from the data to move our businesses forward? But we've also got for the technicians, migrating to Snowflake. Training sessions on how to do the migration and modernizing your data lake, data science. How to do analytics with, and data science in Snowflake and in the data cloud. And even down to building apps, for the developers and building data products. So, we've got stuff for developers, we've got stuff for data scientists, we've got stuff for the data architects like myself and the data engineers, on how to build all of this out. And then there's going to be some industry solutions spotlights as well. So we can talk about different verticals, folks in FinTech and in healthcare, there's going to be stuff for them. And then for our data superheroes, we have a hallway track where we're going to get talks from the folks that are in our data superheroes, which is really our community advocacy program. So these are folks that are out there in the trenches using Snowflake, delivering value at their organizations. And they're going to talk down and dirty of how did they make this stuff happen? So there's going to be just really, something for everyone. Fireside chats with our executives, of course, something I'm really looking forward to myself. It's always fun to hear from Frank and Christian and Benoit, about what's the next big thing, what are we doing now? Where are we going with all of this? And then there is going to be some awards. We'll be giving out our Data Driver Awards for our most innovative customers. So there's going to be a lot for everybody to consume and enjoy and learn about this new space of the data cloud. >> Well, thank you for that Kent and I'll second that, and there's going to be a lot for everybody. If you're an existing Snowflake customer, there's going to be plenty of two of one content, where we can get in to the how tos and the best practice. If you're really not that familiar with Snowflake or you're not a customer, there's a lot of one-on-one content going on. If you're an investor and you want to figure out, "Okay, what is this vision? "And can, will this company grow into its massive valuation? "And how are they going to do that?" I think you're going to hear about the data cloud and really try to get a perspective and you can make your own judgment as to whether or not you think that it's going to be as large a market as many people think. So Felipe, I'd love to hear from you what people can expect at the Data Cloud Summit. >> Totally. So I would love to plus one to every one that Kent said, we have a phenomenal schedule that day, the executives will be there. But I really wanted to especially highlight the session I'm preparing with Trevor Noah. I'm sure you must have heard of him. And we are having him at the Data Cloud Summit, and we are going to have a session. We are going to talk about data. We are preparing a session that's all about how people that love data, that people that want to make that actionable, how can they bring storytelling and make it have more impact as he has well learned to do through his life. >> That's awesome. So, yeah, Trevor Noah, we're not just going to totally geek out here. We're going to have some great entertainment as well. So I want you to go to snowflake.com and click on Data Cloud Summit 2020. There's four geos. It starts on November 17th and then runs through the week and then the following week in Japan. So, check that out, we'll see you there. This is Dave Vellante for theCUBE. Thanks for watching. (upbeat music)

Published Date : Oct 15 2020

SUMMARY :

leaders all around the world, And in the Data Cloud Summit, this is great. and obviously one of the most And I look up to Kent a lot aspects behind the data cloud? And the data cloud is You're the newbie here, And I decided that moving to Snowflake and lowering the cost and so forth. and the cloud, as we And so, the originators And at the time, it was And the data is sort of on the outskirts. and bring the best technology And I get to interview a and in the data cloud. and there's going to the Data Cloud Summit, We're going to have some

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave VellantePERSON

0.99+

DavePERSON

0.99+

Claudia ImhoffPERSON

0.99+

Felipe HoffaPERSON

0.99+

John FurrierPERSON

0.99+

30QUANTITY

0.99+

GoogleORGANIZATION

0.99+

FelipePERSON

0.99+

JapanLOCATION

0.99+

November 17thDATE

0.99+

Bill InmonPERSON

0.99+

Palo AltoLOCATION

0.99+

Trevor NoahPERSON

0.99+

Kent GrazianoPERSON

0.99+

10 yearsQUANTITY

0.99+

360 degreeQUANTITY

0.99+

KentPERSON

0.99+

BillPERSON

0.99+

twoQUANTITY

0.99+

two monthsQUANTITY

0.99+

eight tracksQUANTITY

0.99+

SnowflakeORGANIZATION

0.99+

Snowflake Data Cloud SummitEVENT

0.99+

FrankPERSON

0.99+

bothQUANTITY

0.99+

IntelORGANIZATION

0.99+

two areasQUANTITY

0.99+

HadoopTITLE

0.99+

more than 40 sessionsQUANTITY

0.99+

Data Cloud SummitEVENT

0.99+

AugustDATE

0.98+

30 yearsQUANTITY

0.98+

theCUBEORGANIZATION

0.98+

one timeQUANTITY

0.98+

ThierryPERSON

0.98+

Data Cloud Summit 2020EVENT

0.98+

Cloud SummitEVENT

0.98+

MoorePERSON

0.97+

BostonLOCATION

0.97+

last decadeDATE

0.97+

firstQUANTITY

0.97+

four geographiesQUANTITY

0.97+

2013DATE

0.97+

theCUBE StudiosORGANIZATION

0.96+

SnowflakeLOCATION

0.96+

CyberroamORGANIZATION

0.96+

BenoitPERSON

0.96+

todayDATE

0.96+

nearly 30 yearsQUANTITY

0.95+

ChristianPERSON

0.95+

one customerQUANTITY

0.95+

40 yearsQUANTITY

0.94+

snowflake.comOTHER

0.94+

once a monthQUANTITY

0.92+

four geosQUANTITY

0.92+

HadoopLOCATION

0.92+

SnowflakeEVENT

0.92+

GDPRTITLE

0.91+

couple of decades agoDATE

0.9+

EDWORGANIZATION

0.89+

20/20DATE

0.87+

A decadeQUANTITY

0.87+

oneQUANTITY

0.87+

Data Driver AwardsEVENT

0.87+

Frank & Dave CConvo V1


 

>> Narrator: From "theCUBE" studios in Palo Alto, in Boston, connecting with thought leaders all around the world, this is "theCUBE" conversation. >> Hi everybody, this is Dave Vellante. and as you know, we've been tracking the next generation of cloud. Sometimes we call it cloud 2.0, Frank Slootman is here to really unpack this with me. Frank, great to see you. Thanks for coming on. >> Yeah, you as well Dave, good to see you. >> Yeah so, obviously hot off your IPO, a lot of buzz around that, that's fine. We could talk about that, but I really want to talk about the future. What, before we get off the IPO though, was something you told me when you were CEO of ServiceNow. You said, "Hey, we're priced to perfection." So it looks like Snowflake is going to be priced to perfection, it's a marathon though. You made that clear. I presume it's not any different here for you. >> Well, I think, you know, the ServiceNow journey was different in the sense that we were kind of underdogs and people sort of discovered over the years, the full potential of the company. And, I think with Snowflake, they pretty much just discovered it day one (laughs). It's a little bit more, sometimes it's nice to be an underdog or a bit of an overdog in this particular scenario, but yeah, it is what it is. And, it's all about execution, delivering the results, being great with our customers and, hopefully the (indistinct) where they may at that point. >> Yeah, you're a poorly kept secret at this point, Frank, after a while. I've got some excerpts of your book that I've been reading and of course I've been following your career since the 2000's. You're off sailing. You mentioned in your book that you were kind of retired, you were done, and then you got sucked back in. Now, why? Are you in this for the sport? What's the story here? >> Actually, that's not a bad way of characterizing it. I think I am in it, for the sport, the only way to become the best version of yourself is to be under the gun, every single day. And that's certainly what we are. It sort of has its own rewards, building great products, building great companies, regardless of what the spoils may be, it has its own reward. It's hard for people like us to get off the field and hang it up, so here we are. >> You're putting forth this vision now, the data cloud, which obviously it's good marketing, but I'm really happy because I don't like the term enterprise data warehouse. I don't think it reflects, what you're trying to accomplish. EDW, it's slow, only a few people really know how to use it, the time value of data is gone by the time, your business is moving faster than the data in EDW. And it really became, as the savior because of Sarbanes-Oxley, that's really, what became of a reporting mechanism. So I have never seen what you guys are doing as EDW. So I want you to talk about the data cloud, I want to get into the vision a little bit and maybe challenge you on a couple things so our audience can better understand it. >> Yeah, so the notion of a data cloud is actually a type of cloud that we haven't had. Data has been fragmented and locked up in a million different places, in different clouds, different cloud regions, obviously, on premise. And for data science teams, they're trying to drive analysis across datasets, which is incredibly hard, which is why a lot of this resorts to programming and things of that sort. It's hardly scalable because the data is not optimized, the economics are not optimized, there's no governance model and so on. But the data cloud is actually the ability to loosely couple and lightly federate data, regardless of where it is, so it doesn't have scale limitations or performance limitations the way traditional data warehouses have had it. So we really have a fighting chance of really killing the silos and unlocking the bunkers and allowing the full promise of data sciences and ML and AI to really happen. A` lot of the analysis that happens on data is on a single dataset because it's just too damn hard to drive analysis across multiple datasets. When we talk to our customers, they have very precise designs on what they're trying to do. They say, "Look, we are trying to discover through deep learning what the patterns are that lead to transactions, whether it's... If you're a streaming company, maybe it's that you're signing up for a channel or you're buying a movie or whatever it is. What is the pattern of datapoints that leads us to that desired outcome?" Once you have a very accurate description of the data relationships that result in that outcome, you can then search for it and scale it tens of million times over. That's what digital enterprises do, right? So in order to discover these patterns, enrich the data to the point where the patterns become incredibly predictive, that's what Snowflake is for, right? But it requires a completely federated data model because you're not going to find a data pattern in a single dataset, per se, right? So that's what it's all about. The outcomes of a data cloud are very, very closely related to the business outcomes that the user is seeking, right? It's not some infrastructure process. that has a very remote relationship with business outcome. This is very, very closely related. >> So it doesn't take a brain surgeon to look at the trillionaires' club. (chuckles) So I can see that. I can see the big trillion dollars, Apple, $2 trillion market cap companies, they get data at the core. Whereas most companies, most incumbents, it might be a bottling plant at the core or some manufacturing or some of the process, and they put data rounded in these silos. It seems like you're trying to really bring that innovation and put data at the core and you've got an architecture to do that. You're talking about your multi cluster shared storage architecture. You mentioned data sharing. Will this, in your opinion enable, for instance, incumbents to do what a lot of the startups were able to do with the cloud days. hey got access to data centers which they couldn't have before the cloud. Are you trying to do something similar with data? >> Yeah, so obviously there's no doubt that the cloud is a critical enabler. This wouldn't be happening without it. At the same time, the trails that have been blazed by Alexa, Facebook and Google. The reason that those enterprises are so extraordinarily valuable is because of what they know through data and how they can monetize what they know through data. But that power is now becoming available to every single enterprise out there, right. Because the data platforms and the underlying cloud capabilities, we are now delivering that to anybody who wants it. Now you still need to have strong data engineering, data science capabilities. It's not like falling off of a log, but fundamentally those capabilities are now broadly accessible in the marketplace. >> So we talking up front about some of the differences between what you've done early in your career, like I said, you're the worst kept secret, Data Domain I would say it was somewhat of a niche market. You blew it up until it was very disruptive, but it was somewhat limited in what could be done. And maybe some of that limitation, you know, wouldn't have occurred if you stayed an independent company. ServiceNow, you mopped the table up 'cause you really had no competition there. Not the case here. You've got some of the biggest competitors in the world. So talk about that and what gives you confidence that you can continue to dominate? >> It's actually interesting that you bring up these companies. Data Domain, it was a scenario where we were constrained on market and we were a data backup company as you recall, we needed to move into backup software, needed to move into primary storage. While we knew it, we couldn't execute on it because it took tremendous resources which, back in the day, it was much harder than what it is right now. So we ended up selling the company to EMC and now part of Dell, but we're left with some trauma from that experience in the sense that, why couldn't we execute on that transformation? So coming to ServiceNow, we were extremely, and certainly me personally, extremely attuned to the challenges that we had endured in our prior company, and one of the reasons why you saw ServiceNow break out at scale, at tremendous growth rates is because of what we learned from the prior journey. We were not going to ever get caught again in the situation where we could not sustain our markets and sustain our growth. So ServiceNow is very much, the execution model, very much a reaction to what we had encountered in the prior company. Now coming into Snowflake a totally different deal because not only is this a large market this is a developing market. I think you've pointed out in some of your broadcasting, that this market is very much influx. And the reason is that technology is now capable of doing things for people and enterprises that they could never do before. So people are spending way more resources than they ever thought possible on these new capabilities. So you can't think in terms of static markets and static data definitions, it means nothing. Okay, these things are so in transition right now. It's very difficult for people to scope the scale of this opportunity. >> Yeah, I want to understand your thinking around and, you know, I've written about the TAM and can Snowflake grow into it's valuation and the way I drew it, I said, okay, I've got data lakes and you've got an enterprise data warehouse, that's pretty well understood but I called it data as a service company the closest analogy to your data cloud. And then even beyond that when you start bringing in the Edge and real time data. Talk about how you're thinking about that TAM what you have to do to participate. Do you have to bring adjacent capabilities? Or is it this read data sharing that will get you there? In other words, you're not like a transaction system. You hear people talking about converged databases. You're going to talk about real time inference at the Edge that today anyway, isn't what Snowflake is about. Does that vision of data sharing in the data cloud, does that allow you to participate in that massive multi hundred billion dollar TAM that I laid out and probably others as well? >> Yeah, well, it's always difficult to define markets based on historical concept that probably not going to apply a whole lot or for much longer. I mean the way we think of it is that data is the beating heart of the digital enterprise and digital enterprises today, what are you looking at people like the car door dash or so on. They were built from the ground up to be digital enterprises. And data is the beating heart of their operation, data operations is their manufacturing if you will. Every other enterprise out there is working very hard to become digital or part digital and is going to learn to develop a data platform like what we're talking about here to data cloud as well as the expertise in terms of data engineering and data sciences to really fully become a digital enterprise, right? So we view data as driving operations, all the all the digital enterprise, that's really what it is, right? And it's completely data driven end-to-end. There's no people involved and the people are developing and supporting the process but in the execution, it is end-to-end data driven. Meaning that data is the signal that initiates the process he's taking, but as they're, as they're being detected, and then they fully execute the entire machinery, programmatic machinery if you will, all of the processes have been designed. Now for example, I may fit a certain pattern, that leads to some transactional law context, but that's not fully completed, that pattern until I click on some link and all of a sudden, poof, I have become a prime prospect. System detects that in the real time and then unleashes all its outreach and capabilities to get me to transact. You and I are experiencing this every day. When we're, when we're online, you just may not fully realize (laughs) that that's what's happening behind the scenes. That's really what this is all about. So to me, this is sort of the new online transaction processing is an end to end data digital process that is continually acquiring, analyzing and acting on data. >> Well, you've talked about the time, time value of, of data. It loses value over time. And to the extent that you can actually affect decisions, maybe prior, before you lose the customer, before you lose the patient, even even more importantly, or before you lose the battle. There's all kinds of mental models that you can apply this. So automation is a key part of that and then again, I think a lot of people, like you said, if you just try to look at historical markets, you can't really squint through those and apply them. You really have to open up your mind and think about the new possibilities. And so I could see >> Exactly. >> Your component of automation. I see what's happening in the RPA space, and I could see these just massive opportunities to really change society, change business. Your last thoughts. >> While there's just no scenario that I can envision where data is not completely core and central to a digital enterprise period. >> Yeah, I think I really do think Frank, your vision is misunderstood somewhat. I think people say, "Okay hey, we'll bet on Slootman, "Scott Pelley, the team." That's great to do that, but I think this is going to unfold in a way that people maybe haven't predicted and maybe you guys yourselves and your founders you know haven't, aren't able to predict as well, but you've got that good, strong architectural philosophy that you're pursuing and it just kind of feels right, doesn't it? >> One of the harder conversations and the this is one of the reasons why we also wrote our book "The Rise of the Data Cloud" is to convey to the marketplace that this is not an incremental evolution. It is just not sort of building on the past. There is a real step function here. And the way to think about it is that typically enterprises and institutions will look at a platform like Snowflake from a workload context. In other words, I have this business, I have this workload, which is very much historically defined by the way, and then they benchmark us against what they're already doing on some legacy platform and they decide, "Yeah, this is a good fit, we're going to put Snowflake here, maybe there." But, it's still very workload centric which means that we are, essentially, perpetuating the mentality of the past. We were doing it one workload at a time, we're creating the new silos and the new bunkers of data in the process. And we're really not approaching this with the level of vision that the data scientists really require to drive maximum benefit from data. So our argument, and this is not an easy argument, is to say to CIOs and any other C-level person that wants to listen and say, "Look, just thinking about operational context and operational excellence, it's like we have to have a platform that allows use unfettered access to the data that we may need to bring the analytical power to." If you have to bring analytical power to a diversity of datasets, how are we going to do that? The data lives in 500 different places, it's just not possible, other than with insane amounts of programming and complexity and then we don't have the performance and we don't have the economics and we don't have the governance and so on. So you really want to set yourself up with a data cloud so that you can unleash your data science capabilities, your machine learning, your deep learning capabilities and then really get the full throttle advantage of what the technology can do. If you going to perpetuate the silo-ing and bunkering of data by doing it one workload at a time, five, 10 years from now, we're having the same conversations we've been having over the last 40 years. >> Yeah, operationalizing your data is going to require busting down those silos and it's going to require something like the data cloud to really power that to the next decade and beyond. Frank Slootman, thanks so much for coming to "theCUBE" and helping us do a preview here of what's to come. >> You bet Dave, thanks. >> All right, thank you for watching everybody. This is Dave Vellante from the "theCUBE". We'll see you next time.

Published Date : Oct 14 2020

SUMMARY :

leaders all around the world, and as you know, we've been tracking Yeah, you as well talk about the future. the full potential of the company. that you were kind of retired, the only way to become the is gone by the time, enrich the data to the and put data at the core no doubt that the cloud is that you can continue to dominate? and one of the reasons why the closest analogy to your data cloud. System detects that in the real time And to the extent that you to really change society, change business. to a digital enterprise period. but I think this is going to that the data scientists and it's going to require This is Dave Vellante from the "theCUBE".

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
FrankPERSON

0.99+

Frank SlootmanPERSON

0.99+

Scott PelleyPERSON

0.99+

DavePERSON

0.99+

AppleORGANIZATION

0.99+

Dave VellantePERSON

0.99+

Palo AltoLOCATION

0.99+

EMCORGANIZATION

0.99+

SlootmanPERSON

0.99+

DellORGANIZATION

0.99+

$2 trillionQUANTITY

0.99+

BostonLOCATION

0.99+

fiveQUANTITY

0.99+

GoogleORGANIZATION

0.99+

oneQUANTITY

0.99+

The Rise of the Data CloudTITLE

0.99+

ServiceNowORGANIZATION

0.99+

FacebookORGANIZATION

0.98+

tens of million timesQUANTITY

0.98+

todayDATE

0.98+

SnowflakeTITLE

0.97+

Dave CConvoPERSON

0.97+

next decadeDATE

0.96+

AlexaTITLE

0.96+

EDWORGANIZATION

0.95+

500 different placesQUANTITY

0.95+

TAMORGANIZATION

0.95+

10 yearsQUANTITY

0.95+

SnowflakeEVENT

0.94+

2000'sDATE

0.94+

multi hundred billion dollarQUANTITY

0.93+

theCUBEORGANIZATION

0.92+

single datasetQUANTITY

0.92+

OneQUANTITY

0.82+

SarbanesORGANIZATION

0.82+

single enterpriseQUANTITY

0.75+

big trillion dollarsQUANTITY

0.74+

single dayQUANTITY

0.71+

SnowflakeORGANIZATION

0.65+

millionQUANTITY

0.64+

-OxleyPERSON

0.64+

last 40DATE

0.55+

yearsQUANTITY

0.49+

Breaking Analysis: Competition Heats up for Cloud Analytic Databases


 

(enlightening music) >> From theCUBE's studios in Palo Alto and Boston, connecting with thought leaders all around the world, this is a CUBE conversation. >> As we've been reporting, there's a new class of workloads emerging in the cloud. Early cloud was all about IaaS, spinning up storage, compute, and networking infrastructure to support startups, SaaS, easy experimentation, dev test, and increasingly moving business workloads into the cloud. Modern cloud workloads are combining data. They're infusing machine intelligence into application's AI. They're simplifying analytics and scaling with the cloud to deliver business insights in near real time. And at the center of this mega trend is a new class of data stores and analytic databases, what some called data warehouses, a term that I think is outdated really for today's speed of doing business. Welcome to this week's Wikibon CUBE Insights, powered by ETR. In this breaking analysis, we update our view of the emerging cloud native analytic database market. Today, we want to do three things. First, we'll update you on the basics of this market, what you really need to know in the space. The next thing we're going to do, take a look into the competitive environment, and as always, we'll dig into the ETR spending data to see which companies have the momentum in the market, and maybe, ahead of some of the others. Finally, we're going to close with some thoughts on how the competitive landscape is likely to evolve. And we want to answer the question will the cloud giants overwhelm the upstarts, or will the specialists continue to thrive? Let's take a look at some of the basics of this market. We're seeing the evolution of the enterprise data warehouse market space. It's an area that has been critical to supporting reporting and governance requirements for companies, especially post Sarbanes-Oxley, right? However, historically, as I've said many times, EDW has failed to deliver on its promises of a 360-degree view of the business and real-time customer insights. Classic enterprise data warehouses are too cumbersome, they're too complicated, they're too slow, and don't keep pace with the speed of the business. Now, EDW is about a $20 billion market, but the analytic database opportunity in the cloud, we think is much larger, why is that? It's because cloud computing unlocks the ability to rapidly combine multiple data sources, bring data science tooling into the mix, very quickly analyze data, and deliver insights to the business. More importantly, even more importantly, allow a line of business pros to access data in a self service mode. It's a new paradigm that uses the notion of DevOps as applied to the data pipeline, agile data or what we sometimes called DataOps. This is a highly competitive marketplace. In the early part of last decade, you saw Google bring BigQuery to market, Snowflake was founded, AWS did a one-time license deal to acquire the IP to ParAccel, an MPP database, on which it built Redshift. In the latter part of the decade, Microsoft threw his hat in the ring with SQL DW, which Microsoft has now evolved into Azure Synapse. They did so at the Build conference, a few weeks ago. There are other players as well like IBM. So you can see, there's a lot at stake here. The cloud vendors want your data, because they understand this is one of the key ingredients of the next decade of innovation. No longer is Moore's Law, the mainspring of growth. We've said this many times. Rather today, it's data driven, and AI to push insights and scale with the cloud. Here's the interesting dynamic that is emerging in the space. Snowflake is a cloud specialist in this field, having raised more than a billion dollars in venture, a billion four, a billion five. And it's up against the big cloud players, who are moving fast and often stealing moves from Snowflake and driving customers to their respective platforms. Here's an example that we reported on at last year's re:Invent. It's an article by Tony Baer. He wrote this on ZDNet talking about how AWS RA3 separates compute from storage, and of course, this was a founding architectural principle for Snowflake. Here's another example from the information. They were reporting on Microsoft here turning up the heat on Snowflake. And you can see the highlighted text, where the author talks about Microsoft trying to divert customers to its database. So you got this weird dynamic going on. Snowflake doesn't run on-prem, it only runs in the cloud. Runs on AWS, runs on Azure, runs on GCP. The cloud players again, they all want your data to go into their database. So they want you to put their data into their respective platforms. At the same time, they need SaaS ISVs to run in the cloud because it sells infrastructure services. So, is Snowflake, are they going to pivot to run on-prem to try to differentiate from the cloud giants? I asked Frank Slootman, Snowflake's CEO, about the on-prem opportunity, and his perspective earlier this year. Let's listen to what he said. >> Okay, we're not doing this endless hedging that people have done for 20 years, sort of keeping a leg in both worlds. Forget it, this will only work in the public cloud because this is how the utility model works, right? I think everybody is coming to this realization, right? I mean the excuses are running out at this point. We think that it'll, people will come to the public cloud a lot sooner than we will ever come to the private cloud. It's not that we can't run a private cloud, it just diminishes the potential and the value that we that we bring. >> Okay, so pretty definitive statements by Slootman. Now, the question I want to pose today is can Snowflake compete, given the conventional wisdom that we saw in the media articles that the cloud players are going to hurt Snowflake in this market. And if so, how will they compete? Well, let's see what the customers are saying and bring in some ETR survey data. This chart shows two of our favorite metrics from the ETR data set. That is Net Score, which is on the y-axis. Net Score, remember is a measure of spending momentum and market share, which is on the x-axis. Market share is a measure of pervasiveness in the data set. And what we show here are some of the key players in the EDW and cloud native analytic database market. I'll make a couple of points, and we'll dig into this a little bit further. First thing I want to share is you can see from this data, this is the April ETR survey, which was taken at the height of the US lockdown for the pandemic. The survey captured respondents from more than 1,200 CIOs and IT buyers, asking about their spending intentions for analytic databases for the companies that we show here on this kind of x-y chart. So the higher the company is on the vertical axis, the stronger the spending momentum relative to last year, and you could see Snowflake has a 77% Net Score. It leaves all players with AWS Redshift showing very strong, as well. Now in the box in the lower right, you see a chart. Those are the exact Net Scores for all the vendors in the Shared N. A Shared N is a number of citations for that vendor within the N of the 1,269. So you can see the N's are quite large, certainly large enough to feel comfortable with some of the conclusions that we're going to make today. Microsoft, they have a huge footprint. And they somewhat skew the data with its very high market share due to its volume. And you could see where Google sits, it's at good momentum, not as much presence in the marketplace. We've also added a couple of on-prem vendors, Teradata and Oracle primarily on-prem, just for context. They're two companies that compete, they obviously have some cloud offerings, but again, most of their base is on-prem. So what I want to do now is drill into this a little bit more by looking at Snowflake within the individual clouds. So let's look at Snowflake inside of AWS. That's what this next chart shows. So it's customer spending momentum Net Score inside of AWS accounts. And we cut the data to isolate those ETR survey respondents running AWS, so there's an N there of 672 that you can see. The bars show the Net Score granularity for Snowflake and Amazon Redshift. Now, note that we show 96 Shared N responses for Snowflake and 213 for Redshift within the overall N of 672 AWS accounts. The colors show 2020 spending intentions relative to 2019. So let's read left to right here. The replacements are red. And then, the bright red, then, you see spending less by 6% or more, that's the pinkish, and then, flat spending, the gray, increasing spending by more than 6%, that's the forest green, and then, adding to the platform new, that's the lime green. Now, remember Net Score is derived by subtracting the reds from the greens. And you can see that Snowflake has more spending momentum in the AWS cloud than Amazon Redshift, by a small margin, but look at, 80% of the AWS accounts plan to spend more on Snowflake with 35%, they're adding new. Very strong, 76% of AWS customers plan to spend more in 2020 relative to 2019 on Redshift with only 12% adding the platform new. But nonetheless, both are very, very strong, and you can see here, the key point is minimal red and pink, but not a lot of people leaving, not a lot of people spending less. It's going to be critical to see in the June ETR survey, which is in the field this month, if Snowflake is able to hold on to these new accounts that it's gained in the last couple of months. Now, let's look at how Snowflake is doing inside of Azure and compare it to Microsoft. So here's the data from the ETR survey, same view of the data here except we isolate on Azure accounts. The N there is 677 Azure accounts. And we show Snowflake and Microsoft cuts for analytic databases with 83 and 393 Shared N responses respectively. So again, enough I feel to draw some conclusions from this data. Now, note the Net Scores. Snowflake again, winning with 78% versus 51% from Microsoft. 51% is strong but 78% is there's a meaningful lead for Snowflake within the Microsoft base, very interesting. And once again, you see massive new ads, 41% for Snowflake, whereas Microsoft's Net Score is being powered really by growth from existing customers, that forest green. And again, very little red for both companies. So super positive there. Okay, let's take a look now at how Snowflake's doing inside of Google accounts, GCP, Google Cloud Platform. So here's the ETR data, same view of that data, but now, we isolate on GCP accounts. There are fewer, 298 running, then, you got those running Snowflake and Google Analytic databases, largely BigQuery, but could be some others in there but the Snowflake Shared N is 49, it's smaller than on the other clouds, because the company just announced support for GCP, just about a year ago. I think it was last June, but still large enough to draw conclusions from the data. I feel pretty comfortable with that. We're not slicing and dicing it too finely. And you could see Google Shared N at 147. Look at the story. I sound like a broken record. Snowflake is again winning by a meaningful margin if you measure this Net Score or spending momentum. So 77.6% Net Score versus Google at 54%, with Snowflake at 80% in the green. Both companies, very little red. So this is pretty impressive. Snowflake has greater spending momentum than the captive cloud providers in all three of the big US-based clouds. So the big question is can Snowflake hold serve, and continue to grow, and how are they going to to be able to do that? Look, as I said before, this is a very competitive market. We reported that how Snowflake is taking share from some of the legacy on-prem data warehouse players like Teradata and IBM, and from what our data suggests, Lumen and Oracle too. I've reported how IBM is stretched thin on its research and development budget, spends about $6 billion a year, but it's got to spend it across a lot of different lines. Oracle's got more targeted spending R&D. They can target more toward database and direct more of its free cash flow to database than IBM can. But Amazon, and Microsoft, and Google, they don't have that problem. They spend a ton of dough on R&D. And here's an example of the challenge that Snowflake faces. Take a look at this partial list that I drew together of recent innovations. And we show here a set of features that Snowflake has launched in 2020, and AWS since re:Invent last year. I don't have time to go into these, but we do know this that AWS is no slouch at adding features. Amazon, as a company, spends two x more on research and development than Snowflake is worth as a company. So why do I like Snowflake's chances. Well, there are several reasons. First, every dime that Snowflake spends on R&D, go-to market, and ecosystem, goes into making its databases better for its customers. Now, I asked Frank Slootman in the middle of the lockdown how he was allocating precious capital during the pandemic. Let's listen to his response. I've said, there's no layoffs on our radar, number one. Number two, we are hiring. And number three is, we have a higher level of scrutiny on the hires that we're making. And I am very transparent. In other words, I tell people, "Look, I prioritize the roles that are closest "to the drivetrain of the business." Right, it's kind of common sense. But I wanted to make sure that this is how we're thinking about this. There are some roles that are more postponable than others. I'm hiring in engineering, without any reservation because that is the long term, strategic interest of the company. >> But you know, that's only part of the story. And so I want to spend a moment here on some other differentiation, which is multi-cloud. Now, as many of you know, I've been sort of cynical of multi-cloud up until recently. I've said that multi-cloud is a symptom, more of a symptom of multi-vendor and largely, a bunch of vendor marketing hooey today. But that's beginning to change. I see multi-cloud as increasingly viable and important to organizations, not only because CIOs are being asked to clean up the crime scene, as I've often joked, but also because it's increasingly becoming a strategy, right cloud for the right workload. So first, let me reiterate what I said at the top. New workloads are emerging in the cloud, real-time AI, insights extraction, and real-time inferencing is going to be a competitive differentiator. It's all about the data. The new innovation cocktail stems from machine intelligence applied to that data with data science tooling and simplified interfaces that enable scaling with the cloud. You got to have simplicity if you're going to scale and cloud is the best way to scale. It's really the only way to scale globally. So as such, we see cross-cloud exploitation is a real differentiator for Snowflake and others that build high quality cloud native capabilities from multiple clouds, and I want to spend a minute on this topic generally and talk about what it means for Snowflake specifically. Now, we've been pounding the table lately saying that building capabilities natively for the cloud versus putting a wrapper around your stack and making it run in the cloud is key. It's a big difference, why is this? Because cloud native means taking advantage of the primitive capabilities within respective clouds to create the highest performance, the lowest latency, the most efficient services, for that cloud, and the most secure, really exploiting that cloud. And this is enabled only by natively building in the cloud, and that's why Slootman is so dogmatic on this issue. Multi-cloud can be a differentiator for Snowflake. We can think about data lives everywhere. And you want to keep data, where it lives ideally, you don't want to have to move it, whether it's on AWS, Azure, whatever cloud is holding that data. If the answer to your query requires tapping data that lives in multiple clouds across a data network, and the app needs fast answers, then, you need low latency access to that data. So here's what I think. I think Snowflake's game is to automate by extracting, abstracting, sorry, the complexity around the data location, of course, latency is a part of that, metadata, bandwidth concerns, the time to get to query and answers. All those factors that build complexity into the data pipeline and then optimizing that to get insights, irrespective of data location. So a differentiating formula is really to not only be the best analytic database but be cloud agnostic. AWS, for example, they got a cloud agenda, as do Azure and GCP. Their number one answer to multi-cloud is put everything on our cloud. Yeah, Microsoft and Google Anthos, they would argue against that but we know that behind the scenes, that's what they want. They got offerings across clouds but Snowflake is going to make this a top priority. They can lead with that, and they must be best at it. And if Snowflake can do this, it's going to have a very successful future, in our opinion. And by all accounts, and the data that we shared, Snowflake is executing well. All right, so that's a wrap for this week's CUBE Insights, powered by ETR. Don't forget, all these breaking analysis segments are available as podcasts, just Google breaking analysis with Dave Vellante. I publish every week on wikibon.com and siliconangle.com. Check out etr.plus. That's where all the survey data is and reach out to me, I'm @dvellante on Twitter, or you can hit me up on my LinkedIn posts, or email me at david.vellante@siliconangle.com. Thanks for watching, everyone. We'll see you next time. (enlightening music)

Published Date : Jun 5 2020

SUMMARY :

all around the world, this and maybe, ahead of some of the others. I mean the excuses are that the cloud players are going to and cloud is the best way to scale.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
IBMORGANIZATION

0.99+

Frank SlootmanPERSON

0.99+

AmazonORGANIZATION

0.99+

MicrosoftORGANIZATION

0.99+

OracleORGANIZATION

0.99+

Tony BaerPERSON

0.99+

GoogleORGANIZATION

0.99+

Dave VellantePERSON

0.99+

AWSORGANIZATION

0.99+

SlootmanPERSON

0.99+

BostonLOCATION

0.99+

twoQUANTITY

0.99+

TeradataORGANIZATION

0.99+

Palo AltoLOCATION

0.99+

2020DATE

0.99+

77.6%QUANTITY

0.99+

LumenORGANIZATION

0.99+

360-degreeQUANTITY

0.99+

USLOCATION

0.99+

SnowflakeEVENT

0.99+

TodayDATE

0.99+

80%QUANTITY

0.99+

77%QUANTITY

0.99+

two companiesQUANTITY

0.99+

EDWORGANIZATION

0.99+

41%QUANTITY

0.99+

677QUANTITY

0.99+

2019DATE

0.99+

78%QUANTITY

0.99+

54%QUANTITY

0.99+

FirstQUANTITY

0.99+

JuneDATE

0.99+

35%QUANTITY

0.99+

20 yearsQUANTITY

0.99+

SnowflakeTITLE

0.99+

more than a billion dollarsQUANTITY

0.99+

david.vellante@siliconangle.comOTHER

0.99+

51%QUANTITY

0.99+

last yearDATE

0.99+

76%QUANTITY

0.99+

AprilDATE

0.99+

Both companiesQUANTITY

0.99+

672QUANTITY

0.99+

more than 1,200 CIOsQUANTITY

0.99+

SnowflakeORGANIZATION

0.99+

more than 6%QUANTITY

0.99+

firstQUANTITY

0.99+

last JuneDATE

0.99+

todayDATE

0.99+

1,269QUANTITY

0.99+

both companiesQUANTITY

0.99+

Jamil Jaffer, IronNet | RSAC USA 2020


 

>>Bye from San Francisco. It's the cube covering RSA conference, 2020 San Francisco brought to you by Silicon angle media. >>Hey, welcome back. Everyone's keeps coverage here in San Francisco at the Moscone center for RSA conference 2020 I'm John, your host, as cybersecurity goes to the next generation as the new cloud scale, cyber threats are out there, the real impact a company's business and society will be determined by the industry. This technology and the people that a cube alumni here, caramel Jaffer, SVP, senior vice president of strategy and corporate development for iron net. Welcome back. Thanks to Shawn. Good to be here. Thanks for having so iron net FC general Keith Alexander and you got to know new CEO of there. Phil Welsh scaler and duo knows how to scale up a company. He's right. Iron is doing really well. The iron dome, the vision of collaboration and signaling. Congratulations on your success. What's a quick update? >> Well look, I mean, you know, we have now built the capability to share information across multiple companies, multiple industries with the government in real time at machine speed. >>Really bringing people together, not just creating collected security or clip to defense, but also collaborating real time to defend one another. So you're able to divide and conquer Goliath, the enemy the same way they come after you and beat them at their own game. >> So this is the classic case of offense defense. Most corporations are playing defense, whack-a-mole, redundant, not a lot of efficiencies, a lot of burnout. Exactly. Not a lot of collaboration, but everyone's talking about the who the attackers are and collaborating like a team. Right? And you guys talk about this mission. Exactly. This is really the new way to do it. It has, the only way it works, >> it is. And you know, you see kids doing it out there when they're playing Fortnite, right? They're collaborating in real time across networks, uh, to, you know, to play a game, right? You can imagine that same construct when it comes to cyber defense, right? >>There's no reason why one big company, a second big company in a small company can't work together to identify all the threats, see that common threat landscape, and then take action on it. Trusting one another to take down the pieces they have folk to focus on and ultimately winning the battle. There's no other way a single company is gonna be able defend itself against a huge decency that has virtually unlimited resources and virtually unlimited human capital. And you've got to come together, defend across multiple industries, uh, collectively and collaboratively. >> Do you mean, we talked about this last time and I want to revisit this and I think it's super important. I think it's the most important story that's not really being talked about in the industry. And that is that we were talking last time about the government protects businesses. If someone dropped troops on the ground in your neighborhood, the government would protect you digitally. >>That's not happening. So there's really no protection for businesses. Do they build their own militia? Do they build their own army? Who was going to, who's going to be their heat shield? So this is a big conversation and a big, it brings a question. The role of the government. We're going to need a digital air force. We're going to need a digital army, Navy, Navy seals. We need to have that force, and this has to be a policy issue, but in the short term, businesses and individuals are sitting out there being attacked by sophisticated mission-based teams of hackers and nation States, right? Either camouflaging or hiding, but attacking still. This is a huge issue. What's going on? Are people talking about this in D C well, >> John, look not enough. People are talking about it, right? And forget DC. We need to be talking about here, out here in the Silicon Valley with all these companies here at the RSA floor and bring up the things you're bringing up because this is a real problem we're facing as a nation. >>The Russians aren't coming after one company, one state. They're coming after our entire election infrastructure. They're coming after us as a nation. The Chinese maybe come after one company at a time, but their goal is to take our electoral properties, a nation, repurpose it back home. And when the economic game, right, the Iranians, the North Koreans, they're not focused on individual actors, but they are coming after individual actors. We can't defend against those things. One man, one woman, one company on an Island, one, one agency, one state. We've got to come together collectively, right? Work state with other States, right? If we can defend against the Russians, California might be really good at it. Rhode Island, small States can be real hard, defends against the Russians, but if California, Rhode Island come together, here's the threats. I see. Here's what it's. You see share information, that's great. Then we collaborate on the defense and work together. >>You take these threats, I'll take those threats and now we're working as a team, like you said earlier, like those kids do when they're playing fortnight and now we're changing the game. Now we're really fighting the real fight. >> You know, when I hear general Keith Alexander talking about his vision with iron net and what you guys are doing, I'm inspired because it's simply put, we have a mission to protect our nation, our people, and a good businesses, and he puts it into kind of military, military terms, but in reality, it's a simple concept. Yeah, we're being attacked, defend and attack back. Just basic stuff. But to make it work as the sharing. So I got to ask you, I'm first of all, I love the, I love what he has, his vision. I love what you guys are doing. How real are we? What's the progression? >>Where are we on the progress bar of that vision? Well, you know, a lot's changed to the last year and a half alone, right? The threats gotten a lot, a lot more real to everybody, right? Used to be the industry would say to us, yeah, we want to share with the government, but we want something back for, right. We want them to show us some signal to today. Industry is like, look, the Chinese are crushing us out there, right? We can beat them at a, at some level, but we really need the governor to go do its job too. So we'll give you the information we have on, on an anonymized basis. You do your thing. We're going to keep defending ourselves and if you can give us something back, that's great. So we've now stood up in real time of DHS. We're sharing with them huge amounts of data about what we're seeing across six of the top 10 energy companies, some of the biggest banks, some of the biggest healthcare companies in the country. >>Right? In real time with DHS and more to come on that more to come with other government agencies and more to come with some our partners across the globe, right? Partners like those in Japan, Singapore, Eastern Europe, right? Our allies in the middle East, they're all the four lenses threat. We can bring their better capability. They can help us see what's coming at us in the future because as those enemies out there testing the weapons in those local areas. I want to get your thoughts on the capital markets because obviously financing is critical and you're seeing successful venture capital formulas like forge point really specialized funds on cyber but not classic industry formation sectors. Like it's not just security industry are taking a much more broader view because there's a policy implication is that organizational behavior, this technology up and down the stack. So it's a much broad investment thesis. >>What's your view of that? Because as you do, you see that as a formula and if so, what is this new aperture or this new lens of investing to be successful in funding? Companies will look, it's really important what companies like forge point are doing. Venture capital funds, right? Don Dixon, Alberta Pez will land. They're really innovating here. They've created a largest cybersecurity focused fund. They just closed the recently in the world, right? And so they really focus on this industry. Partners like, Kleiner Perkins, Ted Schlein, Andrea are doing really great work in this area. Also really important capital formation, right? And let's not forget other funds. Ron Gula, right? The founder of tenable started his own fund out there in DC, in the DMV area. There's a lot of innovation happening this country and the funding on it's critical. Now look, the reality is the easy money's not going to be here forever, right? >>It's the question is what comes when that inevitable step back. We don't. Nobody likes to talk about it. I said the guy who who bets on the other side of the craps game in Vegas, right? You don't wanna be that guy, but let's be real. I mean that day will eventually come. And the question is how do you bring some of these things together, right? Bring these various pieces together to really create long term strategies, right? And that's I think what's really innovative about what Don and Alberto are doing is they're building portfolio companies across a range of areas to create sort of an end to end capability, right? Andrea is doing things like that. Ted's doing stuff like that. It's a, that's really innovation. The VC market, right? And we're seeing increased collaboration VC to PE. It's looking a lot more similar, right? And now we're seeing innovative vehicles like stacks that are taking some of these public sort of the reverse manner, right? >>There's a lot of interests. I've had to be there with Hank Thomas, the guys chief cyber wrenches. So a lot of really cool stuff going on in the financing world. Opportunities for young, smart entrepreneurs to really move out in this field and to do it now. And money's still silver. All that hasn't come as innovation on the capital market side, which is awesome. Let's talk about the ecosystem in every single market sector that I've been over, my 30 year career has been about a successful entrepreneurship check, capital two formation of partnerships. Okay. You're on the iron net, front lines here. As part of that ecosystem, how do you see the ecosystem formula developing? Is it the same kind of model? Is it a little bit different? What's your vision of the ecosystem? Look, I mean partnerships channel, it's critical to every cyber security company. You can't scale on your own. >>You've got to do it through others, right? I was at a CrowdStrike event the other day. 91% of the revenue comes from the channel. That's an amazing number. You think about that, right? It's you look at who we're trying to talk about partnering with. We're talking about some of the big cloud players. Amazon, Microsoft, right? Google, right on the, on the vendor side. Pardon me? Splunk crashes, so these big players, right? We want to build with them, right? We want to work with them because there's a story to tell here, right? When we were together, the AECOS through self is defendant stronger. There's no, there's no anonymity here, right? It's all we bring a specialty, you bring specialty, you work together, you run out and go get the go get the business and make companies safer. At the end of the day, it's all about protecting the ecosystem. What about the big cloud player? >>Cause he goes two big mega trends. Obviously cloud computing and scale, right? Multi-cloud on the horizon, hybrids, kind of the bridge between single public cloud and multi-cloud and then AI you've got the biggies are generally will be multiple generations of innovation and value creation. What's your vision on the impact of the big waves that are coming? Well, look, I mean cloud computing is a rate change the world right? Today you can deploy capability and have a supercomputer in your fingertips in in minutes, right? You can also secure that in minutes because you can update it in real time. As the machine is functioning, you have a problem, take it down, throw up a new virtual machine. These are amazing innovations that are creating more and more capability out there in industry. It's game changing. We're happy, we're glad to be part of that and we ought to be helping defend that new amazing ecosystem. >>Partnering with companies like Microsoft. They didn't AWS did, you know, you know, I'm really impressed with your technical acumen. You've got a good grasp of the industry, but also, uh, you have really strong on the societal impact policy formulation side of government and business. So I want to get your thoughts for the young kids out there that are going to school, trying to make sense of the chaos that's going on in the world, whether it's DC political theater or the tech theater, big tech and in general, all of the things with coronavirus, all this stuff going on. It's a, it's a pretty crazy time, but a lot of work has to start getting done that are new problems. Yeah. What is your advice as someone who's been through the multiple waves to the young kids who have to figure out what half fatigue, what problems are out there, what things can people get their arms around to work on, to specialize in? >>What's your, what's your thoughts and expertise on that? Well, John, thanks for the question. What I really like about that question is is we're talking about what the future looks like and here's what I think the future looks like. It's all about taking risks. Tell a lot of these young kids out there today, they're worried about how the world looks right? Will America still be strong? Can we, can we get through this hard time we're going through in DC with the world challenges and what I can say is this country has never been stronger. We may have our own troubles internally, but we are risk takers and we always win. No matter how hard it gets them out of how bad it gets, right? Risk taking a study that's building the American blood. It's our founders came here taking a risk, leaving Eagle to come here and we've succeeded the last 200 years. >>There is no question in my mind that trend will continue. So the young people out there, I don't know what the future has to hold. I don't know if the new tape I was going to be, but you're going to invent it. And if you don't take the risks, we're not succeed as a nation. And that's what I think is key. You know, most people worry that if they take too many risks, they might not succeed. Right? But the reality is most people you see around at this convention, they all took risks to be here. And even when they had trouble, they got up, they dust themselves off and they won. And I believe that everybody in this country, that's what's amazing about the station is we have this opportunity to, to try, if we fail to get up again and succeed. So fail fast, fail often, and crush it. >>You know, some of the best innovations have come from times where you had the cold war, you had, um, you had times where, you know, the hippie revolution spawn the computer. So you, so you have the culture of America, which is not about regulation and stunting growth. You had risk-taking, you had entrepreneurship, but yet enough freedom for business to operate, to solve new challenges, accurate. And to me the biggest imperative in my mind is this next generation has to solve a lot of those new questions. What side of the street is the self driving cars go on? I see bike lanes in San Francisco, more congestion, more more cry. All this stuff's going on. AI could be a great enabler for that. Cyber security, a direct threat to our country and global geopolitical landscape. These are big problems. State and local governments, they're not really tech savvy. They don't really have a lot ID. >>So what do they do? How do they serve their, their constituents? You know, look John, these are really important and hard questions, but we know what has made technology so successful in America? What's made it large, successful is the governor state out of the way, right? Industry and innovators have had a chance to work together and do stuff and change the world, right? You look at California, you know, one of the reasons California is so successful and Silicon Valley is so dynamic. You can move between jobs and we don't enforce non-compete agreements, right? Because you can switch jobs and you can go to that next higher value target, right? That shows the value of, you know, innovation, creating innovation. Now there's a real tendency to say, when we're faced with challenges, well, the government has to step in and solve that problem, right? The Silicon Valley and what California's done, what technology's done is a story about the government stayed out and let innovators innovate, and that's a real opportunity for this nation. >>We've got to keep on down that path, even when it seemed like the easier answer is, come on in DC, come on in Sacramento, fix this problem for us. We have demonstrated as a country that Americans and individual are good at solve these problems. We should allow them to do that and innovate. Yeah. One of my passions is to kind of use technology and media to end communities to get to the truth faster. A lot of, um, access to smart minds out there, but young minds, young minds, uh, old minds, young minds though. It's all there. You gotta get the data out and that's going to be a big thing. That's the, one of the things that's changing is the dark arts of smear campaigns. The story of Bloomberg today, Oracle reveals funding for dark money, group biting, big tech internet accountability projects. Um, and so the classic astroturfing get the Jedi contract, Google WASU with Java. >>So articles in the middle of all this, but using them as an illustrative point. The lawyers seem to be running the kingdom right now. I know you're an attorney, so I'm recovering, recovering. I don't want to be offensive, but entrepreneurship cannot be stifled by regulation. Sarbanes Oxley slowed down a lot of the IPO shifts to the latest stage capital. So regulation, nest and every good thing. But also there's some of these little tactics out in the shadows are going to be revealed. What's the new way to get this straightened out in your mind? We'll look, in my view, the best solution for problematic speech or pragmatic people is more speech, right? Let's shine a light on it, right? If there are people doing shady stuff, let's talk about it's an outfit. Let's have it out in the open. Let's fight it out. At the end of the day, what America's really about is smart ideas. >>Winning. It's a, let's get the ideas out there. You know, we spent a lot of time, right now we're under attack by the Russians when it comes to our elections, right? We spent a lot of time harping at one another, one party versus another party. The president versus that person. This person who tells committee for zap person who tells committee. It's crazy when the real threat is from the outside. We need to get past all that noise, right? And really get to the next thing which is we're fighting a foreign entity on this front. We need to face that enemy down and stop killing each other with this nonsense and turn the lights on. I'm a big believer of if something can be exposed, you can talk about it. Why is it happening exactly right. This consequences with that reputation, et cetera. You got it. >>Thanks for coming on the queue. Really appreciate your insight. Um, I want to just ask you one final question cause you look at, look at the industry right now. What is the most important story that people are talking about and what is the most important story that people should be talking about? Yeah. Well look, I think the one story that's out there a lot, right, is what's going on in our politics, what's going on in our elections. Um, you know, Chris Krebs at DHS has been out here this week talking a lot about the threat that our elections face and the importance about States working with one another and States working with the federal government to defend the nation when it comes to these elections in November. Right? We need to get ahead of that. Right? The reality is it's been four years since 2016 we need to do more. That's a key issue going forward. What are the Iranians North Koreans think about next? They haven't hit us recently. We know what's coming. We got to get ahead of that. I'm going to come again at a nation, depending on staff threat to your meal. Great to have you on the QSO is great insight. Thanks for coming on sharing your perspective. I'm John furrier here at RSA in San Francisco for the cube coverage. Thanks for watching.

Published Date : Feb 27 2020

SUMMARY :

RSA conference, 2020 San Francisco brought to you by Silicon The iron dome, the vision of collaboration and Well look, I mean, you know, time to defend one another. Not a lot of collaboration, but everyone's talking about the who the attackers are and collaborating like a And you know, you see kids doing it out there when they're playing Fortnite, take down the pieces they have folk to focus on and ultimately winning the battle. the government would protect you digitally. and this has to be a policy issue, but in the short term, businesses and individuals are sitting out there out here in the Silicon Valley with all these companies here at the RSA floor and bring up the things you're bringing Rhode Island, small States can be real hard, defends against the Russians, You take these threats, I'll take those threats and now we're working as a team, like you said earlier, You know, when I hear general Keith Alexander talking about his vision with iron net and what you guys are doing, We're going to keep defending ourselves and if you can give us something back, Our allies in the middle East, they're all the four lenses threat. Now look, the reality is the easy And the question is how do you bring some of these things together, right? So a lot of really cool stuff going on in the financing world. 91% of the revenue comes from the channel. on the impact of the big waves that are coming? You've got a good grasp of the industry, but also, uh, you have really strong on the societal impact policy Risk taking a study that's building the American blood. But the reality is most people you see around at this convention, they all took risks to be here. You know, some of the best innovations have come from times where you had the cold war, you had, That shows the value of, you know, innovation, creating innovation. You gotta get the data out and that's going to be a big thing. Sarbanes Oxley slowed down a lot of the IPO shifts to the latest stage capital. It's a, let's get the ideas out there. Great to have you on the QSO is

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
JohnPERSON

0.99+

AmazonORGANIZATION

0.99+

MicrosoftORGANIZATION

0.99+

AndreaPERSON

0.99+

Chris KrebsPERSON

0.99+

Ron GulaPERSON

0.99+

Keith AlexanderPERSON

0.99+

Jamil JafferPERSON

0.99+

JapanLOCATION

0.99+

GoogleORGANIZATION

0.99+

SacramentoLOCATION

0.99+

30 yearQUANTITY

0.99+

DHSORGANIZATION

0.99+

AWSORGANIZATION

0.99+

San FranciscoLOCATION

0.99+

Don DixonPERSON

0.99+

NovemberDATE

0.99+

DonPERSON

0.99+

Silicon ValleyLOCATION

0.99+

ShawnPERSON

0.99+

SingaporeLOCATION

0.99+

OracleORGANIZATION

0.99+

91%QUANTITY

0.99+

VegasLOCATION

0.99+

TodayDATE

0.99+

one companyQUANTITY

0.99+

DCLOCATION

0.99+

sixQUANTITY

0.99+

TedPERSON

0.99+

caramel JafferPERSON

0.99+

Rhode IslandLOCATION

0.99+

one womanQUANTITY

0.99+

AlbertoPERSON

0.99+

JavaTITLE

0.99+

Ted SchleinPERSON

0.99+

AmericaLOCATION

0.99+

RSACORGANIZATION

0.99+

BloombergORGANIZATION

0.99+

four yearsQUANTITY

0.99+

one final questionQUANTITY

0.99+

Phil WelshPERSON

0.99+

FortniteTITLE

0.99+

John furrierPERSON

0.98+

one stateQUANTITY

0.98+

Eastern EuropeLOCATION

0.98+

tenableORGANIZATION

0.98+

todayDATE

0.98+

One manQUANTITY

0.98+

one partyQUANTITY

0.98+

RSAORGANIZATION

0.98+

oneQUANTITY

0.98+

singleQUANTITY

0.98+

this weekDATE

0.97+

Hank ThomasPERSON

0.97+

IronNetORGANIZATION

0.97+

twoQUANTITY

0.96+

one agencyQUANTITY

0.96+

AECOSORGANIZATION

0.96+

AmericaORGANIZATION

0.96+

DMVLOCATION

0.96+

MosconeLOCATION

0.95+

AmericansPERSON

0.95+

OneQUANTITY

0.95+

IraniansPERSON

0.95+

2016DATE

0.94+

cold warEVENT

0.94+

RussiansPERSON

0.93+

RSA conferenceEVENT

0.93+

middle EastLOCATION

0.93+

single companyQUANTITY

0.93+

CaliforniaLOCATION

0.92+

four lensesQUANTITY

0.91+

EagleORGANIZATION

0.91+

second bigQUANTITY

0.91+

Chris Degnan, Snowflake & Anthony Brooks Williams, HVR | AWS re:Invent 2019


 

>>LA Las Vegas. It's the cube hovering AWS reinvent 2019 brought to you by Amazon web services and along with its ecosystem partners. >>Hey, welcome back to the cube. Our day one coverage of AWS reinvent 19 continues. Lisa Martin with Dave Volante. Dave and I have a couple of guests we'd like you to walk up. We've got Anthony Brooks billions, the CEO of HBR back on the cube. You're alumni. We should get you a pin and snowflake alumni. But Chris, your new Chris Dagon, chief revenue officer from snowflake. Chris, welcome to the program. Excited to be here. All right guys. So even though both companies have been on before, Anthony, let's start with you. Give our audience a refresher about HVR, who you guys are at, what you do. >>Sure. So we're in the data integration space, particularly a real time data integration. So we move data to the cloud in the in the most efficient way and we make sure it's secure and it's accurate and you're moving into environments such as snowflake. Um, and that's where we've got some really good customers that we happy to talk about joint custody that we're doing together. But Chris can tell us a little bit about snowflake. >>Sure. And snowflake is a cloud data warehousing company. We are cloud native, we are on AWS or on GCP and we're on Azure. And if you look at the competitive landscape, we compete with our friends at Amazon. We compete with our friends at Microsoft and our friends at Google. So it's super interesting place to be, but it very exciting at the same time and super excited to partner with Anthony and some others who aren't really a friends. That's correct. So I wonder if we could start by just talking about the data warehouse sort of trends that you guys see. When I talk to practitioners in the old days, they used to say to me things like, Oh, infrastructure management, it's such a nightmare. It's like a snake swallowing a basketball every time until it comes out with a new chips. We chase it because we just need more performance and we can't get our jobs done fast enough. And there's only three. There's three guys that we got to go through to get any answers and it was just never really lived up to the promise of 360 degree view of your business and realtime analytics. How has that changed? >>Well, there's that too. I mean obviously the cloud has had a big difference on that illustrious city. Um, what you would find is in, in, in yesterday, customers have these, a retail customer has these big events twice a year. And so to do an analysis on what's being sold and Casper's transactions, they bought this big data warehouse environment for two events a year typically. And so what's happening that's highly cost, highly costly as we know to maintain and then cause the advances in technology and trips and stuff. And then you move into this cloud world which gives you that Lester city of scale up, scale down as you need to. And then particular where we've got Tonies snowflake that is built for that environment and that elicited city. And so you get someone like us that can move this data at today's scale and volume through these techniques we have into an environment that then bleeds into helping them solve the challenge that you talk about of Yesi of >>these big clunky environments. That side, I think you, I think you kind of nailed it. I think like early days. So our founders are from Oracle and they were building Oracle AI nine nine, 10 G. and when I interviewed them I was the first sales rep showing up and day one I'm like, what the heck am I selling? And when I met them I said, tell me what the benefit of snowflake is. And they're like, well at Oracle, and we'd go talk to customers and they'd say, Oracles, you know, I have this problem with Oracle. They'd say, Hey, that's, you know, seven generations ago were Oracle. Do you have an upgraded to the latest code? So one of the things they talked about as being a service, Hey, we want to make it really easy. You never have to upgrade the service. And then to your point around, you have a fixed amount of resources on premise, so you can't all of a sudden if you have a new project, do you want to bring on the first question I asked when I started snowflake to customers was how long does it take you to kick off a net new workload onto your data, onto your Vertica and it take them nine to 12 months because they'd have to go procure the new hardware, install it, and guess what? >>With snowflake, you can make an instantaneous decision and because of our last test city, because the benefits of our partner from Amazon, you can really grow with your demand of your business. >>Many don't have the luxury of nine to 12 months anymore, Chris, because we all know if, if an enterprise legacy business isn't thinking, there's somebody not far behind me who has the elasticity, who has the appetite, who's who understands the opportunity that cloud provides. If you're not thinking that, as auntie Jessie will say, you're going to be on the wrong end of that equation. But for large enterprises, that's hard. The whole change culture is very hard to do. I'd love to get your perspective, Chris, what you're seeing in terms of industries shifting their mindsets to understand the value that they could unlock with this data, but how are big industries legacy industries changing? >>I'd say that, look, we were chasing Amad, we were chasing the cloud providers early days, so five years ago, we're selling to ad tech and online gaming companies today. What's happened in the industry is, and I'll give you a perfect example, is Ben wa and I, one of our founders went out to one of the largest investment banks on wall street five years ago, and they said, and they have more money than God, and they say, Hey, we love what you've built. We love, when are you going to run on premise? And Ben, Ben wa uttered this phrase of, Hey, you will run on the public cloud before we ever run in the private cloud. And guess what? He was a truth teller because five years later, they are one of our largest customers today. And they made the decision to move to the cloud and we're seeing financial services at a blistering face moved to the cloud. >>And that's where, you know, partnering with folks from HR is super important for us because we don't have the ability to just magically have this data appear in the cloud. And that's where we rely quite heavily on on instance. So Anthony, in the financial services world in particular, it used to be a cloud. Never that was an evil word. Automation. No, we have to have full control and in migration, never digital transformation to start to change those things. It's really become an imperative, but it's by in particular is really challenging. So I wonder if we could dig into that a little bit and help us understand how you solve that problem. >>Yes. A customer say they want to adopt some of these technologies. So there's the migration route. They may want to go adopt some of these, these cloud databases, the cloud data warehouses. And so we have some areas where we, you know, we can do that and keep the business up and running at the same time. So the techniques we use are we reading the transactional logs, other databases or something called CDC. And so there'll be an initial transfer of the bulk of the data initiative stantiating or refresh. At that same time we capturing data out of the transaction logs, wildlife systems live and doing a migration to the new environment or into snowflakes world, capturing data where it's happening, where the data is generated and moving that real time securely, accurately into this environment for somewhere like 1-800-FLOWERS where they can do this, make better decisions to say the cost is better at point of sale. >>So have all their business divisions pulling it in. So there's the migration aspects and then there's the, the use case around the realtime reporting as well. So you're essentially refueling the plane. Well while you're in mid air. Um, yeah, that's a good one. So what does the customer see? How disruptive is it? How do you minimize that disruption? Well, the good thing is, well we've all got these experienced teams like Chris said that have been around the block and a lot of us have done this. What we do, what ed days fail for the last 15 years, that companies like golden gate that we sold to Oracle and those things. And so there's a whole consultative approach to them versus just here's some software, good luck with it. So there's that aspect where there's a lot of planning that goes into that and then through that using our technologies that are well suited to this Appleton shows some good success and that's a key focus for us. And in our world, in this subscription by SAS top world, customer success is key. And so we have to build a lot of that into how we make this successful as well. >>I think it's a barrier to entry, like going, going from on premise to the cloud. That's the number one pushback that we get when we go out and say, Hey, we have a cloud native data warehouse. Like how the heck are we going to get the data to the cloud? And that's where, you know, a partnership with HR. Super important. Yeah. >>What are some of the things that you guys encountered? Because we many businesses live in the multi-cloud world most of the time, not by strategy, right? A lot of the CIO say, well we sort of inherited this, or it's M and a or it's developers that have preference. How do you help customers move data appropriately based on the value that the perceived value that it can give in what is really a multi world today? Chris, we'll start with you. >>Yeah, I think so. So as we go into customers, I think the biggest hurdle for them to move to the cloud is security because they think the cloud is not secure. So if we, if you look at our engagement with customers, we go in and we actually have to sell the value snowflake and then they say, well, okay great, go talk to the security team. And then we talked to security team and say, Hey, let me show you how we secure data. And then then they have to get comfortable around how they're going to actually move, get the data from on premise to the cloud. And that's again, when we engage with partners like her. So yeah, >>and then we go through a whole process with a customer. There's a taking some of that data in a, in a POC type environment and proving that after, as before it gets rolled out. And a lot of, you know, references and case studies around it as well. >>Depends on the customer that you have some customers who are bold and it doesn't matter the size. We have a fortune 100 customer who literally had an on premise Teradata system that they moved from on prem, from on premise 30 to choose snowflake in 111 days because they were all in. You have other customers that say, Hey, I'm going to take it easy. I'm going to workload by workload. And it just depends. And the mileage may vary is what can it give us an example of maybe a customer example or in what workloads they moved? Was it reporting? What other kinds? Yeah. >>Oh yeah. We got a couple of, you mean we could talk a little bit about 1-800-FLOWERS. We can talk about someone like Pitney Bowes where they were moving from Oracle to secret server. It's a bunch of SAP data sitting in SAP ECC. So there's some complexity around how you acquire, how you decode that data, which we ever built a unique ability to do where we can decode the cluster and pool tables coupled with our CDC technique and they had some stringent performance loads, um, that a bunch of the vendors couldn't meet the needs between both our companies. And so we were able to solve their challenge for them jointly and move this data at scale in the performance that they needed out with these articles, secret server enrollments into, into snowflake. >>I almost feel like when you have an SAP environment, it's almost stuck in SAP. So to get it out is like, it's scary, right? And this is where it's super awesome for us to do work like this. >>On that front, I wanted to understand your thoughts on transformation. It's a word, it's a theme of reinvent 2019. It's a word that we hear at every event, whether we're talking about digital transformation, workforce, it, et cetera. But one of the things that Andy Jassy said this morning was that got us start. It's this is more than technology, right? This, the next gen cloud is more than technology. It's about getting those senior leaders on board. Chris, your perspective, looking at financial services first, we were really surprised at how quickly they've been able to move. Understanding presumably that if they don't, there's going to be other businesses. But are you seeing that as the chief revenue officer or your conversations starting at that CEO level? >>It kinda has to like in the reason why if you do in bottoms up approach and say, Hey, I've got a great technology and you sell this great technology to, you know, a tech person. The reality is unless the C E O CIO or CTO has an initiative to do digital transformation and move to the cloud, you'll die. You'll die in security, you'll die in legal lawyers love to kill deals. And so those are the two areas that I see D deals, you know, slow down significantly. And that's where, you know, we, it's, it's getting through those processes and finding the champion at the CEO level, CIO level, CTO level. If you're, if you're a modern day CIO and you do not have a a cloud strategy, you're probably going to get replaced >>in 18 months. So you know, you better get on board and you'd better take, you know, taking advantage of what's happening in the industry. >>And I think that coupled with the fact that in today's world, you mean, you said there's a, it gets thrown around as a, as a theme and particularly the last couple of years, I think it's, it's now it is actually a strategy and, and reality because what Josephine is that there's as many it tech savvy people sit in the business side of organizations today that used to sit in legacy it. And I think it's that coupled with the leadership driving it that's, that's demanding it, that demanding to be able to access that certain type of data in a geo to make decisions that affect the business. Right now. >>I wonder if we could talk a little bit more about some of the innovations that are coming up. I mean I've been really hard on data. The data warehouse industry, you can tell I'm jaded. I've been around a long time. I mean I've always said that that Sarbanes Oxley saved the old school BI and data warehousing and because all the reporting requirements, and again that business never lived up to its promises, but it seems like there's this whole new set of workloads emerging in the cloud where you take a data warehouse like a snowflake, you may be bringing in some ML tools, maybe it's Databricks or whatever. You HVR helping you sort of virtualize the data and people are driving new workloads that are, that are bringing insights that they couldn't get before in near real time. What are you seeing in terms of some of those gestalt trends and how are companies taking advantage of these innovations? >>I think one is just the general proliferation of data. There's just more data and like you're saying from many different sources, so they're capturing data from CNC machines in factories, you know like like we do for someone like GE, that type of data is to data financial data that's sitting in a BU taking all of that and going there's just as boss some of data, how can we get a total view of our business and at a board level make better decisions and that's where they got put it in I snowflake in this an elastic environment that allows them to do this consolidated view of that whole organization, but I think it's largely been driven by things that digitize their sensors on everything and there's just a sheer volume of data. I think all of that coming together is what's, what's driven it >>is is data access. We talked about security a little bit, but who has rights to access the data? Is that a challenge? How are you guys solving that or is it, I mean I think it's like anything like once people start to understand how a date where we're an acid compliant date sequel database, so we whatever your security you use on your on premise, you can use the same on snowflake. It's just a misperception that the industry has that being on, on in a data center is more secure than being in the cloud and it's actually wrong. I guess my question is not so much security in the cloud, it's more what you were saying about the disparate data sources that coming in hard and fast now. And how do you keep track of who has access to the data? I mean is it another security tool or is it a partnership within owes? >>Yeah, absolutely man. So there's also, there's in financial data, there's certain geos, data leaves, certain geos, whether it be in the EU or certain companies, particularly this end, there's big banks now California, there's stuff that we can do from a security perspective in the data that we move that's secure, it's encrypted. If we capturing data from multiple different sources, items we have that we have the ability to take it all through one, one proxy in the firewall, which does, it helps him a lot in that aspect. Something unique in our technology. But then there's other tools that they have and largely you sit down with them and it's their sort of governance that they have in the, in the organization to go, how do they tackle that and the rules they set around it, you know? >>Well, last question I have is, so we're seeing, you know, I look at the spending data and my breaking analysis, go on my LinkedIn, you'll see it snowflakes off the charts. It's up there with, with robotic process automation and obviously Redshift. Very strong. Do you see those two? I think you addressed it before, but I'd love to get you on record sort of coexisting and thriving. Really, that's not the enemy, right? It's the, it's the Terra data's and the IBM's and the Oracles. The, >>I think, look, uh, you know, Amazon, our relationship with Amazon is like a, you know, a 20 year marriage, right? Sometimes there's good days, sometimes there's bad days. And I think, uh, you know, every year about this time, you know, we get a bat phone call from someone at Amazon saying, Hey, you know, the Redshift team's coming out with a snowflake killer. And I've heard that literally for six years now. Um, it turns out that there's an opportunity for us to coexist. Turns out there's an opportunity for us to compete. Um, and it's all about how they handle themselves as a business. Amazon has been tremendous in separation of that, of, okay, are going to partner here, we're going to compete here, and we're okay if you guys beat us. And, and so that's how they operate. But yes, it is complex and it's, it's, there are challenges. >>Well, the marketplace guys must love you though because you're selling a lot of computers. >>Well, yeah, yeah. This is three guys. They, when they left, we have a summer thing. You mean NWS have a technological DMS, their data migration service, they work with us. They refer opportunities to us when it's these big enterprises that are use cases, scale complexity, volume of data. That's what we do. We're not necessary into the the smaller mom and pop type shops that just want to adopt it, and I think that's where we all both able to go coexist together. There's more than enough. >>All right. You're right. It's like, it's like, Hey, we have champions in the Esri group, the EEC tuna group, that private link group, you know, across all the Amazon products. So there's a lot of friends of ours. Yeah, the red shift team doesn't like us, but that's okay. I can live in >>healthy coopertition, but it just goes to show that not only do customers and partners have toys, but they're exercising it. Gentlemen, thank you for joining David knee on the key of this afternoon. We appreciate your time. Thank you for having us. Pleasure our pleasure for Dave Volante. I'm Lisa Martin. You're watching the queue from day one of our coverage of AWS reinvent 19 thanks for watching.

Published Date : Dec 3 2019

SUMMARY :

AWS reinvent 2019 brought to you by Amazon web services Dave and I have a couple of guests we'd like you to walk up. So we move data to the cloud in the in the most efficient way and we make sure it's secure and And if you look at the competitive landscape, And then you move into this cloud world which gives you that Lester city of scale to customers was how long does it take you to kick off a net new workload onto your data, from Amazon, you can really grow with your demand of your business. Many don't have the luxury of nine to 12 months anymore, Chris, And they made the decision to move to the cloud and we're seeing financial services And that's where, you know, partnering with folks from HR is super important for us because And so we have some areas where we, And so we have to build a lot of that into how we make this successful And that's where, you know, a partnership with HR. What are some of the things that you guys encountered? And then we talked to security team and say, Hey, let me show you how we secure data. And a lot of, you know, references and case studies around it as well. Depends on the customer that you have some customers who are bold and it doesn't matter the size. So there's some complexity around how you acquire, how you decode that data, I almost feel like when you have an SAP environment, it's almost stuck in SAP. But are you seeing that And that's where, you know, So you know, you better get on board and you'd better take, you know, taking advantage of what's happening And I think that coupled with the fact that in today's world, you mean, you said there's a, it gets thrown around as a, like there's this whole new set of workloads emerging in the cloud where you take a factories, you know like like we do for someone like GE, that type of is not so much security in the cloud, it's more what you were saying about the disparate in the organization to go, how do they tackle that and the rules they set around it, Well, last question I have is, so we're seeing, you know, I look at the spending data and my breaking analysis, separation of that, of, okay, are going to partner here, we're going to compete here, and we're okay if you guys to us when it's these big enterprises that are use cases, scale complexity, that private link group, you know, across all the Amazon products. Gentlemen, thank you for joining David knee on the key of this afternoon.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
DavePERSON

0.99+

Lisa MartinPERSON

0.99+

ChrisPERSON

0.99+

AmazonORGANIZATION

0.99+

Dave VolantePERSON

0.99+

OracleORGANIZATION

0.99+

IBMORGANIZATION

0.99+

AnthonyPERSON

0.99+

BenPERSON

0.99+

Andy JassyPERSON

0.99+

Chris DagonPERSON

0.99+

DavidPERSON

0.99+

JessiePERSON

0.99+

six yearsQUANTITY

0.99+

three guysQUANTITY

0.99+

GoogleORGANIZATION

0.99+

MicrosoftORGANIZATION

0.99+

HBRORGANIZATION

0.99+

NWSORGANIZATION

0.99+

Ben waPERSON

0.99+

Chris DegnanPERSON

0.99+

first questionQUANTITY

0.99+

Anthony BrooksPERSON

0.99+

OraclesORGANIZATION

0.99+

360 degreeQUANTITY

0.99+

111 daysQUANTITY

0.99+

CasperORGANIZATION

0.99+

nineQUANTITY

0.99+

five years laterDATE

0.99+

12 monthsQUANTITY

0.99+

oneQUANTITY

0.99+

five years agoDATE

0.99+

yesterdayDATE

0.99+

LinkedInORGANIZATION

0.99+

twoQUANTITY

0.99+

EsriORGANIZATION

0.99+

AWSORGANIZATION

0.99+

CaliforniaLOCATION

0.99+

both companiesQUANTITY

0.99+

GEORGANIZATION

0.98+

two areasQUANTITY

0.98+

bothQUANTITY

0.98+

18 monthsQUANTITY

0.98+

todayDATE

0.98+

SASORGANIZATION

0.98+

twice a yearQUANTITY

0.98+

golden gateORGANIZATION

0.97+

JosephinePERSON

0.96+

EULOCATION

0.96+

TerraORGANIZATION

0.95+

threeQUANTITY

0.93+

day oneQUANTITY

0.93+

this morningDATE

0.93+

one proxyQUANTITY

0.93+

CTOORGANIZATION

0.93+

LA Las VegasLOCATION

0.92+

C E O CIOORGANIZATION

0.91+

Anthony Brooks WilliamsPERSON

0.91+

Tony Higham, IBM | IBM Data and AI Forum


 

>>live from Miami, Florida It's the Q covering IBM is data in a I forum brought to you by IBM. >>We're back in Miami and you're watching the cubes coverage of the IBM data and a I forum. Tony hi. Amiss here is a distinguished engineer for Ditch the Digital and Cloud Business Analytics at IBM. Tony, first of all, congratulations on being a distinguished engineer. That doesn't happen often. Thank you for coming on the Cube. Thank you. So your area focus is on the B I and the Enterprise performance management space. >>Um, and >>if I understand it correctly, a big mission of yours is to try to modernize those make himself service, making cloud ready. How's that going? >>It's going really well. I mean, you know, we use things like B. I and enterprise performance management. When you really boil it down, there's that's analysis of data on what do we do with the data this useful that makes a difference in the world, and then this planning and forecasting and budgeting, which everyone has to do whether you are, you know, a single household or whether you're an Amazon or Boeing, which are also some of our clients. So it's interesting that we're going from really enterprise use cases, democratizing it all the way down to single user on the cloud credit card swipe 70 bucks a month >>so that was used to be used to work for Lotus. But Cognos is one of IBM's largest acquisitions in the software space ever. Steve Mills on his team architected complete transformation of IBM is business and really got heavily into it. I think I think it was a $5 billion acquisition. Don't hold me to that, but massive one of the time and it's really paid dividends now when all this sort of 2000 ten's came in and said, Oh, how Duke's gonna kill all the traditional b I traditional btw that didn't happen, that these traditional platforms were a fundamental component of people's data strategies, so that created the imperative to modernize and made sure that there could be things like self service and cloud ready, didn't it? >>Yeah, that's absolutely true. I mean, the work clothes that we run a really sticky were close right when you're doing your reporting, your consolidation or you're planning of your yearly cycle, your budget cycle on these technologies, you don't rip them out so easily. So yes, of course, there's competitive disruption in the space. And of course, cloud creates on opportunity for work loads to be wrong, Cheaper without your own I t people. And, of course, the era of digital software. I find it myself. I tried myself by it without ever talking to a sales person creates a democratization process for these really powerful tools that's never been invented before in that space. >>Now, when I started in the business a long, long time ago, it was called GSS decision support systems, and they at the time they promised a 360 degree view with business That never really happened. You saw a whole new raft of players come in, and then the whole B I and Enterprise Data Warehouse was gonna deliver on that promise. That kind of didn't happen, either. Sarbanes Oxley brought a big wave of of imperative around these systems because compliance became huge. So that was a real tailwind for it. Then her duke was gonna solve all these problems that really didn't happen. And now you've got a I, and it feels like the combination of those systems of record those data warehouse systems, the traditional business intelligence systems and all this new emerging tech together are actually going to be a game changer. I wonder if you could comment on >>well so they can be a game changer, but you're touching on a couple of subjects here that are connected. Right? Number one is obviously the mass of data, right? Cause data has accelerated at a phenomenal pace on then you're talking about how do I then visualize or use that data in a useful manner? And that really drives the use case for a I right? Because A I in and of itself, for augmented intelligence as we as we talk about, is only useful almost when it's invisible to the user cause the user needs to feel like it's doing something for them that super intuitive, a bit like the sort of transition between the electric car on the normal car. That only really happens when the electric car can do what the normal car can do. So with things like Imagine, you bring a you know, how do cluster into a B. I solution and you're looking at that data Well. If I can correlate, for example, time profit cost. Then I can create KP eyes automatically. I can create visualizations. I know which ones you like to see from that. Or I could give you related ones that I can even automatically create dashboards. I've got the intelligence about the data and the knowledge to know what? How you might what? Visualize adversity. You have to manually construct everything >>and a I is also going to when you when you spring. These disparage data sets together, isn't a I also going to give you an indication of the confidence level in those various data set. So, for example, you know, you're you're B I data set might be part of the General ledger. You know of the income statement and and be corporate fact very high confidence level. More sometimes you mention to do some of the unstructured data. Maybe not as high a confidence level. How our customers dealing with that and applying that first of all, is that a sort of accurate premise? And how is that manifesting itself in terms of business? Oh, >>yeah. So it is an accurate premise because in the world in the world of data. There's the known knowns on the unknown knowns, right? No, no's are what you know about your data. What's interesting about really good B I solutions and planning solutions, especially when they're brought together, right, Because planning and analysis naturally go hand in hand from, you know, one user 70 bucks a month to the Enterprise client. So it's things like, What are your key drivers? So this is gonna be the drivers that you know what drives your profit. But when you've got massive amounts of data and you got a I around that, especially if it's a I that's gone ontology around your particular industry, it can start telling you about drivers that you don't know about. And that's really the next step is tell me what are the drivers around things that I don't know. So when I'm exploring the data, I'd like to see a key driver that I never even knew existed. >>So when I talk to customers, I'm doing this for a while. One of the concerns they had a criticisms they had of the traditional systems was just the process is too hard. I got to go toe like a few guys I could go to I gotta line up, you know, submit a request. By the time I get it back, I'm on to something else. I want self serve beyond just reporting. Um, how is a I and IBM changing that dynamic? Can you put thes tools in the hands of users? >>Right. So this is about democratizing the cleverness, right? So if you're a big, broad organization, you can afford to hire a bunch of people to do that stuff. But if you're a startup or an SNB, and that's where the big market opportunity is for us, you know, abilities like and this it would be we're building this into the software already today is I'll bring a spreadsheet. Long spreadsheets. By definition, they're not rows and columns, right? Anyone could take a Roan Collin spreadsheet and turn into a set of data because it looks like a database. But when you've got different tabs on different sets of data that may or may not be obviously relatable to each other, that ai ai ability to be on introspect a spreadsheet and turn into from a planning point of view, cubes, dimensions and rules which turn your spreadsheet now to a three dimensional in memory cube or a planning application. You know, the our ability to go way, way further than you could ever do with that planning process over thousands of people is all possible now because we don't have taken all the hard work, all the lifting workout, >>so that three dimensional in memory Cuba like the sound of that. So there's a performance implication. Absolutely. On end is what else? Accessibility Maw wraps more users. Is that >>well, it's the ability to be out of process water. What if things on huge amounts of data? Imagine you're bowing, right? Howdy, pastors. Boeing How? I don't know. Three trillion. I'm just guessing, right? If you've got three trillion and you need to figure out based on the lady's hurricane report how many parts you need to go ship toe? Where that hurricane reports report is you need to do a water scenario on massive amounts of data in a second or two. So you know that capability requires an old lap solution. However, the rest of the planet other than old people bless him who are very special. People don't know what a laugh is from a pop tart, so democratizing it right to the person who says, I've got a set of data on as I still need to do what if analysis on things and probably at large data cause even if you're a small company with massive amounts of data coming through, people click. String me through your website just for example. You know what if I What if analysis on putting a 5% discount on this product based on previous sales have that going to affect me from a future sales again? I think it's the democratizing as the well is the ability to hit scale. >>You talk about Cloud and analytics, how they've they've come together, what specifically IBM has done to modernize that platform. And I'm interested in what customers are saying. What's the adoption like? >>So So I manage the Global Cloud team. We have night on 1000 clients that are using cloud the cloud implementations of our software growing actually so actually Maur on two and 1/2 1000. If you include the multi tenant version, there's two steps in this process, right when you've got an enterprise software solution, your clients have a certain expectation that your software runs on cloud just the way as it does on premise, which means in practical terms, you have to build a single tenant will manage cloud instance. And that's just the first step, right? Because getting clients to see the value of running the workload on cloud where they don't need people to install it, configure it, update it, troubleshoot it on all that other sort of I t. Stuff that subtracts you from doing running your business value. We duel that for you. But the future really is in multi tenant on how we can get vast, vast scale and also greatly lower costs. But the adoptions been great. Clients love >>it. Can you share any kind of indication? Or is that all confidential or what kind of metrics do you look at it? >>So obviously we look, we look a growth. We look a user adoption, and we look at how busy the service. I mean, let me give you the best way I can give you is a is a number of servers, volume numbers, right. So we have 8000 virtual machines running on soft layer or IBM cloud for our clients business Analytics is actually the largest client for IBM Cloud running those workloads for our clients. So it's, you know, that the adoption has been really super hard on the growth continues. Interestingly enough, I'll give you another factoid. So we just launched last October. Cognos Alex. Multi tenant. So it is truly multi infrastructure. You try, you buy, you give you credit card and away you go. And you would think, because we don't have software sellers out there selling it per se that it might not adopt as much as people are out there selling software. Okay, well, in one year, it's growing 10% month on month cigarette Ally's 10% month on month, and we're nearly 1400 users now without huge amounts of effort on our part. So clearly this market interest in running those softwares and then they're not want Tuesdays easer. Six people pretending some of people have 150 people pretending on a multi tenant software. So I believe that the future is dedicated is the first step to grow confidence that my own premise investments will lift and shift the cloud, but multi tenant will take us a lot >>for him. So that's a proof point of existing customer saying okay, I want to modernize. I'm buying in. Take 1/2 step of the man dedicated. And then obviously multi tenant for scale. And just way more cost efficient. Yes, very much. All right. Um, last question. Show us a little leg. What? What can you tell us about the road map? What gets you excited about the future? >>So I think the future historically, Planning Analytics and Carlos analytics have been separate products, right? And when they came together under the B I logo in about about a year ago, we've been spending a lot of our time bringing them together because, you know, you can fight in the B I space and you can fight in the planning space. And there's a lot of competitors here, not so many here. But when you bring the two things together, the connected value chain is where we really gonna win. But it's not only just doing is the connected value chain it and it could be being being vice because I'm the the former Lotus guy who believes in democratization of technology. Right? But the market showing us when we create a piece of software that starts at 15 bucks for a single user. For the same power mind you write little less less of the capabilities and 70 bucks for a single user. For all of it, people buy it. So I'm in. >>Tony, thanks so much for coming on. The kid was great to have you. Brilliant. Thank you. Keep it right there, everybody. We'll be back with our next guest. You watching the Cube live from the IBM data and a I form in Miami. We'll be right back.

Published Date : Oct 23 2019

SUMMARY :

IBM is data in a I forum brought to you by IBM. is on the B I and the Enterprise performance management How's that going? I mean, you know, we use things like B. I and enterprise performance management. so that created the imperative to modernize and made sure that there could be things like self service and cloud I mean, the work clothes that we run a really sticky were close right when you're doing and it feels like the combination of those systems of record So with things like Imagine, you bring a you know, and a I is also going to when you when you spring. that you know what drives your profit. By the time I get it back, I'm on to something else. You know, the our ability to go way, way further than you could ever do with that planning process So there's a performance implication. So you know that capability What's the adoption like? t. Stuff that subtracts you from doing running your business value. or what kind of metrics do you look at it? So I believe that the future is dedicated What can you tell us about the road map? For the same power mind you write little less less of the capabilities and 70 bucks for a single user. The kid was great to have you.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Tony HighamPERSON

0.99+

Steve MillsPERSON

0.99+

AmazonORGANIZATION

0.99+

IBMORGANIZATION

0.99+

BoeingORGANIZATION

0.99+

MiamiLOCATION

0.99+

$5 billionQUANTITY

0.99+

15 bucksQUANTITY

0.99+

TonyPERSON

0.99+

70 bucksQUANTITY

0.99+

three trillionQUANTITY

0.99+

5%QUANTITY

0.99+

Three trillionQUANTITY

0.99+

360 degreeQUANTITY

0.99+

150 peopleQUANTITY

0.99+

Miami, FloridaLOCATION

0.99+

two stepsQUANTITY

0.99+

Six peopleQUANTITY

0.99+

1000 clientsQUANTITY

0.99+

two thingsQUANTITY

0.99+

twoQUANTITY

0.99+

first stepQUANTITY

0.99+

last OctoberDATE

0.99+

OneQUANTITY

0.97+

one yearQUANTITY

0.97+

DukeORGANIZATION

0.97+

Ditch the DigitalORGANIZATION

0.97+

todayDATE

0.97+

CubaLOCATION

0.96+

AmissPERSON

0.96+

Planning AnalyticsORGANIZATION

0.96+

single userQUANTITY

0.96+

LotusTITLE

0.95+

nearly 1400 usersQUANTITY

0.95+

TuesdaysDATE

0.92+

oneQUANTITY

0.92+

10% monthQUANTITY

0.92+

B IORGANIZATION

0.91+

aboutDATE

0.91+

over thousands of peopleQUANTITY

0.91+

Global CloudORGANIZATION

0.91+

Carlos analyticsORGANIZATION

0.91+

10% monthQUANTITY

0.9+

1/2 1000QUANTITY

0.87+

AlexPERSON

0.87+

firstQUANTITY

0.81+

70 bucks a monthQUANTITY

0.81+

8000 virtual machinesQUANTITY

0.8+

AllyORGANIZATION

0.79+

Enterprise Data WarehouseORGANIZATION

0.79+

single tenantQUANTITY

0.79+

a year agoDATE

0.79+

CollinPERSON

0.78+

single userQUANTITY

0.76+

1/2 stepQUANTITY

0.73+

Sarbanes OxleyPERSON

0.73+

single householdQUANTITY

0.7+

Cloud Business AnalyticsORGANIZATION

0.7+

a secondQUANTITY

0.68+

coupleQUANTITY

0.65+

CognosPERSON

0.59+

2000 tenDATE

0.58+

cloudTITLE

0.57+

RoanORGANIZATION

0.56+

IBM CloudORGANIZATION

0.53+

CubePERSON

0.37+

Show Wrap | MIT CDOIQ 2019


 

>> from Cambridge, Massachusetts. It's three Cube covering M I T. Chief data officer and information quality Symposium 2019. Brought to you by Silicon Angle Media. >> Welcome back. We're here to wrap up the M I T. Chief data officer officer, information quality. It's hashtag m i t CDO conference. You're watching the Cube. I'm David Dante, and Paul Gill is my co host. This is two days of coverage. We're wrapping up eyes. Our analysis of what's going on here, Paul, Let me let me kick it off. When we first started here, we talked about that are open. It was way saw the chief data officer role emerged from the back office, the information quality role. When in 2013 the CEO's that we talked to when we asked them what was their scope. We heard things like, Oh, it's very wide. Involves analytics, data science. Some CEOs even said Oh, yes, security is actually part of our purview because all the cyber data so very, very wide scope. Even in some cases, some of the digital initiatives were sort of being claimed. The studios were staking their claim. The reality was the CDO also emerged out of highly regulated industries financialservices healthcare government. And it really was this kind of wonky back office role. And so that's what my compliance, that's what it's become again. We're seeing that CEOs largely you're not involved in a lot of the emerging. Aye, aye initiatives. That's what we heard, sort of anecdotally talking to various folks At the same time. I feel as though the CDO role has been more fossilized than it was before. We used to ask, Is this role going to be around anymore? We had C I. Ose tell us that the CEO Rose was going to disappear, so you had both ends of the spectrum. But I feel as though that whatever it's called CDO Data's our chief analytics off officer, head of data, you know, analytics and governance. That role is here to stay, at least for for a fair amount of time and increasingly, issues of privacy and governance. And at least the periphery of security are gonna be supported by that CD a role. So that's kind of takeaway Number one. Let me get your thoughts. >> I think there's a maturity process going on here. What we saw really in 2016 through 2018 was, ah, sort of a celebration of the arrival of the CDO. And we're here, you know, we've got we've got power now we've got an agenda. And that was I mean, that was a natural outcome of all this growth and 90% of organizations putting sea Dios in place. I think what you're seeing now is a realization that Oh, my God, this is a mess. You know what I heard? This year was a lot less of this sort of crowing about the ascendance of sea Dios and Maura about We've got a big integration problem of big data cleansing problem, and we've got to get our hands down to the nitty gritty. And when you talk about, as you said, we had in here so much this year about strategic initiatives, about about artificial intelligence, about getting involved in digital business or customer experience transformation. What we heard this year was about cleaning up data, finding the data that you've got organizing it, applying meditator, too. It is getting in shape to do something with it. There's nothing wrong with that. I just think it's part of the natural maturation process. Organizations now have to go through Tiu to the dirty process of cleaning up this data before they can get to the next stage, which was a couple of three years out for most of >> the second. Big theme, of course. We heard this from the former head of analytics. That G s K on the opening keynote is the traditional methods have failed the the Enterprise Data Warehouse, and we've actually studied this a lot. You know, my analogy is often you snake swallowing a basketball, having to build cubes. E D W practitioners would always used to call it chasing the chips until we come up with a new chip. Oh, we need that because we gotta run faster because it's taking us hours and hours, weeks days to run these analytics. So that really was not an agile. It was a rear view mirror looking thing. And Sarbanes Oxley saved the E. D. W. Business because reporting became part of compliance thing perspective. The master data management piece we've heard. Do you consistently? We heard Mike Stone Breaker, who's obviously a technology visionary, was right on. It doesn't scale through this notion of duping. Everything just doesn't work and manually creating rules. It's just it's just not the right approach. This we also heard the top down data data enterprise data model doesn't works too complicated, can operationalize it. So what they do, they kick the can to governance. The Duke was kind of a sidecar, their big data that failed to live up to its promises. And so it's It's a big question as to whether or not a I will bring that level of automation we heard from KPMG. Certainly, Mike Stone breaker again said way heard this, uh, a cz well, from Andy Palmer. They're using technology toe automate and scale that big number one data science problem, which is? They spend all their time wrangling data. We'll see if that if that actually lives up >> to his probable is something we did here today from several of our guests. Was about the promise of machine learning to automate this day to clean up process and as ah Mark Ramsay kick off the conference saying that all of these efforts to standardize data have failed in the past. This does look, He then showed how how G s K had used some of the tools that were represented here using machine learning to actually clean up the data at G S. K. So there is. And I heard today a lot of optimism from the people we talked to about the capability of Chris, for example, talking about the capability of machine learning to bring some order to solve this scale scale problem Because really organizing data creating enterprise data models is a scale problem, and the only way you can solve that it's with with automation, Mike Stone breaker is right on top of that. So there was optimism at this event. There was kind of an ooh, kind of, ah, a dismay at seeing all the data problems they have to clean up, but also promised that tools are on the way that could do that. >> Yeah, The reason I'm an optimist about this role is because data such a hard problem. And while there is a feeling of wow, this is really a challenge. There's a lot of smart people here who are up for the challenge and have the d n a for it. So the role, that whole 360 thing. We talked about the traditional methods, you know, kind of failing, and in the third piece that touched on, which is really bringing machine intelligence to the table. We haven't heard that as much at this event. It's now front and center. It's just another example of a I injecting itself into virtually every aspect every corner of the industry. And again, I often jokes. Same wine, new bottle. Our industry has a habit of doing that, but it's cyclical, but it is. But we seem to be making consistent progress. >> And the machine learning, I thought was interesting. Several very guest spoke to machine learning being applied to the plumbing projects right now to cleaning up data. Those are really self contained projects. You can manage those you can. You can determine out test outcomes. You can vet the quality of the of the algorithms. It's not like you're putting machine learning out there in front of the customer where it could potentially do some real damage. There. They're vetting their burning in machine, learning in a environment that they control. >> Right, So So, Amy, Two solid days here. I think that this this conference has really grown when we first started here is about 130 people, I think. And now it was 500 registrants. This'd year. I think 600 is the sort of the goal for next year. Moving venues. The Cube has been covering this all but one year since 2013. Hope to continue to do that. Paul was great working with you. Um, always great work. I hope we can, uh we could do more together. We heard the verdict is bringing back its conference. You put that together. So we had column. Mahoney, um, had the vertical rock stars on which was fun. Com Mahoney, Mike Stone breaker uh, Andy Palmer and Chris Lynch all kind of weighed in, which was great to get their perspectives kind of the days of MPP and how that's evolved improving on traditional relational database. And and now you're Stone breaker. Applying all these m i. Same thing with that scale with Chris Lynch. So it's fun to tow. Watch those guys all Boston based East Coast folks some news. We just saw the news hit President Trump holding up jet icon contractors is we've talked about. We've been following that story very closely and I've got some concerns over that. It's I think it's largely because he doesn't like Bezos in The Washington Post Post. Exactly. You know, here's this you know, America first. The Pentagon says they need this to be competitive with China >> and a I. >> There's maybe some you know, where there's smoke. There's fire there, so >> it's more important to stick in >> the eye. That's what it seems like. So we're watching that story very closely. I think it's I think it's a bad move for the executive branch to be involved in those type of decisions. But you know what I know? Well, anyway, Paul awesome working with you guys. Thanks. And to appreciate you flying out, Sal. Good job, Alex Mike. Great. Already wrapping up. So thank you for watching. Go to silicon angle dot com for all the news. Youtube dot com slash silicon angles where we house our playlist. But the cube dot net is the main site where we have all the events. It will show you what's coming up next. We've got a bunch of stuff going on straight through the summer. And then, of course, VM World is the big kickoff for the fall season. Goto wicked bond dot com for all the research. We're out. Thanks for watching Dave. A lot day for Paul Gillon will see you next time.

Published Date : Aug 1 2019

SUMMARY :

Brought to you by in 2013 the CEO's that we talked to when we asked them what was their scope. And that was I mean, And Sarbanes Oxley saved the E. data models is a scale problem, and the only way you can solve that it's with with automation, We talked about the traditional methods, you know, kind of failing, and in the third piece that touched on, And the machine learning, I thought was interesting. We just saw the news hit President Trump holding up jet icon contractors There's maybe some you know, where there's smoke. And to appreciate you flying out, Sal.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Andy PalmerPERSON

0.99+

David DantePERSON

0.99+

Chris LynchPERSON

0.99+

ChrisPERSON

0.99+

2013DATE

0.99+

PaulPERSON

0.99+

Paul GillPERSON

0.99+

Mike StonePERSON

0.99+

2016DATE

0.99+

Paul GillonPERSON

0.99+

Mike Stone BreakerPERSON

0.99+

Silicon Angle MediaORGANIZATION

0.99+

2018DATE

0.99+

RosePERSON

0.99+

Alex MikePERSON

0.99+

BezosPERSON

0.99+

G s KORGANIZATION

0.99+

MahoneyPERSON

0.99+

BostonLOCATION

0.99+

KPMGORGANIZATION

0.99+

90%QUANTITY

0.99+

SalPERSON

0.99+

third pieceQUANTITY

0.99+

DavePERSON

0.99+

500 registrantsQUANTITY

0.99+

two daysQUANTITY

0.99+

Cambridge, MassachusettsLOCATION

0.99+

todayDATE

0.99+

next yearDATE

0.99+

Mark RamsayPERSON

0.99+

360QUANTITY

0.99+

this yearDATE

0.99+

MauraPERSON

0.99+

G S. K.ORGANIZATION

0.98+

YoutubeORGANIZATION

0.98+

AmyPERSON

0.98+

PentagonORGANIZATION

0.98+

C I. OsePERSON

0.98+

Sarbanes OxleyPERSON

0.97+

firstQUANTITY

0.97+

This yearDATE

0.96+

one yearQUANTITY

0.96+

Mike Stone breakerPERSON

0.95+

Enterprise Data WarehouseORGANIZATION

0.95+

DiosPERSON

0.94+

Two solid daysQUANTITY

0.94+

secondQUANTITY

0.94+

three yearsQUANTITY

0.92+

about 130 peopleQUANTITY

0.91+

600QUANTITY

0.9+

DukeORGANIZATION

0.89+

VM WorldEVENT

0.88+

dot comORGANIZATION

0.85+

ChinaORGANIZATION

0.84+

E. D. W.ORGANIZATION

0.83+

CubeORGANIZATION

0.8+

MITORGANIZATION

0.77+

East CoastLOCATION

0.75+

M I T.PERSON

0.75+

2019DATE

0.74+

President TrumpPERSON

0.71+

both endsQUANTITY

0.71+

threeQUANTITY

0.68+

M I T.EVENT

0.64+

cube dot netORGANIZATION

0.59+

ChiefPERSON

0.58+

The Washington Post PostTITLE

0.57+

AmericaORGANIZATION

0.56+

Goto wickedORGANIZATION

0.54+

CEOPERSON

0.54+

coupleQUANTITY

0.54+

CDOORGANIZATION

0.45+

StonePERSON

0.43+

CDOIQTITLE

0.24+

Keynote Analysis | MIT CDOIQ 2019


 

>> From Cambridge, Massachusetts, it's The Cube! Covering MIT Chief Data Officer and Information Qualities Symposium 2019. Brought to you by SiliconANGLE Media. >> Welcome to Cambridge, Massachusetts everybody. You're watching The Cube, the leader in live tech coverage. My name is Dave Vellante and I'm here with my cohost Paul Gillin. And we're covering the 13th annual MIT CDOIQ conference. The Cube first started here in 2013 when the whole industry Paul, this segment of the industry was kind of moving out of the ashes of the compliance world and the data quality world and kind of that back office role, and it had this tailwind of the so called big data movement behind it. And the Chief Data Officer was emerging very strongly within as we've talked about many times in theCube, within highly regulated industries like financial services and government and healthcare and now we're seeing data professionals from all industries join this symposium at MIT as I say 13th year, and we're now seeing a lot of discussion about not only the role of the Chief Data Officer, but some of what we heard this morning from Mark Ramsey some of the failures along the way of all these north star data initiatives, and kind of what to do about it. So this conference brings together several hundred practitioners and we're going to be here for two days just unpacking all the discussions the major trends that touch on data. The data revolution, whether it's digital transformation, privacy, security, blockchain and the like. Now Paul, you've been involved in this conference for a number of years, and you've seen it evolve. You've seen that chief data officer role both emerge from the back office into a c-level executive role, and now spanning a very wide scope of responsibilities. Your thoughts? >> It's been like being part of a soap opera for the last eight years that I've been part of this conference because as you said Dave, we've gone through all of these transitions. In the early days this conference actually started as an information qualities symposium. It has evolved to become about chief data officer and really about the data as an asset to the organization. And I thought that the presentation we saw this morning, Mark Ramsey's talk, we're going to have him on later, very interesting about what they did at GlaxoSmithKline to get their arms around all of the data within that organization. Now a project like that would've unthinkable five years ago, but we've seen all of these new technologies come on board, essentially they've created a massive search engine for all of their data. We're seeing organizations beginning to get their arms around this massive problem. And along the way I say it's a soap opera because along the way we've seen failure after failure, we heard from Mark this morning that data governance is a failure too. That was news to me! All of these promising initiatives that have started and fallen flat or failed to live up to their potential, the chief data officer role has emerged out of that to finally try to get beyond these failures and really get their arms around that organizational data and it's a huge project, and it's something that we're beginning to see some organization succeed at. >> So let's talk a little bit about the role. So the chief data officer in many ways has taken a lot of the heat off the chief information officer, right? It used to be CIO stood for career is over. Well, when you throw all the data problems at an individual c-level executive, that really is a huge challenge. And so, with the cloud it's created opportunities for CIOs to actually unburden themselves of some of the crapplications and actually focus on some of the mission critical stuff that they've always been really strong at and focus their budgets there. But the chief data officer has had somewhat of an unclear scope. Different organizations have different roles and responsibilities. And there's overlap with the chief digital officer. There's a lot of emphasis on monetization whether that's increasing revenue or cutting costs. And as we heard today from the keynote speaker Mark Ramsey, a lot of the data initiatives have failed. So what's your take on that role and its viability and its longterm staying power? >> I think it's coming together. I think last year we saw the first evidence of that. I talked to a number of CDOs last year as well as some of the analysts who were at this conference, and there was pretty good clarity beginning to emerge about what they chief data officer role stood for. I think a lot of what has driven this is this digital transformation, the hot buzz word of 2019. The foundation of digital transformation is a data oriented culture. It's structuring the entire organization around data, and when you get to that point when an organization is ready to do that, then the role of the CDO I think becomes crystal clear. It's not so much just an extract transform load discipline. It's not just technology, it's not just governance. It really is getting that data, pulling that data together and putting it at the center of the organization. That's the value that the CDO can provide, I think organizations are coming around to that. >> Yeah and so we've seen over the last 10 years the decrease, the rapid decrease in cost, the cost of storage. Microprocessor performance we've talked about endlessly. And now you've got the machine intelligence piece layering in. In the early days Hadoop was the hot tech, and interesting now nobody talks even about Hadoop. Rarely. >> Yet it was discussed this morning. >> It was mentioned today. It is a fundamental component of infrastructures. >> Yeah. >> But what it did is it dramatically lowered the cost of storing data, and allowing people to leave data in place. The old adage of ship a five megabytes of code to a petabyte of data versus the reverse. Although we did hear today from Mark Ramsey that they copied all the data into a centralized location so I got some questions on that. But the point I want to make is that was really early days. We're now entered an era and it's underscored by if you look at the top five companies in terms of market cap in the US stock market, obviously Microsoft is now over a trillion. Microsoft, Apple, Amazon, Google and Facebook. Top five. They're data companies, their assets are all data driven. They've surpassed the banks, the energy companies, of course any manufacturing automobile companies, et cetera, et cetera. So they're data companies, and they're wrestling with big issues around security. You can't help but open the paper and see issues on security. Yesterday was the big Capital One. The Equifax issue was resolved in terms of the settlement this week, et cetera, et cetera. Facebook struggling mightily with whether or not how to deal fake news, how to deal with deep fakes. Recently it shut down likes for many Instagram accounts in some countries because they're trying to protect young people who are addicted to this. Well, they need to shut down likes for business accounts. So what kids are doing is they're moving over to the business Instagram accounts. Well when that happens, it exposes their emails automatically so they've all kinds of privacy landmines and people don't know how to deal with them. So this data explosion, while there's a lot of energy and excitement around it, brings together a lot of really sticky issues. And that falls right in the lap of the chief data officer, doesn't it? >> We're in uncharted territory and all of the examples you used are problems that we couldn't have foreseen, those companies couldn't have foreseen. A problem may be created but then the person who suffers from that problem changes their behavior and it creates new problems as you point out with kids shifting where they're going to communicate with each other. So these are all uncharted waters and I think it's got to be scary if you're a company that does have large amounts of consumer data in particular, consumer packaged goods companies for example, you're looking at what's happening to these big companies and these data breaches and you know that you're sitting on a lot of customer data yourself, and that's scary. So we may see some backlash to this from companies that were all bought in to the idea of the 360 degree customer view and having these robust data sources about each one of your customers. Turns out now that that's kind of a dangerous place to be. But to your point, these are data companies, the companies that business people look up to now, that they emulate, are companies that have data at their core. And that's not going to change, and that's certainly got to be good for the role of the CDO. >> I've often said that the enterprise data warehouse failed to live up to its expectations and its promises. And Sarbanes-Oxley basically saved EDW because reporting became a critical component post Enron. Mark Ramsey talked today about EDW failing, master data management failing as kind of a mapping and masking exercise. The enterprise data model which was a top down push for a sort of distraction layer, that failed. You had all these failures and so we turned to governance. That failed. And so you've had this series of issues. >> Let me just point out, what do all those have in common? They're all top down. >> Right. >> All top down initiatives. And what Glaxo did is turn that model on its head and left the data where it was. Went and discovered it and figured it out without actually messing with the data. That may be the difference that changes the game. >> Yeah and it's prescription was basically taking a tactical approach to that problem, start small, get quick hits. And then I think they selected a workload that was appropriate for solving this problem which was clinical trials. And I have some questions for him. And of the big things that struck me is the edge. So as you see a new emerging data coming out of the edge, how are organizations going to deal with that? Because I think a lot of what he was talking about was a lot of legacy on-prem systems and data. Think about JEDI, a story we've been following on SiliconANGLE the joint enterprise defense infrastructure. This is all about the DOD basically becoming cloud enabled. So getting data out into the field during wartime fast. We're talking about satellite data, you're talking about telemetry, analytics, AI data. A lot of distributed data at the edge bringing new challenges to how organizations are going to deal with data problems. It's a whole new realm of complexity. >> And you talk about security issues. When you have a lot of data at the edge and you're sending data to the edge, you're bringing it back in from the edge, every device in the middle is from the smart thermostat. at the edge all the way up to the cloud is a potential failure point, a potential vulnerability point. These are uncharted waters, right? We haven't had to do this on a large scale. Organizations like the DOD are going to be the ones that are going to be the leaders in figuring this out because they are so aggressive. They have such an aggressive infrastructure and place. >> The other question I had, striking question listening to Mark Ramsey this morning. Again Mark Ramsey was former data God at GSK, GlaxoSmithKline now a consultant. We're going to hear from a number of folks like him and chief data officers. But he basically kind of poopooed, he used the example of build it and they will come. You know the Kevin Costner movie Field of Dreams. Don't go after the field of dreams. So my question is, and I wonder if you can weigh in on this is, everywhere we go we hear about digital transformation. They have these big digital transformation projects, they generally are top down. Every CEO wants to get digital right. Is that the wrong approach? I want to ask Mark Ramsey that. Are they doing field of dreams type stuff? Is it going to be yet another failure of traditional legacy systems to try to compete with cloud native and born in data era companies? >> Well he mentioned this morning that the research is already showing that digital transformation most initiatives are failing. Largely because of cultural reasons not technical reasons, and I think Ramsey underscored that point this morning. It's interesting that he led off by mentioning business process reengineering which you remember was a big fad in the 1990s, companies threw billions of dollars at trying to reinvent themselves and most of them failed. Is digital transformation headed down the same path? I think so. And not because the technology isn't there, it's because creating a culture where you can break down these silos and you can get everyone oriented around a single view of the organizations data. The bigger the organization the less likely that is to happen. So what does that mean for the CDO? Well, chief information officer at one point we said the CIO stood for career is over. I wonder if there'll be a corresponding analogy for the CDOs at some of these big organizations when it becomes obvious that pulling all that data together is just not feasible. It sounds like they've done something remarkable at GSK, maybe we'll learn from that example. But not all organizations have the executive support, which was critical to what they did, or just the organizational will to organize themselves around that central data storm. >> And I also said before I think the CDO is taking a lot of heat off the CIO and again my inference was the GSK use case and workload was actually quite narrow in clinical trials and was well suited to success. So my takeaway in this, if I were CDO what I would be doing is trying to figure out okay how does data contribute to the monetization of my organization? Maybe not directly selling the data, but what data do I have that's valuable and how can I monetize that in terms of either saving money, supply chain, logistics, et cetera, et cetera, or making money? Some kind of new revenue opportunity. And I would super glue myself for the line of business executive and go after a small hit. You're talking about digital transformations being top down and largely failing. Shadow digital transformations is maybe the answer to that. Aligning with a line of business, focusing on a very narrow use case, and building successes up that way using data as the ingredient to drive value. >> And big ideas. I recently wrote about Experian which launched a service last called Boost that enables the consumers to actually impact their own credit scores by giving Experian access to their bank accounts to see that they are at better credit risk than maybe portrayed in the credit store. And something like 600,000 people signed up in the first six months of this service. That's an example I think of using inspiration, creating new ideas about how data can be applied And in the process by the way, Experian gains data that they can use in other context to better understand their consumer customers. >> So digital meets data. Data is not the new oil, data is more valuable than oil because you can use it multiple times. The same data can be put in your car or in your house. >> Wish we could do that with the oil. >> You can't do that with oil. So what does that mean? That means it creates more data, more complexity, more security risks, more privacy risks, more compliance complexity, but yet at the same time more opportunities. So we'll be breaking that down all day, Paul and myself. Two days of coverage here at MIT, hashtag MITCDOIQ. You're watching The Cube, we'll be right back right after this short break. (upbeat music)

Published Date : Jul 31 2019

SUMMARY :

and Information Qualities Symposium 2019. and the data quality world and really about the data as an asset to the organization. and actually focus on some of the mission critical stuff and putting it at the center of the organization. In the early days Hadoop was the hot tech, It is a fundamental component of infrastructures. And that falls right in the lap of and all of the examples you used I've often said that the enterprise data warehouse what do all those have in common? and left the data where it was. And of the big things that struck me is the edge. Organizations like the DOD are going to be the ones Is that the wrong approach? the less likely that is to happen. and how can I monetize that in terms of either saving money, that enables the consumers to actually Data is not the new oil, You can't do that with oil.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Mark RamseyPERSON

0.99+

MicrosoftORGANIZATION

0.99+

Dave VellantePERSON

0.99+

AmazonORGANIZATION

0.99+

PaulPERSON

0.99+

AppleORGANIZATION

0.99+

FacebookORGANIZATION

0.99+

Paul GillinPERSON

0.99+

GoogleORGANIZATION

0.99+

2013DATE

0.99+

RamseyPERSON

0.99+

Kevin CostnerPERSON

0.99+

EnronORGANIZATION

0.99+

last yearDATE

0.99+

DODORGANIZATION

0.99+

ExperianORGANIZATION

0.99+

2019DATE

0.99+

GlaxoSmithKlineORGANIZATION

0.99+

DavePERSON

0.99+

GSKORGANIZATION

0.99+

GlaxoORGANIZATION

0.99+

Two daysQUANTITY

0.99+

five megabytesQUANTITY

0.99+

360 degreeQUANTITY

0.99+

two daysQUANTITY

0.99+

todayDATE

0.99+

Cambridge, MassachusettsLOCATION

0.99+

Field of DreamsTITLE

0.99+

billions of dollarsQUANTITY

0.99+

MarkPERSON

0.99+

EquifaxORGANIZATION

0.99+

YesterdayDATE

0.99+

over a trillionQUANTITY

0.99+

1990sDATE

0.98+

600,000 peopleQUANTITY

0.98+

USLOCATION

0.98+

this weekDATE

0.98+

SiliconANGLE MediaORGANIZATION

0.98+

first six monthsQUANTITY

0.98+

InstagramORGANIZATION

0.98+

The CubeTITLE

0.98+

five years agoDATE

0.97+

Capital OneORGANIZATION

0.96+

first evidenceQUANTITY

0.96+

bothQUANTITY

0.96+

firstQUANTITY

0.95+

MITORGANIZATION

0.93+

this morningDATE

0.91+

HadoopTITLE

0.88+

one pointQUANTITY

0.87+

13th yearQUANTITY

0.86+

MIT CDOIQ conferenceEVENT

0.84+

MITCDOIQTITLE

0.84+

each oneQUANTITY

0.82+

hundred practitionersQUANTITY

0.82+

EDWORGANIZATION

0.81+

last eight yearsDATE

0.81+

MIT Chief Data Officer andEVENT

0.81+

Sarbanes-OxleyPERSON

0.8+

top five companiesQUANTITY

0.78+

The CubeORGANIZATION

0.75+

Top fiveQUANTITY

0.74+

single viewQUANTITY

0.7+

last 10 yearsDATE

0.69+

BoostTITLE

0.68+

a petabyte of dataQUANTITY

0.65+

EDWTITLE

0.64+

SiliconANGLEORGANIZATION

0.64+

Action Item | Big Data SV Preview Show - Feb 2018


 

>> Hi, I'm Peter Burris and once again, welcome to a Wikibon Action Item. (lively electronic music) We are again broadcasting from the beautiful theCUBE Studios here in Palo Alto, California, and we're joined today by a relatively larger group. So, let me take everybody through who's here in the studio with us. David Floyer, George Gilbert, once again, we've been joined by John Furrier, who's one of the key CUBE hosts, and on the remote system is Jim Kobielus, Neil Raden, and another CUBE host, Dave Vellante. Hey guys. >> Hi there. >> Good to be here. >> Hey. >> So, one of the things we're, one of the reasons why we have a little bit larger group here is because we're going to be talking about a community gathering that's taking place in the big data universe in a couple of weeks. Large numbers of big data professionals are going to be descending upon Strata for the purposes of better understanding what's going on within the big data universe. Now we have run a CUBE show next to that event, in which we get the best thought leaders that are possible at Strata, bring them in onto theCUBE, and really to help separate the signal from the noise that Strata has historically represented. We want to use this show to preview what we think that signal's going to be, so that we can help the community better understand what to look for, where to go, what kinds of things to be talking about with each other so that it can get more out of that important event. Now, George, with that in mind, what are kind of the top level thing? If it was one thing that we'd identify as something that was different two years ago or a year ago, and it's going to be different from this show, what would we say it would be? >> Well, I think the big realization that's here is that we're starting with the end in mind. We know the modern operational analytic applications that we want to build, that anticipate or influence a user interaction or inform or automate a business transaction. And for several years, we were experimenting with big data infrastructure, but that was, it wasn't solution-centric, it was technology-centric. And we kind of realized that the do it yourself, assemble your own kit, opensource big data infrastructure created too big a burden on admins. Now we're at the point where we're beginning to see a more converged set of offerings take place. And by converged, I mean an end to end analytic pipeline that is uniform for developers, uniform for admins, and because it's pre-integrated, is lower latency. It helps you put more data through one single analytic latency budget. That's what we think people should look for. Right now, though, the hottest new tech-centric activity is around Machine Learning, and I think the big thing we have to do is recognize that we're sort of at the same maturity level as we were with big data several years ago. And people should, if they're going to work with it, start with the knowledge, for the most part, that they're going to be experimenting, 'cause the tooling isn't quite mature enough, we don't have enough data scientists for people to be building all these pipelines bespoke. And the third-party applications, we don't have a high volume of them where this is embedded yet. >> So if I can kind of summarize what you're saying, we're seeing bifurcation occur within the ecosystem associated with big data that's driving toward simplification on the infrastructure side, which increasingly is being associated with the term big data, and new technologies that can apply that infrastructure and that data to new applications, including things like AI, ML, DL, where we think about modeling and services, and a new way of building value. Now that suggests that one or the other is more or less hot, but Neil Raden, I think the practical reality is that here in Silicon Valley, we got to be careful about getting too far out in front of our skis. At the end of the day, there's still a lot of work to be done inside how you simply do things like move data from one place to the other in a lot of big enterprises. Would you agree with that? >> Oh absolutely. I've been talking to a lot clients this week and, you know, we don't talk about the fact that they're still running their business on what we would call legacy systems, and they don't know how to, you know, get out of them or transform from them. So they're still starting to plan for this, but the problem is, you know, it's like talking about the 27 rocket engines on the whatever it was that he launched into space, launching a Tesla into space. But you can talk about the engineering of those engines and that's great, but what about all the other things you're going to have to do to get that (laughs) car into space? And it's the same thing. A year ago, we were talking about Hadoop and big data and, to a certain extent, Machine Learning, maybe more data science. But now people are really starting to say, How do we actually do this, how do we secure it, how do we govern it, how do we get some sort of metadata or semantics on the data we're working with so people know what they're using. I think that's where we are in a lot of companies. >> Great, so that's great feedback, Neil. So as we look forward, Jim Kobielus, the challenges associated with what it means to better improve the facilities of your infrastructure, but also use that as a basis for increasing your capability on some of the new applications services, what are we looking for, what should folks be looking for as they explore the show in the next couple of weeks on the ML side? What new technologies, what new approaches? Going back to what George said, we're in experimentation mode. What are going to be the experiments that are going to generate greatest results over the course of the next year? >> Yeah, for the data scientists, who flock to Strata and similar conferences, automation of the Machine Learning pipeline is super hot in terms of investments by the solution providers. Everybody from Google to IBM to AWS, and others, are investing very heavily in automation of, not just the data engine, that problem's been had a long time ago. It's automation of more of the feature engineering and the trending. These very manual, often labor intensive, jobs have to be sped up and automated to a great degree to enable the magic of productivity by the data scientists in the new generation of app developers. So look for automation of Machine Learning to be a super hot focus. Related to that is, look for a new generation of development suites that focus on DevOps, speeding the Machine Learning in DL and AI from modeling through training and evaluation deployment in iteration. We've seen a fair upswing in the number of such toolkits on the market from a variety of startup vendors, like the DataRobots of the world. But also coming to say, AWS with SageMaker, for example, that's hot. Also, look for development toolkits that automate more of the cogeneration, you know, a low-code tools, but the new generation of low-code tools, as highlighted in a recent Wikibons study, use ML to drive more of the actual production of fairly decent, good enough code, as a first rough prototype for a broad range of applications. And finally we're seeing a fair amount of ML-generated code generation inside of things like robotic process automation, RPA, which I believe will probably be a super hot theme at Strata and other shows this year going forward. So there's a, you mentioned the idea of better tooling for DevOps and the relationship between big data and ML, and what not, and DevOps. One of the key things that we've been seeing over the course of the last few years, and it's consistent with the trends that we're talking about, is increasing specialization in a lot of the perspectives associated with changes within this marketplace, so we've seen other shows that have emerged that have been very, very important, that we, for example, are participating in. Places like Splunk, for example, that is the vanguard, in many respects, of a lot of these trends in big data and how big data can applied to business problems. Dave Vellante, I know you've been associated with a number of, participating in these shows, how does this notion of specialization inform what's going to happen in San Jose, and what kind of advice and counsel should we tell people to continue to explore beyond just what's going to happen in San Jose in a couple weeks? >> Well, you mentioned Splunk as an example, a very sort of narrow and specialized company that solves a particular problem and has a very enthusiastic ecosystem and customer base around that problem. LAN files to solve security problems, for example. I would say Tableau is another example, you know, heavily focused on Viz. So what you're seeing is these specialized skillsets that go deep within a particular domain. I think the thing to think about, especially when we're in San Jose next week, is as we talk about digital disruption, what are the skillsets required beyond just the domain expertise. So you're sort of seeing this bifurcated skillsets really coming into vogue, where if somebody understands, for example, traditional marketing, but they also need to understand digital marketing in great depth, and the skills that go around it, so there's sort of a two-tool player. We talk about five-tool player in baseball. At least a multidimensional skillset in digital. >> And that's likely to occur not just in a place like marketing, but across the board. David Floyer, as folks go to the show and start to look more specifically about this notion of convergence, are there particular things that they should think about that, to come back to the notion of, well, you know, hardware is going to make things more or less difficult for what the software can do, and software is going to be created that will fill up the capabilities of hardware. What are some of the underlying hardware realities that folks going to the show need to keep in mind as they evaluate, especially the infrastructure side, these different infrastructure technologies that are getting more specialized? >> Well, if we look historically at the big data area, the solution has been to put in very low cost equipment as nodes, lots of different nodes, and move the data to those nodes so that you get a parallelization of the, of the data handling. That is not the only way of doing it. There are good ways now where you can, in fact, have a single version of that data in one place in very high speed storage, on flash storage, for example, and where you can allow very fast communication from all of the nodes directly to that data. And that makes things a lot simpler from an operational point of view. So using current Batch Automation techniques that are in existence, and looking at those from a new perspective, which is I do IUs apply these to big data, how do I automate these things, can make a huge difference in just the practicality in the elapsed time for some of these large training things, for example. >> Yeah, I was going to say that to many respects, what you're talking about is bringing things like training under a more traditional >> David: Operational, yeah. >> approach and operational set of disciplines. >> David: Yes, that's right. >> Very, very important. So John Furrier, I want to come back to you, or I want to come to you, and say that there are some other technologies that, while they're the bright shiny objects and people think that they're going to be the new kind of Harry Potter technologies of magic everywhere, Blockchain is certainly going to become folded into this big data concept, because Blockchain describes how contracts, ownership, authority ultimately get distributed. What should folks look for as the, as Blockchain starts to become part of these conversations? >> That's a good point, Peter. My summary of the preview for BigData SV Silicon Valley, which includes the Strata show, is two things: Blockchain points to the future and GDPR points to the present. GDPR is probably the most, one of the most fundamental impacts to the big data market in a long time. People have been working on it for a year. It is a nightmare. The technical underpinnings of what companies have to do to comply with GDPR is a moving train, and it's complete BS. There's no real solutions out there, so if I was going to tell everyone to think about that and what to look for: What is happening with GDPR, what's the impact of the databases, what's the impact of the architectures? Everyone is faking it 'til they make it. No one really has anything, in my opinion from what I can see, so it's a technical nightmare. Where was that database? So it's going to impact how you store the data, and the sovereignty issue is another issue. So the Blockchain then points to the sovereignty issue of the data, both in terms of the company, the country, and the user. These things are going to impact software development, application development, and, ultimately, cloud choice and the IoT. So to me, GDPR is not just a one and done thing and Blockchain is kind of a future thing to look at. So I would look out of those two lenses and say, Do you have a direction or a narrative that supports me today with what GDPR will impact throughout the organization. And then, what's going on with this new decentralized infrastructure and the role of data, and the sovereignty of that data, with respect to company, country, and user. So to me, that's the big issue. >> So George Gilbert, if we think about this question of these fundamental technologies that are going to become increasingly important here, database managers are not dead as a technology. We've seen a relative explosion over the last few years in at least invention, even if it hasn't been followed with, as Neil talked about, very practical ways of bringing new types of disciplines into a lot of enterprises. What's going to happen with the database world, and what should people be looking for in a couple of weeks to better understand how some of these data management technologies are going to converge and, or involve? >> It's a topic that will be of intense interest and relevance to IT professionals, because it's become the common foundation of all modern apps. But I think what we can do is we can see, for instance, a leading indicator of what's going to happen with the legacy vendors, where we have in-memory technologies from both transaction processing and analytics, and we have more advanced analytics embedded in the database engine, including Machine Learning, the model training, as well as model serving. But the, what happened in the big data community is that we disassembled the DBMS into the data manipulation language, which is an analytic language, like, could be Spark, could be Flink, even Hive. We had the Catalog, which I think Jim has talked about or will be talking about, where we're not looking, it's not just a dictionary of what's in one DBMS, but it's a whole way of tracking and governing data across many stores. And then there's the Storage Manager, could be the file system, an object store, could be just something like Kudu, which is a MPP way of, in parallel, performing a bunch of operations on data that's stored. The reason I bring all this up is, following on David's comment about the evolution of hardware, databases are fundamentally meant to expose capabilities in the hardware and to mediate access to data, using these hardware capabilities. And now that we have this, what's emerging as this unigrid, with memory-intensive architectures and super low latency to get from any point or node on that cluster to any other node, like with only a five microsecond lag, relative to previous architectures. We can now build databases that scale up with the same knowledge base that we built databases... I'm sorry, that scale out, that we used to build databases that scale up. In other words, it democratizes the ability to build databases of enormous scale, and that means that we can have analytics and the transactions working together at very low latency. >> Without binding them. Alright, so I think it's time for the action items. We got a lot to do, so guys, keep it really tight, really simple. David Floyer, let me start with you. Action item. >> So action item on big data should be focus on technologies that are going to reduce the elapse time of solutions in the data center, and those are many and many of them, but it's a production problem, it's becoming a production problem, treat it as a production problem, and put it in the fundamental procedures and technologies to succeed. >> And look for vendors >> Who can do that, yes. >> that do that. George Gilbert, action item. >> So I talked about convergence before. The converged platform now is shifting, it's center of gravity is shifting to continuous processing, where the data lake is a reference data repository that helps inform the creation of models, but then you run the models against the streaming continuous data for the freshest insights-- >> Okay, Jim Kobielus, action item. >> Yeah, focus on developer productivity in this new era of big data analytics. Specifically focus on the next generation of developers, who are data scientists, and specifically focus on automating most of what they do, so they can focus on solving problems and sifting through data. Put all the grunt work or training, and all that stuff, take and carry it by the infrastructure, the tooling. >> Peter: Neil Raden, action item. >> Well, one thing I learned this week is that everything we're talking about is about the analytical problem, which is how do you make better decisions and take action? But companies still run on transactions, and it seems like we're running on two different tracks and no one's talking about the transactions anymore. We're like the tail wagging the dog. >> Okay, John Furrier, action item. >> Action item is dig into GDPR. It is a really big issue. If you're not proactive, it could be a nightmare. It's going to have implications that are going to be far-reaching in the technical infrastructure, and it's the Sarbanes-Oxley, what they did for public companies, this is going to be a nightmare. And evaluate the impact of Blockchains. Two things. >> David Vellante, action item. >> So we often say that digital is data, and just because your industry hasn't been upended by digital transformations, don't think it's not coming. So it's maybe comfortable to sit back and say, Well, we're going to wait and see. Don't sit back and wait and see. All industries are susceptible to digital transformation. >> Alright, so I'll give the action item for the team. We've talked a lot about what to look for in the community gathering that's taking place next week in Silicon Valley around strata. Our observations as the community, it descends upon us, and what to look for is, number one, we're seeing a bifurcation in the marketplace, in the thought leadership, and in the tooling. One set of group, one group is going more after the infrastructure, where it's focused more on simplification, convergence; another group is going more after the developer, AI, ML, where it's focused more on how to create models, training those models, and building applications with the services associated with those models. Look for that. Don't, you know, be careful about vendors who say that they do it all. Be careful about vendors that say that they don't have to participate in a converged approach to doing this. The second thing I think we need to look for, very importantly, is that the role of data is evolving, and data is becoming an asset. And the tooling for driving velocity of data through systems and applications is going to become increasingly important, and the discipline that is necessary to ensure that the business can successfully do that with a high degree of predictability, bringing new production systems are also very important. A third area that we take a look at is that, ultimately, the impact of this notion of data as an asset is going to really come home to roost in 2018 through things like GDPR. As you scan the show, ask a simple question: Who here is going to help me get up to compliance and sustain compliance, as the understanding of privacy, ownership, etc. of data, in a big data context, starts to evolve, because there's going to be a lot of specialization over the next few years. And there's a final one that we might add: When you go to the show, do not just focus on your favorite brands. There's a lot of new technology out there, including things like Blockchain. They're going to have an enormous impact, ultimately, on how this marketplace unfolds. The kind of miasma that's occurred in big data is starting to specialize, it's starting to break down, and that's creating new niches and new opportunities for new sources of technology, while at the same time, reducing the focus that we currently have on things like Hadoop as a centerpiece. A lot of convergence is going to create a lot of new niches, and that's going to require new partnerships, new practices, new business models. Once again, guys, I want to thank you very much for joining me on Action Item today. This is Peter Burris from our beautiful Palo Alto theCUBE Studio. This has been Action Item. (lively electronic music)

Published Date : Feb 24 2018

SUMMARY :

We are again broadcasting from the beautiful and it's going to be different from this show, And the third-party applications, we don't have Now that suggests that one or the other is more or less hot, but the problem is, you know, it's like talking about the What are going to be the experiments that are going to in a lot of the perspectives associated with I think the thing to think about, that folks going to the show need to keep in mind and move the data to those nodes and people think that they're going to be So the Blockchain then points to the sovereignty issue What's going to happen with the database world, in the hardware and to mediate access to data, We got a lot to do, so guys, focus on technologies that are going to that do that. that helps inform the creation of models, Specifically focus on the next generation of developers, and no one's talking about the transactions anymore. and it's the Sarbanes-Oxley, So it's maybe comfortable to sit back and say, and sustain compliance, as the understanding of privacy,

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
DavidPERSON

0.99+

Jim KobielusPERSON

0.99+

GeorgePERSON

0.99+

David FloyerPERSON

0.99+

George GilbertPERSON

0.99+

Dave VellantePERSON

0.99+

Neil RadenPERSON

0.99+

NeilPERSON

0.99+

Peter BurrisPERSON

0.99+

David VellantePERSON

0.99+

IBMORGANIZATION

0.99+

San JoseLOCATION

0.99+

John FurrierPERSON

0.99+

PeterPERSON

0.99+

Feb 2018DATE

0.99+

Silicon ValleyLOCATION

0.99+

JimPERSON

0.99+

AWSORGANIZATION

0.99+

2018DATE

0.99+

GoogleORGANIZATION

0.99+

GDPRTITLE

0.99+

next weekDATE

0.99+

two thingsQUANTITY

0.99+

Palo Alto, CaliforniaLOCATION

0.99+

SplunkORGANIZATION

0.99+

bothQUANTITY

0.99+

A year agoDATE

0.99+

two lensesQUANTITY

0.99+

a year agoDATE

0.99+

two years agoDATE

0.99+

this weekDATE

0.99+

Palo AltoLOCATION

0.99+

firstQUANTITY

0.99+

third areaQUANTITY

0.98+

CUBEORGANIZATION

0.98+

one groupQUANTITY

0.98+

second thingQUANTITY

0.98+

27 rocketQUANTITY

0.98+

todayDATE

0.98+

next yearDATE

0.98+

Two thingsQUANTITY

0.97+

theCUBE StudiosORGANIZATION

0.97+

two-tool playerQUANTITY

0.97+

five microsecondQUANTITY

0.96+

One setQUANTITY

0.96+

TableauORGANIZATION

0.94+

a yearQUANTITY

0.94+

single versionQUANTITY

0.94+

oneQUANTITY

0.94+

WikibonsORGANIZATION

0.91+

WikibonORGANIZATION

0.91+

two different tracksQUANTITY

0.91+

five-tool playerQUANTITY

0.9+

several years agoDATE

0.9+

this yearDATE

0.9+

StrataTITLE

0.87+

Harry PotterPERSON

0.85+

one thingQUANTITY

0.84+

yearsDATE

0.83+

one placeQUANTITY

0.82+

Jeremy Almond, PayStand | CUBE Conversation, Feb 2018


 

(orchestral string music) >> Welcome to this special Cube Conversation here in our Palo Alto studios, the Cube office here. I'm John Furrier, the co-founder of SiliconAngle Media, and also the co-host of the Cube. Our next guest is Jeremy Almond who is the CEO of PayStand, a hot startup doing some really new things in and around Blockchain, decentralized, and really targeting the B to B space on a really compelling and an interesting topic that a lot of people are interested in. Jeremy, welcome to this Cube Conversation. >> Awesome, thank you John. >> John: Hey, so tell me a little bit about the company and set the table for us...Paystand, what you guys are doing, why you were founded, and what's the disruptive enabler you guys are taking? What's the angle of your business? >> Jeremy: Yeah, sure...so Paystand, like you mentioned, is a B to B software platform specifically focused on payment. So you can imagine what PayPal or Venmo does from the consumer level. We do for complicated commercial transactions between accounts receivable and accounts payable departments that normally would be paying with paper checks in a manual process. >> John: So basically, accounting, ledger, I'm kind of guessing...nice fit for Blockchain... >> Correct, yeah, yeah. So what we do is we apply Blockchain technology to help a company speed up their time-to-cash, automate their business process, and dramatically lower their transaction costs. >> I'll get your thoughts on this...I interviewed Don Tapscott at an event and we were riffing on this notion of the nature of the firm, right? People would come to an office, you'd have accounting, all these things that you'd have to put in place of systems. Now with this decentralized world we're living in, internet, and with Blockchain in particular, and a crypto-currency market that's pretty frothy but, you know, you look at Blockchain and separate those two for a minute, you really can look at ways to change how work is organized. How do you guys view that? I mean, it's obviously a new, big wave coming. Then you got businesses who are just trying to operate and make money, right? Keep the lights on, but they almost have to start rethinking about the future. So, what is this block wave...Blockchain wave coming? How do you talk about that? Is it that disruptive? I mean, certainly centralized databases aren't going away anytime soon, but it's coming. What's your thoughts in reaction to that? >> It's coming, you know...I think it's... It will affect the enterprise which is where we spend our time and space, in a lot of ways like Cloud did, right? So I've spent probably 15 years doing un-sexy B to B tech, in some way, shape, or form. And what we've seen is digital transformation in the enterprise has happened in a few key areas. CRM is now in the Cloud, right? You have companies like SalesForce that have become significant. ERP is now in the Cloud, your financial software is now automated, right? Kind of ironically, the last mile piece, that part that lubricates the business, the core of the business, the money-movement piece, is actually still really, really manual. So, you have humans that sit around and they take an invoice and then a paper check and then they move it, and that process is very, very ineffecient. And so, having a more automatic, smart financial system can improve the business's life in really significant ways. >> Also, you know, one of the things we've been commenting on and opining here on the Cube is... I made a statement a couple weeks ago, "Oh, MarTech"...you know, marketing technology wave, all those logos on those landscape slides, "didn't really pan out 'cause the Cloud kind of changes that." I mean, it's panning out, but not the way people thought. FinTech...financial tech...is also certainly important. Banks, subsidy trading, you see that. What is the inhibitor for these new trends? Because you mentioned they're moving paper around. I mean, it's money, they probably don't want to mess with an operational system that's core to their business. Is it fear? Is it tech? Both? What's your view on why it's taking so long? Or is it moving along at a speed you think it's going to... Be adopted? >> Jeremy: Yeah, it's actually kind of a unique point in time right now. I think on one hand, financial services in general, part of their job is to manage risk, right? And so they're going to be a lagging, in some ways, industry. And so, digital transformation, right? The internet has opened up and democratized media. It's opened up so many other areas. Blockchain now is the entry point for digital transformation of financial services, and so the time is probably right, right now. We've been in the space...we started the company in 2014. And, you know, I've seen over the last three years, hearing banks, other large institutions, large enterprises, go from skepticism to curiosity. >> John: What's the technology stack look like? Obviously four years is, like, decades in the Blockchain world, and obviously, people are running as fast as they can. It's kind of a moving train at many levels. Business model side as well as a tech stack. And this is really the opportunity. A lot of these systems... I mean, some of the e-commerce systems are 20-year-old tech stacks, some are even older. >> Jeremy: Yeah. >> Just going back four years, since you were founded... What's the big moving glacier, if you will, of change and how are you guys managing that? How should people think about managing the risk of the tech stack? >> Jeremy: Yeah, I mean I think...you know... On the Blockchain-specific side... in the early days, a lot of it was about currency, and actual payment, right? I think what we're seeing now is the opportunity for Blockchain, particularly in the enterprise, to actually dramatically improve their operations side, right? Ethereum, private Blockchains... actually have the ability to not just decentralize how money movement or networks operate, but how an internal system operates. I'll give you an example...we used the Blockchain to... A private Blockchain to actually control approval workflow. So when a payment goes out, oftentimes you need your accounts payable person to send a payment out, but the controller or the treasury or someone else has to sign off on it, right? So that signature, you need it to be valid, trusted, the identity around it, right? And you want an audit record. And so Blockchain's a really, really good use case for something like that. That's not peer-play payments, it's not peer-play settlement. It doesn't require, you know, a million people to get on. It just can operate in the business in a really critical function, in a better way than the current technology does. >> John: It's interesting, I love these new technology opportunities 'cause... There's always going to be a tipping point and the famous Steve Jobs quote is, "Hey if I was asked to build a better "phone in 2005, I would have built an excellent... "better Blackberry." But he...then he built the iPhone, so he thought differently. No one was really asking for the iPhone. The question I get a lot from skeptics in Blockchain is, "No one's really asking for Blockchain." So, again, this is kind of like...you could always say, "I'm building a better centralized database system "in a distributed computing environment." Okay, we've done that. >> Yeah. >> Are people asking for Blockchain, or are they just asking for it in a different way? What's your thoughts to that? >> Yeah, I would say that there's... There's a big picture question of, "Are people asking for it." And I'd say society's actually asking for it. Part of my personal story is, you know, my family, blue collar family, they... My mother's side immigrated here, her generation. My brick-layer father, they spent their entire lives getting their first home. And you know, 800 square foot home, that's nothing special, but it was their American dream. In 2008, in the financial crisis, they lost the house. And so I think, you know, society said, "Financial services and core parts of our economy "actually could...we could do better, right?" And I think the magic thing about technology is we get to imagine the world not as it is but as it ought to be. So one, I think society is actually asking for... Can the core parts of our economy actually do better? Can we dream up something better? And I think that's the purest part of what the folks in the Blockchain movement are trying to do. That's, you know, at a very high level. And then I think, practically, right, for businesses like we operate day in and day out...you know... If there's technology that allows them to be able to operate their business more efficiently, drop their costs and grow faster, you know... How would that work, right? It's in some ways like Cloud. How does Cloud work? You know, I think... now we're really getting into the deep mess of it, but you know, Cloud was transformative to the business, right? VOIP was transformative to some businesses. Inbound marketing was transformative to some businesses. Blockchain is the same kind of concept. >> I mean, and Cloud, too...there was a lot of naysayers. I remember I used the first EC2 instances of Amazon when it came out, being an entrepreneur, I'm like, "I don't have to provision servers? "This is amazing, I can put my credit card down "and pay a few bucks..." And then even still, up until, I would say, even three or four years ago they were dismissed as relevant. >> Jeremy: Yeah. >> And again, the rest is history, look what they've done. So there's always going to be those naysayers. But to the point about Cloud and Blockchains, and even crypto, this is a wave, and we've, you know... We're very bullish on this movement because we see the wave coming way out there and it's huge. This is probably bigger than the other waves combined, in our opinion. So you mentioned societal change. This is a big deal. I mean, you're seeing regulations right now in GDPR in Europe, kind of trying to govern an old database market that's...it's a mess, database wise. But it makes sense from a society standpoint. People want to pull their data out. This is a trend. You got societal forces, and then technical legacy. I mean, this could be an opportunity for Blockchain to say, "Hey, optimize for the new wave." Don't try to retrofit, say, an old wave. What's your thoughts? >> Jeremy: Correct, yeah, I mean I think there's a... ...a number of areas... Even in the data cyber society. Take an Enron scandal, right? That happened a decade plus ago. Out of that came regulation called Sarbanes-Oxley, right? And Sarbanes-Oxley's concept, right, is to ensure that companies publicly account for their records in a proper way, right? If there's an audit trail, that they don't sort of take their financial systems and misrepresent them, right? Blockchain, because it's a source of truth that's immutable, meaning it can't be changed, is a great way, right, to have more efficiency in that process. Today there's a whole industry that's popped up just for Sarbanes-Oxley, just to regulate the financial system, just to ensure that the books actually say what they're supposed to say, right? That's kind of the definition of what a smart contract can and should do. >> John: This is really an opportunity for entrepreneurs, if you think about it. I mean, a lot of alpha entrepreneurs are really licking their chops on Blockchain because they can see how it could disrupt industries. And I showed you some of the things we're working on, and what we're thinking about for SiliconAngle about media and data. But it brings up things that we obviously see every day in the press: the election, weaponizing content for bad things. Facebook's having a challenge right now on how they optimize their data for their own self-service reasons. This is a problem, this is a revolution. People are kind of tired, so...what's your view of the role of data to the human? I mean, obviously, you know, the cliche: "Oh, the users are in charge, "they should own their own data." Okay I got that. But how...how do you see that vision playing out? I mean not just from a Facebook which is a social network example, but how does data impact a user going forward in your vision? Because they could really change from the outside in. >> Yeah, I mean I think...part of what's critical with data is two things: one, identity really matters, right? How do you manage identity? And so I think there's a number of really fascinating Blockchain companies that are specifically focused on the identity question, right? And that's...that's true around the social media side, it's true around...how do I actually manage where I move... Identity around? So I think that's one side that's really, really critical to solve. I don't know that we've got a crystal ball yet on what it will ultimately look like. But the Blockchain model for identity allows us to... rethink the fabrics of what privacy is, what permission looks like, and what trust looks like with people I want to engage with and with people I don't want to engage with. It's interesting, you talk about the Blockchain culture being more societal and mission-driven. My word, but you're kind of implying that. I remember when the Cloud came out, it was... The network guys were in charge, and the app guys were like, have to feed off the network requirements. And then that sea-change flipped around. The app guys are in charge, data driving requirements for the network. Question for you is: Do you see a day, soon, where societal requirements will dictate technology? I mean, you're seeing... you're seeing that pattern kind of emerging now, it's kind of not yet been fully thought through in public commentary but, you know...we see these pressure points potentially impacting tech design. >> Jeremy: Yeah, yeah...I think there's actually a good tug-of-war or balance, right? So entrepreneurs naturally are going to run as fast as they can to see innovation hopefully with means of improving society, right? And then, you know, you have regulators and you have government agencies who are looking and saying, "Okay, you might be thinking about one myopic view "and we need to make sure "we're looking at the good of society." And so I think that tug-of-war you saw with the internet, right, where how much do we regulate the internet, right? And I think the balance was mostly healthy. And we're sort of seeing that through today with Blockchain as well, where...you know, things like ICOs have good and bad implications. The regulators have been watching it relatively closely. But they also haven't completely came down and clamped down on it, you know, even this week there's... There was a relative balance in the discussions that came out. >> John: The SEC's done a good job, they've... >> Correct. >> John: They whipped a few people in shape to send the signal, but they weren't foreclosing any innovation. >> Jeremy: That's correct, yeah. >> And ICOs...certainly there're some scams. What's the good sides of ICO? Obviously the scams are out there. What's the good side? The fundraising? Democratization? What's your take on the ICO? Initial coin offering opportunity. >> Yeah, you know, I think...in some ways, democratization has become such a buzzword it's lost its meaning, right? But if you think about what it really is, it's so powerful, because it's this concept, right, that we distribute power and control to the hands of many. And so, you know, I think there are a lot of public good technologies that actually can use that concept, right? The internet is a public good. You could argue Wikipedia is a public good, right? And so, utility-type tokens actually are valuable because they can have a dual nature to them. I think the other thing that I'm particularly interested in watching how ICOs evolve is...I think there's some danger in ICOs...coming in and... in the early stage market. Because early stage companies tend to be... They're so nascent that they need guidance, right? And I actually...I might be contradictory here to most people in the Blockchain space, but I actually think early stage investors have a lot of value in that space. And so, I am actually fascinated about what happens in later stage rounds and what do ICOs become there. So I think utility, and later stage rounds are actually two fascinating areas of ICOs. >> John: Sure, that's a great point. I would also say that the trend that we're seeing is... There's an early stage component that needs mentoring and needs some nurturing, I would agree with that. That's a classic VC, maybe some token economics in there, but again, different playbook. The tokenization of business is really interesting 'cause now you have token economics being applied to a preexisting, proven business, with a disruptive nature on the other side, is super interesting. So I have to ask you: Are we going to have a chief economic officer as a new role soon? Or, is that going to be...'cause remember, if you think about token economics, it's about opening up and changing the distribution of data and wealth, you can argue both are the same, but...how do you view that? Because that's a trend we're seeing. The tokenization of a business to disrupt an industry incumbent...set of incumbents. >> Correct, yeah, and I think it's a... it's really, really early days and what... You have really early stage companies that are thinking about tokenizing their business before they exist, right? And then you have other companies which are maybe past the innovation curve and they're trying to apply tokens to their business. >> A pivot of an existing business. >> Yeah, so we've seen these, right? Public companies that have added Blockchain to the name. I think the fascinating thing will become where... Fast-growing, real businesses, where there's a there there, they've crossed the chasm, go, "Okay how do we apply "tokenization to our company? "And how do we think about it, from both a... "commercial economic part of the business, "and then how do we think about it "from tokenizing the business?" And we haven't seen many cases yet, but I actually think that's one of the next waves we'll see. >> John: Great insight. I got to ask you on a personal level. You're doing some talking, obviously the founder of the company, CEO. What's going on? What do you talk about these days? What are you passionate about? I know you were talking to some folks at UC Santa Barbara. You mentioned going to teach down there. What are you talking about? What are you sharing publicly? what's on your mind these days? >> Yeah, I mean, I think...I'm personally deeply motivated every day by waking up and going, you know, "The financial service industry can go through a massive transformation, right? And I think there's a lot of really good companies that are doing that at the consumer level, and so, you know, I think our space...we have a unique place in time to be working at the commercial level. So the commercial level affects big parts of our economic infrastructure in ways that we don't think about. The Equifax breach was a pretty big deal to people, right? The financial crisis was a big deal to people. So, how do we imagine those kinds of industries, right? Supply chain, title, logistics, right? And how do we think about those industries, democratizing them with Blockchain? Those, to me, are the unsung heroes of what Blockchain will ultimately help transform society. >> John: It's interesting, you said you were kind of humble when you came on earlier. "I'm in boring areas of B to B..." But I got to say to your point about Cloud earlier, there's a calm before the storm, these boring areas that are, say, calm are really the grounds where you see disruption, and I think that's an area... Not just high-frequency trading, that's going to be, you know, always an issue, but in terms of real financial plumbing. >> Yeah. >> Perfect for a ledger, perfect for those things. Okay, take a plug for your company. How are people using you guys? What's the value proposition? What are some of the things that you guys are involved in? How does someone engage with you guys? Give the plug for PayStand. >> Yeah, so at PayStand, we tend to work with companies where there are high volumes of paper checks in the process. So if you have a $100,000 invoice that goes out, for example, with a company that you've been working at for a decade, and you have a contract that says it's a Net 60 contract, right? The challenge is, it's a paper check, you want to move it digitally, what do you move it digitally to? And the reality is the consumer payment companies that are focused on credit cards are not really an ideal solution for that because their business model is a percentage business model. There's nothing wrong with a percentage business model that charges a company two or three percent if I'm swiping for a five dollar cup of coffee, right? If it's a $100,000 payment that I owe someone that I know, and I have contract terms, I'm not going to pay the bank $3,000 to move ones and zeroes from this bank database to this bank database. So what we do with our network is we make that money movement fast, instant, automatic, verified, validated, right, with control, in a way where we can automate the process. >> It's so funny what jumped in my mind is punch cards to computers, tape to storage. This is interesting. So paper checks, probably big, I don't know what the numbers are, you might have them handy. People are doing paper checks, so you're building a system around paper checks, did I get that right? >> Yeah, so we digitize what would have been a paper check. Today over 50 percent of all commercial payments are still done in paper checks. So they're gone in our digital world, right? Like, you and I, we Venmo each other. But when a business goes to write a check, when they get an invoice, they send out a check. And so we digitize the whole process. The moment that the invoice is ready to go, to the moment it gets in the bank, it all becomes digital space. >> John: And the alternative is what, I got to go check when it was mailed, was it received, was it cashed, did it get put into the accounting system? And that's kind of... >> Jeremy: That's correct. >> That's the manual... >> Jeremy: That's the manual. So they spend...they'll spend a week tracking down the payment from the moment the controller says, "Okay to pay," to the time it sits in their bank account, that's humans, time, money. >> And an old, antiquated system that doesn't change because of...what? >> Jeremy: Well, it's legacy infrastructure in one way. But in another, you know, even the banking infrastructure, the...most of the banking infrastructure that are for commercial payments was designed in the 60s and 70s. And last time I checked, the 60s and 70s was before the internet of today. So they weren't really designed for digital realtime payments. And they weren't designed for commercial use cases like today. >> Is fraud a factor, or is that not a factor? Is that part of it, or...yes? >> Jeremy: Yeah and I think a key thing with what we do, enterprise payments, security is really, really important. We take it very, very seriously. And this is, again, one of the downsides to the legacy commercial infrastructure. When you have a check, right? You have this checking and routing number on it. Anybody takes that, in theory, that's all that identifies you and your company and your account. And so money can actually be moved and ran against in that case. With a network like ours, we can validate that you are who you say you are, you have the money in your account, it moved when it should, and you've actually authorized it. These are all things that we should know, but we just don't. >> John: And you take the data around it, you take that check, put it into the system. Okay so when does a company want...should be calling you. Is it like, "I'm overloaded with paper, "I want a new system, I'm doing a refresh." I mean, when do people call PayStand? What's the signals that would give your buyer some indicator of time to call PayStand? >> Yeah, so generally it's after...it's when they have high-volumes of checks and they're growing, and/or that they've basically taken their ERP, and they've done an ERP Cloud migration, right? And so now they've got their general ledger, and that financial system's not in a shoebox anymore, right? It's in a critical core ERP system. And so what they're finding is they've bought digital transformation for financial services and their accountant only sort of has half the solution. And so they come in and they use us to close the last mile. >> John: Okay, so I'm going to put my naysayer hat on and ask you the question: I love it, but what's this Blockchain thing? I'm an accounting guy, took one computer class, whatever, I get blockchained. How do you stay up to date, how do you ensure that I'm going to have a system that's going to be working? I know that Blockchain standards are changing. How do you guys mitigate that? How do you handle that question? >> Jeremy: Yeah, I mean I think the critical thing for our customers, right, is... For us, our customers, money moves in dollars, right? It leaves their bank account, and goes into their supplier's bank account, the supplier's bank account goes into their customer's bank account, right? Their financial system does not change. We're actually very, very sensitive to that. We think about this very different than a consumer solution, which is...consumer solutions almost have a... A critical mass question. They need everybody to get into the system for it to work. For commercial, you don't actually want to change the business process of your partners, right? It's really important, they've been doing this...so... So we are very thoughtful about our software doesn't change business process, it doesn't require you to enter into some kind of new economy or a new currency. You simply do what you're always doing, with the systems you're already using, right? And we just digitize the process to make them faster, cheaper, and automated. >> Awesome. Talk about your goals for the year at PayStand. Where are you guys at, company-wise? Funding, goals, hiring, what's going on? Give a quick final word on the company. >> Jeremy: Yeah, I mean I think we...you know... We're blessed right now, I would say we're one of, if not the fastest B to B payment company... fastest-growing B to B payment company today. So, you know, I think we have a long way to go... I would call this inning two for us, right? We ultimately...I think much more about what does 10 years look like than 12 months look like. Because this is the beginning of the commercial financial service wave. And so, you know, I think we ultimately believe the digital transformation is going to reinvent our industry. And if we can go lead the way, we'll be very happy. QAnd for us that just means continue growing, continue serving our customers, continue hiring, you know. I think if we do all that, you know, right place right time. >> John: Awesome...final question for you. To the folks out there watching, you're an expert in the industry...again, fintech as well as computer engineering. If my sister who is not savvy says, "Jeremy, what is Blockchain?" How would you describe Blockchain to someone who's interested and needs to know the definition and importance of Blockchain? >> Jeremy: Okay, so Blockchain, to me, is basically a way to be able to take information like you might have on your checkbook, or you might have in a spreadsheet, and use it where anybody can access it in a way that's actually easily, controllable, visible, secure, and automated. That doesn't sound very sexy, but the important thing is how we keep records affects all of society, right? We have records of who owns their houses, we have records of how much money we have in our account, we have records of who did we vote on, right? Those records are the foundation for our society. Currently companies own those records. Companies are fallible, right? And so what Blockchain does is it allows us to make a more infallible system to keep access to those records you and I care about. >> John: And this is an infrastructure opportunity, not so much crypto currency... kind of a distinction between the two, right? >> That's right, that's right. I would say crypto currency and money is like the first pillar app on top of Blockchain. >> John: Jeremy Almond, CEO, founder of PayStand, hot company, doing something really good in a growing, changing market called checks, paper checks, and if you have them and groan, digitize them. Great entry strategy for Blockchain. Thanks for coming on this Cube Conversation. And thanks for joining us here in Palo Alto. I'm John Furrier in the Cube Studios for Cube Conversations. Thanks for watching. (exciting orchestral music)

Published Date : Feb 13 2018

SUMMARY :

decentralized, and really targeting the B to B space and what's the disruptive enabler you guys are taking? Jeremy: Yeah, sure...so Paystand, like you mentioned, John: So basically, accounting, ledger, to help a company speed up their time-to-cash, Keep the lights on, but they almost have to start ERP is now in the Cloud, your financial software I mean, it's panning out, but not the way people thought. of financial services, and so the time is probably right, I mean, some of the e-commerce systems What's the big moving glacier, if you will, of change actually have the ability to not just decentralize and the famous Steve Jobs quote is, And so I think, you know, society said, "I don't have to provision servers? And again, the rest is history, look what they've done. the financial system, just to ensure that the books of the role of data to the human? in public commentary but, you know...we see these And so I think that tug-of-war you saw with the internet, to send the signal, What's the good sides of ICO? And so, you know, I think there are a lot Or, is that going to be...'cause remember, if you think about And then you have other companies which are maybe Public companies that have added Blockchain to the name. I got to ask you on a personal level. that are doing that at the consumer level, and so, you know, But I got to say to your point about Cloud earlier, What are some of the things that you guys are involved in? And the reality is the consumer payment companies you might have them handy. The moment that the invoice is ready to go, John: And the alternative is what, I got to go check Jeremy: That's the manual. And an old, antiquated system that doesn't change But in another, you know, even the banking infrastructure, Is fraud a factor, or is that not a factor? With a network like ours, we can validate that you are What's the signals that would give your buyer And so what they're finding is they've bought and ask you the question: the business process of your partners, right? Where are you guys at, company-wise? I think if we do all that, you know, right place right time. in the industry...again, fintech as well as like you might have on your checkbook, kind of a distinction between the two, right? the first pillar app on top of Blockchain. and if you have them and groan, digitize them.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Jeremy AlmondPERSON

0.99+

JeremyPERSON

0.99+

JohnPERSON

0.99+

2005DATE

0.99+

2014DATE

0.99+

$3,000QUANTITY

0.99+

twoQUANTITY

0.99+

EuropeLOCATION

0.99+

$100,000QUANTITY

0.99+

PayStandORGANIZATION

0.99+

John FurrierPERSON

0.99+

FacebookORGANIZATION

0.99+

Palo AltoLOCATION

0.99+

15 yearsQUANTITY

0.99+

Don TapscottPERSON

0.99+

SiliconAngle MediaORGANIZATION

0.99+

Steve JobsPERSON

0.99+

two thingsQUANTITY

0.99+

MarTechORGANIZATION

0.99+

iPhoneCOMMERCIAL_ITEM

0.99+

three percentQUANTITY

0.99+

800 square footQUANTITY

0.99+

PaystandORGANIZATION

0.99+

BlackberryORGANIZATION

0.99+

Feb 2018DATE

0.99+

threeDATE

0.99+

bothQUANTITY

0.99+

2008DATE

0.99+

60sDATE

0.99+

TodayDATE

0.99+

AmazonORGANIZATION

0.99+

BothQUANTITY

0.99+

12 monthsQUANTITY

0.99+

70sDATE

0.99+

10 yearsQUANTITY

0.99+

SECORGANIZATION

0.99+

EquifaxORGANIZATION

0.99+

first homeQUANTITY

0.99+

PayPalORGANIZATION

0.99+

UC Santa BarbaraORGANIZATION

0.98+

four yearsQUANTITY

0.98+

over 50 percentQUANTITY

0.98+

oneQUANTITY

0.98+

this weekDATE

0.98+

todayDATE

0.98+

CloudTITLE

0.98+

firstQUANTITY

0.97+

Cube StudiosORGANIZATION

0.97+

Jeremy Almond, PayStand | CUBE Conversation, Feb 2018


 

(orchestral music) >> Welcome to the special Cube Conversation here at Palo Alto studios, at the Cube office yeah I'm John Furrier, the co-founder of SiliconANGLE Media, and also co-host the Cube. Our next guest is Jeremy Almond is the CEO of Paystand, hot startup doin' some really new things in and around blockchain, decentralize, and really targeting the B2B space on a really compelling and interesting topic that a lot of people are interested in. Jeremy welcome to this Cube conversation. >> Awesome, thank you John. >> Hey so talk a little about the company, set the table for us, PayStand, what you guys are doing, why you were founded, and what's the disruptive enabler that you guys are taking, and what's the angle of your business? >> Yeah sure so, PayStand like you mentioned, is a B2B software platform, specifically focused on payment. So, you can imagine what Paypal or Venmo does from the consumer level, we do for complicated commercial transactions between accounts receivable and accounts payable departments that normally would be paying with paper checks in manual process. >> So basically accounting, ledger, I'm kind of guessing. Nice bit for blockchain. >> Correct, yeah, yeah. So, what we do is we apply blockchain technology to help a company speed up their time to cash, automate their business process and dramatically lower their transaction cost. >> I'll get your thoughts on this. I interviewed Don Tapscott at an event and we were riffing on this notion of the nature of the firm, right? People would come to an office, you'd have accounting, all these things that you'd have to put in place as systems. Now with this decentralized world we're living in, internet and with blockchain in particular, and a cryptocurrency market that's pretty frothy. But, you look at blockchain and you separate those two for a minute. You really can look at ways to change how work is organized. How do you guys view that, I mean, It's obviously a new, big wave coming. Then you've got businesses who are just trying to operate and make money, right? So, keep the lights on, but they also have to start rethinking about the future. So, what is this block wave, blockchain wave coming? How do you talk about that? Is it that disruptive? I mean, certainly centralized databases aren't going a away any time soon, but it's coming. What's your thoughts and reaction to that? >> It's coming. You know, I think it's... It will effect the enterprise, which is where we spend our time and space. In a lot of ways like cloud did. So, I've spent probably 15 years doing unsexy B2B tech in some way shape or form. And what we've seen is digital transformation in the enterprise has happened in a few key areas. CRM is now in the cloud. You have companies like Salesforce that have become significant. ERP is now in the cloud, you're financial software is now automated. Kind of ironically the last mile piece, the part that lubricates the business, the core of the business, the money movement piece, is actually still really, really manual. So, you have humans that sit around and they take an invoice and then a paper check and then they move it. And that process is very, very inefficient. And so, having a more automatic, smart financial system can improve the business's life in really significant ways. >> Also, you know, one of the things we've been commenting on opining here on the Cube is, I made a statement a couple weeks ago, OMAR tech, marketing technology, Wave, all those logos on those landscape slides, didn't really pan out cause the cloud kind of changed that. It's panning out, but not the way people thought. FinTech, financial tech, is also certainly important. Banks of safe trading, you see that. What is the inhibitor for these new trends? Because you mentioned moving paper around. I mean it's money, they probably don't want to mess with an operational system that's a quarter of their business. Is it fear? Is it tech? Both? What's your view on why it's taking so long? Or is it moving along at a speed you think is going to... Being adopted? >> Yeah, it's actually kind of a unique point in time right now. I think in one hand, financial services in general, part of their job is to manage risk. And so, they're going to be a lagging, in some ways, industry. And so, digital transformation, the internet has opened up and democratized media. It's opened up so many other areas. Blockchain, now, is the entry point for digital transformation of financial services. And so, the time is probably right now. We've been in the space. We started the company in 2014. I'd seen over the last three years, hearing banks, other large institutions, large enterprises go from skepticism to curiosity. >> What's the technology stack look like? Obviously, four years is like decades in the blockchain world. Obviously, people are running as fast as they can. It's kind of a moving train, at many levels, business model side, as well as the tech stack. And this is really the opportunity a lot of these systems-- I mean some of the e-commerce systems are 20 year old tech stacks, some are even older. Just going back four years since you were founded, what's the big moving glacier, if you will, of change and how are you guys managing that? And how should people think about managing the risk of the tech stack? >> Yeah, I mean, I think on the blockchain specific side, in the early days a lot of it was about currency and actual payment. I think what we're seeing now is the opportunity for blockchain, particularly in the enterprise, to actually dramatically improve their operations side. So, Ethereum, private blockchains, actually have the ability to, not just decentralize how money movement or networks operate, but how an internal system operates. I'll give you an example, we used the blockchain to-- A private blockchain to actually control approval work flow. So when a payment goes out, often times you need your accounts payable person to send a payment out, but the controller or the treasury or someone else has to sign off on it. So that signature, you need it to be valid, trusted, the identity around it, right? And you want an audit record. And so blockchains a really, really good use case for something like that. That's not pure play payments, it's not pure play settlement, it doesn't require a million people to get on. It just can operate in the business in a really critical function in a better way than the current technology does. >> It's interesting. I love these new technology opportunities, cause there's always going to be a tipping point and the famous Steve Jobs quote is, "Hey, if I was asked to build a better phone "in 2005 I would've built an excellent, better Blackberry" But then he built the Iphone, so he thought differently. No one was really asking for the Iphone. So, the question I get a lot from skeptics in blockchain is, no one's really asking for blockchain. So, again, this is kind of like, you could always say, I'm building a better centralized database system in a distributive computing environment. Okay, we've done that. >> Yeah >> Are people asking for blockchain? Or are they just asking for it in a different way? What's your thoughts to that? >> Yeah, I would say that there's a big picture question of are people ask for it? And I'd say, society is actually asking for it. Part of my personal story is, my family, blue collar family, my mother's side immigrated here, her generation. My brick layer father, they spent their entire lives getting their first home. 800 square foot home, it was nothing special, but it was their American dream. In 2008, in the financial crisis, they lost that house. And so I think society said financial services and core parts of our economy actually could-- we could do better. And I think the magic thing about technology is we get to imagine the world not as it is, but as it ought to be. So, one, I think society is actually asking for can the core parts of our economy actually do better? Can we dream up something better? I think that's the purest part of what the folks in the blockchain movement are trying to do. That's at a very high level. And then I think practically, right, for businesses like we operate day in and day out, if there's technology that allows them to be able to operate their business more efficiently, drop their costs and grow faster, how would that work? It's in some ways like cloud. How does cloud work? I think now we're really getting into the deep mess of it. But cloud was transformative to the business. VOIP was transformative to some businesses. Inbound marketing was transformative to some businesses. Blockchain is the same kind of concept. >> And cloud too, there's a lot of naysayers. I remember I use the first EC2 instances of Amazon, when it came out being an entrepreneur I don't have to have provision servers. This is amazing. And I can put credit card down and pay a few bucks? And then even still, up until three or four years ago, they were dismissed as relevant. And, again, the rest is history. Look what they've done. So, there's always going to be those naysayers. But, to the point about cloud and blockchain, and say even crypto, this is a wave and we are very bullish on this movement because we see the waves coming way out there and it's huge. And this is probably bigger than the other waves combined, in our opinion. So, you mentioned societal change. This is a big deal. You're seeing regulations right now in GDPR, in Europe. Trying to govern an old database market, that's even in an own problem. It's a mess database wise. But it makes sense from a society standpoint. People want to pull their data out. This is a trend. You've got societal forces and then technical legacy. This could be an opportunity for a blockchain to saying, hey optimize for the new wave, don't try to retrofit, say, an old wave. What's your thoughts? >> Correct. Yeah, I think there's a number of areas, even in the data side with society. Take an Enron scandal that happened a decade plus ago. Out of that came regulation called Sarbanes-Oxley. And Sarbanes-Oxley's concept is to ensure that companies publicly account for their records in the proper way. That there's an audit trail. That they don't, sort of, pick their financial systems and misrepresent them. The blockchain, because it's a source of truth that's immutable, meaning it can't be changed, is a great way to have more efficiency in that process. Today, there's a whole industry that's popped up just for Sarbanes-Oxley, just to regulate the financial system, just to ensure that the books actually say what their supposed to say. That's kind of the definition of what a smart contract can and should do. >> This is though, really an opportunity for entrepreneurs when you think about it. A lot of alpha entrepreneurs are really lickin' their chops on blockchain, because they can see how it could disrupt industries. And this is really, again, I showed you some things we're working on and what we're thinking about with SiliconANGLE about media and data. But it brings up things that we, obviously, see every day in the press. The election, weaponizing content for bed, things-- Facebooks having a challenge right now in how they optimize their data for their own self service reasons. This is a problem. This is a revolution. People are kind of tired. So, what's your view of the role of data to the human? Obviously the cliche, oh the users are in charge, they should own their own data. Okay, I get that, but how do you see that vision playing out? Not just from Facebook, that's just a social network example. But how does data impact a user going forward in your vision? Because they could really change from the outside in. >> Yeah, I think part of what's critical with data is two things. One, identity really matters. How do you manage identity? So, I think there's a number of really fascinating blockchain companies that are specifically focused on the identity question. And that's true around the social media side. It's true around, how do I actually manage where I move identity around? So, I think that's one side that's really, really critical to solve. I don't know that we've got a crystal ball yet on what it will ultimately look like. But the blockchain model for identity allows us to rethink the fabrics of what privacy is, what permission looks like and what trust looks like with people I want to engage with and with people I don't want to engage with. >> That's interesting. You talk about the blockchain culture being more societal and mission driven, my word, but you're kind of implying that. I remember when the cloud came out. It was, the network guys were in charge and the app guys had to feed off the network requirements. And then that seat changed, flipped around. The app guys are in charge, data is driving requirements for the network. Question for you is do you see a day soon where societal requirements will dictate technology? You're seeing that pattern kind of emerging now, kind of not yet been fully thought through in public commentary. We see the pressure points potentially impacting tech design. >> Yeah, I think there's actually a good tug of war balance. So, entrepreneurs naturally are going to run as fast as they can to see innovation. Hopefully with means of improving society. And then you have regulators and you have government agencies who are looking and saying okay, you might be thinking about one myopic view and we need to make sure we're looking at the good of society. And so, I think that tug of war you saw with the internet, where how much do we regulate the internet? And I think the balance was mostly healthy. And we're sort of seeing that through today with blockchain as well. Where things like ICOs have good and bad implications. The regulators have been watching it relatively closely. But they also haven't completely came down and clamped down on it. Even this week there's... There was a relative balance in the discussions that came out. >> The SECs done a great job. >> Correct. >> They've whipped a few people into shape, sent the signal, but they weren't foreclosing any innovation. >> That's correct. >> And ICOs certainly had some scams. What's the good sides of ICOs? Obviously the scams are out there. What's the good sides? The fundraising, democratization? What's your take on the ICO, initially coin offering opportunity? >> Yeah, I think in some ways democratization has become such a buzz word it's lost it's meaning. But if you think about what it really is it's so powerful, because it's this concept that we distribute power and control to the hands of many. And so, I think there are a lot of public, good technologies that actually can use that concept. The internet is a public good. You could agree Wikipedia is a public good. And so, utility type tokens actually are valuable, because they can have a dual nature to them. I think the other thing, that I'm particularly interested in watching how ICOs evolve, is-- I think there's some danger in ICOs coming in, in the early stage market. Because early stage companies tend to be... They're so nascent that they need guidance. And I actually, I might be contradictory here to most people in the blockchain space, but I actually think early stage investors have a lot of value in that space. And so, I am actually fascinated about what happens in later stage rounds and what do ICOs become there. So, I think utility and later stage rounds are actually two fascinating areas of ICOs. >> Jeremy, that's a great point. I would also say that the trend that we're seeing is: there's an early stage component that needs mentoring and needs some nurturing, I would agree with that. That's a classic VC-- Maybe some token economics in there, but again different playbook. The tokenization of business is really interesting, cause now you have token economics being applied to a pre-existing proven business with a disruptive nature on the other side. >> Correct. >> Is super interesting. So, I have to ask you. Are we going to have a chief economic officer as a new role soon? Or is that going to be-- Cause it made me think about token economics it's about opening up and changing the distribution, or data and wealth, you could argue both are the same. But how do you view that? Because that's a trend were seeing. The tokenization of a business to disrupt an industry incumbent, set of incumbents. >> Correct, yeah. And I think it's really, really early days in what... You have really early stage companies that are thinking about tokenizing their business before they exist. And then you have other companies which are maybe past the innovation curve and their trying to apply tokens to their business. >> A pivot of an old, existing business. >> Yeah, so we've seen these, right? Public companies that have added blockchain to their name. I think the fascinating thing will become where fast growing, real businesses where there's a there, there. They've crossed the chasm. Go, okay, how do we apply tokenization to our company? And how do we think about it, from both a commercial economic part of the business and then how do we think about it from tokenizing the business? We haven't seen many cases yet, but I actually think that's one of the next waves we'll see. >> Great. Great insight. I got to ask you, on a personal level, you're doing some talking, obviously your the founder of the company, CEO, what's goin' on? What are you talking about these days? What are you passionate about? I know your talking to some folks at University of Santa Barbara. You mentioned going to teach down there. What are you talking about? What are you sharing publicly? What's on your mind these days? >> Yeah, I think I'm personally deeply motivated every day by waking up and going. The financial service industry can go through a massive transformation. And I think there's a lot of really good companies doing that at the consumer level. And so, I think our space, we have a unique place and time to be working at the commercial level. So, the commercial level effects big parts of our economic infrastructures in ways that we don't think about. The Equifax breach was a pretty big deal to people, right? The financial crisis was a big deal to people. So, how do we imagine those kinds of industries? Supply chain, title, logistics. And how do we think about those industries democratizing them with blockchain? Those, to me, are the unsung heroes of what blockchain will ultimately help transform society. >> That's interesting. You said you were kind of humble when you came on earlier. I'm in boring areas of B2B, but I got to say, to see your point about cloud earlier. There's a calm before the storm, these boring areas that are, say, calm, are really the grounds where you see disruption. I think that's an area-- Not just high frequency trading, that's going to be always an issue, but in terms of real financial plumbing. Perfect for a ledger, perfect for those things. Okay, explain-- Take a plug for your company. How are people using you guys? What's the value proposition? What are some of the things you guys are involved in? How does someone engage with you guys? Give the plug for Paystand. >> Yeah, so at Paystand we tend to work with companies where there are high volumes of paper checks in the process. So if you have a hundred thousand dollar invoice that goes out, for example, with a company you've been working out with for a decade. And you have a contract that says it's a net 60 contract. The challenge is, it's paper check. You want to move it digitally. What do you move it digitally to? And the reality is, the consumer payment companies that are focused on credit cards are not really an ideal solution for that because their business model is a percentage business model. And there's nothing wrong with a percentage business model that charges a company two or three percent if I'm swiping for a five dollar cup of coffee. If it's a hundred thousand dollar payment that I owe someone that I know and I have a contract terms. I'm not going to pay the bank 3,000 dollars to move ones and zeros from this bank database to this bank database. So, what we do with our network is we make that money movement fast, instant, automatic, verified, validated with control, in a way that we can automate the process. >> It's so funny. What jumps into my mind is punchcards to computers, tape to duck storage. This is interesting. So, paper checks, probably big, I don't know what the numbers are, you might have them handy. People are doing paper checks. So, you're doing a system around paper checks, did I get that right? >> Yeah, so we digitized what would have been a paper check. Today, over 50 % of all commercial payments are still done in paper checks. So, they're gone in our digital world. You and I, we Venmo each other. But when the business goes to write a check, when they get an invoice they send out a check. And so we digitized the whole process. The moment that the invoice is ready to go to the moment it gets in the bank. It all becomes digital space. >> And the alternative is what? I got to go check when it was mailed, was it received, was it cashed, did it get put into the accounting system? And that's kind of, that's the manual-- >> That's the manual. So, they'll spend a week tracking down the payment. From the moment the controller says okay to pay, to the time it sits in their bank account. That's humans, time, money. >> And an old antiquated system that doesn't change because of what? >> Well it's legacy infrastructure in one way. But in another, even the banking infrastructure-- Most of the banking infrastructure that are for commercial payments was designed in the 60s and 70s. And last time I checked, the 60s and 70s was before the internet today. So, they weren't really designed for digital real time payments. And they weren't designed for commercially used cases like today. >> Is fraud a factor or is that not a factor? Or is that not a part of it? Or yes? >> Yeah, I think a key thing with what we do, enterprise payments, is security is really, really important. We take it very, very seriously. And this is, again, one of the down sides to the legacy commercial infrastructure is when you have a check, you have this checking and routing number on it. Anybody takes that, in theory, that's all that identifies you and your company and your account. Money can actually be moved and ran against in that case. With a network like ours, we can validate that you are who you say you are, you have the money in your account, it moved when it should and you've actually authorized it. These are all things that we should know, but we just don't. >> And you put the data around it. You take that payload, aka check, put it into the system. So, when does a company want-- Should be calling you? Is it like, I'm overloaded with paper. I want a new system. I'm doing a refresh. When do people call Paystand? What's the signals that would give your buyer some indicator of time to call Paystand? >> Yeah, so generally it's after-- It's when they have high volumes of checks and they're growing. And, or, that they've basically taken their ERP and they've done an ERP cloud migration. So, now they've got their general ledger and that financial system is not in a shoebox anymore, it's in a critical, core ERP system. And so, what they're finding is they bought digital transformation for financial services and their accountant only sort of has half the solution. And so they come in and they use us to close the last mile. >> Okay, so I'm going to put my naysayer hat on and ask you the question. I love it, but what's this blockchain thing? I'm an accounting guy. Look at one computer class or whatever, I get blockchain. How do you stay up to date? How do you ensure that I'm going to have a system that's going to be working? I know that blockchain standards are changing. How do you guys mitigate that? How do you handle that question? >> Yeah, I think the critical thing for our customers is for us, our customers, money moves in dollars. It leaves their bank account and goes into their supplier's bank account. The supplier's bank account goes into their customer's bank account. Their financial system does not change. We're actually very, very sensitive to that. We think about this very different than a consumers solution. Which is, consumer solutions almost have a critical mass question. They need everybody to get into the system for it to work. For commercial, you don't actually want to change the business process of your partners. It's really important, they've been doing this. So, we are very thoughtful about our software. It doesn't change business process. It doesn't require you to enter into come kind of new economy or new currency. You simply do what your always doing with the systems you're already using. And we just digitize the process to make them faster, cheaper and automated. >> Awesome. Talk about your goals for the year with Paystand. Where you guys at company wise? Funding? Goals? Hiring? What's going on? Give a quick final word on the company. >> Yeah, I think we're blessed right now. I would say we're one of, if not the fastest B2B payment companies, fastest growing B2B payment companies today. I think we have a long way to go. I would call this inning two for us. We ultimately-- I think much more about what does ten years look like than twelve months look like because this is the beginning of the commercial financial service way. And so, I think we ultimately believe that digital transformation is going to reinvent our industry. And if we can go lead the way we'll be very happy. And for us that just means continue growing, continue serving our customers, continue hiring. I think if we do all that... Right place, right time. >> Awesome. Final question for you. The folks out there watching, your an expert in the industry, again, FinTech as well as computer engineering. If my sister, who is not savvy, says Jeremy what is blockchain? How would you describe blockchain to someone who's interested and needs to know the importance definition and the importance of blockchain? >> Okay, so blockchain to me is basically a way to be able to take information like you might have on your techbook or you might have in a spreadsheet and use it where anybody can access it in a way that's actually easily controllable, visible, secure and automated. That doesn't sound very sexy, but the important thing is how we keep records effects all of society. We have records of who owns our houses. We have records of how much money we have in our account. We have records of who did we vote on. Those records are the foundation for our society. Currently, companies own those records. Companies are fallible And so, what blockchain does, is it allows us to make a more infallible system to keep access to those records you and I care about. >> And this is an infrastructure opportunity, not so much cryptocurrency, kind of a distinction between those two, right? >> That's right. I would say, cryptocurrency and money is like the first pillar app on top of blockchain. >> Jeremy Almond, CEO, founder of Paystand, hot company doing something really good in a growing, changing market called checks, paper checks. And if you have um', grow um', digitize them. Great entry strategy for blockchain. Thanks for coming on this Cube Conversation. Thanks for joining us here in Palo Alto. I'm John Furrier in the Cube studios For Cube conversation, thanks for watching. (orchastral music)

Published Date : Feb 8 2018

SUMMARY :

is the CEO of Paystand, hot startup So, you can imagine what Paypal I'm kind of guessing. to help a company speed up their time to cash, of the nature of the firm, right? ERP is now in the cloud, you're financial software What is the inhibitor for these new trends? And so, the time is probably right now. I mean some of the e-commerce systems in the early days a lot of it was about currency and the famous Steve Jobs quote is, And I think the magic thing about technology I don't have to have provision servers. And Sarbanes-Oxley's concept is to ensure that I showed you some things we're working on But the blockchain model for identity and the app guys had to feed off the network requirements. And I think the balance was mostly healthy. but they weren't foreclosing any innovation. What's the good sides of ICOs? And so, I think there are a lot of public, cause now you have token economics Or is that going to be-- And then you have other companies And how do we think about it, I got to ask you, on a personal level, And so, I think our space, we have a unique What are some of the things you guys are involved in? And the reality is, the consumer payment companies I don't know what the numbers are, The moment that the invoice is ready to go From the moment the controller says okay to pay, But in another, even the banking infrastructure-- is when you have a check, you have this You take that payload, aka check, put it into the system. And so they come in and they use us to close the last mile. and ask you the question. And we just digitize the process Where you guys at company wise? And so, I think we ultimately believe in the industry, again, FinTech but the important thing is how we keep records is like the first pillar app on top of blockchain. And if you have um', grow um', digitize them.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Jeremy AlmondPERSON

0.99+

2014DATE

0.99+

EuropeLOCATION

0.99+

twoQUANTITY

0.99+

JeremyPERSON

0.99+

2005DATE

0.99+

PaystandORGANIZATION

0.99+

Palo AltoLOCATION

0.99+

JohnPERSON

0.99+

John FurrierPERSON

0.99+

15 yearsQUANTITY

0.99+

Feb 2018DATE

0.99+

FacebookORGANIZATION

0.99+

ten yearsQUANTITY

0.99+

three percentQUANTITY

0.99+

Don TapscottPERSON

0.99+

IphoneCOMMERCIAL_ITEM

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

two thingsQUANTITY

0.99+

3,000 dollarsQUANTITY

0.99+

Steve JobsPERSON

0.99+

first homeQUANTITY

0.99+

OMARORGANIZATION

0.99+

FacebooksORGANIZATION

0.99+

twelve monthsQUANTITY

0.99+

70sDATE

0.99+

2008DATE

0.99+

TodayDATE

0.99+

PayStandORGANIZATION

0.99+

800 square footQUANTITY

0.99+

four yearsQUANTITY

0.99+

AmazonORGANIZATION

0.99+

GDPRTITLE

0.99+

60sDATE

0.99+

BothQUANTITY

0.99+

bothQUANTITY

0.99+

BlackberryORGANIZATION

0.98+

over 50 %QUANTITY

0.98+

WikipediaORGANIZATION

0.98+

a weekQUANTITY

0.98+

zerosQUANTITY

0.97+

todayDATE

0.97+

firstQUANTITY

0.97+

one wayQUANTITY

0.97+

a decade plus agoDATE

0.97+

OneQUANTITY

0.97+

PaypalORGANIZATION

0.97+

oneQUANTITY

0.96+

VenmoORGANIZATION

0.96+

EC2TITLE

0.95+

CubeORGANIZATION

0.95+

EquifaxORGANIZATION

0.95+

halfQUANTITY

0.95+

this weekDATE

0.95+

a decadeQUANTITY

0.95+

University of Santa BarbaraORGANIZATION

0.95+

20 year oldQUANTITY

0.94+

60 contractQUANTITY

0.94+

one sideQUANTITY

0.94+

five dollar cupQUANTITY

0.94+

block waveEVENT

0.93+

one computer classQUANTITY

0.91+

SiliconANGLEORGANIZATION

0.9+

MonDATE

0.89+

threeQUANTITY

0.89+

decadesQUANTITY

0.88+

first pillarQUANTITY

0.88+

Robert Herjavec & Atif Ghaur, Herjavec Group | Splunk .conf2017


 

>> Announcer: Live from Washington, DC it's theCUBE. Covering .conf2017. Brought to you by Splunk. >> Welcome back here on theCUBE continuing our coverage of .conf2017 sponsored by Get Together in your nations capitol, we are live here at the Walter Washington Convention Center in Washington, DC. Along with Dave Vellante I'm John Walls Joined now by a couple CUBE alums, actually, you guys were here about a year ago. Yeah, Robert Herjavec, with the Herjavec Group of course you all know him from Shark Tank fame answer Atif Ghauri who is the VP of Customer Service Success at the Herjavec Group. I love that title, Atif we're going to get into that in just a little bit. Welcome. >> Thank you. >> Good to see you all. >> We're more like CUBE groupies We're more like CUBE groupies. >> Alums. >> Alums, okay, yeah. >> If we had a promo reel. >> Yeah, we love it here. We get free mugs with the beautiful Splunk. >> That doesn't happen all the time does it. >> Where did you get those? >> They're everywhere. >> Dave, I'll share. >> So again for folks who don't, what brings you here what, what's the focus here for the Herjavec Group in in terms of what you're seeing in the Splunk community and I assume it's very security driven. >> Yeah, well we've been part of the Splunk community for many years going on gosh, eight, nine years. We're Splunkers and we use Splunk as our core technology to provide our managed service and we manage a lot of customer environments with Splunk and we've been really forefront of Splunk as a SIM technology for a long time. >> Atif, excuse me, David, just the title, VP of Customer Service Success, what's under that umbrella? >> Yeah, it's actually pretty simple and straightforward given especially that Splunk's aligned the same way. Christmas success is King, right. If our customers aren't successful then how are we successful? So what we're trying to do there is putting the customer first and help in growing accounts and growing our services starting with our customers that we have today. >> It was actually Doug Maris, I have to give him full credit him and I were on a flight, and I said to him what's really critical to you growing revenue, efficiency, innovation and he said, number one for us is customer success. So we're very happy to steal other people's ideas if they're better. >> So security's changing so fast. You mentioned SIM, Splunk's narrative is that things are shifting from a traditional SIM world to one of an analytic driven remediation world. I wonder if you could talk about what you're seeing in the customer base, are people actually shifting their spending and how fast and where do you see it all going? >> Yeah, so the days of chasing IOC's is a dead end. Because that's just a nonstop effort. What's really happening now is technique detection. Defining, looking at how hackers are doing their trade craft and then parroting that. So Splunk has ideas and other vendors have ideas on how to go about trying to detect pattern recognition of attacker trade craft. And so what definitely was driving what's next when it comes to security automation, security detection, for our customers today. >> You know, we always tell people and it's just dead on but the challenge is people want to buy the, sexy, exciting thing and why I always try to say to customers is you're a dad and you have three kids, and you have a minivan. You don't really want to own a minivan, you want a really nice Ferrari or Corvette but at the end of the day, you have three kids and you got to get to the store. And in the security world it's a little bit like that. People talk about artificial intelligence and better threat metrics and analytics but the core, foundational basis still is logs. You have to manage your log infrastructure. And the beauty of Splunk is, it does it better than anyone and gives you an upstream in fact to be able to do the analytics and all those other things. But you still got to do the foundation. You still got to get three kids into the minivan and bring back groceries. >> So there's been a lot of focus, obviously security's become a Board level topic. You hear that all the time, you used to not hear it all the time, used to be IT problem. >> Absolutely, the only way I could get a meeting with the CEO or CIO was because I was on Shark Tank. But as a security guy, I would never meet any executives. Oh yeah I spend 80% of my time meeting with CEO, not just CIO's, but CEO's and Boards and that kind of stuff, absolutely. >> How should the CIO be communicating the Board about security, how often, what should be the narrative you know, transparency, I wonder if you could give us your thoughts. >> It's a great question. There's a new financial regulation that's coming out where CISO's and CIO's actually have to sign off on financial statements related to cyber security. And there's a clause in there that says if they knowingly are negligent, it carries criminal charges. So the regulations coming into cyber security are very similar to what we're seeing and Sarbanes Oxley like if a CEO signs an audit statement that he suspects might have some level of negligence to it I'm not talking about outright criminal fraud but just some level of negligence, it carries a criminal offense. If you look at the latest Equifax breach, a lot of the media around it was that there should be criminal charges around it. And so as soon as as you use words like criminal, compliance, audit, CEO's, executives really care. So the message from the CIO has to be we're doing everything in our power, based on industry standards, to be as secure as we can number one. And number two we have the systems in place that if we are breached, we can detect it as quickly as possible. >> So I was watching CNBC the other day and what you don't want to see as a Board member, every Board members picture from Equifax up there, with the term breach. >> Is that true? >> Yeah, yeah. >> See, but, isn't that different. Like you never, like if we think back on all the big breaches, Target and Sony they were all seminal in their own way. Target was seminal because the CEO got fired. And that was the first time it happened. I think we're going to remember Equifax, I didn't know that about the Board. >> For 50 seconds it was up there. I the sound off. >> You don't want to be a Board member. >> I mean, I hate to say it, but it's got to be great for your business, first of all it's another reason not to be a public company is one more hurdle. But if you are they need help. >> They absolutely need help. And on point I don't want to lose is that what we're seeing with CISO's, Chief Information Security Officers, Is that that role's transcending, that role is actually reporting directly to in to CEO's now. Directly into CFO's now, away from the CIO, because there's some organizational dynamics that keep the CISO from telling, what's really going on. >> Fox in henhouse. >> Exactly. >> You want to separate those roles. You're you're seeing that more often. What percent of the CISO's and CIO's are separate in your experience? >> Organizations that have a mature security program. That have evolved to where it's really a risk-based decision, and then the security function becomes more like risk management, right. Just what you they've been doing for decades. But now you have a choice security person leading that charge. >> So what we really always saying theCUBE, it's not a matter of if, it's when you're going to get infiltrated. Do you feel as though that the Boards and CIO's are transparent about that? Do Boards understand that that it's really the remediation and the response that's most important now, or there's still some education that has to go on there? >> You know, Robert speaks to Boards are the time he can comment on that, but they really want to know two things, how bad is it and how much money do you need. And those are the key questions that's driving from a Board perspective what's going to happen next. >> What's worse that Equifax got breached or that Equifax was breached for months and didn't know about it. I mean, as a Board member the latter is much worse. There's an acceptance like I have a beautiful house and I have big windows a lots of alarms and a dog, not a big dog, but still, I have a dog. >> A yipper. >> Yeah, I have a yipper. It's worse to me if somebody broke into my house, was there for a while and my wife came home at night and the person was still there. That to me is fundamentally worse than getting an alarm and saying, somebody broke the window, went in, stole a picture frame. You're going to get breached, it's how quickly you respond and what the assets are. >> And is it all shapes and sizes, too I mean, we talk about big companies here you've mentioned three but is it the mid-level guys and do smaller companies have the same concerns or same threats and risks right now? >> See these are the you heard about. What about all the breaches you don't know. >> That's the point, how big of a problem are we talking about? >> It's a wide scaling problem right and to the previous question, the value now in 2017, is what is the quality of your intelligence? Like what actions can I take, with the software that you're giving me, or with the service that you're giving me because you could detect all day but what are you going to do about it? And you're going to be held accountable for that. >> I'm watching the service now screen over here and I've seen them flash the stat 191 days to detect an infiltration. >> That sounds optimistic to me. I think most people would be happy with that if they could guarantee that. >> I would think the number's 250 to 300 so that now maybe they're claiming they can squeeze that down but, are you seeing any compression in that number? I mean it's early days I know. >> I think that the industry continues to be extremely complicated. There's a lot of vendors, there's a lot of products. The average Fortune 500 company has 72 security products. There's a stat that RSA this year that there's 1500 new security start ups every year. Every single year. How are they going to survive? And which ones do you have to buy because they're critical and provide valuable insights. And which ones are going to be around for a year or two and you're never going to hear about again. So it's a extremely challenging complex environment. >> From the bad guys are so much more sophisticated going from hacktivists to whatever State sponsored or criminal. >> That's the bottom line, I mean the bad guys are better, the bad guys are winning. The white hats fought their way out to the black hats, right. The white hats are trying, trying hard, we're trying to get organized, we're trying to win battles but the war is clearly won by the by the black hats. And that's something that as an industry we're getting better at working towards. >> Robert, as an investor what's your sentiment around valuations right now and do you feel as though. >> Not high enough. >> Oh boy. >> Managed security companies should be trading way higher value. >> Do you feel like they're somewhat insulated? >> Its a really good question, we're in that space you know we're we're about a $200 million private company. We're the largest privately held, managed security company in the world actually. And so I always think every time we're worth more I think wow, we couldn't be worth more, the market can't get bigger. Because your values always based for potential size. Nobody values you for what you're worth today. Because an investor doesn't buy history an investor doesn't buy present state, an investor buys future state. So if the valuations are increasing, it's a direct correlation because the macro factors are getting bigger. And so the answer to your question is values are going to go up because the market is just going to be fundamentally bigger. Is everybody going to survive? No, but I think you're going to see valuations continue to increase. >> Well in digital business everybody talks about digital business. We look at digital business as how well you leverage data. We think the value of data is going through the roof but I'm not sure customers understand the intrinsic value of the data or have a method to actually value their data. If they did, we feel like they would find it's way more valuable and they need to protect it better. What are you seeing in that regard with customers? >> There's an explosion of data in that with IoT, internet of things, and the amount of additional data that's come now. But, to your point, how do you sequence and label data? That's been a multi-decade old question more organizations struggle with. Many have gone to say that, it's all important so let's protect it all, right. And verses having layers of approach. So, it's a challenging problem, I don't think across all our customer base. That's something that each wrestling with to try to solve individually for their companies. >> Well, I think you also have the reality though of money. So, it's easy to say all the data is important, Structured unstructured, but you look at a lot of the software and tools that you need around this floor are sold to you on a per user or per ingestion model. So, even though all your data is critical. You can't protect all your data. It's like your house, you can't protect every single component of it, you try, and every year gets better maybe get a better alarm maybe I'll get rid the yappy dog and get a Doberman you know you're constantly upgrading. But you can't protect everything, because reality is you still live in an unstructured, unsafe world. >> So is that the complexity then, because the a simple question is why does it take so long to find out if there's something wrong with your house? >> I think it's highly complex because we're dealing with people who are manipulating what we know to their benefit in ways we've never done it. The Wannacry breach was done in a way that had not been done before. If it had done before we could have created some analytics around it, we could created some, you know, metrics around it but these are attacks that are happening in a way we've never seen before and so it's this element of risk and data and then you always have human nature. Gary Moore was that the Council this morning. The writer of Crossing the Chasm, legendary book, and he said something very interesting which was Why do people always get on a flight and say, good luck with the flight, hope you fly safe. But they don't think twice about hopping in their car and driving to the grocery store. Whereas statistically, your odds of dying in that car are fundamentally greater, and it's human nature, it's how we perceive risk. So it's the same with security and data in cyber security. >> As security experts I'm curious and we're here in DC, how much time you think about and what your thoughts might be in the geopolitical implications of security, cyber war, you know it's Stuxnet, fast forward, whatever, ten years. What are you thoughts as security practitioners in that regard? >> The longest and most heated battles in the next World War, will not be on Earth, they'll be in cyberspace. It's accepted as a given. That's the way this Country is moving. That's the way our financial systems are tied together and that's the way we're moving forward. >> It's interesting we had Robert Gates on last year and he was saying you know we have to be really careful because while we have the United States has the best security technologies, we also have the most to lose with our infrastructure and it's a whole new you know gamification or game theory balance we have to play. >> I would agree with him that we have some of the best security technology in the world but I would say that our barometer and our limiter is the freedom of our society. By nature what we love about our country and Canada is that we love freedom. And we love giving people access to information and data and free speech. By nature we have countries that may not have as good a security, but have the ability to limit access to outsiders, and I'm not saying that's good by any means but it does make security a little bit easier from that perspective. Whereas in our system, we're never going to go to that, we shouldn't go to that. So now we have to have better security just to stay even. >> To Dave's point talking about the geopolitical pressures, the regulatory environment being what it is, you know legislators, if they smell blood right, it in terms of compliance and what have you, what are you seeing in terms of that shift focus from the Hill. >> Great question. I did a speech to about two thousand CIO's, CISO's not long ago and I said, how many people in this room buy security to be more secure and how many people buy because you have to be compliant. 50/50, even the security ones admitted that how they got budget was leveraging the compliance guys. It was easier to walk into CEO's office and say look, we have to buy this to meet some kind of a political, compliance, Board issue. Than it was to say this will make us better. Better is a hard sell. So that, has to go to the head to pull the trigger to do some of that. >> You know, I think in this geopolitical environment it's look at the elections, look at all the rhetoric. It's just there is going to be more of that stuff. >> A lot's changed in crypto and its potential applications in security. More money poured into ICO's in the first half than venture backed crypto opportunities. >> There are practical applications of blockchain technology all across the board, right, but as you mentioned is fundamentally built on pathology. On core gut security work and making a community of people decide whether something's authentic or not. It's a game changer, as far what what we could do from a platform standpoint to secure our financial systems and short answer it's volatile. As you saw with the fluctuation of Bitcoin and then the currency of Bitcoin, how it's gone up and down. It's quite volatile right now because there's a lot of risk So I say what's the next Bitcoin in six months or eighteen months and what's going to happen to the old Bitcoin and then all the money that into there, where is that going to go? So that's a discuss the pivot point I think for the financial services industry and more and more their larger institutions are just trying to get involved with that whole network of blockchain. >> Crypto currencies really interesting. In some ways it's the fuel that's funding the cyber security ransomeware. I mean it's one of the easiest ways to send money and be completely anonymous. If you didn't have crypto currency, how would you pay for ransomware? You give them your checking account? You deposit into their checking account? So, I think that you're seeing a big surge of it but if you look at the history of money or even checks, checks were developed by company called Deluxe here in the United States 104 years ago. They're a customer of ours, that's why I know this, but the basis of it is that somebody, a real institution with bricks and mortar and people in suits is backing that check, or that currency. Who's backing crypto currency today? So you have, by nature, you have this element of volatility and I don't know if it's going to make it or it's not going to make it. But inevitably has to cross from a purely electronic crypto form to some element of a note or a tender that I can take from that world and get backing on it. >> That's kind of what Warren Buffet has said about it. I mean I would respond that it's the community, whatever that means, that's backing it. I mean, what backs the greenback, it's the US Government and the US military. It's an interesting. >> Right like, at the end of the day I would still rather take a US dollar than even a Canadian dollar or a UK dollar. >> Gentlemen thanks for being with us. >> Great to see you. >> Thank you for the coffee mug. >> This is incredible. >> There's actually stuff in it too so be careful. >> I drank it is that okay? >> Can I go to the hospital. >> Atif, thanks for the time and Robert good luck with that new dog. (all laughing) >> Don't tell my wife I got rid of her dog. >> In time. >> In time. All things a time, theCUBE continues live here Washington DC at .conf2017 right after this.

Published Date : Sep 27 2017

SUMMARY :

Brought to you by Splunk. of Customer Service Success at the Herjavec Group. We're more like CUBE groupies Yeah, we love it here. for the Herjavec Group in in terms of We're Splunkers and we use Splunk as that Splunk's aligned the same way. what's really critical to you growing revenue, I wonder if you could talk about what you're seeing Yeah, so the days of chasing IOC's is a dead end. but at the end of the day, you have three kids You hear that all the time, you used to Absolutely, the only way I could get a meeting How should the CIO be communicating the Board So the message from the CIO has to be and what you don't want to see as a Board member, I didn't know that about the Board. I the sound off. You don't want to be I mean, I hate to say it, but it's got to be great that keep the CISO from telling, what's really going on. What percent of the CISO's and CIO's Just what you they've been doing for decades. the remediation and the response that's most important now, and how much money do you need. I mean, as a Board member the latter is much worse. and the person was still there. What about all the breaches you don't know. and to the previous question, the value now 191 days to detect an infiltration. That sounds optimistic to me. that down but, are you seeing And which ones do you have to buy From the bad guys are so much more sophisticated are better, the bad guys are winning. around valuations right now and do you feel as though. be trading way higher value. And so the answer to your question is values the intrinsic value of the data or have a method There's an explosion of data in that with IoT, of the software and tools that you need around this floor and say, good luck with the flight, hope you fly safe. and we're here in DC, how much time you think about and that's the way we're moving forward. and it's a whole new you know gamification but have the ability to limit access that shift focus from the Hill. and how many people buy because you have to be compliant. it's look at the elections, look at all the rhetoric. More money poured into ICO's in the first half all across the board, right, but as you mentioned I mean it's one of the easiest ways to send money it's the US Government and the US military. end of the day I would still rather take a US dollar Thank you for the in it too so be careful. Atif, thanks for the time and Robert good luck In time.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Atif GhauriPERSON

0.99+

DavidPERSON

0.99+

DavePERSON

0.99+

RobertPERSON

0.99+

2017DATE

0.99+

Dave VellantePERSON

0.99+

TargetORGANIZATION

0.99+

SonyORGANIZATION

0.99+

Crossing the ChasmTITLE

0.99+

three kidsQUANTITY

0.99+

Gary MoorePERSON

0.99+

AtifPERSON

0.99+

80%QUANTITY

0.99+

EquifaxORGANIZATION

0.99+

Doug MarisPERSON

0.99+

Washington DCLOCATION

0.99+

Warren BuffetPERSON

0.99+

DCLOCATION

0.99+

EarthLOCATION

0.99+

Herjavec GroupORGANIZATION

0.99+

Washington, DCLOCATION

0.99+

last yearDATE

0.99+

SplunkORGANIZATION

0.99+

John WallsPERSON

0.99+

50 secondsQUANTITY

0.99+

Atif GhaurPERSON

0.99+

eightQUANTITY

0.99+

ten yearsQUANTITY

0.99+

Robert GatesPERSON

0.99+

two thingsQUANTITY

0.99+

a yearQUANTITY

0.99+

CUBEORGANIZATION

0.99+

72 security productsQUANTITY

0.99+

191 daysQUANTITY

0.99+

RSAORGANIZATION

0.99+

twoQUANTITY

0.99+

World WarEVENT

0.99+

twiceQUANTITY

0.99+

first halfQUANTITY

0.99+

threeQUANTITY

0.99+

Shark TankTITLE

0.98+

eighteen monthsQUANTITY

0.98+

United StatesLOCATION

0.98+

Walter Washington Convention CenterLOCATION

0.98+

Robert HerjavecPERSON

0.98+

104 years agoDATE

0.98+

US GovernmentORGANIZATION

0.97+

six monthsQUANTITY

0.97+

$200 millionQUANTITY

0.97+

this yearDATE

0.97+

oneQUANTITY

0.97+

todayDATE

0.97+

nine yearsQUANTITY

0.97+

FoxORGANIZATION

0.96+

.conf2017EVENT

0.96+

300QUANTITY

0.96+

CNBCORGANIZATION

0.96+

first timeQUANTITY

0.96+

50/50QUANTITY

0.95+

FerrariORGANIZATION

0.95+

250QUANTITY

0.94+

ChristmasEVENT

0.93+

DeluxeORGANIZATION

0.92+

CoveringEVENT

0.92+

a year agoDATE

0.91+

US militaryORGANIZATION

0.91+

WannacryEVENT

0.91+

multi-decadeQUANTITY

0.9+

this morningDATE

0.89+

eachQUANTITY

0.89+

1500 new security startQUANTITY

0.88+

decadesQUANTITY

0.88+

Jane Allen & Jay Cline | Veritas Vision 2017


 

>> Male: Live from Las Vegas. It's theCUBE covering Veritas Vision 2017. Brought to you by Veritas. (upbeat music) >> Welcome to Las Vegas, everybody. This is the Cube and we are here covering Veritas Vision 2017. It's the hashtag Vtas, V-T-A-S Vision, and this is Day one of two days of coverage here. I'm with Stu Miniman. My name is Dave Vellante. Jane Allen and Jay Cline are here from PwC. Jane is a partner and principal and Jay is a partner. Folks, welcome to the Cube, good to see you. >> Thank you. >> Thank you. >> Thanks for having us. >> So PwC leading global consultancy, I would say one of the top three, four, easily. Top 2. Maybe even top 1. >> Jane: Yes. >> I mean, you guys are gold standard for global. You solve problems that most people can't even begin to touch, except for a handful of companies. Jane, let's start with you. What's hot these days in your world? >> So I lead a practice, an information governance practice here at PwC, founded in a lot of folks with technology, legal support, regulatory backgrounds. And it pertains to all companies these days, right? How do you manage your data, to manage all the risks and reap the benefits of it. Certainly a hot topic and certainly with your privacy regulations on board, cyber risk, and just again all the benefits of data that companies are trying to take advantage of. It's been a growing consultancy practice and something that's very relevant to companies of all industries. >> Jay, we've heard a lot today about GDPR. I know it's something that you've been knee-deep in. What do people need to know about GDPR? >> I think GDPR boils down to one proposition, being able to prove that you have control over people's data. I think that summarizes the 72 different requirements of GDPR. >> Yeah, so GDPR, for those of you who don't know, General Data Protection Regulation, came out of the EU. One person on theCUBE called it a socialist agenda. (Jane laughs) But it's serious business, and if you can't ... I mean, actually, Jay, summarize, you know, what people should know about the exposure. I mean, essentially you have to be able to identify personal information and be able to delete that personal information on request, right, for any European Union citizen? >> Resident or citizen. >> Right, okay. >> That's right. >> So if somebody walks into Joe's pizza shop and says I want to sign up a bingo card to get, you know, mailings and your emailings, technically speaking, that person, if they wanted to do business in the EU, is responsible, is that right? >> You've got to know 360 degree view of all the personal data that you have of your employees, your consumers, your customers. You've got to be able to produce evidence on demand that you have this level of control. And whenever somebody comes in and asks for access to their data, to correct it, to export it, to their email, or to erase it, you've got to know whether you can deny that request or do you have to fulfill it, and you usually only have 30 days to fulfill it. >> So is this one of the hotter topics going on in your world these days? And what percent of your clients are actually prepared? >> I'll let Jay comment on how many are prepared, but you know, I think most companies, frankly, are trying to figure out how to be compliant and what is it they actually need to do. But it is a hot topic. I think even before GDPR, the landscape was already complex, right? People are trying to respond to litigation investigations, retention requirements from regulations, cyber risk, how do we manage it? And it's all about, what data do we have, where is it, and what are we doing with it, and how are we controlling it? And those questions are already there. GDPR highlights it. And with a May 2018 deadline, I mean, it's really putting the spotlight on this topic. >> Oh, yeah, that's one little, the fact that we forgot to mention, the clock is ticking. We're down under a year. So how about customer readiness? >> I think when we cross the one-year milestone in May, a lot of boards got exercised. The phone started ringing off the hooks, because they realized, we only have one more budget cycle to get this done. And so now I think, they're realizing that because GDPR hits the tech stack, and the IT budgets had already been planned for, the release cycles had already been put in place, they're now starting to ask, well, we can't get everything done by next May. What are the most important high-risk things that we do need to get done? And there's going to be more spillover work after May, I think. >> I think this highlights something that was already present in terms of the need for cross-functional senior leadership to pay attention to this, right? This isn't just a legal or privacy topic. It isn't just an IT topic. This really hits across organization and these folks need to work together. >> Jane, could you help us kind of uplevel a little bit. If I look at information governance, you mentioned it's super complex. You know, every company I talked to, they're deploying more and more sass. In the keynote this morning, Veritas said most of their customers have at least three clouds. We find, you know, absolutely it's, the strategy, especially if I start, oh, well, just different groups start using things, then how do I govern it? Do I even worry about security and backup and everything like that? How does this fit in the overall picture for most customers? >> Well, I guess that's what's interesting, right? There's no one right way of doing this right. And so it depends on your business, your industry, your customer base, your geographic location and outreach, and the data landscape. And you have to make smart decisions of what works within your corporate business culture even, of what is it that we need to keep and how we need to keep it and enable, you know, our engineers, our users, our customers, to leverage data, but also manage our risks. And there's just not one way to look at it. But again it goes down to really knowing what control you have, what you have, and where is it, right? But that's what's interesting, is for every company to figure out how is the best way for them to tackle it. >> So who's driving the information governance bus these days? I mean, with Sarbanes-Oxley it was the CFO. With the federal rules of civil procedure, it was kind of the general council. Who's really sort of in charge today? >> Well, I mean, depending on who owns it in an organization, looks a little different, usually legal and/or privacy, and oftentimes they are within the same group. >> Dave: So a chief privacy officer? >> Yeah. >> General counsel obviously involved, IT? >> Sometimes the compliance office again, depending how that's structured, but generally in that legal compliance privacy realm. >> Right. Okay, and when I think about some of those previous, you know, generations, Sox in particular, but also I guess FRO, CP. There was an effort within the company, because the ROI was just like, oh, we got to do this. It was like, okay, what does it cost to not comply, you know. >> Jane: Yeah. >> They would try to thread that needle. But there was always a faction that said, hey, we can... And consultancies were part of this. We can actually get value out of this. It's an opportunity to clean up your data, maybe to get rid of stuff, maybe you can reclaim some wasted space or, you know, et cetera, et cetera. Is that the way it is today with GDPR? And maybe we could unpack that a little bit. >> Yeah. One of the first steps that you have to take for GDPR, is to discover where all of your European personal data is, so data discovery effort. And in doing that, we've had a number of clients that for the first time, they've really put together a view of how they make money using data. And they're finding data, their chief marketing officer is finding data they didn't know they had. And so now they're able to monetize that data if they can use it responsibly within the privacy regulations of GDPR. So marketing is oftentimes funding, helping IT and Legal fund their GDPR efforts. >> And I think one of the other benefits is, if you have to go through this exercise to be compliant, but then you get additional insights in your data and you know where to invest more for those additional business opportunities, then at least hopefully you're reaping, again, more ROI off the effort. >> Well, I know the clock's ticking and there's a sort of virtual gun to organization's heads, but getting into that whole value notion, monetization, most organizations that we talked to, they don't really have an understanding of how data fuels monetization. Not necessarily monetizing the data, but how it contributes to monetization. What do you see in the customer base? >> This is the biggest area I think where GDPR is going to morph after May of 2018. I think the companies that can protect their exposure to this regulation, by going through the same processes to find out where their data is, they are positioned to monetize that data, to take advantage of new market opportunities, in Europe in particular. >> Okay. By the way, we should mention that this actually, the law is in effect, it's just the penalties aren't being-- >> Jay: Right. >> invoked at this point in time, right? >> Jay: That's right. >> So the recital is one-year grace period? And a lot of people are thinking, well, maybe we'll get another year of grace period. It's going to be really interesting to see how that goes down. And presumably the EU's going to go after the big pockets, right? I mean, those are the guys who have to be most concerned about this. But what about that midsize company? For your midsize clients, what are you advising them, that may not have the budgets of the big guys? >> We've been advising our clients that there are actually three ways that you can get hit by GDPR. The one that everybody's talking about is the famous 4% fine on your global revenues. That's what the regulators would impose on you if they discovered that you had an egregious violation of privacy. But there's another way that people aren't talking about that's going to be live on May 25th of 2018. And that's a new litigation risk for B2C. Anybody in the B2C space, even if you're midsize, if you violate the rights of a class of people, they can sue you on May 25th. And you can bet there are going to be law firms that are going to take advantage of this new situation. >> Dave: So they can sue you as individuals? >> As a class of individuals. There's also for people in the B2B space, we're seeing right away the contracting risk. And RFPs, they're saying as a condition to bid for this work, you've got to be able to sign that you are GDPR compliant. So you'll be locked out of the European market if you're B2B and you're not ready on May 2018. >> So we were talking off-camera. I was sort of struggling with trying to understand the direct fit with technology, Jay, and I thought you had a good answer. So what's technology's role in all of this? I mean, technology, can it help us get out of this problem? >> There's two parts where technology's very important. First is just discovering where your data is. That takes a lot of technology tools based on your tech stack, to be able to have an ongoing real-time data map. But the other one, the harder part, is responding to these individual rights requests, to ask for where their data is, to correct it, to delete it, to have that 360 view of individuals throughout your information environment. I think that takes IT to a new level. It hits all parts of the tech stack. >> All right. Because an individual can essentially say, I need to know what you know about me, right, that's part of it? >> Well, exactly. And a lot these companies that collect customer data and structured systems, they weren't really built for this type of exercise, to go through and search for something and actually dispose of it. And so companies are having to think very tactically. Okay, can I do this across all my different systems? And then certainly an unstructured data stores, again, what's there and how do we figure that out? >> So in the keynote this morning, we heard about GDPR. It looked like there was... I called it the doomsday clock, what was up on the wall. Can you bring back, how is Veritas doing? How are they helping customers with information governance and GDPR? >> Well, I think one of the really exciting things they demoed and talked about there is some of the data scanning or data profiling information, whether it be the classification or reporting out in terms of what is in this unstructured stores. Again, in order for companies to figure out what it is that they need to do process and technology wise is, what do we have out there again? And they're giving and enabling customers with some of their tools to be able to get some insights there, which I think is really transformative. I think people have been talking about these things from either a legal discovery standpoint, certainly a cyber risk. And I think this is just really adding on. So again, these tools help enable all of them, but certainly for GDPR. >> You have to get this first step right, the data discovery and classification, because if you scope GDPR too big, your compliance costs are through the roof. But if you scope it too small, your exposure's too big. So having a good discovery and classification approach, is critical to the success of your GDPR program. >> Has the industry solved the classification problem? I mean, for years, you really struggled to classify data. You could classify, you know, maybe data in an email archive, but data became so distributed by its very nature. Has that problem been solved? >> I would say no, but I've certainly seen a huge uptick in companies that actually finally just biting the bullet and getting themselves organized. But again, at least doing it because, hey, we need to figure it out for GDPR and privacy, we need to figure it out for cyber security controls, we need to figure it out for e-discovery, and just regular records management and how long we need to keep things. And so I think they recognize, wait, this satisfies a lot of different needs. But I don't know that there's an easy solution to it either. >> And the best practice organizations have automated that presumably, 'cause otherwise it's not going to scale, right? >> In the long-term that's what they're seeking, right, but you need to get the structure right, so you need to have file plans and organization of the information that makes sense to your employees and the way you do work, and then hopefully tie that back, knowing the data life cycle, to be able to classify things based on role, based on access, based on data type. So there's a lot of upfront work, but ultimately that's the-- >> So that's a taxonomical exercise, is that right? >> It is. That's a fancy word. >> Okay. But that's a heavy lift. And then it changes. >> It is, it is. But I think. Again, there's multiple benefits to that. >> Sure. >> And then going forward, you've got things in order for all those reasons. You can leverage the power of the technology, and then your functional groups and what work they do. People know what work they do, how long it generally it needs to be kept. And if you kind of can marry those two things from the business, the technology side, you can get set up and lauch. >> And then you can automate the policies around data retention. >> Exactly. >> What's your relationship specifically with Veritas? >> Well, you know, they're a client of ours, but we're also a client of theirs. >> Dave: Okay. >> I guess we're friends on a number of different angels and whatnot. But our practice tends to... Or we are technology agnostic in general, but we definitely want to stay on top of the different leaders in the industry. So that when we go to our clients, we can recommend, hey, these these are the top two or three that we believe could help you based on your situation, based on your data landscape, and be able to advise in that regard. So Veritas, between the backup tools, their e-discovery, and certainly some of the things they're doing on, you know, information governance and GDPR, is certainly one of the key providers that our clients should consider. >> So, I have sort of set up this discussion with a little background on PwC, clearly one of the leading consultancies out there. I would point to global, footprint, your deep industry expertise, you understand technology, you've been around, you know, you've got deep relationships. So other than those, what's the big difference, you know? Why PwC? And you can repeat some of those if you want. Probably be more articulate than I was. >> I think one thing that's different is what we call the end-to-end approach, where there might be other companies that have some of the qualities that you've talked about. But with GDPRs, it hits across five to ten different budgets in an enterprise. And we'll take a company through a transformational journey across all of them. We have auditors, and we have lawyers, and technologists, forensic scientists. GDPR really hits across all the functions of the enterprise. Because of our scale, we can hit all of these. Whereas other providers will take different slices of that. >> I would also add, PwC looks at our clients as forever clients. We're not looking for a one transaction and see you later. I mean, we look at them in terms of we want to be a firm that supports and partners them, whether it be on the consulting side, audit, tax, whatnot. And so we look at that that way in terms of trying to support them. And maybe that's just one point solution, maybe it's broader. But we'll bring the right experts to the table that fits for that client. And so we always want to think about it that way. While we might have ways and approaches that we leverage, hey, if they've got a specific need or a specific specialty, we'll bring the right expert to the firm. >> So that leads me to like my last question, which is, so it sounds like GDPR, and in chain of the context of that answer, is not just a tactical sort of pain relief project. Is it part of more strategic digital transformations? Are you able to make that connection? Or are people just in too much of a rush to fix the pain? >> No. Jay and I were talking about this earlier today. I mean, I'll use the example of some of the cloud transformation that companies are going through, right, if they haven't already, and thinking about their data and how they operate differently. And wait a minute, we don't need to forklift all of our data over. Let's think about it. And oh, by the way, let's make sure we're compliant with GDPR, right? So there's a number of different ways that you can kind of pull in different pieces that are helpful to clients. I think there were a number of different aspects to that, that we were talking about. So it's certainly something front and center, but it's not a one time, let's check the box and move on exercise either. >> Awesome. All right. We got to go. Thanks very much for coming the Cube. >> Thank you. >> Thanks. >> It's good to meet you guys. All right, keep it right there, everybody. We'll be back with our next guests. This is theCUBE. We're live from Veritas Vision 2017 in Las Vegas. We'll be right back. (techno music)

Published Date : Sep 19 2017

SUMMARY :

Brought to you by Veritas. This is the Cube I would say one of the top three, I mean, you guys are gold standard for global. and just again all the benefits of data I know it's something that you've been knee-deep in. I think GDPR boils down to one proposition, I mean, essentially you have to be able to identify of all the personal data that you have I mean, it's really putting the spotlight on this topic. the fact that we forgot to mention, And there's going to be more spillover work and these folks need to work together. In the keynote this morning, Veritas said And you have to make smart decisions the information governance bus these days? and oftentimes they are within the same group. Sometimes the compliance office again, what does it cost to not comply, you know. It's an opportunity to clean up your data, And so now they're able to monetize that data but then you get additional insights in your data but how it contributes to monetization. This is the biggest area I think where GDPR it's just the penalties aren't being-- the EU's going to go after the big pockets, right? And you can bet there are going to be law firms that you are GDPR compliant. and I thought you had a good answer. I think that takes IT to a new level. I need to know what you know about me, right, And so companies are having to think very tactically. So in the keynote this morning, we heard about GDPR. that they need to do process and technology wise is, is critical to the success of your GDPR program. You could classify, you know, But I don't know that there's an easy solution to it either. and organization of the information that makes sense That's a fancy word. And then it changes. Again, there's multiple benefits to that. And if you kind of can marry those two things And then you can automate the policies Well, you know, they're a client of ours, and certainly some of the things they're doing on, you know, And you can repeat some of those if you want. some of the qualities that you've talked about. And so we always want to think about it that way. and in chain of the context of that answer, And oh, by the way, We got to go. It's good to meet you guys.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave VellantePERSON

0.99+

JayPERSON

0.99+

JanePERSON

0.99+

Jane AllenPERSON

0.99+

DavePERSON

0.99+

May 25thDATE

0.99+

Stu MinimanPERSON

0.99+

May 2018DATE

0.99+

30 daysQUANTITY

0.99+

Jay ClinePERSON

0.99+

General Data Protection RegulationTITLE

0.99+

MayDATE

0.99+

May 25th of 2018DATE

0.99+

fiveQUANTITY

0.99+

Las VegasLOCATION

0.99+

two partsQUANTITY

0.99+

PwCORGANIZATION

0.99+

EuropeLOCATION

0.99+

FirstQUANTITY

0.99+

May of 2018DATE

0.99+

threeQUANTITY

0.99+

GDPRTITLE

0.99+

360 degreeQUANTITY

0.99+

one-yearQUANTITY

0.99+

two thingsQUANTITY

0.99+

4%QUANTITY

0.99+

two daysQUANTITY

0.99+

oneQUANTITY

0.99+

one propositionQUANTITY

0.99+

VeritasORGANIZATION

0.99+

EUORGANIZATION

0.99+

next MayDATE

0.99+

three waysQUANTITY

0.98+

first stepQUANTITY

0.98+

first timeQUANTITY

0.98+

first stepsQUANTITY

0.98+

todayDATE

0.98+

SoxORGANIZATION

0.98+

one transactionQUANTITY

0.98+

fourQUANTITY

0.97+

under a yearQUANTITY

0.96+

72 different requirementsQUANTITY

0.95+

360 viewQUANTITY

0.94+

OneQUANTITY

0.94+

one timeQUANTITY

0.94+

one thingQUANTITY

0.94+

one wayQUANTITY

0.93+

Paul Farrell, Nehemiah & Jason Cook, The Chertoff Group | Security in the Boardroom


 

>> Hey Jeff Rick here with the cube. We're here in Palo Alto at the Chertoff event, its called security in the boardroom. We're talking about the security conversations that need to happen in the boardroom not just at the IT department and locking down your phone and your VPN. Its really how do we elevate the conversation, especially as things continue to change, digital transformation is forcing people to move quickly and everyone's becoming a digital company. All our assets are becoming digital. So it needs to get elevated. We're excited to have, our next guest, he's Paul Farrell, he's the CEO of Nehemiah. Paul welcome. >> Thank you. >> And joining us again, Jason Cook from the Chertoff Group. Good to see you again. >> Hi. Alright so lets jump into it, so you're CEO... Well before you get it, first tell people about Nehemiah, you are familiar with the company. >> Nehemiah has a cyber security suite where we know, manage and help protect organizations and the knowing part is what we're probably going to talk more about today which is our risk quantifier software. >> Well lets jump in what is risk quantifier software? >> We take a bottoms up look at the organization to get a high fidelity copy of the corporate network and then we layer business applications on top of it so boards can get a look at what the business exposure is to the cyber security risk. >> So the network and the application. So very techy piece of it, how much of it, in terms of the process and the people get filled into that piece as well. >> We call that process BIA or Business Impact Analysis and a lot of the Fortune 500 firms have already been doing this to be compliant with Sarbanes Oxley and other regulations. And its being able to work with them to take some of that information out of the system and combine it with the cyber information we have, to give them a good look at risk. So if I'm looking to invest $2 million dollars, what's my risk buy down. Is it 10 million? Is it two million? Is it nothing? I just need to do it. So these are some of the questions we're trying to help boards answer. >> I'm just curious, from a why do we need to do this point of view. How much of it is compliance and governance and regulation? And how much of it is not? Its just, we need to protect ourselves from the bad guys. I would imagine especially financial services and healthcare, a lot of it was driven by compliance before but is that percentage going down? >> Go ahead. >> So, no not at all. >> Not at all, still mainly governance, compliance regulation. >> And what you have to bring together now is security risk and compliance. Its all the one thing. And at the board level, you don't have those as separate agenda topics anymore and that's why we talk about a risk management program. Especially the Fortune 500 boards becoming very educated and also actioning and taking forward and that's really where that stuff comes together. Compliance, especially if you look at the finance industry, health care industry for example, its always going to be there cause its a duty of care as to the industry, how to run the business and to all of the consumers at the end of the day at the end of that. So you need a bit of (indistinct talking) and its a very useful tool, if you apply risk management to it, if you're applying security to it and bring those things together. Many CSOs will talk about situational awareness and one of things they need to do, if they've got a seat at the board table, is, what do I have, what's my assets? And that's no longer just purely from a technical perspective. You hear the phrase, many organizations have technology silos, that don't talk, that don't come together, perhaps different business units that are running those silos. And at the board level how do you ascertain what you've got when you have an issue and that situational awareness then, is also going to help drive, what parties do I take when I have to take action. So that's something that Nehemiah's security is really focusing on. So they're saying let us put together for you and work with you to assemble your silos of IT network and everything else there. Essentially underpinning your digital footprint as you go on that digital journey. But then how do you have actionable business intelligence that's going to help you prioritize how to run that, how to secure it but also how to invest and run your business through this journey. >> You're going to say summn? >> I think its the word that Jason used a lot is the journey and there's a lot of things we should be doing just because its cyber hygiene and its intelligence, is what we should do to run our business by taking the business information and marrying what we got up and then communicate it in language that the board knows. Which is key, don't be talking about WannaCry viruses and all that and SNB ports. That doesn't make any sense to them, they make business decisions every day, so its we're investing X and you take a risk profile overtime and you say, this will help reduce our exposure here, but its good and we need to do it. Whether compliance says it or not, we need to be protecting our data. That's one of the things that... Compliance is a checklist and we need to check, make sure that's done and everybody does audited financial statements and that's great, we should do it every year but there's somethings that are basic we should do basic stuff in finance, we should do basic stuff in cyber hygiene as well as updating our systems, keeping them current, educating our employees on scams and stuff that happen. These are things that need to happen over time and so its a journey for the board and for the senior management but for every employee, to be able to know these things and to actually integrate it as part of their everyday job, in my opinion. >> It sounds like the cyber hygiene stuff is still just not (laughs), we're not hygienic enough (laughs) as we should be. Its amazing that just continues to be a recurring thing. >> One of the ethos approaches that Nehemiah is taking to this is, they call it know. What do you know about your environment and it starts there. To say so, especially for an organization, as many are on a digital journey. Well what is underpinning all of our digital footprint. Do you know that? And unfortunately so many organizations out there have bits of it but they don't maintain that. So when you have, for example, the famous WannaCry incident, they kicked off very very large organizations as well as many small one were impacted. Why? Well cause they didn't actually understand what they had and they didn't have the business intelligence and the business analytics to make a prioritization to say, we need to invest our focus and time and effort here to respond to this activity from a hygiene perspective. And until those things are addressed, you're not actually going to truly be able to go on your digital journey as an organization. So if anything, what this is doing is heightening the awareness at the board level that you need to have an articulated dialogue, where at the board level you can understand the impact to the business of what's going on here but then take all of that and take all the knowledge that you're building to then drive actionable intelligence, business as well as technology coming together, which underpins risk management in that context. >> And I would imagine those types of incidents are helpful in terms of helping to define what is that risk. >> Tragically helpful. >> Yeah tragically helpful but still without those types of things its probably harder or harder to really monetize what is the risk so that I can come up with a portfolio that then I can validate my investment. >> Its about being prepared. Its about thinking about what are your critical business systems. And so when you got something happening, no matter what it is, lets make sure that critical business systems are protected first and then we'll get to the the less priority systems. Its not that they're not all important, its just that there're some that are more critical. Inventory systems or sales at the end of the quarter, it tends to be we find to be, not only the systems but also the time of the year. If you're selling seeds, March and April, North America is really big. If you're Amazon its Christmas time. The inventory system and order entry system has got to be going so but its taking that step back now and saying; what are our critical business systems, what are the risks and then, the only thing we also look at that we've talked to Jason about is, we know what the risks are but what's the probability those risks are going to hit you. Everybody's not a 100%, some people are 20%. So when you go to the board you got to give them a true idea of, this is the true risk that we're seeing and we've tempered it down by saying if it was a 100 million at risk but you only have a 20% chance of getting that exploit then its really just $20 million that we're talking about not 100 cause the days are gone where we slam our hand on the board that you must do this, you must do this. Boards are more cyber aware now than ever and they don't want to just pay people throw information at them they want to understand it to be able to respond properly and not react. >> Right. So really the Net Nat is speaking a language, boil it down into language in the decision making process in which they're use to doing. Cause its not a zero sum game, it not a one or zero anymore, its really a probability decision and the risk assessment. >> Yeah that happens over time. That's the whole thing. There's ebbs and flows of the year and you look at things over time and I think that's the other thing that we'd like to talk about. And its renassessing, and one of the things that we talk is, we talk with a lot of people and the chief information security officers are embracing us because they're looking for new ways to be able to communicate properly and succinctly to the boards and that's one of the big things that we see. >> Good cause when they get bumped up the agenda items on the board that's what you want to see right. (laughing) >> Absolutely. >> Well Paul and Jason thanks for stopping by really appreciate your time >> Thank you. >> I'm Jeff Rick you're watching the cube, we'll see you next time, thanks for watching.

Published Date : Aug 25 2017

SUMMARY :

that need to happen in the boardroom Good to see you again. Well before you get it, first tell people about Nehemiah, and the knowing part is what we're probably going to talk and then we layer business applications on top of it So the network and the application. and a lot of the Fortune 500 firms and healthcare, a lot of it was driven by compliance before Not at all, still mainly governance, and one of things they need to do, and so its a journey for the board Its amazing that just continues to be a recurring thing. and the business analytics to make a prioritization in terms of helping to define what is that risk. or harder to really monetize what is the risk it tends to be we find to be, not only the systems So really the Net Nat is speaking a language, and that's one of the big things that we see. on the board that's what you want to see right. we'll see you next time, thanks for watching.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Jason CookPERSON

0.99+

Paul FarrellPERSON

0.99+

JasonPERSON

0.99+

PaulPERSON

0.99+

Jeff RickPERSON

0.99+

100 millionQUANTITY

0.99+

Palo AltoLOCATION

0.99+

20%QUANTITY

0.99+

two millionQUANTITY

0.99+

Chertoff GroupORGANIZATION

0.99+

10 millionQUANTITY

0.99+

100%QUANTITY

0.99+

$20QUANTITY

0.99+

AmazonORGANIZATION

0.99+

millionQUANTITY

0.99+

oneQUANTITY

0.99+

todayDATE

0.98+

NehemiahPERSON

0.98+

zeroQUANTITY

0.98+

AprilDATE

0.98+

OneQUANTITY

0.97+

WannaCryEVENT

0.97+

North AmericaLOCATION

0.97+

$2 million dollarsQUANTITY

0.97+

Sarbanes OxleyORGANIZATION

0.97+

The Chertoff GroupORGANIZATION

0.96+

firstQUANTITY

0.96+

MarchDATE

0.95+

100QUANTITY

0.91+

NehemiahORGANIZATION

0.77+

ChristmasEVENT

0.76+

ChertoffORGANIZATION

0.75+

NatORGANIZATION

0.71+

thingsQUANTITY

0.63+

yearQUANTITY

0.62+

Fortune 500ORGANIZATION

0.61+

Fortune 500 firmsQUANTITY

0.45+