Image Title

Search Results for Jarvis:

Jarvis Sam, Snap Inc. | Grace Hopper 2017


 

>> Announcer: Live from Orlando, Florida. It's the Cube. Covering, Grace Hopper Celebration of Women in Computing brought to you by Silicon Angle Media. >> Welcome back to the Cube's coverage of the Grace Hopper Conference here in Orlando, Florida. I'm your host Rebecca Knight. We're joined by Jarvis Sam, he is the manager of global diversity issues at Snap Inc. Welcome. >> Thank you so much for having me. I'm really happy to be here. >> So, I've gotta--first of all, you're wearing a Rosie the Riveter shirt, we've got these tchotchkes here, can you explain to our viewers a little bit about them? We got to, we got to talk about these first. >> Of course, so, the shirt was actually inspired by our Lady Chilla, that's our local women employee resource group at Snap. The idea was take the ghost, a representative mascot of Snap Inc. and parlay that with the idea of Rosie the Riveter, of course powerful in her own right. >> Rebecca: Alright, I love it, and then these spectacles are...? >> Yeah, so spectacles are Snap Inc.'s first ever hardware product released earlier this year. They allow for you to take an in-the-moment Snap, to be featured on your phone, using Bluetooth technology for iPhones and then WiFi technology for Android. They allow individual users to record Snaps on their phone, while of course not distorting the experience of being able to use their hands in the moment. >> Rebecca: So, I love it, these are the recruiting tactics: your own products. >> Exactly >> Want to play with these toys? Come work for us? >> Yes! >> So, tell us a little bit about what you do, Jarvis. Before you were at Snap, you were at Google. You were interested in really engaging in these diversity issues. So what do you at Snap? >> Yeah, so, at Snap, I manage our global diversity effort. What that includes is analyzing the diversity framework across three key verticals; first on the pipeline layer. So, what are we doing by way of K-12 education to ensure communities of color as well as women-- >> Rebecca: K-12? Wow. >> Exactly. >> Have specific opportunities in the space to be impactful. We often create this framework or archetype for what we think is ineffective software engineer for example or account manager. Reframing that by providing access and opportunity is showcase to people that the image that we have is not always the image that we want to portray, is critical. Next then we focus heavily on the idea of the candidate, so candidate experience. Deep diving into understanding key talent acquisition measures as well as key HR practices that will allow for us to create the best experience, moves us forward in that regard. But then finally, and this is where we get to the whole global perspective. Is the idea of the employee. Creating a nurturing community where the idea of psychological safety is not only bolstered but ensuring that your community feels empowered to the idea of inclusion. Making sure inclusion is not just a seat at the table but rather a voice in the conversation that can be actioned upon. >> So I want to dig into that a little bit, this voice in the conversation. Before the cameras were rolling you were talking about these very difficult candid conversations that employees at same have. Tell our viewers a little bit more about that. >> Yeah, so I think one of the greatest challenges across the tech industry and at Snap as well is the idea of referral networks. The tech industry on its own right has grown so greatly out of referral networks. People that you have worked with perviously, people that have the same academic or pedagogical experience as you. The problem with that is, the traditional network analysis would seem to let us know that you often refer people who look like you, or come from a similar internal dimension background as yourself. In a community that's largely rooted in a dominated discourse by white or Asian males. That means that we're continuing to perpetuate that exact same type of rhetoric. >> Rebecca: That's who you're recruiting. >> Exactly. And so then idea of getting more women or communities of color involved in that space can often be distorted. So that remains a challenge that we as a company as well as the tech industry need to overcome is understanding; one, how do we encourage more diverse referrals over time. But then two, creating an ecosystem where this seems natural and not like an artificial standard. >> Okay, so how do you do it? I mean that we've pinpointed the problem and it absolutely is a problem, but what are the kinds of things that Snap is doing to improve the referral process? >> So it's the idea of being innovative by design. One thing that's unique about Snap in particular is that we are an LA-based company. >> So based out of Venice Beach and Santa Monica, California. We don't face a lot of the core challenges that we see in Silicon Valley. And as a result have the opportunity to be more innovative in our approach. As a result when we look to referral networks in particular. One thing that Snap has focused on is the idea of diversity recruiting as a core pillar or tenant of all of our employee research groups. Not only do they join us to attend conferences like Grace Hopper, like the National Society of Black Engineers. But we actually do sourcing jams. Where we sit down with them and mine their networks. Either on LinkedIn-- >> Rebecca: Sourcing jams? >> Yes >> Rebecca: I love it. >> Yes Either on LinkedIn or GitHub or any of the various professional networking sites that they work on. Or technical networking sites to find out who are great talents that they've worked with before. >> Who do you know? Who can join us? >> Exactly. And what's more significant than that, is creating a sense of empowerment where we actually having them reach out to their network as opposed to a recruiter. This creates more of a warm and welcoming environment for the candidate. Where the idea of being a simple passive candidate is further explored by activating them to showcase how your experience has been great. >> And how are you also ensuring that the experience at Snap is great, particularly for women and people of color? >> Yes, so one area is our employee resource group. So we have a couple, so Lady Chilla is of course what I am wearing today. But Snap Noir for the black community. Snap Pride for the LGTBQ plus community and Low Snaps for the Latin X community. >> Rebecca: How big is Snap, we should just-- >> Yeah, about 3,000 people globally. >> Okay, 3,000. Okay, wow. >> And so one of the exciting things that we do is ERG that. So it's where we bring all of our employee resource groups together and they hold massive events every single quarter. To encourage other communities that are either allies or individuals of the sociological out group to understand what they do. But this deploys in so many different ways. In June, for Pride for example, we held drag bingo. Where our LGTBQ plus community participated. In March, we did a whole series of events celebrating women in engineering, women in sales, and women in media that resulted in a large expanse of events allowing for people to come in and learn about, not only the female experience more broadly, but particularly at Snap and some of the great endeavors that they're working on. >> And I know you are also working with other organizations like Girls Who Code, Women Who Code, Made with Code. Can you tell the viewers a little bit more about Snap's involvement. >> 100% Made with Code is one of the most exciting projects that I've had the opportunity to work on. It was for me personally this great combination of working with my previous employer Google, and Snap. So Google's Made with Code project is an idea that started to empower teen girls to code, ages 13 to 18 primarily. What they found is was that's exactly the same demographic that primarily uses our product. And so about three months ago, we decided to come together to launch an imitative where we'd have teen girls make geofilters, one of Snap's core products. The project actually launched one week ago, and teen girls are using Blocky technology to actually go about creating their own geofilters. And then writing a 100 word personal statement defining what their vision for the future of technology is. I'm personally exciting to say after checking the numbers this morning, more than 22,000 girls have already submitted responses to participate. And they will culminate in an event, November 1 through 3. Where we will take the top five finalists to TED Women in New Orleans. To not only showcase women who have done incredible things in the past and present. But also showcase their work at participating in this competition, as the women of technology for the future. >> Rebecca: And the next generation. >> Exactly. >> So we're running out of time here, but I want to just talk finally about the headlines. It's very depressing, you know the Google Manifesto, the sexism that we've seen against women. The racism in the industry. These are are-- we don't want to talk about it at this celebration of computing because we want to focus on the positives. And yet, where do you feel, particularly because you have worked at large tech companies, on these issues for a while now? >> Not facing challenges head on is going to be the greatest threat to the tech industry. The idea of avoiding conversation and avoiding sheer communication of these challenging issues will continue to raise-- >> Rebecca: And ignoring the bad behavior. >> Exactly, and it results in negative rhetoric that inherently put these communities out of wanting to work in this specific industry. But arguably given that technology not only represents the face of the future but how every single product and entity is made for the future, we have to include individuals. Everyone often wants to highlight the McKinsey study from Diversity Matters. Highlighting all of these great ways of diversity impacting business, but we need to look at it in addition from an ethic standpoint. The idea that technology represents how we are building our future. Leaving entire communities out of that primarily focusing on people of color and women, will result in a space where these communities will never have access, opportunity and thus employment to exist in this space. Being able to attack these issues head on, address the bad behavior, highlight what the potential implication is step one. Step two though is being proactive in everything that we're doing, to attempt to ameliorate that from the beginning. You'll notice one thing that's very different about Snap's diversity strategy is we seek to build infrastructure first, then focus on talent acquisition. Once we can ensure that communities of color and women are entering a space that is psychologically safe, open, and inviting. Then we can focus on how we're bringing in talent effectively so that the idea of retention and advancement is not an afterthought but rather top of mind. >> Right, because you can't recruit them if they haven't had the opportunities to begin with. >> Exactly, and that's what Snap often upholds the value of the idea that diversity is our determination, while inclusion is our imperative. >> Jarvis, I love it. >> Thank you so much. >> This has been really fun talking to you. >> Thank you. >> We will have more from Orlando, Florida at the Grace Hopper Celebration of Women in Computing just after this. (upbeat music)

Published Date : Oct 12 2017

SUMMARY :

brought to you by Silicon Angle Media. We're joined by Jarvis Sam, he is the manager of global I'm really happy to be here. Rosie the Riveter shirt, we've got these Rosie the Riveter, of course powerful in her own right. and then these spectacles are...? to be featured on your phone, using Bluetooth technology Rebecca: So, I love it, these are the recruiting tactics: So what do you at Snap? What that includes is analyzing the diversity framework Rebecca: K-12? Have specific opportunities in the space to be impactful. Before the cameras were rolling you were talking people that have the same academic the tech industry need to overcome is understanding; So it's the idea of being innovative by design. And as a result have the opportunity to be more of the various professional networking sites Where the idea of being a simple passive candidate and Low Snaps for the Latin X community. Okay, 3,000. And so one of the exciting things that we do is ERG that. And I know you are also working with other organizations that I've had the opportunity to work on. The racism in the industry. the greatest threat to the tech industry. talent effectively so that the idea of retention if they haven't had the opportunities to begin with. the value of the idea that diversity is our determination, at the Grace Hopper Celebration of Women in Computing

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
DavidPERSON

0.99+

OdiePERSON

0.99+

Mitzi ChangPERSON

0.99+

RubaPERSON

0.99+

Rebecca KnightPERSON

0.99+

Lisa MartinPERSON

0.99+

CiscoORGANIZATION

0.99+

AliciaPERSON

0.99+

Peter BurrisPERSON

0.99+

JoshPERSON

0.99+

ScottPERSON

0.99+

JarvisPERSON

0.99+

Rick EchevarriaPERSON

0.99+

2012DATE

0.99+

RebeccaPERSON

0.99+

BrucePERSON

0.99+

AcronisORGANIZATION

0.99+

JohnPERSON

0.99+

InfosysORGANIZATION

0.99+

ThomasPERSON

0.99+

JeffPERSON

0.99+

DeloitteORGANIZATION

0.99+

AnantPERSON

0.99+

MaheshPERSON

0.99+

Scott ShadleyPERSON

0.99+

AdamPERSON

0.99+

EuropeLOCATION

0.99+

Alicia HalloranPERSON

0.99+

Savannah PetersonPERSON

0.99+

Nadir SalessiPERSON

0.99+

Miami BeachLOCATION

0.99+

Mahesh RamPERSON

0.99+

Dave VolantePERSON

0.99+

Pat GelsingerPERSON

0.99+

January of 2013DATE

0.99+

AmericaLOCATION

0.99+

Amazon Web ServicesORGANIZATION

0.99+

Bruce BottlesPERSON

0.99+

John FurrierPERSON

0.99+

GoogleORGANIZATION

0.99+

Asia PacificLOCATION

0.99+

MarchDATE

0.99+

David CopePERSON

0.99+

AmazonORGANIZATION

0.99+

Rick EchavarriaPERSON

0.99+

AmazonsORGANIZATION

0.99+

John WallsPERSON

0.99+

ChinaLOCATION

0.99+

July of 2017DATE

0.99+

AWSORGANIZATION

0.99+

CatalinaLOCATION

0.99+

NewportLOCATION

0.99+

ZapposORGANIZATION

0.99+

NGD SystemsORGANIZATION

0.99+

50 terabytesQUANTITY

0.99+

Lie 3, Today’s Modern Data Stack Is Modern | Starburst


 

(energetic music) >> Okay, we're back with Justin Borgman, CEO of Starburst, Richard Jarvis is the CTO of EMIS Health, and Teresa Tung is the cloud first technologist from Accenture. We're on to lie number three. And that is the claim that today's "Modern Data Stack" is actually modern. So (chuckles), I guess that's the lie. Or, is that it's not modern. Justin, what do you say? >> Yeah, I think new isn't modern. Right? I think it's the new data stack. It's the cloud data stack, but that doesn't necessarily mean it's modern. I think a lot of the components actually, are exactly the same as what we've had for 40 years. Rather than Teradata, you have Snowflake. Rather than Informatica, you have Fivetran. So, it's the same general stack, just, y'know, a cloud version of it. And I think a lot of the challenges that have plagued us for 40 years still maintain. >> So, let me come back to you Justin. Okay, but there are differences, right? You can scale. You can throw resources at the problem. You can separate compute from storage. You really, there's a lot of money being thrown at that by venture capitalists, and Snowflake you mentioned, its competitors. So that's different. Is it not? Is that not at least an aspect of modern dial it up, dial it down? So what do you say to that? >> Well, it is. It's certainly taking, y'know what the cloud offers and taking advantage of that. But it's important to note that the cloud data warehouses out there are really just separating their compute from their storage. So it's allowing them to scale up and down, but your data's still stored in a proprietary format. You're still locked in. You still have to ingest the data to get it even prepared for analysis. So a lot of the same structural constraints that exist with the old enterprise data warehouse model on-preem still exist. Just yes, a little bit more elastic now because the cloud offers that. >> So Teresa, let me go to you, 'cause you have cloud-first in your title. So, what's say you to this conversation? >> Well, even the cloud providers are looking towards more of a cloud continuum, right? So the centralized cloud as we know it, maybe data lake, data warehouse in the central place, that's not even how the cloud providers are looking at it. They have use query services. Every provider has one that really expands those queries to be beyond a single location. And if we look at a lot of where our- the future goes, right? That's going to very much fall the same thing. There was going to be more edge. There's going to be more on-premise, because of data sovereignty, data gravity, because you're working with different parts of the business that have already made major cloud investments in different cloud providers, right? So, there's a lot of reasons why the modern, I guess, the next modern generation of the data stack needs to be much more federated. >> Okay, so Richard, how do you deal with this? You've obviously got, you know, the technical debt, the existing infrastructure, it's on the books. You don't want to just throw it out. A lot of conversation about modernizing applications, which a lot of times is, you know, of microservices layer on top of legacy apps. How do you think about the Modern Data Stack? >> Well, I think probably the first thing to say is that the stack really has to include the processes and people around the data as well is all well and good changing the technology. But if you don't modernize how people use that technology, then you're not going to be able to, to scale because just 'cause you can scale CPU and storage doesn't mean you can get more people to use your data to generate you more value for the business. And so what we've been looking at is really changing in very much aligned to data products and, and data mesh. How do you enable more people to consume the service and have the stack respond in a way that keeps costs low? Because that's important for our customers consuming this data but also allows people to occasionally run enormous queries and then tick along with smaller ones when required. And it's a good job we did because during COVID all of a sudden we had enormous pressures on our data platform to answer really important life threatening queries. And if we couldn't scale both our data stack and our teams we wouldn't have been able to answer those as quickly as we had. So I think the stack needs to support a scalable business not just the technology itself. >> Well thank you for that. So Justin let's, let's try to break down what the critical aspects are of the modern data stack. So you think about the past, you know, five seven years cloud obviously has given a different pricing model. Derisked experimentation, you know that we talked about the ability to scale up scale down, but it's, I'm taking away that that's not enough. Based on what Richard just said, the modern data stack has to serve the business and enable the business to build data products. I buy that. I'm you a big fan of the data mesh concepts, even though we're early days. So what are the critical aspects if you had to think about you know, the, maybe putting some guardrails and definitions around the modern data stack, what does that look like? What are some of the attributes and, and principles there >> Of how it should look like or, or how >> Yeah. What it should be? >> Yeah. Yeah. Well, I think, you know, in, in Theresa mentioned this in in a previous segment about the data warehouse is not necessarily going to disappear. It just becomes one node, one element of the overall data mesh. And I certainly agree with that. So by no means, are we suggesting that, you know Snowflake or what Redshift or whatever cloud data warehouse you may be using is going to disappear, but it's it's not going to become the end all be all. It's not the, the central single source of truth. And I think that's the paradigm shift that needs to occur. And I think it's also worth noting that those who were the early adopters of the modern data stack were primarily digital, native born in the cloud young companies who had the benefit of of idealism. They had the benefit of starting with a clean slate that does not reflect the vast majority of enterprises. And even those companies, as they grow up, mature out of that ideal state, they go by a business. Now they've got something on another cloud provider that has a different data stack and they have to deal with that heterogeneity that is just change and change is a part of life. And so I think there is an element here that is almost philosophical. It's like, do you believe in an absolute ideal where I can just fit everything into one place or do I believe in reality? And I think the far more pragmatic approach is really what data mesh represents. So to answer your question directly, I think it's adding you know, the ability to access data that lives outside of the data warehouse, maybe living in open data formats in a data lake or accessing operational systems as well. Maybe you want to directly access data that lives in an Oracle database or a Mongo database or, or what have you. So creating that flexibility to really future proof yourself from the inevitable change that you will you won't encounter over time. >> So thank you. So Theresa, based on what Justin just said, I I might take away there is it's inclusive whether it's a data mart, data hub, data lake, data warehouse, just a node on the mesh. Okay. I get that. Does that include Theresa on, on Preem data? Obviously it has to. What are you seeing in terms of the ability to, to take that data mesh concept on Preem I mean most implementations I've seen and data mesh, frankly really aren't, you know adhering to the philosophy there. Maybe, maybe it's data lake and maybe it's using glue. You look at what JPMC is doing, HelloFresh, a lot of stuff happening on the AWS cloud in that, you know, closed stack, if you will. What's the answer to that Theresa? >> I mean, I think it's a killer case for data mesh. The fact that you have valuable data sources on Preem, and then yet you still want to modernize and take the best of cloud. Cloud is still, like we mentioned, there's a lot of great reasons for it around the economics and the way ability to tap into the innovation that the cloud providers are giving around data and AI architecture. It's an easy button. So the mesh allows you to have the best of both world. You can start using the data products on Preem, or in the existing systems that are working already. It's meaningful for the business. At the same time, you can modernize the ones that make business sense because it needs better performance. It needs, you know, something that is, is cheaper or or maybe just tapping into better analytics to get better insights, right? So you're going to be able to stretch and really have the best of both worlds. That, again, going back to Richard's point, that is meaningful by the business. Not everything has to have that one size fits all set a tool. >> Okay. Thank you. So Richard, you know, talking about data as product wonder if we could give us your perspectives here what are the advantages of treating data as a product? What, what role do data products have in the modern data stack? We talk about monetizing data. What are your thoughts on data products? >> So for us, one of the most important data products that we've been creating is taking data that is healthcare data across a wide variety of different settings. So information about patients, demographics about their their treatment, about their medications and so on, and taking that into a standards format that can be utilized by a wide variety of different researchers because misinterpreting that data or having the data not presented in the way that the user is expecting means that you generate the wrong insight and in any business that's clearly not a desirable outcome but when that insight is so critical as it might be in healthcare or some security settings you really have to have gone to the trouble of understanding the data, presenting it in a format that everyone can clearly agree on. And then letting people consume in a very structured managed way, even if that data comes from a variety of different sources in the first place. And so our data product journey has really begun by standardizing data across a number of different silos through the data mesh. So we can present out both internally and through the right governance externally to, to researchers. >> So that data product through whatever APIs is is accessible, it's discoverable, but it's obviously got to be governed as well. You mentioned appropriately provided to internally. >> Yeah. >> But also, you know, external folks as well. So the, so you've, you've architected that capability today? >> We have and because the data is standard it can generate value much more quickly and we can be sure of the security and value that that's providing, because the data product isn't just about formatting the data into the correct tables, it's understanding what it means to redact the data or to remove certain rows from it or to interpret what a date actually means. Is it the start of the contract or the start of the treatment or the date of birth of a patient? These things can be lost in the data storage without having the proper product management around the data to say in a very clear business context what does this data mean, and what does it mean to process this data for a particular use case. >> Yeah, it makes sense. It's got the context. If the, if the domains on the data, you know you got to cut through a lot of the, the centralized teams, the technical teams that that data agnostic, they don't really have that context. All right, let's end. Justin. How does Starburst fit into this modern data stack? Bring us home. >> Yeah. So I think for us it's really providing our customers with, you know the flexibility to operate and analyze data that lives in a wide variety of different systems. Ultimately giving them that optionality, you know and optionality provides the ability to reduce costs store more in a data lake rather than data warehouse. It provides the ability for the fastest time to insight to access the data directly where it lives. And ultimately with this concept of data products that we've now, you know incorporated into our offering as well you can really create and, and curate, you know data as a product to be shared and consumed. So we're trying to help enable the data mesh, you know model and make that an appropriate compliment to you know, the modern data stack that people have today. >> Excellent. Hey, I want to thank Justin, Teresa, and Richard for joining us today. You guys are great. Big believers in the in the data mesh concept, and I think, you know we're seeing the future of data architecture. So thank you. Now, remember, all these conversations are going to be available on the cube.net for on demand viewing. You can also go to starburst.io. They have some great content on the website and they host some really thought provoking interviews and they have awesome resources. Lots of data mesh conversations over there and really good stuff in, in the resource section. So check that out. Thanks for watching the "Data Doesn't Lie... or Does It?" made possible by Starburst data. This is Dave Vellante for the Cube, and we'll see you next time. (upbeat music)

Published Date : Aug 22 2022

SUMMARY :

And that is the claim It's the cloud data stack, So, let me come back to you Justin. that the cloud data warehouses out there So Teresa, let me go to you, So the centralized cloud as we know it, it's on the books. the first thing to say is of the modern data stack. from the inevitable change that you will What's the answer to that Theresa? So the mesh allows you to in the modern data stack? or having the data not presented So that data product But also, you know, around the data to say in a on the data, you know enable the data mesh, you know in the data mesh concept,

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
RichardPERSON

0.99+

Teresa TungPERSON

0.99+

JustinPERSON

0.99+

TeresaPERSON

0.99+

Dave VellantePERSON

0.99+

Justin BorgmanPERSON

0.99+

Richard JarvisPERSON

0.99+

40 yearsQUANTITY

0.99+

TheresaPERSON

0.99+

StarburstORGANIZATION

0.99+

JPMCORGANIZATION

0.99+

AWSORGANIZATION

0.99+

InformaticaORGANIZATION

0.99+

AccentureORGANIZATION

0.99+

both worldsQUANTITY

0.99+

todayDATE

0.99+

EMIS HealthORGANIZATION

0.99+

first technologistQUANTITY

0.98+

one elementQUANTITY

0.98+

bothQUANTITY

0.98+

first thingQUANTITY

0.98+

five seven yearsQUANTITY

0.98+

oneQUANTITY

0.97+

TeradataORGANIZATION

0.97+

OracleORGANIZATION

0.97+

cube.netOTHER

0.96+

MongoORGANIZATION

0.95+

one sizeQUANTITY

0.93+

CubeORGANIZATION

0.92+

PreemTITLE

0.92+

both worldQUANTITY

0.91+

one placeQUANTITY

0.91+

Today’sTITLE

0.89+

FivetranORGANIZATION

0.86+

Data Doesn't Lie... or Does It?TITLE

0.86+

single locationQUANTITY

0.85+

HelloFreshORGANIZATION

0.84+

first placeQUANTITY

0.83+

CEOPERSON

0.83+

LieTITLE

0.82+

single sourceQUANTITY

0.79+

firstQUANTITY

0.75+

one nodeQUANTITY

0.72+

SnowflakeORGANIZATION

0.66+

SnowflakeTITLE

0.66+

threeQUANTITY

0.59+

CTOPERSON

0.53+

Data StackTITLE

0.53+

RedshiftTITLE

0.52+

starburst.ioOTHER

0.48+

COVIDTITLE

0.37+

Lie 2, An Open Source Based Platform Cannot Give You Performance and Control | Starburst


 

>>We're back with Jess Borgman of Starburst and Richard Jarvis of EVAs health. Okay. We're gonna get into lie. Number two, and that is this an open source based platform cannot give you the performance and control that you can get with a proprietary system. Is that a lie? Justin, the enterprise data warehouse has been pretty dominant and has evolved and matured. Its stack has mature over the years. Why is it not the default platform for data? >>Yeah, well, I think that's become a lie over time. So I, I think, you know, if we go back 10 or 12 years ago with the advent of the first data lake really around Hudu, that probably was true that you couldn't get the performance that you needed to run fast, interactive, SQL queries in a data lake. Now a lot's changed in 10 or 12 years. I remember in the very early days, people would say, you'll, you'll never get performance because you need to be column. You need to store data in a column format. And then, you know, column formats were introduced to, to data lake. You have Parque ORC file in aro that were created to ultimately deliver performance out of that. So, okay. We got, you know, largely over the performance hurdle, you know, more recently people will say, well, you don't have the ability to do updates and deletes like a traditional data warehouse. >>And now we've got the creation of new data formats, again, like iceberg and Delta and hoote that do allow for updates and delete. So I think the data lake has continued to mature. And I remember a quote from, you know, Kurt Monash many years ago where he said, you know, it takes six or seven years to build a functional database. I think that's that's right. And now we've had almost a decade go by. So, you know, these technologies have matured to really deliver very, very close to the same level performance and functionality of, of cloud data warehouses. So I think the, the reality is that's become a lie and now we have large giant hyperscale internet companies that, you know, don't have the traditional data warehouse at all. They do all of their analytics in a data lake. So I think we've, we've proven that it's very much possible today. >>Thank you for that. And so Richard, talk about your perspective as a practitioner in terms of what open brings you versus, I mean, the clothes is it's open as a moving target. I remember Unix used to be open systems and so it's, it is an evolving, you know, spectrum, but, but from your perspective, what does open give you that you can't get from a proprietary system where you are fearful of in a proprietary system? >>I, I suppose for me open buys us the ability to be unsure about the future, because one thing that's always true about technology is it evolves in a, a direction, slightly different to what people expect and what you don't want to end up done is backed itself into a corner that then prevents it from innovating. So if you have chosen the technology and you've stored trillions of records in that technology and suddenly a new way of processing or machine learning comes out, you wanna be able to take advantage your competitive edge might depend upon it. And so I suppose for us, we acknowledge that we don't have perfect vision of what the future might be. And so by backing open storage technologies, we can apply a number of different technologies to the processing of that data. And that gives us the ability to remain relevant, innovate on our data storage. And we have bought our way out of the, any performance concerns because we can use cloud scale infrastructure to scale up and scale down as we need. And so we don't have the concerns that we don't have enough hardware today to process what we want to do, want to achieve. We can just scale up when we need it and scale back down. So open source has really allowed us to maintain the being at the cutting edge. >>So Jess, let me play devil's advocate here a little bit, and I've talked to JAK about this and you know, obviously her vision is there's an open source that, that data mesh is open source, an open source tooling, and it's not a proprietary, you know, you're not gonna buy a data mesh. You're gonna build it with, with open source toolings and, and vendors like you are gonna support it, but come back to sort of today, you can get to market with a proprietary solution faster. I'm gonna make that statement. You tell me if it's a lie and then you can say, okay, we support Apache iceberg. We're gonna support open source tooling, take a company like VMware, not really in the data business, but how, the way they embraced Kubernetes and, and you know, every new open source thing that comes along, they say, we do that too. Why can't proprietary systems do that and be as effective? >>Yeah, well I think at least with the, within the data landscape saying that you can access open data formats like iceberg or, or others is, is a bit dis disingenuous because really what you're selling to your customer is a certain degree of performance, a certain SLA, and you know, those cloud data warehouses that can reach beyond their own proprietary storage drop all the performance that they were able to provide. So it is, it reminds me kind of, of, again, going back 10 or 12 years ago when everybody had a connector to hit and that they thought that was the solution, right? But the reality was, you know, a connector was not the same as running workloads in hit back then. And I think similarly, you know, being able to connect to an external table that lives in an open data format, you know, you're, you're not going to give it the performance that your customers are accustomed to. And at the end of the day, they're always going to be predisposed. They're always going to be incentivized to get that data ingested into the data warehouse, cuz that's where they have control. And you know, the bottom line is the database industry has really been built around vendor lockin. I mean, from the start, how, how many people love Oracle today, but our customers, nonetheless, I think, you know, lockin is, is, is part of this industry. And I think that's really what we're trying to change with open data formats. >>Well, it's interesting remind of when I, you know, I see the, the gas price, the TSR gas price I, I drive up and then I say, oh, that's the cash price credit card. I gotta pay 20 cents more, but okay. But so the, the argument then, so let me, let me come back to you, Justin. So what's wrong with saying, Hey, we support open data formats, but yeah, you're gonna get better performance if you, if you, you keep it into our closed system, are you saying that long term that's gonna come back and bite you cuz you're gonna end up, you mentioned Oracle, you mentioned Teradata. Yeah. That's by, by implication, you're saying that's where snowflake customers are headed. >>Yeah, absolutely. I think this is a movie that, you know, we've all seen before. At least those of us who've been in the industry long enough to, to see this movie play over a couple times. So I do think that's the future. And I think, you know, I loved what Richard said. I actually wrote it down. Cause I thought it was an amazing quote. He said, it buys us the ability to be unsure of the future. That that pretty much says it all the, the future is unknowable and the reality is using open data formats. You remain interoperable with any technology you want to utilize. If you want to use spark to train a machine learning model and you wanna use Starbust to query via sequel, that's totally cool. They can both work off the same exact, you know, data, data sets by contrast, if you're, you know, focused on a proprietary model, then you're kind of locked in again to that model. I think the same applies to data, sharing to data products, to a wide variety of, of aspects of the data landscape that a proprietary approach kind of closes you and, and locks you in. >>So I, I would say this Richard, I'd love to get your thoughts on it. Cause I talked to a lot of Oracle customers, not as many te data customers there, but, but a lot of Oracle customers and they, you know, they'll admit yeah, you know, the Jammin us on price and the license cost, but we do get value out of it. And so my question to you, Richard, is, is do the, let's call it data warehouse systems or the proprietary systems. Are they gonna deliver a greater ROI sooner? And is that in allure of, of that customers, you know, are attracted to, or can open platforms deliver as fast an ROI? >>I think the answer to that is it can depend a bit. It depends on your business's skillset. So we are lucky that we have a number of proprietary teams that work in databases that provide our operational data capability. And we have teams of analytics and big data experts who can work with open data sets and open data formats. And so for those different teams, they can get to an ROI more quickly with different technologies for the business though, we can't do better for our operational data stores than proprietary databases. Today we can back off very tight SLAs to them. We can demonstrate reliability from millions of hours of those databases being run at enterprise scale, but for an analytics workload where increasing our business is growing in that direction, we can't do better than open data formats with cloud-based data mesh type technologies. And so it's not a simple answer. That one will always be the right answer for our business. We definitely have times when proprietary databases provide a capability that we couldn't easily represent or replicate with open technologies. >>Yeah. Richard, stay with you. You mentioned, you know, you know, some things before that, that strike me, you know, the data brick snowflake, you know, thing is always a lot of fun for analysts like me. You've got data bricks coming at it. Richard, you mentioned you have a lot of rockstar, data engineers, data bricks coming at it from a data engineering heritage. You get snowflake coming at it from an analytics heritage. Those two worlds are, are colliding people like PJI Mohan said, you know what? I think it's actually harder to play in the data engineering. So IE, it's easier to for data engineering world to go into the analytics world versus the reverse, but thinking about up and coming engineers and developers preparing for this future of data engineering and data analytics, how, how should they be thinking about the future? What, what's your advice to those young people? >>So I think I'd probably fall back on general programming skill sets. So the advice that I saw years ago was if you have open source technologies, the pythons and Javas on your CV, you command a 20% pay, hike over people who can only do proprietary programming languages. And I think that's true of data technologies as well. And from a business point of view, that makes sense. I'd rather spend the money that I save on proprietary licenses on better engineers, because they can provide more value to the business that can innovate us beyond our competitors. So I think I would my advice to people who are starting here or trying to build teams to capitalize on data assets is begin with open license, free capabilities because they're very cheap to experiment with. And they generate a lot of interest from people who want to join you as a business. And you can make them very successful early, early doors with, with your analytics journey. >>It's interesting. Again, analysts like myself, we do a lot of TCO work and have over the last 20 plus years and in the world of Oracle, you know, normally it's the staff, that's the biggest nut in total cost of ownership, not an Oracle. It's the it's the license cost is by far the biggest component in the, in the blame pie. All right, Justin, help us close out this segment. We've been talking about this sort of data mesh open, closed snowflake data bricks. Where does Starburst sort of as this engine for the data lake data lake house, the data warehouse, it, it fit in this, in this world. >>Yeah. So our view on how the future ultimately unfolds is we think that data lakes will be a natural center of gravity for a lot of the reasons that we described open data formats, lowest total cost of ownership, because you get to choose the cheapest storage available to you. Maybe that's S3 or Azure data lake storage or Google cloud storage, or maybe it's on-prem object storage that you bought at a, at a really good price. So ultimately storing a lot of data in a data lake makes a lot of sense, but I think what makes our perspective unique is we still don't think you're gonna get everything there either. We think that basically centralization of all your data assets is just an impossible endeavor. And so you wanna be able to access data that lives outside of the lake as well. So we kind of think of the lake as maybe the biggest place by volume in terms of how much data you have, but to, to have comprehensive analytics and to truly understand your business and understanding holistically, you need to be able to go access other data sources as well. And so that's the role that we wanna play is to be a single point of access for our customers, provide the right level of fine grained access controls so that the right people have access to the right data and ultimately make it easy to discover and consume via, you know, the creation of data products as well. >>Great. Okay. Thanks guys. Right after this quick break, we're gonna be back to debate whether the cloud data model that we see emerging and the so-called modern data stack is really modern or is it the same wine new bottle when it comes to data architectures, you're watching the cube, the leader in enterprise and emerging tech coverage.

Published Date : Aug 22 2022

SUMMARY :

give you the performance and control that you can get with a proprietary We got, you know, largely over the performance hurdle, you know, more recently people will say, And I remember a quote from, you know, Kurt Monash many years ago where he said, you know, it is an evolving, you know, spectrum, but, but from your perspective, in a, a direction, slightly different to what people expect and what you don't want to end up So Jess, let me play devil's advocate here a little bit, and I've talked to JAK about this and you know, And I think similarly, you know, being able to connect to an external table that lives in an open data format, Well, it's interesting remind of when I, you know, I see the, the gas price, the TSR gas price And I think, you know, I loved what Richard said. you know, the Jammin us on price and the license cost, but we do get value out And so for those different teams, they can get to an you know, the data brick snowflake, you know, thing is always a lot of fun for analysts like me. So the advice that I saw years ago was if you have open source technologies, years and in the world of Oracle, you know, normally it's the staff, to discover and consume via, you know, the creation of data products as well. data model that we see emerging and the so-called modern data stack is

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Jess BorgmanPERSON

0.99+

RichardPERSON

0.99+

20 centsQUANTITY

0.99+

sixQUANTITY

0.99+

JustinPERSON

0.99+

Richard JarvisPERSON

0.99+

OracleORGANIZATION

0.99+

Kurt MonashPERSON

0.99+

20%QUANTITY

0.99+

JessPERSON

0.99+

pythonsTITLE

0.99+

seven yearsQUANTITY

0.99+

TodayDATE

0.99+

JavasTITLE

0.99+

TeradataORGANIZATION

0.99+

VMwareORGANIZATION

0.98+

millionsQUANTITY

0.98+

EVAsORGANIZATION

0.98+

JAKPERSON

0.98+

StarburstORGANIZATION

0.98+

bothQUANTITY

0.97+

10DATE

0.97+

12 years agoDATE

0.97+

StarbustTITLE

0.96+

todayDATE

0.95+

Apache icebergORGANIZATION

0.94+

GoogleORGANIZATION

0.93+

12 yearsQUANTITY

0.92+

single pointQUANTITY

0.92+

two worldsQUANTITY

0.92+

10QUANTITY

0.91+

HuduLOCATION

0.91+

UnixTITLE

0.9+

one thingQUANTITY

0.87+

trillions of recordsQUANTITY

0.83+

first data lakeQUANTITY

0.82+

StarburstTITLE

0.8+

PJIORGANIZATION

0.79+

years agoDATE

0.76+

IETITLE

0.75+

Lie 2TITLE

0.72+

many years agoDATE

0.72+

over a couple timesQUANTITY

0.7+

TCOORGANIZATION

0.7+

ParqueORGANIZATION

0.67+

Number twoQUANTITY

0.64+

KubernetesORGANIZATION

0.59+

a decadeQUANTITY

0.58+

plus yearsDATE

0.57+

AzureTITLE

0.57+

S3TITLE

0.55+

DeltaTITLE

0.54+

20QUANTITY

0.49+

lastDATE

0.48+

MohanPERSON

0.44+

ORCORGANIZATION

0.27+

Lie 1, The Most Effective Data Architecture Is Centralized | Starburst


 

(bright upbeat music) >> In 2011, early Facebook employee and Cloudera co-founder Jeff Hammerbacher famously said, "The best minds of my generation are thinking about how to get people to click on ads, and that sucks!" Let's face it. More than a decade later, organizations continue to be frustrated with how difficult it is to get value from data and build a truly agile and data-driven enterprise. What does that even mean, you ask? Well, it means that everyone in the organization has the data they need when they need it in a context that's relevant to advance the mission of an organization. Now, that could mean cutting costs, could mean increasing profits, driving productivity, saving lives, accelerating drug discovery, making better diagnoses, solving supply chain problems, predicting weather disasters, simplifying processes, and thousands of other examples where data can completely transform people's lives beyond manipulating internet users to behave a certain way. We've heard the prognostications about the possibilities of data before and in fairness we've made progress, but the hard truth is the original promises of master data management, enterprise data warehouses, data marts, data hubs, and yes even data lakes were broken and left us wanting for more. Welcome to The Data Doesn't Lie... Or Does It? A series of conversations produced by theCUBE and made possible by Starburst Data. I'm your host, Dave Vellante, and joining me today are three industry experts. Justin Borgman is the co-founder and CEO of Starburst, Richard Jarvis is the CTO at EMIS Health, and Teresa Tung is cloud first technologist at Accenture. Today, we're going to have a candid discussion that will expose the unfulfilled, and yes, broken promises of a data past. We'll expose data lies: big lies, little lies, white lies, and hidden truths. And we'll challenge, age old data conventions and bust some data myths. We're debating questions like is the demise of a single source of truth inevitable? Will the data warehouse ever have feature parity with the data lake or vice versa? Is the so-called modern data stack simply centralization in the cloud, AKA the old guards model in new cloud close? How can organizations rethink their data architectures and regimes to realize the true promises of data? Can and will an open ecosystem deliver on these promises in our lifetimes? We're spanning much of the Western world today. Richard is in the UK, Teresa is on the West Coast, and Justin is in Massachusetts with me. I'm in theCUBE studios, about 30 miles outside of Boston. Folks, welcome to the program. Thanks for coming on. >> Thanks for having us. >> Okay, let's get right into it. You're very welcome. Now, here's the first lie. The most effective data architecture is one that is centralized with a team of data specialists serving various lines of business. What do you think Justin? >> Yeah, definitely a lie. My first startup was a company called Hadapt, which was an early SQL engine for IDU that was acquired by Teradata. And when I got to Teradata, of course, Teradata is the pioneer of that central enterprise data warehouse model. One of the things that I found fascinating was that not one of their customers had actually lived up to that vision of centralizing all of their data into one place. They all had data silos. They all had data in different systems. They had data on prem, data in the cloud. Those companies were acquiring other companies and inheriting their data architecture. So despite being the industry leader for 40 years, not one of their customers truly had everything in one place. So I think definitely history has proven that to be a lie. >> So Richard, from a practitioner's point of view, what are your thoughts? I mean, there's a lot of pressure to cut cost, keep things centralized, serve the business as best as possible from that standpoint. What does your experience show? >> Yeah, I mean, I think I would echo Justin's experience really that we as a business have grown up through acquisition, through storing data in different places sometimes to do information governance in different ways to store data in a platform that's close to data experts people who really understand healthcare data from pharmacies or from doctors. And so, although if you were starting from a greenfield site and you were building something brand new, you might be able to centralize all the data and all of the tooling and teams in one place. The reality is that businesses just don't grow up like that. And it's just really impossible to get that academic perfection of storing everything in one place. >> Teresa, I feel like Sarbanes-Oxley have kind of saved the data warehouse, right? (laughs) You actually did have to have a single version of the truth for certain financial data, but really for some of those other use cases I mentioned, I do feel like the industry has kind of let us down. What's your take on this? Where does it make sense to have that sort of centralized approach versus where does it make sense to maybe decentralize? >> I think you got to have centralized governance, right? So from the central team, for things like Sarbanes-Oxley, for things like security, for certain very core data sets having a centralized set of roles, responsibilities to really QA, right? To serve as a design authority for your entire data estate, just like you might with security, but how it's implemented has to be distributed. Otherwise, you're not going to be able to scale, right? So being able to have different parts of the business really make the right data investments for their needs. And then ultimately, you're going to collaborate with your partners. So partners that are not within the company, right? External partners. We're going to see a lot more data sharing and model creation. And so you're definitely going to be decentralized. >> So Justin, you guys last, jeez, I think it was about a year ago, had a session on data mesh. It was a great program. You invited Zhamak Dehghani. Of course, she's the creator of the data mesh. One of our fundamental premises is that you've got this hyper specialized team that you've got to go through if you want anything. But at the same time, these individuals actually become a bottleneck, even though they're some of the most talented people in the organization. So I guess, a question for you Richard. How do you deal with that? Do you organize so that there are a few sort of rock stars that build cubes and the like or have you had any success in sort of decentralizing with your constituencies that data model? >> Yeah. So we absolutely have got rockstar data scientists and data guardians, if you like. People who understand what it means to use this data, particularly the data that we use at EMIS is very private, it's healthcare information. And some of the rules and regulations around using the data are very complex and strict. So we have to have people who understand the usage of the data, then people who understand how to build models, how to process the data effectively. And you can think of them like consultants to the wider business because a pharmacist might not understand how to structure a SQL query, but they do understand how they want to process medication information to improve patient lives. And so that becomes a consulting type experience from a set of rock stars to help a more decentralized business who needs to understand the data and to generate some valuable output. >> Justin, what do you say to a customer or prospect that says, "Look, Justin. I got a centralized team and that's the most cost effective way to serve the business. Otherwise, I got duplication." What do you say to that? >> Well, I would argue it's probably not the most cost effective, and the reason being really twofold. I think, first of all, when you are deploying a enterprise data warehouse model, the data warehouse itself is very expensive, generally speaking. And so you're putting all of your most valuable data in the hands of one vendor who now has tremendous leverage over you for many, many years to come. I think that's the story at Oracle or Teradata or other proprietary database systems. But the other aspect I think is that the reality is those central data warehouse teams, as much as they are experts in the technology, they don't necessarily understand the data itself. And this is one of the core tenets of data mesh that Zhamak writes about is this idea of the domain owners actually know the data the best. And so by not only acknowledging that data is generally decentralized, and to your earlier point about Sarbanes-Oxley, maybe saving the data warehouse, I would argue maybe GDPR and data sovereignty will destroy it because data has to be decentralized for those laws to be compliant. But I think the reality is the data mesh model basically says data's decentralized and we're going to turn that into an asset rather than a liability. And we're going to turn that into an asset by empowering the people that know the data the best to participate in the process of curating and creating data products for consumption. So I think when you think about it that way, you're going to get higher quality data and faster time to insight, which is ultimately going to drive more revenue for your business and reduce costs. So I think that that's the way I see the two models comparing and contrasting. >> So do you think the demise of the data warehouse is inevitable? Teresa, you work with a lot of clients. They're not just going to rip and replace their existing infrastructure. Maybe they're going to build on top of it, but what does that mean? Does that mean the EDW just becomes less and less valuable over time or it's maybe just isolated to specific use cases? What's your take on that? >> Listen, I still would love all my data within a data warehouse. I would love it mastered, would love it owned by a central team, right? I think that's still what I would love to have. That's just not the reality, right? The investment to actually migrate and keep that up to date, I would say it's a losing battle. Like we've been trying to do it for a long time. Nobody has the budgets and then data changes, right? There's going to be a new technology that's going to emerge that we're going to want to tap into. There's going to be not enough investment to bring all the legacy, but still very useful systems into that centralized view. So you keep the data warehouse. I think it's a very, very valuable, very high performance tool for what it's there for, but you could have this new mesh layer that still takes advantage of the things I mentioned: the data products in the systems that are meaningful today, and the data products that actually might span a number of systems. Maybe either those that either source systems with the domains that know it best, or the consumer-based systems or products that need to be packaged in a way that'd be really meaningful for that end user, right? Each of those are useful for a different part of the business and making sure that the mesh actually allows you to use all of them. >> So, Richard, let me ask you. Take Zhamak's principles back to those. You got the domain ownership and data as product. Okay, great. Sounds good. But it creates what I would argue are two challenges: self-serve infrastructure, let's park that for a second, and then in your industry, one of the most regulated, most sensitive, computational governance. How do you automate and ensure federated governance in that mesh model that Teresa was just talking about? >> Well, it absolutely depends on some of the tooling and processes that you put in place around those tools to centralize the security and the governance of the data. And I think although a data warehouse makes that very simple 'cause it's a single tool, it's not impossible with some of the data mesh technologies that are available. And so what we've done at EMIS is we have a single security layer that sits on top of our data mesh, which means that no matter which user is accessing which data source, we go through a well audited, well understood security layer. That means that we know exactly who's got access to which data field, which data tables. And then everything that they do is audited in a very kind of standard way regardless of the underlying data storage technology. So for me, although storing the data in one place might not be possible, understanding where your source of truth is and securing that in a common way is still a valuable approach, and you can do it without having to bring all that data into a single bucket so that it's all in one place. And so having done that and investing quite heavily in making that possible has paid dividends in terms of giving wider access to the platform, and ensuring that only data that's available under GDPR and other regulations is being used by the data users. >> Yeah. So Justin, we always talk about data democratization, and up until recently, they really haven't been line of sight as to how to get there, but do you have anything to add to this because you're essentially doing analytic queries with data that's all dispersed all over. How are you seeing your customers handle this challenge? >> Yeah, I mean, I think data products is a really interesting aspect of the answer to that. It allows you to, again, leverage the data domain owners, the people who know the data the best, to create data as a product ultimately to be consumed. And we try to represent that in our product as effectively, almost eCommerce like experience where you go and discover and look for the data products that have been created in your organization, and then you can start to consume them as you'd like. And so really trying to build on that notion of data democratization and self-service, and making it very easy to discover and start to use with whatever BI tool you may like or even just running SQL queries yourself. >> Okay guys, grab a sip of water. After the short break, we'll be back to debate whether proprietary or open platforms are the best path to the future of data excellence. Keep it right there. (bright upbeat music)

Published Date : Aug 22 2022

SUMMARY :

has the data they need when they need it Now, here's the first lie. has proven that to be a lie. of pressure to cut cost, and all of the tooling have kind of saved the data So from the central team, for that build cubes and the like and to generate some valuable output. and that's the most cost effective way is that the reality is those of the data warehouse is inevitable? and making sure that the mesh one of the most regulated, most sensitive, and processes that you put as to how to get there, aspect of the answer to that. or open platforms are the best path

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave VellantePERSON

0.99+

RichardPERSON

0.99+

Justin BorgmanPERSON

0.99+

JustinPERSON

0.99+

Richard JarvisPERSON

0.99+

Teresa TungPERSON

0.99+

Jeff HammerbacherPERSON

0.99+

TeresaPERSON

0.99+

TeradataORGANIZATION

0.99+

OracleORGANIZATION

0.99+

MassachusettsLOCATION

0.99+

Zhamak DehghaniPERSON

0.99+

UKLOCATION

0.99+

2011DATE

0.99+

two challengesQUANTITY

0.99+

HadaptORGANIZATION

0.99+

40 yearsQUANTITY

0.99+

StarburstORGANIZATION

0.99+

two modelsQUANTITY

0.99+

thousandsQUANTITY

0.99+

BostonLOCATION

0.99+

FacebookORGANIZATION

0.99+

Sarbanes-OxleyORGANIZATION

0.99+

EachQUANTITY

0.99+

first lieQUANTITY

0.99+

AccentureORGANIZATION

0.99+

GDPRTITLE

0.99+

TodayDATE

0.98+

todayDATE

0.98+

SQLTITLE

0.98+

Starburst DataORGANIZATION

0.98+

EMIS HealthORGANIZATION

0.98+

ClouderaORGANIZATION

0.98+

oneQUANTITY

0.98+

first startupQUANTITY

0.98+

one placeQUANTITY

0.98+

about 30 milesQUANTITY

0.98+

OneQUANTITY

0.97+

More than a decade laterDATE

0.97+

EMISORGANIZATION

0.97+

single bucketQUANTITY

0.97+

first technologistQUANTITY

0.96+

three industry expertsQUANTITY

0.96+

single toolQUANTITY

0.96+

single versionQUANTITY

0.94+

ZhamakPERSON

0.92+

theCUBEORGANIZATION

0.91+

single sourceQUANTITY

0.9+

West CoastLOCATION

0.87+

one vendorQUANTITY

0.84+

single security layerQUANTITY

0.81+

about a year agoDATE

0.75+

IDUORGANIZATION

0.68+

IsTITLE

0.65+

a secondQUANTITY

0.64+

EDWORGANIZATION

0.57+

examplesQUANTITY

0.55+

echoCOMMERCIAL_ITEM

0.54+

twofoldQUANTITY

0.5+

LieTITLE

0.35+

Starburst The Data Lies FULL V2b


 

>>In 2011, early Facebook employee and Cloudera co-founder Jeff Ocker famously said the best minds of my generation are thinking about how to get people to click on ads. And that sucks. Let's face it more than a decade later organizations continue to be frustrated with how difficult it is to get value from data and build a truly agile data-driven enterprise. What does that even mean? You ask? Well, it means that everyone in the organization has the data they need when they need it. In a context that's relevant to advance the mission of an organization. Now that could mean cutting cost could mean increasing profits, driving productivity, saving lives, accelerating drug discovery, making better diagnoses, solving, supply chain problems, predicting weather disasters, simplifying processes, and thousands of other examples where data can completely transform people's lives beyond manipulating internet users to behave a certain way. We've heard the prognostications about the possibilities of data before and in fairness we've made progress, but the hard truth is the original promises of master data management, enterprise data, warehouses, data marts, data hubs, and yes, even data lakes were broken and left us wanting from more welcome to the data doesn't lie, or doesn't a series of conversations produced by the cube and made possible by Starburst data. >>I'm your host, Dave Lanta and joining me today are three industry experts. Justin Borgman is this co-founder and CEO of Starburst. Richard Jarvis is the CTO at EMI health and Theresa tongue is cloud first technologist at Accenture. Today we're gonna have a candid discussion that will expose the unfulfilled and yes, broken promises of a data past we'll expose data lies, big lies, little lies, white lies, and hidden truths. And we'll challenge, age old data conventions and bust some data myths. We're debating questions like is the demise of a single source of truth. Inevitable will the data warehouse ever have featured parody with the data lake or vice versa is the so-called modern data stack, simply centralization in the cloud, AKA the old guards model in new cloud close. How can organizations rethink their data architectures and regimes to realize the true promises of data can and will and open ecosystem deliver on these promises in our lifetimes, we're spanning much of the Western world today. Richard is in the UK. Teresa is on the west coast and Justin is in Massachusetts with me. I'm in the cube studios about 30 miles outside of Boston folks. Welcome to the program. Thanks for coming on. Thanks for having us. Let's get right into it. You're very welcome. Now here's the first lie. The most effective data architecture is one that is centralized with a team of data specialists serving various lines of business. What do you think Justin? >>Yeah, definitely a lie. My first startup was a company called hit adapt, which was an early SQL engine for hit that was acquired by Teradata. And when I got to Teradata, of course, Teradata is the pioneer of that central enterprise data warehouse model. One of the things that I found fascinating was that not one of their customers had actually lived up to that vision of centralizing all of their data into one place. They all had data silos. They all had data in different systems. They had data on prem data in the cloud. You know, those companies were acquiring other companies and inheriting their data architecture. So, you know, despite being the industry leader for 40 years, not one of their customers truly had everything in one place. So I think definitely history has proven that to be a lie. >>So Richard, from a practitioner's point of view, you know, what, what are your thoughts? I mean, there, there's a lot of pressure to cut cost, keep things centralized, you know, serve the business as best as possible from that standpoint. What, what is your experience show? >>Yeah, I mean, I think I would echo Justin's experience really that we, as a business have grown up through acquisition, through storing data in different places sometimes to do information governance in different ways to store data in, in a platform that's close to data experts, people who really understand healthcare data from pharmacies or from, from doctors. And so, although if you were starting from a Greenfield site and you were building something brand new, you might be able to centralize all the data and all of the tooling and teams in one place. The reality is that that businesses just don't grow up like that. And, and it's just really impossible to get that academic perfection of, of storing everything in one place. >>Y you know, Theresa, I feel like Sarbanes Oxley kinda saved the data warehouse, you know, right. You actually did have to have a single version of the truth for certain financial data, but really for those, some of those other use cases, I, I mentioned, I, I do feel like the industry has kinda let us down. What's your take on this? Where does it make sense to have that sort of centralized approach versus where does it make sense to maybe decentralized? >>I, I think you gotta have centralized governance, right? So from the central team, for things like star Oxley, for things like security for certainly very core data sets, having a centralized set of roles, responsibilities to really QA, right. To serve as a design authority for your entire data estate, just like you might with security, but how it's implemented has to be distributed. Otherwise you're not gonna be able to scale. Right? So being able to have different parts of the business really make the right data investments for their needs. And then ultimately you're gonna collaborate with your partners. So partners that are not within the company, right. External partners, we're gonna see a lot more data sharing and model creation. And so you're definitely going to be decentralized. >>So, you know, Justin, you guys last, geez, I think it was about a year ago, had a session on, on data mesh. It was a great program. You invited Jamma, Dani, of course, she's the creator of the data mesh. And her one of our fundamental premises is that you've got this hyper specialized team that you've gotta go through. And if you want anything, but at the same time, these, these individuals actually become a bottleneck, even though they're some of the most talented people in the organization. So I guess question for you, Richard, how do you deal with that? Do you, do you organize so that there are a few sort of rock stars that, that, you know, build cubes and, and the like, and, and, and, or have you had any success in sort of decentralizing with, you know, your, your constituencies, that data model? >>Yeah. So, so we absolutely have got rockstar, data scientists and data guardians. If you like people who understand what it means to use this data, particularly as the data that we use at emos is very private it's healthcare information. And some of the, the rules and regulations around using the data are very complex and, and strict. So we have to have people who understand the usage of the data, then people who understand how to build models, how to process the data effectively. And you can think of them like consultants to the wider business, because a pharmacist might not understand how to structure a SQL query, but they do understand how they want to process medication information to improve patient lives. And so that becomes a, a consulting type experience from a, a set of rock stars to help a, a more decentralized business who needs to, to understand the data and to generate some valuable output. >>Justin, what do you say to a, to a customer or prospect that says, look, Justin, I'm gonna, I got a centralized team and that's the most cost effective way to serve the business. Otherwise I got, I got duplication. What do you say to that? >>Well, I, I would argue it's probably not the most cost effective and, and the reason being really twofold. I think, first of all, when you are deploying a enterprise data warehouse model, the, the data warehouse itself is very expensive, generally speaking. And so you're putting all of your most valuable data in the hands of one vendor who now has tremendous leverage over you, you know, for many, many years to come. I think that's the story at Oracle or Terra data or other proprietary database systems. But the other aspect I think is that the reality is those central data warehouse teams is as much as they are experts in the technology. They don't necessarily understand the data itself. And this is one of the core tenants of data mash that that jam writes about is this idea of the domain owners actually know the data the best. >>And so by, you know, not only acknowledging that data is generally decentralized and to your earlier point about SAR, brain Oxley, maybe saving the data warehouse, I would argue maybe GDPR and data sovereignty will destroy it because data has to be decentralized for, for those laws to be compliant. But I think the reality is, you know, the data mesh model basically says, data's decentralized, and we're gonna turn that into an asset rather than a liability. And we're gonna turn that into an asset by empowering the people that know the data, the best to participate in the process of, you know, curating and creating data products for, for consumption. So I think when you think about it, that way, you're going to get higher quality data and faster time to insight, which is ultimately going to drive more revenue for your business and reduce costs. So I think that that's the way I see the two, the two models comparing and contrasting. >>So do you think the demise of the data warehouse is inevitable? I mean, I mean, you know, there Theresa you work with a lot of clients, they're not just gonna rip and replace their existing infrastructure. Maybe they're gonna build on top of it, but what does that mean? Does that mean the E D w just becomes, you know, less and less valuable over time, or it's maybe just isolated to specific use cases. What's your take on that? >>Listen, I still would love all my data within a data warehouse would love it. Mastered would love it owned by essential team. Right? I think that's still what I would love to have. That's just not the reality, right? The investment to actually migrate and keep that up to date. I would say it's a losing battle. Like we've been trying to do it for a long time. Nobody has the budgets and then data changes, right? There's gonna be a new technology. That's gonna emerge that we're gonna wanna tap into. There's going to be not enough investment to bring all the legacy, but still very useful systems into that centralized view. So you keep the data warehouse. I think it's a very, very valuable, very high performance tool for what it's there for, but you could have this, you know, new mesh layer that still takes advantage of the things. I mentioned, the data products in the systems that are meaningful today and the data products that actually might span a number of systems, maybe either those that either source systems for the domains that know it best, or the consumer based systems and products that need to be packaged in a way that be really meaningful for that end user, right? Each of those are useful for a different part of the business and making sure that the mesh actually allows you to use all of them. >>So, Richard, let me ask you, you take, take Gemma's principles back to those. You got to, you know, domain ownership and, and, and data as product. Okay, great. Sounds good. But it creates what I would argue are two, you know, challenges, self-serve infrastructure let's park that for a second. And then in your industry, the one of the high, most regulated, most sensitive computational governance, how do you automate and ensure federated governance in that mesh model that Theresa was just talking about? >>Well, it absolutely depends on some of the tooling and processes that you put in place around those tools to be, to centralize the security and the governance of the data. And I think, although a data warehouse makes that very simple, cause it's a single tool, it's not impossible with some of the data mesh technologies that are available. And so what we've done at emus is we have a single security layer that sits on top of our data match, which means that no matter which user is accessing, which data source, we go through a well audited well understood security layer. That means that we know exactly who's got access to which data field, which data tables. And then everything that they do is, is audited in a very kind of standard way, regardless of the underlying data storage technology. So for me, although storing the data in one place might not be possible understanding where your source of truth is and securing that in a common way is still a valuable approach and you can do it without having to bring all that data into a single bucket so that it's all in one place. And, and so having done that and investing quite heavily in making that possible has paid dividends in terms of giving wider access to the platform and ensuring that only data that's available under GDPR and other regulations is being used by, by the data users. >>Yeah. So Justin, I mean, Democrat, we always talk about data democratization and you know, up until recently, they really haven't been line of sight as to how to get there. But do you have anything to add to this because you're essentially taking, you know, do an analytic queries and with data that's all dispersed all over the, how are you seeing your customers handle this, this challenge? >>Yeah. I mean, I think data products is a really interesting aspect of the answer to that. It allows you to, again, leverage the data domain owners, people know the data, the best to, to create, you know, data as a product ultimately to be consumed. And we try to represent that in our product as effectively a almost eCommerce like experience where you go and discover and look for the data products that have been created in your organization. And then you can start to consume them as, as you'd like. And so really trying to build on that notion of, you know, data democratization and self-service, and making it very easy to discover and, and start to use with whatever BI tool you, you may like, or even just running, you know, SQL queries yourself, >>Okay. G guys grab a sip of water. After this short break, we'll be back to debate whether proprietary or open platforms are the best path to the future of data excellence, keep it right there. >>Your company has more data than ever, and more people trying to understand it, but there's a problem. Your data is stored across multiple systems. It's hard to access and that delays analytics and ultimately decisions. The old method of moving all of your data into a single source of truth is slow and definitely not built for the volume of data we have today or where we are headed while your data engineers spent over half their time, moving data, your analysts and data scientists are left, waiting, feeling frustrated, unproductive, and unable to move the needle for your business. But what if you could spend less time moving or copying data? What if your data consumers could analyze all your data quickly? >>Starburst helps your teams run fast queries on any data source. We help you create a single point of access to your data, no matter where it's stored. And we support high concurrency, we solve for speed and scale, whether it's fast, SQL queries on your data lake or faster queries across multiple data sets, Starburst helps your teams run analytics anywhere you can't afford to wait for data to be available. Your team has questions that need answers. Now with Starburst, the wait is over. You'll have faster access to data with enterprise level security, easy connectivity, and 24 7 support from experts, organizations like Zolando Comcast and FINRA rely on Starburst to move their businesses forward. Contact our Trino experts to get started. >>We're back with Jess Borgman of Starburst and Richard Jarvis of EVAs health. Okay, we're gonna get to lie. Number two, and that is this an open source based platform cannot give you the performance and control that you can get with a proprietary system. Is that a lie? Justin, the enterprise data warehouse has been pretty dominant and has evolved and matured. Its stack has mature over the years. Why is it not the default platform for data? >>Yeah, well, I think that's become a lie over time. So I, I think, you know, if we go back 10 or 12 years ago with the advent of the first data lake really around Hudu, that probably was true that you couldn't get the performance that you needed to run fast, interactive, SQL queries in a data lake. Now a lot's changed in 10 or 12 years. I remember in the very early days, people would say, you you'll never get performance because you need to be column there. You need to store data in a column format. And then, you know, column formats we're introduced to, to data apes, you have Parque ORC file in aro that were created to ultimately deliver performance out of that. So, okay. We got, you know, largely over the performance hurdle, you know, more recently people will say, well, you don't have the ability to do updates and deletes like a traditional data warehouse. >>And now we've got the creation of new data formats, again like iceberg and Delta and Hodi that do allow for updates and delete. So I think the data lake has continued to mature. And I remember a, a quote from, you know, Kurt Monash many years ago where he said, you know, know it takes six or seven years to build a functional database. I think that's that's right. And now we've had almost a decade go by. So, you know, these technologies have matured to really deliver very, very close to the same level performance and functionality of, of cloud data warehouses. So I think the, the reality is that's become a line and now we have large giant hyperscale internet companies that, you know, don't have the traditional data warehouse at all. They do all of their analytics in a data lake. So I think we've, we've proven that it's very much possible today. >>Thank you for that. And so Richard, talk about your perspective as a practitioner in terms of what open brings you versus, I mean, look closed is it's open as a moving target. I remember Unix used to be open systems and so it's, it is an evolving, you know, spectrum, but, but from your perspective, what does open give you that you can't get from a proprietary system where you are fearful of in a proprietary system? >>I, I suppose for me open buys us the ability to be unsure about the future, because one thing that's always true about technology is it evolves in a, a direction, slightly different to what people expect. And what you don't want to end up is done is backed itself into a corner that then prevents it from innovating. So if you have chosen a technology and you've stored trillions of records in that technology and suddenly a new way of processing or machine learning comes out, you wanna be able to take advantage and your competitive edge might depend upon it. And so I suppose for us, we acknowledge that we don't have perfect vision of what the future might be. And so by backing open storage technologies, we can apply a number of different technologies to the processing of that data. And that gives us the ability to remain relevant, innovate on our data storage. And we have bought our way out of the, any performance concerns because we can use cloud scale infrastructure to scale up and scale down as we need. And so we don't have the concerns that we don't have enough hardware today to process what we want to do, want to achieve. We can just scale up when we need it and scale back down. So open source has really allowed us to maintain the being at the cutting edge. >>So Jess, let me play devil's advocate here a little bit, and I've talked to Shaak about this and you know, obviously her vision is there's an open source that, that the data meshes open source, an open source tooling, and it's not a proprietary, you know, you're not gonna buy a data mesh. You're gonna build it with, with open source toolings and, and vendors like you are gonna support it, but to come back to sort of today, you can get to market with a proprietary solution faster. I'm gonna make that statement. You tell me if it's a lie and then you can say, okay, we support Apache iceberg. We're gonna support open source tooling, take a company like VMware, not really in the data business, but how, the way they embraced Kubernetes and, and you know, every new open source thing that comes along, they say, we do that too. Why can't proprietary systems do that and be as effective? >>Yeah, well, I think at least with the, within the data landscape saying that you can access open data formats like iceberg or, or others is, is a bit dis disingenuous because really what you're selling to your customer is a certain degree of performance, a certain SLA, and you know, those cloud data warehouses that can reach beyond their own proprietary storage drop all the performance that they were able to provide. So it is, it reminds me kind of, of, again, going back 10 or 12 years ago when everybody had a connector to Haddo and that they thought that was the solution, right? But the reality was, you know, a connector was not the same as running workloads in Haddo back then. And I think similarly, you know, being able to connect to an external table that lives in an open data format, you know, you're, you're not going to give it the performance that your customers are accustomed to. And at the end of the day, they're always going to be predisposed. They're always going to be incentivized to get that data ingested into the data warehouse, cuz that's where they have control. And you know, the bottom line is the database industry has really been built around vendor lockin. I mean, from the start, how, how many people love Oracle today, but our customers, nonetheless, I think, you know, lockin is, is, is part of this industry. And I think that's really what we're trying to change with open data formats. >>Well, that's interesting reminded when I, you know, I see the, the gas price, the tees or gas price I, I drive up and then I say, oh, that's the cash price credit card. I gotta pay 20 cents more, but okay. But so the, the argument then, so let me, let me come back to you, Justin. So what's wrong with saying, Hey, we support open data formats, but yeah, you're gonna get better performance if you, if you keep it into our closed system, are you saying that long term that's gonna come back and bite you cuz you're gonna end up, you mentioned Oracle, you mentioned Teradata. Yeah. That's by, by implication, you're saying that's where snowflake customers are headed. >>Yeah, absolutely. I think this is a movie that, you know, we've all seen before. At least those of us who've been in the industry long enough to, to see this movie play over a couple times. So I do think that's the future. And I think, you know, I loved what Richard said. I actually wrote it down. Cause I thought it was an amazing quote. He said, it buys us the ability to be unsure of the future. Th that that pretty much says it all the, the future is unknowable and the reality is using open data formats. You remain interoperable with any technology you want to utilize. If you want to use spark to train a machine learning model and you want to use Starbust to query via sequel, that's totally cool. They can both work off the same exact, you know, data, data sets by contrast, if you're, you know, focused on a proprietary model, then you're kind of locked in again to that model. I think the same applies to data, sharing to data products, to a wide variety of, of aspects of the data landscape that a proprietary approach kind of closes you in and locks you in. >>So I, I would say this Richard, I'd love to get your thoughts on it. Cause I talked to a lot of Oracle customers, not as many te data customers, but, but a lot of Oracle customers and they, you know, they'll admit, yeah, you know, they're jamming us on price and the license cost they give, but we do get value out of it. And so my question to you, Richard, is, is do the, let's call it data warehouse systems or the proprietary systems. Are they gonna deliver a greater ROI sooner? And is that in allure of, of that customers, you know, are attracted to, or can open platforms deliver as fast in ROI? >>I think the answer to that is it can depend a bit. It depends on your businesses skillset. So we are lucky that we have a number of proprietary teams that work in databases that provide our operational data capability. And we have teams of analytics and big data experts who can work with open data sets and open data formats. And so for those different teams, they can get to an ROI more quickly with different technologies for the business though, we can't do better for our operational data stores than proprietary databases. Today we can back off very tight SLAs to them. We can demonstrate reliability from millions of hours of those databases being run at enterprise scale, but for an analytics workload where increasing our business is growing in that direction, we can't do better than open data formats with cloud-based data mesh type technologies. And so it's not a simple answer. That one will always be the right answer for our business. We definitely have times when proprietary databases provide a capability that we couldn't easily represent or replicate with open technologies. >>Yeah. Richard, stay with you. You mentioned, you know, you know, some things before that, that strike me, you know, the data brick snowflake, you know, thing is, oh, is a lot of fun for analysts like me. You've got data bricks coming at it. Richard, you mentioned you have a lot of rockstar, data engineers, data bricks coming at it from a data engineering heritage. You get snowflake coming at it from an analytics heritage. Those two worlds are, are colliding people like PJI Mohan said, you know what? I think it's actually harder to play in the data engineering. So I E it's easier to for data engineering world to go into the analytics world versus the reverse, but thinking about up and coming engineers and developers preparing for this future of data engineering and data analytics, how, how should they be thinking about the future? What, what's your advice to those young people? >>So I think I'd probably fall back on general programming skill sets. So the advice that I saw years ago was if you have open source technologies, the pythons and Javas on your CV, you commander 20% pay, hike over people who can only do proprietary programming languages. And I think that's true of data technologies as well. And from a business point of view, that makes sense. I'd rather spend the money that I save on proprietary licenses on better engineers, because they can provide more value to the business that can innovate us beyond our competitors. So I think I would my advice to people who are starting here or trying to build teams to capitalize on data assets is begin with open license, free capabilities, because they're very cheap to experiment with. And they generate a lot of interest from people who want to join you as a business. And you can make them very successful early, early doors with, with your analytics journey. >>It's interesting. Again, analysts like myself, we do a lot of TCO work and have over the last 20 plus years. And in world of Oracle, you know, normally it's the staff, that's the biggest nut in total cost of ownership, not an Oracle. It's the it's the license cost is by far the biggest component in the, in the blame pie. All right, Justin, help us close out this segment. We've been talking about this sort of data mesh open, closed snowflake data bricks. Where does Starburst sort of as this engine for the data lake data lake house, the data warehouse fit in this, in this world? >>Yeah. So our view on how the future ultimately unfolds is we think that data lakes will be a natural center of gravity for a lot of the reasons that we described open data formats, lowest total cost of ownership, because you get to choose the cheapest storage available to you. Maybe that's S3 or Azure data lake storage, or Google cloud storage, or maybe it's on-prem object storage that you bought at a, at a really good price. So ultimately storing a lot of data in a deal lake makes a lot of sense, but I think what makes our perspective unique is we still don't think you're gonna get everything there either. We think that basically centralization of all your data assets is just an impossible endeavor. And so you wanna be able to access data that lives outside of the lake as well. So we kind of think of the lake as maybe the biggest place by volume in terms of how much data you have, but to, to have comprehensive analytics and to truly understand your business and understand it holistically, you need to be able to go access other data sources as well. And so that's the role that we wanna play is to be a single point of access for our customers, provide the right level of fine grained access controls so that the right people have access to the right data and ultimately make it easy to discover and consume via, you know, the creation of data products as well. >>Great. Okay. Thanks guys. Right after this quick break, we're gonna be back to debate whether the cloud data model that we see emerging and the so-called modern data stack is really modern, or is it the same wine new bottle? When it comes to data architectures, you're watching the cube, the leader in enterprise and emerging tech coverage. >>Your data is capable of producing incredible results, but data consumers are often left in the dark without fast access to the data they need. Starers makes your data visible from wherever it lives. Your company is acquiring more data in more places, more rapidly than ever to rely solely on a data centralization strategy. Whether it's in a lake or a warehouse is unrealistic. A single source of truth approach is no longer viable, but disconnected data silos are often left untapped. We need a new approach. One that embraces distributed data. One that enables fast and secure access to any of your data from anywhere with Starburst, you'll have the fastest query engine for the data lake that allows you to connect and analyze your disparate data sources no matter where they live Starburst provides the foundational technology required for you to build towards the vision of a decentralized data mesh Starburst enterprise and Starburst galaxy offer enterprise ready, connectivity, interoperability, and security features for multiple regions, multiple clouds and everchanging global regulatory requirements. The data is yours. And with Starburst, you can perform analytics anywhere in light of your world. >>Okay. We're back with Justin Boardman. CEO of Starbust Richard Jarvis is the CTO of EMI health and Theresa tongue is the cloud first technologist from Accenture. We're on July number three. And that is the claim that today's modern data stack is actually modern. So I guess that's the lie it's it is it's is that it's not modern. Justin, what do you say? >>Yeah. I mean, I think new isn't modern, right? I think it's the, it's the new data stack. It's the cloud data stack, but that doesn't necessarily mean it's modern. I think a lot of the components actually are exactly the same as what we've had for 40 years, rather than Terra data. You have snowflake rather than Informatica you have five trend. So it's the same general stack, just, you know, a cloud version of it. And I think a lot of the challenges that it plagued us for 40 years still maintain. >>So lemme come back to you just, but okay. But, but there are differences, right? I mean, you can scale, you can throw resources at the problem. You can separate compute from storage. You really, you know, there's a lot of money being thrown at that by venture capitalists and snowflake, you mentioned it's competitors. So that's different. Is it not, is that not at least an aspect of, of modern dial it up, dial it down. So what, what do you say to that? >>Well, it, it is, it's certainly taking, you know, what the cloud offers and taking advantage of that, but it's important to note that the cloud data warehouses out there are really just separating their compute from their storage. So it's allowing them to scale up and down, but your data still stored in a proprietary format. You're still locked in. You still have to ingest the data to get it even prepared for analysis. So a lot of the same sort of structural constraints that exist with the old enterprise data warehouse model OnPrem still exist just yes, a little bit more elastic now because the cloud offers that. >>So Theresa, let me go to you cuz you have cloud first in your, in your, your title. So what's what say you to this conversation? >>Well, even the cloud providers are looking towards more of a cloud continuum, right? So the centralized cloud, as we know it, maybe data lake data warehouse in the central place, that's not even how the cloud providers are looking at it. They have news query services. Every provider has one that really expands those queries to be beyond a single location. And if we look at a lot of where our, the future goes, right, that that's gonna very much fall the same thing. There was gonna be more edge. There's gonna be more on premise because of data sovereignty, data gravity, because you're working with different parts of the business that have already made major cloud investments in different cloud providers. Right? So there's a lot of reasons why the modern, I guess, the next modern generation of the data staff needs to be much more federated. >>Okay. So Richard, how do you deal with this? You you've obviously got, you know, the technical debt, the existing infrastructure it's on the books. You don't wanna just throw it out. A lot of, lot of conversation about modernizing applications, which a lot of times is a, you know, a microservices layer on top of leg legacy apps. How do you think about the modern data stack? >>Well, I think probably the first thing to say is that the stack really has to include the processes and people around the data as well is all well and good changing the technology. But if you don't modernize how people use that technology, then you're not going to be able to, to scale because just cuz you can scale CPU and storage doesn't mean you can get more people to use your data, to generate you more, more value for the business. And so what we've been looking at is really changing in very much aligned to data products and, and data mesh. How do you enable more people to consume the service and have the stack respond in a way that keeps costs low? Because that's important for our customers consuming this data, but also allows people to occasionally run enormous queries and then tick along with smaller ones when required. And it's a good job we did because during COVID all of a sudden we had enormous pressures on our data platform to answer really important life threatening queries. And if we couldn't scale both our data stack and our teams, we wouldn't have been able to answer those as quickly as we had. So I think the stack needs to support a scalable business, not just the technology itself. >>Well thank you for that. So Justin let's, let's try to break down what the critical aspects are of the modern data stack. So you think about the past, you know, five, seven years cloud obviously has given a different pricing model. De-risked experimentation, you know that we talked about the ability to scale up scale down, but it's, I'm, I'm taking away that that's not enough based on what Richard just said. The modern data stack has to serve the business and enable the business to build data products. I, I buy that. I'm a big fan of the data mesh concepts, even though we're early days. So what are the critical aspects if you had to think about, you know, paying, maybe putting some guardrails and definitions around the modern data stack, what does that look like? What are some of the attributes and, and principles there >>Of, of how it should look like or, or how >>It's yeah. What it should be. >>Yeah. Yeah. Well, I think, you know, in, in Theresa mentioned this in, in a previous segment about the data warehouse is not necessarily going to disappear. It just becomes one node, one element of the overall data mesh. And I, I certainly agree with that. So by no means, are we suggesting that, you know, snowflake or Redshift or whatever cloud data warehouse you may be using is going to disappear, but it's, it's not going to become the end all be all. It's not the, the central single source of truth. And I think that's the paradigm shift that needs to occur. And I think it's also worth noting that those who were the early adopters of the modern data stack were primarily digital, native born in the cloud young companies who had the benefit of, of idealism. They had the benefit of it was starting with a clean slate that does not reflect the vast majority of enterprises. >>And even those companies, as they grow up mature out of that ideal state, they go buy a business. Now they've got something on another cloud provider that has a different data stack and they have to deal with that heterogeneity that is just change and change is a part of life. And so I think there is an element here that is almost philosophical. It's like, do you believe in an absolute ideal where I can just fit everything into one place or do I believe in reality? And I think the far more pragmatic approach is really what data mesh represents. So to answer your question directly, I think it's adding, you know, the ability to access data that lives outside of the data warehouse, maybe living in open data formats in a data lake or accessing operational systems as well. Maybe you want to directly access data that lives in an Oracle database or a Mongo database or, or what have you. So creating that flexibility to really Futureproof yourself from the inevitable change that you will, you won't encounter over time. >>So thank you. So there, based on what Justin just said, I, my takeaway there is it's inclusive, whether it's a data Mar data hub, data lake data warehouse, it's a, just a node on the mesh. Okay. I get that. Does that include there on Preem data? O obviously it has to, what are you seeing in terms of the ability to, to take that data mesh concept on Preem? I mean, most implementations I've seen in data mesh, frankly really aren't, you know, adhering to the philosophy. They're maybe, maybe it's data lake and maybe it's using glue. You look at what JPMC is doing. Hello, fresh, a lot of stuff happening on the AWS cloud in that, you know, closed stack, if you will. What's the answer to that Theresa? >>I mean, I, I think it's a killer case for data. Me, the fact that you have valuable data sources, OnPrem, and then yet you still wanna modernize and take the best of cloud cloud is still, like we mentioned, there's a lot of great reasons for it around the economics and the way ability to tap into the innovation that the cloud providers are giving around data and AI architecture. It's an easy button. So the mesh allows you to have the best of both worlds. You can start using the data products on-prem or in the existing systems that are working already. It's meaningful for the business. At the same time, you can modernize the ones that make business sense because it needs better performance. It needs, you know, something that is, is cheaper or, or maybe just tap into better analytics to get better insights, right? So you're gonna be able to stretch and really have the best of both worlds. That, again, going back to Richard's point, that is meaningful by the business. Not everything has to have that one size fits all set a tool. >>Okay. Thank you. So Richard, you know, talking about data as product, wonder if we could give us your perspectives here, what are the advantages of treating data as a product? What, what role do data products have in the modern data stack? We talk about monetizing data. What are your thoughts on data products? >>So for us, one of the most important data products that we've been creating is taking data that is healthcare data across a wide variety of different settings. So information about patients' demographics about their, their treatment, about their medications and so on, and taking that into a standards format that can be utilized by a wide variety of different researchers because misinterpreting that data or having the data not presented in the way that the user is expecting means that you generate the wrong insight. And in any business, that's clearly not a desirable outcome, but when that insight is so critical, as it might be in healthcare or some security settings, you really have to have gone to the trouble of understanding the data, presenting it in a format that everyone can clearly agree on. And then letting people consume in a very structured, managed way, even if that data comes from a variety of different sources in, in, in the first place. And so our data product journey has really begun by standardizing data across a number of different silos through the data mesh. So we can present out both internally and through the right governance externally to, to researchers. >>So that data product through whatever APIs is, is accessible, it's discoverable, but it's obviously gotta be governed as well. You mentioned you, you appropriately provided to internally. Yeah. But also, you know, external folks as well. So the, so you've, you've architected that capability today >>We have, and because the data is standard, it can generate value much more quickly and we can be sure of the security and, and, and value that that's providing because the data product isn't just about formatting the data into the correct tables, it's understanding what it means to redact the data or to remove certain rows from it or to interpret what a date actually means. Is it the start of the contract or the start of the treatment or the date of birth of a patient? These things can be lost in the data storage without having the proper product management around the data to say in a very clear business context, what does this data mean? And what does it mean to process this data for a particular use case? >>Yeah, it makes sense. It's got the context. If the, if the domains own the data, you, you gotta cut through a lot of the, the, the centralized teams, the technical teams that, that data agnostic, they don't really have that context. All right. Let's send Justin, how does Starburst fit into this modern data stack? Bring us home. >>Yeah. So I think for us, it's really providing our customers with, you know, the flexibility to operate and analyze data that lives in a wide variety of different systems. Ultimately giving them that optionality, you know, and optionality provides the ability to reduce costs, store more in a data lake rather than data warehouse. It provides the ability for the fastest time to insight to access the data directly where it lives. And ultimately with this concept of data products that we've now, you know, incorporated into our offering as well, you can really create and, and curate, you know, data as a product to be shared and consumed. So we're trying to help enable the data mesh, you know, model and make that an appropriate compliment to, you know, the, the, the modern data stack that people have today. >>Excellent. Hey, I wanna thank Justin Theresa and Richard for joining us today. You guys are great. I big believers in the, in the data mesh concept, and I think, you know, we're seeing the future of data architecture. So thank you. Now, remember, all these conversations are gonna be available on the cube.net for on-demand viewing. You can also go to starburst.io. They have some great content on the website and they host some really thought provoking interviews and, and, and they have awesome resources, lots of data mesh conversations over there, and really good stuff in, in the resource section. So check that out. Thanks for watching the data doesn't lie or does it made possible by Starburst data? This is Dave Valante for the cube, and we'll see you next time. >>The explosion of data sources has forced organizations to modernize their systems and architecture and come to terms with one size does not fit all for data management today. Your teams are constantly moving and copying data, which requires time management. And in some cases, double paying for compute resources. Instead, what if you could access all your data anywhere using the BI tools and SQL skills your users already have. And what if this also included enterprise security and fast performance with Starburst enterprise, you can provide your data consumers with a single point of secure access to all of your data, no matter where it lives with features like strict, fine grained, access control, end to end data encryption and data masking Starburst meets the security standards of the largest companies. Starburst enterprise can easily be deployed anywhere and managed with insights where data teams holistically view their clusters operation and query execution. So they can reach meaningful business decisions faster, all this with the support of the largest team of Trino experts in the world, delivering fully tested stable releases and available to support you 24 7 to unlock the value in all of your data. You need a solution that easily fits with what you have today and can adapt to your architecture. Tomorrow. Starbust enterprise gives you the fastest path from big data to better decisions, cuz your team can't afford to wait. Trino was created to empower analytics anywhere and Starburst enterprise was created to give you the enterprise grade performance, connectivity, security management, and support your company needs organizations like Zolando Comcast and FINRA rely on Starburst to move their businesses forward. Contact us to get started.

Published Date : Aug 22 2022

SUMMARY :

famously said the best minds of my generation are thinking about how to get people to the data warehouse ever have featured parody with the data lake or vice versa is So, you know, despite being the industry leader for 40 years, not one of their customers truly had So Richard, from a practitioner's point of view, you know, what, what are your thoughts? although if you were starting from a Greenfield site and you were building something brand new, Y you know, Theresa, I feel like Sarbanes Oxley kinda saved the data warehouse, I, I think you gotta have centralized governance, right? So, you know, Justin, you guys last, geez, I think it was about a year ago, had a session on, And you can think of them Justin, what do you say to a, to a customer or prospect that says, look, Justin, I'm gonna, you know, for many, many years to come. But I think the reality is, you know, the data mesh model basically says, I mean, you know, there Theresa you work with a lot of clients, they're not just gonna rip and replace their existing that the mesh actually allows you to use all of them. But it creates what I would argue are two, you know, Well, it absolutely depends on some of the tooling and processes that you put in place around those do an analytic queries and with data that's all dispersed all over the, how are you seeing your the best to, to create, you know, data as a product ultimately to be consumed. open platforms are the best path to the future of data But what if you could spend less you create a single point of access to your data, no matter where it's stored. give you the performance and control that you can get with a proprietary system. I remember in the very early days, people would say, you you'll never get performance because And I remember a, a quote from, you know, Kurt Monash many years ago where he said, you know, know it takes six or seven it is an evolving, you know, spectrum, but, but from your perspective, And what you don't want to end up So Jess, let me play devil's advocate here a little bit, and I've talked to Shaak about this and you know, And I think similarly, you know, being able to connect to an external table that lives in an open data format, Well, that's interesting reminded when I, you know, I see the, the gas price, And I think, you know, I loved what Richard said. not as many te data customers, but, but a lot of Oracle customers and they, you know, And so for those different teams, they can get to an ROI more quickly with different technologies that strike me, you know, the data brick snowflake, you know, thing is, oh, is a lot of fun for analysts So the advice that I saw years ago was if you have open source technologies, And in world of Oracle, you know, normally it's the staff, easy to discover and consume via, you know, the creation of data products as well. really modern, or is it the same wine new bottle? And with Starburst, you can perform analytics anywhere in light of your world. And that is the claim that today's So it's the same general stack, just, you know, a cloud version of it. So lemme come back to you just, but okay. So a lot of the same sort of structural constraints that exist with So Theresa, let me go to you cuz you have cloud first in your, in your, the data staff needs to be much more federated. you know, a microservices layer on top of leg legacy apps. So I think the stack needs to support a scalable So you think about the past, you know, five, seven years cloud obviously has given What it should be. And I think that's the paradigm shift that needs to occur. data that lives outside of the data warehouse, maybe living in open data formats in a data lake seen in data mesh, frankly really aren't, you know, adhering to So the mesh allows you to have the best of both worlds. So Richard, you know, talking about data as product, wonder if we could give us your perspectives is expecting means that you generate the wrong insight. But also, you know, around the data to say in a very clear business context, It's got the context. And ultimately with this concept of data products that we've now, you know, incorporated into our offering as well, This is Dave Valante for the cube, and we'll see you next time. You need a solution that easily fits with what you have today and can adapt

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
RichardPERSON

0.99+

Dave LantaPERSON

0.99+

Jess BorgmanPERSON

0.99+

JustinPERSON

0.99+

TheresaPERSON

0.99+

Justin BorgmanPERSON

0.99+

TeresaPERSON

0.99+

Jeff OckerPERSON

0.99+

Richard JarvisPERSON

0.99+

Dave ValantePERSON

0.99+

Justin BoardmanPERSON

0.99+

sixQUANTITY

0.99+

DaniPERSON

0.99+

MassachusettsLOCATION

0.99+

20 centsQUANTITY

0.99+

TeradataORGANIZATION

0.99+

OracleORGANIZATION

0.99+

JammaPERSON

0.99+

UKLOCATION

0.99+

FINRAORGANIZATION

0.99+

40 yearsQUANTITY

0.99+

Kurt MonashPERSON

0.99+

20%QUANTITY

0.99+

twoQUANTITY

0.99+

fiveQUANTITY

0.99+

JessPERSON

0.99+

2011DATE

0.99+

StarburstORGANIZATION

0.99+

10QUANTITY

0.99+

AccentureORGANIZATION

0.99+

seven yearsQUANTITY

0.99+

thousandsQUANTITY

0.99+

pythonsTITLE

0.99+

BostonLOCATION

0.99+

GDPRTITLE

0.99+

TodayDATE

0.99+

two modelsQUANTITY

0.99+

Zolando ComcastORGANIZATION

0.99+

GemmaPERSON

0.99+

StarbustORGANIZATION

0.99+

JPMCORGANIZATION

0.99+

FacebookORGANIZATION

0.99+

JavasTITLE

0.99+

todayDATE

0.99+

AWSORGANIZATION

0.99+

millionsQUANTITY

0.99+

first lieQUANTITY

0.99+

10DATE

0.99+

12 yearsQUANTITY

0.99+

one placeQUANTITY

0.99+

TomorrowDATE

0.99+

Starburst The Data Lies FULL V1


 

>>In 2011, early Facebook employee and Cloudera co-founder Jeff Ocker famously said the best minds of my generation are thinking about how to get people to click on ads. And that sucks. Let's face it more than a decade later organizations continue to be frustrated with how difficult it is to get value from data and build a truly agile data-driven enterprise. What does that even mean? You ask? Well, it means that everyone in the organization has the data they need when they need it. In a context that's relevant to advance the mission of an organization. Now that could mean cutting cost could mean increasing profits, driving productivity, saving lives, accelerating drug discovery, making better diagnoses, solving, supply chain problems, predicting weather disasters, simplifying processes, and thousands of other examples where data can completely transform people's lives beyond manipulating internet users to behave a certain way. We've heard the prognostications about the possibilities of data before and in fairness we've made progress, but the hard truth is the original promises of master data management, enterprise data, warehouses, data marts, data hubs, and yes, even data lakes were broken and left us wanting from more welcome to the data doesn't lie, or doesn't a series of conversations produced by the cube and made possible by Starburst data. >>I'm your host, Dave Lanta and joining me today are three industry experts. Justin Borgman is this co-founder and CEO of Starburst. Richard Jarvis is the CTO at EMI health and Theresa tongue is cloud first technologist at Accenture. Today we're gonna have a candid discussion that will expose the unfulfilled and yes, broken promises of a data past we'll expose data lies, big lies, little lies, white lies, and hidden truths. And we'll challenge, age old data conventions and bust some data myths. We're debating questions like is the demise of a single source of truth. Inevitable will the data warehouse ever have featured parody with the data lake or vice versa is the so-called modern data stack, simply centralization in the cloud, AKA the old guards model in new cloud close. How can organizations rethink their data architectures and regimes to realize the true promises of data can and will and open ecosystem deliver on these promises in our lifetimes, we're spanning much of the Western world today. Richard is in the UK. Teresa is on the west coast and Justin is in Massachusetts with me. I'm in the cube studios about 30 miles outside of Boston folks. Welcome to the program. Thanks for coming on. Thanks for having us. Let's get right into it. You're very welcome. Now here's the first lie. The most effective data architecture is one that is centralized with a team of data specialists serving various lines of business. What do you think Justin? >>Yeah, definitely a lie. My first startup was a company called hit adapt, which was an early SQL engine for hit that was acquired by Teradata. And when I got to Teradata, of course, Teradata is the pioneer of that central enterprise data warehouse model. One of the things that I found fascinating was that not one of their customers had actually lived up to that vision of centralizing all of their data into one place. They all had data silos. They all had data in different systems. They had data on prem data in the cloud. You know, those companies were acquiring other companies and inheriting their data architecture. So, you know, despite being the industry leader for 40 years, not one of their customers truly had everything in one place. So I think definitely history has proven that to be a lie. >>So Richard, from a practitioner's point of view, you know, what, what are your thoughts? I mean, there, there's a lot of pressure to cut cost, keep things centralized, you know, serve the business as best as possible from that standpoint. What, what is your experience show? >>Yeah, I mean, I think I would echo Justin's experience really that we, as a business have grown up through acquisition, through storing data in different places sometimes to do information governance in different ways to store data in, in a platform that's close to data experts, people who really understand healthcare data from pharmacies or from, from doctors. And so, although if you were starting from a Greenfield site and you were building something brand new, you might be able to centralize all the data and all of the tooling and teams in one place. The reality is that that businesses just don't grow up like that. And, and it's just really impossible to get that academic perfection of, of storing everything in one place. >>Y you know, Theresa, I feel like Sarbanes Oxley kinda saved the data warehouse, you know, right. You actually did have to have a single version of the truth for certain financial data, but really for those, some of those other use cases, I, I mentioned, I, I do feel like the industry has kinda let us down. What's your take on this? Where does it make sense to have that sort of centralized approach versus where does it make sense to maybe decentralized? >>I, I think you gotta have centralized governance, right? So from the central team, for things like star Oxley, for things like security for certainly very core data sets, having a centralized set of roles, responsibilities to really QA, right. To serve as a design authority for your entire data estate, just like you might with security, but how it's implemented has to be distributed. Otherwise you're not gonna be able to scale. Right? So being able to have different parts of the business really make the right data investments for their needs. And then ultimately you're gonna collaborate with your partners. So partners that are not within the company, right. External partners, we're gonna see a lot more data sharing and model creation. And so you're definitely going to be decentralized. >>So, you know, Justin, you guys last, geez, I think it was about a year ago, had a session on, on data mesh. It was a great program. You invited Jamma, Dani, of course, she's the creator of the data mesh. And her one of our fundamental premises is that you've got this hyper specialized team that you've gotta go through. And if you want anything, but at the same time, these, these individuals actually become a bottleneck, even though they're some of the most talented people in the organization. So I guess question for you, Richard, how do you deal with that? Do you, do you organize so that there are a few sort of rock stars that, that, you know, build cubes and, and the like, and, and, and, or have you had any success in sort of decentralizing with, you know, your, your constituencies, that data model? >>Yeah. So, so we absolutely have got rockstar, data scientists and data guardians. If you like people who understand what it means to use this data, particularly as the data that we use at emos is very private it's healthcare information. And some of the, the rules and regulations around using the data are very complex and, and strict. So we have to have people who understand the usage of the data, then people who understand how to build models, how to process the data effectively. And you can think of them like consultants to the wider business, because a pharmacist might not understand how to structure a SQL query, but they do understand how they want to process medication information to improve patient lives. And so that becomes a, a consulting type experience from a, a set of rock stars to help a, a more decentralized business who needs to, to understand the data and to generate some valuable output. >>Justin, what do you say to a, to a customer or prospect that says, look, Justin, I'm gonna, I got a centralized team and that's the most cost effective way to serve the business. Otherwise I got, I got duplication. What do you say to that? >>Well, I, I would argue it's probably not the most cost effective and, and the reason being really twofold. I think, first of all, when you are deploying a enterprise data warehouse model, the, the data warehouse itself is very expensive, generally speaking. And so you're putting all of your most valuable data in the hands of one vendor who now has tremendous leverage over you, you know, for many, many years to come. I think that's the story at Oracle or Terra data or other proprietary database systems. But the other aspect I think is that the reality is those central data warehouse teams is as much as they are experts in the technology. They don't necessarily understand the data itself. And this is one of the core tenants of data mash that that jam writes about is this idea of the domain owners actually know the data the best. >>And so by, you know, not only acknowledging that data is generally decentralized and to your earlier point about SAR, brain Oxley, maybe saving the data warehouse, I would argue maybe GDPR and data sovereignty will destroy it because data has to be decentralized for, for those laws to be compliant. But I think the reality is, you know, the data mesh model basically says, data's decentralized, and we're gonna turn that into an asset rather than a liability. And we're gonna turn that into an asset by empowering the people that know the data, the best to participate in the process of, you know, curating and creating data products for, for consumption. So I think when you think about it, that way, you're going to get higher quality data and faster time to insight, which is ultimately going to drive more revenue for your business and reduce costs. So I think that that's the way I see the two, the two models comparing and contrasting. >>So do you think the demise of the data warehouse is inevitable? I mean, I mean, you know, there Theresa you work with a lot of clients, they're not just gonna rip and replace their existing infrastructure. Maybe they're gonna build on top of it, but what does that mean? Does that mean the E D w just becomes, you know, less and less valuable over time, or it's maybe just isolated to specific use cases. What's your take on that? >>Listen, I still would love all my data within a data warehouse would love it. Mastered would love it owned by essential team. Right? I think that's still what I would love to have. That's just not the reality, right? The investment to actually migrate and keep that up to date. I would say it's a losing battle. Like we've been trying to do it for a long time. Nobody has the budgets and then data changes, right? There's gonna be a new technology. That's gonna emerge that we're gonna wanna tap into. There's going to be not enough investment to bring all the legacy, but still very useful systems into that centralized view. So you keep the data warehouse. I think it's a very, very valuable, very high performance tool for what it's there for, but you could have this, you know, new mesh layer that still takes advantage of the things. I mentioned, the data products in the systems that are meaningful today and the data products that actually might span a number of systems, maybe either those that either source systems for the domains that know it best, or the consumer based systems and products that need to be packaged in a way that be really meaningful for that end user, right? Each of those are useful for a different part of the business and making sure that the mesh actually allows you to use all of them. >>So, Richard, let me ask you, you take, take Gemma's principles back to those. You got to, you know, domain ownership and, and, and data as product. Okay, great. Sounds good. But it creates what I would argue are two, you know, challenges, self-serve infrastructure let's park that for a second. And then in your industry, the one of the high, most regulated, most sensitive computational governance, how do you automate and ensure federated governance in that mesh model that Theresa was just talking about? >>Well, it absolutely depends on some of the tooling and processes that you put in place around those tools to be, to centralize the security and the governance of the data. And I think, although a data warehouse makes that very simple, cause it's a single tool, it's not impossible with some of the data mesh technologies that are available. And so what we've done at emus is we have a single security layer that sits on top of our data match, which means that no matter which user is accessing, which data source, we go through a well audited well understood security layer. That means that we know exactly who's got access to which data field, which data tables. And then everything that they do is, is audited in a very kind of standard way, regardless of the underlying data storage technology. So for me, although storing the data in one place might not be possible understanding where your source of truth is and securing that in a common way is still a valuable approach and you can do it without having to bring all that data into a single bucket so that it's all in one place. And, and so having done that and investing quite heavily in making that possible has paid dividends in terms of giving wider access to the platform and ensuring that only data that's available under GDPR and other regulations is being used by, by the data users. >>Yeah. So Justin, I mean, Democrat, we always talk about data democratization and you know, up until recently, they really haven't been line of sight as to how to get there. But do you have anything to add to this because you're essentially taking, you know, do an analytic queries and with data that's all dispersed all over the, how are you seeing your customers handle this, this challenge? >>Yeah. I mean, I think data products is a really interesting aspect of the answer to that. It allows you to, again, leverage the data domain owners, people know the data, the best to, to create, you know, data as a product ultimately to be consumed. And we try to represent that in our product as effectively a almost eCommerce like experience where you go and discover and look for the data products that have been created in your organization. And then you can start to consume them as, as you'd like. And so really trying to build on that notion of, you know, data democratization and self-service, and making it very easy to discover and, and start to use with whatever BI tool you, you may like, or even just running, you know, SQL queries yourself, >>Okay. G guys grab a sip of water. After this short break, we'll be back to debate whether proprietary or open platforms are the best path to the future of data excellence, keep it right there. >>Your company has more data than ever, and more people trying to understand it, but there's a problem. Your data is stored across multiple systems. It's hard to access and that delays analytics and ultimately decisions. The old method of moving all of your data into a single source of truth is slow and definitely not built for the volume of data we have today or where we are headed while your data engineers spent over half their time, moving data, your analysts and data scientists are left, waiting, feeling frustrated, unproductive, and unable to move the needle for your business. But what if you could spend less time moving or copying data? What if your data consumers could analyze all your data quickly? >>Starburst helps your teams run fast queries on any data source. We help you create a single point of access to your data, no matter where it's stored. And we support high concurrency, we solve for speed and scale, whether it's fast, SQL queries on your data lake or faster queries across multiple data sets, Starburst helps your teams run analytics anywhere you can't afford to wait for data to be available. Your team has questions that need answers. Now with Starburst, the wait is over. You'll have faster access to data with enterprise level security, easy connectivity, and 24 7 support from experts, organizations like Zolando Comcast and FINRA rely on Starburst to move their businesses forward. Contact our Trino experts to get started. >>We're back with Jess Borgman of Starburst and Richard Jarvis of EVAs health. Okay, we're gonna get to lie. Number two, and that is this an open source based platform cannot give you the performance and control that you can get with a proprietary system. Is that a lie? Justin, the enterprise data warehouse has been pretty dominant and has evolved and matured. Its stack has mature over the years. Why is it not the default platform for data? >>Yeah, well, I think that's become a lie over time. So I, I think, you know, if we go back 10 or 12 years ago with the advent of the first data lake really around Hudu, that probably was true that you couldn't get the performance that you needed to run fast, interactive, SQL queries in a data lake. Now a lot's changed in 10 or 12 years. I remember in the very early days, people would say, you you'll never get performance because you need to be column there. You need to store data in a column format. And then, you know, column formats we're introduced to, to data apes, you have Parque ORC file in aro that were created to ultimately deliver performance out of that. So, okay. We got, you know, largely over the performance hurdle, you know, more recently people will say, well, you don't have the ability to do updates and deletes like a traditional data warehouse. >>And now we've got the creation of new data formats, again like iceberg and Delta and Hodi that do allow for updates and delete. So I think the data lake has continued to mature. And I remember a, a quote from, you know, Kurt Monash many years ago where he said, you know, know it takes six or seven years to build a functional database. I think that's that's right. And now we've had almost a decade go by. So, you know, these technologies have matured to really deliver very, very close to the same level performance and functionality of, of cloud data warehouses. So I think the, the reality is that's become a line and now we have large giant hyperscale internet companies that, you know, don't have the traditional data warehouse at all. They do all of their analytics in a data lake. So I think we've, we've proven that it's very much possible today. >>Thank you for that. And so Richard, talk about your perspective as a practitioner in terms of what open brings you versus, I mean, look closed is it's open as a moving target. I remember Unix used to be open systems and so it's, it is an evolving, you know, spectrum, but, but from your perspective, what does open give you that you can't get from a proprietary system where you are fearful of in a proprietary system? >>I, I suppose for me open buys us the ability to be unsure about the future, because one thing that's always true about technology is it evolves in a, a direction, slightly different to what people expect. And what you don't want to end up is done is backed itself into a corner that then prevents it from innovating. So if you have chosen a technology and you've stored trillions of records in that technology and suddenly a new way of processing or machine learning comes out, you wanna be able to take advantage and your competitive edge might depend upon it. And so I suppose for us, we acknowledge that we don't have perfect vision of what the future might be. And so by backing open storage technologies, we can apply a number of different technologies to the processing of that data. And that gives us the ability to remain relevant, innovate on our data storage. And we have bought our way out of the, any performance concerns because we can use cloud scale infrastructure to scale up and scale down as we need. And so we don't have the concerns that we don't have enough hardware today to process what we want to do, want to achieve. We can just scale up when we need it and scale back down. So open source has really allowed us to maintain the being at the cutting edge. >>So Jess, let me play devil's advocate here a little bit, and I've talked to Shaak about this and you know, obviously her vision is there's an open source that, that the data meshes open source, an open source tooling, and it's not a proprietary, you know, you're not gonna buy a data mesh. You're gonna build it with, with open source toolings and, and vendors like you are gonna support it, but to come back to sort of today, you can get to market with a proprietary solution faster. I'm gonna make that statement. You tell me if it's a lie and then you can say, okay, we support Apache iceberg. We're gonna support open source tooling, take a company like VMware, not really in the data business, but how, the way they embraced Kubernetes and, and you know, every new open source thing that comes along, they say, we do that too. Why can't proprietary systems do that and be as effective? >>Yeah, well, I think at least with the, within the data landscape saying that you can access open data formats like iceberg or, or others is, is a bit dis disingenuous because really what you're selling to your customer is a certain degree of performance, a certain SLA, and you know, those cloud data warehouses that can reach beyond their own proprietary storage drop all the performance that they were able to provide. So it is, it reminds me kind of, of, again, going back 10 or 12 years ago when everybody had a connector to Haddo and that they thought that was the solution, right? But the reality was, you know, a connector was not the same as running workloads in Haddo back then. And I think similarly, you know, being able to connect to an external table that lives in an open data format, you know, you're, you're not going to give it the performance that your customers are accustomed to. And at the end of the day, they're always going to be predisposed. They're always going to be incentivized to get that data ingested into the data warehouse, cuz that's where they have control. And you know, the bottom line is the database industry has really been built around vendor lockin. I mean, from the start, how, how many people love Oracle today, but our customers, nonetheless, I think, you know, lockin is, is, is part of this industry. And I think that's really what we're trying to change with open data formats. >>Well, that's interesting reminded when I, you know, I see the, the gas price, the tees or gas price I, I drive up and then I say, oh, that's the cash price credit card. I gotta pay 20 cents more, but okay. But so the, the argument then, so let me, let me come back to you, Justin. So what's wrong with saying, Hey, we support open data formats, but yeah, you're gonna get better performance if you, if you keep it into our closed system, are you saying that long term that's gonna come back and bite you cuz you're gonna end up, you mentioned Oracle, you mentioned Teradata. Yeah. That's by, by implication, you're saying that's where snowflake customers are headed. >>Yeah, absolutely. I think this is a movie that, you know, we've all seen before. At least those of us who've been in the industry long enough to, to see this movie play over a couple times. So I do think that's the future. And I think, you know, I loved what Richard said. I actually wrote it down. Cause I thought it was an amazing quote. He said, it buys us the ability to be unsure of the future. Th that that pretty much says it all the, the future is unknowable and the reality is using open data formats. You remain interoperable with any technology you want to utilize. If you want to use spark to train a machine learning model and you want to use Starbust to query via sequel, that's totally cool. They can both work off the same exact, you know, data, data sets by contrast, if you're, you know, focused on a proprietary model, then you're kind of locked in again to that model. I think the same applies to data, sharing to data products, to a wide variety of, of aspects of the data landscape that a proprietary approach kind of closes you in and locks you in. >>So I, I would say this Richard, I'd love to get your thoughts on it. Cause I talked to a lot of Oracle customers, not as many te data customers, but, but a lot of Oracle customers and they, you know, they'll admit, yeah, you know, they're jamming us on price and the license cost they give, but we do get value out of it. And so my question to you, Richard, is, is do the, let's call it data warehouse systems or the proprietary systems. Are they gonna deliver a greater ROI sooner? And is that in allure of, of that customers, you know, are attracted to, or can open platforms deliver as fast in ROI? >>I think the answer to that is it can depend a bit. It depends on your businesses skillset. So we are lucky that we have a number of proprietary teams that work in databases that provide our operational data capability. And we have teams of analytics and big data experts who can work with open data sets and open data formats. And so for those different teams, they can get to an ROI more quickly with different technologies for the business though, we can't do better for our operational data stores than proprietary databases. Today we can back off very tight SLAs to them. We can demonstrate reliability from millions of hours of those databases being run at enterprise scale, but for an analytics workload where increasing our business is growing in that direction, we can't do better than open data formats with cloud-based data mesh type technologies. And so it's not a simple answer. That one will always be the right answer for our business. We definitely have times when proprietary databases provide a capability that we couldn't easily represent or replicate with open technologies. >>Yeah. Richard, stay with you. You mentioned, you know, you know, some things before that, that strike me, you know, the data brick snowflake, you know, thing is, oh, is a lot of fun for analysts like me. You've got data bricks coming at it. Richard, you mentioned you have a lot of rockstar, data engineers, data bricks coming at it from a data engineering heritage. You get snowflake coming at it from an analytics heritage. Those two worlds are, are colliding people like PJI Mohan said, you know what? I think it's actually harder to play in the data engineering. So I E it's easier to for data engineering world to go into the analytics world versus the reverse, but thinking about up and coming engineers and developers preparing for this future of data engineering and data analytics, how, how should they be thinking about the future? What, what's your advice to those young people? >>So I think I'd probably fall back on general programming skill sets. So the advice that I saw years ago was if you have open source technologies, the pythons and Javas on your CV, you commander 20% pay, hike over people who can only do proprietary programming languages. And I think that's true of data technologies as well. And from a business point of view, that makes sense. I'd rather spend the money that I save on proprietary licenses on better engineers, because they can provide more value to the business that can innovate us beyond our competitors. So I think I would my advice to people who are starting here or trying to build teams to capitalize on data assets is begin with open license, free capabilities, because they're very cheap to experiment with. And they generate a lot of interest from people who want to join you as a business. And you can make them very successful early, early doors with, with your analytics journey. >>It's interesting. Again, analysts like myself, we do a lot of TCO work and have over the last 20 plus years. And in world of Oracle, you know, normally it's the staff, that's the biggest nut in total cost of ownership, not an Oracle. It's the it's the license cost is by far the biggest component in the, in the blame pie. All right, Justin, help us close out this segment. We've been talking about this sort of data mesh open, closed snowflake data bricks. Where does Starburst sort of as this engine for the data lake data lake house, the data warehouse fit in this, in this world? >>Yeah. So our view on how the future ultimately unfolds is we think that data lakes will be a natural center of gravity for a lot of the reasons that we described open data formats, lowest total cost of ownership, because you get to choose the cheapest storage available to you. Maybe that's S3 or Azure data lake storage, or Google cloud storage, or maybe it's on-prem object storage that you bought at a, at a really good price. So ultimately storing a lot of data in a deal lake makes a lot of sense, but I think what makes our perspective unique is we still don't think you're gonna get everything there either. We think that basically centralization of all your data assets is just an impossible endeavor. And so you wanna be able to access data that lives outside of the lake as well. So we kind of think of the lake as maybe the biggest place by volume in terms of how much data you have, but to, to have comprehensive analytics and to truly understand your business and understand it holistically, you need to be able to go access other data sources as well. And so that's the role that we wanna play is to be a single point of access for our customers, provide the right level of fine grained access controls so that the right people have access to the right data and ultimately make it easy to discover and consume via, you know, the creation of data products as well. >>Great. Okay. Thanks guys. Right after this quick break, we're gonna be back to debate whether the cloud data model that we see emerging and the so-called modern data stack is really modern, or is it the same wine new bottle? When it comes to data architectures, you're watching the cube, the leader in enterprise and emerging tech coverage. >>Your data is capable of producing incredible results, but data consumers are often left in the dark without fast access to the data they need. Starers makes your data visible from wherever it lives. Your company is acquiring more data in more places, more rapidly than ever to rely solely on a data centralization strategy. Whether it's in a lake or a warehouse is unrealistic. A single source of truth approach is no longer viable, but disconnected data silos are often left untapped. We need a new approach. One that embraces distributed data. One that enables fast and secure access to any of your data from anywhere with Starburst, you'll have the fastest query engine for the data lake that allows you to connect and analyze your disparate data sources no matter where they live Starburst provides the foundational technology required for you to build towards the vision of a decentralized data mesh Starburst enterprise and Starburst galaxy offer enterprise ready, connectivity, interoperability, and security features for multiple regions, multiple clouds and everchanging global regulatory requirements. The data is yours. And with Starburst, you can perform analytics anywhere in light of your world. >>Okay. We're back with Justin Boardman. CEO of Starbust Richard Jarvis is the CTO of EMI health and Theresa tongue is the cloud first technologist from Accenture. We're on July number three. And that is the claim that today's modern data stack is actually modern. So I guess that's the lie it's it is it's is that it's not modern. Justin, what do you say? >>Yeah. I mean, I think new isn't modern, right? I think it's the, it's the new data stack. It's the cloud data stack, but that doesn't necessarily mean it's modern. I think a lot of the components actually are exactly the same as what we've had for 40 years, rather than Terra data. You have snowflake rather than Informatica you have five trend. So it's the same general stack, just, you know, a cloud version of it. And I think a lot of the challenges that it plagued us for 40 years still maintain. >>So lemme come back to you just, but okay. But, but there are differences, right? I mean, you can scale, you can throw resources at the problem. You can separate compute from storage. You really, you know, there's a lot of money being thrown at that by venture capitalists and snowflake, you mentioned it's competitors. So that's different. Is it not, is that not at least an aspect of, of modern dial it up, dial it down. So what, what do you say to that? >>Well, it, it is, it's certainly taking, you know, what the cloud offers and taking advantage of that, but it's important to note that the cloud data warehouses out there are really just separating their compute from their storage. So it's allowing them to scale up and down, but your data still stored in a proprietary format. You're still locked in. You still have to ingest the data to get it even prepared for analysis. So a lot of the same sort of structural constraints that exist with the old enterprise data warehouse model OnPrem still exist just yes, a little bit more elastic now because the cloud offers that. >>So Theresa, let me go to you cuz you have cloud first in your, in your, your title. So what's what say you to this conversation? >>Well, even the cloud providers are looking towards more of a cloud continuum, right? So the centralized cloud, as we know it, maybe data lake data warehouse in the central place, that's not even how the cloud providers are looking at it. They have news query services. Every provider has one that really expands those queries to be beyond a single location. And if we look at a lot of where our, the future goes, right, that that's gonna very much fall the same thing. There was gonna be more edge. There's gonna be more on premise because of data sovereignty, data gravity, because you're working with different parts of the business that have already made major cloud investments in different cloud providers. Right? So there's a lot of reasons why the modern, I guess, the next modern generation of the data staff needs to be much more federated. >>Okay. So Richard, how do you deal with this? You you've obviously got, you know, the technical debt, the existing infrastructure it's on the books. You don't wanna just throw it out. A lot of, lot of conversation about modernizing applications, which a lot of times is a, you know, a microservices layer on top of leg legacy apps. How do you think about the modern data stack? >>Well, I think probably the first thing to say is that the stack really has to include the processes and people around the data as well is all well and good changing the technology. But if you don't modernize how people use that technology, then you're not going to be able to, to scale because just cuz you can scale CPU and storage doesn't mean you can get more people to use your data, to generate you more, more value for the business. And so what we've been looking at is really changing in very much aligned to data products and, and data mesh. How do you enable more people to consume the service and have the stack respond in a way that keeps costs low? Because that's important for our customers consuming this data, but also allows people to occasionally run enormous queries and then tick along with smaller ones when required. And it's a good job we did because during COVID all of a sudden we had enormous pressures on our data platform to answer really important life threatening queries. And if we couldn't scale both our data stack and our teams, we wouldn't have been able to answer those as quickly as we had. So I think the stack needs to support a scalable business, not just the technology itself. >>Well thank you for that. So Justin let's, let's try to break down what the critical aspects are of the modern data stack. So you think about the past, you know, five, seven years cloud obviously has given a different pricing model. De-risked experimentation, you know that we talked about the ability to scale up scale down, but it's, I'm, I'm taking away that that's not enough based on what Richard just said. The modern data stack has to serve the business and enable the business to build data products. I, I buy that. I'm a big fan of the data mesh concepts, even though we're early days. So what are the critical aspects if you had to think about, you know, paying, maybe putting some guardrails and definitions around the modern data stack, what does that look like? What are some of the attributes and, and principles there >>Of, of how it should look like or, or how >>It's yeah. What it should be. >>Yeah. Yeah. Well, I think, you know, in, in Theresa mentioned this in, in a previous segment about the data warehouse is not necessarily going to disappear. It just becomes one node, one element of the overall data mesh. And I, I certainly agree with that. So by no means, are we suggesting that, you know, snowflake or Redshift or whatever cloud data warehouse you may be using is going to disappear, but it's, it's not going to become the end all be all. It's not the, the central single source of truth. And I think that's the paradigm shift that needs to occur. And I think it's also worth noting that those who were the early adopters of the modern data stack were primarily digital, native born in the cloud young companies who had the benefit of, of idealism. They had the benefit of it was starting with a clean slate that does not reflect the vast majority of enterprises. >>And even those companies, as they grow up mature out of that ideal state, they go buy a business. Now they've got something on another cloud provider that has a different data stack and they have to deal with that heterogeneity that is just change and change is a part of life. And so I think there is an element here that is almost philosophical. It's like, do you believe in an absolute ideal where I can just fit everything into one place or do I believe in reality? And I think the far more pragmatic approach is really what data mesh represents. So to answer your question directly, I think it's adding, you know, the ability to access data that lives outside of the data warehouse, maybe living in open data formats in a data lake or accessing operational systems as well. Maybe you want to directly access data that lives in an Oracle database or a Mongo database or, or what have you. So creating that flexibility to really Futureproof yourself from the inevitable change that you will, you won't encounter over time. >>So thank you. So there, based on what Justin just said, I, my takeaway there is it's inclusive, whether it's a data Mar data hub, data lake data warehouse, it's a, just a node on the mesh. Okay. I get that. Does that include there on Preem data? O obviously it has to, what are you seeing in terms of the ability to, to take that data mesh concept on Preem? I mean, most implementations I've seen in data mesh, frankly really aren't, you know, adhering to the philosophy. They're maybe, maybe it's data lake and maybe it's using glue. You look at what JPMC is doing. Hello, fresh, a lot of stuff happening on the AWS cloud in that, you know, closed stack, if you will. What's the answer to that Theresa? >>I mean, I, I think it's a killer case for data. Me, the fact that you have valuable data sources, OnPrem, and then yet you still wanna modernize and take the best of cloud cloud is still, like we mentioned, there's a lot of great reasons for it around the economics and the way ability to tap into the innovation that the cloud providers are giving around data and AI architecture. It's an easy button. So the mesh allows you to have the best of both worlds. You can start using the data products on-prem or in the existing systems that are working already. It's meaningful for the business. At the same time, you can modernize the ones that make business sense because it needs better performance. It needs, you know, something that is, is cheaper or, or maybe just tap into better analytics to get better insights, right? So you're gonna be able to stretch and really have the best of both worlds. That, again, going back to Richard's point, that is meaningful by the business. Not everything has to have that one size fits all set a tool. >>Okay. Thank you. So Richard, you know, talking about data as product, wonder if we could give us your perspectives here, what are the advantages of treating data as a product? What, what role do data products have in the modern data stack? We talk about monetizing data. What are your thoughts on data products? >>So for us, one of the most important data products that we've been creating is taking data that is healthcare data across a wide variety of different settings. So information about patients' demographics about their, their treatment, about their medications and so on, and taking that into a standards format that can be utilized by a wide variety of different researchers because misinterpreting that data or having the data not presented in the way that the user is expecting means that you generate the wrong insight. And in any business, that's clearly not a desirable outcome, but when that insight is so critical, as it might be in healthcare or some security settings, you really have to have gone to the trouble of understanding the data, presenting it in a format that everyone can clearly agree on. And then letting people consume in a very structured, managed way, even if that data comes from a variety of different sources in, in, in the first place. And so our data product journey has really begun by standardizing data across a number of different silos through the data mesh. So we can present out both internally and through the right governance externally to, to researchers. >>So that data product through whatever APIs is, is accessible, it's discoverable, but it's obviously gotta be governed as well. You mentioned you, you appropriately provided to internally. Yeah. But also, you know, external folks as well. So the, so you've, you've architected that capability today >>We have, and because the data is standard, it can generate value much more quickly and we can be sure of the security and, and, and value that that's providing because the data product isn't just about formatting the data into the correct tables, it's understanding what it means to redact the data or to remove certain rows from it or to interpret what a date actually means. Is it the start of the contract or the start of the treatment or the date of birth of a patient? These things can be lost in the data storage without having the proper product management around the data to say in a very clear business context, what does this data mean? And what does it mean to process this data for a particular use case? >>Yeah, it makes sense. It's got the context. If the, if the domains own the data, you, you gotta cut through a lot of the, the, the centralized teams, the technical teams that, that data agnostic, they don't really have that context. All right. Let's send Justin, how does Starburst fit into this modern data stack? Bring us home. >>Yeah. So I think for us, it's really providing our customers with, you know, the flexibility to operate and analyze data that lives in a wide variety of different systems. Ultimately giving them that optionality, you know, and optionality provides the ability to reduce costs, store more in a data lake rather than data warehouse. It provides the ability for the fastest time to insight to access the data directly where it lives. And ultimately with this concept of data products that we've now, you know, incorporated into our offering as well, you can really create and, and curate, you know, data as a product to be shared and consumed. So we're trying to help enable the data mesh, you know, model and make that an appropriate compliment to, you know, the, the, the modern data stack that people have today. >>Excellent. Hey, I wanna thank Justin Theresa and Richard for joining us today. You guys are great. I big believers in the, in the data mesh concept, and I think, you know, we're seeing the future of data architecture. So thank you. Now, remember, all these conversations are gonna be available on the cube.net for on-demand viewing. You can also go to starburst.io. They have some great content on the website and they host some really thought provoking interviews and, and, and they have awesome resources, lots of data mesh conversations over there, and really good stuff in, in the resource section. So check that out. Thanks for watching the data doesn't lie or does it made possible by Starburst data? This is Dave Valante for the cube, and we'll see you next time. >>The explosion of data sources has forced organizations to modernize their systems and architecture and come to terms with one size does not fit all for data management today. Your teams are constantly moving and copying data, which requires time management. And in some cases, double paying for compute resources. Instead, what if you could access all your data anywhere using the BI tools and SQL skills your users already have. And what if this also included enterprise security and fast performance with Starburst enterprise, you can provide your data consumers with a single point of secure access to all of your data, no matter where it lives with features like strict, fine grained, access control, end to end data encryption and data masking Starburst meets the security standards of the largest companies. Starburst enterprise can easily be deployed anywhere and managed with insights where data teams holistically view their clusters operation and query execution. So they can reach meaningful business decisions faster, all this with the support of the largest team of Trino experts in the world, delivering fully tested stable releases and available to support you 24 7 to unlock the value in all of your data. You need a solution that easily fits with what you have today and can adapt to your architecture. Tomorrow. Starbust enterprise gives you the fastest path from big data to better decisions, cuz your team can't afford to wait. Trino was created to empower analytics anywhere and Starburst enterprise was created to give you the enterprise grade performance, connectivity, security management, and support your company needs organizations like Zolando Comcast and FINRA rely on Starburst to move their businesses forward. Contact us to get started.

Published Date : Aug 20 2022

SUMMARY :

famously said the best minds of my generation are thinking about how to get people to the data warehouse ever have featured parody with the data lake or vice versa is So, you know, despite being the industry leader for 40 years, not one of their customers truly had So Richard, from a practitioner's point of view, you know, what, what are your thoughts? although if you were starting from a Greenfield site and you were building something brand new, Y you know, Theresa, I feel like Sarbanes Oxley kinda saved the data warehouse, I, I think you gotta have centralized governance, right? So, you know, Justin, you guys last, geez, I think it was about a year ago, had a session on, And you can think of them Justin, what do you say to a, to a customer or prospect that says, look, Justin, I'm gonna, you know, for many, many years to come. But I think the reality is, you know, the data mesh model basically says, I mean, you know, there Theresa you work with a lot of clients, they're not just gonna rip and replace their existing that the mesh actually allows you to use all of them. But it creates what I would argue are two, you know, Well, it absolutely depends on some of the tooling and processes that you put in place around those do an analytic queries and with data that's all dispersed all over the, how are you seeing your the best to, to create, you know, data as a product ultimately to be consumed. open platforms are the best path to the future of data But what if you could spend less you create a single point of access to your data, no matter where it's stored. give you the performance and control that you can get with a proprietary system. I remember in the very early days, people would say, you you'll never get performance because And I remember a, a quote from, you know, Kurt Monash many years ago where he said, you know, know it takes six or seven it is an evolving, you know, spectrum, but, but from your perspective, And what you don't want to end up So Jess, let me play devil's advocate here a little bit, and I've talked to Shaak about this and you know, And I think similarly, you know, being able to connect to an external table that lives in an open data format, Well, that's interesting reminded when I, you know, I see the, the gas price, And I think, you know, I loved what Richard said. not as many te data customers, but, but a lot of Oracle customers and they, you know, And so for those different teams, they can get to an ROI more quickly with different technologies that strike me, you know, the data brick snowflake, you know, thing is, oh, is a lot of fun for analysts So the advice that I saw years ago was if you have open source technologies, And in world of Oracle, you know, normally it's the staff, easy to discover and consume via, you know, the creation of data products as well. really modern, or is it the same wine new bottle? And with Starburst, you can perform analytics anywhere in light of your world. And that is the claim that today's So it's the same general stack, just, you know, a cloud version of it. So lemme come back to you just, but okay. So a lot of the same sort of structural constraints that exist with So Theresa, let me go to you cuz you have cloud first in your, in your, the data staff needs to be much more federated. you know, a microservices layer on top of leg legacy apps. So I think the stack needs to support a scalable So you think about the past, you know, five, seven years cloud obviously has given What it should be. And I think that's the paradigm shift that needs to occur. data that lives outside of the data warehouse, maybe living in open data formats in a data lake seen in data mesh, frankly really aren't, you know, adhering to So the mesh allows you to have the best of both worlds. So Richard, you know, talking about data as product, wonder if we could give us your perspectives is expecting means that you generate the wrong insight. But also, you know, around the data to say in a very clear business context, It's got the context. And ultimately with this concept of data products that we've now, you know, incorporated into our offering as well, This is Dave Valante for the cube, and we'll see you next time. You need a solution that easily fits with what you have today and can adapt

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
RichardPERSON

0.99+

Dave LantaPERSON

0.99+

Jess BorgmanPERSON

0.99+

JustinPERSON

0.99+

TheresaPERSON

0.99+

Justin BorgmanPERSON

0.99+

TeresaPERSON

0.99+

Jeff OckerPERSON

0.99+

Richard JarvisPERSON

0.99+

Dave ValantePERSON

0.99+

Justin BoardmanPERSON

0.99+

sixQUANTITY

0.99+

DaniPERSON

0.99+

MassachusettsLOCATION

0.99+

20 centsQUANTITY

0.99+

TeradataORGANIZATION

0.99+

OracleORGANIZATION

0.99+

JammaPERSON

0.99+

UKLOCATION

0.99+

FINRAORGANIZATION

0.99+

40 yearsQUANTITY

0.99+

Kurt MonashPERSON

0.99+

20%QUANTITY

0.99+

twoQUANTITY

0.99+

fiveQUANTITY

0.99+

JessPERSON

0.99+

2011DATE

0.99+

StarburstORGANIZATION

0.99+

10QUANTITY

0.99+

AccentureORGANIZATION

0.99+

seven yearsQUANTITY

0.99+

thousandsQUANTITY

0.99+

pythonsTITLE

0.99+

BostonLOCATION

0.99+

GDPRTITLE

0.99+

TodayDATE

0.99+

two modelsQUANTITY

0.99+

Zolando ComcastORGANIZATION

0.99+

GemmaPERSON

0.99+

StarbustORGANIZATION

0.99+

JPMCORGANIZATION

0.99+

FacebookORGANIZATION

0.99+

JavasTITLE

0.99+

todayDATE

0.99+

AWSORGANIZATION

0.99+

millionsQUANTITY

0.99+

first lieQUANTITY

0.99+

10DATE

0.99+

12 yearsQUANTITY

0.99+

one placeQUANTITY

0.99+

TomorrowDATE

0.99+

Starburst panel Q3


 

>>Okay. We're back with Justin Boorman CEO of Starburst. Richard Jarvis is the CTO of EMI health and Teresa tongue is the cloud first technologist from Accenture. We're on July number three. And that is the claim that today's modern data stack is actually modern. So I guess that's the lie or it's it is it's is that it's not modern, Justin, what do you say? >>Yeah, I mean, I think new isn't modern, right? I think it's, the's the new data stack. It's the cloud data stack, but that doesn't necessarily mean it's modern. I think a lot of the components actually are exactly the same as what we've had for 40 years, rather than Terra data. You have snowflake rather than Informatica you have five trend. So it's the same general stack, just, you know, a cloud version of it. And I think a lot of the challenges that it plagued us for 40 years still maintain. >>So lemme come back to you just this, but okay. But, but there are differences, right? I mean, you can scale, you can throw resources at the problem. You can separate compute from storage. You really, you know, there's a lot of money being thrown at that by venture capitalists and snowflake, you mentioned it's competitors. So that's different. Is it not, is that not at least an aspect of, of modern dial it up, dial it down. So what, what do you say to that? >>Well, it, it is, it's certainly taking, you know, what the cloud offers and taking advantage of that, but it's important to note that the cloud data warehouses out there are really just separating their compute from their storage. So it's allowing them to scale up and down, but your data's still stored in a proprietary format. You're still locked in. You still have to ingest the data to get it even prepared for analysis. So a lot of the same sort of structural constraints that exist with the old enterprise data warehouse model OnPrem still exists just, yes, a little bit more elastic now because the cloud offers that. >>So Theresa, let me go to you cuz you have cloud first in your, in your, your title. So what's what say you to this conversation? >>Well, even the cloud providers are looking towards more of a cloud continuum, right? So the centralized cloud, as we know it, maybe data lake data warehouse in the central place, that's not even how the cloud providers are looking at it. They have news query services. Every provider has one that really expands those queries to be beyond a single location. And if we look at a lot of where our, the future goes, right, that that's gonna very much fall the same thing. There was gonna be more edge. There's gonna be more on premise because of data sovereignty, data gravity, because you're working with different parts of the business that have already made major cloud investments in different cloud providers. Right? So there's a lot of reasons why the modern, I guess the next modern generation of the data staff needs to be much more federated. >>Okay. So Richard, how do you deal with this? You you've obviously got, you know, the technical debt, the existing infrastructure it's on the books. You don't wanna just throw it out. A lot of, lot of conversation about modernizing applications, which a lot of times is a, you know, of microservices layer on top of leg legacy apps. Ho how do you think about the modern data stack? >>Well, I think probably the first thing to say is that the stack really has to include the processes and people around the data as well is all well and good changing the technology. But if you don't modernize how people use that technology, then you're not going to be able to, to scale because just cuz you can scale CPU and storage doesn't mean you can get more people to use your data, to generate you more value for the business. And so what we've been looking at is really changing in very much aligned to data products and, and data mesh. How do you enable more people to consume the service and have the stack respond in a way that keeps costs low? Because that's important for our customers consuming this data, but also allows people to occasionally run enormous queries and then tick along with smaller ones when required. And it's a good job we did because during COVID all of a sudden we had enormous pressures on our data platform to answer really important life threatening queries. And if we couldn't scale both our data stack and our teams, we wouldn't have been able to answer those as quickly as we had. So I think the stack needs to support a scalable business, not just the technology itself. >>Oh thank you for that. So Justin let's, let's try to break down what the critical aspects are of the modern data stack. So you think about the past, you know, five, seven years cloud obviously has given a different pricing model. Drisk experimentation, you know that we talked about the ability to scale up scale down, but it's, I'm, I'm taking away that that's not enough based on what Richard just said. The modern data stack has to serve the business and enable the business to build data products. I, I buy that I'm, you know, a big fan of the data mesh concepts, even though we're early days. So what are the critical aspects if you had to think about, you know, the paying, maybe putting some guardrails and definitions around the modern data stack, what does that look like? What are some of the attributes and principles there >>Of, of how it should look like or, or how >>Yeah. What it should be? >>Yeah. Yeah. Well, I think, you know, in Theresa mentioned this in, in a previous segment about the data warehouse is not necessarily going to disappear. It just becomes one node, one element of the overall data mesh. And I, I certainly agree with that. So by no means, are we suggesting that, you know, snowflake or Redshift or whatever cloud data warehouse you may be using is going to disappear, but it's, it's not going to become the end all be all. It's not the, the central single source of truth. And I think that's the paradigm shift that needs to occur. And I think it's also worth noting that those who were the early adopters of the modern data stack were primarily digital, native born in the cloud young companies who had the benefit of, of idealism. They had the benefit of starting with a clean slate that does not reflect the vast majority of enterprises. >>And even those companies, as they grow up mature out of that ideal state, they go by a business. Now they've got something on another cloud provider that has a different data stack and they have to deal with that heterogeneity that is just change and change is a part of life. And so I think there is an element here that is almost philosophical. It's like, do you believe in an absolute ideal where I can just fit everything into one place or do I believe in reality? And I think the far more pragmatic approach is really what data mesh represents. So to answer your question directly, I think it's adding, you know, the ability to access data that lives outside of the data warehouse, maybe living in open data formats in a data lake or accessing operational systems as well. Maybe you want to directly access data that lives in an Oracle database or a Mongo database or, or what have you. So creating that flexibility to really Futureproof yourself from the inevitable change that you will, you won't encounter over time. >>So thank you. So there, based on what Justin just said, I, I might take away there is it's inclusive, whether it's a data Mart, data hub, data lake data warehouse, it's a, just a node on the mesh. Okay. I get that. Does that include Theresa on, on Preem data? Obviously it has to, what are you seeing in terms of the ability to, to take that data mesh concept on pre I mean most implementations I've seen and data mesh, frankly really aren't, you know, adhering to the philosophy there. Maybe, maybe it's data lake and maybe it's using glue. You look at what JPMC is doing. Hello, fresh, a lot of stuff happening on the AWS cloud in that, you know, closed stack, if you will. What's the answer to that Theresa? >>I mean, I, I think it's a killer case for data mesh. The fact that you have valuable data sources, OnPrem, and then yet you still wanna modernize and take the best of cloud cloud is still, like we mentioned, there's a lot of great reasons for it around the economics and the way ability to tap into the innovation that the cloud providers are giving around data and AI architecture. It's an easy button. So the mesh allows you to have the best of both world. You can start using the data products on-prem or in the existing systems that are working already. It's meaningful for the business. At the same time, you can modernize the ones that make business sense because it needs better performance. It needs, you know, something that is, is cheaper or, or maybe just tap into better analytics to get better insights, right? So you're gonna be able to stretch and really have the best of both worlds that, again, going back to Richard's point, that is needful by the business. Not everything has to have that one size fits all set a tool. >>Okay. Thank you. So Richard, you know, you're talking about data as product. Wonder if we could give us your perspectives here, what are the advantages of treating data as a product? What, what role do data products have in the modern data stack? We talk about monetizing data. What are your thoughts on data products? >>So for us, one of the most important data products that we've been creating is taking data that is healthcare data across a wide variety of different settings. So information about patients' demographics about their, their treatment, about their medications and so on, and taking that into a standards format that can be utilized by a wide variety of different researchers because misinterpreting that data or having the data not presented in the way that the user is expecting means that you generate the wrong insight and in any business, that's clearly not a desirable outcome, but when that insight is so critical, as it might be in healthcare or some security settings, you really have to have gone to the trouble of understanding the data, presenting it in a format that everyone can clearly agree on. And then letting people consume in a very structured and managed way, even if that data comes from a variety of different sources in, in, in the first place. And so our data product journey has really begun by standardizing data across a number of different silos through the data mesh. So we can present out both internally and through the right governance externally to, to research is >>So that data product through whatever APIs is, is accessible, it's discoverable, but it's obviously gotta be governed as well. You mentioned appropriately provided to internally. Yeah. But also, you know, external folks as well. So the, so you've, you've architected that capability today >>We have and because the data is standard, it can generate value much more quickly and we can be sure of the security and, and, and value that that's providing because the data product isn't just about formatting the data into the right, correct tables, it's understanding what it means to redact the data or to remove certain rows from it or to interpret what a date actually means. Is it the start of the contract or the start of the treatment or the date of birth of a patient? These things can be lost in the data storage without having the proper product management around the data to say in a very clear business context, what does this data mean? And what does it mean to process this data for a particular use >>Case? Yeah, it makes sense. It's got the context. If the, if the domains on the data, you, you gotta cut through a lot of the, the, the centralized teams, the technical teams that, that data agnostic, they don't really have that context. All right. Let's end, Justin, how does Starburst fit into this modern data stack? Bring us home. >>Yeah. So I think for us, it's really providing our customers with, you know, the flexibility to operate and analyze data that lives in a wide variety of different systems. Ultimately giving them that optionality, you know, and optionality provides the ability to reduce costs, store more in a data lake rather than data warehouse. It provides the ability for the fastest time to insight to access the data directly where it lives. And ultimately with this concept of data products that we've now, you know, incorporated into our offering as well, you can really create and, and curate, you know, data as a product to be shared and consumed. So we're trying to help enable the data mesh, you know, model and make that an appropriate compliment to, you know, the, the, the modern data stack that people have today. >>Excellent. Hey, I wanna thank Justin Teresa and Richard for joining us today. You guys are great. I big believers in the, in the data mesh concept, and I think, you know, we're seeing the future of data architecture. So thank you. Now, remember, all these conversations are gonna be available on the cube.net for on-demand viewing. You can also go to starburst.io. They have some great content on the website and they host some really thought provoking interviews and, and, and they have awesome resources, lots of data mesh conversations over there, and really good stuff in, in the resource section. So check that out. Thanks for watching the data doesn't lie or does it made possible by Starburst data? This is Dave ante for the, and we'll see you next time.

Published Date : Aug 2 2022

SUMMARY :

And that is the claim that today's So it's the same general stack, So lemme come back to you just this, but okay. So a lot of the same sort of structural So Theresa, let me go to you cuz you have cloud first in your, in your, So the centralized cloud, as we know it, maybe data lake data warehouse in the central place, a, you know, of microservices layer on top of leg legacy apps. you can get more people to use your data, to generate you more value for the business. So you think about the past, you know, five, seven years cloud obviously has given And I think that's the paradigm shift that needs to occur. from the inevitable change that you will, you won't encounter over time. and data mesh, frankly really aren't, you know, adhering to So the mesh allows you to have the best of both world. So Richard, you know, you're talking about data as product. that data or having the data not presented in the way that the user But also, you know, external folks as well. the proper product management around the data to say in a very clear business It's got the context. So we're trying to help enable the data mesh, you know, I big believers in the, in the data mesh concept, and I think, you know,

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
RichardPERSON

0.99+

TheresaPERSON

0.99+

Richard JarvisPERSON

0.99+

JustinPERSON

0.99+

Justin BoormanPERSON

0.99+

DavePERSON

0.99+

AWSORGANIZATION

0.99+

fiveQUANTITY

0.99+

40 yearsQUANTITY

0.99+

StarburstORGANIZATION

0.99+

AccentureORGANIZATION

0.99+

40 yearsQUANTITY

0.99+

JPMCORGANIZATION

0.99+

bothQUANTITY

0.99+

Justin TeresaPERSON

0.99+

both worldsQUANTITY

0.99+

todayDATE

0.98+

first thingQUANTITY

0.98+

TeresaPERSON

0.98+

first technologistQUANTITY

0.98+

OracleORGANIZATION

0.98+

firstQUANTITY

0.98+

one elementQUANTITY

0.97+

InformaticaORGANIZATION

0.97+

cube.netOTHER

0.97+

MongoORGANIZATION

0.97+

starburst.ioOTHER

0.96+

seven yearsQUANTITY

0.95+

oneQUANTITY

0.95+

data MartORGANIZATION

0.91+

one placeQUANTITY

0.88+

both worldQUANTITY

0.85+

COVIDTITLE

0.83+

single locationQUANTITY

0.8+

OnPremORGANIZATION

0.8+

TerraORGANIZATION

0.77+

single sourceQUANTITY

0.74+

one sizeQUANTITY

0.73+

EMI healthORGANIZATION

0.73+

July numberDATE

0.7+

dataORGANIZATION

0.64+

five trendQUANTITY

0.63+

moneyQUANTITY

0.51+

threeQUANTITY

0.37+

Starburst Panel Q2


 

>>We're back with Jess Borgman of Starburst and Richard Jarvis of emus health. Okay. We're gonna get into lie. Number two, and that is this an open source based platform cannot give you the performance and control that you can get with a proprietary system. Is that a lie? Justin, the enterprise data warehouse has been pretty dominant and has evolved and matured. Its stack has mature over the years. Why is it not the default platform for data? >>Yeah, well, I think that's become a lie over time. So I, I think, you know, if we go back 10 or 12 years ago with the advent of the first data lake really around Hudu, that probably was true that you couldn't get the performance that you needed to run fast, interactive, SQL queries in a data lake. Now a lot's changed in 10 or 12 years. I remember in the very early days, people would say, you'll, you'll never get performance because you need to be column. You need to store data in a column format. And then, you know, column formats were introduced to, to data lakes. You have Parque ORC file in aro that were created to ultimately deliver performance out of that. So, okay. We got, you know, largely over the performance hurdle, you know, more recently people will say, well, you don't have the ability to do updates and deletes like a traditional data warehouse. >>And now we've got the creation of new data formats, again like iceberg and Delta and DY that do allow for updates and delete. So I think the data lake has continued to mature. And I remember a, a quote from, you know, Kurt Monash many years ago where he said, you know, it takes six or seven years to build a functional database. I think that's that's right. And now we've had almost a decade go by. So, you know, these technologies have matured to really deliver very, very close to the same level performance and functionality of, of cloud data warehouses. So I think the, the reality is that's become a lie and now we have large giant hyperscale internet companies that, you know, don't have the traditional data warehouse at all. They do all of their analytics in a data lake. So I think we've, we've proven that it's very much possible today. >>Thank you for that. And so Richard, talk about your perspective as a practitioner in terms of what open brings you versus, I mean, the closed is it's open as a moving target. I remember Unix used to be open systems and so it's, it is an evolving, you know, spectrum, but, but from your perspective, what does open give you that you can't get from a proprietary system where you are fearful of in a proprietary system? >>I, I suppose for me open buys us the ability to be unsure about the future, because one thing that's always true about technology is it evolves in a, a direction, slightly different to what people expect. And what you don't want to end up is done is backed itself into a corner that then prevents it from innovating. So if you have chosen the technology and you've stored trillions of records in that technology and suddenly a new way of processing or machine learning comes out, you wanna be able to take advantage and your competitive edge might depend upon it. And so I suppose for us, we acknowledge that we don't have perfect vision of what the future might be. And so by backing open storage technologies, we can apply a number of different technologies to the processing of that data. And that gives us the ability to remain relevant, innovate on our data storage. And we have bought our way out of the, any performance concerns because we can use cloud scale infrastructure to scale up and scale down as we need. And so we don't have the concerns that we don't have enough hardware today to process what we want to do, but want to achieve. We can just scale up when we need it and scale back down. So open source has really allowed us to maintain the being at the cutting edge. >>So Justin, let me play devil's advocate here a little bit, and I've talked to JAK about this and you know, obviously her vision is there's an open source that, that data mesh is open source, an open source tooling, and it's not a proprietary, you know, you're not gonna buy a data mesh. You're gonna build it with, with open source toolings and, and vendors like you are gonna support it, but come back to sort of today, you can get to market with a proprietary solution faster. I'm gonna make that statement. You tell me if it's a lie and then you can say, okay, we support Apache iceberg. We're gonna support open source tooling, take a company like VMware, not really in the data business, but how, the way they embraced Kubernetes and, and you know, every new open source thing that comes along, they say, we do that too. Why can't proprietary systems do that and be as effective? >>Yeah, well, I think at least with the, within the data landscape saying that you can access open data formats like iceberg or, or others is, is a bit dis disingenuous because really what you're selling to your customer is a certain degree of performance, a certain SLA, and you know, those cloud data warehouses that can reach beyond their own proprietary storage drop all the performance that they were able to provide. So it is, it reminds me kind of, of, again, going back 10 or 12 years ago when everybody had a connector to hit and that they thought that was the solution, right? But the reality was, you know, a connector was not the same as running workloads in had back then. And I think, think similarly, you know, being able to connect to an external table that lives in an open data format, you know, you're, you're not going to give it the performance that your customers are accustomed to. And at the end of the day, they're always going to be predisposed. They're always going to be incentivized to get that data ingested into the data warehouse, cuz that's where they have control. And you know, the bottom line is the database industry has really been built around vendor lockin. I mean, from the start, how, how many people love Oracle today, but our customers, nonetheless, I think, you know, lockin is, is, is part of this industry. And I think that's really what we're trying to change with open data formats. >>Well, it's interesting reminded when I, you know, I see the, the gas price, the TSR gas price I, I drive up and then I say, oh, that's the cash price credit card. I gotta pay 20 cents more, but okay. But so the, the argument then, so let me, let me come back to you, Justin. So what's wrong with saying, Hey, we support open data formats, but yeah, you're gonna get better performance if you, if you, you keep it into our closed system, are you saying that long term that's gonna come back and bite you cuz you're gonna end up. You mentioned Oracle, you mentioned Teradata. Yeah. That's by, by implication, you're saying that's where snowflake customers are headed. >>Yeah, absolutely. I think this is a movie that, you know, we've all seen before. At least those of us who've been in the industry long enough to, to see this movie play over a couple times. So I do think that's the future. And I think, you know, I loved what Richard said. I actually wrote it down cause I thought it was amazing quote. He said, it buys us the ability to be unsure of the future. That that pretty much says it all the, the future is unknowable and the reality is using open data formats. You remain interoperable with any technology you want to utilize. If you want to use smart to train a machine learning model and you wanna use Starbust to query be a sequel, that's totally cool. They can both work off the same exact, you know, data, data sets by contrast, if you're, you know, focused on a proprietary model, then you're kind of locked in again to that model. I think the same applies to data, sharing to data products, to a wide variety of, of aspects of the data landscape that a proprietary approach kind of closes you and, and locks you in. >>So I would say this Richard, I'd love to get your thoughts on it. Cause I talked to a lot of Oracle customers, not as many te data customers, but, but a lot of Oracle customers and they, you know, they'll admit yeah, you know, they Jimin some price and the license cost they give, but we do get value out of it. And so my question to you, Richard, is, is do the, let's call it data warehouse systems or the proprietary systems. Are they gonna deliver a greater ROI sooner? And is that in allure of, of that customers, you know, are attracted to, or can open platforms deliver as fast an ROI? >>I think the answer to that is it can depend a bit. It depends on your business's skillset. So we are lucky that we have a number of proprietary teams that work in databases that provide our operational data capability. And we have teams of analytics and big data experts who can work with open data sets and open data formats. And so for those different teams, they can get to an ROI more quickly with different technologies for the business though, we can't do better for our operational data stores than proprietary databases. Today we can back off very tight SLAs to them. We can demonstrate reliability from millions of hours of those databases being run enterprise scale, but for an analytics workload where increasing our business is growing in that direction, we can't do better than open data formats with cloud based data mesh type technologies. And so it's not a simple answer. That one will always be the right answer for our business. We definitely have times when proprietary databases provide a capability that we couldn't easily represent or replicate with open technologies. >>Yeah. Richard, stay with you. You mentioned, you know, you know, some things before that, that strike me, you know, the data brick snowflake, you know, thing is a lot of fun for analysts like me. You've got data bricks coming at it. Richard, you mentioned you have a lot of rockstar, data engineers, data bricks coming at it from a data engineering heritage. You get snowflake coming at it from an analytics heritage. Those two worlds are, are colliding people like P Sanji Mohan said, you know what? I think it's actually harder to play in the data engineering. So I E it's easier to for data engineering world to go into the analytics world versus the reverse, but thinking about up and coming engineers and developers preparing for this future of data engineering and data analytics, how, how should they be thinking about the future? What, what's your advice to those young people? >>So I think I'd probably fall back on general programming skill sets. So the advice that I saw years ago was if you have open source technologies, the pythons and Javas on your CV, you command a 20% pay, hike over people who can only do proprietary programming languages. And I think that's true of data technologies as well. And from a business point of view, that makes sense. I'd rather spend the money that I save on proprietary licenses on better engineers, because they can provide more value to the business that can innovate us beyond our competitors. So I think I would my advice to people who are starting here or trying to build teams to capitalize on data assets is begin with open license, free capabilities, because they're very cheap to experiment with. And they generate a lot of interest from people who want to join you as a business. And you can make them very successful early, early doors with, with your analytics journey. >>It's interesting. Again, analysts like myself, we do a lot of TCO work and have over the last 20 plus years and in the world of Oracle, you know, normally it's the staff, that's the biggest nut in total cost of ownership, not an Oracle. It's the it's the license cost is by far the biggest component in the, in the blame pie. All right, Justin, help us close out this segment. We've been talking about this sort of data mesh open, closed snowflake data bricks. Where does Starburst sort of as this engine for the data lake data lake house, the data warehouse, it fit in this, in this world. >>Yeah. So our view on how the future ultimately unfolds is we think that data lakes will be a natural center of gravity for a lot of the reasons that we described open data formats, lowest total cost of ownership, because you get to choose the cheapest storage available to you. Maybe that's S3 or Azure data lake storage, or Google cloud storage, or maybe it's on-prem object storage that you bought at a, at a really good price. So ultimately storing a lot of data in a data lake makes a lot of sense, but I think what makes our perspective unique is we still don't think you're gonna get everything there either. We think that basically centralization of all your data assets is just an impossible endeavor. And so you wanna be able to access data that lives outside of the lake as well. So we kind of think of the lake as maybe the biggest place by volume in terms of how much data you have, but to, to have comprehensive analytics and to truly understand your business and understand it holistically, you need to be able to go access other data sources as well. And so that's the role that we wanna play is to be a single point of access for our customers, provide the right level of fine grained access control so that the right people have access to the right data and ultimately make it easy to discover and consume via, you know, the creation of data products as well. >>Great. Okay. Thanks guys. Right after this quick break, we're gonna be back to debate whether the cloud data model that we see emerging and the so-called modern data stack is really modern, or is it the same wine new bottle when it comes to data architectures, you're watching the cube, the leader in enterprise and emerging tech coverage.

Published Date : Aug 2 2022

SUMMARY :

cannot give you the performance and control that you can get with We got, you know, largely over the performance hurdle, you know, more recently people will say, And I remember a, a quote from, you know, Kurt Monash many years ago where he said, you know, open systems and so it's, it is an evolving, you know, spectrum, And what you don't want to end up So Justin, let me play devil's advocate here a little bit, and I've talked to JAK about this and you know, And I think, think similarly, you know, being able to connect to an external table that lives in an open data Well, it's interesting reminded when I, you know, I see the, the gas price, And I think, you know, I loved what Richard said. not as many te data customers, but, but a lot of Oracle customers and they, you know, I think the answer to that is it can depend a bit. that strike me, you know, the data brick snowflake, you know, thing is a lot of fun for analysts So the advice that I saw years ago was if you have open source technologies, years and in the world of Oracle, you know, normally it's the staff, it easy to discover and consume via, you know, the creation of data products as well. data model that we see emerging and the so-called modern data stack

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
RichardPERSON

0.99+

Jess BorgmanPERSON

0.99+

JustinPERSON

0.99+

sixQUANTITY

0.99+

OracleORGANIZATION

0.99+

Richard JarvisPERSON

0.99+

20 centsQUANTITY

0.99+

20%QUANTITY

0.99+

Kurt MonashPERSON

0.99+

P Sanji MohanPERSON

0.99+

TodayDATE

0.99+

seven yearsQUANTITY

0.99+

pythonsTITLE

0.99+

TeradataORGANIZATION

0.99+

JAKPERSON

0.99+

JavasTITLE

0.99+

10DATE

0.99+

todayDATE

0.98+

StarbustTITLE

0.98+

StarburstORGANIZATION

0.97+

VMwareORGANIZATION

0.97+

bothQUANTITY

0.97+

12 years agoDATE

0.96+

single pointQUANTITY

0.96+

millions of hoursQUANTITY

0.95+

10QUANTITY

0.93+

UnixTITLE

0.92+

12 yearsQUANTITY

0.92+

GoogleORGANIZATION

0.9+

two worldsQUANTITY

0.9+

DYORGANIZATION

0.87+

first data lakeQUANTITY

0.86+

HuduLOCATION

0.85+

trillionsQUANTITY

0.85+

one thingQUANTITY

0.83+

many years agoDATE

0.79+

Apache icebergORGANIZATION

0.79+

over a couple timesQUANTITY

0.77+

emus healthORGANIZATION

0.75+

JiminPERSON

0.73+

StarburstTITLE

0.73+

years agoDATE

0.72+

AzureTITLE

0.7+

KubernetesORGANIZATION

0.67+

TCOORGANIZATION

0.64+

S3TITLE

0.62+

DeltaORGANIZATION

0.6+

plus yearsDATE

0.59+

Number twoQUANTITY

0.58+

a decadeQUANTITY

0.56+

icebergTITLE

0.47+

ParqueORGANIZATION

0.47+

lastDATE

0.47+

20QUANTITY

0.46+

Q2QUANTITY

0.31+

ORCORGANIZATION

0.27+

Starburst Panel Q1


 

>>In 2011, early Facebook employee and Cloudera co-founder Jeff Ocker famously said the best minds of my generation are thinking about how to get people to click on ads. And that sucks. Let's face it more than a decade later organizations continue to be frustrated with how difficult it is to get value from data and build a truly agile data driven enterprise. What does that even mean? You ask? Well, it means that everyone in the organization has the data they need when they need it. In a context that's relevant to advance the mission of an organization. Now that could mean cutting costs could mean increasing profits, driving productivity, saving lives, accelerating drug discovery, making better diagnoses, solving, supply chain problems, predicting weather disasters, simplifying processes, and thousands of other examples where data can completely transform people's lives beyond manipulating internet users to behave a certain way. We've heard the prognostications about the possibilities of data before and in fairness we've made progress, but the hard truth is the original promises of master data management, enterprise data, warehouses, data, Mars, data hubs, and yes, even data lakes were broken and left us wanting for more welcome to the data doesn't lie, or does it a series of conversations produced by the cube and made possible by Starburst data. >>I'm your host, Dave Lanta and joining me today are three industry experts. Justin Borgman is this co-founder and CEO of Starburst. Richard Jarvis is the CTO at EMI health and Theresa tongue is cloud first technologist at Accenture. Today we're gonna have a candid discussion that will expose the unfulfilled and yes, broken promises of a data past we'll expose data lies, big lies, little lies, white lies, and hidden truths. And we'll challenge, age old data conventions and bust some data myths. We're debating questions like is the demise of a single source of truth. Inevitable will the data warehouse ever have feature parody with the data lake or vice versa is the so-called modern data stack simply centralization in the cloud, AKA the old guards model in new cloud close. How can organizations rethink their data architectures and regimes to realize the true promises of data can and will and open ecosystem deliver on these promises in our lifetimes, we're spanning much of the Western world today. Richard is in the UK. Teresa is on the west coast and Justin is in Massachusetts with me. I'm in the cube studios about 30 miles outside of Boston folks. Welcome to the program. Thanks for coming on. Thanks for having us. Let's get right into it. You're very welcome. Now here's the first lie. The most effective data architecture is one that is centralized with a team of data specialists serving various lines of business. What do you think Justin? >>Yeah, definitely a lie. My first startup was a company called hit adapt, which was an early SQL engine for IDU that was acquired by Teradata. And when I got to Teradata, of course, Terada is the pioneer of that central enterprise data warehouse model. One of the things that I found fascinating was that not one of their customers had actually lived up to that vision of centralizing all of their data into one place. They all had data silos. They all had data in different systems. They had data on-prem data in the cloud. You know, those companies were acquiring other companies and inheriting their data architecture. So, you know, despite being the industry leader for 40 years, not one of their customers truly had everything in one place. So I think definitely history has proven that to be a lie. >>So Richard, from a practitioner's point of view, you know, what, what are your thoughts? I mean, there, there's a lot of pressure to cut cost, keep things centralized, you know, serve the business as best as possible from that standpoint. What, what is your experience, Joe? >>Yeah, I mean, I think I would echo Justin's experience really that we, as a business have grown up through acquisition, through storing data in different places sometimes to do information governance in different ways to store data in, in a platform that's close to data experts, people who really understand healthcare data from pharmacies or from, from doctors. And so, although if you were starting from a Greenfield site and you were building something brand new, you might be able to centralize all the data and all of the tooling and teams in one place. The reality is that that businesses just don't grow up like that. And, and it's just really impossible to get that academic perfection of, of storing everything in one place. >>Y you know, Theresa, I feel like Sarbanes Oxley kinda saved the data warehouse, you know? Right. But you actually did have to have a single version of the truth for certain financial data, but really for those, some of those other use cases, I, I mentioned, I, I do feel like the industry has kinda let us down. What's your take on this? Where does it make sense to have that sort of centralized approach versus where does it make sense to maybe decentralized? >>I, I think you gotta have centralized governance, right? So from the central team, for things like swans Oxley, for things like security, for certain very core data sets, having a centralized set of roles, responsibilities to really QA, right. To serve as a design authority for your entire data estate, just like you might with security, but how it's implemented has to be distributed. Otherwise you're not gonna be able to scale. Right? So being able to have different parts of the business really make the right data investments for their needs. And then ultimately you're gonna collaborate with your partners. So partners that are not within the company, right. External partners, we're gonna see a lot more data sharing and model creation. And so you're definitely going to be decentralized. >>So, you know, Justin, you guys last, geez, I think it was about a year ago, had a session on, on data mesh. It was a great program. You invited JAK, Dani, of course, she's the creator of the data mesh. And her one of our fundamental premises is that you've got this hyper specialized team that you've gotta go through. And if you want anything, but at the same time, these, these individuals actually become a bottleneck, even though they're some of the most talented people in the organization. So I guess question for you, Richard, how do you deal with that? Do you, do you organize so that there are a few sort of rock stars that, that, you know, build cubes and, and the like, and, and, and, or have you had any success in sort of decentralizing with, you know, your, your constituencies, that data model? >>Yeah. So, so we absolutely have got rockstar, data scientists and data guardians. If you like people who understand what it means to use this data, particularly as the data that we use at emos is very private it's healthcare information. And some of the, the rules and regulations around using the data are very complex and, and strict. So we have to have people who understand the usage of the data, then people who understand how to build models, how to process the data effectively. And you can think of them like consultants to the wider business, because a pharmacist might not understand how to structure a SQL query, but they do understand how they want to process medication information to improve patient lives. And so that becomes a, a consulting type experience from a, a set of rock stars to help a, a more decentralized business who needs to, to understand the data and to generate some valuable output. >>Justin, what do you say to a, to a customer or prospect that says, look, Justin, I'm gonna, I got a centralized team and that's the most cost effective way to serve the business. Otherwise I got, I got duplication. What do you say to that? >>Well, I, I would argue it's probably not the most cost effective and, and the reason being really twofold. I think, first of all, when you are deploying a enterprise data warehouse model, the, the data warehouse itself is very expensive, generally speaking. And so you're putting all of your most valuable data in the hands of one vendor who now has tremendous leverage over you, you know, for many, many years to come, I think that's the story of Oracle or Terra data or other proprietary database systems. But the other aspect I think is that the reality is those central data warehouse teams is as much as they are experts in the technology. They don't necessarily understand the data itself. And this is one of the core tenets of data mash that that jam writes about is this idea of the domain owners actually know the data the best. >>And so by, you know, not only acknowledging that data is generally decentralized and to your earlier point about, so Oxley, maybe saving the data warehouse, I would argue maybe GDPR and data sovereignty will destroy it because data has to be decentralized for, for those laws to be compliant. But I think the reality is, you know, the data mesh model basically says, data's decentralized, and we're gonna turn that into an asset rather than a liability. And we're gonna turn that into an asset by empowering the people that know the data, the best to participate in the process of, you know, curating and creating data products for, for consumption. So I think when you think about it, that way, you're going to get higher quality data and faster time to insight, which is ultimately going to drive more revenue for your business and reduce costs. So I think that that's the way I see the two, the two models comparing and con contrasting. >>So do you think the demise of the data warehouse is inevitable? I mean, I mean, you know, there Theresa you work with a lot of clients, they're not just gonna rip and replace their existing infrastructure. Maybe they're gonna build on top of it, but the, what does that mean? Does that mean the ed w just becomes, you know, less and less valuable over time, or it's maybe just isolated to specific use cases. What's your take on that? >>Listen, I still would love all my data within a data warehouse would love it. Mastered would love it owned by essential team. Right? I think that's still what I would love to have. That's just not the reality, right? The investment to actually migrate and keep that up to date. I would say it's a losing battle. Like we've been trying to do it for a long time. Nobody has the budgets and then data changes, right? There's gonna be a new technology. That's gonna emerge that we're gonna wanna tap into. There's gonna be not enough investment to bring all the legacy, but still very useful systems into that centralized view. So you keep the data warehouse. I think it's a very, very valuable, very high performance tool for what it's there for, but you could have this, you know, new mesh layer that still takes advantage of the things. I mentioned, the data products in the systems that are meaningful today and the data products that actually might span a number of systems. Maybe either those that either source systems, the domains that know it best, or the consumer based systems and products that need to be packaged in a way that be really meaningful for that end user, right? Each of those are useful for a different part of the business and making sure that the mesh actually allows you to lose all of them. >>So, Richard, let me ask you, you take, take Gemma's principles back to those. You got, you know, the domain ownership and, and, and data as product. Okay, great. Sounds good. But it creates what I would argue or two, you know, challenges self-serve infrastructure let's park that for a second. And then in your industry, one of the high, most regulated, most sensitive computational governance, how do you automate and ensure federated governance in that mesh model that Theresa was just talking about? >>Well, it absolutely depends on some of the tooling and processes that you put in place around those tools to be, to centralize the security and the governance of the data. And, and I think, although a data warehouse makes that very simple, cause it's a single tool, it's not impossible with some of the data mesh technologies that are available. And so what we've done at EMI is we have a single security layer that sits on top of our data mesh, which means that no matter which user is accessing, which data source, we go through a well audited well understood security layer. That means that we know exactly who's got access to which data field, which data tables. And then everything that they do is, is audited in a very kind of standard way, regardless of the underlying data storage technology. So for me, although storing the data in one place might not be possible understanding where your source of truth is and securing that in a common way is still a valuable approach and you can do it without having to bring all that data into a single bucket so that it's all in one place. >>And, and so having done that and investing quite heavily in making that possible has paid dividends in terms of giving wider access to the platform and ensuring that only data that's available under GDPR and other regulations is being used by, by the data users. >>Yeah. So Justin mean Democrat, we always talk about data democratization and you know, up until recently, they really haven't been line of sight as to how to get there. But do you have anything to add to this because you're essentially taking, you know, doing analytic queries and with data, that's all dispersed all over the, how are you seeing your customers handle this, this challenge? >>Yeah, I mean, I think data products is a really interesting aspect of the answer to that. It allows you to, again, leverage the data domain owners, people know the data, the best to, to create, you know, data as a product ultimately to be consumed. And we try to represent that in our product as effectively, almost eCommerce, like experience where you go and discover and look for the data products that have been created in your organization. And then you can start to consume them as, as you'd like. And so really trying to build on that notion of, you know, data democratization and self-service, and making it very easy to discover and, and start to use with whatever BI tool you, you may like, or even just running, you know, SQL queries yourself. >>Okay. G guys grab a sip of water. After the short break, we'll be back to debate whether proprietary or open platforms are the best path to the future of data excellence. Keep it right there.

Published Date : Aug 2 2022

SUMMARY :

famously said the best minds of my generation are thinking about how to get people to Teresa is on the west coast and Justin is in Massachusetts with me. So, you know, despite being the industry leader for 40 years, not one of their customers truly had So Richard, from a practitioner's point of view, you know, what, what are your thoughts? you might be able to centralize all the data and all of the tooling and teams in one place. Y you know, Theresa, I feel like Sarbanes Oxley kinda saved the data warehouse, I, I think you gotta have centralized governance, right? of rock stars that, that, you know, build cubes and, and the like, And you can think of them like consultants Justin, what do you say to a, to a customer or prospect that says, look, Justin, I'm gonna, you know, for many, many years to come, I think that's the story of Oracle or Terra data or other proprietary But I think the reality is, you know, the data mesh model basically says, I mean, you know, there Theresa you work with a lot of clients, they're not just gonna rip and replace their existing you know, new mesh layer that still takes advantage of the things. But it creates what I would argue or two, you know, Well, it absolutely depends on some of the tooling and processes that you put in place around And, and so having done that and investing quite heavily in making that possible But do you have anything to add to this because you're essentially taking, you know, the best to, to create, you know, data as a product ultimately to be consumed. open platforms are the best path to the future of

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave LantaPERSON

0.99+

DaniPERSON

0.99+

RichardPERSON

0.99+

Justin BorgmanPERSON

0.99+

JustinPERSON

0.99+

Jeff OckerPERSON

0.99+

TheresaPERSON

0.99+

Richard JarvisPERSON

0.99+

TeresaPERSON

0.99+

MassachusettsLOCATION

0.99+

TeradataORGANIZATION

0.99+

40 yearsQUANTITY

0.99+

OracleORGANIZATION

0.99+

UKLOCATION

0.99+

twoQUANTITY

0.99+

JoePERSON

0.99+

GDPRTITLE

0.99+

JAKPERSON

0.99+

2011DATE

0.99+

StarburstORGANIZATION

0.99+

BostonLOCATION

0.99+

thousandsQUANTITY

0.99+

two modelsQUANTITY

0.99+

EMIORGANIZATION

0.99+

FacebookORGANIZATION

0.99+

GemmaPERSON

0.99+

TeradaORGANIZATION

0.99+

AccentureORGANIZATION

0.99+

EachQUANTITY

0.99+

first lieQUANTITY

0.99+

todayDATE

0.99+

first startupQUANTITY

0.98+

ClouderaORGANIZATION

0.98+

TodayDATE

0.98+

SQLTITLE

0.98+

first technologistQUANTITY

0.97+

one placeQUANTITY

0.97+

DemocratORGANIZATION

0.97+

singleQUANTITY

0.97+

about 30 milesQUANTITY

0.97+

oneQUANTITY

0.96+

three industry expertsQUANTITY

0.95+

more than a decade laterDATE

0.94+

OneQUANTITY

0.94+

hit adaptORGANIZATION

0.94+

Terra dataORGANIZATION

0.93+

GreenfieldLOCATION

0.92+

single sourceQUANTITY

0.91+

single toolQUANTITY

0.91+

OxleyPERSON

0.91+

one vendorQUANTITY

0.9+

single bucketQUANTITY

0.9+

single versionQUANTITY

0.88+

about a year agoDATE

0.85+

Theresa tonguePERSON

0.83+

emosORGANIZATION

0.82+

MarsORGANIZATION

0.8+

swans OxleyPERSON

0.77+

IDUTITLE

0.69+

firstQUANTITY

0.59+

a secondQUANTITY

0.55+

Sarbanes OxleyORGANIZATION

0.53+

MasteredPERSON

0.45+

Q1QUANTITY

0.37+

Intermission 1 | DockerCon 2021


 

>>Hey, everyone. I want to welcome you back. This is our intermission. And let me tell you what a morning we've had for those of you that don't know. I'm, Hayma Ganapati, I'm in product marketing at Docker. And I would just want to quote, actually someone who was in one of the chat rooms and this, I think encapsulates exactly how I feel today, because this is my first Docker con and the quote was from. And he said, I feel like a kid in an ice cream store where I don't know which flavor to choose. I want to go to all of the sessions and I got to tell you that's how I felt. And, you know, um, I want to just do some specific call-ups. Um, first of all, Dana way to keep it real in your interview. I love the cube interview. If you miss that, um, it was really great. >>She talks a lot about, uh, CI CD pipeline and you know, what to do with GoodHub. It was great. Um, I also want to say that I was, uh, slipping back and forth between the community rooms and way to go Brazil obrigado until all of the people who participate in the Brazil room, we had about 250 plus people in that room. And the, the chat window was just going crazy and in the French community room, Vive left hall. So if you've a uncle funny, uh, we had about 150 plus people in that room. So I just want to say that, you know, we've been seeing a lot of participation and I just want to thank everyone for attending and for participating on people have been so kind in the chat rooms, we just want to remind you to stay kind, you know, presenters put a lot of effort into their presentations, so just, you know, offer some positive and supportive critique to them. >>And the other thing I want to mention is all of the countries that we're seeing, all of the participation. So I'm just going to call out a few. We have people from the Netherlands, from Canada, from South Africa, Akron, Ohio, Belgium, Austria, yeah, Ecuador, New Zealand. And he cut up Westchester. Like, I mean, it just goes list goes on and on and on. And I think this really speaks to the power of Docker community. And it's a real testimony to how people from all over the world are in love with Docker technology and are excited to be here. And so I just wanted to thank everyone again and want to remind you that we want to leverage the power of community. And we have a fundraising campaign going on to help, uh, people who are affected by COVID. And you know, some of our big communities, especially in India and Brazil are, have been really affected by COVID. >>So we're asking you to contribute and we'd really like you to participate. Um, we have, uh, the, the link you can see here, Docker donates, you can tweet about it and would love to see the numbers go up for those donations, because, you know, I've personally been affected, had some family members pass away from COVID in India, and I'm sure other people may have stories that firsthand or secondhand. So please do that and let's show what the power of Docker community can do. And before I hand over to, to Peter, I'm just going to read out some of the tweets we've been getting, okay, this Brett and Peter, these are great. Uh, one of the, one of the tweets said dev environments is one of the most exciting features in the past few years. Super excited to try this out. Great, great, great tweet. Yeah. >>I agree to, um, another loving the content that was not your tweets. You can, you can slip me the 20 bucks later. Um, there's another tweet that says loving the content from hashtag Docker con so far fascinating use cases and interesting progress and future directions love that. And then there's another one I'm trying to find it here. Uh, I've been waiting for this so long Docker builds now work on Intel and M one. So keep those tweets coming. We love getting this kind of feedback and we love reading the chat room. So, um, Peter, you know, I attended your, your panel and I love that we were talking about a security and that moving, moving it left. So how did that go for you? >>Uh, it was, it was, uh, it was extremely fun. And for those that are, uh, I think my parents might be watching, so they probably watched it and were like, w this is the most boring thing I've ever seen, but, um, you know, you get a bunch of geeks and, uh, Brett has told me I should use geek instead of nerd, but I, I liked, uh, geek. So you get a bunch of geeks talking about security and coding and, um, what, what, what containers actually are, what vulnerabilities are. Yeah, it was, it was extremely fun. The panel was fantastic. They were very engaging the chat. I mean, I couldn't keep up with the chat. Right. It would just kept flying by, uh, luckily I had a helper to pull off questions, but, um, yeah, it's super exciting. You can, I know we're all remote, but you can just feel that energy, right. It was, it was great. It was great. Yeah. Yeah, for sure. It's super >>Connected. I felt that with your panel to Brett as well, sorry to talk over you there, but yeah. How did, how did it go for you? I, there was a lot of engagement in your session. >>Uh, ditto, like it was just, uh, there was so many questions. We only got to get a fraction of them. I tried to pick themes because, uh, when you talk about continuous testing and integration and all the things that take a part of that, um, you, you end up with lots of, well, what I like is the discussion around opinions, because so much of these pipelines from code on your machine, into production and everything in between, it's really, uh, it's a culture. It turns out to be the description of your culture and how you all perceive testing, how you, what you value in testing. And so that really started to come out as a theme, um, throughout that show. And we, we ran at a time. I was also watching Peters and it was fantastic, but like you think an hour is enough time to cover a topic, but it's just tipping tip of the iceberg kind of stuff. So I think it was super helpful. I learned some things, um, I really enjoyed watching Peters and, uh, yeah, can't wait for the next one. There's >>More than that. And likewise, great. I mean, I know, I know we're w maybe we pat chose it, but it, it was, it was super exciting to watch your panel. They were very Nikos, one of my favorite people in the world, uh, a fellow Austinite, but, um, yeah, I love that too. How you, uh, you were talking about opinions right. And playing off each other. It's, it's always interesting to hear smart people, uh, how they think, right. Yeah. I learned from how they think, right. Yeah. A hundred percent. >>So, all right. So we're, we're, um, what's next? Like, we, we gotta keep this thing going, so I've got to remember that. >>I want to, so I want to talk a bit about some of the panels that are, or the sessions that are coming up and just want to remind people that happened this afternoon. I'm all about use cases. You know, I was a developer for many decades, and it's great to hear how other developers are using the tools. But, uh, as a developer, I always wanted to know how are, what are the end user applications? And so we have two exciting sessions at 1:00 PM. We have sneak and red ventures, and they're going to be talking about how they used Docker containers. The title of the, uh, uh, session is great. An ounce of prevention, curing, insecure, container images. So check that out. And we also have another one at one 30 with Massimo, from AWS and Dexter Legaspi from Sirius XM. And they're going to be talking about a real world application using Docker containers. So I really want you to, to encourage you to attend those. >>Yeah. Um, can I say one really quick? Cause I'm Sue and a shout out to Eric Smalling. He's giving the red ventures talk with our partners. He's awesome. Go check out his, uh, but I'm really excited about Matt. Jarvis's sneak talk around. Uh, I think we might've talked about it earlier. My container image has 500 vulnerabilities vulnerabilities now what, right. I mean, I think as developers, as we're coming into this and dev ops and everybody right. You scan and then you see all these vulnerabilities just shoot by. And you're like, well, what do I do? So Matt, Matt will be addressing that. And he is fantastic. I can go on. There's a bunch of them. >>Yeah. There's a whole bunch of coming up and right up after this, I'm on a live stream with a bunch of panels on get ops. And then after that, Peter will be back. And so stay tuned and thanks for watching during the intermission. And we'll see you soon. >>I'm also leading the women in tech panel attend that. Don't forget to do that. >>Absolutely. Yep. All right. Ciao. Ciao >>For me like my first, oh, I get it about Docker was when I used a SQL server container on my neck book for the first time >>Being able to install Docker desktop, which was the first thing that I did and be able to build this without worrying about any of my software versions that I currently had on my machine. It was >>Awesome. One of the things, because I love the most about Docker is because I write books and I do video training courses to help a lot of people take their first steps with Docker and containers and to get a connection with those people and for them to come back to me and say, do you know what this is so cool, so easy, and it's going to change both my job. And, but also my organization, my team, all of that kind of stuff, change the experience that our customers have with our applications and what our business really puts a smile on my face. If >>You want to use containers, then Docker is the first toys, especially with tools like the mark Docker, compose, you can, uh, easily do your day-to-day job as a developer, or even if you're an ops person, then there are the books of the cloud and other things. So yeah, the idea is that we can go the simplicity one simple task, uh, to, uh, Daugherty mate and make that reuse as many times. Uh, that is one of the cool things I like about my >>Favorite part about Docker is using it as a developer tool. I using Docker desktop, really easy to install, really easy to run. >>Every time I come back to DACA, I love the simplicity of the way that it works, especially on things like security, which I find frustrating and hard. It's just done so seamlessly. And so my favorite thing about DACA is not just that it changed the world in the way that we develop in and ship and build applications and put that. It's just so easy that even the guy, like, I think >>It really is all about finding that aha moment, that hook where Docker really makes sense to you because once you have that moment, then all of a sudden, you, you know, you are on your way to being a Docker power user. >>We need for people to understand this technology better before they can, uh, actually dive deep into that. And Docker makes it easier to explain things, to explain the concept of containers, to explain how containers will work, how you can split your environments, how you can, uh, standardize all your pipelines and so on. It's important that we also take the time to help other people. And I think it's very important that we also give back and that's part of the motto of open sources. How do we give back to other people and how we help other people learn? And I think that's what I'm really passionate about. This whole thing is continuing, uh, giving back to the community. I just >>Hope and has fun at Docker con. And I know that there's a lot of great speakers coming and I will be watching the talks, even though they're happening at 3:00 AM and in my local time zone, um, I'm pretty excited to watch and, uh, hopefully watch more than later on streaming or YouTube or wherever they're going to be. So I hope everyone has fun and learn something and yeah, I don't see how you couldn't have fun.

Published Date : May 28 2021

SUMMARY :

I want to welcome you back. She talks a lot about, uh, CI CD pipeline and you know, what to do with GoodHub. And I think this really speaks to So we're asking you to contribute and we'd really like you to participate. I agree to, um, another loving the content that was not your tweets. thing I've ever seen, but, um, you know, you get a bunch of geeks and, I felt that with your panel to Brett as well, sorry to talk over you there, And so that really started to come out as a theme, um, throughout that show. And likewise, great. So we're, we're, um, what's next? So I really want you to, to encourage you to attend those. You scan and then you see all these vulnerabilities just shoot by. And we'll see you soon. I'm also leading the women in tech panel attend that. Being able to install Docker desktop, which was the first thing that I did and be able to to get a connection with those people and for them to come back to me and say, do you know what this the mark Docker, compose, you can, uh, easily do your day-to-day job as a developer, really easy to install, really easy to run. It's just so easy that even the guy, like, I think really makes sense to you because once you have that moment, And I think it's very important that we also give back and that's part of the motto of open sources. And I know that there's a lot of great speakers coming and I

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Eric SmallingPERSON

0.99+

PeterPERSON

0.99+

BrettPERSON

0.99+

Hayma GanapatiPERSON

0.99+

EcuadorLOCATION

0.99+

MattPERSON

0.99+

IndiaLOCATION

0.99+

BelgiumLOCATION

0.99+

CanadaLOCATION

0.99+

AWSORGANIZATION

0.99+

South AfricaLOCATION

0.99+

DACATITLE

0.99+

OhioLOCATION

0.99+

DanaPERSON

0.99+

AustriaLOCATION

0.99+

New ZealandLOCATION

0.99+

AkronLOCATION

0.99+

BrazilLOCATION

0.99+

1:00 PMDATE

0.99+

DockerORGANIZATION

0.99+

JarvisPERSON

0.99+

firstQUANTITY

0.99+

SuePERSON

0.99+

PetersTITLE

0.99+

DockerConEVENT

0.99+

3:00 AMDATE

0.99+

first stepsQUANTITY

0.99+

IntelORGANIZATION

0.99+

GoodHubORGANIZATION

0.99+

500 vulnerabilitiesQUANTITY

0.99+

first thingQUANTITY

0.98+

first toysQUANTITY

0.98+

NetherlandsLOCATION

0.98+

DockerTITLE

0.98+

first timeQUANTITY

0.98+

an hourQUANTITY

0.97+

bothQUANTITY

0.97+

MassimoPERSON

0.97+

SQLTITLE

0.97+

about 250 plus peopleQUANTITY

0.95+

YouTubeORGANIZATION

0.95+

about 150 plus peopleQUANTITY

0.95+

todayDATE

0.94+

DaughertyPERSON

0.93+

oneQUANTITY

0.93+

two excitingQUANTITY

0.92+

this afternoonDATE

0.91+

WestchesterLOCATION

0.9+

Sirius XMORGANIZATION

0.89+

20 bucksDATE

0.87+

hundred percentQUANTITY

0.83+

one simple taskQUANTITY

0.83+

NikosPERSON

0.81+

past few yearsDATE

0.74+

Dexter LegaspiPERSON

0.71+

One of the thingsQUANTITY

0.7+

2021DATE

0.66+

CIORGANIZATION

0.63+

MORGANIZATION

0.63+

FrenchLOCATION

0.52+

COVIDEVENT

0.49+

Gavin Jackson, UiPath | UiPath FORWARD III 2019


 

you live from Las Vegas it's the cube covering you I pat forward America's 2019 brought to you by uipath welcome back everyone to the cubes live coverage of UI path forward here at the Bellagio in Las Vegas Nevada I'm your host Rebecca night co-hosting alongside Dave Volante we are joined by Gavin Jackson he is the senior vice president and managing director amia at uipath thanks so much for coming you are brand spanking new to brands thanking you AWS for four years yeah joined UI paths in September yeah I want to start this conversation by having you talk a little bit about what what appealed to you about UI path and what more do you want to make the leap after four years at AWS yeah so I had the privilege to be west of really having a really close proximity to enterprise customers and getting the opportunity to listen to what they really wanted when they were talking about their digital transformation journeys and as it turns out the sort of cloud first in the automation first eras if you will are operating models at to two sides of the same coin if you think about what the that the cloud proposition has been over the last number of years it's really been about sort of reducing or eliminating the undifferentiated heavy lifting so that builders can build and then that turned into an operating model principle and it became sort of cloud first it's the same thing for the automation world you know we are reducing and eliminating the undifferentiated heavy lifting of Tata a product of business processes and tasks and everything else whether they're complex tasks or simple tasks removing that so that builders can build and business people can innovate and given them the freedom to do what they need to do as business owners think about AWS we obviously follow them very closely yeah anybody but it strikes you didn't thank you such are filters yeah what's the analog so what I think we again I would say that we are we are providing tools so the builders could build but at the same time our our products that works across the entire business stack whether that is sort of automation first as an operating principle across all businesses or whether it's across a business persona whether it's a CFO or somebody in accounts or a salesperson or whatever might be we're building tools that take the mundane tasks away from those users so that they have the freedom to go and serve their customers or or innovate within finance or do the do the job that they really love doing and that's really important for the business it turns out there's not a lot of value and a lot of the work that people do every day so if we can remove some of that then innovation will have an exponential curve of progress and that's what we're focused on today yes yeah again there are similarities there so if I understand the you're shifting one date asked allowing people freeing them up to do so that they can have a strategic impact in their business yes yeah yeah I think it is so if you look at even the technology paradigms and how cloud and AWS evolved and then also the layer on how uipath is evolving in the same way so you have computing and compute power started really with the mainframe and went to distributed servers and then to virtual machines and then from virtual machines it went to hosted virtual machines in the cloud and then from then it went to containers and now we're in this world of server lists we're in the cloud right so effectively the logic lives in server lists and the infrastructure sort of disappears and that provides massive scale in the automation world you started off with big monolithic processes you then had sort of network processes with software and data in the middle of all of that networked RPA really came in as an early sort of tool to help automate a lot of that a lot of processes and now in the realms of sort of automation as a function where in the end like the end game really is where automation is the application and the the applications themselves the data sources the processes really disappear so that the best done analogy I can come up with a metaphor acting um up with is I'm a Marvel fan I'm a geeky kind of Marvel fan of my favorite character is his Iron Man or Tony Stark and more specifically the Jarvis AI so what's happening all the time with with Tony Stark in the Jarvis a is he's interacting with his AI user interface all the time and what's happening in the background is that Java she's working with probably you know a hundred different applications and a hundred different data sources and everything else and rather than having you know a human go and do what the integration work that robots are doing that for him and it's just coming back as a as an outcome yeah I'm gonna keep pushing on this yeah similarities and differences because where it seems to break down is where our PA is focusing on the citizen developer the the end-user I'm afraid of AWS I won't go near it I see that console I call it my techies hey you know AWS is you know you got to be you know pretty technical to actually leverage it at the same time I'm thinking well maybe not maybe my builders are building things that I can touch but help us square that circle yeah so I think you the world is trending towards as much automation as possible so if it can be automated or if you can reduce the the burden to get to innovation I think you know technology is moving that way even in coding I think the transit we're seeing whether it's AWS or anyone else is low to no code and so we we occupy a world within the RPA space or the intelligent automation space where we're providing tools for people that don't need a requirement or or a skill set to code and they can still manufacture a few world their own automations and particularly with a release that we're just announcing today which is Studio X it really kind of reduces the friction from a business user where's zero understanding of how to code to build their own automations whether it's kind of recording a process or just dragging and dropping different components into a process even like even I could do that and that's saying something I can tell you yes exactly yeah this idea of democratizing the the automation the building that you said yeah very much so what will this mean I mean what what does what does that bode for the future of how work gets done because that is at the core of what you're doing is typically understanding how and where work gets done or the bottlenecks where the challenges and how can our PA fix this so I think ultimately like a lot of technologies it's really about the the exponential curve of productivity and whether you're looking at a national level a global level a company level a human level every level productivity has declined really over the last number of years and technology hasn't done a great job to improve that and you can say that some technologies have done a good job again I'd use a TBS is a good job in terms of the proliferation or the how prolific you can get more code out and more more progress there but overall productivity has declined so our sort of view of the world is if you can democratize automation if you can use or add a digital workforce to your to your to your teams then you'll have an exponential curve of productivity which a human level is important company level is important a national level is important and probably at global level is important you know you guys might be right place right time as well yeah because I remember you know all the spending in the 80s said receive growth everywhere except the Nobel prize-winning economist Robert Solow yeah [Laughter] [Music] you guys are hitting it right at the right time yeah you be able to take credit for a lot of it but yeah your thoughts on that in terms of productivity depending yeah I think it is pent up I think that is where where we're at right now and it's ready to be unleashed and I think that these technologies are are the technologies that will unleash it I think really what's happened over the last number of decades probably is that the six trillion dollar IT industry they exist today has largely kind of increased productivity or performance of other technologies it hasn't really increased output so whether it's sort of you know the core networking when Cisco started core networking there was a big increase I would imagine in connectivity and outputs then the technologies that were laid on top of that maybe less so and it was just really kind of putting bad band-aids on problems so it was really technology solving technology problems rather than technology solving human output problems and so I think that this is now the most tangible technology category that really is turning technology into value and productivity for technology really unlocking a lot of value one of the things that your former boss Jeff Bezos said was bet on dreamy businesses that have unlimited upside these these dreamy businesses customers love them they grow to very large sizes they have strong returns on capital and they can endure for decades I wonder if you could put you iPad in that context of a dreamy business what does he know right I mean nobody right I mean so and this is one of the reasons I was attracted by the way to DUI path because I think I think that the robots themselves if you can just kind of look at the subcategory of the robot I think it's on a similar curve to how Gordon Moore was talking about the Intel microprocessor in 1965 and the exponential curve of progress I think we were on that similar curve so when I sort of project five years from now I just think that the amount the robots will be able to do the cognitive kind of capabilities it will be able to do are just phenomenal so and customers customers give us feedback all the time about to two things they love and they value what we do the value is important because it's very empirical for the first time they can actually deploy a technology and see almost an immediate return on their technology whether it's a point technology solving one process or a group of processes they can see an immediate empirical return the other thing that I like to measure I quite like is that they value it so they think they love it they love and value it so they love it meaning it actually induces an emotion so when you when you watch the robots in action and they watch something that has been holding your team back or there's been stifling productivity or whatever it is people get giddy about it it's quite fascinating to see comment about Gordon Moore and Ty that's a digital transformation when I think of digital transformation I think of data yeah what's the difference in a business in a digital business it's how they use data yeah they put data at the core and four years we march to the cadence of Moore's law and that's changed its that that's not what the innovation the engine is today it's it's machine intelligence it's data and it's cloud for scale where do you guys fit I mean obviously AI is a piece of that but but maybe you could add some color to where our PA fits in that equation so I think that's an important point because there's a lot of miscommunication I think about really what it means when you talk about digital transformation and what it means to be digitally transformed and really to see transformed you're really talking about a category of customers which are large more institutional enterprises and governments because they have something to transform what they're transforming into is more of a digital native sort of set of attributes more insurgent mindsets and these companies are to your point they're very data hungry they harvest as much data as they can from from value from data they're very customer centric they focus on the customer experience they use other people's resources oh the cloud being one great example of that and the missing point from what you said is they automate everything they've to meet it so part of the digital transformation journey is if it can be automated it will be automated and anything that's new will be born automated so let me ask a follow-up on that is there a cultural difference in amia versus what you're seeing in North America in terms of the receptivity to automation I mean there are certain parts of of Europe which are you know more protective of jobs do you see a cultural difference or are they kind of I mean we do see even some resistance here but when you talk to customers they're like no it's it's wonderful I love it what are you seeing in Europe so I don't I don't see much of a cultural difference there and I see don't I don't see yet I haven't seen any feedback yes Peres I'm very new still but I haven't seen anybody talk about really that this technology is a technology to take jobs out I think most people see this technology as a way of getting better performance out of humans you know pivoting them towards more so I would say like in some markets in my in my in my prior life in in many prior lives I would say that there's some countries like France for example that would have been a little bit more stayed within their approach to new technologies and adoption not so with regards to automation they see this as a real and game productivity increase thank you I think that's true for people who have tasted it yeah but I do think there's still some reticence in the ranks until they actually experience it that's why we'll talk to some customers about it they'll have bought a Thon's and just a yeah to educate people and what's possible to let them try to build their own robots and then people then the light bulbs go off that it's taking away the aggravations the frustrations the mundi the drudgery and then you said people get giddy about those things you don't have to do that yeah but then the question is also so so what creative things are you doing now so how are you spending your time what are you doing differently that makes your job more interesting more compelling yeah and and and I think that's the real question - so what is the okay yes receiving some money and people aren't having to do those mundane tasks but then what are what is the value add that the employees are now bringing to the table yeah so in actually sit and it takes made the right point as well in terms of the mechanism for doing that is the the part of the battle here is to spark the imagination just like anything really just let you like it back in the Amazon wild it's all of our spark in the imagination if you can if you can imagine it you can build it it's the same thing really with within our world now is figuring out with customers what think what tasks do they do that they hate doing either a user level or a downstream level what are the things that they really want to do that they need our help to harvest and so we do the same sort the same sort of things that we would have done with AWS where we did lots of hackathons and you bought lots of technology partners in with us and we would sort of building all of this we do exactly the same thing with the RP a space it's exactly the same this is really important because creativity is going to become an increasingly important because if productivity goes up it means you can do the same amount of work with less people so it is going to impact jobs and people are gonna have to be comfortable to get out of their comfort zone and become creative and find ways to apply these technologies to really advance but you know drive value to their organizations and actually I look at this as well as a long term technology whereas a long term technology is something that's important for my children I've three and they're still very young so twelve ten and six but eventually they will go into the workplace with these skills embedded they will just know the how you get work done is you have your robot do a whole load of tasks for you here and your your job is to build and to be creative and to harvest data and to manipulate data and and serve customers and focus on the customer experience that's really what it's all about the real brain works I've been a pleasure having you on the show at uipath thank you so much appreciate it i'm rebecca night for j4 day Volante please stay tuned for more from the cubes live coverage of uipath coming up in just a little bit

Published Date : Oct 15 2019

**Summary and Sentiment Analysis are not been shown because of improper transcript**

ENTITIES

EntityCategoryConfidence
Jeff BezosPERSON

0.99+

Dave VolantePERSON

0.99+

Gavin JacksonPERSON

0.99+

EuropeLOCATION

0.99+

Gordon MoorePERSON

0.99+

1965DATE

0.99+

AWSORGANIZATION

0.99+

MoorePERSON

0.99+

threeQUANTITY

0.99+

Robert SolowPERSON

0.99+

Las VegasLOCATION

0.99+

sixQUANTITY

0.99+

North AmericaLOCATION

0.99+

SeptemberDATE

0.99+

iPadCOMMERCIAL_ITEM

0.99+

six trillion dollarQUANTITY

0.99+

Tony StarkPERSON

0.99+

CiscoORGANIZATION

0.99+

uipathORGANIZATION

0.99+

two sidesQUANTITY

0.99+

four yearsQUANTITY

0.99+

JavaTITLE

0.98+

a hundred different data sourcesQUANTITY

0.98+

first timeQUANTITY

0.98+

oneQUANTITY

0.98+

two thingsQUANTITY

0.97+

IntelORGANIZATION

0.97+

Studio XTITLE

0.97+

MarvelORGANIZATION

0.97+

TyPERSON

0.96+

PeresPERSON

0.95+

2019DATE

0.95+

firstQUANTITY

0.95+

UI pathTITLE

0.95+

four yearsQUANTITY

0.95+

a hundred different applicationsQUANTITY

0.94+

JarvisPERSON

0.94+

todayDATE

0.94+

UiPathORGANIZATION

0.94+

Iron ManPERSON

0.94+

Nobel prizeTITLE

0.93+

AmericaLOCATION

0.93+

decadesQUANTITY

0.92+

five yearsQUANTITY

0.91+

twelve tenQUANTITY

0.91+

AmazonORGANIZATION

0.9+

one of the reasonsQUANTITY

0.88+

Las Vegas NevadaLOCATION

0.87+

FORWARD IIITITLE

0.86+

one dateQUANTITY

0.85+

TBSORGANIZATION

0.85+

80sDATE

0.83+

lots of hackathonsQUANTITY

0.83+

Rebecca nightPERSON

0.82+

FranceLOCATION

0.79+

zeroQUANTITY

0.78+

every dayQUANTITY

0.78+

BellagioLOCATION

0.77+

a lot of theQUANTITY

0.74+

amiaPERSON

0.72+

UiPathTITLE

0.7+

last number of decadesDATE

0.69+

UI pathsTITLE

0.66+

TataORGANIZATION

0.63+

technologyQUANTITY

0.59+

lotsQUANTITY

0.58+

uipathTITLE

0.58+

thingsQUANTITY

0.55+

ThonORGANIZATION

0.5+

yearsQUANTITY

0.46+

j4EVENT

0.4+

VolanteORGANIZATION

0.32+

Cameron Mirza, University of Bahrain | AWSPS Summit Bahrain 2019


 

>> from Bahrain. It's the Q covering AWS Public sector Bahrain, brought to you by Amazon Web service, is, >> But we are here. The Cube in Bahrain, Middle East for Amazon Web service is some of our second year were cloud computing and their region of couple availability zones are up and running. Big news with Amazon got our next guest. Here's Cameron Years as head of strategy at the University of By Rain. You guys big news announcing a degree bachelor's degree in cloud computing? Yeah, a certificate one year that is gonna rapidly put new talent in the market. Congratulations. Thank you. Thank you. >> Thank you so much. We're really excited by this announcement today on Dhe. What's exciting about it is Ah, first of all, it's the first cloud computing degree in the Middle East on the other. The other element to this is that the the students suits from any background. Any discipline can get a really good understanding about cloud technology for the certification because the challenges we face in the region right now are we don't have enough skilled tech talent on we don't have enough skill talent to fill the jobs are available in the region. This is not just a regional thing is you know this is a global issue on universities. Have Thio adapt, be a bit more forward thinking live in the future. And we feel really optimistic with our partnership with Amazon today that we can actually fulfill the needs off public sector employers, entrepreneurs, governments throughout the region. And that's the exciting thing >> for us. I mean, let's just take a minute to explain the two components. One's a four year degree, one when you just give a little quick DT on ongoing questions. >> So I need a four year back to the program is gonna be delivered in a very different wave in the traditional academic program is gonna be heavily integrated with the needs of employers, so employees are gonna be really involved in curriculum design. We like them to be part of a teacher faculty as well. The way that the program will be delivered will be very much in a kind of project based way. So it's about developing not just knowledge, but the skills, competency values mindset required to be successful in the 21st century. That's exciting. Think about it, and of course, you know, looking at some of the detail behind the curriculum you're looking at networking, security, machine learning, artificial intelligence, big data. So the fact that this cloud base is actually just a small component to what it opens up in terms of broader skill sets >> I mean, one of the things that we always comment here on the Cube as we cover Amazons reinvent their big annual conference. And the joke is how many more announcement's gonna make this year a tsunami of new things coming. So certainly it's tough to keep up. Many people say that, but for the young people in school, this is relevant stuff. This is like pathway to success. Yeah, job making some cash, making some money, get that's what the purpose of education is. >> Well, I think I think there's a couple of That's a great point. The first thing is, education systems now need to live in the future. Living in a current or in many cases, the past is no acceptable. So it means it means taking some sort of calculated risk. But we're very clear in terms of the direction of travel with regard to technology in the future, jobs The reality is today. But 2/3 of the world's population already needs re Skilling. Those are the challenges we face today. Young people are purpose driven. They know where the where jobs are gonna be. They want to work for themselves. You know, they understand far better than anyone else where the way the future is unraveling do they >> understand how relevant this is? I mean, that's pretty obvious. We're in the industry. Yeah, we kind of obviously known you've been part of you are getting that This is wave. This wave is not gonna end for a while. This is gonna be a great upward migration for opportunity. You know, it's still learning on the young kids part. >> I think I think I think sometimes in education we do a disservice to young people. They're so well informed they understand the market, the trends, the way the technology shaping the future on reality is that what student learns in year one of the university, 50% and acknowledge will be obsolete by the time they graduate. So the focus is no just around giving him a degree. This is also about Skillet Re Skilling and upscaling. People have graduated people in the workforce. So this is a far wider opportunity, even just young people. Well, >> I'll tell you, one thing that gets my attention is that this reminds me of theeighties glider science because I got a degree. I was a freshman. 1983 was just at the beginning of the operating systems movement. Lennox was even around yet Units was just emerging on the scene and was interesting what we learned as building blocks with operating systems and that becoming obsolete in the sense that we don't use it anymore. But coding still happen. So this is had scaled to it with Amazon. You got okay. Easy to industry. Yeah. Now you got He's mentioned machine learning at Lambda Functions server lists. Yep. I'm so much more stuff there for a variety of jobs. >> I think this is just the tip of the iceberg. And I think for us, the way that education is evolving is that we we really believe that education will be more modular, as you say, credentials based, um lifelong on the channel. So some of it will be hands on. Some would be through other channels on competency base, and I think that's the thing. I think competency for us is about the kind of mobilization of the knowledge, your skills, the values attributes. And that's the bit it's gonna add. Value Thio economies throughout >> the world. So had a strategy. You gotta look at the chessboard in the future. You mentioned I live in the future. Yeah. What are some of the feedback you've gotten as you talk to folks in the industry when you roll this out? Um, doing some press interviews? I know you've had some feedback. What's the what's the general sentiment right now? >> Really excited. I think that we talk to employees all the time. We talked to sm easy. You talk to big players like Amazon. I think that in the in the region, I think when we talk about the scale of disruption, I think well, the way we talk about it in U. S. Or Europe is very different to the way we talk about it. I think the Middle East region, like Mellie developing parts of the world still playing catch up on old there. But what you'll find is once they've caught up, the adoption rates go through the roof and then that's that's the challenge for us. Because you know what? We see the uptake. Now we see the update every year growing and growing. And now the next challenge is moving into government, moving into the private sector on upscaling and re Skilling, though. So we're just at the start of this kind of huge opportunity. John and I see it being, you know, exponentially over the next five years. You >> know, it's interesting. I live in San Francisco, Bay Area and Silicon Valley. Invalid. We'll tow you. See what Berkeley's doing. Stand up for you. If you look at Berkeley in particular, number one classes are the data science class and the CS intro. Yeah, I mean, they're kind of hybrids, basically, is all cloudy? Do anything with coding. It's gonna be cloud based, right? Um, and seal, who's the deputy Group CEO? Banky, ABC. I just interviewed earlier today. He said, Aye, aye. He thinks is the biggest thing that's gonna happen. So it's not just racking and stacking standing up infrastructure with Amazon, although great to learn that it will be nerds. Geeks do that. There's a huge machine learning a I field. Yeah, I think that's gonna be something. Is head of strategy. You gotta keep your eye on the prize. They're absolutely What's your view on that? How do you see that happening? >> I think you're right. I think only CD of recently released some doctor to say that over 20% of jobs will be automated as a result of their arrive in the next few years. I think our role is to prepare young people regardless of what they're studying. Fool. Aye, aye. On the impact of machine learning. So I'll give an example. Medicine. You can make a diagnosis now for a patient diagnosis in a fraction of a second compared to what we used to be able to buy using I. Now the reality is that although I all I can give you that information you as a patient, one a robot to give you that diagnosis, right? So our job, I think, is to look at the skills that will define what defines us as human beings away from robots. And that's empathy. That's the stuff around building, building connections around team, working around collaboration. And actually those are the things the education systems of a designed not to deliver. So our job now is by embracing these types of new program is it is. It is to start to work on those softer skills on Prepare this generation of shooting for the for the A. I will that we're moving into >> camera, and I was so excited for your opportunity. Computer science cloud >> all kind, bundle >> together and software is powering this new job. As we say, it's the keys to the kingdom. In this case, it could be the keys to the kingdom. >> Well, I think for us as the national university on for many Ah, not just Bahrain. But for many developing an emerging countries around the world, this is far greater than just technology. Or create Jarvis's about sovereignty. Because if you look at many countries, they import talent. They have to import hardware, software, computers and things imported. This is a great opportunity to help create a workforce but actually flips it on its head. Becomes the innovators, becomes the job creators. So that's the exciting thing for us. It really is >> a generational accident. This is an opportunity for the younger generation to literally take the keys to the kingdom. Absolutely absolutely thanks so much for coming. Thank you. Thank you. Telling cube coverage here by rain Middle East AWS Summit. I'm John Feehery Stables for more coverage after this short break.

Published Date : Sep 15 2019

SUMMARY :

from Bahrain. It's the Q covering AWS the University of By Rain. the challenges we face in the region right now are we don't have enough skilled tech talent on I mean, let's just take a minute to explain the two components. So the fact that this cloud base is actually just a small component to what it opens I mean, one of the things that we always comment here on the Cube as we cover Amazons reinvent their big annual Those are the challenges we face today. You know, it's still learning on the young kids part. I think I think I think sometimes in education we do a disservice to young people. in the sense that we don't use it anymore. And I think for us, the way that education is evolving is that we we You gotta look at the chessboard in the future. the way we talk about it. data science class and the CS intro. I. Now the reality is that although I all I can give you that information you camera, and I was so excited for your opportunity. In this case, it could be the keys to the kingdom. So that's the exciting thing take the keys to the kingdom.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
AmazonORGANIZATION

0.99+

JohnPERSON

0.99+

Silicon ValleyLOCATION

0.99+

21st centuryDATE

0.99+

U. S.LOCATION

0.99+

AWSORGANIZATION

0.99+

Cameron MirzaPERSON

0.99+

Middle EastLOCATION

0.99+

50%QUANTITY

0.99+

BahrainLOCATION

0.99+

second yearQUANTITY

0.99+

1983DATE

0.99+

San FranciscoLOCATION

0.99+

BerkeleyORGANIZATION

0.99+

four yearQUANTITY

0.99+

one yearQUANTITY

0.99+

two componentsQUANTITY

0.99+

JarvisPERSON

0.99+

University of BahrainORGANIZATION

0.99+

oneQUANTITY

0.99+

University of By RainORGANIZATION

0.99+

AmazonsORGANIZATION

0.99+

Amazon WebORGANIZATION

0.98+

John Feehery StablesPERSON

0.98+

Bay AreaLOCATION

0.98+

todayDATE

0.98+

2/3QUANTITY

0.98+

BankyORGANIZATION

0.98+

EuropeLOCATION

0.97+

over 20%QUANTITY

0.96+

first thingQUANTITY

0.96+

AWSPS SummitEVENT

0.95+

this yearDATE

0.94+

Middle East regionLOCATION

0.91+

fourDATE

0.9+

CubeCOMMERCIAL_ITEM

0.89+

year oneQUANTITY

0.88+

first cloud computingQUANTITY

0.87+

ABCORGANIZATION

0.86+

LambdaORGANIZATION

0.83+

Cameron YearsPERSON

0.82+

OneQUANTITY

0.81+

ThioPERSON

0.81+

EastEVENT

0.81+

one thingQUANTITY

0.76+

firstQUANTITY

0.72+

next five yearsDATE

0.71+

AWS SummitEVENT

0.71+

worldQUANTITY

0.7+

waveEVENT

0.7+

MiddleLOCATION

0.68+

CubeORGANIZATION

0.66+

earlier todayDATE

0.66+

yearsDATE

0.65+

yearQUANTITY

0.65+

couple availability zonesQUANTITY

0.63+

jobsQUANTITY

0.6+

GroupORGANIZATION

0.59+

DheORGANIZATION

0.56+

LennoxORGANIZATION

0.54+

2019EVENT

0.53+

secondQUANTITY

0.52+

MellieORGANIZATION

0.51+

Jim Lundy, Aragon Research | Enterprise Connect 2019


 

>> Live, from Orlando, Florida. It's theCUBE! Covering Enterprise Connect 2019. Brought to you by Five9. >> Welcome back to Orlando at Enterprise Connect 2019, I'm Lisa Martin with Stu Miniman. It may sound like we're at a party, this is the buzz of the event, this is day one, and we have had a great day so far of talking with lots of guests. We're welcoming back to theCUBE an alumni, Jim Lundy, see applause for you, Jim, CEO of Aragon Research, welcome back to theCUBE. >> Thank you, great to be here. [Lisa] - That was cute, by the way, so I hope we get some credit for that. >> Yeah, yeah, very cute. >> So Jim, you have been coming to Enterprise Connect since before it was even branded Enterprise Connect, back when it was VoiceCon. Tell us a little bit about your observations about the evolution, not only of the events, but also of all the collaboration and communication tools that consumers now are expecting and demanding of businesses. >> So, I think my first event was called VoiceCon in '07, and then it was all about phones. There was no software here. There was no video. There was no messaging. There was certainly no AI. And there were a lot of the players were not here, they were not in business then. So, if you actually look at some of the bigger players here today, they did not exist in 2007. So you look at the advent of Cloud, that's powered a whole new generation of services and opportunities, and it's great for buyers because there's so much more choice. So, VoiceCon almost died and they rebranded it but they've had to expand their focus. There's still a lot of voice focused stuff, but as you can see it's really shifted, we think it's shifting to communications and collaboration, we think contact center, particularly Cloud, is hot. We've got through overall Tam for communication, collaboration, contact center, by 2024, about 120 billion dollars, which makes it bigger than Enterprise secured. >> Yeah, we just had a great type-in with Blair Pleasant, and said, I'm a new channel, absolutely is where it is, but voice is still the number one preferred channel, when you talk about context center, there's lots of ways you can get in touch, but when something's wrong, I want to pick up my device and talk to a human eventually, so yeah, Cloud, and AI, and everything else, but there's still people in this center of everything going on here. >> Well, I think one of the things for contact center in particular you mentioned is the power of Cloud. So you look at some of the players here like we're in the Five9 booth, they've grown because of their Cloud focus, and Cloud is a lot of what's powering everybody here. And buyers want flexibility, so I think that's one of the big things that's changed, is there's still a lot of On Premise, and hybrid Cloud, but the power and the demand for 'I want to deploy something fast, and maybe I'm not even that big of a shop,' Cloud gives me that flexibility. >> When I look at the market as a whole, there's all those arguments about it's private Cloud, public Cloud, hybrid Cloud, multi Cloud, but if we think of Cloud as an operational model, and not a place, I want speed, I want to be able to update to my latest thing, whether that's for security or the cool new feature, and if I'm not Cloud, or Cloud-like, then I probably install something and what I do now and what I do a few years from now looks pretty close to what I did when I installed it. No? Does that resonate in this phase? >> Yeah, yeah. I think there's a couple things, also there's the operational nature of do I want to be in the server update business? Some people do, because of the nature of their business, but a lot of people don't. So then I can focus on the client experience, providing better journeys, and I think that's up the game. I think there's an awful lot of competition in this market because, really because of Cloud, but On Premise or private Cloud is not a bad word, and like I said, I think the bigger play is to be able to do a combination of things and meet the needs of the customer. The only thing I would say about the show is there's a lot of feature wars at this show and needs to be maybe a little more focused on what the customer needs versus hey, my box is better than your box. >> On that front, in terms of focusing on the customer experience, we talk a lot about that, there's a lot of the messaging and branding around the shows you were just pointing out, but something that is always interesting is where does a company balance the customer experience with the agent experience, because the customer experience is directly related to the agents being in power. >> Oh, totally! Well, you got to really do both and do both well. If the agent can't do their job, then the customer is not going to have a good experience. I do think that overall, there's been a pretty good focus on the agent, because that's where it kind of all started, and if you really look at contact center, it's really a heavy-duty application. You've got to be able to do all those things to service the inbound calls or inbound messages, and you're right, there is a lot of focus on the customer, because in some cases there is so much focus on the agent, well, we took the calls even though a lot of the calls, 10% might've gone to voicemail? Sometimes? Well, we serviced it, so. Little unknown fact is that in a lot of enterprises, marketing and the contact center group never talk. Interesting opportunity. >> Yeah, Jim, it's interesting, you talked about in tech we often get to that feature battle. Battle by power point or by product stack and oh, I've got 147 features and they only have 125 features, when you look at most customers they only know how to use three of the features they've got on there. So what differentiates from a customer standpoint, how do they choose, how do they make sure that they get something that is going to help their overall customer experience, and help their products and their marketing? >> Well, a couple things. First of all, you're right, they don't care as much about 'I've got this feature, you don't', they want to know can the provider take care of me if I buy from them? Are they reputable? Do other people, are they happy with the service? We do a lot of vender evaluations, we call them Aragon research globes and we usually spend six months working on understanding where the vender is this year, and we talk to references and things like that. So I think that sometimes when you, they read a report and they get some insight, they still want to talk to somebody versus just reading a peer review on somebody's consumer website, and really get that insight, so I think that's one lens and I think the other lens is that the smarter players are doing those things where they can provide really high touch support, I'd probably say Five9's pretty good at that, because contact center is really, really complicated, you just don't turn them on sometimes, there's things you have to do to make them work, and I think overall in this space, there are some products you can buy, maybe not contact center where you can spin them up and turn them, configure phones and go, I've actually deployed some of them, and there's some that would be such a nightmare, like who in the world would ever buy this product? So, I think it really varies a gambit and again, sometimes that doesn't always come out with an online review and again, sometimes the buyer, still buyer beware, in a lot of cases, some of the things you read online are not true. >> One of the things we were chatting with a number of the Five9 executs about today is that they have a five billion recorded customer conversations, tremendous potential there to really glean actionable insights about retaining that customer, increasing their CLV, but there's also the concern of data privacy and security in sharing, when you're talking with customers that might have this massive pull of data from which they can really expand their business and become competitive, where is the security and the privacy concerns there? >> It's a good question. There's a lot of focus on GDPR in Europe, there's a lot of focus in California on that, even though there's not been talked about in California. The rest of the US is kind of behind a little bit what Europe has done, but here's the thing. They've got ways to mass sensitive data in a recording like credit card data, that's pretty standard stuff, the big thing is data residency. I want my data in a certain country, Canadians do not want their data resident in the United States, Europeans don't either. Germans don't want their data resident in Belgium, so there's a big sensitivity in Europe about that, and even in fact, Microsoft's even gotten in trouble in Germany over that last year, because they eliminated a relationship with Doy to Telecom, sometimes you can kind of go overboard on that, but however, what I would say though is, some of the big Cloud companies have done this, brought this problem onto themselves, where they have not respected data privacy, there's even a bill now on facial recognition, because of some of the things that have gone on like IBM disclosed, they're doing something, so it is still an issue, it's always going to be an issue, I do think that there needs to be more protect, but here's the question. Who owns your data? Who owns your face, or my face? I don't think that because I upload a photo that I should give my rights away. I think we're going to catch up on that, I do think for the B-to-B though, a lot of these companies, first of all, they are certified, they have Cloud certifications, they definitely do certain things relative to privacy, and so they have to pass a lot of tests that are certified by an auditor, so I think there's a lot of things that most of the B-to-B buyers are not going to have to worry about with a lot of the people here, it's more of the personal side of things, the personal Cloud, Facebook, but usually not the kind of stuff you're dealing with here. >> So, Jim, when I look at the overall contact center market, the Cloud portion of that is still relatively small, if I saw right somewhere, 10, 15%, but it's been growing at a steady clip, where are we in their adoption, is there a plateau that it will hit that, is it take a third of a market, half the market, what do you see happening? >> I would say, we're on a journey and you're right, there is still a small part, which means the large address will market, not that much different than unified communications where it's mainly On Premise, going Cloud. We've got contact center going about 24 billion, and we think a lot of that will be eventually converted to a Cloud, except for maybe the ultra, ultra large call centers, and I think just like email migration 10 years, I've covered that, 10 years ago it was all On Premise. Today it's the opposite. It's like 90-10. So I think that eventually is going to start to happen. >> It's interesting, a lot of that was Microsoft really turned the lever, Microsoft on email, and Microsoft is like, we're going sass, you are going sass if you use Office, you are going Office 365. So I'm curious, is there a lever like that from a licensing standpoint or from a vender standpoint, that would push contact center? >> If you look at the contact center market, we've got it, growth rates around 9% overall, but then you've got people like Five9 that are growing 31%, alright? So if you starting looking at that, why is a Cloud company growing that much when the overall market, well because there's demand. They want the flexibility of Cloud, they don't want to run the servers and upgrade the servers, and I think that they've learned lessons from that, and you're right, Microsoft did do that, but Google forced them to do that. So I think that, are fast growing companies like Five9 forcing some of the bigger players to go more Cloud? And I can say absolutely yes, that a lot of the bigger players are looking over their shoulders saying, and they bought Cloud contact center players so they can keep up with some of the young startups, and Five9's not young, but they would still be considered young in the relative terms of this event. >> I'm curious, Jim, when you're talking with venders and the Aragon research that you do, companies of different sizes, whether they're born in the Cloud or they're legacy companies, where does cultural transformation come into this conversation about evolving a contact center such that an agent is empowered with the right content to deliver it through the right channel, to make a decision that really positively impacts the customer? I can imagine multiple generations, multiple countries, cultural transformation is hard. >> It is a big issue, I think there's more awareness on both the culture of the agent and the culture of the buyer, and I think there's more stuff going on relative to sentiment, sentiment analysis. I do think that's a bigger issue, I think there's more time being spent on training, the better digital companies are investing tons of money in training, so I think there's more awareness relative to cultural differences, cultural nuances, and being more sensitive to maybe things that they would say sorry, can't help you with that, since they've been trained to be maybe more sensitive, they're going to be more understanding when they're actually on a call. >> So, Jim, in your research, where's the white space? Where's the real opportunity for growth and transformation, we've had some discussions here, it's early days in AI's, at AI, or is it not the technology, is it the cultural changes, that Lisa brings up, where are some of impediments and room for growth in the industry? >> So we do think that the enterprise will become more intelligent, and that the providers are going to lead that charge, where instead of you say to AI, we call it intelligent contact center, and we think that there's going to be more of a demand for automation, and that there will be more assistance that might take care of a customer's problem before it ever gets to a human. I do think that we're not going to, that's going to be something that's never going to go away, it's just that they're going to get smarter and more supportive. We have helped clients deploy chat bots for help desk internally for customer facing help desk, I think it's still early here, that people have them, but they're more rules based than AI based. AI's coming in the next two years but there's no doubt that is going to be one of the drivers, and by the way, sometimes people be like, is this the problem we were having, is this the question you have? Yes. Here's this answer, and it's the right answer, the correct answer, that's what people really want, they want the instant gratification, we all kind of grew up, we were used to that with our phones, I need the answer, and I do think that I would probably say the demand for Cloud is going to out-strip everything, so if somebody that's an On Premise provider doesn't have a Cloud option, then I would be worried about them. But I do think AI is not going to go away, we don't think it's going to be an AI or nothing, it's going to be basically intelligent digital assistance, it can answer questions intelligently and have a conversation with you, there's some tools that do that today, but most of them are very basic question and answer, they're not high-end, it can't be like Jarvis on Iron Man, where yes, yes, Mr. Spark, I will do that for you, they're not quite there yet, but the movies glamify that whole thing. Some people expect, well, why doesn't it talk back to me? >> Any last questions, Jim, are there any industries that you see is going to be early adopters to start creating and actually deploying the intelligent contact center? >> Well, let's put it this way. Every client we've talked to in survey work said we wish we had more intelligence in our contact center. I think they're a little scared that they want to make sure they do it right, but if you do it and deploy it and test it, you'd be amazed it's for some of the basic Q&A, how rockstar stuff that is, but sometimes people rush too quickly and deploy it when it's not quite ready. I think a lot of the providers here, including Five9, are going to try to do AI the right way, and not try to rush it, but I would also say this. There's an awful lot of fud about AI, and most of it's not true. >> Lisa, final, final question for Jim here, since John Ferger's not here to ask it, Five9's gone through a lot of changes here, brought in some pretty high-profile executives, any commentary on our host here? >> Look, I knew Rowan and Jonathan Rosenberg at Cisco, they had a rockstar team there, they've even, since they've joined here brought more talent in, and so, the Five9 people I knew have been blown away by the level of talent that has come in, and I think that's just going to help them continue to grow. The question is, when did they declare how big they're going to be? And that's what we're looking for them to do. >> To be continued, Jim, thanks so much for joining Stu and me on theCUBE this afternoon. >> Thank you very much. >> For Stu Miniman, I'm Lisa Martin, you're watching theCUBE. (light beat music)

Published Date : Mar 19 2019

SUMMARY :

Brought to you by Five9. of the event, this is day one, and we have had a great day [Lisa] - That was cute, by the way, so I hope we get but also of all the collaboration and communication So, if you actually look at some of the bigger players when you talk about context center, there's lots of ways of the big things that's changed, is there's still a lot When I look at the market as a whole, there's all I think the bigger play is to be able to do a combination the messaging and branding around the shows you were just on the agent, because that's where it kind of all started, of the features they've got on there. in a lot of cases, some of the things you read online of the B-to-B buyers are not going to have to worry about with So I think that eventually is going to start to happen. It's interesting, a lot of that was Microsoft really forcing some of the bigger players to go more Cloud? that really positively impacts the customer? that they would say sorry, can't help you with that, But I do think AI is not going to go away, we don't think it's I think they're a little scared that they want to make sure come in, and I think that's just going to help them Stu and me on theCUBE this afternoon. For Stu Miniman, I'm Lisa Martin, you're watching theCUBE.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
JimPERSON

0.99+

Jim LundyPERSON

0.99+

Lisa MartinPERSON

0.99+

2007DATE

0.99+

GoogleORGANIZATION

0.99+

MicrosoftORGANIZATION

0.99+

Stu MinimanPERSON

0.99+

Jonathan RosenbergPERSON

0.99+

BelgiumLOCATION

0.99+

EuropeLOCATION

0.99+

10QUANTITY

0.99+

CaliforniaLOCATION

0.99+

147 featuresQUANTITY

0.99+

United StatesLOCATION

0.99+

125 featuresQUANTITY

0.99+

five billionQUANTITY

0.99+

31%QUANTITY

0.99+

six monthsQUANTITY

0.99+

Five9ORGANIZATION

0.99+

LisaPERSON

0.99+

CiscoORGANIZATION

0.99+

John FergerPERSON

0.99+

10%QUANTITY

0.99+

JarvisPERSON

0.99+

IBMORGANIZATION

0.99+

GermanyLOCATION

0.99+

StuPERSON

0.99+

Orlando, FloridaLOCATION

0.99+

Aragon ResearchORGANIZATION

0.99+

OrlandoLOCATION

0.99+

AragonORGANIZATION

0.99+

bothQUANTITY

0.99+

last yearDATE

0.99+

RowanPERSON

0.99+

TodayDATE

0.99+

first eventQUANTITY

0.99+

around 9%QUANTITY

0.99+

OneQUANTITY

0.99+

2024DATE

0.99+

SparkPERSON

0.98+

10 yearsQUANTITY

0.98+

Five9 peopleQUANTITY

0.98+

Office 365TITLE

0.98+

FacebookORGANIZATION

0.98+

about 120 billion dollarsQUANTITY

0.98+

OfficeTITLE

0.98+

CloudTITLE

0.98+

todayDATE

0.98+

VoiceConEVENT

0.97+

10 years agoDATE

0.97+

FirstQUANTITY

0.96+

Iron ManTITLE

0.96+

USLOCATION

0.96+

Enterprise Connect 2019EVENT

0.96+

this yearDATE

0.95+

theCUBEORGANIZATION

0.93+

about 24 billionQUANTITY

0.93+

'07DATE

0.92+

oneQUANTITY

0.92+

GDPRTITLE

0.91+

CEOPERSON

0.89+

this afternoonDATE

0.88+

Blair PleasantORGANIZATION

0.85+

tons of moneyQUANTITY

0.85+

Ken Yeung, Tech Reporter | Samsung Developer Conference 2017


 

>> Announcer: Live from San Francisco it's TheCUBE covering Samsung Developer Conference 2017. Brought to you by Samsung. (digital music) >> Hey welcome back and we're live here in San Francisco this is TheCUBE's exclusive coverage Samsung Developer Conference #SDC2017, I'm John Furrier co-founder of SiliconANGLE Media Coast My next guest is Ken Yeoung tech reporter here inside TheCUBE. I've known Ken for almost 10 years now plus been in the Silicon Valley beat scene covering technology, communities, and all the cutting edge tech but also some of the old established companies. Great to see you. >> Likewise, thanks for having me. >> So tech reporter, let's have a little reporter session here because reporting here at Samsung, to me, is my first developer conference with Samsung. I stopped going to the Apple World Developer Conference when it became too much of a circus around, you know, close to a couple of years before Steve Jobs died. >> Right. >> Now this whole scene well we will have to talk to Steve Gall when we get down there but here, my first one, my reports an awakening I get the TV thing but I'm like IoT that's my world. >> Ken: Oh really? >> I want to see more IoT >> Ken: Yeah. >> So it's good to see Samsung coming into the cloud and owning that. So, that's exciting for me. What do you see as a report that you could file? >> You know, so it's funny because I actually did write a post this morning after watching the keynote yesterday. While I was at VentureBeat a few months ago I reported on Bixby's launch when it came out with the Galaxy S8 and when I heard about what that was it was kind of interesting. That was one of the biggest selling points for me to switch over from my iPhone. And when I tried it out it was interesting. I was kind of wondering how it would stand up against Google Assistant because both of them are installed on the same device. But now as you see with Bixby 2.0 and now with the SmartThings you start to see Samsung's vision. Right now it's on a mobile, it's just very piecemeal. But now when you tackle it on with the TVs, with the fridges, monitors, ovens and everything like that it becomes your entire home. It becomes your Jarvis. You don't actually have to spend 150 bucks or 200 bucks on an Alexa-enabled device or Google Home that most people may not be totally familiar with. But if you have a TV you're familiar with it. >> Obviously you mentioned Jarvis. That's reference to the old sitcom and when Mark Zuckerberg tried his Jarvis project which was, you know, wire his home from scratch. Although a science project, you talk about real utility. I mean so we're getting down to the consumerization so let's take that to the next level. >> Ken: Right. >> If you look at the trends in Silicon Valley it's certainly in the tech industry block, chain and ICOs are really hot. Mission point offerings. That's based on utility right? So, utility-based ICOs, so communities using gamification. Game apps, utility. Samsung, SmartThings. Using their intelligence to not just be the next Amazon. >> Right >> The commerce cloud company, they're just trying to be a better Samsung. >> Ken: Exactly. >> Which they've had some problems in the past and we've heard from analysts here Patrick Morgan was on, pointed out... Illustrated the point. They're a stovepipe company. And with Bixby 2.0 they're like breaking down the silos. We had the execs on here saying that's their goal. >> Ken: Exactly. Yeah if you look on here everything has been siloed. You look at a lot of tech companies now and you don't get to see their grand vision. Everyone has this proto-program when they start these companies and when they expand then you start to see everything come together. Like for example, whether it's Square, whether it's Apple, whether it's Google or Facebook, right? And Samsung, a storied history, right, they've been around for ages with a lot of great technology and they've got their hands in different parts. But from a consumer standpoint you're like likelihood of you having a Samsung device in your home is probably pretty good and so why not just expand that leverage that technology. Right now tech is all about AI. You start to see a lot of the AI stars get acquired or heavily funded and heavily invested. >> Really The Cube is AI, we're AI machine right here. Right here is the bot, analyst report. People are AI watching. But I mean what the hell is AI? AI is machine learning, using software, >> Data collection. >> Nailed it. >> And personalization. And you look at I interviewed a Samsung executive at CAS last year this January, and he was telling me about the three parts. It has to be personal, it has to be contextual and it has to be conversational in terms of AI. What you saw yesterday during the keynote and what executives and the companies have been repeatedly saying is that's what Bixby is. And you could kind of say that's similar to what Google has with Google Assistant you can see that with Alexa but it's still very... Those technologies are very silent. >> What were those three things again? Personal, >> Personable, contextual, and conversational. >> That is awesome, in fact, that connects with what Amy Joe Kim, CEO of ShuffleBrain. She took it from a different angle; she's building these game apps but she's becoming more of a product development. Because it's not just build a game like a Zynga game or you know, something on a mobile phone. She's bringing gaming systems. Her thesis was people are now part of the game. Now those are my words but, she's essentially saying the game system includes data from your friends. >> Right. >> The game might suck but my friends are still there. So there's still some social equity in there. You're bringing it over to the contextual personal, this is the new magic for app developers. Is this leading to AR? >> Oh absolutely. >> I mean we're talking about ... This is the convergence of the new formulas for successful app development. >> Right, I mean we were talking about earlier what is AI and I mentioned all about data and it's absolutely true. Your home is collecting so much data about you that it's going to offer that personal response. So you're talking about is this going to lead to AR? Absolutely, so whatever data it has about your home you might bring your phone out as you go shopping or whatnot. You might be out sight-seeing and have your camera out. And it might bring back some memories, right or might display a photo from your photo album or something. So there's a lot of interesting ties that could come into it and obviously Samsung's camera on their phones are one of the top ones on the market. So there's potential for it, yeah. >> Sorry Ken, I've got to ask you. So looking at the bigger picture now let's look outside of Samsung. We can look at some tell signs here Google on stage clearly not grand-standing but doing their thing. Android, you know, AR core, starting to see that Google DNA. Now they've got tensor flow and a lot of goodness happening in the cloud with Sam Ramji over there kicking ass at Google doing a great job. Okay, they're the big three, some people call it the big seven I call it the big three. It's Amazon, Microsoft, Google. Everyone else is fighting for four, five, six. Depending on who you want to talk to. But those are the three, what I call, native clouds. Ones that are going to be whole-saleing resource. Amazon is not Google, Amazon has no Android. They dropped their phones. Microsoft, Joe Belfiore said hey I'm done with phones they tapped out. So essentially Microsoft taps out of device. They've still got the Xbox. Amazon tapping out of phones. They've got commerce. They've got web service. They've got entertainment. This is going to be interesting. What's your take? >> Well interesting is an under-statement there. I mean, you look at what the ... Amazon, right now, is basically running the show when it comes to virtual assistant or voice-powered assistance. Alexa, Amazon launched a bunch of Alexa products recently and then soon after, I believe it was the last month, Google launches a whole bunch of Google home devices as well. But what's interesting is that both of those companies are targeting... Have a different approach to what Samsung is, right? Remember Samsung's with Bixby 2.0 is all about consolidating the home, right? In my post I coined that it was basically their fight to unite the internet of things kind of thing. But, you know, when it comes to Alexa with Amazon and Google they're targeting not only the smaller integrations with maybe like August or SmartLocks or thermostats and whatnot but they're also going after retailers and businesses. So how many skills can you have on Alexa? How many, what are they called, actions can you have on Google Home? They're going after businesses. >> Well this is the edge of the network so the reason why, again coming back full-circle, I was very critical on day one yesterday. I was kind of like, data IoT that's our wheelhouse in TheCUBE. Not a lot of messaging around that because I don't think Samsung is ready yet and nor should they be given their evolution. But in Amazon's world >> I think they're ... The way they played it yesterday was pretty good a little humble, like they didn't set that expectation like oh my god this is going to >> They didn't dismiss it but they were basically not highlighting it right. >> Well they did enough. They did enough to entice you to tease it but like, look, they have a long way to go to kind of unite it. SmartThings has been around for a while so they've been kind of building it behind the scenes. Now this is like hey now we're going to slap on AI. It's similar to ... >> What do you hear from developers? I've been hearing some chirping here about AI it's got to be standardized and not sure. >> Oh, absolutely. I think a lot of developers will probably want to see hey if I'm going to build... If I want to leverage AI and kind of consolidate I want to be able to have it to maximize my input maximize my reach. Like I don't want to have to build one action here one service skill here. Whatever Samsung's going to call for Bixby. You know I want to make it that one thing. But Samsung's whole modernization that's going to be interesting in terms of your marketplace. How does that play out? You know, Amazon has recently started to monetize or start to incentivize, as it were, developers. And Google if they're not already doing that will probably has plenty of experience in doing that. With Android and now they can do that with Google. >> So I've got to ask you about Facebook. Facebook has been rumored to have a phone coming but I mean Facebook's >> Ken: They tried that once. >> They're Licking their wounds right now. I mean the love on Facebook is not high. Fake news, platform inconsistencies. >> Ken: Ad issues. >> Moves fast, breaks stuff. Zuck is hurting. It's hurting Zuck. Certainly the Russian stuff. I think, first of all, it's really not Facebook's fault. They never claimed to be some original content machine. They just got taken advantage of through bad arbitrage. >> It's gets it to some scale. >> People are not happy with Facebook right now so it's hard for them to choose a phone. >> Well, you're right. There are rumors that they were going to introduce the phone again after... We all remember Facebook Home which was, you know, we won't talk about that anymore. But I think there was talk about them doing a speaker some sort of video thing. I think they were calling it... I believe it's called Project Aloha. I believe Business ETC. and TechCrunch have reported on that extensively. That is going to compete with what Amazon's going. So everyone is going after Amazon, right. So I think don't discount Samsung on this part I think they are going to be I don't want to call them the dark horse but you know, people are kind of ignoring them right now. >> Well if Samsung actually aligned with Amazon that would be very because they'd have their foot in both camps. Google and Amazon. Just play Switzerland and win on both sides. >> Samsung, I think Samsung >> That might be a vital strategy. Kinesis if the customers wanted to do that. Google can provide some cloud for them, don't know how they feel about that. >> Yeah I mean Samsung will definitely be... I think has the appeal with their history they can go after the bigger retailers. The bigger manufacturers to leverage them because there's some stability as opposed to well I'm not going to give access to my data to Amazon you look at Amazon now as Amazon's one of the probably the de facto leader in that space. You see people teaming up with Google to compete against them. You know, there's a anti-Amazony type of alliance out there. >> Well I would say there's a jealousy factor. >> Ken: True, true. >> But a lot of the fud going out there... I saw Matt Asay's article in InfoWorld... And it was over the top basically saying that Amazon's not giving back an open source. I challenged Andy Jesse two years ago on that and Matt's behind the times. Matt you've got to get with the program you're a little bit hardcore pushed there. But I think he's echoing the fear of the community. Amazon's definitely doing open source first of all but the same thing goes for Ali Baba. I asked the founder of Ali Baba cloud last week when I was in China. You guys are taking open source what are you giving back and it was off the record comment and he was like, you know, they want to give back. So, just all kinds of political and or incumbent positions on open source, that to me is going to be the game-changer. Linux foundation, Hipatchi is growing, exponential growth in open source over the next five to ten years. Just in terms of lines of code shipped. >> Right. >> Linux foundation's shown those numbers and 10% of that code is going to be new. 90% of the code's going to be re-used and so forth. >> Ken: Oh absolutely. I mean you're going to need to have a lot of open source in order for this eco-system to really flourish. To build it on your own and build it proprietary it basically locks it down. Didn't Sony deal with that when they were doing, like, they're own memory cards for cameras and stuff and now their cameras are using SD cards now. So you're starting to see, I think, a lot of companies will need to be supportive of open source. In tech you start to see people boasting that, you know, we are doing this in open source. Or you know, Facebook constantly announces hey we are releasing this into open source. LinkedIn will do that. Any company that you talk to will... >> Except Apple. Apple does some open source. >> Apple does some open source, yeah. >> But they're a closed system and they are cool about it. They're up front it. Okay final question, bottom line, Samsung Developer Conference 2017 what should people know that didn't make it or are watching this, what should they know about what they missed and what Samsung's doing, what they need to do better. >> You know I think what really took the two-day conference is basically Bixby. You look at all the sessions; all about Bixby. SmartThings, sure they consolidated everything into the SmartThings cloud, great. But you know SmartThings has been around for a while and I'm interested to see how well they've been doing. I wish they released a little bit more numbers on those. But Bixby it was kind of an interesting 10 million users on them after three months launching in the US which is very is a pretty good number but they still have a bit of a ways to go and they're constantly making improvements which is a very good, good, good thing as well. >> Ken Yeoung, a friend of TheCUBE, tech reporter formerly with VentureBeat now onto his next thing what are you going to do? Take some time off? >> Take some time off, continue writing about what I see and who knows where that takes me. >> Yeah and it's good to get decompressed, you know, log off for a week or so. I went to China I was kind of off Facebook for a week. It felt great. >> Yeah. (laughs) >> No more political posts. One more Colin Kaepernick kneeling down during the national anthem or one more anti-Trump post I'm going to... It was just disaster and then the whole #MeToo thing hit and oh my god it was just so much hate. A lot of good things happening though in the world and it's good to see you writing out there. It's TheCUBE, I'm John Furrier, live in San Francisco, Samsung Developer Conference exclusive Cube coverage live here we'll be right back with more day two coverage of two days. We'll be right back.

Published Date : Oct 19 2017

SUMMARY :

Brought to you by Samsung. and all the cutting edge tech but also I stopped going to the Apple World Developer Conference I get the TV thing but I'm like IoT So it's good to see Samsung coming into the cloud But now when you tackle it on with the TVs, so let's take that to the next level. Using their intelligence to not just be the next Amazon. The commerce cloud company, they're just trying to be We had the execs on here saying that's their goal. and when they expand then you But I mean what the hell is AI? and it has to be conversational in terms of AI. or you know, something on a mobile phone. You're bringing it over to the contextual personal, This is the convergence of the new formulas for Your home is collecting so much data about you that This is going to be interesting. I mean, you look at what the ... Not a lot of messaging around that because I don't think like oh my god this is going to They didn't dismiss it but they were They did enough to entice you it's got to be standardized and not sure. that's going to be interesting in terms of your marketplace. So I've got to ask you about Facebook. I mean the love on Facebook is not high. They never claimed to be some original content machine. so it's hard for them to choose a phone. I think they are going to be Google and Amazon. Kinesis if the customers wanted to do that. I think has the appeal with their history they can go in open source over the next five to ten years. and 10% of that code is going to be new. in order for this eco-system to really flourish. Apple does some open source. and what Samsung's doing, and I'm interested to see how well they've been doing. and who knows where that takes me. Yeah and it's good to get decompressed, you know, and it's good to see you writing out there.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Ken YeoungPERSON

0.99+

AmazonORGANIZATION

0.99+

MicrosoftORGANIZATION

0.99+

Joe BelfiorePERSON

0.99+

GoogleORGANIZATION

0.99+

Patrick MorganPERSON

0.99+

Ken YeungPERSON

0.99+

FacebookORGANIZATION

0.99+

SamsungORGANIZATION

0.99+

Steve JobsPERSON

0.99+

TechCrunchORGANIZATION

0.99+

Matt AsayPERSON

0.99+

Steve GallPERSON

0.99+

Sam RamjiPERSON

0.99+

AppleORGANIZATION

0.99+

Andy JessePERSON

0.99+

LinkedInORGANIZATION

0.99+

Colin KaepernickPERSON

0.99+

Amy Joe KimPERSON

0.99+

USLOCATION

0.99+

iPhoneCOMMERCIAL_ITEM

0.99+

Mark ZuckerbergPERSON

0.99+

KenPERSON

0.99+

150 bucksQUANTITY

0.99+

last yearDATE

0.99+

John FurrierPERSON

0.99+

SonyORGANIZATION

0.99+

200 bucksQUANTITY

0.99+

10%QUANTITY

0.99+

Silicon ValleyLOCATION

0.99+

ZyngaORGANIZATION

0.99+

ChinaLOCATION

0.99+

Galaxy S8COMMERCIAL_ITEM

0.99+

San FranciscoLOCATION

0.99+

VentureBeatORGANIZATION

0.99+

a weekQUANTITY

0.99+

last weekDATE

0.99+

Tal Klein, The Punch Escrow | VMworld 2017


 

>> Narrator: Live from Las Vegas, it's the Cube, covering VMWorld 2017. Brought to you by VMWare and its ecosystem partners. (bright music) >> Hi, I'm Stu Miniman with the Cube, here with my guest host, Justin Warren. Happy to have a returning Cube alum, but in a different role then we had. It's been a few years. Tal Klein, who is the author of The Punch Escrow. >> Au-tor, please. No, I'm just kidding. (laughing) Tal, thanks so much for joining us. It's great for you to be able to find time to hang out with the tech geeks rather than all the Hollywood people that you've been with recently. (laughing) >> You guys are more interesting. (laughing) >> Well thank you for saying that. So last time we interviewed you, you were working for a sizable tech company. You were talking about things like, you know, virtualization, everything like that. Your Twitter handle's VirtualTal. So how does a guy like that become not only an author but an author that's been optioned for a movie, which those of us that, you know, are geeks and everything are looking at, as a matter of fact, Pac Elsiger this morning said, "we are seeing science fiction become science fact." >> That's right. >> Stu: So tell us a little of the journey. >> Yeah, cool, I hope you read the book. (laughing) I don't know, the journey is really about marketing, right? Cause a lot of times when we talk about virtual, like, in fact last time I was on the Cube, we were talking about the idea that desktops could be virtual. Cause back then it was still this, you know, almost hypothetical notion, like could desktops be virtual, and so today, you know, so much of our life is virtual. So much of the things that we do are not actually direct. I was watching this great video by Apple's new augmented reality product, where you sit in the restaurant and you look at it with your iPad, and it's your plate, and you can just shift the menu items, and you see the menu items on your plate in the context of the restaurant and your seat and the person you're sitting across from. So I think the future is now. >> Yeah, it reminds of, you know, the movie Wall-E, the animated one. We're all going to be sitting in chairs with our devices or Ready Player One, you know, very popular sci-fi book that's being done by Speilberg, I believe. >> Yes, yeah, very exciting. >> Tell us a little bit about your book, you know, we talked, when I was younger and used to read a lot of sci-fi, it was like, what stuff had they done 50 years ago that now's reality, and what stuff had they predicted, like, you know, we're going to go away from currency and go digital currency, and it's like we're almost there. But we still don't have flying cars. >> Yeah, we're, I mean, the main problem with flying cars is that we need pilots. And I think actually we're very close to flying cars, cause once we have self-driving vehicles and we no longer need to worry about it being a person behind the joystick, then we're in really good shape. That's really the issue, you know, the problem with flying cars is that we are so incompetent at driving and or flying. That's not our core competency, so let's just put things that do understand how to make those things happen and eliminate us from the equation. >> Everything is a people problem. >> Yeah, so when I wrote the book, Punch Escrow, Punch Escrow, (laughing) when I wrote the book, I really thought about all the things that I read growing up in science fiction, you know, things like teleportation, things like nanotechnology, things like digital currency, you know, how do we make those, how do we present those in a viable way that doesn't seem too science fictiony. Like one of the things I really get when people read the book is it feels really near-future, even though it's set like 100 plus years in the future, all the concepts in it feel very pragmatic or within reach, you know? >> Yeah, absolutely. It's interesting, we look at, you know, what things happen in a couple of years and what things take a long time. So artificial intelligence, machine learning, it's not like these are new concepts, you know? I read a great book by, you know, it was Isaacson, The Innovators. You go back to like Aida Lovelace, and the idea of what a machine or computer would be able to do. So 100 years from now, what's real, what's not real? We still all have jobs or something? >> We have jobs but different. Remember, I don't know if you're a historian, but back in the industrial age, there was a whole bunch of people screaming doom and gloom. In fact, if we go way back to the age of the Luddites, who just hated machines of any kind. I think that in general, we don't like, you know, we're scared of change. So I do think a lot of the jobs that exist today are going to be done by machines or code. That doesn't mean the jobs are going away. It means jobs are changing. A lot of the jobs that people have today didn't exist in the industrial age. So I think that we have to accept that we are going to be pragmatic enough to accept the fact that humans will continue to evolve as the infrastructure powering our world evolves, you know? We talk about living in the age of the quantified self, right? There's a whole bunch that we don't understand how to do yet. For example, I can think of a whole industry that tethers my FitBit to my nutrition. You know, like there's so much opportunity that for us to say, oh that's going to be the end of jobs, or the end of innovation or the end of capitalism, is insane. I think this just ushers in a whole new age of opportunity. And that's me, I'm just an optimist that way, you know. >> So the Luddites did famously try to destroy the machines. But the thing is, the Luddites weren't wrong. They did lose their jobs. So what about the people whose jobs are replaced, as you say net new, there's a net new number of jobs. But specific individuals, like people who manufacture cars for example, lose their jobs because a robot can do that job safer and better and faster than a human can do it. So what do we do with those humans? Because how do we get people to have new jobs and retrain themselves? >> I address some of these notions in the book. For example, one of the weird things that we're suffering from is the lack of welders in society today, cause welding has become this weird thing that we don't think we need people for, so people don't really get trained up in it because, you know, machines do a lot of welding but there's actually specialty welding that machines can't do. So I think the people who are really good at the things that they do will continue to have careers. I think their careers will become more niche. Therefore they'll be able to create, to demand a higher wage for it because almost like a carpenter, you know, a specialist carpenter will be able to earn a much higher wage today by having fewer customers who want really custom carpentry versus things that can be carved up by a machine. So I think what we end up seeing is that it's not that those jobs go away. It's they become more specialized. People still want Rolls Royces. People still want McLarens. Those are not done by machines. Those are hand-made, you know? >> That's an interesting point, so the value of something being hand-made becomes, instead of it being a worse product, it's actually- >> Tal: That's a big concept in the book. >> Oh okay, right. >> A big concept in the book is that we place a lot of value on the uniqueness of an object. And that parlays in multiple ways. So one of the examples that I use in the book is the value of a Big Mac actually coming from McDonald's. Like, you can make a Big Mac. We know the recipe for a Big Mac. But there is a weird sort of nacent value to getting a Big Mac from McDonald's. It's something in our brain that clicks that tethers it to an originality. Diamonds, another really good example. Or you know, we know there's synthetic diamonds. We still want the ones that get mined in the cave. Why? We don't know. Right, they're just special. >> Because De Beers still has really good marketing. (laughing) >> So I think there's- >> That's interesting, so the concept of uniqueness, which again comes to scarcity and so on. As an author, someone who is no doubt, signed a lot of his book, that means that that book is unique because it's signed by the author, unlike something which is mass produced and there is hopefully thousands and thousands of copies that you sell. >> Going into this, I actually thought about that a lot. And that's why I've created like multiple editions of the book. So like the first 500 people who pre-ordered it, they get like a special edition of the book that's like stamped and all this kind of stuff. I even used different pens. (laughs) I appreciate that because I'm also a collector. I collect music, I collect books. And you know, so I see those aspects in myself. So I know what I value about them, you know? >> And the crossover between music and books is interesting. So as someone who has a musical background, I know that there's a lot of musicians who'll come out with special editions, and you know, because this is an age where we can download it. You can download the book. Do you think there is something, is there something that is intrinsic to having a physical object in a virtual world? >> I think to our generation, yes. I'm not so sure about millennials, when they grow up. But there are, for example, I'm going to see U2 next week, I'm very lucky to see that. But part of the U2 buying experience, to get access to the presale, you need to be part of their fan club. To be a part of their fan club, you need to get, you get like a whole bunch of limited edition posters, limited edition vinyl, and all this kind of stuff. So there's an experience. It's no longer just about going to see U2 at a concert. There's like the entire package of you being a special U2 fan. And they surround it with uniqueness. It's not necessarily limited, but there's an enhanced experience that can't just be, it's not just about you having a ticket to a single concert. >> Justin: Yeah, okay. >> I'm curious, the genre, if you'd call it, is hard science fiction. >> Yes. >> The challenge with that is, you know, what is an extension of what we're doing, and what is fiction? And people probably poke at that. Have you had any interesting experience, things like that? I mean, I've listened to a lot of stuff like Andy Weir, like let the community give feedback before he created the final The Martian. (laughing) But so yeah, what's it like, cause we can, the geeks can be really harsh. >> Yes, I've learned from my Reddit experience that, so what's really funny about it is the first draft of this novel was hard as nails. It was crazy. And my publisher read it, and it would have made all the hard science fiction guys super happy. My publisher read it, he was like, you've written a really great hard science fiction book, and all five people who read it are going to love it. (laughing) You know, but like, I came here with my buddy Danny. He couldn't even get through the first three pages of it. He's like, he wanted to read it. So part of working through the editorial process is saying, look, I care a lot about the science because one of my deep goals is to write a STEM-oriented book that gets people excited about technology and present the future as not a dystopian place. And so I wanted the science to be there and have a sort of gravity to the narrative. But yeah, it's tough. I worked with a physicist, a biologist, a geneticist, an anthropologist, and a lawyer. (laughs) Just to try to figure out, how do we carve out, you know, what does the future look like, what does the evolution of each individual sciences, we talked about the mosquitoes, right? You know, we're already doing a lot of crazy stuff with mosquitoes. We're modifying them so that the males mate with females that carry the Zika virus, you know, give birth to offspring that never reach maturity. I mean, this is just crazy, it's science fiction. And now that they're working on modifying female mosquitoes into vaccine carriers instead of disease carriers. I mean, this is science fiction, right? Like who believes this stuff? It's crazy. >> Christopher is amazing. >> Yeah, I've loved, there's been a bunch of movies recently that have kind of helped to educate on STEM some, you know, Martian got a lot of people excited, you know, Hidden Figures, the one that I could being my kids that are teenagers now into it and they get excited, oh, science is great. So the movie, how much will you be involved? You know, what can you share about that experience, too, so far? >> It's been, it's very surreal. That's the word is use to describe it, the honest, god's honest truth, I mean. I've been very lucky in that my representation in Hollywood is this rock-solid guy called Howie Sanders. And he's this bigger-than-life Hollywood agent guy. He's hooked me up, we've made a lot of business decisions that we're focused less on the money and more on the team, which is nice to be, like when you're in your 40s and you're more financially settled, you're not in the kind of situation where you might be in your 20s and just going to sign the first deal that people give you. So we really focused on hooking up with like the director, James Bovin is, you know, he's the guy who co-created Flight of the Concords. He did the Muppets movie, you know, Alice Through the Looking Glass. Really professional guy but also really understands the tone of the book, which is like humorous, you know, kind of sarcastic. It's not just about the technology. It's also about the characters. Same thing with the production team. The two producers, Mandeville Productions, I was just talking to Todd Lieberman, and we're talking about just what is augmented reality, like how does it look like on the screen? So I'm not- >> It's not going to look like Blade Runner is what I'm hearing. >> (laughs) I don't know. It's going to look real. I imagine, I don't know, they're going to make whatever movie they're going to make, but their perspective, one of the things we talked about is keeping the movie very grounded. Like you know, one of the big questions they ask first going into it is before we even had any sort of movie discussions is like is this more of like a Looper, Gattica, or District Nine, or is it more like The Fifth Element, you know, I mean, is it like, do you want it to be this sort of grounded movie that feels authentic and real and near future or do you want this to be like completely alien and weird and out of it. And the story is more grounded. So I think a lot, hopefully what we display on the screen will not feel that far away from reality. >> Okay, yeah. >> You do marketing in your day job. >> I do. >> I'm curious as you look at this, kind of the balance of educating, reaching a broad audience, you have passion for STEM, what's your thoughts around that? Is it, I worry there's so much general, like television or things like that, when I see the science stuff, it like makes me groan. Because you know, it's like I don't understand that. >> I am the worst, because I got a security background too, so that's the one I get scrambled on. The war, I mean, like. >> Wait, thank goodness I updated my firewall settings because I saved the world from terrorists. >> Hang on, we're breaking through the first firewall. Now we're through the second firewall. (laughing) Now we're going through the third firewall, like 15 firewalls. And let me upload the virus, like all that stuff. It's difficult for me. I think that, you know, hopefully, there's also a group in Hollywood called the Hollywood Science and Entertainment Exchange. And they're a group of scientists who work with film makers on, you know, reigning things in. And film makers don't usually take all their advice, i.e. Interstellar, (laughing) but you know, I think (laughing) in many cases there's some really good ideas that come to play into it that hopefully bring up, like I think Jarvis for example, in Iron Man or the Avengers is a really cool implementation of what the future of AI systems might be like. And I know they used the Hollywood Science Exchange to figure out how is that going to work? And I think the marketing aspect is, you know, the reason I came up with the idea for this book is because my CEO of a company I used to work for, he had this whole conversation about teleportation, like teleportation was impossible. And he's like, it's not because the science, yes, the science is a problem right now, but we'll get over it. The main issue is that nobody would ever step foot into a device that vaporizes them and then printed them out somewhere else. And I said, well that's great, cause that's a marketing problem. (laughing) >> Yeah, you're dead every time you do it. But it's the same you, I can't tell the difference. >> Well, you say you're dead, I'm saying you're just moving. (laughing) >> Artificial intelligence, you know, kind of a big gap between the hype to where we need to go. What's your thoughts on that space in general? >> I think that we have, it's a great question because I feel like that's a term that gets thrown around a lot, and I think as a result it's becoming watered down. So you've this sort of artificial intelligence that comes with like, you know, Google building an app that can beat the world's best Go player, which is a really, really difficult puzzle. The problem is, that app can do one thing, and that's play Go. You put in it a chess game, and it's like I don't know what's going on. >> It's a very specialized kind of intelligence, yeah. >> Now with Open AI, you know, they just had some pretty interesting implementations where they actually played video games with a real live competition and won. Again, you know, but without the smack talk, which really I think would add a lot. Now you got to get an AI to smack talk. So I think the problem is we haven't figured out a really good way of creating a general purpose AI. And there's a lot of parallels to the evolution of computing in general because if you look at how computers were before we had general purpose operating systems like Unix, every computer was built to do a very, very specific function, and that's kind of what AI is right now. So we're still waiting to have a sort of general purpose AI that can do a lot of specialized activities. >> Even most robots are still very single-purpose today. >> That's the fundamental problem. But you're seeing the Cambridge guys are working on sort of the bipedal robot that can do lots of things. And Siri's getting better, Cortana's getting better, Watson's getting better, but we're not there. We still need to find a really good way of integrating deep knowledge with general purpose conversational AI. Cause that's really what you need to like, Stu, what do you need? Here, let me give it to you, you know? >> Do you draw a distinction between AI that's able to simply sort of react as a fairly complex machine or something that can create new things and add something? >> That's in the book as well. So the fundamental thing that I don't think we get around even in the future is giving computers the ability to actually come up with new ideas. There's actually a career, the main job of the protagonist in the book, his job is a salter. And his job is to salt AI algorithms to introduce entropy so they can come up with new ideas. >> Okay, interesting. >> So based off the sort of chaos theory. >> Like chaos monkey, right? >> Yeah. And that's really what you're trying to do is like, okay, react to things that are happening because you can't just come up with them on their own. There's a whole, I don't want to bore you, but there's a whole bunch of stuff in the book about how that works. >> It's like hand-carving ideas that are then mass produced by machines. >> Yeah, I don't know if you guys are going to have Simon Crosby on here, he's kind of like an expert on that. He was the Dean of Kings College, which is where Turing came from. So he really knows a lot about that. He's got a lot of strong ideas about it. But I learned a lot from him in that regard. There's a lot of like, the snarky spirit of Simon Crosby lives on in my book somewhere. But he's just funny cause he's, coming from that field, he immediately sees a lot of BS right off the bat, whenever anybody's presenting. He's got like the ability to just cut through it. Because he understands what it would actually take to make that happen, you know? So I tried to preserve some of that in the book. >> That is refreshing in the tech industry. >> So Tal, I need to let you, you know, wrap this up. Give us a plug for the book, tell us, when are we going to be able to see this on the big screen? >> I don't know about the big screen, but the Punch Escrow is now available. You can get it on Amazon, Barnes and Noble, anywhere books are sold. It's been optioned by Lionsgate. The director attached to it is James Bovin, production team is Mandeville Productions. I'm very excited about it. Go check it out. It's a pretty quick read, reads like a technothriller. It's not too hard. And it's fun for the whole family. I think one of the coolest things about it is that the feedback I've been getting has been that it really is appealing to everybody. I've got mother-in-laws reading it, you know, it's pretty cool. Initially I sold it, my initial audience is like us, but it's kind of cool, like, Stu will finish the book, he'll give it to, you know, wife, daughter, anything, and they're really digging it. So it's kind of fun. >> Justin: Thanks a lot. >> Tal Klein, really appreciate you coming. Congratulations on the book, we look forward to the movie. Maybe, you know, we'll get the Cube involved down the road. (laughing) >> And we're giving away 75 copies of it here at Lakeside booth, if you guys want to come. >> Tal Klein, author of The Punch Escrow, also CMO of Lakeside, who is here in the thing. But yeah, (laughing) a lot of stuff. Justin and I will be back with more coverage here from VMWorld 2017. You're watching the Cube. (bright music)

Published Date : Aug 28 2017

SUMMARY :

Brought to you by VMWare but in a different role then we had. It's great for you to be able to find time (laughing) You were talking about things like, you know, So much of the things that we do are with our devices or Ready Player One, you know, you know, we talked, when I was younger you know, the problem with flying cars is that things like digital currency, you know, It's interesting, we look at, you know, of jobs, or the end of innovation So the Luddites did famously try because, you know, machines do a lot of welding So one of the examples that I use in the book (laughing) of copies that you sell. So I know what I value about them, you know? and you know, because this is an age of you being a special U2 fan. I'm curious, the genre, if you'd call it, The challenge with that is, you know, is the first draft of this novel was hard as nails. So the movie, how much will you be involved? He did the Muppets movie, you know, It's not going to look like Blade Runner Like you know, one of the big questions Because you know, it's like I don't understand that. I am the worst, because I got a security background too, because I saved the world from terrorists. I think that, you know, But it's the same you, I can't tell the difference. Well, you say you're dead, Artificial intelligence, you know, that comes with like, you know, Google building an app Now with Open AI, you know, Cause that's really what you need to like, So the fundamental thing that I don't think because you can't just come up with them on their own. that are then mass produced by machines. He's got like the ability to just cut through it. So Tal, I need to let you, you know, wrap this up. is that the feedback I've been getting has been Maybe, you know, we'll get the Cube involved down the road. at Lakeside booth, if you guys want to come. Justin and I will be back with more coverage here

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Todd LiebermanPERSON

0.99+

Justin WarrenPERSON

0.99+

Tal KleinPERSON

0.99+

James BovinPERSON

0.99+

JustinPERSON

0.99+

Alice Through the Looking GlassTITLE

0.99+

Andy WeirPERSON

0.99+

SpeilbergPERSON

0.99+

DannyPERSON

0.99+

75 copiesQUANTITY

0.99+

Howie SandersPERSON

0.99+

SiriTITLE

0.99+

Barnes and NobleORGANIZATION

0.99+

Flight of the ConcordsTITLE

0.99+

Hollywood Science ExchangeORGANIZATION

0.99+

VMWareORGANIZATION

0.99+

JarvisPERSON

0.99+

Hollywood Science and Entertainment ExchangeORGANIZATION

0.99+

Stu MinimanPERSON

0.99+

Pac ElsigerPERSON

0.99+

The Punch EscrowTITLE

0.99+

LionsgateORGANIZATION

0.99+

iPadCOMMERCIAL_ITEM

0.99+

CortanaTITLE

0.99+

ChristopherPERSON

0.99+

Simon CrosbyPERSON

0.99+

Wall-ETITLE

0.99+

TuringPERSON

0.99+

AppleORGANIZATION

0.99+

next weekDATE

0.99+

AmazonORGANIZATION

0.99+

District NineTITLE

0.99+

first draftQUANTITY

0.99+

two producersQUANTITY

0.99+

second firewallQUANTITY

0.99+

oneQUANTITY

0.99+

15 firewallsQUANTITY

0.99+

Mandeville ProductionsORGANIZATION

0.99+

third firewallQUANTITY

0.99+

TalPERSON

0.99+

five peopleQUANTITY

0.99+

GoogleORGANIZATION

0.99+

Ready Player OneTITLE

0.98+

first firewallQUANTITY

0.98+

Blade RunnerTITLE

0.98+

firstQUANTITY

0.98+

20sQUANTITY

0.98+

Iron ManTITLE

0.98+

first 500 peopleQUANTITY

0.98+

first three pagesQUANTITY

0.98+

The Fifth ElementTITLE

0.98+

todayDATE

0.98+

StuPERSON

0.98+

LooperTITLE

0.98+

40sQUANTITY

0.97+

GatticaTITLE

0.97+

McDonald'sORGANIZATION

0.97+

The MartianTITLE

0.97+

TwitterORGANIZATION

0.97+

IsaacsonPERSON

0.97+

50 years agoDATE

0.96+

MartianTITLE

0.96+

100 plus yearsQUANTITY

0.96+

VMWorld 2017EVENT

0.95+

GoTITLE

0.95+

UnixTITLE

0.94+

CubePERSON

0.94+

single concertQUANTITY

0.94+

HollywoodLOCATION

0.93+

one thingQUANTITY

0.93+

Kings CollegeORGANIZATION

0.92+

CubeCOMMERCIAL_ITEM

0.92+

U2ORGANIZATION

0.92+

McLarensORGANIZATION

0.92+

first dealQUANTITY

0.91+

VMworld 2017EVENT

0.9+

Key Pillars of a Modern Analytics & Monitoring Strategy for Hybrid Cloud


 

>> Good morning, everyone. My name is Sudip Datta. I head up product management for Infrastructure Management and Analytics at CA Technologies. Today I am going to talk about the key pillars for modern analytics and monitoring for hybrid cloud. So before we get started, let's set the context. Let's take a stock of where we are today. Today in terms of digital business, software is driving business. Software is the backbone, is the driving force for most of the business services. Whether you are a financial institution or a hospitality service or a health care service or even a restaurant service pizza, you are front-ended by software. And therefore the user experience is of paramount importance. Just to give you some factoids. Eighty-three percent of U.S. consumers say that the brand that, the frontal software portal is more important than the product itself. And the companies are reciprocating by putting a lot of emphasis on user experience, as you see in the second factoid. The third factoid, it's even more interesting that 53% of the users of a mobile app actually abandon the app if the app doesn't load within a specified time. So we all understand now the importance of user experience in today's business. So what's happening to the infrastructure underneath that's hosting these applications? The infrastructure itself is evolving, right? How? First of all, as we all know there is a huge movement, a huge shift towards cloud. Customers are adopting cloud for reasons of economy, agility and efficiency. And whether you are running on cloud or on prem, the architecture itself is getting more and more dynamic. On the server side we hear about server-less computing. More and more enterprises are adopting containers, could be Dockers or other containers. And on the networking side we see an adoption of software-defined networking. The logical overlay on top of the physical underlay is abstracting the network. While we see a huge shift, a movement towards cloud, it is also true that customers are also retaining some of their assets on prem, and that's why we talk about hybrid cloud. Hybrid cloud is a reality, and it's going to be a reality for the foreseeable future. Take for example a bank that has its systems of engagement on public cloud, and systems of records on prem deeply nested within their DNC. So the transaction, the end-to-end transaction has to traverse multiple clouds. Similarly we talk to customers who run their production tier one application on prem, while tier two and tier three desktop applications run on public cloud. So that's the reality. Multi-cloud dynamic environment is a reality of today. While that's a reality, they pose a serious challenge for IT operations. What are the challenges? Because of multiple clouds, because of assets spanning multiple data centers, multiple clouds, there are blind spots getting created. IT ops is often blindsided on things that are happening on the other side of the firewall. And as a result what's happening is they're late to react, and often they react to problems much later than their customers find it, and that's an embarrassment. The other thing that's happening is because of the dynamic nature of the cloud, things are ephemeral, things are dynamic, things come and go, assets come and go, IT ops is often in the business of keeping pace with these changes. They are reacting to these changes. They are trying to keep pace with these changes, and silo'd tools are not the way to go. They are trying to keep up with these changes, but they are failing in doing so. And as a result we see poor user experience, low productivity, capacity problems and delayed time to market. Now what's the solution? What is the solution to all these problems? So what we are recommending is a four-pronged solution, what we represent as four pillars. The first pillar is about dynamic policy-based configuration and discovery. The second one is unification of the monitoring and analytics. The third one is contextual intelligence, and the fourth one is integration and collaboration. Let's go through them one by one. First of all, in terms of dynamic policy-based configuration, why is it important? I was talking to a VP of IT last week, and he commented that the time to deploy the monitoring for an application is longer than the time to deploy the application itself, and that's a shame. That's a real shame because in today's world application needs to be monitored straight out of the box. This is compounded by the fact that once you deploy the application, the application today is dynamic, as I said, the cloud assets are dynamic. The topology changes, and monitoring tools need to keep pace with that changing topology. So we need automated discovery. We need API driven discovery, and we need policy-based monitoring for large scale standardization. And last but not the least, the policies need to be based on dynamic baselines. The age, the era of static thresholds is long over because static thresholds lead to false alerts, resulting in higher opics for IT, and IT personnel absolutely, absolutely want to move away from it. Unified monitoring and analytics. This morning I stumbled upon a Lincoln white paper which said 20 tools you need for your hybrid monitoring, and I was absolutely dumbfounded. Twenty tools? I mean, that's a conversation non-starter. So how do we rationalize the tools, minimize the silos, and bring them under single pane of glass, or at least minimal panes for glass for monitoring? So IT admins can have a coherent view of servers, storage, network and applications through a single pane of glass? And why is that important? It's important because it results in lesser blame game. Because of silo'd tools what happens is admins are often fighting with each other, blaming each other. Server admins think that it's a storage problem. The storage admin thinks it's a database problem, and they are pointing to each other, right? So the tools, the management tools should be a point of collaboration, not a point of contention. Talking about blame game, one area that often gets ignored is the area of fault management and monitoring. Why is it important? And I will give a specific example. Let's say you have 100 VMs, and all those VMs become unreachable as a result of router being down. The root cause of the problem therefore are not the VMs, but the router. So instead of generating 101 alarms, the management tool needs to be smart enough to generate one single alarm. And that's why fault management and root cause analysis is of paramount importance. It suppresses unnecessary noise and results in lesser blaming. Contextual intelligence. Now when we talk about the cloud administrator, the cloud admin, the cloud admin in the past were living in the cocoon of their hybrid infrastructure. They were managing the hybrid infrastructure, but in today's world to have an end-to-end visibility of the digital chain, they need to integrate with application performance management tools, APM, as well as what lies underneath, which is the network, so that they have an end-to-end visibility of what's happening in the whole digital chain. But that's not all. They also need what we call is the context of the application. I will give you a specific example. For example, if the server runs out of memory when a lot of end users log into the system, or run out of capacity when a particular marketing promotion is running, then the context really is the business that leads to a saturation in IT. So what you need is to capture all the data, whether they come from logs, whether they come from alarms, capacity events as well as business events, into a single analytics platform and perform analytics on top of it. And then augment it with machine learning and pattern recognition capabilities so that it will not only perform root cause analysis for what happened in the past, but you're also able to anticipate, predict and prevent future problems. The fourth pillar is collaboration and integration. IT ops in today's world doesn't and shouldn't run in a silo. IT ops need to interact with dev ops. Within dev ops developers need to interact with QA. Storage admins need to collaborate with server admins, database admins and various other admins. So the tools need to encourage and provide a platform for collaboration. Similarly IT tools, IT management tools should not run standalone. They need to integrate with other tools. For example, if you want monitoring straight out of the box, the monitoring needs to integrate with provisioning processes. The monitoring downstream needs to integrate with ticketing systems. So integration with other tools, whether third party or custom developed, whatever it is, it's very, very important. Having said that, having laid what the solution should be, what the prescription should be, how is CA Technologies gearing up for it? In CA we have the industry's most comprehensive, the richest portfolio of infrastructure management tools, which is capable of managing all forms of infrastructure, traditional, private cloud, public cloud. Just to give you an example, in private cloud we support the traditional VMs as well as hyper converged infrastructure like Nutanix. We support Docker and other forms of containers. In public cloud we support the monitoring of infrastructure as a service, platform as a service, software as a service. We support all the popular clouds, AWS, Azure, Office 365 on Azure, as well as Salesforce.com. In terms of network, out net ops tools manage the latest and greatest SDN and SD-WAN, the VMware SDN, the open stack SDN, in terms of SD-WAN Cisco, Viptella. If you are a hybrid cloud customer, then you are no longer blindsided on things that are happening on the cloud side because we integrate with tools like Ixia. And once we monitor all these tools, we provide value on top of it. First of all, we monitor not only performance, but also packet, flow, all the net ops attributes. Then on top of that we provide predictive insights and learning. And because of our presence in the application performance management space, we integrate with APM to provide application to infrastructure correlation. Finally our monitoring is integrally linked with our operational intelligence platform. So in CA we have an operational intelligence platform built around CA Jarvis technology, which is based on open source technology, Elastic Logstash and Kibana, supplemented by Hadoop and Spark. And what we are doing is we are ingesting data from our monitoring tools into this data lake to provide value added insights and intelligence. When we talk about big data we talk about the three Vs, the variety, the volume and the velocity of data. But there is a fourth V that we often ignore. That's the veracity of the data, the truthfulness of data. CA being a leader in monitoring space, we have been in the business of collecting and monitoring data for ages, and what we are doing is we are ingesting these data into the platform and provided value added analytics on top of it. If you can read the slide, it's also an open framework we have the APIs from for ingesting data from third-party sources as well. For example, if you have your business data, your business sentiment data, and if you want to correlate that with IT metrics, how your IT is keeping up with your business cycles, you can do that as well. Now some of the applications that we are building, and this product is in beta as you see, are correlation between the various events, IT events and business events, network events and server events. Contextual log analytics. The operative word is contextual. There are a plethora of tools in the market that perform log analytics, but log analytics in the context of a problem when you really need it is of paramount importance. Predictive capacity analytics. Again, capacity analytics is not only about trending, right? It's about what if analysis. What will happen to your infrastructure? Or can your infrastructure sustain the pressure if your business grows by 2X, for example? That kind of what if analysis we should be able to do. And finally machine learning, we are working on it. Out of box machine learning algorithm to make sure that problems are not only corrected after the fact, but we can predict problems. We can prevent the problems in future. So for those who may be listening to this might be wondering where do we start? If you are already a CA customer, you are familiar with CA tools, but if you're not, what's the starting point? So I would recommend the starting point is CA Unified Infrastructure Manager, which is the market leading tool for hybrid cloud management. And it's not a hollow claim that we are making, right? It has been testified, it has been blessed by customers and analysts alike. And you can see it was voted the cloud monitoring software of the year 2016 by a third party. And here are some of the customer experiences. NMSP, they were able to achieve 15% productivity improvement as a result of adopting UIM. A healthcare provider, their meantime to repair, MTTR, went down by 40% as a result of UIM. And a telecom provider, they had a faster adoption to cloud as a result of UIM, the reason being UIM gave them for the first time a single pane of glass to manage their on prem and cloud environments, which has been a detriment for them for adopting cloud. And once they were able to achieve that, they were able to switch onto cloud much, much faster. Finally, the infrastructure management capabilities that I talked about is now being delivered as a turnkey solution, as a SAS solution, which we call digital experience insights. And I strongly, strongly encourage you to try UIM via CA digital experience insights, and here is the URL. You can go and sign up for the trial. With that, thank you.

Published Date : Aug 22 2017

SUMMARY :

And on the networking side we see an adoption of

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
101 alarmsQUANTITY

0.99+

100 VMsQUANTITY

0.99+

53%QUANTITY

0.99+

20 toolsQUANTITY

0.99+

Twenty toolsQUANTITY

0.99+

15%QUANTITY

0.99+

Eighty-three percentQUANTITY

0.99+

second factoidQUANTITY

0.99+

fourth VQUANTITY

0.99+

40%QUANTITY

0.99+

CALOCATION

0.99+

third factoidQUANTITY

0.99+

fourth pillarQUANTITY

0.99+

first pillarQUANTITY

0.99+

2XQUANTITY

0.99+

last weekDATE

0.99+

CA TechnologiesORGANIZATION

0.99+

TodayDATE

0.99+

AWSORGANIZATION

0.99+

CiscoORGANIZATION

0.99+

NMSPORGANIZATION

0.99+

four pillarsQUANTITY

0.98+

2016DATE

0.98+

third oneQUANTITY

0.98+

first timeQUANTITY

0.98+

Sudip DattaPERSON

0.98+

fourth oneQUANTITY

0.98+

HadoopORGANIZATION

0.98+

todayDATE

0.98+

FirstQUANTITY

0.97+

Office 365TITLE

0.97+

one single alarmQUANTITY

0.97+

second oneQUANTITY

0.97+

Elastic LogstashORGANIZATION

0.96+

AzureTITLE

0.96+

UIMORGANIZATION

0.95+

single paneQUANTITY

0.95+

LincolnORGANIZATION

0.95+

U.S.LOCATION

0.95+

KibanaORGANIZATION

0.95+

This morningDATE

0.95+

three VsQUANTITY

0.93+

one areaQUANTITY

0.87+

oneQUANTITY

0.86+

ViptellaORGANIZATION

0.84+

VMwareTITLE

0.82+

NutanixORGANIZATION

0.81+

single analyticsQUANTITY

0.8+

SparkORGANIZATION

0.75+

four-prongedQUANTITY

0.69+

Salesforce.comORGANIZATION

0.67+

DockerTITLE

0.67+

tier threeQUANTITY

0.62+

CAORGANIZATION

0.61+

IxiaTITLE

0.6+

tier twoQUANTITY

0.57+

JarvisORGANIZATION

0.56+

APMORGANIZATION

0.54+

premORGANIZATION

0.53+

tier oneQUANTITY

0.53+