Image Title

Search Results for fourth CDO gig:

Phil Kippen, Snowflake, Dave Whittington, AT&T & Roddy Tranum, AT&T | | MWC Barcelona 2023


 

(gentle music) >> Narrator: "TheCUBE's" live coverage is made possible by funding from Dell Technologies, creating technologies that drive human progress. (upbeat music) >> Hello everybody, welcome back to day four of "theCUBE's" coverage of MWC '23. We're here live at the Fira in Barcelona. Wall-to-wall coverage, John Furrier is in our Palo Alto studio, banging out all the news. Really, the whole week we've been talking about the disaggregation of the telco network, the new opportunities in telco. We're really excited to have AT&T and Snowflake here. Dave Whittington is the AVP, at the Chief Data Office at AT&T. Roddy Tranum is the Assistant Vice President, for Channel Performance Data and Tools at AT&T. And Phil Kippen, the Global Head Of Industry-Telecom at Snowflake, Snowflake's new telecom business. Snowflake just announced earnings last night. Typical Scarpelli, they beat earnings, very conservative guidance, stocks down today, but we like Snowflake long term, they're on that path to 10 billion. Guys, welcome to "theCUBE." Thanks so much >> Phil: Thank you. >> for coming on. >> Dave and Roddy: Thanks Dave. >> Dave, let's start with you. The data culture inside of telco, We've had this, we've been talking all week about this monolithic system. Super reliable. You guys did a great job during the pandemic. Everything shifting to landlines. We didn't even notice, you guys didn't miss a beat. Saved us. But the data culture's changing inside telco. Explain that. >> Well, absolutely. So, first of all IoT and edge processing is bringing forth new and exciting opportunities all the time. So, we're bridging the world between a lot of the OSS stuff that we can do with edge processing. But bringing that back, and now we're talking about working, and I would say traditionally, we talk data warehouse. Data warehouse and big data are now becoming a single mesh, all right? And the use cases and the way you can use those, especially I'm taking that edge data and bringing it back over, now I'm running AI and ML models on it, and I'm pushing back to the edge, and I'm combining that with my relational data. So that mesh there is making all the difference. We're getting new use cases that we can do with that. And it's just, and the volume of data is immense. >> Now, I love ChatGPT, but I'm hoping your data models are more accurate than ChatGPT. I never know. Sometimes it's really good, sometimes it's really bad. But enterprise, you got to be clean with your AI, don't you? >> Not only you have to be clean, you have to monitor it for bias and be ethical about it. We're really good about that. First of all with AT&T, our brand is Platinum. We take care of that. So, we may not be as cutting-edge risk takers as others, but when we go to market with an AI or an ML or a product, it's solid. >> Well hey, as telcos go, you guys are leaning into the Cloud. So I mean, that's a good starting point. Roddy, explain your role. You got an interesting title, Channel Performance Data and Tools, what's that all about? >> So literally anything with our consumer, retail, concenters' channels, all of our channels, from a data perspective and metrics perspective, what it takes to run reps, agents, all the way to leadership levels, scorecards, how you rank in the business, how you're driving the business, from sales, service, customer experience, all that data infrastructure with our great partners on the CDO side, as well as Snowflake, that comes from my team. >> And that's traditionally been done in a, I don't mean the pejorative, but we're talking about legacy, monolithic, sort of data warehouse technologies. >> Absolutely. >> We have a love-hate relationship with them. It's what we had. It's what we used, right? And now that's evolving. And you guys are leaning into the Cloud. >> Dramatic evolution. And what Snowflake's enabled for us is impeccable. We've talked about having, people have dreamed of one data warehouse for the longest time and everything in one system. Really, this is the only way that becomes a reality. The more you get in Snowflake, we can have golden source data, and instead of duplicating that 50 times across AT&T, it's in one place, we just share it, everybody leverages it, and now it's not duplicated, and the process efficiency is just incredible. >> But it really hinges on that separation of storage and compute. And we talk about the monolithic warehouse, and one of the nightmares I've lived with, is having a monolithic warehouse. And let's just go with some of my primary, traditional customers, sales, marketing and finance. They are leveraging BSS OSS data all the time. For me to coordinate a deployment, I have to make sure that each one of these units can take an outage, if it's going to be a long deployment. With the separation of storage, compute, they own their own compute cluster. So I can move faster for these people. 'Cause if finance, I can implement his code without impacting finance or marketing. This brings in CI/CD to more reality. It brings us faster to market with more features. So if he wants to implement a new comp plan for the field reps, or we're reacting to the marketplace, where one of our competitors has done something, we can do that in days, versus waiting weeks or months. >> And we've reported on this a lot. This is the brilliance of Snowflake's founders, that whole separation >> Yep. >> from compute and data. I like Dave, that you're starting with sort of the business flexibility, 'cause there's a cost element of this too. You can dial down, you can turn off compute, and then of course the whole world said, "Hey, that's a good idea." And a VC started throwing money at Amazon, but Redshift said, "Oh, we can do that too, sort of, can't turn off the compute." But I want to ask you Phil, so, >> Sure. >> it looks from my vantage point, like you're taking your Data Cloud message which was originally separate compute from storage simplification, now data sharing, automated governance, security, ultimately the marketplace. >> Phil: Right. >> Taking that same model, break down the silos into telecom, right? It's that same, >> Mm-hmm. >> sorry to use the term playbook, Frank Slootman tells me he doesn't use playbooks, but he's not a pattern matcher, but he's a situational CEO, he says. But the situation in telco calls for that type of strategy. So explain what you guys are doing in telco. >> I think there's, so, what we're launching, we launched last week, and it really was three components, right? So we had our platform as you mentioned, >> Dave: Mm-hmm. >> and that platform is being utilized by a number of different companies today. We also are adding, for telecom very specifically, we're adding capabilities in marketplace, so that service providers can not only use some of the data and apps that are in marketplace, but as well service providers can go and sell applications or sell data that they had built. And then as well, we're adding our ecosystem, it's telecom-specific. So, we're bringing partners in, technology partners, and consulting and services partners, that are very much focused on telecoms and what they do internally, but also helping them monetize new services. >> Okay, so it's not just sort of generic Snowflake into telco? You have specific value there. >> We're purposing the platform specifically for- >> Are you a telco guy? >> I am. You are, okay. >> Total telco guy absolutely. >> So there you go. You see that Snowflake is actually an interesting organizational structure, 'cause you're going after verticals, which is kind of rare for a company of your sort of inventory, I'll say, >> Absolutely. >> I don't mean that as a negative. (Dave laughs) So Dave, take us through the data journey at AT&T. It's a long history. You don't have to go back to the 1800s, but- (Dave laughs) >> Thank you for pointing out, we're a 149-year-old company. So, Jesse James was one of the original customers, (Dave laughs) and we have no longer got his data. So, I'll go back. I've been 17 years singular AT&T, and I've watched it through the whole journey of, where the monolithics were growing, when the consolidation of small, wireless carriers, and we went through that boom. And then we've gone through mergers and acquisitions. But, Hadoop came out, and it was going to solve all world hunger. And we had all the aspects of, we're going to monetize and do AI and ML, and some of the things we learned with Hadoop was, we had this monolithic warehouse, we had this file-based-structured Hadoop, but we really didn't know how to bring this all together. And we were bringing items over to the relational, and we were taking the relational and bringing it over to the warehouse, and trying to, and it was a struggle. Let's just go there. And I don't think we were the only company to struggle with that, but we learned a lot. And so now as tech is finally emerging, with the cloud, companies like Snowflake, and others that can handle that, where we can create, we were discussing earlier, but it becomes more of a conducive mesh that's interoperable. So now we're able to simplify that environment. And the cloud is a big thing on that. 'Cause you could not do this on-prem with on-prem technologies. It would be just too cost prohibitive, and too heavy of lifting, going back and forth, and managing the data. The simplicity the cloud brings with a smaller set of tools, and I'll say in the data space specifically, really allows us, maybe not a single instance of data for all use cases, but a greatly reduced ecosystem. And when you simplify your ecosystem, you simplify speed to market and data management. >> So I'm going to ask you, I know it's kind of internal organizational plumbing, but it'll inform my next question. So, Dave, you're with the Chief Data Office, and Roddy, you're kind of, you all serve in the business, but you're really serving the, you're closer to those guys, they're banging on your door for- >> Absolutely. I try to keep the 130,000 users who may or may not have issues sometimes with our data and metrics, away from Dave. And he just gets a call from me. >> And he only calls when he has a problem. He's never wished me happy birthday. (Dave and Phil laugh) >> So the reason I asked that is because, you describe Dave, some of the Hadoop days, and again love-hate with that, but we had hyper-specialized roles. We still do. You've got data engineers, data scientists, data analysts, and you've got this sort of this pipeline, and it had to be this sequential pipeline. I know Snowflake and others have come to simplify that. My question to you is, how is that those roles, how are those roles changing? How is data getting closer to the business? Everybody talks about democratizing business. Are you doing that? What's a real use example? >> From our perspective, those roles, a lot of those roles on my team for years, because we're all about efficiency, >> Dave: Mm-hmm. >> we cut across those areas, and always have cut across those areas. So now we're into a space where things have been simplified, data processes and copying, we've gone from 40 data processes down to five steps now. We've gone from five steps to one step. We've gone from days, now take hours, hours to minutes, minutes to seconds. Literally we're seeing that time in and time out with Snowflake. So these resources that have spent all their time on data engineering and moving data around, are now freed up more on what they have skills for and always have, the data analytics area of the business, and driving the business forward, and new metrics and new analysis. That's some of the great operational value that we've seen here. As this simplification happens, it frees up brain power. >> So, you're pumping data from the OSS, the BSS, the OKRs everywhere >> Everywhere. >> into Snowflake? >> Scheduling systems, you name it. If you can think of what drives our retail and centers and online, all that data, scheduling system, chat data, call center data, call detail data, all of that enters into this common infrastructure to manage the business on a day in and day out basis. >> How are the roles and the skill sets changing? 'Cause you're doing a lot less ETL, you're doing a lot less moving of data around. There were guys that were probably really good at that. I used to joke in the, when I was in the storage world, like if your job is bandaging lungs, you need to look for a new job, right? So, and they did and people move on. So, are you able to sort of redeploy those assets, and those people, those human resources? >> These folks are highly skilled. And we were talking about earlier, SQL hasn't gone away. Relational databases are not going away. And that's one thing that's made this migration excellent, they're just transitioning their skills. Experts in legacy systems are now rapidly becoming experts on the Snowflake side. And it has not been that hard a transition. There are certainly nuances, things that don't operate as well in the cloud environment that we have to learn and optimize. But we're making that transition. >> Dave: So just, >> Please. >> within the Chief Data Office we have a couple of missions, and Roddy is a great partner and an example of how it works. We try to bring the data for democratization, so that we have one interface, now hopefully know we just have a logical connection back to these Snowflake instances that we connect. But we're providing that governance and cleansing, and if there's a business rule at the enterprise level, we provide it. But the goal at CDO is to make sure that business units like Roddy or marketing or finance, that they can come to a platform that's reliable, robust, and self-service. I don't want to be in his way. So I feel like I'm providing a sub-level of platform, that he can come to and anybody can come to, and utilize, that they're not having to go back and undo what's in Salesforce, or ServiceNow, or in our billers. So, I'm sort of that layer. And then making sure that that ecosystem is robust enough for him to use. >> And that self-service infrastructure is predominantly through the Azure Cloud, correct? >> Dave: Absolutely. >> And you work on other clouds, but it's predominantly through Azure? >> We're predominantly in Azure, yeah. >> Dave: That's the first-party citizen? >> Yeah. >> Okay, I like to think in terms sometimes of data products, and I know you've mentioned upfront, you're Gold standard or Platinum standard, you're very careful about personal information. >> Dave: Yeah. >> So you're not trying to sell, I'm an AT&T customer, you're not trying to sell my data, and make money off of my data. So the value prop and the business case for Snowflake is it's simpler. You do things faster, you're in the cloud, lower cost, et cetera. But I presume you're also in the business, AT&T, of making offers and creating packages for customers. I look at those as data products, 'cause it's not a, I mean, yeah, there's a physical phone, but there's data products behind it. So- >> It ultimately is, but not everybody always sees it that way. Data reporting often can be an afterthought. And we're making it more on the forefront now. >> Yeah, so I like to think in terms of data products, I mean even if the financial services business, it's a data business. So, if we can think about that sort of metaphor, do you see yourselves as data product builders? Do you have that, do you think about building products in that regard? >> Within the Chief Data Office, we have a data product team, >> Mm-hmm. >> and by the way, I wouldn't be disingenuous if I said, oh, we're very mature in this, but no, it's where we're going, and it's somewhat of a journey, but I've got a peer, and their whole job is to go from, especially as we migrate from cloud, if Roddy or some other group was using tables three, four and five and joining them together, it's like, "Well look, this is an offer for data product, so let's combine these and put it up in the cloud, and here's the offer data set product, or here's the opportunity data product," and it's a journey. We're on the way, but we have dedicated staff and time to do this. >> I think one of the hardest parts about that is the organizational aspects of it. Like who owns the data now, right? It used to be owned by the techies, and increasingly the business lines want to have access, you're providing self-service. So there's a discussion about, "Okay, what is a data product? Who's responsible for that data product? Is it in my P&L or your P&L? Somebody's got to sign up for that number." So, it sounds like those discussions are taking place. >> They are. And, we feel like we're more the, and CDO at least, we feel more, we're like the guardians, and the shepherds, but not the owners. I mean, we have a role in it all, but he owns his metrics. >> Yeah, and even from our perspective, we see ourselves as an enabler of making whatever AT&T wants to make happen in terms of the key products and officers' trade-in offers, trade-in programs, all that requires this data infrastructure, and managing reps and agents, and what they do from a channel performance perspective. We still ourselves see ourselves as key enablers of that. And we've got to be flexible, and respond quickly to the business. >> I always had empathy for the data engineer, and he or she had to service all these different lines of business with no business context. >> Yeah. >> Like the business knows good data from bad data, and then they just pound that poor individual, and they're like, "Okay, I'm doing my best. It's just ones and zeros to me." So, it sounds like that's, you're on that path. >> Yeah absolutely, and I think, we do have refined, getting more and more refined owners of, since Snowflake enables these golden source data, everybody sees me and my organization, channel performance data, go to Roddy's team, we have a great team, and we go to Dave in terms of making it all happen from a data infrastructure perspective. So we, do have a lot more refined, "This is where you go for the golden source, this is where it is, this is who owns it. If you want to launch this product and services, and you want to manage reps with it, that's the place you-" >> It's a strong story. So Chief Data Office doesn't own the data per se, but it's your responsibility to provide the self-service infrastructure, and make sure it's governed properly, and in as automated way as possible. >> Well, yeah, absolutely. And let me tell you more, everybody talks about single version of the truth, one instance of the data, but there's context to that, that we are taking, trying to take advantage of that as we do data products is, what's the use case here? So we may have an entity of Roddy as a prospective customer, and we may have a entity of Roddy as a customer, high-value customer over here, which may have a different set of mix of data and all, but as a data product, we can then create those for those specific use cases. Still point to the same data, but build it in different constructs. One for marketing, one for sales, one for finance. By the way, that's where your data engineers are struggling. >> Yeah, yeah, of course. So how do I serve all these folks, and really have the context-common story in telco, >> Absolutely. >> or are these guys ahead of the curve a little bit? Or where would you put them? >> I think they're definitely moving a lot faster than the industry is generally. I think the enabling technologies, like for instance, having that single copy of data that everybody sees, a single pane of glass, right, that's definitely something that everybody wants to get to. Not many people are there. I think, what AT&T's doing, is most definitely a little bit further ahead than the industry generally. And I think the successes that are coming out of that, and the learning experiences are starting to generate momentum within AT&T. So I think, it's not just about the product, and having a product now that gives you a single copy of data. It's about the experiences, right? And now, how the teams are getting trained, domains like network engineering for instance. They typically haven't been a part of data discussions, because they've got a lot of data, but they're focused on the infrastructure. >> Mm. >> So, by going ahead and deploying this platform, for platform's purpose, right, and the business value, that's one thing, but also to start bringing, getting that experience, and bringing new experience in to help other groups that traditionally hadn't been data-centric, that's also a huge step ahead, right? So you need to enable those groups. >> A big complaint of course we hear at MWC from carriers is, "The over-the-top guys are killing us. They're riding on our networks, et cetera, et cetera. They have all the data, they have all the client relationships." Do you see your client relationships changing as a result of sort of your data culture evolving? >> Yes, I'm not sure I can- >> It's a loaded question, I know. >> Yeah, and then I, so, we want to start embedding as much into our network on the proprietary value that we have, so we can start getting into that OTT play, us as any other carrier, we have distinct advantages of what we can do at the edge, and we just need to start exploiting those. But you know, 'cause whether it's location or whatnot, so we got to eat into that. Historically, the network is where we make our money in, and we stack the services on top of it. It used to be *69. >> Dave: Yeah. >> If anybody remembers that. >> Dave: Yeah, of course. (Dave laughs) >> But you know, it was stacked on top of our network. Then we stack another product on top of it. It'll be in the edge where we start providing distinct values to other partners as we- >> I mean, it's a great business that you're in. I mean, if they're really good at connectivity. >> Dave: Yeah. >> And so, it sounds like it's still to be determined >> Dave: Yeah. >> where you can go with this. You have to be super careful with private and for personal information. >> Dave: Yep. >> Yeah, but the opportunities are enormous. >> There's a lot. >> Yeah, particularly at the edge, looking at, private networks are just an amazing opportunity. Factories and name it, hospital, remote hospitals, remote locations. I mean- >> Dave: Connected cars. >> Connected cars are really interesting, right? I mean, if you start communicating car to car, and actually drive that, (Dave laughs) I mean that's, now we're getting to visit Xen Fault Tolerance people. This is it. >> Dave: That's not, let's hold the traffic. >> Doesn't scare me as much as we actually learn. (all laugh) >> So how's the show been for you guys? >> Dave: Awesome. >> What're your big takeaways from- >> Tremendous experience. I mean, someone who doesn't go outside the United States much, I'm a homebody. The whole experience, the whole trip, city, Mobile World Congress, the technologies that are out here, it's been a blast. >> Anything, top two things you learned, advice you'd give to others, your colleagues out in general? >> In general, we talked a lot about technologies today, and we talked a lot about data, but I'm going to tell you what, the accelerator that you cannot change, is the relationship that we have. So when the tech and the business can work together toward a common goal, and it's a partnership, you get things done. So, I don't know how many CDOs or CIOs or CEOs are out there, but this connection is what accelerates and makes it work. >> And that is our audience Dave. I mean, it's all about that alignment. So guys, I really appreciate you coming in and sharing your story in "theCUBE." Great stuff. >> Thank you. >> Thanks a lot. >> All right, thanks everybody. Thank you for watching. I'll be right back with Dave Nicholson. Day four SiliconANGLE's coverage of MWC '23. You're watching "theCUBE." (gentle music)

Published Date : Mar 2 2023

SUMMARY :

that drive human progress. And Phil Kippen, the Global But the data culture's of the OSS stuff that we But enterprise, you got to be So, we may not be as cutting-edge Channel Performance Data and all the way to leadership I don't mean the pejorative, And you guys are leaning into the Cloud. and the process efficiency and one of the nightmares I've lived with, This is the brilliance of the business flexibility, like you're taking your Data Cloud message But the situation in telco and that platform is being utilized You have specific value there. I am. So there you go. I don't mean that as a negative. and some of the things we and Roddy, you're kind of, And he just gets a call from me. (Dave and Phil laugh) and it had to be this sequential pipeline. and always have, the data all of that enters into How are the roles and in the cloud environment that But the goal at CDO is to and I know you've mentioned upfront, So the value prop and the on the forefront now. I mean even if the and by the way, I wouldn't and increasingly the business and the shepherds, but not the owners. and respond quickly to the business. and he or she had to service Like the business knows and we go to Dave in terms doesn't own the data per se, and we may have a entity and really have the and having a product now that gives you and the business value, that's one thing, They have all the data, on the proprietary value that we have, Dave: Yeah, of course. It'll be in the edge business that you're in. You have to be super careful Yeah, but the particularly at the edge, and actually drive that, let's hold the traffic. much as we actually learn. the whole trip, city, is the relationship that we have. and sharing your story in "theCUBE." Thank you for watching.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
DavePERSON

0.99+

Dave WhittingtonPERSON

0.99+

Frank SlootmanPERSON

0.99+

RoddyPERSON

0.99+

AmazonORGANIZATION

0.99+

PhilPERSON

0.99+

Phil KippenPERSON

0.99+

AT&TORGANIZATION

0.99+

Jesse JamesPERSON

0.99+

AT&T.ORGANIZATION

0.99+

five stepsQUANTITY

0.99+

Dave NicholsonPERSON

0.99+

John FurrierPERSON

0.99+

50 timesQUANTITY

0.99+

SnowflakeORGANIZATION

0.99+

Roddy TranumPERSON

0.99+

10 billionQUANTITY

0.99+

one stepQUANTITY

0.99+

17 yearsQUANTITY

0.99+

130,000 usersQUANTITY

0.99+

United StatesLOCATION

0.99+

1800sDATE

0.99+

last weekDATE

0.99+

BarcelonaLOCATION

0.99+

Palo AltoLOCATION

0.99+

Dell TechnologiesORGANIZATION

0.99+

last nightDATE

0.99+

MWC '23EVENT

0.98+

telcoORGANIZATION

0.98+

one systemQUANTITY

0.98+

oneQUANTITY

0.98+

40 data processesQUANTITY

0.98+

todayDATE

0.98+

one placeQUANTITY

0.97+

P&LORGANIZATION

0.97+

telcosORGANIZATION

0.97+

CDOORGANIZATION

0.97+

149-year-oldQUANTITY

0.97+

fiveQUANTITY

0.97+

singleQUANTITY

0.96+

three componentsQUANTITY

0.96+

OneQUANTITY

0.96+

Kirk Haslbeck, Collibra, Data Citizens 22


 

(atmospheric music) >> Welcome to theCUBE Coverage of Data Citizens 2022 Collibra's Customer event. My name is Dave Vellante. With us is Kirk Haslbeck, who's the Vice President of Data Quality of Collibra. Kirk, good to see you, welcome. >> Thanks for having me, Dave. Excited to be here. >> You bet. Okay, we're going to discuss data quality, observability. It's a hot trend right now. You founded a data quality company, OwlDQ, and it was acquired by Collibra last year. Congratulations. And now you lead data quality at Collibra. So we're hearing a lot about data quality right now. Why is it such a priority? Take us through your thoughts on that. >> Yeah, absolutely. It's definitely exciting times for data quality which you're right, has been around for a long time. So why now? And why is it so much more exciting than it used to be? I think it's a bit stale, but we all know that companies use more data than ever before, and the variety has changed and the volume has grown. And while I think that remains true there are a couple other hidden factors at play that everyone's so interested in as to why this is becoming so important now. And I guess you could kind of break this down simply and think about if Dave you and I were going to build a new healthcare application and monitor the heartbeat of individuals, imagine if we get that wrong, what the ramifications could be, what those incidents would look like. Or maybe better yet, we try to build a new trading algorithm with a crossover strategy where the 50 day crosses the 10 day average. And imagine if the data underlying the inputs to that is incorrect. We will probably have major financial ramifications in that sense. So, kind of starts there, where everybody's realizing that we're all data companies, and if we are using bad data we're likely making incorrect business decisions. But I think there's kind of two other things at play. I bought a car not too long ago and my dad called and said, "How many cylinders does it have?" And I realized in that moment, I might have failed him cause I didn't know. And I used to ask those types of questions about any lock breaks and cylinders, and if it's manual or automatic. And I realized, I now just buy a car that I hope works. And it's so complicated with all the computer chips. I really don't know that much about it. And that's what's happening with data. We're just loading so much of it. And it's so complex that the way companies consume them in the IT function is that they bring in a lot of data and then they syndicate it out to the business. And it turns out that the individuals loading and consuming all of this data for the company actually may not know that much about the data itself and that's not even their job anymore. So, we'll talk more about that in a minute, but that's really what's setting the foreground for this observability play and why everybody's so interested. It's because we're becoming less close to the intricacies of the data and we just expect it to always be there and be correct. >> You know, the other thing too about data quality, and for years we did the MIT, CDO, IQ event. We didn't do it last year at COVID, messed everything up. But the observation I would make there, your thoughts is, data quality used to be information quality, used to be this back office function, and then it became sort of front office with financial services, and government and healthcare, these highly regulated industries. And then the whole chief data officer thing happened and people were realizing, well they sort of flipped the bit from sort of a data as a risk to data as an asset. And now as we say, we're going to talk about observability. And so it's really become front and center, just the whole quality issue because data's so fundamental, hasn't it? >> Yeah, absolutely. I mean, let's imagine we pull up our phones right now and I go to my favorite stock ticker app, and I check out the Nasdaq market cap. I really have no idea if that's the correct number. I know it's a number, it looks large, it's in a numeric field. And that's kind of what's going on. There's so many numbers and they're coming from all of these different sources, and data providers, and they're getting consumed and passed along. But there isn't really a way to tactically put controls on every number and metric across every field we plan to monitor, but with the scale that we've achieved in early days, even before Collibra. And what's been so exciting is, we have these types of observation techniques, these data monitors that can actually track past performance of every field at scale. And why that's so interesting, and why I think the CDO is listening right intently nowadays to this topic is, so maybe we could surface all of these problems with the right solution of data observability and with the right scale, and then just be alerted on breaking trends. So we're sort of shifting away from this world of must write a condition and then when that condition breaks that was always known as a break record. But what about breaking trends and root cause analysis? And is it possible to do that with less human intervention? And so I think most people are seeing now that it's going to have to be a software tool and a computer system. It's not ever going to be based on one or two domain experts anymore. >> So how does data observability relate to data quality? Are they sort of two sides of the same coin? Are they cousins? What's your perspective on that? >> Yeah, it's super interesting. It's an emerging market. So the language is changing, a lot of the topic and areas changing. The way that I like to say it or break it down because the lingo is constantly moving, as a target on the space is really breaking records versus breaking trends. And I could write a condition when this thing happens it's wrong, and when it doesn't it's correct. Or I could look for a trend and I'll give you a good example. Everybody's talking about fresh data and stale data, and why would that matter? Well, if your data never arrived, or only part of it arrived, or didn't arrive on time, it's likely stale, and there will not be a condition that you could write that would show you all the good and the bads. That was kind of your traditional approach of data quality break records. But your modern day approach is you lost a significant portion of your data, or it did not arrive on time to make that decision accurately on time. And that's a hidden concern. Some people call this freshness, we call it stale data. But it all points to the same idea of the thing that you're observing may not be a data quality condition anymore. It may be a breakdown in the data pipeline. And with thousands of data pipelines in play for every company out there, there's more than a couple of these happening every day. >> So what's the Collibra angle on all this stuff? Made the acquisition, you got data quality, observability coming together. You guys have a lot of expertise in this area, but you hear providence of data. You just talked about stale data, the whole trend toward realtime. How is Collibra approaching the problem and what's unique about your approach? >> Well I think where we're fortunate is with our background. Myself and team, we sort of lived this problem for a long time in the Wall Street days about a decade ago. And we saw it from many different angles. And what we came up with, before it was called data observability or reliability, was basically the underpinnings of that. So we're a little bit ahead of the curve there when most people evaluate our solution. It's more advanced than some of the observation techniques that currently exist. But we've also always covered data quality and we believe that people want to know more, they need more insights. And they want to see break records and breaking trends together, so they can correlate the root cause. And we hear that all the time. "I have so many things going wrong just show me the big picture. Help me find the thing that if I were to fix it today would make the most impact." So we're really focused on root cause analysis, business impact, connecting it with lineage and catalog metadata. And as that grows you can actually achieve total data governance. At this point with the acquisition of what was a Lineage company years ago, and then my company OwlDQ, now Collibra Data Quality. Collibra may be the best positioned for total data governance and intelligence in the space. >> Well, you mentioned financial services a couple of times and some examples, remember the flash crash in 2010. Nobody had any idea what that was. They would just say, "Oh, it's a glitch." So they didn't understand the root cause of it. So this is a really interesting topic to me. So we know at Data Citizens 22 that you're announcing, you got to announce new products, right? It is your yearly event. What's new? Give us a sense as to what products are coming out but specifically around data quality and observability. >> Absolutely. There's this, there's always a next thing on the forefront. And the one right now is these hyperscalers in the cloud. So you have databases like Snowflake and BigQuery, and Databricks, Delta Lake and SQL Pushdown. And ultimately what that means is a lot of people are storing in loading data even faster in a SaaS like model. And we've started to hook into these databases, and while we've always worked with the same databases in the past they're supported today. We're doing something called Native Database pushdown, where the entire compute and data activity happens in the database. And why that is so interesting and powerful now? Is everyone's concerned with something called Egress. Did my data that I've spent all this time and money with my security team securing ever leave my hands, did it ever leave my secure VPC as they call it? And with these native integrations that we're building and about to unveil here as kind of a sneak peak for next week at Data Citizens, we're now doing all compute and data operations in databases like Snowflake. And what that means is with no install and no configuration you could log into the Collibra data quality app and have all of your data quality running inside the database that you've probably already picked as your go forward team selection secured database of choice. So we're really excited about that. And I think if you look at the whole landscape of network cost, egress cost, data storage and compute, what people are realizing is it's extremely efficient to do it in the way that we're about to release here next week. >> So this is interesting because what you just described, you mentioned Snowflake, you mentioned Google, oh actually you mentioned yeah, Databricks. You know, Snowflake has the data cloud. If you put everything in the data cloud, okay, you're cool. But then Google's got the open data cloud. If you heard, Google next. And now Databricks doesn't call it the data cloud, but they have like the open source data cloud. So you have all these different approaches and there's really no way, up until now I'm hearing, to really understand the relationships between all those and have confidence across, it's like yamarket AMI, you should just be a note on the mesh. I don't care if it's a data warehouse or a data lake, or where it comes from, but it's a point on that mesh and I need tooling to be able to have confidence that my data is governed and has the proper lineage, providence. And that's what you're bringing to the table. Is that right? Did I get that right? >> Yeah, that's right. And it's, for us, it's not that we haven't been working with those great cloud databases, but it's the fact that we can send them the instructions now we can send them the operating ability to crunch all of the calculations, the governance, the quality, and get the answers. And what that's doing, it's basically zero network cost, zero egress cost, zero latency of time. And so when you were to log into BigQuery tomorrow using our tool, or say Snowflake for example, you have instant data quality metrics, instant profiling, instant lineage in access, privacy controls, things of that nature that just become less onerous. What we're seeing is there's so much technology out there just like all of the major brands that you mentioned but how do we make it easier? The future is about less clicks, faster time to value, faster scale, and eventually lower cost. And we think that this positions us to be the leader there. >> I love this example because, we've got talks about well the cloud guys you're going to own the world. And of course now we're seeing that the ecosystem is finding so much white space to add value connect across cloud. Sometimes we call it super cloud and so, or inter clouding. Alright, Kirk, give us your final thoughts on the trends that we've talked about and data Citizens 22. >> Absolutely. Well I think, one big trend is discovery and classification. Seeing that across the board, people used to know it was a zip code and nowadays with the amount of data that's out there they want to know where everything is, where their sensitive data is, if it's redundant, tell me everything inside of three to five seconds. And with that comes, they want to know in all of these hyperscale databases how fast they can get controls and insights out of their tools. So I think we're going to see more one click solutions, more SaaS based solutions, and solutions that hopefully prove faster time to value on all of these modern cloud platforms. >> Excellent. All right, Kirk Haslbeck, thanks so much for coming on theCUBE and previewing Data Citizens 22. Appreciate it. >> Thanks for having me, Dave. >> You're welcome. All right. And thank you for watching. Keep it right there for more coverage from theCUBE. (atmospheric music)

Published Date : Nov 2 2022

SUMMARY :

Kirk, good to see you, welcome. Excited to be here. And now you lead data quality at Collibra. And it's so complex that the And now as we say, we're going and I check out the Nasdaq market cap. of the thing that you're observing and what's unique about your approach? ahead of the curve there and some examples, And the one right now is these and has the proper lineage, providence. and get the answers. And of course now we're and solutions that hopefully and previewing Data Citizens 22. And thank you for watching.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave VellantePERSON

0.99+

DavePERSON

0.99+

CollibraORGANIZATION

0.99+

2010DATE

0.99+

Kirk HaslbeckPERSON

0.99+

oneQUANTITY

0.99+

OwlDQORGANIZATION

0.99+

KirkPERSON

0.99+

50 dayQUANTITY

0.99+

GoogleORGANIZATION

0.99+

10 dayQUANTITY

0.99+

DatabricksORGANIZATION

0.99+

two sidesQUANTITY

0.99+

last yearDATE

0.99+

Collibra Data QualityORGANIZATION

0.99+

next weekDATE

0.99+

Data CitizensORGANIZATION

0.99+

tomorrowDATE

0.98+

two other thingsQUANTITY

0.98+

BigQueryTITLE

0.98+

five secondsQUANTITY

0.98+

one clickQUANTITY

0.97+

todayDATE

0.97+

CollibraTITLE

0.96+

Wall StreetLOCATION

0.96+

SQL PushdownTITLE

0.94+

Data Citizens 22ORGANIZATION

0.93+

COVIDORGANIZATION

0.93+

SnowflakeTITLE

0.91+

NasdaqORGANIZATION

0.9+

Data Citizens 22ORGANIZATION

0.89+

Delta LakeTITLE

0.89+

EgressORGANIZATION

0.89+

MITEVENT

0.89+

more than a coupleQUANTITY

0.87+

a decade agoDATE

0.85+

zeroQUANTITY

0.84+

CitizensORGANIZATION

0.83+

Data Citizens 2022 CollibraEVENT

0.83+

yearsDATE

0.81+

thousands of dataQUANTITY

0.8+

Data Citizens 22TITLE

0.78+

two domain expertsQUANTITY

0.77+

SnowflakeORGANIZATION

0.76+

IQEVENT

0.76+

coupleQUANTITY

0.75+

CollibraPERSON

0.75+

theCUBEORGANIZATION

0.71+

many numbersQUANTITY

0.7+

Vice PresidentPERSON

0.68+

LineageORGANIZATION

0.66+

DatabricksTITLE

0.64+

too long agoDATE

0.62+

threeQUANTITY

0.6+

DataORGANIZATION

0.57+

CDOEVENT

0.53+

minuteQUANTITY

0.53+

CDOTITLE

0.53+

numberQUANTITY

0.51+

AMIORGANIZATION

0.44+

QualityPERSON

0.43+

Collibra Data Citizens 22


 

>>Collibra is a company that was founded in 2008 right before the so-called modern big data era kicked into high gear. The company was one of the first to focus its business on data governance. Now, historically, data governance and data quality initiatives, they were back office functions and they were largely confined to regulatory regulated industries that had to comply with public policy mandates. But as the cloud went mainstream, the tech giants showed us how valuable data could become and the value proposition for data quality and trust. It evolved from primarily a compliance driven issue to becoming a lynchpin of competitive advantage. But data in the decade of the 2010s was largely about getting the technology to work. You had these highly centralized technical teams that were formed and they had hyper specialized skills to develop data architectures and processes to serve the myriad data needs of organizations. >>And it resulted in a lot of frustration with data initiatives for most organizations that didn't have the resources of the cloud guys and the social media giants to really attack their data problems and turn data into gold. This is why today for example, this quite a bit of momentum to rethinking monolithic data architectures. You see, you hear about initiatives like data mesh and the idea of data as a product. They're gaining traction as a way to better serve the the data needs of decentralized business Uni users, you hear a lot about data democratization. So these decentralization efforts around data, they're great, but they create a new set of problems. Specifically, how do you deliver like a self-service infrastructure to business users and domain experts? Now the cloud is definitely helping with that, but also how do you automate governance? This becomes especially tricky as protecting data privacy has become more and more important. >>In other words, while it's enticing to experiment and run fast and loose with data initiatives kinda like the Wild West, to find new veins of gold, it has to be done responsibly. As such, the idea of data governance has had to evolve to become more automated. And intelligence governance and data lineage is still fundamental to ensuring trust as data. It moves like water through an organization. No one is gonna use data that isn't trusted. Metadata has become increasingly important for data discovery and data classification. As data flows through an organization, the continuously ability to check for data flaws and automating that data quality, they become a functional requirement of any modern data management platform. And finally, data privacy has become a critical adjacency to cyber security. So you can see how data governance has evolved into a much richer set of capabilities than it was 10 or 15 years ago. >>Hello and welcome to the Cube's coverage of Data Citizens made possible by Calibra, a leader in so-called Data intelligence and the host of Data Citizens 2022, which is taking place in San Diego. My name is Dave Ante and I'm one of the hosts of our program, which is running in parallel to data citizens. Now at the Cube we like to say we extract the signal from the noise, and over the, the next couple of days, we're gonna feature some of the themes from the keynote speakers at Data Citizens and we'll hear from several of the executives. Felix Von Dala, who is the co-founder and CEO of Collibra, will join us along with one of the other founders of Collibra, Stan Christians, who's gonna join my colleague Lisa Martin. I'm gonna also sit down with Laura Sellers, she's the Chief Product Officer at Collibra. We'll talk about some of the, the announcements and innovations they're making at the event, and then we'll dig in further to data quality with Kirk Hasselbeck. >>He's the vice president of Data quality at Collibra. He's an amazingly smart dude who founded Owl dq, a company that he sold to Col to Collibra last year. Now many companies, they didn't make it through the Hado era, you know, they missed the industry waves and they became Driftwood. Collibra, on the other hand, has evolved its business. They've leveraged the cloud, expanded its product portfolio, and leaned in heavily to some major partnerships with cloud providers, as well as receiving a strategic investment from Snowflake earlier this year. So it's a really interesting story that we're thrilled to be sharing with you. Thanks for watching and I hope you enjoy the program. >>Last year, the Cube Covered Data Citizens Collibra's customer event. And the premise that we put forth prior to that event was that despite all the innovation that's gone on over the last decade or more with data, you know, starting with the Hado movement, we had data lakes, we'd spark the ascendancy of programming languages like Python, the introduction of frameworks like TensorFlow, the rise of ai, low code, no code, et cetera. Businesses still find it's too difficult to get more value from their data initiatives. And we said at the time, you know, maybe it's time to rethink data innovation. While a lot of the effort has been focused on, you know, more efficiently storing and processing data, perhaps more energy needs to go into thinking about the people and the process side of the equation, meaning making it easier for domain experts to both gain insights for data, trust the data, and begin to use that data in new ways, fueling data, products, monetization and insights data citizens 2022 is back and we're pleased to have Felix Van Dema, who is the founder and CEO of Collibra. He's on the cube or excited to have you, Felix. Good to see you again. >>Likewise Dave. Thanks for having me again. >>You bet. All right, we're gonna get the update from Felix on the current data landscape, how he sees it, why data intelligence is more important now than ever and get current on what Collibra has been up to over the past year and what's changed since Data Citizens 2021. And we may even touch on some of the product news. So Felix, we're living in a very different world today with businesses and consumers. They're struggling with things like supply chains, uncertain economic trends, and we're not just snapping back to the 2010s. That's clear, and that's really true as well in the world of data. So what's different in your mind, in the data landscape of the 2020s from the previous decade, and what challenges does that bring for your customers? >>Yeah, absolutely. And, and I think you said it well, Dave, and and the intro that that rising complexity and fragmentation in the broader data landscape, that hasn't gotten any better over the last couple of years. When when we talk to our customers, that level of fragmentation, the complexity, how do we find data that we can trust, that we know we can use has only gotten kinda more, more difficult. So that trend that's continuing, I think what is changing is that trend has become much more acute. Well, the other thing we've seen over the last couple of years is that the level of scrutiny that organizations are under respect to data, as data becomes more mission critical, as data becomes more impactful than important, the level of scrutiny with respect to privacy, security, regulatory compliance, as only increasing as well, which again, is really difficult in this environment of continuous innovation, continuous change, continuous growing complexity and fragmentation. >>So it's become much more acute. And, and to your earlier point, we do live in a different world and and the the past couple of years we could probably just kind of brute for it, right? We could focus on, on the top line. There was enough kind of investments to be, to be had. I think nowadays organizations are focused or are, are, are, are, are, are in a very different environment where there's much more focus on cost control, productivity, efficiency, How do we truly get value from that data? So again, I think it just another incentive for organization to now truly look at data and to scale it data, not just from a a technology and infrastructure perspective, but how do you actually scale data from an organizational perspective, right? You said at the the people and process, how do we do that at scale? And that's only, only only becoming much more important. And we do believe that the, the economic environment that we find ourselves in today is gonna be catalyst for organizations to really dig out more seriously if, if, if, if you will, than they maybe have in the have in the best. >>You know, I don't know when you guys founded Collibra, if, if you had a sense as to how complicated it was gonna get, but you've been on a mission to really address these problems from the beginning. How would you describe your, your, your mission and what are you doing to address these challenges? >>Yeah, absolutely. We, we started Colli in 2008. So in some sense and the, the last kind of financial crisis, and that was really the, the start of Colli where we found product market fit, working with large finance institutions to help them cope with the increasing compliance requirements that they were faced with because of the, of the financial crisis and kind of here we are again in a very different environment, of course 15 years, almost 15 years later. But data only becoming more important. But our mission to deliver trusted data for every user, every use case and across every source, frankly, has only become more important. So what has been an incredible journey over the last 14, 15 years, I think we're still relatively early in our mission to again, be able to provide everyone, and that's why we call it data citizens. We truly believe that everyone in the organization should be able to use trusted data in an easy, easy matter. That mission is is only becoming more important, more relevant. We definitely have a lot more work ahead of us because we are still relatively early in that, in that journey. >>Well, that's interesting because, you know, in my observation it takes seven to 10 years to actually build a company and then the fact that you're still in the early days is kind of interesting. I mean, you, Collibra's had a good 12 months or so since we last spoke at Data Citizens. Give us the latest update on your business. What do people need to know about your, your current momentum? >>Yeah, absolutely. Again, there's, there's a lot of tail organizations that are only maturing the data practices and we've seen it kind of transform or, or, or influence a lot of our business growth that we've seen, broader adoption of the platform. We work at some of the largest organizations in the world where it's Adobe, Heineken, Bank of America, and many more. We have now over 600 enterprise customers, all industry leaders and every single vertical. So it's, it's really exciting to see that and continue to partner with those organizations. On the partnership side, again, a lot of momentum in the org in, in the, in the markets with some of the cloud partners like Google, Amazon, Snowflake, data bricks and, and others, right? As those kind of new modern data infrastructures, modern data architectures that are definitely all moving to the cloud, a great opportunity for us, our partners and of course our customers to help them kind of transition to the cloud even faster. >>And so we see a lot of excitement and momentum there within an acquisition about 18 months ago around data quality, data observability, which we believe is an enormous opportunity. Of course, data quality isn't new, but I think there's a lot of reasons why we're so excited about quality and observability now. One is around leveraging ai, machine learning, again to drive more automation. And the second is that those data pipelines that are now being created in the cloud, in these modern data architecture arch architectures, they've become mission critical. They've become real time. And so monitoring, observing those data pipelines continuously has become absolutely critical so that they're really excited about about that as well. And on the organizational side, I'm sure you've heard a term around kind of data mesh, something that's gaining a lot of momentum, rightfully so. It's really the type of governance that we always believe. Then federated focused on domains, giving a lot of ownership to different teams. I think that's the way to scale data organizations. And so that aligns really well with our vision and, and from a product perspective, we've seen a lot of momentum with our customers there as well. >>Yeah, you know, a couple things there. I mean, the acquisition of i l dq, you know, Kirk Hasselbeck and, and their team, it's interesting, you know, the whole data quality used to be this back office function and, and really confined to highly regulated industries. It's come to the front office, it's top of mind for chief data officers, data mesh. You mentioned you guys are a connective tissue for all these different nodes on the data mesh. That's key. And of course we see you at all the shows. You're, you're a critical part of many ecosystems and you're developing your own ecosystem. So let's chat a little bit about the, the products. We're gonna go deeper in into products later on at, at Data Citizens 22, but we know you're debuting some, some new innovations, you know, whether it's, you know, the, the the under the covers in security, sort of making data more accessible for people just dealing with workflows and processes as you talked about earlier. Tell us a little bit about what you're introducing. >>Yeah, absolutely. We're super excited, a ton of innovation. And if we think about the big theme and like, like I said, we're still relatively early in this, in this journey towards kind of that mission of data intelligence that really bolts and compelling mission, either customers are still start, are just starting on that, on that journey. We wanna make it as easy as possible for the, for our organization to actually get started because we know that's important that they do. And for our organization and customers that have been with us for some time, there's still a tremendous amount of opportunity to kind of expand the platform further. And again, to make it easier for really to, to accomplish that mission and vision around that data citizen that everyone has access to trustworthy data in a very easy, easy way. So that's really the theme of a lot of the innovation that we're driving. >>A lot of kind of ease of adoption, ease of use, but also then how do we make sure that lio becomes this kind of mission critical enterprise platform from a security performance architecture scale supportability that we're truly able to deliver that kind of an enterprise mission critical platform. And so that's the big theme from an innovation perspective, From a product perspective, a lot of new innovation that we're really excited about. A couple of highlights. One is around data marketplace. Again, a lot of our customers have plans in that direction, how to make it easy. How do we make, how do we make available to true kind of shopping experience that anybody in your organization can, in a very easy search first way, find the right data product, find the right dataset, that data can then consume usage analytics. How do you, how do we help organizations drive adoption, tell them where they're working really well and where they have opportunities homepages again to, to make things easy for, for people, for anyone in your organization to kind of get started with ppia, you mentioned workflow designer, again, we have a very powerful enterprise platform. >>One of our key differentiators is the ability to really drive a lot of automation through workflows. And now we provided a new low code, no code kind of workflow designer experience. So, so really customers can take it to the next level. There's a lot more new product around K Bear Protect, which in partnership with Snowflake, which has been a strategic investor in kib, focused on how do we make access governance easier? How do we, how do we, how are we able to make sure that as you move to the cloud, things like access management, masking around sensitive data, PII data is managed as much more effective, effective rate, really excited about that product. There's more around data quality. Again, how do we, how do we get that deployed as easily and quickly and widely as we can? Moving that to the cloud has been a big part of our strategy. >>So we launch more data quality cloud product as well as making use of those, those native compute capabilities in platforms like Snowflake, Data, Bricks, Google, Amazon, and others. And so we are bettering a capability, a capability that we call push down. So actually pushing down the computer and data quality, the monitoring into the underlying platform, which again, from a scale performance and ease of use perspective is gonna make a massive difference. And then more broadly, we, we talked a little bit about the ecosystem. Again, integrations, we talk about being able to connect to every source. Integrations are absolutely critical and we're really excited to deliver new integrations with Snowflake, Azure and Google Cloud storage as well. So there's a lot coming out. The, the team has been work at work really hard and we are really, really excited about what we are coming, what we're bringing to markets. >>Yeah, a lot going on there. I wonder if you could give us your, your closing thoughts. I mean, you, you talked about, you know, the marketplace, you know, you think about data mesh, you think of data as product, one of the key principles you think about monetization. This is really different than what we've been used to in data, which is just getting the technology to work has been been so hard. So how do you see sort of the future and, you know, give us the, your closing thoughts please? >>Yeah, absolutely. And I, and I think we we're really at this pivotal moment, and I think you said it well. We, we all know the constraint and the challenges with data, how to actually do data at scale. And while we've seen a ton of innovation on the infrastructure side, we fundamentally believe that just getting a faster database is important, but it's not gonna fully solve the challenges and truly kind of deliver on the opportunity. And that's why now is really the time to deliver this data intelligence vision, this data intelligence platform. We are still early, making it as easy as we can. It's kind of, of our, it's our mission. And so I'm really, really excited to see what we, what we are gonna, how the marks gonna evolve over the next, next few quarters and years. I think the trend is clearly there when we talk about data mesh, this kind of federated approach folks on data products is just another signal that we believe that a lot of our organization are now at the time. >>The understanding need to go beyond just the technology. I really, really think about how do we actually scale data as a business function, just like we've done with it, with, with hr, with, with sales and marketing, with finance. That's how we need to think about data. I think now is the time given the economic environment that we are in much more focus on control, much more focused on productivity efficiency and now's the time. We need to look beyond just the technology and infrastructure to think of how to scale data, how to manage data at scale. >>Yeah, it's a new era. The next 10 years of data won't be like the last, as I always say. Felix, thanks so much and good luck in, in San Diego. I know you're gonna crush it out there. >>Thank you Dave. >>Yeah, it's a great spot for an in-person event and, and of course the content post event is gonna be available@collibra.com and you can of course catch the cube coverage@thecube.net and all the news@siliconangle.com. This is Dave Valante for the cube, your leader in enterprise and emerging tech coverage. >>Hi, I'm Jay from Collibra's Data Office. Today I want to talk to you about Collibra's data intelligence cloud. We often say Collibra is a single system of engagement for all of your data. Now, when I say data, I mean data in the broadest sense of the word, including reference and metadata. Think of metrics, reports, APIs, systems, policies, and even business processes that produce or consume data. Now, the beauty of this platform is that it ensures all of your users have an easy way to find, understand, trust, and access data. But how do you get started? Well, here are seven steps to help you get going. One, start with the data. What's data intelligence? Without data leverage the Collibra data catalog to automatically profile and classify your enterprise data wherever that data lives, databases, data lakes or data warehouses, whether on the cloud or on premise. >>Two, you'll then wanna organize the data and you'll do that with data communities. This can be by department, find a business or functional team, however your organization organizes work and accountability. And for that you'll establish community owners, communities, make it easy for people to navigate through the platform, find the data and will help create a sense of belonging for users. An important and related side note here, we find it's typical in many organizations that data is thought of is just an asset and IT and data offices are viewed as the owners of it and who are really the central teams performing analytics as a service provider to the enterprise. We believe data is more than an asset, it's a true product that can be converted to value. And that also means establishing business ownership of data where that strategy and ROI come together with subject matter expertise. >>Okay, three. Next, back to those communities there, the data owners should explain and define their data, not just the tables and columns, but also the related business terms, metrics and KPIs. These objects we call these assets are typically organized into business glossaries and data dictionaries. I definitely recommend starting with the topics that are most important to the business. Four, those steps that enable you and your users to have some fun with it. Linking everything together builds your knowledge graph and also known as a metadata graph by linking or relating these assets together. For example, a data set to a KPI to a report now enables your users to see what we call the lineage diagram that visualizes where the data in your dashboards actually came from and what the data means and who's responsible for it. Speaking of which, here's five. Leverage the calibra trusted business reporting solution on the marketplace, which comes with workflows for those owners to certify their reports, KPIs, and data sets. >>This helps them force their trust in their data. Six, easy to navigate dashboards or landing pages right in your platform for your company's business processes are the most effective way for everyone to better understand and take action on data. Here's a pro tip, use the dashboard design kit on the marketplace to help you build compelling dashboards. Finally, seven, promote the value of this to your users and be sure to schedule enablement office hours and new employee onboarding sessions to get folks excited about what you've built and implemented. Better yet, invite all of those community and data owners to these sessions so that they can show off the value that they've created. Those are my seven tips to get going with Collibra. I hope these have been useful. For more information, be sure to visit collibra.com. >>Welcome to the Cube's coverage of Data Citizens 2022 Collibra's customer event. My name is Dave Valante. With us is Kirk Hasselbeck, who's the vice president of Data Quality of Collibra Kirk, good to see you. Welcome. >>Thanks for having me, Dave. Excited to be here. >>You bet. Okay, we're gonna discuss data quality observability. It's a hot trend right now. You founded a data quality company, OWL dq, and it was acquired by Collibra last year. Congratulations. And now you lead data quality at Collibra. So we're hearing a lot about data quality right now. Why is it such a priority? Take us through your thoughts on that. >>Yeah, absolutely. It's, it's definitely exciting times for data quality, which you're right, has been around for a long time. So why now and why is it so much more exciting than it used to be? I think it's a bit stale, but we all know that companies use more data than ever before and the variety has changed and the volume has grown. And, and while I think that remains true, there are a couple other hidden factors at play that everyone's so interested in as, as to why this is becoming so important now. And, and I guess you could kind of break this down simply and think about if Dave, you and I were gonna build, you know, a new healthcare application and monitor the heartbeat of individuals, imagine if we get that wrong, you know, what the ramifications could be, what, what those incidents would look like, or maybe better yet, we try to build a, a new trading algorithm with a crossover strategy where the 50 day crosses the, the 10 day average. >>And imagine if the data underlying the inputs to that is incorrect. We will probably have major financial ramifications in that sense. So, you know, it kind of starts there where everybody's realizing that we're all data companies and if we are using bad data, we're likely making incorrect business decisions. But I think there's kind of two other things at play. You know, I, I bought a car not too long ago and my dad called and said, How many cylinders does it have? And I realized in that moment, you know, I might have failed him because, cause I didn't know. And, and I used to ask those types of questions about any lock brakes and cylinders and, and you know, if it's manual or, or automatic and, and I realized I now just buy a car that I hope works. And it's so complicated with all the computer chips, I, I really don't know that much about it. >>And, and that's what's happening with data. We're just loading so much of it. And it's so complex that the way companies consume them in the IT function is that they bring in a lot of data and then they syndicate it out to the business. And it turns out that the, the individuals loading and consuming all of this data for the company actually may not know that much about the data itself, and that's not even their job anymore. So we'll talk more about that in a minute, but that's really what's setting the foreground for this observability play and why everybody's so interested. It, it's because we're becoming less close to the intricacies of the data and we just expect it to always be there and be correct. >>You know, the other thing too about data quality, and for years we did the MIT CDO IQ event, we didn't do it last year, Covid messed everything up. But the observation I would make there thoughts is, is it data quality? Used to be information quality used to be this back office function, and then it became sort of front office with financial services and government and healthcare, these highly regulated industries. And then the whole chief data officer thing happened and people were realizing, well, they sort of flipped the bit from sort of a data as a, a risk to data as a, as an asset. And now as we say, we're gonna talk about observability. And so it's really become front and center just the whole quality issue because data's so fundamental, hasn't it? >>Yeah, absolutely. I mean, let's imagine we pull up our phones right now and I go to my, my favorite stock ticker app and I check out the NASDAQ market cap. I really have no idea if that's the correct number. I know it's a number, it looks large, it's in a numeric field. And, and that's kind of what's going on. There's, there's so many numbers and they're coming from all of these different sources and data providers and they're getting consumed and passed along. But there isn't really a way to tactically put controls on every number and metric across every field we plan to monitor, but with the scale that we've achieved in early days, even before calibra. And what's been so exciting is we have these types of observation techniques, these data monitors that can actually track past performance of every field at scale. And why that's so interesting and why I think the CDO is, is listening right intently nowadays to this topic is, so maybe we could surface all of these problems with the right solution of data observability and with the right scale and then just be alerted on breaking trends. So we're sort of shifting away from this world of must write a condition and then when that condition breaks, that was always known as a break record. But what about breaking trends and root cause analysis? And is it possible to do that, you know, with less human intervention? And so I think most people are seeing now that it's going to have to be a software tool and a computer system. It's, it's not ever going to be based on one or two domain experts anymore. >>So, So how does data observability relate to data quality? Are they sort of two sides of the same coin? Are they, are they cousins? What's your perspective on that? >>Yeah, it's, it's super interesting. It's an emerging market. So the language is changing a lot of the topic and areas changing the way that I like to say it or break it down because the, the lingo is constantly moving is, you know, as a target on this space is really breaking records versus breaking trends. And I could write a condition when this thing happens, it's wrong and when it doesn't it's correct. Or I could look for a trend and I'll give you a good example. You know, everybody's talking about fresh data and stale data and, and why would that matter? Well, if your data never arrived or only part of it arrived or didn't arrive on time, it's likely stale and there will not be a condition that you could write that would show you all the good in the bads. That was kind of your, your traditional approach of data quality break records. But your modern day approach is you lost a significant portion of your data, or it did not arrive on time to make that decision accurately on time. And that's a hidden concern. Some people call this freshness, we call it stale data, but it all points to the same idea of the thing that you're observing may not be a data quality condition anymore. It may be a breakdown in the data pipeline. And with thousands of data pipelines in play for every company out there there, there's more than a couple of these happening every day. >>So what's the Collibra angle on all this stuff made the acquisition, you got data quality observability coming together, you guys have a lot of expertise in, in this area, but you hear providence of data, you just talked about, you know, stale data, you know, the, the whole trend toward real time. How is Calibra approaching the problem and what's unique about your approach? >>Well, I think where we're fortunate is with our background, myself and team, we sort of lived this problem for a long time, you know, in, in the Wall Street days about a decade ago. And we saw it from many different angles. And what we came up with before it was called data observability or reliability was basically the, the underpinnings of that. So we're a little bit ahead of the curve there when most people evaluate our solution, it's more advanced than some of the observation techniques that that currently exist. But we've also always covered data quality and we believe that people want to know more, they need more insights, and they want to see break records and breaking trends together so they can correlate the root cause. And we hear that all the time. I have so many things going wrong, just show me the big picture, help me find the thing that if I were to fix it today would make the most impact. So we're really focused on root cause analysis, business impact, connecting it with lineage and catalog metadata. And as that grows, you can actually achieve total data governance at this point with the acquisition of what was a Lineage company years ago, and then my company Ldq now Collibra, Data quality Collibra may be the best positioned for total data governance and intelligence in the space. >>Well, you mentioned financial services a couple of times and some examples, remember the flash crash in 2010. Nobody had any idea what that was, you know, they just said, Oh, it's a glitch, you know, so they didn't understand the root cause of it. So this is a really interesting topic to me. So we know at Data Citizens 22 that you're announcing, you gotta announce new products, right? You're yearly event what's, what's new. Give us a sense as to what products are coming out, but specifically around data quality and observability. >>Absolutely. There's this, you know, there's always a next thing on the forefront. And the one right now is these hyperscalers in the cloud. So you have databases like Snowflake and Big Query and Data Bricks is Delta Lake and SQL Pushdown. And ultimately what that means is a lot of people are storing in loading data even faster in a SaaS like model. And we've started to hook in to these databases. And while we've always worked with the the same databases in the past, they're supported today we're doing something called Native Database pushdown, where the entire compute and data activity happens in the database. And why that is so interesting and powerful now is everyone's concerned with something called Egress. Did your, my data that I've spent all this time and money with my security team securing ever leave my hands, did it ever leave my secure VPC as they call it? >>And with these native integrations that we're building and about to unveil, here's kind of a sneak peek for, for next week at Data Citizens. We're now doing all compute and data operations in databases like Snowflake. And what that means is with no install and no configuration, you could log into the Collibra data quality app and have all of your data quality running inside the database that you've probably already picked as your your go forward team selection secured database of choice. So we're really excited about that. And I think if you look at the whole landscape of network cost, egress, cost, data storage and compute, what people are realizing is it's extremely efficient to do it in the way that we're about to release here next week. >>So this is interesting because what you just described, you know, you mentioned Snowflake, you mentioned Google, Oh actually you mentioned yeah, data bricks. You know, Snowflake has the data cloud. If you put everything in the data cloud, okay, you're cool, but then Google's got the open data cloud. If you heard, you know, Google next and now data bricks doesn't call it the data cloud, but they have like the open source data cloud. So you have all these different approaches and there's really no way up until now I'm, I'm hearing to, to really understand the relationships between all those and have confidence across, you know, it's like Jak Dani, you should just be a note on the mesh. And I don't care if it's a data warehouse or a data lake or where it comes from, but it's a point on that mesh and I need tooling to be able to have confidence that my data is governed and has the proper lineage, providence. And, and, and that's what you're bringing to the table, Is that right? Did I get that right? >>Yeah, that's right. And it's, for us, it's, it's not that we haven't been working with those great cloud databases, but it's the fact that we can send them the instructions now, we can send them the, the operating ability to crunch all of the calculations, the governance, the quality, and get the answers. And what that's doing, it's basically zero network costs, zero egress cost, zero latency of time. And so when you were to log into Big Query tomorrow using our tool or like, or say Snowflake for example, you have instant data quality metrics, instant profiling, instant lineage and access privacy controls, things of that nature that just become less onerous. What we're seeing is there's so much technology out there, just like all of the major brands that you mentioned, but how do we make it easier? The future is about less clicks, faster time to value, faster scale, and eventually lower cost. And, and we think that this positions us to be the leader there. >>I love this example because, you know, Barry talks about, wow, the cloud guys are gonna own the world and, and of course now we're seeing that the ecosystem is finding so much white space to add value, connect across cloud. Sometimes we call it super cloud and so, or inter clouding. All right, Kirk, give us your, your final thoughts and on on the trends that we've talked about and Data Citizens 22. >>Absolutely. Well, I think, you know, one big trend is discovery and classification. Seeing that across the board, people used to know it was a zip code and nowadays with the amount of data that's out there, they wanna know where everything is, where their sensitive data is. If it's redundant, tell me everything inside of three to five seconds. And with that comes, they want to know in all of these hyperscale databases how fast they can get controls and insights out of their tools. So I think we're gonna see more one click solutions, more SAS based solutions and solutions that hopefully prove faster time to value on, on all of these modern cloud platforms. >>Excellent. All right, Kurt Hasselbeck, thanks so much for coming on the Cube and previewing Data Citizens 22. Appreciate it. >>Thanks for having me, Dave. >>You're welcome. Right, and thank you for watching. Keep it right there for more coverage from the Cube. Welcome to the Cube's virtual Coverage of Data Citizens 2022. My name is Dave Valante and I'm here with Laura Sellers, who's the Chief Product Officer at Collibra, the host of Data Citizens. Laura, welcome. Good to see you. >>Thank you. Nice to be here. >>Yeah, your keynote at Data Citizens this year focused on, you know, your mission to drive ease of use and scale. Now when I think about historically fast access to the right data at the right time in a form that's really easily consumable, it's been kind of challenging, especially for business users. Can can you explain to our audience why this matters so much and what's actually different today in the data ecosystem to make this a reality? >>Yeah, definitely. So I think what we really need and what I hear from customers every single day is that we need a new approach to data management and our product teams. What inspired me to come to Calibra a little bit a over a year ago was really the fact that they're very focused on bringing trusted data to more users across more sources for more use cases. And so as we look at what we're announcing with these innovations of ease of use and scale, it's really about making teams more productive in getting started with and the ability to manage data across the entire organization. So we've been very focused on richer experiences, a broader ecosystem of partners, as well as a platform that delivers performance, scale and security that our users and teams need and demand. So as we look at, Oh, go ahead. >>I was gonna say, you know, when I look back at like the last 10 years, it was all about getting the technology to work and it was just so complicated. But, but please carry on. I'd love to hear more about this. >>Yeah, I, I really, you know, Collibra is a system of engagement for data and we really are working on bringing that entire system of engagement to life for everyone to leverage here and now. So what we're announcing from our ease of use side of the world is first our data marketplace. This is the ability for all users to discover and access data quickly and easily shop for it, if you will. The next thing that we're also introducing is the new homepage. It's really about the ability to drive adoption and have users find data more quickly. And then the two more areas of the ease of use side of the world is our world of usage analytics. And one of the big pushes and passions we have at Collibra is to help with this data driven culture that all companies are trying to create. And also helping with data literacy, with something like usage analytics, it's really about driving adoption of the CLE platform, understanding what's working, who's accessing it, what's not. And then finally we're also introducing what's called workflow designer. And we love our workflows at Libra, it's a big differentiator to be able to automate business processes. The designer is really about a way for more people to be able to create those workflows, collaborate on those workflow flows, as well as people to be able to easily interact with them. So a lot of exciting things when it comes to ease of use to make it easier for all users to find data. >>Y yes, there's definitely a lot to unpack there. I I, you know, you mentioned this idea of, of of, of shopping for the data. That's interesting to me. Why this analogy, metaphor or analogy, I always get those confused. I let's go with analogy. Why is it so important to data consumers? >>I think when you look at the world of data, and I talked about this system of engagement, it's really about making it more accessible to the masses. And what users are used to is a shopping experience like your Amazon, if you will. And so having a consumer grade experience where users can quickly go in and find the data, trust that data, understand where the data's coming from, and then be able to quickly access it, is the idea of being able to shop for it, just making it as simple as possible and really speeding the time to value for any of the business analysts, data analysts out there. >>Yeah, I think when you, you, you see a lot of discussion about rethinking data architectures, putting data in the hands of the users and business people, decentralized data and of course that's awesome. I love that. But of course then you have to have self-service infrastructure and you have to have governance. And those are really challenging. And I think so many organizations, they're facing adoption challenges, you know, when it comes to enabling teams generally, especially domain experts to adopt new data technologies, you know, like the, the tech comes fast and furious. You got all these open source projects and get really confusing. Of course it risks security, governance and all that good stuff. You got all this jargon. So where do you see, you know, the friction in adopting new data technologies? What's your point of view and how can organizations overcome these challenges? >>You're, you're dead on. There's so much technology and there's so much to stay on top of, which is part of the friction, right? It's just being able to stay ahead of, of and understand all the technologies that are coming. You also look at as there's so many more sources of data and people are migrating data to the cloud and they're migrating to new sources. Where the friction comes is really that ability to understand where the data came from, where it's moving to, and then also to be able to put the access controls on top of it. So people are only getting access to the data that they should be getting access to. So one of the other things we're announcing with, with all of the innovations that are coming is what we're doing around performance and scale. So with all of the data movement, with all of the data that's out there, the first thing we're launching in the world of performance and scale is our world of data quality. >>It's something that Collibra has been working on for the past year and a half, but we're launching the ability to have data quality in the cloud. So it's currently an on-premise offering, but we'll now be able to carry that over into the cloud for us to manage that way. We're also introducing the ability to push down data quality into Snowflake. So this is, again, one of those challenges is making sure that that data that you have is d is is high quality as you move forward. And so really another, we're just reducing friction. You already have Snowflake stood up. It's not another machine for you to manage, it's just push down capabilities into Snowflake to be able to track that quality. Another thing that we're launching with that is what we call Collibra Protect. And this is that ability for users to be able to ingest metadata, understand where the PII data is, and then set policies up on top of it. So very quickly be able to set policies and have them enforced at the data level. So anybody in the organization is only getting access to the data they should have access to. >>Here's Topica data quality is interesting. It's something that I've followed for a number of years. It used to be a back office function, you know, and really confined only to highly regulated industries like financial services and healthcare and government. You know, you look back over a decade ago, you didn't have this worry about personal information, g gdpr, and, you know, California Consumer Privacy Act all becomes, becomes so much important. The cloud is really changed things in terms of performance and scale and of course partnering for, for, with Snowflake it's all about sharing data and monetization, anything but a back office function. So it was kind of smart that you guys were early on and of course attracting them and as a, as an investor as well was very strong validation. What can you tell us about the nature of the relationship with Snowflake and specifically inter interested in sort of joint engineering or, and product innovation efforts, you know, beyond the standard go to market stuff? >>Definitely. So you mentioned there were a strategic investor in Calibra about a year ago. A little less than that I guess. We've been working with them though for over a year really tightly with their product and engineering teams to make sure that Collibra is adding real value. Our unified platform is touching pieces of our unified platform or touching all pieces of Snowflake. And when I say that, what I mean is we're first, you know, able to ingest data with Snowflake, which, which has always existed. We're able to profile and classify that data we're announcing with Calibra Protect this week that you're now able to create those policies on top of Snowflake and have them enforce. So again, people can get more value out of their snowflake more quickly as far as time to value with, with our policies for all business users to be able to create. >>We're also announcing Snowflake Lineage 2.0. So this is the ability to take stored procedures in Snowflake and understand the lineage of where did the data come from, how was it transformed with within Snowflake as well as the data quality. Pushdown, as I mentioned, data quality, you brought it up. It is a new, it is a, a big industry push and you know, one of the things I think Gartner mentioned is people are losing up to $15 million without having great data quality. So this push down capability for Snowflake really is again, a big ease of use push for us at Collibra of that ability to, to push it into snowflake, take advantage of the data, the data source, and the engine that already lives there and get the right and make sure you have the right quality. >>I mean, the nice thing about Snowflake, if you play in the Snowflake sandbox, you, you, you, you can get sort of a, you know, high degree of confidence that the data sharing can be done in a safe way. Bringing, you know, Collibra into the, into the story allows me to have that data quality and, and that governance that I, that I need. You know, we've said many times on the cube that one of the notable differences in cloud this decade versus last decade, I mean ob there are obvious differences just in terms of scale and scope, but it's shaping up to be about the strength of the ecosystems. That's really a hallmark of these big cloud players. I mean they're, it's a key factor for innovating, accelerating product delivery, filling gaps in, in the hyperscale offerings cuz you got more stack, you know, mature stack capabilities and you know, it creates this flywheel momentum as we often say. But, so my question is, how do you work with the hyperscalers? Like whether it's AWS or Google, whomever, and what do you see as your role and what's the Collibra sweet spot? >>Yeah, definitely. So, you know, one of the things I mentioned early on is the broader ecosystem of partners is what it's all about. And so we have that strong partnership with Snowflake. We also are doing more with Google around, you know, GCP and kbra protect there, but also tighter data plex integration. So similar to what you've seen with our strategic moves around Snowflake and, and really covering the broad ecosystem of what Collibra can do on top of that data source. We're extending that to the world of Google as well and the world of data plex. We also have great partners in SI's Infosys is somebody we spoke with at the conference who's done a lot of great work with Levi's as they're really important to help people with their whole data strategy and driving that data driven culture and, and Collibra being the core of it. >>Hi Laura, we're gonna, we're gonna end it there, but I wonder if you could kind of put a bow on, you know, this year, the event your, your perspectives. So just give us your closing thoughts. >>Yeah, definitely. So I, I wanna say this is one of the biggest releases Collibra's ever had. Definitely the biggest one since I've been with the company a little over a year. We have all these great new product innovations coming to really drive the ease of use to make data more valuable for users everywhere and, and companies everywhere. And so it's all about everybody being able to easily find, understand, and trust and get access to that data going forward. >>Well congratulations on all the pro progress. It was great to have you on the cube first time I believe, and really appreciate you, you taking the time with us. >>Yes, thank you for your time. >>You're very welcome. Okay, you're watching the coverage of Data Citizens 2022 on the cube, your leader in enterprise and emerging tech coverage. >>So data modernization oftentimes means moving some of your storage and computer to the cloud where you get the benefit of scale and security and so on. But ultimately it doesn't take away the silos that you have. We have more locations, more tools and more processes with which we try to get value from this data. To do that at scale in an organization, people involved in this process, they have to understand each other. So you need to unite those people across those tools, processes, and systems with a shared language. When I say customer, do you understand the same thing as you hearing customer? Are we counting them in the same way so that shared language unites us and that gives the opportunity for the organization as a whole to get the maximum value out of their data assets and then they can democratize data so everyone can properly use that shared language to find, understand, and trust the data asset that's available. >>And that's where Collibra comes in. We provide a centralized system of engagement that works across all of those locations and combines all of those different user types across the whole business. At Collibra, we say United by data and that also means that we're united by data with our customers. So here is some data about some of our customers. There was the case of an online do it yourself platform who grew their revenue almost three times from a marketing campaign that provided the right product in the right hands of the right people. In other case that comes to mind is from a financial services organization who saved over 800 K every year because they were able to reuse the same data in different kinds of reports and before there was spread out over different tools and processes and silos, and now the platform brought them together so they realized, oh, we're actually using the same data, let's find a way to make this more efficient. And the last example that comes to mind is that of a large home loan, home mortgage, mortgage loan provider where they have a very complex landscape, a very complex architecture legacy in the cloud, et cetera. And they're using our software, they're using our platform to unite all the people and those processes and tools to get a common view of data to manage their compliance at scale. >>Hey everyone, I'm Lisa Martin covering Data Citizens 22, brought to you by Collibra. This next conversation is gonna focus on the importance of data culture. One of our Cube alumni is back, Stan Christians is Collibra's co-founder and it's Chief Data citizens. Stan, it's great to have you back on the cube. >>Hey Lisa, nice to be. >>So we're gonna be talking about the importance of data culture, data intelligence, maturity, all those great things. When we think about the data revolution that every business is going through, you know, it's so much more than technology innovation. It also really re requires cultural transformation, community transformation. Those are challenging for customers to undertake. Talk to us about what you mean by data citizenship and the role that creating a data culture plays in that journey. >>Right. So as you know, our event is called Data Citizens because we believe that in the end, a data citizen is anyone who uses data to do their job. And we believe that today's organizations, you have a lot of people, most of the employees in an organization are somehow gonna to be a data citizen, right? So you need to make sure that these people are aware of it. You need that. People have skills and competencies to do with data what necessary and that's on, all right? So what does it mean to have a good data culture? It means that if you're building a beautiful dashboard to try and convince your boss, we need to make this decision that your boss is also open to and able to interpret, you know, the data presented in dashboard to actually make that decision and take that action. Right? >>And once you have that why to the organization, that's when you have a good data culture. Now that's continuous effort for most organizations because they're always moving, somehow they're hiring new people and it has to be continuous effort because we've seen that on the hand. Organizations continue challenged their data sources and where all the data is flowing, right? Which in itself creates a lot of risk. But also on the other set hand of the equation, you have the benefit. You know, you might look at regulatory drivers like, we have to do this, right? But it's, it's much better right now to consider the competitive drivers, for example, and we did an IDC study earlier this year, quite interesting. I can recommend anyone to it. And one of the conclusions they found as they surveyed over a thousand people across organizations worldwide is that the ones who are higher in maturity. >>So the, the organizations that really look at data as an asset, look at data as a product and actively try to be better at it, don't have three times as good a business outcome as the ones who are lower on the maturity scale, right? So you can say, ok, I'm doing this, you know, data culture for everyone, awakening them up as data citizens. I'm doing this for competitive reasons, I'm doing this re reasons you're trying to bring both of those together and the ones that get data intelligence right, are successful and competitive. That's, and that's what we're seeing out there in the market. >>Absolutely. We know that just generally stand right, the organizations that are, are really creating a, a data culture and enabling everybody within the organization to become data citizens are, We know that in theory they're more competitive, they're more successful. But the IDC study that you just mentioned demonstrates they're three times more successful and competitive than their peers. Talk about how Collibra advises customers to create that community, that culture of data when it might be challenging for an organization to adapt culturally. >>Of course, of course it's difficult for an organization to adapt but it's also necessary, as you just said, imagine that, you know, you're a modern day organization, laptops, what have you, you're not using those, right? Or you know, you're delivering them throughout organization, but not enabling your colleagues to actually do something with that asset. Same thing as through with data today, right? If you're not properly using the data asset and competitors are, they're gonna to get more advantage. So as to how you get this done, establish this. There's angles to look at, Lisa. So one angle is obviously the leadership whereby whoever is the boss of data in the organization, you typically have multiple bosses there, like achieve data officers. Sometimes there's, there's multiple, but they may have a different title, right? So I'm just gonna summarize it as a data leader for a second. >>So whoever that is, they need to make sure that there's a clear vision, a clear strategy for data. And that strategy needs to include the monetization aspect. How are you going to get value from data? Yes. Now that's one part because then you can leadership in the organization and also the business value. And that's important. Cause those people, their job in essence really is to make everyone in the organization think about data as an asset. And I think that's the second part of the equation of getting that right, is it's not enough to just have that leadership out there, but you also have to get the hearts and minds of the data champions across the organization. You, I really have to win them over. And if you have those two combined and obviously a good technology to, you know, connect those people and have them execute on their responsibilities such as a data intelligence platform like s then the in place to really start upgrading that culture inch by inch if you'll, >>Yes, I like that. The recipe for success. So you are the co-founder of Collibra. You've worn many different hats along this journey. Now you're building Collibra's own data office. I like how before we went live, we were talking about Calibra is drinking its own champagne. I always loved to hear stories about that. You're speaking at Data Citizens 2022. Talk to us about how you are building a data culture within Collibra and what maybe some of the specific projects are that Collibra's data office is working on. >>Yes, and it is indeed data citizens. There are a ton of speaks here, are very excited. You know, we have Barb from m MIT speaking about data monetization. We have Dilla at the last minute. So really exciting agen agenda. Can't wait to get back out there essentially. So over the years at, we've doing this since two and eight, so a good years and I think we have another decade of work ahead in the market, just to be very clear. Data is here to stick around as are we. And myself, you know, when you start a company, we were for people in a, if you, so everybody's wearing all sorts of hat at time. But over the years I've run, you know, presales that sales partnerships, product cetera. And as our company got a little bit biggish, we're now thousand two. Something like people in the company. >>I believe systems and processes become a lot important. So we said you CBRA isn't the size our customers we're getting there in of organization structure, process systems, et cetera. So we said it's really time for us to put our money where is and to our own data office, which is what we were seeing customers', organizations worldwide. And they organizations have HR units, they have a finance unit and over time they'll all have a department if you'll, that is responsible somehow for the data. So we said, ok, let's try to set an examples that other people can take away with it, right? Can take away from it. So we set up a data strategy, we started building data products, took care of the data infrastructure. That's sort of good stuff. And in doing all of that, ISA exactly as you said, we said, okay, we need to also use our product and our own practices and from that use, learn how we can make the product better, learn how we make, can make the practice better and share that learning with all the, and on, on the Monday mornings, we sometimes refer to eating our dog foods on Friday evenings. >>We referred to that drinking our own champagne. I like it. So we, we had a, we had the driver to do this. You know, there's a clear business reason. So we involved, we included that in the data strategy and that's a little bit of our origin. Now how, how do we organize this? We have three pillars, and by no means is this a template that everyone should, this is just the organization that works at our company, but it can serve as an inspiration. So we have a pillar, which is data science. The data product builders, if you'll or the people who help the business build data products. We have the data engineers who help keep the lights on for that data platform to make sure that the products, the data products can run, the data can flow and you know, the quality can be checked. >>And then we have a data intelligence or data governance builders where we have those data governance, data intelligence stakeholders who help the business as a sort of data partner to the business stakeholders. So that's how we've organized it. And then we started following the CBRA approach, which is, well, what are the challenges that our business stakeholders have in hr, finance, sales, marketing all over? And how can data help overcome those challenges? And from those use cases, we then just started to build a map and started execution use of the use case. And a important ones are very simple. We them with our, our customers as well, people talking about the cata, right? The catalog for the data scientists to know what's in their data lake, for example, and for the people in and privacy. So they have their process registry and they can see how the data flows. >>So that's a starting place and that turns into a marketplace so that if new analysts and data citizens join kbra, they immediately have a place to go to, to look at, see, ok, what data is out there for me as an analyst or a data scientist or whatever to do my job, right? So they can immediately get access data. And another one that we is around trusted business. We're seeing that since, you know, self-service BI allowed everyone to make beautiful dashboards, you know, pie, pie charts. I always, my pet pee is the pie chart because I love buy and you shouldn't always be using pie charts. But essentially there's become proliferation of those reports. And now executives don't really know, okay, should I trust this report or that report the reporting on the same thing. But the numbers seem different, right? So that's why we have trusted this reporting. So we know if a, the dashboard, a data product essentially is built, we not that all the right steps are being followed and that whoever is consuming that can be quite confident in the result either, Right. And that silver browser, right? Absolutely >>Decay. >>Exactly. Yes, >>Absolutely. Talk a little bit about some of the, the key performance indicators that you're using to measure the success of the data office. What are some of those KPIs? >>KPIs and measuring is a big topic in the, in the data chief data officer profession, I would say, and again, it always varies with to your organization, but there's a few that we use that might be of interest. Use those pillars, right? And we have metrics across those pillars. So for example, a pillar on the data engineering side is gonna be more related to that uptime, right? Are the, is the data platform up and running? Are the data products up and running? Is the quality in them good enough? Is it going up? Is it going down? What's the usage? But also, and especially if you're in the cloud and if consumption's a big thing, you have metrics around cost, for example, right? So that's one set of examples. Another one is around the data sciences and products. Are people using them? Are they getting value from it? >>Can we calculate that value in ay perspective, right? Yeah. So that we can to the rest of the business continue to say we're tracking all those numbers and those numbers indicate that value is generated and how much value estimated in that region. And then you have some data intelligence, data governance metrics, which is, for example, you have a number of domains in a data mesh. People talk about being the owner of a data domain, for example, like product or, or customer. So how many of those domains do you have covered? How many of them are already part of the program? How many of them have owners assigned? How well are these owners organized, executing on their responsibilities? How many tickets are open closed? How many data products are built according to process? And so and so forth. So these are an set of examples of, of KPIs. There's a, there's a lot more, but hopefully those can already inspire the audience. >>Absolutely. So we've, we've talked about the rise cheap data offices, it's only accelerating. You mentioned this is like a 10 year journey. So if you were to look into a crystal ball, what do you see in terms of the maturation of data offices over the next decade? >>So we, we've seen indeed the, the role sort of grow up, I think in, in thousand 10 there may have been like 10 achieve data officers or something. Gartner has exact numbers on them, but then they grew, you know, industries and the number is estimated to be about 20,000 right now. Wow. And they evolved in a sort of stack of competencies, defensive data strategy, because the first chief data officers were more regulatory driven, offensive data strategy support for the digital program. And now all about data products, right? So as a data leader, you now need all of those competences and need to include them in, in your strategy. >>How is that going to evolve for the next couple of years? I wish I had one of those balls, right? But essentially I think for the next couple of years there's gonna be a lot of people, you know, still moving along with those four levels of the stack. A lot of people I see are still in version one and version two of the chief data. So you'll see over the years that's gonna evolve more digital and more data products. So for next years, my, my prediction is it's all products because it's an immediate link between data and, and the essentially, right? Right. So that's gonna be important and quite likely a new, some new things will be added on, which nobody can predict yet. But we'll see those pop up in a few years. I think there's gonna be a continued challenge for the chief officer role to become a real executive role as opposed to, you know, somebody who claims that they're executive, but then they're not, right? >>So the real reporting level into the board, into the CEO for example, will continue to be a challenging point. But the ones who do get that done will be the ones that are successful and the ones who get that will the ones that do it on the basis of data monetization, right? Connecting value to the data and making that value clear to all the data citizens in the organization, right? And in that sense, they'll need to have both, you know, technical audiences and non-technical audiences aligned of course. And they'll need to focus on adoption. Again, it's not enough to just have your data office be involved in this. It's really important that you're waking up data citizens across the organization and you make everyone in the organization think about data as an asset. >>Absolutely. Because there's so much value that can be extracted. Organizations really strategically build that data office and democratize access across all those data citizens. Stan, this is an exciting arena. We're definitely gonna keep our eyes on this. Sounds like a lot of evolution and maturation coming from the data office perspective. From the data citizen perspective. And as the data show that you mentioned in that IDC study, you mentioned Gartner as well, organizations have so much more likelihood of being successful and being competitive. So we're gonna watch this space. Stan, thank you so much for joining me on the cube at Data Citizens 22. We appreciate it. >>Thanks for having me over >>From Data Citizens 22, I'm Lisa Martin, you're watching The Cube, the leader in live tech coverage. >>Okay, this concludes our coverage of Data Citizens 2022, brought to you by Collibra. Remember, all these videos are available on demand@thecube.net. And don't forget to check out silicon angle.com for all the news and wiki bod.com for our weekly breaking analysis series where we cover many data topics and share survey research from our partner ETR Enterprise Technology Research. If you want more information on the products announced at Data Citizens, go to collibra.com. There are tons of resources there. You'll find analyst reports, product demos. It's really worthwhile to check those out. Thanks for watching our program and digging into Data Citizens 2022 on the Cube, your leader in enterprise and emerging tech coverage. We'll see you soon.

Published Date : Nov 2 2022

SUMMARY :

largely about getting the technology to work. Now the cloud is definitely helping with that, but also how do you automate governance? So you can see how data governance has evolved into to say we extract the signal from the noise, and over the, the next couple of days, we're gonna feature some of the So it's a really interesting story that we're thrilled to be sharing And we said at the time, you know, maybe it's time to rethink data innovation. 2020s from the previous decade, and what challenges does that bring for your customers? as data becomes more impactful than important, the level of scrutiny with respect to privacy, So again, I think it just another incentive for organization to now truly look at data You know, I don't know when you guys founded Collibra, if, if you had a sense as to how complicated the last kind of financial crisis, and that was really the, the start of Colli where we found product market Well, that's interesting because, you know, in my observation it takes seven to 10 years to actually build a again, a lot of momentum in the org in, in the, in the markets with some of the cloud partners And the second is that those data pipelines that are now being created in the cloud, I mean, the acquisition of i l dq, you know, So that's really the theme of a lot of the innovation that we're driving. And so that's the big theme from an innovation perspective, One of our key differentiators is the ability to really drive a lot of automation through workflows. So actually pushing down the computer and data quality, one of the key principles you think about monetization. And I, and I think we we're really at this pivotal moment, and I think you said it well. We need to look beyond just the I know you're gonna crush it out there. This is Dave Valante for the cube, your leader in enterprise and Without data leverage the Collibra data catalog to automatically And for that you'll establish community owners, a data set to a KPI to a report now enables your users to see what Finally, seven, promote the value of this to your users and Welcome to the Cube's coverage of Data Citizens 2022 Collibra's customer event. And now you lead data quality at Collibra. imagine if we get that wrong, you know, what the ramifications could be, And I realized in that moment, you know, I might have failed him because, cause I didn't know. And it's so complex that the way companies consume them in the IT function is And so it's really become front and center just the whole quality issue because data's so fundamental, nowadays to this topic is, so maybe we could surface all of these problems with So the language is changing a you know, stale data, you know, the, the whole trend toward real time. we sort of lived this problem for a long time, you know, in, in the Wall Street days about a decade you know, they just said, Oh, it's a glitch, you know, so they didn't understand the root cause of it. And the one right now is these hyperscalers in the cloud. And I think if you look at the whole So this is interesting because what you just described, you know, you mentioned Snowflake, And so when you were to log into Big Query tomorrow using our I love this example because, you know, Barry talks about, wow, the cloud guys are gonna own the world and, Seeing that across the board, people used to know it was a zip code and nowadays Appreciate it. Right, and thank you for watching. Nice to be here. Can can you explain to our audience why the ability to manage data across the entire organization. I was gonna say, you know, when I look back at like the last 10 years, it was all about getting the technology to work and it And one of the big pushes and passions we have at Collibra is to help with I I, you know, you mentioned this idea of, and really speeding the time to value for any of the business analysts, So where do you see, you know, the friction in adopting new data technologies? So one of the other things we're announcing with, with all of the innovations that are coming is So anybody in the organization is only getting access to the data they should have access to. So it was kind of smart that you guys were early on and We're able to profile and classify that data we're announcing with Calibra Protect this week that and get the right and make sure you have the right quality. I mean, the nice thing about Snowflake, if you play in the Snowflake sandbox, you, you, you, you can get sort of a, We also are doing more with Google around, you know, GCP and kbra protect there, you know, this year, the event your, your perspectives. And so it's all about everybody being able to easily It was great to have you on the cube first time I believe, cube, your leader in enterprise and emerging tech coverage. the cloud where you get the benefit of scale and security and so on. And the last example that comes to mind is that of a large home loan, home mortgage, Stan, it's great to have you back on the cube. Talk to us about what you mean by data citizenship and the And we believe that today's organizations, you have a lot of people, And one of the conclusions they found as they So you can say, ok, I'm doing this, you know, data culture for everyone, awakening them But the IDC study that you just mentioned demonstrates they're three times So as to how you get this done, establish this. part of the equation of getting that right, is it's not enough to just have that leadership out Talk to us about how you are building a data culture within Collibra and But over the years I've run, you know, So we said you the data products can run, the data can flow and you know, the quality can be checked. The catalog for the data scientists to know what's in their data lake, and data citizens join kbra, they immediately have a place to go to, Yes, success of the data office. So for example, a pillar on the data engineering side is gonna be more related So how many of those domains do you have covered? to look into a crystal ball, what do you see in terms of the maturation industries and the number is estimated to be about 20,000 right now. How is that going to evolve for the next couple of years? And in that sense, they'll need to have both, you know, technical audiences and non-technical audiences And as the data show that you mentioned in that IDC study, the leader in live tech coverage. Okay, this concludes our coverage of Data Citizens 2022, brought to you by Collibra.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
LauraPERSON

0.99+

Lisa MartinPERSON

0.99+

DavePERSON

0.99+

AmazonORGANIZATION

0.99+

HeinekenORGANIZATION

0.99+

Dave ValantePERSON

0.99+

Laura SellersPERSON

0.99+

2008DATE

0.99+

CollibraORGANIZATION

0.99+

AdobeORGANIZATION

0.99+

Felix Von DalaPERSON

0.99+

GoogleORGANIZATION

0.99+

Felix Van DemaPERSON

0.99+

sevenQUANTITY

0.99+

Stan ChristiansPERSON

0.99+

2010DATE

0.99+

LisaPERSON

0.99+

San DiegoLOCATION

0.99+

JayPERSON

0.99+

50 dayQUANTITY

0.99+

FelixPERSON

0.99+

oneQUANTITY

0.99+

Kurt HasselbeckPERSON

0.99+

Bank of AmericaORGANIZATION

0.99+

10 yearQUANTITY

0.99+

California Consumer Privacy ActTITLE

0.99+

10 dayQUANTITY

0.99+

SixQUANTITY

0.99+

SnowflakeORGANIZATION

0.99+

Dave AntePERSON

0.99+

Last yearDATE

0.99+

demand@thecube.netOTHER

0.99+

ETR Enterprise Technology ResearchORGANIZATION

0.99+

BarryPERSON

0.99+

GartnerORGANIZATION

0.99+

one partQUANTITY

0.99+

PythonTITLE

0.99+

2010sDATE

0.99+

2020sDATE

0.99+

CalibraLOCATION

0.99+

last yearDATE

0.99+

twoQUANTITY

0.99+

CalibraORGANIZATION

0.99+

K Bear ProtectORGANIZATION

0.99+

two sidesQUANTITY

0.99+

Kirk HasselbeckPERSON

0.99+

12 monthsQUANTITY

0.99+

tomorrowDATE

0.99+

AWSORGANIZATION

0.99+

BarbPERSON

0.99+

StanPERSON

0.99+

Data CitizensORGANIZATION

0.99+

Kirk Haslbeck, Collibra | Data Citizens '22


 

(bright upbeat music) >> Welcome to theCUBE's Coverage of Data Citizens 2022 Collibra's Customer event. My name is Dave Vellante. With us is Kirk Hasselbeck, who's the Vice President of Data Quality of Collibra. Kirk, good to see you. Welcome. >> Thanks for having me, Dave. Excited to be here. >> You bet. Okay, we're going to discuss data quality, observability. It's a hot trend right now. You founded a data quality company, OwlDQ and it was acquired by Collibra last year. Congratulations! And now you lead data quality at Collibra. So we're hearing a lot about data quality right now. Why is it such a priority? Take us through your thoughts on that. >> Yeah, absolutely. It's definitely exciting times for data quality which you're right, has been around for a long time. So why now, and why is it so much more exciting than it used to be? I think it's a bit stale, but we all know that companies use more data than ever before and the variety has changed and the volume has grown. And while I think that remains true, there are a couple other hidden factors at play that everyone's so interested in as to why this is becoming so important now. And I guess you could kind of break this down simply and think about if Dave, you and I were going to build, you know a new healthcare application and monitor the heartbeat of individuals, imagine if we get that wrong, what the ramifications could be? What those incidents would look like? Or maybe better yet, we try to build a new trading algorithm with a crossover strategy where the 50 day crosses the 10 day average. And imagine if the data underlying the inputs to that is incorrect. We'll probably have major financial ramifications in that sense. So, it kind of starts there where everybody's realizing that we're all data companies and if we are using bad data, we're likely making incorrect business decisions. But I think there's kind of two other things at play. I bought a car not too long ago and my dad called and said, "How many cylinders does it have?" And I realized in that moment, I might have failed him because 'cause I didn't know. And I used to ask those types of questions about any lock brakes and cylinders and if it's manual or automatic and I realized I now just buy a car that I hope works. And it's so complicated with all the computer chips. I really don't know that much about it. And that's what's happening with data. We're just loading so much of it. And it's so complex that the way companies consume them in the IT function is that they bring in a lot of data and then they syndicate it out to the business. And it turns out that the individuals loading and consuming all of this data for the company actually may not know that much about the data itself and that's not even their job anymore. So, we'll talk more about that in a minute but that's really what's setting the foreground for this observability play and why everybody's so interested, it's because we're becoming less close to the intricacies of the data and we just expect it to always be there and be correct. >> You know, the other thing too about data quality and for years we did the MIT CDOIQ event we didn't do it last year at COVID, messed everything up. But the observation I would make there love thoughts is it data quality used to be information quality used to be this back office function, and then it became sort of front office with financial services and government and healthcare, these highly regulated industries. And then the whole chief data officer thing happened and people were realizing, well, they sort of flipped the bit from sort of a data as a a risk to data as an asset. And now, as we say, we're going to talk about observability. And so it's really become front and center, just the whole quality issue because data's fundamental, hasn't it? >> Yeah, absolutely. I mean, let's imagine we pull up our phones right now and I go to my favorite stock ticker app and I check out the NASDAQ market cap. I really have no idea if that's the correct number. I know it's a number, it looks large, it's in a numeric field. And that's kind of what's going on. There's so many numbers and they're coming from all of these different sources and data providers and they're getting consumed and passed along. But there isn't really a way to tactically put controls on every number and metric across every field we plan to monitor. But with the scale that we've achieved in early days, even before Collibra. And what's been so exciting is we have these types of observation techniques, these data monitors that can actually track past performance of every field at scale. And why that's so interesting and why I think the CDO is listening right intently nowadays to this topic is so maybe we could surface all of these problems with the right solution of data observability and with the right scale and then just be alerted on breaking trends. So we're sort of shifting away from this world of must write a condition and then when that condition breaks, that was always known as a break record. But what about breaking trends and root cause analysis? And is it possible to do that, with less human intervention? And so I think most people are seeing now that it's going to have to be a software tool and a computer system. It's not ever going to be based on one or two domain experts anymore. >> So, how does data observability relate to data quality? Are they sort of two sides of the same coin? Are they cousins? What's your perspective on that? >> Yeah, it's super interesting. It's an emerging market. So the language is changing a lot of the topic and areas changing the way that I like to say it or break it down because the lingo is constantly moving as a target on this space is really breaking records versus breaking trends. And I could write a condition when this thing happens it's wrong and when it doesn't, it's correct. Or I could look for a trend and I'll give you a good example. Everybody's talking about fresh data and stale data and why would that matter? Well, if your data never arrived or only part of it arrived or didn't arrive on time, it's likely stale and there will not be a condition that you could write that would show you all the good and the bads. That was kind of your traditional approach of data quality break records. But your modern day approach is you lost a significant portion of your data, or it did not arrive on time to make that decision accurately on time. And that's a hidden concern. Some people call this freshness, we call it stale data but it all points to the same idea of the thing that you're observing may not be a data quality condition anymore. It may be a breakdown in the data pipeline. And with thousands of data pipelines in play for every company out there there, there's more than a couple of these happening every day. >> So what's the Collibra angle on all this stuff made the acquisition you got data quality observability coming together, you guys have a lot of expertise in this area but you hear providence of data you just talked about stale data, the whole trend toward real time. How is Collibra approaching the problem and what's unique about your approach? >> Well, I think where we're fortunate is with our background, myself and team we sort of lived this problem for a long time in the Wall Street days about a decade ago. And we saw it from many different angles. And what we came up with before it was called data observability or reliability was basically the underpinnings of that. So we're a little bit ahead of the curve there when most people evaluate our solution. It's more advanced than some of the observation techniques that currently exist. But we've also always covered data quality and we believe that people want to know more, they need more insights and they want to see break records and breaking trends together so they can correlate the root cause. And we hear that all the time. I have so many things going wrong just show me the big picture. Help me find the thing that if I were to fix it today would make the most impact. So we're really focused on root cause analysis, business impact connecting it with lineage and catalog, metadata. And as that grows, you can actually achieve total data governance. At this point, with the acquisition of what was a lineage company years ago and then my company OwlDQ, now Collibra Data Quality, Collibra may be the best positioned for total data governance and intelligence in the space. >> Well, you mentioned financial services a couple of times and some examples, remember the flash crash in 2010. Nobody had any idea what that was, they just said, "Oh, it's a glitch." So they didn't understand the root cause of it. So this is a really interesting topic to me. So we know at Data Citizens '22 that you're announcing you got to announce new products, right? Your yearly event, what's new? Give us a sense as to what products are coming out but specifically around data quality and observability. >> Absolutely. There's always a next thing on the forefront. And the one right now is these hyperscalers in the cloud. So you have databases like Snowflake and Big Query and Data Bricks, Delta Lake and SQL Pushdown. And ultimately what that means is a lot of people are storing in loading data even faster in a salike model. And we've started to hook in to these databases. And while we've always worked with the same databases in the past they're supported today we're doing something called Native Database pushdown, where the entire compute and data activity happens in the database. And why that is so interesting and powerful now is everyone's concerned with something called Egress. Did my data that I've spent all this time and money with my security team securing ever leave my hands? Did it ever leave my secure VPC as they call it? And with these native integrations that we're building and about to unveil here as kind of a sneak peek for next week at Data Citizens, we're now doing all compute and data operations in databases like Snowflake. And what that means is with no install and no configuration you could log into the Collibra Data Quality app and have all of your data quality running inside the database that you've probably already picked as your your go forward team selection secured database of choice. So we're really excited about that. And I think if you look at the whole landscape of network cost, egress cost, data storage and compute, what people are realizing is it's extremely efficient to do it in the way that we're about to release here next week. >> So this is interesting because what you just described you mentioned Snowflake, you mentioned Google, oh actually you mentioned yeah, the Data Bricks. Snowflake has the data cloud. If you put everything in the data cloud, okay, you're cool but then Google's got the open data cloud. If you heard Google Nest and now Data Bricks doesn't call it the data cloud but they have like the open source data cloud. So you have all these different approaches and there's really no way up until now I'm hearing to really understand the relationships between all those and have confidence across, it's like (indistinct) you should just be a note on the mesh. And I don't care if it's a data warehouse or a data lake or where it comes from, but it's a point on that mesh and I need tooling to be able to have confidence that my data is governed and has the proper lineage, providence. And that's what you're bringing to the table. Is that right? Did I get that right? >> Yeah, that's right. And for us, it's not that we haven't been working with those great cloud databases, but it's the fact that we can send them the instructions now we can send them the operating ability to crunch all of the calculations, the governance, the quality and get the answers. And what that's doing, it's basically zero network cost, zero egress cost, zero latency of time. And so when you were to log into Big BigQuery tomorrow using our tool or let or say Snowflake, for example, you have instant data quality metrics, instant profiling, instant lineage and access privacy controls things of that nature that just become less onerous. What we're seeing is there's so much technology out there just like all of the major brands that you mentioned but how do we make it easier? The future is about less clicks, faster time to value faster scale, and eventually lower cost. And we think that this positions us to be the leader there. >> I love this example because every talks about wow the cloud guys are going to own the world and of course now we're seeing that the ecosystem is finding so much white space to add value, connect across cloud. Sometimes we call it super cloud and so, or inter clouding. Alright, Kirk, give us your final thoughts and on the trends that we've talked about and Data Citizens '22. >> Absolutely. Well I think, one big trend is discovery and classification. Seeing that across the board people used to know it was a zip code and nowadays with the amount of data that's out there, they want to know where everything is where their sensitive data is. If it's redundant, tell me everything inside of three to five seconds. And with that comes, they want to know in all of these hyperscale databases, how fast they can get controls and insights out of their tools. So I think we're going to see more one click solutions, more SAS-based solutions and solutions that hopefully prove faster time to value on all of these modern cloud platforms. >> Excellent, all right. Kurt Hasselbeck, thanks so much for coming on theCUBE and previewing Data Citizens '22. Appreciate it. >> Thanks for having me, Dave. >> You're welcome. All right, and thank you for watching. Keep it right there for more coverage from theCUBE.

Published Date : Oct 24 2022

SUMMARY :

Kirk, good to see you. Excited to be here. and it was acquired by Collibra last year. And it's so complex that the And now, as we say, we're going and I check out the NASDAQ market cap. and areas changing the and what's unique about your approach? of the curve there when most and some examples, remember and data activity happens in the database. and has the proper lineage, providence. and get the answers. and on the trends that we've talked about and solutions that hopefully and previewing Data Citizens '22. All right, and thank you for watching.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave VellantePERSON

0.99+

DavePERSON

0.99+

CollibraORGANIZATION

0.99+

Kurt HasselbeckPERSON

0.99+

2010DATE

0.99+

oneQUANTITY

0.99+

Kirk HasselbeckPERSON

0.99+

50 dayQUANTITY

0.99+

KirkPERSON

0.99+

10 dayQUANTITY

0.99+

OwlDQORGANIZATION

0.99+

Kirk HaslbeckPERSON

0.99+

next weekDATE

0.99+

GoogleORGANIZATION

0.99+

last yearDATE

0.99+

two sidesQUANTITY

0.99+

thousandsQUANTITY

0.99+

NASDAQORGANIZATION

0.99+

SnowflakeTITLE

0.99+

Data CitizensORGANIZATION

0.99+

Data BricksORGANIZATION

0.99+

two other thingsQUANTITY

0.98+

one clickQUANTITY

0.98+

tomorrowDATE

0.98+

todayDATE

0.98+

five secondsQUANTITY

0.97+

two domainQUANTITY

0.94+

Collibra Data QualityTITLE

0.92+

MIT CDOIQEVENT

0.9+

Data Citizens '22TITLE

0.9+

EgressORGANIZATION

0.89+

Delta LakeTITLE

0.89+

threeQUANTITY

0.86+

zeroQUANTITY

0.85+

Big QueryTITLE

0.85+

about a decade agoDATE

0.85+

SQL PushdownTITLE

0.83+

Data Citizens 2022 CollibraEVENT

0.82+

Big BigQueryTITLE

0.81+

more than a coupleQUANTITY

0.79+

coupleQUANTITY

0.78+

one bigQUANTITY

0.77+

Collibra Data QualityORGANIZATION

0.75+

CollibraOTHER

0.75+

Google NestORGANIZATION

0.75+

Data Citizens '22ORGANIZATION

0.74+

zero latencyQUANTITY

0.72+

SASORGANIZATION

0.71+

SnowflakeORGANIZATION

0.69+

COVIDORGANIZATION

0.69+

years agoDATE

0.68+

Wall StreetLOCATION

0.66+

theCUBEORGANIZATION

0.66+

many numbersQUANTITY

0.63+

CollibraPERSON

0.63+

timesQUANTITY

0.61+

DataORGANIZATION

0.61+

too longDATE

0.6+

Vice PresidentPERSON

0.57+

dataQUANTITY

0.56+

CDOTITLE

0.52+

BricksTITLE

0.48+

Todd Foley, Lydonia Technologie & Devika Saharya, MongoDB | UiPath Forward 5


 

(intro upbeat music) >> TheCUBE presents UiPath Forward5, Brought to you by UiPath. >> Welcome to day two of Forward5 UiPath Customer Conference. You're watching theCUBE. My name is Dave Vellante. My co-host is David Nicholson. Yesterday, Dave, we heard about the extension into an enterprise platform. We heard about, from the two CEOs, a new go-to-market strategy. We heard from a lot of customers how they're implementing UiPath generally and automation, specifically, scaling, hyper-automation, and all the buzzwords you hear. Todd Foley is the CDO and CSO of Lydonia Technologies and Devika Saharya is the director of ERP and RPA at MongoDB. Folks, welcome to theCUBE. Thanks for taking time out of your busy day and coming on. >> Thank you Dave. >> Thank you so much. >> So let's start with the roles. So Devika, ERP and RPA. >> Yes. >> It's like peanut butter and jelly, or how do those things relate? What's your, what's your role? >> Absolutely. So I started at Mongo as an ERP manager, and you know, as we were growing, the one thing that came out of, you know, the every year goals for the company, one big goal that came out was how we have to scale. There are so many barriers to scale. How can we become a billion dollar company? What do we need to do? And when we started drilling down into, you know, different areas, we figured it out that people do a lot of stuff manually. It's like comparing sheets, you know, copying data from one place to the other, and so on and so forth. So one thing that we realized was we definitely need some kind of automation. At that time, we didn't know about automation, but we did our own market research and here we are. >> Let's automate. Yeah, right. (Devika laughs) Sounds easy. All right, thank you. Todd, CDO, Chief Data or Chief Dig, and CSO, I'm assuming Chief Data? >> Chief Data. >> And the Chief Information Security Officer. Tell us about Lydonia and also your role. >> Sure, Lydonia, we started just over three years ago. We looked at the RPA market. We saw great opportunity, but we also saw a challenge. We saw that a lot of people had deployed RPA but weren't getting the promised, you know, immediate ROI, rapid deployment that was out there. And when we looked at it, we saw that it really wasn't a technical challenge. Sometimes it was how technology was applied, but there were a lot of things that people were doing in their process and how they were treating RPA, often as if it were traditional technology that slowed them down. So we built our practice, our company, around the idea of being able to help people scale very quickly and drive that faster. And we're finding now with the RPA being pretty ubiquitous, that it's the one thing that's in the greatest demand among our clients. >> Okay, so you're the implementation partner for Mongo, is that right? >> We are. >> Okay, so relatively new. Very new actually, but a specialist. Why'd you choose Lydonia? >> So, that's an interesting question. When we came last year to UiPath Forward, we were looking for, you know, the right kind of people who can, you know, put us on track. We had the technology, we had everything in place, we did the POC, everybody liked it, but we didn't know how to, you know, basically go in that direction. We were missing that direction. And then we, you know, we were doing our homework here, we found, we accidentally stumbled with Lydonia, and I had follow up conversations with Todd, and they were just so tapered. I knew exactly what Todd was explaining me, and we knew we are, we are in safe hands. >> So, where did you start? >> So we, the first thing that we did was a POC for the finance side of business. And right after that POC, we realized that, you know, how much time people were actually investing manually, like things that were done in three to four days was turning into a 30 minute process. And that gave us, you know, the idea that we should start drilling down into different departments and try to find where there are, you know, areas where we can improve. And we did all of that. And then we met with Todd, and Todd explained that how his Reignite process works. So we took Reignite as our first step and, you know, took it from there. We chose one department, we worked with them. We had about 10 processes highlighted, thanks to Todd, he worked with them, and he literally drilled and nailed it down that what we need to do. And as of today, all those 10 are automated. >> Wow. Okay. >> Todd, does this interaction between Lydonia and MongoDB, as a customer, apply equally in the field when you're going out and talking to clients that might be running MongoDB, they might be customers of MongoDB, they may have financial applications that are backended with MongoDB, is there a synergy there that you've been able to gain? >> I think there is. I think there's one thing that's kind of unique about RPA, and that the traditional questions around integration and applicability aren't as important when you have a platform that can work with anything that people can use. I think also, you know, when we look at what we typically do with people, some of the things we see at Mongo are very common use cases you know, across all of our clients. So I, there's definitely the ability for us to take things we've done and have clients get leverage out of them. At the same time, the platform itself is, makes it different than a traditional model where, you know if somebody has worked in a particular area or built an automation for a particular application, there's some kind of utility to do it faster for another client. What we find is that that's not really the case. And that oftentimes we'll compete with people who use different tool sets than UiPath who have that kind of value story around having done it before, we come in and we do it twice as fast as they could. >> So you've, you're a veteran of complex integrations. >> Oh yeah. (Todd laughs) >> I know that from our paths have crossed in the past. So you're saying that in this world of RPA, that this tool set like UiPath as a platform, we've been talking a lot about the difference between being a tool set and being a platform. >> Right. >> That this platform can sort of hover above things without that same layer of complexity, or level of complexity, that you've experienced in the past. Because that speaks to the idea that UiPath, as a platform, is going to work moving forward in a big way. >> Exactly, right. I think we've seen for years and years that regardless of the type of development environment you're using, a developer's value sometimes is based on what reusable libraries they've created, what they have to cut and paste from their old code to be able to do things faster. The challenge with that is it has to be maintained, when things change, they've got to update those libraries. It's a value prop that's very high touch. With UiPath, they've created the ultimate in reusability. The platform, especially since they acquired cloud elements and built all of those API integrations into their platform. The platform maintains the reusability and the libraries in such a way where they're drag and drop from a development standpoint and you don't have to maintain them. It's the ultimate expression of reusability as a platform. >> Yeah, cloud elements, API automation, obviously a key pick by UiPath. Devika, what's the scale of your operation today? Like how many bots and where do you see it going? >> Yes. So we, we started with one bot. Last year we experimented a lot that, you know, we were just trying to make our footprint in the company, trying to understand that, you know, people understand what RPA is, what UiPath is. Initially we got a lot of pushback. We got a pushback from our security team as well, because they could not understand, you know, that what UiPath is and how secure it is. And we had to explain them that how we would host it over AWS, how we will work, how we will not save passwords, et cetera. When we did all of that and they got comfort, we started picking, you know, very small processes around to show, you know, people the capability of RPA and UiPath per se. When we did that, people started just coming with bigger processes, and one specific team that I can think of came that we do, you know, fuzzy logic in Excel, and we do it twice a week, but it takes a lot of time. We automated it, they run it daily, every single day, two times now. And the exponential growth that we saw just with that one automation was mind boggling. I couldn't believe that, you know. We were tracking our insights and we were like, oh my God, what happened? It just blew out of proportion. >> Okay. So then did you need more bots? Are you still running one bot, or? >> Nope. Now at the moment we have nine. >> Okay. >> And we are still looking to grow. >> Okay. So the initial friction, you said there was some, you know, concern, it was primarily security or were there others, people afraid they're going to lose their jobs? Was there any of that? >> There was no risk of losing the job. The major, you know, pushback was, one was from security, the other one was from different system owners because a lot of people were not sure why we want UI access, or why we want API access, and why are we accessing their systems? What type of information we are trying to gather out of their systems. Are we writing into their system? Because a lot of people have issues when we start saying that we will write or override data. So most of the processes that we are working around are either writing, comparing, and reading and comparing, and if it is writing, we take special permission that this is what we are going to do. >> So what did you have to do to get through the security mottle, a AWS SOC 2 report, did you have to show them the UiPath pen test? >> Absolutely. >> Did you have to change any of your processes? What was that sort of punch list like? >> Everything. >> Yeah. >> So we had to start from pen test. We had to start, we had to explain that UiPath is in the process of, you know, acquiring SOC. We also explained that how things are hosted on AWS. We had to, you know, bring our consultants in who explained that how on, on AWS, this will be a very secured way of doing things. And when we did our first process, which was actually for the auditors, which is, you know, interesting. >> Yeah. >> What we did was we did segregation of duties, which I think is very important in every field and every sphere we work in. So for example, the the writeup that we were building for auditors, we made sure that it is approved by a physical or a human, you know, and not everything is done by the bot. The biggest piece of the puzzle was writing, you know, because it was taking a lot of time. People were going into different systems, gathering information, putting it on Excel, and then you know, comparing and submitting it to PWC. >> When you say write, you mean any update to a system of record? >> Correct. >> Required some scrutiny? >> Some scrutiny, yes, yes. >> Okay, initially by a human until there was comfort level and then it's like these bots know what they're doing. >> Correct, correct. >> Okay. And now you're a NetSuite customer, correct? >> Yes. >> That's your ERP? >> That's right. >> Now we were talking about Oracle is going to acquire OCR capabilities. Will that, and we've been talking, Dave and I, a week about, okay well ServiceNow has, you know, RPA, and Salesforce, and SAP, et cetera. How will that affect your thinking about adopting UiPath? >> I don't think it should matter because I think all these systems kind of coexist in a bigger ecosystem, you know, and I also feel that all these systems have their own plus points and minus points. Not one system in, per se, can do everything within a company. So it could be that, for example, NetSuite might be very strong for financials in the space we are in, but not extremely good around sales and marketing. So for that company chose Salesforce. So you know, you have those smaller smaller multiple systems that build into a bigger ecosystem, right. And I think the other piece of the puzzle is that UiPath helps bridge that gap between these systems. You know, it could happen that certain things can get integrated, certain things cannot because of the nature of business, the nature of work that the teams are trying to do. And I think UiPath is leveraging that gap, you know, and putting, you know, those strings together. >> As you scale - >> Mm hmm. >> How will, and Todd I presume you're going to assist in this process, but how will you decide what processes to prioritize, and is that a process driven decision? Is it data led? Both? If so, what kind of data? Can you describe how you guys are going to approach that? >> Yep. Todd, would you like to take that first before I start? >> Sure, yeah. >> Maybe some best practices and then we can maybe get specific to Mongo. >> Absolutely. Our guidance is always that it should be a business decision, right? And it should be data driven, based on a business defined metric around the business case for that particular automation. Our guidance to customers is don't automate it unless you know why you're automating it, and what the value is. We see sometimes there are challenges with people being able to articulate the business case for an automation, and it can almost always be resolved by having that business case be the first step, and qualifying and identifying an automation candidate. >> And how does that apply to Mongo? Do you, where are you thinking about scaling, in your opinion? >> It's interesting because, you know, initially we thought that we will, you know, explore one area in MongoDB. And the other thing that we did was we did road shows. So because we had to create some awareness in the company that we have UiPath there's something called bots. There's something called, you know, automation that we can do, so we created a presentation with small demos inside it and, you know, circulated it within the company. Different departments tried to explain what we can achieve. And based off of that, you know, we came up with a laundry list of all the automations that different departments needed. And out of that, you know, we started doing the business case, the value, you know, trying to come up with complexity, effort. We did a full estimation matrix and based off of that we came, okay, these are the top 20 that we should build first. And as soon as we built those top 20, we saw a skyrocket, you know, growth and - >> And you're looking for hard dollars, right? >> Yes, yes. Absolutely. >> Okay, just to be clear. >> Devika, I think Mongo also is great at taking a data driven approach to looking at their program. Do you want to share how you do that? >> Yes, absolutely. So one thing that we were very sure was we have to talk in terms of numbers because that's the only solid way to see growth. And what we did was, you know, we got insights, we started doing full metrics in terms of dollar saved, hour saved, and we are trying to track how every process is impacting, you know, in the grand scheme of things. Like say for example, for finance, are we shortening the close cycle in any shape or form by doing these two or three automations that we are doing? And I'm happy to report that we have really shortened our close cycle from where we started. >> Your quarter end or month end close. >> Correct, yes. >> Daily? You at the daily close yet, (all laugh) or the "John Chambers"? >> Drive everyone nuts. First I have to say, I could feel the audience sort of smiling as they see, as they hear from MongoDB, disruptor of legacy databases being cautious in their internal approach to change. As everyone else is. >> Exactly, yeah. >> But Todd, just sort of, double clicking on this idea of kind of stove pipes of capabilities in the RPA space. I mean OCR, being added to NetSuite, I'm not sure if that's the greatest example, but the point is Lydonia will work with all of those technologies to synthesize something. Is that correct? Or are you a UiPath only? >> Both. So we exclusively use UiPath with our customers. We don't use other RPA platforms. >> Okay. >> And we don't because, not because we can't, but because we don't believe that anything else is going to be as quick or as effective. Also, it's the only platform that is as broad and comprehensive as it needs to be to deliver outcomes to our customers. We have partnerships with other companies that have gaps where UiPath isn't currently playing, but the number of companies and the number of gaps has shrunk down to almost nothing these days. And we're well placed as UiPath continues to grow their platform to take advantage of that and leverage that to deliver outcomes to customers. >> It was a great story of starting small, being careful. >> Yes. >> And prudent, from a security standpoint, especially as a public company. And then it sounds like there's virtually unlimited opportunity. >> Yes, absolutely, absolutely. >> For you guys. Great story, thank you very much for sharing it. Appreciate it. >> Thank you. >> All right, good luck. All right, thank you for watching. Keep it right there. Dave Nicholson and Dave Vellante will be back from UiPath Forward5 from the Venetian in Las Vegas. Be right back. (upbeat music playing)

Published Date : Sep 30 2022

SUMMARY :

Brought to you by UiPath. and all the buzzwords you hear. So Devika, ERP and RPA. that came out of, you know, the every year All right, thank you. And the Chief Information that it's the one thing Why'd you choose Lydonia? we were looking for, you And that gave us, you know, and that the traditional So you've, you're a veteran Oh yeah. have crossed in the past. Because that speaks to and you don't have to maintain them. where do you see it going? that we do, you know, So then did you need more bots? Now at the moment we have nine. So the initial friction, you that we will write or override data. We had to start, we had and then you know, comparing and then it's like these bots know And now you're a NetSuite ServiceNow has, you know, leveraging that gap, you know, Todd, would you like to take and then we can maybe unless you know why you're automating it, that we will, you know, Yes, yes. Do you want to share how you do that? automations that we are doing? I could feel the audience capabilities in the RPA space. So we exclusively use and leverage that to deliver It was a great story of And then it sounds like there's Great story, thank you All right, thank you for watching.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
ToddPERSON

0.99+

David NicholsonPERSON

0.99+

Todd FoleyPERSON

0.99+

Dave VellantePERSON

0.99+

Dave NicholsonPERSON

0.99+

twoQUANTITY

0.99+

DavePERSON

0.99+

30 minuteQUANTITY

0.99+

OracleORGANIZATION

0.99+

Last yearDATE

0.99+

two timesQUANTITY

0.99+

Lydonia TechnologiesORGANIZATION

0.99+

ExcelTITLE

0.99+

threeQUANTITY

0.99+

nineQUANTITY

0.99+

John ChambersPERSON

0.99+

MongoORGANIZATION

0.99+

last yearDATE

0.99+

BothQUANTITY

0.99+

Las VegasLOCATION

0.99+

first stepQUANTITY

0.99+

AWSORGANIZATION

0.99+

one departmentQUANTITY

0.99+

PWCORGANIZATION

0.99+

todayDATE

0.99+

10QUANTITY

0.99+

FirstQUANTITY

0.99+

UiPathTITLE

0.99+

Devika SaharyaPERSON

0.99+

one botQUANTITY

0.98+

NetSuiteTITLE

0.98+

first processQUANTITY

0.98+

four daysQUANTITY

0.98+

LydoniaORGANIZATION

0.98+

YesterdayDATE

0.98+

DevikaPERSON

0.98+

UiPathORGANIZATION

0.98+

twice a weekQUANTITY

0.98+

MongoDBTITLE

0.98+

twiceQUANTITY

0.97+

three automationsQUANTITY

0.97+

firstQUANTITY

0.96+

two CEOsQUANTITY

0.95+

SOC 2TITLE

0.95+

SalesforceORGANIZATION

0.95+

one big goalQUANTITY

0.95+

MongoDBORGANIZATION

0.94+

Lydonia TechnologieORGANIZATION

0.94+

about 10 processesQUANTITY

0.93+

DigPERSON

0.93+

ServiceNowORGANIZATION

0.93+

Forward5EVENT

0.92+

one systemQUANTITY

0.92+

top 20QUANTITY

0.92+

oneQUANTITY

0.91+

day twoQUANTITY

0.91+

one specific teamQUANTITY

0.91+

Jon Loyens, data.world | Snowflake Summit 2022


 

>>Good morning, everyone. Welcome back to the Cube's coverage of snowflake summit 22 live from Caesar's forum in Las Vegas. Lisa Martin, here with Dave Valante. This is day three of our coverage. We've had an amazing, amazing time. Great conversations talking with snowflake executives, partners, customers. We're gonna be digging into data mesh with data.world. Please welcome John loins, the chief product officer. Great to have you on the program, John, >>Thank you so much for, for having me here. I mean, the summit, like you said, has been incredible, so many great people, so such a good time, really, really nice to be back in person with folks. >>It is fabulous to be back in person. The fact that we're on day four for, for them. And this is the, the solution showcase is as packed as it is at 10 11 in the morning. Yeah. Is saying something >>Yeah. Usually >>Chopping at the bit to hear what they're doing and innovate. >>Absolutely. Usually those last days of conferences, everybody starts getting a little tired, but we're not seeing that at all here, especially >>In Vegas. This is impressive. Talk to the audience a little bit about data.world, what you guys do and talk about the snowflake relationship. >>Absolutely data.world is the only true cloud native enterprise data catalog. We've been an incredible snowflake partner and Snowflake's been an incredible partner to us really since 2018. When we became the first data catalog in the snowflake partner connect experience, you know, snowflake and the data cloud make it so possible. And it's changed so much in terms of being able to, you know, very easily transition data into the cloud to break down those silos and to have a platform that enables folks to be incredibly agile with data from an engineering and infrastructure standpoint, data out world is able to provide a layer of discovery and governance that matches that agility and the ability for a lot of different stakeholders to really participate in the process of data management and data governance. >>So data mesh basically Jamma, Dani lays out the first of all, the, the fault domains of existing data and big data initiatives. And she boils it down to the fact that it's just this monolithic architecture with hyper specialized teams that you have to go through and it just slows everything down and it doesn't scale. They don't have domain context. So she came up with four principles if I may, yep. Domain ownership. So push it out to the businesses. They have the context they should own the data. The second is data as product. We're certainly hearing a lot about that today this week. The third is that. So that makes it sounds good. Push out the, the data great, but it creates two problems. Self-serve infrastructure. Okay. But her premises infrastructure should be an operational detail. And then the fourth is computational governance. So you talked about data CA where do you fit in those four principles? >>You know, honestly, we are able to help teams realize the data mesh architecture. And we know that data mesh is really, it's, it's both a process in a culture change, but then when you want to enact a process in a culture change like this, you also need to select the appropriate tools to match the culture that you're trying to build the process in the architecture that you're trying to build. And the data world data catalog can really help along all four of those axes. When you start thinking first about, let's say like, let's take the first one, you know, data as a product, right? We even like very meta of us from metadata management platform at the end of the day. But very meta of us. When you talk about data as a product, we track adoption and usage of all your data assets within your organization and provide program teams and, you know, offices of the CDO with incredible evented analytics, very detailed that gives them the right audit trail that enables them to direct very scarce data engineering, data architecture resources, to make sure that their data assets are getting adopted and used properly. >>On the, on the domain driven side, we are entirely knowledge graph and open standards based enabling those different domains. We have, you know, incredible joint snowflake customers like Prologis. And we chatted a lot about this in our session here yesterday, where, because of our knowledge graph underpinnings, because of the flexibility of our metadata model, it enables those domains to actually model their assets uniquely from, from group to group, without having to, to relaunch or run different environments. Like you can do that all within one day catalog platform without having to have separate environments for each of those domains, federated governance. Again, the amount of like data exhaust that we create that really enables ambient governance and participatory governance as well. We call it agile data governance, really the adoption of agile and open principles applied to governance to make it more inclusive and transparent. And we provide that in a way that Confederate across those means and make it consistent. >>Okay. So you facilitate across that whole spectrum of, of principles. And so what in the, in the early examples of data mesh that I've studied and actually collaborated with, like with JPMC, who I don't think is who's not using your data catalog, but hello, fresh who may or may not be, but I mean, there, there are numbers and I wanna get to that. But what they've done is they've enabled the domains to spin up their own, whatever data lakes, data, warehouses, data hubs, at least in, in concept, most of 'em are data lakes on AWS, but still in concept, they wanna be inclusive and they've created a master data catalog. And then each domain has its sub catalogue, which feeds into the master and that's how they get consistency and governance and everything else is, is that the right way to think about it? And or do you have a different spin on that? >>Yeah, I, I, you know, I have a slightly different spin on it. I think organizationally it's the right way to think about it. And in absence of a catalog that can truly have multiple federated metadata models, multiple graphs in one platform, I, that is really kind of the, the, the only way to do it, right with data.world. You don't have to do that. You can have one platform, one environment, one instance of data.world that spans all of your domains, enable them to operate independently and then federate across. So >>You just answered my question as to why I should use data.world versus Amazon glue. >>Oh, absolutely. >>And that's a, that's awesome that you've done now. How have you done that? What, what's your secret >>Sauce? The, the secret sauce era is really an all credit to our CTO. One of my closest friends who was a true student of knowledge graph practices and principles, and really felt that the right way to manage metadata and knowledge about the data analytics ecosystem that companies were building was through federated linked data, right? So we use standards and we've built a, a, an open and extensible metadata model that we call costs that really takes the best parts of existing open standards in the semantics space. Things like schema.org, DCA, Dublin core brings them together and models out the most typical enterprise data assets providing you with an ontology that's ready to go. But because of the graph nature of what we do is instantly accessible without having to rebuild environments, without having to do a lot of management against it. It's, it's really quite something. And it's something all of our customers are, are very impressed with and, and, and, and, you know, are getting a lot of leverage out of, >>And, and we have a lot of time today, so we're not gonna shortchange this topic. So one last question, then I'll shut up and let you jump in. This is an open standard. It's not open source. >>No, it's an open built on open standards, built on open standards. We also fundamentally believe in extensibility and openness. We do not want to vertically like lock you into our platform. So everything that we have is API driven API available. Your metadata belongs to you. If you need to export your graph, you know, instantly available in open machine readable formats. That's really, we come from the open data community. That was a lot of the founding of data.world. We, we worked a lot in with the open data community and we, we fundamentally believe in that. And that's enabled a lot of our customers as well to truly take data.world and not have it be a data catalog application, but really an entire metadata management platform and extend it even further into their enterprise to, to really catalog all of their assets, but also to build incredible integrations to things like corporate search, you know, having data assets show up in corporate Wiki search, along with all the, the descriptive metadata that people need has been incredibly powerful and an incredible extension of our platform that I'm so happy to see our customers in. >>So leasing. So it's not exclusive to, to snowflake. It's not exclusive to AWS. You can bring it anywhere. Azure GCP, >>Anytime. Yeah. You know where we are, where we love snowflake, look, we're at the snowflake summit. And we've always had a great relationship with snowflake though, and really leaned in there because we really believe Snowflake's principles, particularly around cloud and being cloud native and the operating advantages that it affords companies that that's really aligned with what we do. And so snowflake was really the first of the cloud data catalogs that we ultimately or say the cloud data warehouses that we integrated with and to see them transition to building really out the data cloud has been awesome. >>Talk about how data world and snowflake enable companies like per lodges to be data companies. These days, every company has to be a data company, but they, they have to be able to do so quickly to be competitive and to, to really win. How do you help them if we like up level the conversation to really impacting the overall business? >>That's a great question, especially right now, everybody knows. And pro is a great example. They're a logistics and supply chain company at the end of the day. And we know how important logistics and supply chain is nowadays and for them and for a lot of our customers. I think one of the advantages of having a data catalog is the ability to build trust, transparency and inclusivity into their data analytics practice by adopting agile principles, by adopting a data mesh, you're able to extend your data analytics practice to a much broader set of stakeholders and to involve them in the process while the work is getting done. One of the greatest things about agile software development, when it became a thing in the early two thousands was how inclusive it was. And that inclusivity led to a much faster ROI on software projects. And we see the same thing happening in data analytics, people, you know, we have amazing data scientists and data analysts coming up with these insights that could be business changing that could make their company significantly more resilient, especially in the face of economic uncertainty. >>But if you have to sit there and argue with your business stakeholders about the validity of the data, about the, the techniques that were used to do the analysis, and it takes you three months to get people to trust what you've done, that opportunity's passed. So how do we shorten those cycles? How do we bring them closer? And that's, that's really a huge benefit that like Prologis has, has, has realized just tightening that cycle time, building trust, building inclusion, and making sure ultimately humans learn by doing, and if you can be inclusive, it, even, it even increases things like that. We all want to, to, to, to help cuz Lord knows the world needs it. Things like data literacy. Yeah. Right. >>So data.world can inform me as to where on the spectrum of data quality, my data set lives. So I can say, okay, this is usable, shareable, you know, exactly of gold standard versus fix this. Right. Okay. Yep. >>Yep. >>That's yeah. Okay. And you could do that with one data catalog, not a bunch of >>Yeah. And trust trust is really a multifaceted and multi multi-angle idea, right? It's not just necessarily data quality or data observability. And we have incredible partnerships in that space, like our partnership with, with Monte Carlo, where we can ingest all their like amazing observability information and display that in a really like a really consumable way in our data catalog. But it also includes things like the lineage who touch it, who is involved in the process of a, can I get a, a, a question answered quickly about this data? What's it been used for previously? And do I understand that it's so multifaceted that you have to be able to really model and present that in a way that's unique to any given organization, even unique within domains within a single organization. >>If you're not, that means to suggest you're a data quality. No, no supplier. Absolutely. But your partner with them and then that you become the, the master catalog. >>That's brilliant. I love it. Exactly. And you're >>You, you just raised your series C 15 million. >>We did. Yeah. So, you know, really lucky to have incredible investors like Goldman Sachs, who, who led our series C it really, I think, communicates the trust that they have in our vision and what we're doing and the impact that we can have on organization's ability to be agile and resilient around data analytics, >>Enabling customers to have that single source of truth is so critical. You talked about trust. That is absolutely. It's no joke. >>Absolutely. >>That is critical. And there's a tremendous amount of business impact, positive business impact that can come from that. What are some of the things that are next for data.world that we're gonna see? >>Oh, you know, I love this. We have such an incredibly innovative team. That's so dedicated to this space and the mission of what we're doing. We're out there trying to fundamentally change how people get data analytics work done together. One of the big reasons I founded the company is I, I really truly believe that data analytics needs to be a team sport. It needs to go from, you know, single player mode to team mode and everything that we've worked on in the last six years has leaned into that. Our architecture being cloud native, we do, we've done over a thousand releases a year that nobody has to manage. You don't have to worry about upgrading your environment. It's a lot of the same story that's made snowflake. So great. We are really excited to have announced in March on our own summit. And we're rolling this suite of features out over the course of the year, a new package of features that we call data.world Eureka, which is a suite of automations and, you know, knowledge driven functionality that really helps you leverage a knowledge graph to make decisions faster and to operationalize your data in, in the data ops way with significantly less effort, >>Big, big impact there. John, thank you so much for joining David, me unpacking what data world is doing. The data mesh, the opportunities that you're giving to customers and every industry. We appreciate your time and congratulations on the news and the funding. >>Ah, thank you. It's been a, a true pleasure. Thank you for having me on and, and I hope, I hope you guys enjoy the rest of, of the day and, and your other guests that you have. Thank you. >>We will. All right. For our guest and Dave ante, I'm Lisa Martin. You're watching the cubes third day of coverage of snowflake summit, 22 live from Vegas, Dave and I will be right back with our next guest. So stick around.

Published Date : Jun 16 2022

SUMMARY :

Great to have you on the program, John, I mean, the summit, like you said, has been incredible, It is fabulous to be back in person. Usually those last days of conferences, everybody starts getting a little tired, but we're not seeing that at all here, what you guys do and talk about the snowflake relationship. And it's changed so much in terms of being able to, you know, very easily transition And she boils it down to the fact that it's just this monolithic architecture with hyper specialized teams about, let's say like, let's take the first one, you know, data as a product, We have, you know, incredible joint snowflake customers like Prologis. governance and everything else is, is that the right way to think about it? And in absence of a catalog that can truly have multiple federated How have you done that? of knowledge graph practices and principles, and really felt that the right way to manage then I'll shut up and let you jump in. an incredible extension of our platform that I'm so happy to see our customers in. It's not exclusive to AWS. first of the cloud data catalogs that we ultimately or say the cloud data warehouses but they, they have to be able to do so quickly to be competitive and to, thing happening in data analytics, people, you know, we have amazing data scientists and data the data, about the, the techniques that were used to do the analysis, and it takes you three So I can say, okay, this is usable, shareable, you know, That's yeah. that you have to be able to really model and present that in a way that's unique to any then that you become the, the master catalog. And you're that we can have on organization's ability to be agile and resilient Enabling customers to have that single source of truth is so critical. What are some of the things that are next for data.world that we're gonna see? It needs to go from, you know, single player mode to team mode and everything The data mesh, the opportunities that you're giving to customers and every industry. and I hope, I hope you guys enjoy the rest of, of the day and, and your other guests that you have. So stick around.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
DavidPERSON

0.99+

Lisa MartinPERSON

0.99+

Dave ValantePERSON

0.99+

DavePERSON

0.99+

JohnPERSON

0.99+

Jon LoyensPERSON

0.99+

Monte CarloORGANIZATION

0.99+

John loinsPERSON

0.99+

AmazonORGANIZATION

0.99+

MarchDATE

0.99+

Las VegasLOCATION

0.99+

VegasLOCATION

0.99+

Goldman SachsORGANIZATION

0.99+

yesterdayDATE

0.99+

three monthsQUANTITY

0.99+

AWSORGANIZATION

0.99+

one platformQUANTITY

0.99+

one dayQUANTITY

0.99+

thirdQUANTITY

0.99+

two problemsQUANTITY

0.99+

fourthQUANTITY

0.99+

OneQUANTITY

0.99+

2018DATE

0.99+

15 millionQUANTITY

0.98+

DaniPERSON

0.98+

secondQUANTITY

0.98+

firstQUANTITY

0.98+

third dayQUANTITY

0.98+

first oneQUANTITY

0.98+

SnowflakeORGANIZATION

0.98+

DCAORGANIZATION

0.98+

one last questionQUANTITY

0.98+

data.world.ORGANIZATION

0.97+

PrologisORGANIZATION

0.97+

JPMCORGANIZATION

0.97+

each domainQUANTITY

0.97+

today this weekDATE

0.97+

JammaPERSON

0.97+

bothQUANTITY

0.97+

first data catalogQUANTITY

0.95+

Snowflake Summit 2022EVENT

0.95+

eachQUANTITY

0.94+

todayDATE

0.94+

singleQUANTITY

0.94+

data.worldORGANIZATION

0.93+

day threeQUANTITY

0.93+

oneQUANTITY

0.93+

one instanceQUANTITY

0.92+

over a thousand releases a yearQUANTITY

0.92+

day fourQUANTITY

0.91+

SnowflakeTITLE

0.91+

fourQUANTITY

0.91+

10 11 in the morningDATE

0.9+

22QUANTITY

0.9+

one environmentQUANTITY

0.9+

single organizationQUANTITY

0.88+

four principlesQUANTITY

0.86+

agileTITLE

0.85+

last six yearsDATE

0.84+

one data catalogQUANTITY

0.84+

EurekaORGANIZATION

0.83+

Azure GCPTITLE

0.82+

CaesarPERSON

0.82+

series COTHER

0.8+

CubeORGANIZATION

0.8+

data.worldOTHER

0.78+

LordPERSON

0.75+

thousandsQUANTITY

0.74+

single sourceQUANTITY

0.74+

DublinORGANIZATION

0.73+

snowflake summit 22EVENT

0.7+

WikiTITLE

0.68+

schema.orgORGANIZATION

0.67+

early twoDATE

0.63+

CDOTITLE

0.48+

Ankit Goel, Aravind Jagannathan, & Atif Malik


 

>>From around the globe. It's the cube covering data citizens. 21 brought to you by Colibra >>Welcome to the cubes coverage of Collibra data citizens 21. I'm Lisa Martin. I have three guests with me here today. Colibra customer Freddie Mac, please welcome JAG chief data officer and vice president of single family data and decisions. Jog. Welcome to the cube. >>Thank you, Lisa. Look forward to be, >>Uh, excellent on Kiko LSU as well. Vice president data transformation and analytics solution on Kay. Good to have you on the program. >>Thank you, Lisa. Great to be here and >>A teeth Malik senior director from the single family division at Freddie Mac is here as well. A team welcome. So we have big congratulations in order. Uh, pretty Mac was just announced at data citizens as the winners of the Colibra excellence award for data program of the year. Congratulations on that. We're going to unpack that. Talk about what that means, but I'd love to get familiar with the 3d Jack. Start with you. Talk to me a little bit about your background, your current role as chief data officer. >>Appreciate it, Lisa, thank you for the opportunity to share our story. Uh, my name is Arvind calls me Jack. And as you said, I'm just single-family chief data officer at Freddie Mac, but those that don't know, Freddie Mac is a Garland sponsored entity that supports the U S housing finance system and single family deals with the residential side of the marketplace, as CDO are responsible for our managed content data lineage, data governance, business architecture, which Cleaver plays a integral role, uh, in, in depth, that function as well as, uh, support our shared assets across the enterprise and our data monetization efforts, data, product execution, decision modeling, as well as our business intelligence capabilities, including AI and ML for various use cases as a background, starting my career in New York and then moved to Boston and last 20 years of living in the Northern Virginia DC area and fortunate to have been responsible for business operations, as well as led and, um, executed large transformation efforts. That background has reinforced the power of data and how, how it's so critical to meeting our business objectives. Look forward to our dialogue today, Lisa, once again. >>Excellent. You have a great background and clearly not a dull moment in your job with Freddy, Matt. And tell me a little bit about your background, your role, what you're doing at Freddie >>Mac. Definitely. Um, hi everyone. I'm,, I'm vice president of data transformation and analytics solutions. And I worked for JAG. I'm responsible for many of the things he said, including leading our transformation to the cloud and migrating all our existing data assets front of that transformation journey. I'm also responsible for our business information and business data architecture, decision modeling, business intelligence, and some of the analytics and artificial intelligence. I started my career back in the day as a computer engineer, but I've always been in the financial industry up in New York. And now in the Northern Virginia area, I called myself that bridge between business and technology. And I would say, I think over the last six years with data found that perfect spot where business and technology actually come together to solve real problems and, and really lead, um, you know, businesses to the next stage of, so thank you Lisa for the opportunity today. Excellent. >>And we're going to unpack you call yourself the bridge between business and it that's always such an important bridge. We're going to talk about that in just a minute, but I want to get your background, tell our audience about you. >>Uh, I'm Alec Malek, I'm senior director of business, data architecture, data transformation, and Freddie Mac. Uh, I'm responsible for the overall business data architecture and transformation of the existing data onto the cloud data lake. Uh, my team is responsible for the Kleberg platform and the business analysts that are using and maintaining the data in Libra and also driving the data architecture in close collaboration with our engineering teams. My background is I'm a engineer at heart. I still do a lot of development. This is my first time as of crossing over onto the bridge onto business side of maintaining data and working with data teams. >>Jan, let's talk about digital transformation. Freddie Mac is a 50 year old and growing company. I always love talking with established businesses about digital transformation. It's pretty challenging. Talk to me about your initial plan and what some of the main challenges were that you were looking to solve. >>Uh, great question, Lisa, and, uh, it's definitely pertinent as you say, in our digital world or figuring out how we need to accomplish it. If I look at our data, modernization is it is a major program and, uh, effort, uh, in, in our, in our division, what started as a reducing cost or looking at an infrastructure play, moving from physical data assets to the cloud, as well as enhancing our resiliency as quickly morphed into meeting business demand and objectives, whether it be for sourcing, servicing or securitization of our loan products. So where are we as we think about creating this digital data marketplace, we are, we are basically forming, empowering a new data ecosystem, which Columbia is definitely playing a major role. It's more than just a cloud native data lake, but it's bringing in some of our current assets and capabilities into this new data landscape. >>So as we think about creating an information hub, part of the challenges, as you say, 50 years of having millions of loans and millions of data across multiple assets, it's frigging out that you still have to care and feed legacy while you're building the new highway and figuring out how you best have to transform and translate and move data and assets to this new platform. What we've been striving for is looking at what is the business demand or what is the business use case, and what's the value to help prioritize that transformation. Exciting part is, as you think about new uses of acquiring and distribution of data, as well as news new use cases for prescriptive and predictive analytics, the power of what we're building in our daily, this new data ecosystem, we're feeling comfortable, we'll meet the business demand, but as any CTO will tell you demand is always, uh, outpaces our capacity. And that's why we want to be very diligent in terms of our execution plan. So we're very excited as to what we've accomplished so far this year and looking forward as we offered a remainder year. And as you go into 2022. Excellent, >>Thanks JAG. Uh, two books go to you. As I mentioned in the intro of that Freddie Mac has won the Culebra excellence award for data program of the year. Again, congratulations on that, but I'd love to understand the Kleber center of excellence that you're building at Freddie Mac. First of all, define what a center of excellence is to Freddie Mac and then what you're specifically building. Yeah, sure. >>So the Cleaver center of excellence provides us the overall framework from a people and process standpoint to focus in on our use of Colibra and for adopting best practices. Uh, we can have teams that are focused just on developing best practices and implementing workflows and lineage within Collibra and implementing and adopting a number of different aspects of Libra. It provides the central hub of people being domain experts on the tool that can then be leveraged by different groups within the organization to maintain, uh, the tool. >>Put another follow on question a T for you. How does Freddie Mac define, uh, dated citizens as anybody in finance or sales or marketing or operations? What does that definition of data citizen? >>It's really everyone it's within the organization. They all consume data in different ways and we provide a way of governing data and for them to get a better understanding of data from Collibra itself. So it's really everyone within the organization that way. >>Excellent. Okay. Let's go over to you a big topic at data citizens. 21 is collaboration. That's probably a word that we used a ton in the last 15 plus months or so it was every business really pivoted quickly to figure out how do we best collaborate. But something that you talked about in your intro is being the bridge between business and it, I want to understand from your perspective, how can data teams help to drive improved collaboration between business and it, >>The collaboration between business and technology have been a key focus area for us over the last few years, we actually started an agile transformation journey two years ago that we called modern delivery. And that was about moving away from project teams to persistent product teams that brought business and technology together. And we've really been able to pioneer that in the data space within Freddie Mac, where we have now teams with product owners coming from the data team and then full stack ID developers with them creating these combined teams to meet the business needs. We found that bringing these teams together really remove the barriers that were there in the interaction and the employee satisfaction has been high. And like you said, over the last 16 months with the pandemic, we've actually seen the productivity stay same or even go up because the teams were all working together, they work as a unit and they all have the sense of ownership versus working on a project that has a finite end date to fail. So we've, um, you know, we've been really lucky with having started this two years ago. Well, and >>That's great. And congratulations about either maintaining productivity or having it go up during the last 16 months, which had been incredibly challenging. Jack. I want to ask you what does winning this award from Collibra what does this mean to you and your team and does this signify that you're really establishing a data first culture? >>Great question, Lisa again. Um, I think winning the award, uh, just from a team standpoint, it's a great honor. Uh, Kleber has been a fantastic partner. And when I think about the journey of going from spread sheets, right, that all of us had in the past to now having all our business class returns lineage, and really being at the forefront of our data monetization. So as we think about moving to the cloud Beliebers step in step with us in terms of our integral part of that holistic delivery model, when I ultimately, as a CDO, it's really the team's honor and effort, cause this has been a multi-year journey to get here. And it's great that Libra as a, as a partner has helped us achieve some of these goals, but also recognized, um, where we are in terms of, uh, as looking at data as a product and some of our, um, leading forefront and using that holistic delivery, uh, to, uh, to meet our business objectives. So overall poorly jazzed when, uh, we've been found that we wanted the data program here at Collibra and very honored, um, uh, to, to win this award. That's >>Where we got to bring back I'm jazzed. I liked that jug sticking with you, let's unpack a little bit, some of those positive results, those business outcomes that you've seen so far from the data program. What are those? >>Yeah. So again, if you were thinking about a traditional CDO model, what were the terms that would have been used few years ago? It was around governance and may have been viewed as an oversight. Um, maybe less talking, um, monetization of what it was, the business values that you needed to accomplish collectively. It's really those three building blocks managing content. You got to trust the source, but ultimately it's empowering the business. So the best success that I could say at Freddy, as you're moving to this digital world, it's really empowering the business to figure out the new capabilities and demand and objectives that we're meeting. We're not going to be able to transform the mortgage industry. We're not going to be able or any, any industry, if we're still stuck in old world thinking, and ultimately data is going to be the blood that has to enable those capabilities. >>So if you tell me the business best success, we're no longer talking a okay, I got my data governance, what do we have to do? It's all embedded together. And as I alluded to that partnership between business and it informing that data is a product where you now you're delivering capabilities holistically from program teams all across data. It's no longer an afterthought. As I said, a few minutes ago, you're able to then meet the demand what's current. And how do we want to think about going forward? So it's no longer buzzwords of digital data marketplace. What is the value of that? And that's what the success, I think if our group collectively working across the organization, it's just not one team it's across the organization. Um, and we have our partners, our operations, everyone from business owners, all swimming in the same direction with, and I would say critical management support. So top of the house, our, our head of business, my, my boss was the COO full supportive in terms of how we're trying to execute and I've makes us, um, it's critical because when there is a potential, trade-offs, we're all looking at it collectively as an organization, >>Right. And that's the best viewpoint to have is that sort of centralized unified vision. And as you say, JAG, the support from, from up top, uh, I'd see if I want to ask you, you establish the Culebra center of excellence. What are you focused on now? >>So we really focused in allowing our users to consume data and understand data and really democratizing data so that they can really get a better understanding of that. So that's a lot of our focus and engaging with Collibra and getting them to start to define things in Colibra law form. That's a lot of focus right now. >>Excellent. Want to stay with you one more question and take that I'm gonna ask to all of you, what are you most excited about a lot of success that you've talked about transforming a legacy institution? What are you most excited about and what are the next steps for the data program? Uh, teak what's are your thoughts? >>Yeah, so really modernizing onto, uh, onto a cloud data lake and allowing all of the users and, uh, Freddie Mac to consume data with the level of governance that we need around. It is a exciting proposition for me. >>What would you say is most exciting to you? >>I'm really looking forward to the opportunities that artificial intelligence has to offer, not just in the augmented analytics space, but in the overall data management life cycle. There's still a lot of things that are manual in the data management space. And, uh, I personally believe, uh, artificial intelligence has a huge role to play there. And Jackson >>Question to you, it seems like you have a really strong collaborative team. You have a very collaborative relationship with management and with Collibra, what are you excited about? What's coming down the pipe. >>So Lisa, if I look at it, you know, we sit back here June, 2021, where were we a year ago? And you think about a lot of the capabilities and some of the advancements that we may just in a year sitting virtually using that word jazzed or induced or feeling really great about. We made a lot of accomplishments. I'm excited what we're going to be doing for the next year. So there's other use cases, and I could talk about AIML and OCHA talks about, you know, our new ecosystem. Seeing those use cases come to fruition so that we're, we are contributing to value from a business standpoint. The organization is what really keeps me up. Uh, keeps me up at night. It gets me up in the morning and I'm really feeling dues for the entire division. Excellent. >>Well, thank you. I want to thank all three of you for joining me today. Talking about the successes that Freddie Mac has had transforming in partnership with Colibra again, congratulations on the Culebra excellence award for the data program. It's been a pleasure talking to all three of you. I'm Lisa Martin. You're watching the cubes coverage of Collibra data citizens 21.

Published Date : Jun 17 2021

SUMMARY :

21 brought to you by Colibra Welcome to the cubes coverage of Collibra data citizens 21. Good to have you on the program. but I'd love to get familiar with the 3d Jack. has reinforced the power of data and how, how it's so critical to And tell me a little bit about your background, your role, what you're doing at Freddie to solve real problems and, and really lead, um, you know, businesses to the next stage of, We're going to talk about that in just a minute, but I want to get your background, tell our audience about you. Uh, I'm responsible for the overall business data architecture and transformation Talk to me about your initial plan and what some of the main challenges were that Uh, great question, Lisa, and, uh, it's definitely pertinent as you say, building the new highway and figuring out how you best have to transform and translate As I mentioned in the intro of that Freddie Mac has won So the Cleaver center of excellence provides us the overall framework from a people What does that definition of data citizen? So it's really everyone within the organization is being the bridge between business and it, I want to understand from your perspective, over the last 16 months with the pandemic, we've actually seen the productivity this award from Collibra what does this mean to you and your team and the past to now having all our business class returns lineage, I liked that jug sticking with you, let's unpack a little bit, it's really empowering the business to figure out the new capabilities and demand and objectives that we're meeting. And as I alluded to And as you say, JAG, the support from, from up top, uh, I'd see if I want to ask you, So that's a lot of our focus and engaging with Collibra and getting them to Want to stay with you one more question and take that I'm gonna ask to all of you, what are you most excited all of the users and, uh, Freddie Mac to consume data with the I'm really looking forward to the opportunities that artificial intelligence has to offer, with Collibra, what are you excited about? So Lisa, if I look at it, you know, we sit back here June, 2021, where were we a year ago? congratulations on the Culebra excellence award for the data program.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Lisa MartinPERSON

0.99+

Atif MalikPERSON

0.99+

LisaPERSON

0.99+

Alec MalekPERSON

0.99+

June, 2021DATE

0.99+

BostonLOCATION

0.99+

Ankit GoelPERSON

0.99+

New YorkLOCATION

0.99+

JackPERSON

0.99+

Freddie MacORGANIZATION

0.99+

50 yearsQUANTITY

0.99+

ArvindPERSON

0.99+

Aravind JagannathanPERSON

0.99+

JAGPERSON

0.99+

CollibraORGANIZATION

0.99+

2022DATE

0.99+

KayPERSON

0.99+

JacksonPERSON

0.99+

two booksQUANTITY

0.99+

MattPERSON

0.99+

Northern Virginia DCLOCATION

0.99+

FreddieORGANIZATION

0.99+

Northern VirginiaLOCATION

0.99+

three guestsQUANTITY

0.99+

todayDATE

0.99+

next yearDATE

0.99+

two years agoDATE

0.99+

a year agoDATE

0.98+

ColibraTITLE

0.98+

first timeQUANTITY

0.98+

this yearDATE

0.97+

FreddyORGANIZATION

0.97+

pandemicEVENT

0.97+

OCHAORGANIZATION

0.97+

threeQUANTITY

0.97+

three building blocksQUANTITY

0.97+

KleberORGANIZATION

0.96+

CDOORGANIZATION

0.96+

FreddyPERSON

0.94+

last 16 monthsDATE

0.94+

MacORGANIZATION

0.94+

ColibraORGANIZATION

0.93+

one more questionQUANTITY

0.93+

FirstQUANTITY

0.93+

50 year oldQUANTITY

0.92+

KleberPERSON

0.91+

millions of dataQUANTITY

0.9+

millions of loansQUANTITY

0.9+

singleQUANTITY

0.89+

few years agoDATE

0.89+

AIMLORGANIZATION

0.86+

Culebra excellence awardTITLE

0.85+

CleaverPERSON

0.83+

one teamQUANTITY

0.83+

few minutes agoDATE

0.82+

Freddie MacORGANIZATION

0.81+

3dQUANTITY

0.81+

CulebraORGANIZATION

0.8+

LibraTITLE

0.8+

ULOCATION

0.8+

last six yearsDATE

0.78+

GarlandORGANIZATION

0.78+

ColumbiaLOCATION

0.74+

MalikPERSON

0.74+

KlebergORGANIZATION

0.73+

LibraORGANIZATION

0.72+

Cindi Howson, ThoughtSpot | Thought.Leaders Digital 2020


 

>>So we're going to take a hard pivot now and go from football to Ternopil Chernobyl. What went wrong? 1986, as the reactors were melting down, they had the data to say, this is going to be catastrophic. And yet the culture said, no, we're perfect. Hide it. Don't dare tell anyone which meant they went ahead and had celebrations in Kiev. Even though that increased the exposure, the additional thousands, getting cancer and 20,000 years before the ground around there and even be inhabited again, this is how powerful and detrimental a negative culture, a culture that is unable to confront the brutal facts that hides data. This is what we have to contend with, and this is why I want you to focus on having fostering a data driven culture. I don't want you to be a laggard. I want you to be a leader in using data to drive your digital transformation. >>So I'll talk about culture and technology. Isn't really two sides of the same coin, real world impacts, and then some best practices you can use to disrupt and innovate your culture. Now, oftentimes I would talk about culture and I talk about technology. And recently a CDO said to me, you know, Cindy, I actually think this is two sides of the same coin. One reflects the other. What do you think? Let me walk you through this. So let's take a laggard. What is the technology look like? Is it based on 1990s BI and reporting largely parameterized reports on premises, data, warehouses, or not even that operational reports at best one enterprise data warehouse, very slow moving and collaboration is only email. What does that culture tell you? Maybe there's a lack of leadership to change, to do the hard work that Sudheesh referred to, or is there also a culture of fear, afraid of failure, resistance to change complacency. >>And sometimes that complacency it's not because people are lazy. It's because they've been so beaten down every time a new idea is presented. It's like, no we're measured least cost to serve. So ticks and distrust there it's between business and it or individual stakeholders is the norm. So data is hoarded. Let's contrast that with a leader, a data and analytics leader, what is their technology look like? Augmented analytics search and AI driven insights, not on premises, but in the cloud and maybe multiple clouds. And the data is not in one place, but it's in a data Lake and in a data warehouse, a logical data warehouse, the collaboration is via newer methods, whether it's Slack or teams allowing for that real time decisioning or investigating a particular data point. So what is the culture in the leaders? It's transparent and trust. There is a trust that data will not be used to punish. >>There is an ability to confront the bad news. It's innovation, valuing innovation in pursuit of the company goals, whether it's the best fan experience and player safety in the NFL or best serving your customers. It's innovative and collaborative. There's none of this. Oh, well, I didn't invent that. I'm not going to look at that. There's still pride of ownership, but it's collaborating to get to a better place faster. And people feel empowered to present new ideas, to fail fast. And they're energized knowing that they're using the best technology and innovating at the pace that business requires. So data is democratized and democratized, not just for power users or analysts, but really at the point of impact what we like to call the new decision makers or really the frontline workers. So Harvard business review partnered with us to develop this study to say, just how important is this? >>They've been working at BI and analytics as an industry for more than 20 years. Why is it not at the front lines? Whether it's a doctor, a nurse, a coach, a supply chain manager, a warehouse manager, a financial services advisor, 87% said they would be more successful if frontline workers were empowered with data driven insights, but they recognize they need new technology to be able to do that. It's not about learning hard tools. The sad reality only 20% of organizations are actually doing this. These are the data-driven leaders. So this is the culture and technology. How did we get here? It's because state of the art keeps changing. So the first generation BI and analytics platforms were deployed on premises on small datasets, really just taking data out of ERP systems that were also on premises and state of the art was maybe getting a management report, an operational report over time, visual based data discovery vendors disrupted these traditional BI vendors, empowering now analysts to create visualizations with the flexibility on a desktop, sometimes larger data sometimes coming from a data warehouse, the current state of the art though, Gartner calls it augmented analytics at ThoughtSpot, we call it search and AI driven analytics. >>And this was pioneered for large scale data sets, whether it's on premises or leveraging the cloud data warehouses. And I think this is an important point. Oftentimes you, the data and analytics leaders will look at these two components separately, but you have to look at the BI and analytics tier in lockstep with your data architectures to really get to the granular insights and to leverage the capabilities of AI. Now, if you've never seen ThoughtSpot, I'll just show you what this looks like. Instead of somebody's hard coding of report, it's typing in search keywords and very robust keywords contains rank top bottom, getting to a visual visualization that then can be pinned to an existing Pinboard that might also contain insights generated by an AI engine. So it's easy enough for that new decision maker, the business user, the non analyst to create themselves modernizing the data and analytics portfolio is hard because the pace of change has accelerated. >>You used to be able to create an investment place. A bet for maybe 10 years, a few years ago, that time horizon was five years now, it's maybe three years and the time to maturity has also accelerated. So you have these different, the search and AI tier the data science, tier data preparation and virtualization. But I would also say equally important is the cloud data warehouse and pay attention to how well these analytics tools can unlock the value in these cloud data warehouses. So thoughts about was the first to market with search and AI driven insights, competitors have followed suit, but be careful if you look at products like power BI or SAP analytics cloud, they might demo well, but do they let you get to all the data without moving it in products like snowflake, Amazon Redshift, or, or Azure synapse or Google big query, they do not. >>They re require you to move it into a smaller in memory engine. So it's important how well these new products inter operate the pace of change. It's acceleration Gartner recently predicted that by 2020 to 65% of analytical queries will be generated using search or NLP or even AI. And that is roughly three times the prediction they had just a couple years ago. So let's talk about the real world impact of culture. And if you read any of my books or used any of the maturity models out there, whether the Gardner it score that I worked on, or the data warehousing Institute also has the maturity model. We talk about these five pillars to really become data driven. As Michelle spoke about it's focusing on the business outcomes, leveraging all the data, including new data sources, it's the talent, the people, the technology, and also the processes. >>And often when I would talk about the people in the talent, I would lump the culture as part of that. But in the last year, as I've traveled the world and done these digital events for thought leaders, you have told me now culture is absolutely so important. And so we've pulled it out as a separate pillar. And in fact, in polls that we've done in these events, look at how much more important culture is as a barrier to becoming data driven. It's three times as important as any of these other pillars. That's how critical it is. And let's take an example of where you can have great, but if you don't have the right culture, there's devastating impacts. And I will say I have been a loyal customer of Wells Fargo for more than 20 years, but look at what happened in the face of negative news with data, it said, Hey, we're not doing good cross selling customers do not have both a checking account and a credit card and a savings account and a mortgage. >>The opened fake accounts, basing billions in fines, change in leadership that even the CEO attributed to a toxic sales culture, and they're trying to fix this. But even recently there's been additional employee backlash saying the culture has not changed. Let's contrast that with some positive effects, samples, Medtronic, a worldwide company in 150 countries around the world. They may not be a household name to you, but if you have a loved one or yourself, you have a pacemaker spinal implant diabetes, you know, this brand and at the start of COVID when they knew their business would be slowing down, because hospitals would only be able to take care of COVID patients. They took the bold move of making their IP for ventilators publicly available. That is the power of a positive culture or Verizon, a major telecom organization looking at late payments of their customers. And even though the us federal government said, well, you can't turn them off. >>He said, we'll extend that even beyond the mandated guidelines and facing a slow down in the business because of the tough economy, he said, you know what? We will spend the time upskilling our people, giving them the time to learn more about the future of work, the skills and data and analytics for 20,000 of their employees, rather than furloughing them. That is the power of a positive culture. So how can you transform your culture to the best in class? I'll give you three suggestions, bring in a change agent, identify the relevance, or I like to call it with them and organize for collaboration. So the CDO, whatever your title is, chief analytics, officer chief, digital officer, you are the most important change agent. And this is where you will hear that. Oftentimes a change agent has to come from outside organization. So this is where, for example, in Europe, you have the CDO of just eat a takeout food delivery organization coming from the airline industry or in Australia, national Australian bank, taking a CDO within the same sector from TD bank going to NAB. >>So these change agents come in disrupt. It's a hard job. As one of you said to me, it often feels like Sisyphus. I make one step forward and I get knocked down again. I get pushed back. It is not for the faint of heart, but it's the most important part of your job. The other thing I'll talk about is with them, what is in it for me? And this is really about understanding the motivation, the relevance that data has for everyone on the frontline, as well as those analysts, as well as the executives. So if we're talking about players in the NFL, they want to perform better and they want to stay safe. That is why data matters to them. If we're talking about financial services, this may be a wealth management advisor, okay. We could say commissions, but it's really helping people have their dreams come true, whether it's putting their children through college or being able to retire without having to work multiple jobs still into your seventies or eighties for the teachers, teachers, you ask them about data. They'll say we don't, we don't need that. I care about the student. So if you can use data to help a student perform better, that is with them. And sometimes we spend so much time talking the technology, we forget what is the value we're trying to deliver with it? And we forget the impact on the people that it does require change. In fact, the Harvard business review study found that 44% said lack of change. Management is the biggest barrier to leveraging both new technology, but also being empowered to act on those data driven insights. >>The third point organize for collaboration. This does require diversity of thought, but also bringing the technology, the data and the business people together. Now there's not a single one size fits all model for data and analytics. At one point in time, even having a BICC a BI competency center was considered state of the art. Now for the biggest impact, what I recommend is that you have a federated model centralized for economies of scale. That could be the common data, but then in bed, these evangelists, these analysts of the future within every business unit, every functional domain. And as you see this top bar, all models are possible, but the hybrid model has the most impact the most leaders. So as we look ahead said to the months ahead to the year ahead and exciting time, because data is helping organizations better navigate a tough economy, lock in the customer loyalty. And I look forward to seeing how you foster that culture. That's collaborative with empathy and bring the best of technology, leveraging the cloud, all your data. So thank you for joining us at thoughtless.

Published Date : Oct 16 2020

SUMMARY :

and this is why I want you to focus on having fostering a CDO said to me, you know, Cindy, I actually think this And the data is not in one place, analysts, but really at the point of impact what Why is it not at the front lines? So it's easy enough for that new decision maker, the business user, So you have these different, the So let's talk about the real world impact of And let's take an example of where you can have great, in fines, change in leadership that even the CEO agent, identify the relevance, or I like to call it with them and organize Management is the biggest barrier to of technology, leveraging the cloud, all your data.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
MichellePERSON

0.99+

VerizonORGANIZATION

0.99+

EuropeLOCATION

0.99+

AustraliaLOCATION

0.99+

Cindi HowsonPERSON

0.99+

Wells FargoORGANIZATION

0.99+

two sidesQUANTITY

0.99+

1986DATE

0.99+

GartnerORGANIZATION

0.99+

three yearsQUANTITY

0.99+

CindyPERSON

0.99+

1990sDATE

0.99+

87%QUANTITY

0.99+

20,000QUANTITY

0.99+

KievLOCATION

0.99+

MedtronicORGANIZATION

0.99+

44%QUANTITY

0.99+

TD bankORGANIZATION

0.99+

last yearDATE

0.99+

65%QUANTITY

0.99+

20%QUANTITY

0.99+

more than 20 yearsQUANTITY

0.99+

2020DATE

0.99+

five yearsQUANTITY

0.99+

150 countriesQUANTITY

0.99+

ThoughtSpotORGANIZATION

0.99+

COVIDOTHER

0.99+

bothQUANTITY

0.99+

20,000 yearsQUANTITY

0.99+

HarvardORGANIZATION

0.99+

third pointQUANTITY

0.99+

five pillarsQUANTITY

0.99+

one pointQUANTITY

0.99+

seventiesQUANTITY

0.98+

three suggestionsQUANTITY

0.98+

SudheeshPERSON

0.98+

more than 20 yearsQUANTITY

0.98+

first generationQUANTITY

0.98+

billionsQUANTITY

0.98+

SisyphusPERSON

0.98+

firstQUANTITY

0.98+

three timesQUANTITY

0.98+

10 yearsQUANTITY

0.98+

OneQUANTITY

0.98+

thousandsQUANTITY

0.97+

oneQUANTITY

0.97+

two componentsQUANTITY

0.97+

one placeQUANTITY

0.97+

AmazonORGANIZATION

0.96+

eightiesQUANTITY

0.96+

singleQUANTITY

0.96+

GoogleORGANIZATION

0.95+

couple years agoDATE

0.93+

few years agoDATE

0.89+

Azure synapseTITLE

0.86+

Ternopil ChernobylLOCATION

0.84+

NFLEVENT

0.78+

GardnerPERSON

0.75+

one stepQUANTITY

0.74+

CDOTITLE

0.66+

data warehousing InstituteORGANIZATION

0.65+

snowflakeTITLE

0.64+

SAPORGANIZATION

0.61+

NABLOCATION

0.61+

employeesQUANTITY

0.57+

SlackTITLE

0.53+

Thought.Leaders DigitalORGANIZATION

0.51+

RedshiftTITLE

0.5+

AustralianOTHER

0.48+

Ajay Vohora and Ved Sen | SmartData Marketplaces


 

>> Narrator: From around the globe, it's "theCUBE" with digital coverage of Smart Data Marketplaces brought to you by Io-Tahoe. >> We're back. We're talking about smart data and have been for several weeks now. Really it's all about injecting intelligence and automation into the data life cycle of the data pipeline. And today we're drilling into Smart Data Marketplaces, really trying to get to that self-serve, unified, trusted, secured, and compliant data models. And this is not trivial. And with me to talk about some of the nuances involved in actually getting there with folks that have experienced doing that. They'd send a series of digital evangelist with Tata Consultancy Services, TCS. And Ajay Vohora is back, he's the CEO of Io-Tahoe. Guys, great to see you, thanks so much for coming on. >> Good to see you, Dave. >> Hey Dave. >> Ajay, let's start with you. Let's set up the sort of smart data concept. What's that all about? What's your perspective? >> Yeah, so I mean, our way of thinking about this is you you've got data, it has latent value, and it's really about discovering what the properties of that data. Does it have value? Can you put that data to work? And the way we go about that with algorithms and machine learning, to generate signals in that data identified patterns, that means we can start to discover how can we apply that data to down stream? What value can we unlock for a customer and business? >> Well, so you've been on this, I mean, really like a laser, why? I mean, why this issue? Did you see a gap in the marketplace in terms of talking to customers and maybe you can help us understand the origin? >> Yeah, I think that the gap has always been there. They've been, it's become more apparent over recent times with big data. So the ability to manually work with volumes of data in petabytes is prohibitively complex and expensive. So you need the different routes, you need different set of tools and methods to do that. Metadata are data that you can understand about data. That's what we at Io-Tahoe focus on, discovering and generating that metadata. That ready, that analogy to automate those data ops processes. So the gap David, is being felt by a business owner prizes and all sectors, healthcare, telecoms, and putting that data to work. >> So Ved, Let's talk a little bit about your role. You work with a lot of customers. I see you as an individual in a company who's really trying to transform what is a very challenging industry. That's sort of ripe for transformation, but maybe you could give us your perspective on this, what kind of signals you're looking for from the data pipeline and we'll get into how you are helping transform healthcare? >> Thanks, David. You know I think this year has been one of those years where we've all realized about this idea of unknown unknowns, where something comes around the corner that you're completely not expecting. And that's really hard to plan for obviously. And I think what we need is the ability to find early signals and be able to act on things as soon as you can. Sometimes, and you know, the COVID-19 scenario of course, is hopefully once in a generation thing, but most businesses struggle with the idea that they may have the data there in their systems, but they still don't know which bit of that is really valuable and what are the signals they should be watching for. And I think the interesting thing here is the ability for us to extract from a massive data, the most critical and important signals. And I think that's where we want to focus on. >> And so, talk a little bit about healthcare in particular and sort of your role there, and maybe at a high level. How Tata and your eco-system are helping transform healthcare? >> So if you look at healthcare, you've got the bit where people need active intervention from a medical professional. And then you've got this larger body of people, typically elderly people who aren't unwell, but they have frailties. They have underlying conditions and they're very vulnerable, especially in the world that we're in now in the post-COVID-19 scenario. And what we were trying to look at is how do we keep people who are elderly, frail and vulnerable? How can we keep them safe in their own homes rather than moving to care homes, where there has been an incredibly high level of infection for things like COVID-19. So the world works better if you can keep people safe in their own homes, if you can see the slide we've got. We're also talking about a world where care is expensive. In most Western countries, especially in Western Europe, the number of elderly people is increasing as a percentage of the population, quite significantly, and resources just are not keeping up. We don't have enough people. We don't have enough funding to look after them effectively. And the care industry that used to do that job has been struggling of late. So it's kind of a perfect storm for the need for technology intervention there. And in that space, what we're saying is the data signal that we want to receive are exactly what as a relative, or a son or daughter you might want from a parent to say, "Everything's okay. "We know that today's been just like every other day "there are no anomalies in your daily living." If you could get the signals that might tell us that something's wrong, something not quite right. We don't need very complex diagnostics. We just need to know something's not quite right, that my dad hasn't woken up as has always at seven o'clock, but till nine o'clock there's no movement. Maybe he's a bit unwell. It's that kind of signal that if we can generate, can make a dramatic difference to how we can look out for these people, whether through professional carers or through family members. So what we're looking to do is to sensor-enable homes of vulnerable people so that those data signals can come through to us in a curated manner, in a way that protects privacy and security of the individual, but gives the right people, which is carers or chosen family members the access to the signals, which is alerts that might tell you there was too much movement at night, or the front door was been left open, things like that that would give you a reason to call him and check. Everybody has spoken to in this always has an example of an uncle or a relative or parent that they've looked after. And all they're looking for is a signal. Even stories like my father's neighbor calls me when he doesn't open his curtain by 11 o'clock, that actually, if you think about it is a data signal that something might be all right. And I think what we're trying to do with technology is create those kinds of data signals because ultimately, the healthcare system works much better if you can prevent rather than cure. So every dollar that you put into prevention saves maybe $3 to $5 downstream. The economic summit also are working our favor. >> And those signals give family members the confidence to act. Ajay, it is interesting to hear what Ved was talking about in terms of the unknowns, because when you think about the early days of the computer industry, there were a lot of knowns, the processes were known. It was like the technology was the big mystery. Now, I feel like it's flipped. We've certainly seen that with COVID. The technology is actually quite well understood and quite mature and reliable. One of the examples is automated data discovery, which is something that you guys have been been focused on at Io-Tahoe. Why is automated data discovery such an important component of a smart data life cycle? >> Yeah. I mean, if we look David at the schematic and this one moves from left to right where right at the outset with that latent data, the value is late because you don't know. Does it have? Can it be applied? Can that data be put to work or not? And the objective really is about driving some form of exchange or monetization of data. If you think about it in insurance or healthcare, you've got lots of different parties, providers, payers, patients, everybody's looking to make some kind of an exchange of information. The difficulty is in all of those organizations, that data sits within its own system. So data discovery, if we drill into the focus itself that, it's about understanding which data has value, classifying that data so that it can be applied and being able to tag it so that it can then be put to use it's the real enabler for DataOps. >> So maybe talk a little bit more about this. We're trying to get to self-service. It's something that we hear a lot about. You mentioned putting data to work. It seems to me that if the business can have access to that data and serve themselves, that's the way to put data to work. Do you have thoughts on that? >> Yeah, I mean, thinking back in terms of what IT and the IT function in a business could provide, there have been limitations around infrastructure, around scaling, around compute. Now that we're in an economy that is digital driven by API's your infrastructure, your data, your business rules, your intelligence, your models, all of those on the back of an API. So the options become limitless. How you can drive value and exchange that data. What that allows us to do is to be more creative, if we can understand what data has value for what use case. >> Ved, Let's talk a little bit about the US healthcare system. It's a good use case. I was recently at a chief data officer conference and listening to the CDO of Johns Hopkins, talk about the multiple different formats that they had to ingest to create that COVID map. They even had some PDFs, they had different definitions, and that's sort of underscored to me, the state of the US healthcare industry. I'm not as familiar with the UK and Europe generally, but I am familiar with the US healthcare system and the diversity that's there, the duplication of information and the like, maybe you could sort of summarize your perspectives and give us kind of the before and your vision of the after, if you will? >> The use of course, is particularly large and complex system. We all know that. We also know, I think there is some research that suggests that in the US the per-capita spend on healthcare is among the highest in the world. I think it's like 70%, and that compares to what just under 9%, which is going to be European, typical European figure. So it's almost double of that, but the outcomes are still vastly poor. When Ajay and I were talking earlier, I think we believe that there is a concept of a data friction. When you've got multiple players in an eco-system, trying to provide a single service as a patient, you're receiving a single health care service. There are probably a dozen up to 20 different organizations that have to collaborate to make sure you get that top of the line health care service. That kind of investment deserves. And what prevents it from happening very often is what we would call data friction, which is the ability to effectively share data. Something as simple as a healthcare record, which says, "This is Dave, this is Ved, this is Ajay." And when we go to hospital for anything, whatever happens, that healthcare record can capture all the information and tie to us as an individual. And if you go to a different hospital, then that record will follow you. This is how you would expect that to be implemented, but I think we're still on that journey. There are lots and lots of challenges. I've seen anecdotal data around people who suffered because they weren't carrying a card when they went into hospital, because that card has the critical elements of data, but in today's world, should you need to carry a piece of paper or can the entire thing be a digital data flow that can easily be, can certainly navigate through lack of paper and those kinds of things. So the vision that I think we need to be looking at is an effective data exchange or marketplace back with a kind of a backbone model where people agree and sign off a data standard, where each individual's data is always tied to the individual. So if you were to move States, if you would move providers, change insurance companies, none of that would impact your medical history, your data, and the ability to have the other care and medical professionals to access the data at the point of need and at the point of healthcare delivery. So I think that's the vision we're looking at, but as you rightly you said that there are enormous number of challenges, partly because of the history, of healthcare, I think it was technology enablement of healthcare started early. So there's a lot of legacy as well. So we shouldn't trivialize the challenges that the industry faces, but that I think is the way we want to go. >> Well, privacy is obviously a huge one, and a lot of the processes are built around non-digital processes and what you're describing as a flip for digital first. I mean, as a consumer, as a patient, I want an app for that. So I can see my own data. I can see price, price transparency, give access to people that I think need it. And that is a daunting task, isn't it? >> Absolutely. And I think the implicit idea and what you just said, which is very powerful is also on the app you want to control. >> Yes. >> And sometimes you want to be able to change access on data at that point. Right now, I'm at the hospital. I would like to access my data. And when I walk away or maybe three days later, I want to revoke that access. It's that level of control. And absolutely, it is by no means a trivial problem, but I think that's where you need the data automation tools. If you try to do any of this manually, we'd be here for another decade trying to solve this, but that's where tools like Io-Tahoe come in because to do this, a lot of the heavy lifting behind the scenes has to be automated. There has to be a machine churning that and presenting the simpler options. And I know you were talking about it just a little while ago Ajay. I was reminded of the example of a McDonald's or a Coke, because the sales store idea that you can go in and you can do your own ordering off a menu, or you can go in and select five different flavors from a Coke machine and choose your own particular blend of Coke. It's a very trivial example, but I think that's the word we want to get to with access of data as well. If it was that simple for consumers, for enterprise, business people, for doctors, then that's where we ultimately want to be able to arrive. But of course, to make something very simple for the end-user, somebody has to solve for complexity behind the scenes. >> So Ajay, it seems to me Ajay there're two major outcomes here. One is of course, the most important I guess, is patient outcomes, and the other is cost. I mean, they talked about the cost issues, we all, US especially understand the concerns about rising costs of healthcare. My question is this, how does a Smart Data Marketplace fit into achieving those two very important outcomes? >> When we think about how automation is enabling that, where we've got different data formats, the manual tasks are involved, duplication of information. The administrative overhead of that alone and the work, the rework, and the cycles of work that generates. That's really what we're trying to help with data is to eliminate that wasted effort. And with that wasted effort comes time and money to employ people to work through those siloed systems. So getting to the point where there is an exchange in a marketplace just as they would be for banking or insurance is really about automating the classification of data to make it available to a system that can pick it up through an API and to run a machine learning model and to manage a workflow, a process. >> Right, so you mentioned backing insurance, you're right. I mean, we've actually come a long way and just in terms of, know the customer and applying that to know the patient would be very powerful. I'm interested in what you guys are doing together, just in terms of your vision. Are you going to market together, kind of what you're seeing in terms of promoting or enabling this self-service, self-care. Maybe you could talk a little bit about Io-Tahoe and Tata, the intersection at the customer? >> Sure. I think we've been very impressed with the TCS vision of 4.0, how the re-imagining traditional industries, whether it's insurance, banking, healthcare, and bringing together automation, agile processes, robotics, AI, and once those enablers, technology may have brought together to re-imagine how those services can be delivered digitally. All of those are dependent on data. So we see that there's a really good fit here to enable understanding the legacy, the historic situation that has built up over time in an organization, a business and to help shine a light on what's meaningful in that to migrate to the cloud or to drive a digital twin, data science project. >> Ved, anything you can add to that? >> Sure. I mean, we do take the business 4.0 model quite seriously in terms of a lens with which you look at any industry, and what I talked about in healthcare was an example of that. And for us business 4.0, means a few very specific things. The technology that we use in today's verse should be agile, automated, intelligent, and cloud-based. These have become kind of hygiene factors now. On top of that, the businesses we build should be mass customized. They should be risk embracing. They should engage ecosystems, and they should strive for exponential value, not 10% growth year on year, but doubling, tripling every three, four years, because that's the competition that most businesses are facing today. And within that, the Tata group itself, is an extremely purpose-driven business. We really believe that we exist to serve communities, not just one specific set, i.e. shareholders, but the broader community in which we live and work. And I think this framework also allows us to apply that to things like healthcare, to education and to a whole vast range of areas where, everybody has a vision of using data science or doing really clever stuff at the gradients. But what becomes clear is, to do any of that, the first thing you need is a foundational piece. And as a foundation isn't right, then no matter how much you invest in the data science tools you won't get the answers you want. And the work we're doing with the Io-Tahoe really, for me, is particularly exciting because it sorts out that foundational piece. And at the end of it, to make all of this, again, I will repeat that, to make it simple and easy to use for the end user, whoever that is. And I realized that I'm probably the first person who's used fast food as a shining example for healthcare in this discussion, but you can make a lot of different examples. And today, if you press a button and start a car, that's simplicity, but someone has solved for that. And that's what we want to do with data as well. >> Yeah, that makes a lot of sense to me. We talk a lot about digital transformation and a digital business, and I would observe that a digital business puts data at the core. And you can certainly be the best example. There is, of course, Google is an all digital business, but take a company like Amazon, Who's got obviously a massive physical component to its business. Data is at the core. And that's exactly my takeaway from this discussion. Both of you are talking about putting data at the core, simplifying it, making sure that it's compliant, and healthcare it's taking longer, 'cause it's such a high risk industry, but it's clearly happening, COVID I guess, was an accelerant. Guys, Ajay, I'll start with you. Any final thoughts that you want to leave the audience with? _ Yeah, we're really pleased to be working with TCS. We've been able to explore how we're able to put dates to work in a range of different industries. Ved has mentioned healthcare, telecoms, banking and insurance are others. And the same impact they speak to whenever we see the exciting digital transformations that are being planned, being able to accelerate those, unlock the value from data is where we're having a purpose. And it's good that we can help patients in the healthcare sector, consumers in banking realize a better experience through having a more joined up marketplace with their data. >> Ved, you know what excites me about this conversation is that, as a patient or as a consumer, if I'm helping loved ones, I can go to the web and I can search, and I can find a myriad of possibilities. What you're envisioning here is really personalizing that with real time data. And that to me is a game changer. Your final thoughts? >> Thanks, David. I absolutely agree with you that the idea of data centricity and simplicity are absolutely forefront, but I think if we were to design an organization today, you might design it very differently to how most companies today are structured. And maybe Google and Amazon are probably better examples of that because you almost have to think of a business as having a data engine room at its core. A lot of businesses are trying to get to that stage, whereas what we call digital natives, are people who have started life with that premise. So I absolutely agree with you on that, but extending that a little bit. If you think of most industries as eco-systems that have to collaborate, then you've got multiple organizations who will also have to exchange data to achieve some shared outcomes. Whether you look at supply chains of automobile manufacturers or insurance companies or healthcares we've been talking about. So I think that's the next level of change we want to be able to make, which is to be able to do this at scale across organizations at industry level or in population scheme for healthcare. >> Yeah, Thank you for that. Go ahead Ajay. >> David that's where it comes back to again, the origination where we've come from in big data. The volume of data combined with the specificity of individualizing, personalizing a service around an individual amongst that massive data from different providers is where is exciting, that we're able to have an impact. >> Well, and you know Ajay, I'm glad you brought that up because in the early days of big data, there were only a handful of companies, the biggest financial institutions. Obviously, the internet giants who had all these engineers that were able to take advantage of it. But with companies like Io-Tahoe and others, and the investments that the industry has made in terms of providing the tools and simplifying that, especially with machine intelligence and AI and machine learning, these are becoming embedded into the tooling so that everybody can have access to them, small, medium, and large companies. That's really, to me, the exciting part of this new era that we're entering. >> Yeah, and we have placed those, take it down to the level of not-for-profits and smaller businesses that want to innovate and leapfrog into, to growing their digital delivery of their service. >> And I know a lot of time, but Ved, what you were saying about TCS's responsibility to society, I think is really, really important. Large companies like yours, I believe, and you clearly do as well, have a responsibility to society more than just a profit. And I think, Big Tech it's a better app in a lot of cases, but so thank you for that and thank you gentlemen for this great discussion. I really appreciate it. >> Thanks David. >> Thank you. >> All right, keep it right there. I'll be right back right after this short break. This is Dave Vellante for theCUBE. (calm music)

Published Date : Sep 17 2020

SUMMARY :

brought to you by Io-Tahoe. of the data pipeline. What's that all about? And the way we go about and putting that data to work. from the data pipeline the ability to find early and sort of your role there, the access to the signals, One of the examples is the value is late because you don't know. that's the way to put data to work. and the IT function in a and listening to the CDO of Johns Hopkins, and that compares to what and a lot of the processes are built also on the app you want behind the scenes has to be automated. One is of course, the of that alone and the work, that to know the patient in that to migrate to the cloud And at the end of it, to make all of this, Yeah, that makes a lot of sense to me. And that to me is a game changer. of that because you almost Yeah, Thank you for that. the origination where we've and the investments that the those, take it down to the level And I know a lot of time, This is Dave Vellante for theCUBE.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
DavidPERSON

0.99+

Ajay VohoraPERSON

0.99+

Dave VellantePERSON

0.99+

$3QUANTITY

0.99+

AmazonORGANIZATION

0.99+

GoogleORGANIZATION

0.99+

DavePERSON

0.99+

Tata Consultancy ServicesORGANIZATION

0.99+

Io-TahoeORGANIZATION

0.99+

70%QUANTITY

0.99+

$5QUANTITY

0.99+

AjayPERSON

0.99+

TCSORGANIZATION

0.99+

Western EuropeLOCATION

0.99+

COVID-19OTHER

0.99+

BothQUANTITY

0.99+

VedPERSON

0.99+

seven o'clockDATE

0.99+

11 o'clockDATE

0.99+

twoQUANTITY

0.99+

firstQUANTITY

0.99+

Ved SenPERSON

0.99+

USLOCATION

0.99+

todayDATE

0.98+

singleQUANTITY

0.98+

OneQUANTITY

0.98+

EuropeLOCATION

0.98+

three days laterDATE

0.98+

TataORGANIZATION

0.98+

four yearsQUANTITY

0.98+

CokeORGANIZATION

0.97+

five different flavorsQUANTITY

0.97+

single serviceQUANTITY

0.97+

UKLOCATION

0.97+

Tata groupORGANIZATION

0.95+

under 9%QUANTITY

0.95+

each individualQUANTITY

0.94+

doublingQUANTITY

0.94+

every threeQUANTITY

0.93+

a dozenQUANTITY

0.92+

up to 20 different organizationsQUANTITY

0.92+

WesternLOCATION

0.92+

COVIDTITLE

0.91+

McDonald'sORGANIZATION

0.91+

10% growthQUANTITY

0.9+

nine o'clockDATE

0.9+

oneQUANTITY

0.9+

this yearDATE

0.89+

Johns HopkinsORGANIZATION

0.89+

EuropeanOTHER

0.88+

doubleQUANTITY

0.87+

first personQUANTITY

0.85+

two majorQUANTITY

0.84+

one specific setQUANTITY

0.79+

Ajay Vohora 9 9 V1


 

>>from around the globe. It's the Cube with digital coverage of smart data. Marketplace is brought to You by Io Tahoe Digital transformation is really gone from buzzword to a mandate. Additional businesses, a data business. And for the last several months, we've been working with Iot Tahoe on an ongoing content. Serious, serious, focused on smart data and automation to drive better insights and outcomes, essentially putting data to work. And today we're gonna do a deeper dive on automating data Discovery. And one of the thought leaders in this space is a J ahora who is the CEO of Iot. Tahoe's once again joining Me A J Good to see you. Thanks for coming on. >>A great to be here, David. Thank you. >>So let's start by talking about some of the business realities. And what are the economics that air? That air driving, automated data Discovery? Why is that so important? >>Yeah, and on this one, David, it's It's a number of competing factors we've got. The reality is data which may be sensitive, so this control on three other elements are wanting to drive value from that data. So innovation, you can't really drive a lot of value without exchanging data. So the ability to exchange data and to manage those costs, overheads and data discovery is at the roots of managing that in an automated way to classify that data in sets and policies to put that automation in place. >>Yeah. Okay, look, we have a picture of this. We could bring it up, guys, because I want oh, A j help the audience. Understand? Unaware data Discovery fits in here. This is as we talked about this, a complicated situation for a lot of customers. They got a variety of different tools, and you really laid it out nicely here in this diagram. So take us through. Sort of where that he spits. >>Yeah. I mean, where at the right hand side, This exchange. You know, we're really now in a data driven economy that is, everything's connected through AP, eyes that we consume on mine free mobile relapse. And what's not a parent is the chain of activities and tasks that have to go into serving that data two and eight p. I. At the outset, there may be many legacy systems, technologies, platforms on premise and cloud hybrids. You name it. Andi across those silos. Getting to a unified view is the heavy lifting. I think we've seen Cem some great impacts that be I titles such as Power Bi I tableau looker on DSO on in Clear. Who had Andi there in our ecosystem on visualising Data and CEO's managers, people that are working in companies day to day get a lot of value from saying What's the was the real time activity? What was the trend over this month? First his last month. The tools to enable that you know, we here, Um, a lot of good things are work that we're doing with snowflake mongo db on the public cloud platforms gcpd as your, um, about enabling building those pay planes to feed into those analytics. But what often gets hidden is have you sauce that data that could be locked into a mainframe, a data warehouse? I ot data on DPA, though, that all of that together that is the reality of that is it's it's, um, it's a lot of heavy lifting It z hands on what that, um, can be time consuming on the issue There is that data may have value. It might have potential to have an impact on the on the top line for a business on outcomes for consumers. But you never any sure unless you you've done the investigation discovered it unified that Onda and be able to serve that through to other technologies. >>Guys have. You would bring that picture back up again because A. J, you made a point, and I wanna land on that for a second. There's a lot of manual curating. Ah, an example would be the data catalogue if they decide to complain all the time that they're manually wrangling data. So you're trying to inject automation in the cycle, and then the other piece that I want you to addresses the importance of AP eyes. You really can't do this without an architecture that allows you to connect things together. That sort of enables some of the automation. >>Yeah, I mean, I don't take that in two parts. They would be the AP eyes so virtual machines connected by AP eyes, um, business rules and business logic driven by AP eyes applications. So everything across the stack from infrastructure down to the network um, hardware is all connected through AP eyes and the work of serving data three to an MP I Building these pipelines is is often, um, miscalculated. Just how much manual effort that takes and that manual ever. We've got a nice list here of what we automate down at the bottom. Those tasks of indexing, labeling, mapping across different legacy systems. Um, all of that takes away from the job of a data scientist today to engineer it, looking to produce value monetize data on day two to help their business day to conceive us. >>Yes. So it's that top layer that the business sees, of course, is a lot of work that has to go went into achieving that. I want to talk about some of the key tech trends that you're seeing and one of the things that we talked about a lot of metadata at the importance of metadata. It can't be understated. What are some of the big trends that you're seeing metadata and others? >>Yeah, I'll summarize. It is five. There's trains now, look, a metadata more holistically across the enterprise, and that really makes sense from trying. Teoh look across different data silos on apply, um, a policy to manage that data. So that's the control piece. That's that lever the other side's on. Sometimes competing with that control around sense of data around managing the costs of data is innovation innovation, being able to speculate on experiment and trying things out where you don't really know what the outcome is. If you're a data scientist and engineer, you've got a hypothesis. And now, before you got that tension between control over data on innovation and driving value from it. So enterprise wide manage data management is really helping to enough. Where might that latent value be across that sets of data? The other piece is adaptive data governance. Those controls that that that stick from the data policemen on day to steer its where they're trying to protect the organization, protect the brand, protect consumers data is necessary. But in different use cases, you might want to nuance and apply a different policy to govern that data run of into the context where you may have data that is less sensitive. Um, that can me used for innovation. Andi. Adapting the style of governance to fit the context is another trend that we're seeing coming up here. A few others is where we're sitting quite extensively and working with automating data discovery. We're now breaking that down into what can we direct? What do we know is a business outcome is a known up front objective on direct that data discovery to towards that. And that means applying around with Dems run technology and our tools towards solving a known problem. The other one is autonomous data discovery. And that means, you know, trying to allow background processes do winds down what changes are happening with data over time flagging those anomalies. And the reason that's important is when you look over a length of time to see different spikes, different trends and activity that's really giving a day drops team the ability to to manage and calibrate how they're applying policies and controls today. There, in the last two David that we're seeing is this huge drive towards self service so reimagining how to play policy data governance into the hands off, um, a day to consumer inside a business or indeed, the consumer themselves. The South service, um, if their banking customer or healthcare customer and the policies and the controls and rules, making sure that those are all in place to adaptive Lee, um, serve those data marketplaces that, um when they're involved in creating, >>I want to ask you about the autonomous data discovering the adaptive data. Governance is the is the problem where addressing their one of quality. In other words, machines air better than humans are doing this. Is that one of scale that humans just don't don't scale that well, is it? Is it both? Can you add some color to that >>yet? Honestly, it's the same equation that existed 10 years ago, 20 years ago. It's It's being exacerbated, but it's that equation is how do I control both things that I need to protect? How do we enable innovation where it is going to deliver business value? Had to exchange data between a customer, somebody in my supply chains safely. And all of that was managing the fourth that leg, which is cost overheads. You know, there's no no can checkbook here. I've got a figure out. If only see io and CDO how I do all of this within a fixed budget so that those aspects have always been there. Now, with more choices. Infrastructure in the cloud, um, NPR driven applications own promise. And that is expanding the choices that a a business has and how they put mandated what it's also then creating a layer off management and data governance that really has to now, uh, manage those full wrath space control, innovation, exchange of data on the cost overhead. >>That that top layer of the first slide that we showed was all about business value. So I wonder if we could drill into the business impact a little bit. What do your customers seeing you know, specifically in terms of the impact of all this automation on their business? >>Yeah, so we've had some great results. I think view the biggest Have Bean helping customers move away from manually curating their data in their metadata. It used to be a time where for data quality initiatives or data governance initiative that be teams of people manually feeding a data Cavallo. And it's great to have the inventory of classified data to be out to understand single version of the trees. But in a having 10 15 people manually process that keep it up to date when it's moving feet. The reality of it is what's what's true about data today? and another few sources in a few months. Time to your business on start collaborating with new partners. Suddenly the landscape has changed. The amount of work is gonna But the, um, what we're finding is through automating creating that data discovery feeding a dent convoke that's releasing a lot more time for our CAS. Mr Spend on innovating and managing their data. A couple of others is around cell service data and medics moving the the choices of what data might have business value into the hands of business users and and data consumers to They're faster cycle times around generating insights. Um, we really helping that by automating the creation of those those data sets that are needed for that. And in the last piece, I'd have to say where we're seeing impacts. A more recently is in the exchange of data. There are a number of marketplaces out there who are now being compelled to become more digital to rewire their business processes. Andi. Everything from an r p a initiative. Teoh automation involving digital transformation is having, um, see iose Chief data officers Andi Enterprise architects rethink how do they how they re worthy pipelines? But they dated to feed that additional transformation. >>Yeah, to me, it comes down to monetization. Of course, that's for for profit in industry, from if nonprofits, for sure, the cost cutting or, in the case of healthcare, which we'll talk about in a moment. I mean, it's patient outcomes. But you know, the the job of ah, chief data officer has gone from your data quality and governance and compliance to really figuring out how data and be monetized, not necessarily selling the data, but how it contributes for the monetization of the company and then really understanding specifically for that organization how to apply that. And that is a big challenge. We chatted about it 10 years ago in the early days of a Duke. And then, you know, 1% of the companies had enough engineers to figure it out. But now the tooling is available, the technology is there and the the practices air there, and that really to me, is the bottom line. A. J is it says to show me the money. >>Absolutely. It's is definitely then six sing links is focusing in on the saying over here, that customer Onda, where we're helping there is dio go together. Those disparities siloed source of data to understand what are the needs of the patient of the broker of the if it's insurance? Ah, one of the needs of the supply chain manager If its manufacturing onda providing that 3 60 view of data, um is helping to see helping that individual unlock the value for the business. Eso data is providing the lens, provided you know which data it is that can God assist in doing that? >>And you know, you mentioned r p A. Before an r p A customer tell me she was a six Sigma expert and she told me we would never try to apply six segment to a business process. But with our P A. We can do so very cheaply. Well, what that means is lower costs means better employee satisfaction and, really importantly, better customer satisfaction and better customer outcomes. Let's talk about health care for a minute because it's a really important industry. It's one that is ripe for disruption on has really been up until recently, pretty slow. Teoh adopt ah, lot of the major technologies that have been made available, but come, what are you seeing in terms of this theme, we're using a putting data to work in health care. Specific. >>Yeah, I mean, healthcare's Havlat thrown at it. There's been a lot of change in terms of legislation recently. Um, particularly in the U. S. Market on in other economies, um, healthcare ease on a path to becoming more digital on. Part of that is around transparency of price, saying to be operating effectively as a health care marketplace, being out to have that price transparency, um, around what an elective procedure is going to cost before taking that that's that forward. It's super important to have an informed decision around there. So we look at the US, for example. We've seen that health care costs annually have risen to $4 trillion. But even with all of that on cost, we have health care consumers who are reluctant sometimes to take up health care if they even if they have symptoms on a lot of that is driven through, not knowing what they're opening themselves up to. Andi and I think David, if you are, I want to book, travel, holiday, maybe, or trip. We want to know what what we're in for what we're paying for outfront, but sometimes in how okay, that choice, the option might be their plan, but the cost that comes with it isn't so recent legislation in the US Is it certainly helpful to bring for that tryst price, transparency, the underlying issue there? There is the disparity. Different formats, types of data that being used from payers, patients, employers, different healthcare departments try and make that make that work. And when we're helping on that aspect in particular related to track price transparency is to help make that date of machine readable. So sometimes with with data, the beneficiary might be on a person. I've been a lot of cases now we're seeing the ability to have different systems, interact and exchange data in order to process the workflow. To generate online at lists of pricing from a provider that's been negotiated with a payer is, um, is really a neighboring factor. >>So, guys, I wonder if you bring up the next slide, which is kind of the Nirvana. So if you if you saw the previous slide that the middle there was all different shapes and presumably to disparage data, this is that this is the outcome that you want to get. Everything fits together nicely and you've got this open exchange. It's not opaque as it is today. It's not bubble gum band aids and duct tape, but but but described this sort of outcome the trying to achieve and maybe a little bit about what gonna take to get there. >>Yeah, that's a combination of a number of things. It's making sure that the data is machine readable. Um, making it available to AP eyes that could be our ph toes. We're working with technology companies that employ R P. A full health care. I'm specifically to manage that patient and pay a data. Teoh, bring that together in our data Discovery. What we're able to do is to classify that data on having made available to eight downstream tour technology or person to imply that that workflow to to the data. So this looks like nirvana. It looks like utopia. But it's, you know, the end objective of a journey that we can see in different economies there at different stages of maturity, in turning healthcare into a digital service, even so that you could consume it from when you live from home when telling medicine. Intellicast >>Yes, so And this is not just health care but you wanna achieve that self service doing data marketplace in virtually any industry you working with TCS, Tata Consultancy Services Toe Achieve this You know, if you are a company like Iota has toe have partnerships with organizations that have deep industry expertise Talk about your relationship with TCS and what you guys are doing specifically in this regard. >>Yeah, we've been working with TCS now for room for a long while. Andi will be announcing some of those initiatives here where we're now working together to reach their customers where they've got a a brilliant framework of business for that zero when there re imagining with their clients. Um, how their business cause can operate with ai with automation on, become more agile in digital. Um, our technology, the dreams of patients that we have in our portfolio being out to apply that at scale on the global scale across industries such as banking, insurance and health care is is really allowing us to see a bigger impact on consumer outcomes. Patient outcomes And the feedback from TCS is that we're really helping in those initiatives remove that friction. They talk a lot about data. Friction. Um, I think that's a polite term for the the image that we just saw with the disparity technologies that the legacy that has built up. So if we want to create a transformation, Um, having a partnership with TCS across Industries is giving us that that reach and that impacts on many different people's day to day jobs and knives. >>Let's talk a little bit about the cloud. It's It's a topic that we've hit on quite a bit here in this in this content Siri's. But But you know, the cloud companies, the big hyper scale should put everything into the cloud, right? But but customers are more circumspect than that. But at the same time, machine intelligence M. L. A. The cloud is a place to do a lot of that. That's where a lot of the innovation occurs. And so what are your thoughts on getting to the cloud? Ah, putting dated to work, if you will, with machine learning stuff you're doing with aws. What? You're fit there? >>Yeah, we we and David. We work with all of the cloud platforms. Mike stuffed as your G, c p IBM. Um, but we're expanding our partnership now with AWS Onda we really opening up the ability to work with their Greenfield accounts, where a lot of that data that technology is in their own data centers at the customer, and that's across banking, health care, manufacturing and insurance. And for good reason. A lot of companies have taken the time to see what works well for them, with the technologies that the cloud providers ah, are offered a offering in a lot of cases testing services or analytics using the cloud to move workloads to the cloud to drive Data Analytics is is a real game changer. So there's good reason to maintain a lot of systems on premise. If that makes sense from a cost from a liability point of view on the number of clients that we work with, that do have and we will keep their mainframe systems within kobo is is no surprise to us, but equally they want to tap into technologies that AWS have such a sage maker. The issue is as a chief data officer, I don't have the budget to me, everything to the cloud day one, I might want to show some results. First upfront to my business users Um, Onda worked closely with my chief marketing officer to look at what's happening in terms of customer trains and customer behavior. What are the customer outcomes? Patient outcomes and partner at comes I can achieve through analytics data signs. So I, working with AWS and with clients to manage that hybrid topology of some of that data being, uh, in the cloud being put to work with AWS age maker on night, I hope being used to identify where is the data that needs to bay amalgamated and curated to provide the data set for machine learning advanced and medics to have an impact for the business. >>So what are the critical attributes of what you're looking at to help customers decide what what to move and what to keep, if you will. >>Well, what one of the quickest outcomes that we help custom achieve is to buy that business blustery. You know that the items of data that means something to them across those different silos and pour all of that together into a unified view once they've got that for a data engineer working with a a business manager to think through how we want to create this application. There was the turn model, the loyalty or the propensity model that we want to put in place here. Um, how do we use predictive and medics to understand what needs are for a patient, that sort of innovation is what we're looking applying the tools such a sagemaker, uh, night to be west. So they do the the computation and to build those models to deliver the outcome is is across that value chain, and it goes back to the first picture that we put up. David, you know the outcome Is that a P I On the back of it, you've got the machine learning model that's been developed in That's always such as data breaks. But with Jupiter notebook, that data has to be sourced from somewhere. Somebody has to say that yet you've got permission to do what you're trying to do without falling foul of any compliance around data. Um, it'll goes back to discovering that data, classifying it, indexing it in an automated way to cut those timelines down two hours and days. >>Yeah, it's the it's the innovation part of your data portfolio, if you will, that you're gonna put into the cloud. Apply tools like sage maker and others. You told the jury. Whatever your favorite tool is, you don't care. The customer's gonna choose that and hear the cloud vendors. Maybe they want you to use their tool, but they're making their marketplaces available to everybody. But it's it's that innovation piece, the ones that you where you want to apply that self service data marketplace to and really drive. As I said before monetization. All right, give us your final thoughts. A. J bring us home. >>So final thoughts on this David is that at the moment we're seeing, um, a lot of value in helping customers discover that day the using automation automatically curating a data catalogue, and that unified view is then being put to work through our A B. I's having an open architecture to plug in whatever tool technology our clients have decided to use, and that open architecture is really feeding into the reality of what see Iose in Chief Data Officers of Managing, which is a hybrid on premise cloud approach. Do you suppose to breed Andi but business users wanting to use a particular technology to get their business outcome having the flexibility to do that no matter where you're dating. Sitting on Premise on Cloud is where self service comes in that self service. You of what data I can plug together, Dr Exchange. Monetizing that data is where we're starting to see some real traction. Um, with customers now accelerating becoming more digital, uh, to serve their own customers, >>we really have seen a cultural mind shift going from sort of complacency. And obviously, cove, it has accelerated this. But the combination of that cultural shift the cloud machine intelligence tools give give me a lot of hope that the promises of big data will ultimately be lived up to ah, in this next next 10 years. So a J ahora thanks so much for coming back on the Cube. You're you're a great guest. And ah, appreciate your insights. >>Appreciate, David. See you next time. >>All right? And keep it right there. Very right back. Right after this short break

Published Date : Sep 9 2020

SUMMARY :

And for the last several months, we've been working with Iot Tahoe on an ongoing content. A great to be here, David. So let's start by talking about some of the business realities. So the ability to exchange and you really laid it out nicely here in this diagram. tasks that have to go into serving that data two and eight p. addresses the importance of AP eyes. So everything across the stack from infrastructure down to the network um, What are some of the big trends that you're the costs of data is innovation innovation, being able to speculate Governance is the is and data governance that really has to now, uh, manage those full wrath space control, the impact of all this automation on their business? And in the last piece, I'd have to say where we're seeing in the case of healthcare, which we'll talk about in a moment. Eso data is providing the lens, provided you know Teoh adopt ah, lot of the major technologies that have been made available, that choice, the option might be their plan, but the cost that comes with it isn't the previous slide that the middle there was all different shapes and presumably to disparage into a digital service, even so that you could consume it from Yes, so And this is not just health care but you wanna achieve that self service the image that we just saw with the disparity technologies that the legacy Ah, putting dated to work, if you will, with machine learning stuff A lot of companies have taken the time to see what works well for them, to move and what to keep, if you will. You know that the items of data that means something to The customer's gonna choose that and hear the cloud vendors. the flexibility to do that no matter where you're dating. that cultural shift the cloud machine intelligence tools give give me a lot of hope See you next time. And keep it right there.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
DavidPERSON

0.99+

Ajay VohoraPERSON

0.99+

AWSORGANIZATION

0.99+

IBMORGANIZATION

0.99+

TCSORGANIZATION

0.99+

MikePERSON

0.99+

AndiPERSON

0.99+

IotaORGANIZATION

0.99+

Iot TahoeORGANIZATION

0.99+

IotORGANIZATION

0.99+

$4 trillionQUANTITY

0.99+

SiriTITLE

0.99+

first pictureQUANTITY

0.99+

fiveQUANTITY

0.99+

two partsQUANTITY

0.99+

two hoursQUANTITY

0.99+

USLOCATION

0.99+

bothQUANTITY

0.99+

Tata Consultancy ServicesORGANIZATION

0.99+

10 years agoDATE

0.99+

first slideQUANTITY

0.99+

1%QUANTITY

0.99+

both thingsQUANTITY

0.98+

FirstQUANTITY

0.98+

oneQUANTITY

0.98+

SpendPERSON

0.98+

fourthQUANTITY

0.97+

U. S.LOCATION

0.97+

IosePERSON

0.97+

todayDATE

0.97+

six sing linksQUANTITY

0.96+

A. JPERSON

0.96+

20 years agoDATE

0.96+

three other elementsQUANTITY

0.96+

day twoQUANTITY

0.95+

twoQUANTITY

0.95+

last monthDATE

0.94+

DukeORGANIZATION

0.93+

zeroQUANTITY

0.91+

covePERSON

0.9+

AndiORGANIZATION

0.9+

OndaORGANIZATION

0.88+

10 15 peopleQUANTITY

0.87+

single versionQUANTITY

0.86+

this monthDATE

0.83+

CDOTITLE

0.83+

eightQUANTITY

0.81+

TahoeORGANIZATION

0.79+

eight downstreamQUANTITY

0.77+

AWS OndaORGANIZATION

0.77+

Io TahoeORGANIZATION

0.77+

next next 10 yearsDATE

0.76+

dayQUANTITY

0.73+

CubeCOMMERCIAL_ITEM

0.72+

six segmentQUANTITY

0.71+

last several monthsDATE

0.71+

threeQUANTITY

0.68+

EsoORGANIZATION

0.6+

Have BeanORGANIZATION

0.6+

3 60 viewQUANTITY

0.58+

JORGANIZATION

0.55+

GreenfieldORGANIZATION

0.49+

ahoraPERSON

0.48+

JPERSON

0.48+

SouthORGANIZATION

0.47+

six SigmaQUANTITY

0.46+

Ajay Vohora & Ved Sen V1 FOR REVIEW


 

>> Narrator: From around the globe, it's "theCUBE" with digital coverage of Smart Data Marketplaces brought to you by Io-Tahoe. >> We're back. We're talking about smart data and have been for several weeks now. Really it's all about injecting intelligence and automation into the data life cycle of the data pipeline. And today we're drilling into Smart Data Marketplaces, really trying to get to that self-serve, unified, trusted, secured, and compliant data models. And this is not trivial. And with me to talk about some of the nuances involved in actually getting there with folks that have experienced doing that. They'd send a series of digital evangelist with Tata Consultancy Services, TCS. And Ajay Vohora is back, he's the CEO of Io-Tahoe. Guys, great to see you, thanks so much for coming on. >> Good to see you, Dave. >> Hey Dave. >> Ajay, let's start with you. Let's set up the sort of smart data concept. What's that all about? What's your perspective? >> Yeah, so I mean, our way of thinking about this is you you've got data, it has latent value, and it's really about discovering what the properties of that data. Does it have value? Can you put that data to work? And the way we go about that with algorithms and machine learning, to generate signals in that data identified patterns, that means we can start to discover how can we apply that data to down stream? What value can we unlock for a customer and business? >> Well, so you've been on this, I mean, really like a laser, why? I mean, why this issue? Did you see a gap in the marketplace in terms of talking to customers and maybe you can help us understand the origin? >> Yeah, I think that the gap has always been there. They've been, it's become more apparent over recent times with big data. So the ability to manually work with volumes of data in petabytes is prohibitively complex and expensive. So you need the different routes, you need different set of tools and methods to do that. Metadata are data that you can understand about data. That's what we at Io-Tahoe focus on, discovering and generating that metadata. That ready, that analogy to automate those data ops processes. So the gap David, is being felt by a business owner prizes and all sectors, healthcare, telecoms, and putting that data to work. >> So Ved, Let's talk a little bit about your role. You work with a lot of customers. I see you as an individual in a company who's really trying to transform what is a very challenging industry. That's sort of ripe for transformation, but maybe you could give us your perspective on this, what kind of signals you're looking for from the data pipeline and we'll get into how you are helping transform healthcare? >> Thanks, David. You know I think this year has been one of those years where we've all realized about this idea of unknown unknowns, where something comes around the corner that you're completely not expecting. And that's really hard to plan for obviously. And I think what we need is the ability to find early signals and be able to act on things as soon as you can. Sometimes, and you know, the COVID-19 scenario of course, is hopefully once in a generation thing, but most businesses struggle with the idea that they may have the data there in their systems, but they still don't know which bit of that is really valuable and what are the signals they should be watching for. And I think the interesting thing here is the ability for us to extract from a massive data, the most critical and important signals. And I think that's where we want to focus on. >> And so, talk a little bit about healthcare in particular and sort of your role there, and maybe at a high level. How Tata and your eco-system are helping transform healthcare? >> So if you look at healthcare, you've got the bit where people need active intervention from a medical professional. And then you've got this larger body of people, typically elderly people who aren't unwell, but they have frailties. They have underlying conditions and they're very vulnerable, especially in the world that we're in now in the post-COVID-19 scenario. And what we were trying to look at is how do we keep people who are elderly, frail and vulnerable? How can we keep them safe in their own homes rather than moving to care homes, where there has been an incredibly high level of infection for things like COVID-19. So the world works better if you can keep people safe in their own homes, if you can see the slide we've got. We're also talking about a world where care is expensive. In most Western countries, especially in Western Europe, the number of elderly people is increasing as a percentage of the population, quite significantly, and resources just are not keeping up. We don't have enough people. We don't have enough funding to look after them effectively. And the care industry that used to do that job has been struggling of late. So it's kind of a perfect storm for the need for technology intervention there. And in that space, what we're saying is the data signal that we want to receive are exactly what as a relative, or a son or daughter you might want from a parent to say, "Everything's okay. "We know that today's been just like every other day "there are no anomalies in your daily living." If you could get the signals that might tell us that something's wrong, something not quite right. We don't need very complex diagnostics. We just need to know something's not quite right, that my dad hasn't woken up as has always at seven o'clock, but till nine o'clock there's no movement. Maybe he's a bit unwell. It's that kind of signal that if we can generate, can make a dramatic difference to how we can look out for these people, whether through professional carers or through family members. So what we're looking to do is to sensor-enable homes of vulnerable people so that those data signals can come through to us in a curated manner, in a way that protects privacy and security of the individual, but gives the right people, which is carers or chosen family members the access to the signals, which is alerts that might tell you there was too much movement at night, or the front door was been left open, things like that that would give you a reason to call him and check. Everybody has spoken to in this always has an example of an uncle or a relative or parent that they've looked after. And all they're looking for is a signal. Even stories like my father's neighbor calls me when he doesn't open his curtain by 11 o'clock, that actually, if you think about it is a data signal that something might be all right. And I think what we're trying to do with technology is create those kinds of data signals because ultimately, the healthcare system works much better if you can prevent rather than cure. So every dollar that you put into prevention saves maybe $3 to $5 downstream. The economic summit also are working our favor. >> And those signals give family members the confidence to act. Ajay, it is interesting to hear what Ved was talking about in terms of the unknowns, because when you think about the early days of the computer industry, there were a lot of knowns, the processes were known. It was like the technology was the big mystery. Now, I feel like it's flipped. We've certainly seen that with COVID. The technology is actually quite well understood and quite mature and reliable. One of the examples is automated data discovery, which is something that you guys have been been focused on at Io-Tahoe. Why is automated data discovery such an important component of a smart data life cycle? >> Yeah. I mean, if we look David at the schematic and this one moves from left to right where right at the outset with that latent data, the value is late because you don't know. Does it have? Can it be applied? Can that data be put to work or not? And the objective really is about driving some form of exchange or monetization of data. If you think about it in insurance or healthcare, you've got lots of different parties, providers, payers, patients, everybody's looking to make some kind of an exchange of information. The difficulty is in all of those organizations, that data sits within its own system. So data discovery, if we drill into the focus itself that, it's about understanding which data has value, classifying that data so that it can be applied and being able to tag it so that it can then be put to use it's the real enabler for that per day drops. >> So maybe talk a little bit more about this. We're trying to get to self-service. It's something that we hear a lot about. You mentioned putting data to work. It seems to me that if the business can have access to that data and serve themselves, that's the way to put data to work. Do you have thoughts on that? >> Yeah, I mean, thinking back in terms of what IT and the IT function in a business could provide, there have been limitations around infrastructure, around scaling, around compute. Now that we're in an economy that is digital driven by API's your infrastructure, your data, your business rules, your intelligence, your models, all of those on the back of an API. So the options become limitless. How you can drive value and exchange that data. What that allows us to do is to be more creative, if we can understand what data has value for what use case. >> Ved, Let's talk a little bit about the US healthcare system. It's a good use case. I was recently at a chief data officer conference and listening to the CDO of Johns Hopkins, talk about the multiple different formats that they had to ingest to create that COVID map. They even had some PDFs, they had different definitions, and that's sort of underscored to me, the state of the US healthcare industry. I'm not as familiar with the UK and Europe generally, but I am familiar with the US healthcare system and the diversity that's there, the duplication of information and the like, maybe you could sort of summarize your perspectives and give us kind of the before and your vision of the after, if you will? >> The use of course, is particularly large and complex system. We all know that. We also know, I think there is some research that suggests that in the US the per-capita spend on healthcare is among the highest in the world. I think it's like 70%, and that compares to what just under 9%, which is going to be European, typical European figure. So it's almost double of that, but the outcomes are still vastly poor. When Ajay and I were talking earlier, I think we believe that there is a concept of a data friction. When you've got multiple players in an eco-system, trying to provide a single service as a patient, you're receiving a single health care service. There are probably a dozen up to 20 different organizations that have to collaborate to make sure you get that top of the line health care service. That kind of investment deserves. And what prevents it from happening very often is what we would call data friction, which is the ability to effectively share data. Something as simple as a healthcare record, which says, "This is Dave, this is Ved, this is Ajay." And when we go to hospital for anything, whatever happens, that healthcare record can capture all the information and tie to us as an individual. And if you go to a different hospital, then that record will follow you. This is how you would expect that to be implemented, but I think we're still on that journey. There are lots and lots of challenges. I've seen anecdotal data around people who suffered because they weren't carrying a card when they went into hospital, because that card has the critical elements of data, but in today's world, should you need to carry a piece of paper or can the entire thing be a digital data flow that can easily be, can certainly navigate through lack of paper and those kinds of things. So the vision that I think we need to be looking at is an effective data exchange or marketplace back with a kind of a backbone model where people agree and sign off a data standard, where each individual's data is always tied to the individual. So if you were to move States, if you would move providers, change insurance companies, none of that would impact your medical history, your data, and the ability to have the other care and medical professionals to access the data at the point of need and at the point of healthcare delivery. So I think that's the vision we're looking at, but as you rightly you said that there are enormous number of challenges, partly because of the history, of healthcare, I think it was technology enablement of healthcare started early. So there's a lot of legacy as well. So we shouldn't trivialize the challenges that the industry faces, but that I think is the way we want to go. >> Well, privacy is obviously a huge one, and a lot of the processes are built around non-digital processes and what you're describing as a flip for digital first. I mean, as a consumer, as a patient, I want an app for that. So I can see my own data. I can see price, price transparency, give access to people that I think need it. And that is a daunting task, isn't it? >> Absolutely. And I think the implicit idea and what you just said, which is very powerful is also on the app you want to control. >> Yes. >> And sometimes you want to be able to change access on data at that point. Right now, I'm at the hospital. I would like to access my data. And when I walk away or maybe three days later, I want to revoke that access. It's that level of control. And absolutely, it is by no means a trivial problem, but I think that's where you need the data automation tools. If you try to do any of this manually, we'd be here for another decade trying to solve this, but that's where tools like Io-Tahoe come in because to do this, a lot of the heavy lifting behind the scenes has to be automated. There has to be a machine churning that and presenting the simpler options. And I know you were talking about it just a little while ago Ajay. I was reminded of the example of a McDonald's or a Coke, because the sales store idea that you can go in and you can do your own ordering off a menu, or you can go in and select five different flavors from a Coke machine and choose your own particular blend of Coke. It's a very trivial example, but I think that's the word we want to get to with access of data as well. If it was that simple for consumers, for enterprise, business people, for doctors, then that's where we ultimately want to be able to arrive. But of course, to make something very simple for the end-user, somebody has to solve for complexity behind the scenes. >> So Ajay, it seems to me Ajay there're two major outcomes here. One is of course, the most important I guess, is patient outcomes, and the other is cost. I mean, they talked about the cost issues, we all, US especially understand the concerns about rising costs of healthcare. My question is this, how does a Smart Data Marketplace fit into achieving those two very important outcomes? >> When we think about how automation is enabling that, where we've got different data formats, the manual tasks are involved, duplication of information. The administrative overhead of that alone and the work, the rework, and the cycles of work that generates. That's really what we're trying to help with data is to eliminate that wasted effort. And with that wasted effort comes time and money to employ people to work through those siloed systems. So getting to the point where there is an exchange in a marketplace just as they would be for banking or insurance is really about automating the classification of data to make it available to a system that can pick it up through an API and to run a machine learning model and to manage a workflow, a process. >> Right, so you mentioned backing insurance, you're right. I mean, we've actually come a long way and just in terms of, know the customer and applying that to know the patient would be very powerful. I'm interested in what you guys are doing together, just in terms of your vision. Are you going to market together, kind of what you're seeing in terms of promoting or enabling this self-service, self-care. Maybe you could talk a little bit about Io-Tahoe and Tata, the intersection at the customer? >> Sure. I think we've been very impressed with the TCS vision of 4.0, how the re-imagining traditional industries, whether it's insurance, banking, healthcare, and bringing together automation, agile processes, robotics, AI, and once those enablers, technology may have brought together to re-imagine how those services can be delivered digitally. All of those are dependent on data. So we see that there's a really good fit here to enable understanding the legacy, the historic situation that has built up over time in an organization, a business and to help shine a light on what's meaningful in that to migrate to the cloud or to drive a digital twin, data science project. >> Ved, anything you can add to that? >> Sure. I mean, we do take the business 4.0 model quite seriously in terms of a lens with which you look at any industry, and what I talked about in healthcare was an example of that. And for us business 4.0, means a few very specific things. The technology that we use in today's verse should be agile, automated, intelligent, and cloud-based. These have become kind of hygiene factors now. On top of that, the businesses we build should be mass customized. They should be risk embracing. They should engage ecosystems, and they should strive for exponential value, not 10% growth year on year, but doubling, tripling every three, four years, because that's the competition that most businesses are facing today. And within that, the Tata group itself, is an extremely purpose-driven business. We really believe that we exist to serve communities, not just one specific set, i.e. shareholders, but the broader community in which we live and work. And I think this framework also allows us to apply that to things like healthcare, to education and to a whole vast range of areas where, everybody has a vision of using data science or doing really clever stuff at the gradients. But what becomes clear is, to do any of that, the first thing you need is a foundational piece. And as a foundation isn't right, then no matter how much you invest in the data science tools you won't get the answers you want. And the work we're doing with the Io-Tahoe really, for me, is particularly exciting because it sorts out that foundational piece. And at the end of it, to make all of this, again, I will repeat that, to make it simple and easy to use for the end user, whoever that is. And I realized that I'm probably the first person who's used fast food as a shining example for healthcare in this discussion, but you can make a lot of different examples. And today, if you press a button and start a car, that's simplicity, but someone has solved for that. And that's what we want to do with data as well. >> Yeah, that makes a lot of sense to me. We talk a lot about digital transformation and a digital business, and I would observe that a digital business puts data at the core. And you can certainly be the best example. There is, of course, Google is an all digital business, but take a company like Amazon, Who's got obviously a massive physical component to its business. Data is at the core. And that's exactly my takeaway from this discussion. Both of you are talking about putting data at the core, simplifying it, making sure that it's compliant, and healthcare it's taking longer, 'cause it's such a high risk industry, but it's clearly happening, COVID I guess, was an accelerant. Guys, Ajay, I'll start with you. Any final thoughts that you want to leave the audience with? _ Yeah, we're really pleased to be working with TCS. We've been able to explore how we're able to put dates to work in a range of different industries. Ved has mentioned healthcare, telecoms, banking and insurance are others. And the same impact they speak to whenever we see the exciting digital transformations that are being planned, being able to accelerate those, unlock the value from data is where we're having a purpose. And it's good that we can help patients in the healthcare sector, consumers in banking realize a better experience through having a more joined up marketplace with their data. >> Ved, you know what excites me about this conversation is that, as a patient or as a consumer, if I'm helping loved ones, I can go to the web and I can search, and I can find a myriad of possibilities. What you're envisioning here is really personalizing that with real time data. And that to me is a game changer. Your final thoughts? >> Thanks, David. I absolutely agree with you that the idea of data centricity and simplicity are absolutely forefront, but I think if we were to design an organization today, you might design it very differently to how most companies today are structured. And maybe Google and Amazon are probably better examples of that because you almost have to think of a business as having a data engine room at its core. A lot of businesses are trying to get to that stage, whereas what we call digital natives, are people who have started life with that premise. So I absolutely agree with you on that, but extending that a little bit. If you think of most industries as eco-systems that have to collaborate, then you've got multiple organizations who will also have to exchange data to achieve some shared outcomes. Whether you look at supply chains of automobile manufacturers or insurance companies or healthcares we've been talking about. So I think that's the next level of change we want to be able to make, which is to be able to do this at scale across organizations at industry level or in population scheme for healthcare. >> Yeah, Thank you for that. Go ahead Ajay. >> David that's where it comes back to again, the origination where we've come from in big data. The volume of data combined with the specificity of individualizing, personalizing a service around an individual amongst that massive data from different providers is where is exciting, that we're able to have an impact. >> Well, and you know Ajay, I'm glad you brought that up because in the early days of big data, there were only a handful of companies, the biggest financial institutions. Obviously, the internet giants who had all these engineers that were able to take advantage of it. But with companies like Io-Tahoe and others, and the investments that the industry has made in terms of providing the tools and simplifying that, especially with machine intelligence and AI and machine learning, these are becoming embedded into the tooling so that everybody can have access to them, small, medium, and large companies. That's really, to me, the exciting part of this new era that we're entering. >> Yeah, and we have placed those, take it down to the level of not-for-profits and smaller businesses that want to innovate and leapfrog into, to growing their digital delivery of their service. >> And I know a lot of time, but Ved, what you were saying about TCS's responsibility to society, I think is really, really important. Large companies like yours, I believe, and you clearly do as well, have a responsibility to society more than just a profit. And I think, Big Tech it's a better app in a lot of cases, but so thank you for that and thank you gentlemen for this great discussion. I really appreciate it. >> Thanks David. >> Thank you. >> All right, keep it right there. I'll be right back right after this short break. This is Dave Vellante for theCUBE. (calm music)

Published Date : Sep 8 2020

SUMMARY :

brought to you by Io-Tahoe. of the data pipeline. What's that all about? And the way we go about and putting that data to work. from the data pipeline the ability to find early and sort of your role there, the access to the signals, One of the examples is the value is late because you don't know. that's the way to put data to work. and the IT function in a and listening to the CDO of Johns Hopkins, and that compares to what and a lot of the processes are built also on the app you want behind the scenes has to be automated. One is of course, the of that alone and the work, that to know the patient in that to migrate to the cloud And at the end of it, to make all of this, Yeah, that makes a lot of sense to me. And that to me is a game changer. of that because you almost Yeah, Thank you for that. the origination where we've and the investments that the those, take it down to the level And I know a lot of time, This is Dave Vellante for theCUBE.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
DavidPERSON

0.99+

Ajay VohoraPERSON

0.99+

AmazonORGANIZATION

0.99+

$3QUANTITY

0.99+

Dave VellantePERSON

0.99+

GoogleORGANIZATION

0.99+

DavePERSON

0.99+

Tata Consultancy ServicesORGANIZATION

0.99+

Io-TahoeORGANIZATION

0.99+

$5QUANTITY

0.99+

AjayPERSON

0.99+

70%QUANTITY

0.99+

twoQUANTITY

0.99+

TCSORGANIZATION

0.99+

Western EuropeLOCATION

0.99+

COVID-19OTHER

0.99+

seven o'clockDATE

0.99+

VedPERSON

0.99+

11 o'clockDATE

0.99+

USLOCATION

0.99+

BothQUANTITY

0.99+

oneQUANTITY

0.99+

OneQUANTITY

0.99+

TataORGANIZATION

0.99+

todayDATE

0.98+

EuropeLOCATION

0.98+

three days laterDATE

0.98+

Tata groupORGANIZATION

0.98+

CokeORGANIZATION

0.98+

Ved SenPERSON

0.98+

five different flavorsQUANTITY

0.98+

four yearsQUANTITY

0.97+

under 9%QUANTITY

0.96+

single serviceQUANTITY

0.96+

UKLOCATION

0.96+

McDonald'sORGANIZATION

0.95+

a dozenQUANTITY

0.93+

WesternLOCATION

0.92+

doublingQUANTITY

0.91+

Johns HopkinsORGANIZATION

0.91+

first personQUANTITY

0.91+

up to 20 different organizationsQUANTITY

0.9+

nine o'clockDATE

0.9+

COVIDTITLE

0.89+

this yearDATE

0.89+

setQUANTITY

0.89+

10% growthQUANTITY

0.88+

firstQUANTITY

0.87+

doubleQUANTITY

0.85+

first thingQUANTITY

0.85+

EuropeanOTHER

0.85+

single healthQUANTITY

0.84+

each individualQUANTITY

0.83+

every threeQUANTITY

0.79+

two majorQUANTITY

0.76+

Eileen Vidrine, US Air Force | MIT CDOIQ 2020


 

>> Announcer: From around the globe, it's theCube with digital coverage of MIT, Chief Data Officer and Information Quality Symposium brought to you by Silicon Angle Media. >> Hi, I'm Stu Miniman and this is the seventh year of theCubes coverage of the MIT, Chief Data Officer and Information Quality Symposium. We love getting to talk to these chief data officers and the people in this ecosystem, the importance of data, driving data-driven cultures, and really happy to welcome to the program, first time guests Eileen Vitrine, Eileen is the Chief Data Officer for the United States Air Force, Eileen, thank you so much for joining us. >> Thank you Stu really excited about being here today. >> All right, so the United States Air Force, I believe had it first CDO office in 2017, you were put in the CDO role in June of 2018. If you could, bring us back, give us how that was formed inside the Air force and how you came to be in that role. >> Well, Stu I like to say that we are a startup organization and a really mature organization, so it's really about culture change and it began by bringing a group of amazing citizen airman reservists back to the Air Force to bring their skills from industry and bring them into the Air Force. So, I like to say that we're a total force because we have active and reservists working with civilians on a daily basis and one of the first things we did in June was we stood up a data lab, that's based in the Jones building on Andrews Air Force Base. And there, we actually take small use cases that have enterprise focus, and we really try to dig deep to try to drive data insights, to inform senior leaders across the department on really important, what I would call enterprise focused challenges, it's pretty exciting. >> Yeah, it's been fascinating when we've dug into this ecosystem, of course while the data itself is very sensitive and I'm sure for the Air Force, there are some very highest level of security, the practices that are done as to how to leverage data, the line between public and private blurs, because you have people that have come from industry that go into government and people that are from government that have leveraged their experiences there. So, if you could give us a little bit of your background and what it is that your charter has been and what you're looking to build out, as you mentioned that culture of change. >> Well, I like to say I began my data leadership journey as an active duty soldier in the army, and I was originally a transportation officer, today we would use the title condition based maintenance, but back then, it was really about running the numbers so that I could optimize my truck fleet on the road each and every day, so that my soldiers were driving safely. Data has always been part of my leadership journey and so I like to say that one of our challenges is really to make sure that data is part of every airmans core DNA, so that they're using the right data at the right level to drive insights, whether it's tactical, operational or strategic. And so it's really about empowering each and every airman, which I think is pretty exciting. >> There's so many pieces of that data, you talk about data quality, there's obviously the data life cycle. I know your presentation that you're given here at the CDO, IQ talks about the data platform that your team has built, could you explain that? What are the key tenants and what maybe differentiates it from what other organizations might have done? >> So, when we first took the challenge to build our data lab, we really wanted to really come up. Our goal was to have a cross domain solution where we could solve data problems at the appropriate classification level. And so we built the VAULT data platform, VAULT stands for visible, accessible, understandable, linked, and trustworthy. And if you look at the DOD data strategy, they will also add the tenants of interoperability and secure. So, the first steps that we have really focused on is making data visible and accessible to airmen, to empower them, to drive insights from available data to solve their problems. So, it's really about that data empowerment, we like to use the hashtag built by airmen because it's really about each and every airman being part of the solution. And I think it's really an exciting time to be in the Air Force because any airman can solve a really hard challenge and it can very quickly wrap it up rapidly, escalate up with great velocity to senior leadership, to be an enterprise solution. >> Is there some basic training that goes on from a data standpoint? For any of those that have lived in data, oftentimes you can get lost in numbers, you have to have context, you need to understand how do I separate good from bad data, or when is data still valid? So, how does someone in the Air Force get some of that beta data competency? >> Well, we have taken a multitenant approach because each and every airman has different needs. So, we have quite a few pathfinders across the Air Force today, to help what I call, upscale our total force. And so I developed a partnership with the Air Force Institute of Technology and they now have a online graduate level data science certificate program. So, individuals studying at AFIT or remotely have the opportunity to really focus on building up their data touchpoints. Just recently, we have been working on a pathfinder to allow our data officers to get their ICCP Federal Data Sector Governance Certificate Program. So, we've been running what I would call short boot camps to prep data officers to be ready for that. And I think the one that I'm most excited about is that this year, this fall, new cadets at the U.S Air Force Academy will be able to have an undergraduate degree in data science and so it's not about a one prong approach, it's about having short courses as well as academe solutions to up skill our total force moving forward. >> Well, information absolutely is such an important differentiator(laughs) in general business and absolutely the military aspects are there. You mentioned the DOD talks about interoperability in their platform, can you speak a little bit to how you make sure that data is secure? Yet, I'm sure there's opportunities for other organizations, for there to be collaboration between them. >> Well, I like to say, that we don't fight alone. So, I work on a daily basis with my peers, Tom Cecila at the Department of Navy and Greg Garcia at the Department of Army, as well as Mr. David Berg in the DOD level. It's really important that we have an integrated approach moving forward and in the DOD we partner with our security experts, so it's not about us doing security individually, it's really about, in the Air Force we use a term called digital air force, and it's about optimizing and building a trusted partnership with our CIO colleagues, as well as our chief management colleagues because it's really about that trusted partnership to make sure that we're working collaboratively across the enterprise and whatever we do in the department, we also have to reach across our services so that we're all working together. >> Eileen, I'm curious if there's been much impact from the global pandemic. When I talk to enterprise companies, that they had to rapidly make sure that while they needed to protect data, when it was in their four walls and maybe for VPN, now everyone is accessing data, much more work from home and the like. I have to imagine some of those security measures you've already taken, but have there anything along those lines or anything else that this shift in where people are, and a little bit more dispersed has impacted your work? >> Well, the story that I like to say is, that this has given us velocity. So, prior to COVID, we built our VAULT data platform as a multitenancy platform that is also cross-domain solution, so it allows people to develop and do their problem solving in an appropriate classification level. And it allows us to connect or pushup if we need to into higher classification levels. The other thing that it has helped us really work smart because we do as much as we can in that unclassified environment and then using our cloud based solution in our gateways, it allows us to bring people in at a very scheduled component so that we maximize, or we optimize their time on site. And so I really think that it's really given us great velocity because it has really allowed people to work on the right problem set, on the right class of patient level at a specific time. And plus the other pieces, we look at what we're doing is that the problem set that we've had has really allowed people to become more data focused. I think that it's personal for folks moving forward, so it has increased understanding in terms of the need for data insights, as we move forward to drive decision making. It's not that data makes the decision, but it's using the insight to make the decision. >> And one of the interesting conversations we've been having about how to get to those data insights is the use of things like machine learning, artificial intelligence, anything you can share about, how you're looking at that journey, where you are along that discovery. >> Well, I love to say that in order to do AI and machine learning, you have to have great volumes of high quality data. And so really step one was visible, accessible data, but we in the Department of the Air Force stood up an accelerator at MIT. And so we have a group of amazing airmen that are actually working with MIT on a daily basis to solve some of those, what I would call opportunities for us to move forward. My office collaborates with them on a consistent basis, because they're doing additional use cases in that academic environment, which I'm pretty excited about because I think it gives us access to some of the smartest minds. >> All right, Eileen also I understand it's your first year doing the event. Unfortunately, we don't get, all come together in Cambridge, walking those hallways and being able to listen to some of those conversations and follow up is something we've very much enjoyed over the years. What excites you about being interact with your peers and participating in the event this year? >> Well, I really think it's about helping each other leverage the amazing lessons learned. I think that if we look collaboratively, both across industry and in the federal sector, there have been amazing lessons learned and it gives us a great forum for us to really share and leverage those lessons learned as we move forward so that we're not hitting the reboot button, but we actually are starting faster. So, it comes back to the velocity component, it all helps us go faster and at a higher quality level and I think that's really exciting. >> So, final question I have for you, we've talked for years about digital transformation, we've really said that having that data strategy and that culture of leveraging data is one of the most critical pieces of having gone through that transformation. For people that are maybe early on their journey, any advice that you'd give them, having worked through a couple of years of this and the experience you've had with your peers. >> I think that the first thing is that you have to really start with a blank slate and really look at the art of the possible. Don't think about what you've always done, think about where you want to go because there are many different paths to get there. And if you look at what the target goal is, it's really about making sure that you do that backward tracking to get to that goal. And the other piece that I tell my colleagues is celebrate the wins. My team of airmen, they are amazing, it's an honor to serve them and the reality is that they are doing great things and sometimes you want more. And it's really important to celebrate the victories because it's a very long journey and we keep moving the goalposts because we're always striving for excellence. >> Absolutely, it is always a journey that we're on, it's not about the destination. Eileen, thank you so much for sharing all that you've learned and glad you could participate. >> Thank you, STU, I appreciate being included today. Have a great day. >> Thanks and thank you for watching theCube. I'm Stu Miniman stay tuned for more from the MIT, CDO IQ event. (lively upbeat music)

Published Date : Sep 3 2020

SUMMARY :

brought to you by Silicon Angle Media. and the people in this ecosystem, Thank you Stu really All right, so the of the first things we did sure for the Air Force, at the right level to drive at the CDO, IQ talks to build our data lab, we have the opportunity to and absolutely the It's really important that we that they had to rapidly make Well, the story that I like to say is, And one of the interesting that in order to do AI and participating in the event this year? in the federal sector, is one of the most critical and really look at the art it's not about the destination. Have a great day. from the MIT, CDO IQ event.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
MichaelPERSON

0.99+

EileenPERSON

0.99+

ClairePERSON

0.99+

Tom CecilaPERSON

0.99+

Lisa MartinPERSON

0.99+

David BergPERSON

0.99+

2017DATE

0.99+

Greg GarciaPERSON

0.99+

June of 2018DATE

0.99+

Jonathan RosenbergPERSON

0.99+

Michael RosePERSON

0.99+

JuneDATE

0.99+

Eileen VitrinePERSON

0.99+

BlairPERSON

0.99+

U.S Air Force AcademyORGANIZATION

0.99+

MITORGANIZATION

0.99+

WednesdayDATE

0.99+

five minutesQUANTITY

0.99+

Omni ChannelORGANIZATION

0.99+

five billionQUANTITY

0.99+

Air Force Institute of TechnologyORGANIZATION

0.99+

two yearsQUANTITY

0.99+

CambridgeLOCATION

0.99+

ThursdayDATE

0.99+

Orlando, FloridaLOCATION

0.99+

Silicon Angle MediaORGANIZATION

0.99+

United States Air ForceORGANIZATION

0.99+

Eileen VidrinePERSON

0.99+

RyanPERSON

0.99+

GoogleORGANIZATION

0.99+

BlakePERSON

0.99+

Stu MinimanPERSON

0.99+

Blair PleasantPERSON

0.99+

BC StrategiesORGANIZATION

0.99+

Department of NavyORGANIZATION

0.99+

next yearDATE

0.99+

StuPERSON

0.99+

firstQUANTITY

0.99+

ConfusionORGANIZATION

0.99+

todayDATE

0.99+

five o'ClockDATE

0.99+

YouTubeORGANIZATION

0.99+

seventh yearQUANTITY

0.99+

twenty years agoDATE

0.99+

first yearQUANTITY

0.99+

twentyQUANTITY

0.99+

decades agoDATE

0.99+

last yearDATE

0.99+

JonesLOCATION

0.98+

Andrews Air Force BaseLOCATION

0.98+

this yearDATE

0.98+

United States Air ForceORGANIZATION

0.98+

last yearDATE

0.98+

fiveQUANTITY

0.98+

five nine mugsQUANTITY

0.98+

first thingQUANTITY

0.98+

first thingQUANTITY

0.98+

first stepsQUANTITY

0.98+

this fallDATE

0.98+

DODORGANIZATION

0.98+

Enterprise ConnectORGANIZATION

0.97+

Department of the Air ForceORGANIZATION

0.97+

AFITORGANIZATION

0.97+

bothQUANTITY

0.97+

oneQUANTITY

0.97+

Department of ArmyORGANIZATION

0.97+

first timeQUANTITY

0.97+

eachQUANTITY

0.96+

CDO IQEVENT

0.96+

OneQUANTITY

0.95+

TwitterORGANIZATION

0.95+

Krishna Cheriath, Bristol Myers Squibb | MITCDOIQ 2020


 

>> From the Cube Studios in Palo Alto in Boston, connecting with thought leaders all around the world, this is a Cube Conversation. >> Hi everyone, this is Dave Vellante and welcome back to the Cube's coverage of the MIT CDOIQ. God, we've been covering this show since probably 2013, really trying to understand the intersection of data and organizations and data quality and how that's evolved over time. And with me to discuss these issues is Krishna Cheriath, who's the Vice President and Chief Data Officer, Bristol-Myers Squibb. Krishna, great to see you, thanks so much for coming on. >> Thank you so much Dave for the invite, I'm looking forward to it. >> Yeah first of all, how are things in your part of the world? You're in New Jersey, I'm also on the East coast, how you guys making out? >> Yeah, I think these are unprecedented times all around the globe and whether it is from a company perspective or a personal standpoint, it is how do you manage your life, how do you manage your work in these unprecedented COVID-19 times has been a very interesting challenge. And to me, what is most amazing has been, I've seen humanity rise up and so to our company has sort of snap to be able to manage our work so that the important medicines that have to be delivered to our patients are delivered on time. So really proud about how we have done as a company and of course, personally, it has been an interesting journey with my kids from college, remote learning, wife working from home. So I'm very lucky and blessed to be safe and healthy at this time. So hopefully the people listening to this conversation are finding that they are able to manage through their lives as well. >> Obviously Bristol-Myers Squibb, very, very strong business. You guys just recently announced your quarter. There's a biologics facility near me in Devon's, Massachusetts, I drive by it all the time, it's a beautiful facility actually. But extremely broad portfolio, obviously some COVID impact, but you're managing through that very, very well, if I understand it correctly, you're taking a collaborative approach to a COVID vaccine, you're now bringing people physically back to work, you've been very planful about that. My question is from your standpoint, what role did you play in that whole COVID response and what role did data play? >> Yeah, I think it's a two part as you rightly pointed out, the Bristol-Myers Squibb, we have been an active partner on the the overall scientific ecosystem supporting many different targets that is, from many different companies I think. Across biopharmaceuticals, there's been a healthy convergence of scientific innovation to see how can we solve this together. And Bristol-Myers Squibb have been an active participant as our CEO, as well as our Chief Medical Officer and Head of Research have articulated publicly. Within the company itself, from a data and technology standpoint, data and digital is core to the response from a company standpoint to the COVID-19, how do we ensure that our work continues when the entire global workforce pivots to a kind of a remote setting. So that really calls on the digital infrastructure to rise to the challenge, to enable a complete global workforce. And I mean workforce, it is not just employees of the company but the all of the third-party partners and others that we work with, the whole ecosystem needs to work. And I think our digital infrastructure has proven to be extremely resilient than that. From a data perspective, I think it is twofold. One is how does the core book of business of data continue to drive forward to make sure that our companies key priorities are being advanced. Secondarily, we've been partnering with a research and development organization as well as medical organization to look at what kind of real world data insights can really help in answering the many questions around COVID-19. So I think it is twofold. Main summary; one is, how do we ensure that the data and digital infrastructure of the company continues to operate in a way that allows us to progress the company's mission even during a time when globally, we have been switched to a remote working force, except for some essential staff from lab and manufacturing standpoint. And secondarily is how do we look at the real-world evidence as well as the scientific data to be a good partner with other companies to look at progressing the societal innovations needed for this. >> I think it's a really prudent approach because let's face it, sometimes one shot all vaccine can be like playing roulette. So you guys are both managing your risk and just as I say, financially, a very, very successful company in a sound approach. I want to ask you about your organization. We've interviewed many, many Chief Data Officers over the years, and there seems to be some fuzziness as to the organizational structure. It's very clear with you, you report in to the CIO, you came out of a technical bag, you have a technical degree but you also of course have a business degree. So you're dangerous from that standpoint. You got both sides which is critical, I would think in your role, but let's start with the organizational reporting structure. How did that come about and what are the benefits of reporting into the CIO? >> I think the Genesis for that as Bristol-Myers Squibb and when I say Bristol-Myers Squibb, the new Bristol-Myers Squibb is a combination of Heritage Bristol-Myers Squibb and Heritage Celgene after the Celgene acquisition last November. So in the Heritage Bristol-Myers Squibb acquisition, we came to a conclusion that in order for BMS to be able to fully capitalize on our scientific innovation potential as well as to drive data-driven decisions across the company, having a robust data agenda is key. Now the question is, how do you progress that? Historically, we had approached a very decentralized mechanism that made a different data constituencies. We didn't have a formal role of a Chief Data Officer up until 2018 or so. So coming from that realization that we need to have an effective data agenda to drive forward the necessary data-driven innovations from an analytic standpoint. And equally importantly, from optimizing our execution, we came to conclusion that we need an enterprise-level data organization, we need to have a first among equals if you will, to be mandated by the CEO, his leadership team, to be the kind of an orchestrator of a data agenda for the company, because data agenda cannot be done individually by a singular CDO. It has to be done in partnership with many stakeholders, business, technology, analytics, et cetera. So from that came this notion that we need an enterprise-wide data organization. So we started there. So for awhile, I would joke around that I had all of the accountabilities of the CDO without the lofty title. So this journey started around 2016, where we create an enterprise-wide data organization. And we made a very conscious choice of separating the data organization from analytics. And the reason we did that is when we look at the bowl of Bristol-Myers Squibb, analytics for example, is core and part of our scientific discovery process, research, our clinical development, all of them have deep data science and analytic embedded in it. But we also have other analytics whether it is part of our sales and marketing, whether it is part of our finance and our enabling functions they catch all across global procurement et cetera. So the world of analytics is very broad. BMS did a separation between the world of analytics and from the world of data. Analytics at BMS is in two modes. There is a central analytics organization called Business Insights and Analytics that drive most of the enterprise-level analytics. But then we have embedded analytics in our business areas, which is research and development, manufacturing and supply chain, et cetera, to drive what needs to be closer to the business idea. And the reason for separating that out and having a separate data organization is that none of these analytic aspirations or the business aspirations from data will be met if the world of data is, you don't have the right level of data available, the velocity of data is not appropriate for the use cases, the quality of data is not great or the control of the data. So that we are using the data for the right intent, meeting the compliance and regulatory expectations around the data is met. So that's why we separated out that data world from the analytics world, which is a little bit of a unique construct for us compared to what we see generally in the world of CDOs. And from that standpoint, then the decision was taken to make that report for global CIO. At Bristol-Myers Squibb, they have a very strong CIO organization and IT organization. When I say strong, it is from this lens standpoint. A, it is centralized, we have centralized the budget as well as we have centralized the execution across the enterprise. And the CDO reporting to the CIO with that data-specific agenda, has a lot of value in being able to connect the world of data with the world of technology. So at BMS, their Chief Data Officer organization is a combination of traditional CDO-type accountabilities like data risk management, data governance, data stewardship, but also all of the related technologies around master data management, data lake, data and analytic engineering and a nascent AI data and technology lab. So that construct allows us to be a true enterprise horizontal, supporting analytics, whether it is done in a central analytics organization or embedded analytics teams in the business area, but also equally importantly, focus on the world of data from operational execution standpoint, how do we optimize data to drive operational effectiveness? So that's the construct that we have where CDO reports to the CIO, data organization separated from analytics to really focus around the availability but also the quality and control of data. And the last nuance that is that at BMS, the Chief Data Officer organization is also accountable to be the Data Protection Office. So we orchestrate and facilitate all privacy-related actions across because that allows us to make sure that all personal data that is collected, managed and consumed, meets all of the various privacy standards across the world, as well as our own commitments as a company from across from compliance principles standpoint. >> So that makes a lot of sense to me and thank you for that description. You're not getting in the way of R&D and the scientists, they know data science, they don't need really your help. I mean, they need to innovate at their own pace, but the balance of the business really does need your innovation, and that's really where it seems like you're focused. You mentioned master data management, data lakes, data engineering, et cetera. So your responsibility is for that enterprise data lifecycle to support the business side of things, and I wonder if you could talk a little bit about that and how that's evolved. I mean a lot has changed from the old days of data warehouse and cumbersome ETL and you mentioned, as you say data lakes, many of those have been challenging, expensive, slow, but now we're entering this era of cloud, real-time, a lot of machine intelligence, and I wonder if you could talk about the changes there and how you're looking at and thinking about the data lifecycle and accelerating the time to insights. >> Yeah, I think the way we think about it, we as an organization in our strategy and tactics, think of this as a data supply chain. The supply chain of data to drive business value whether it is through insights and analytics or through operation execution. When you think about it from that standpoint, then we need to get many elements of that into an effective stage. This could be the technologies that is part of that data supply chain, you reference some of them, the master data management platforms, data lake platforms, the analytics and reporting capabilities and business intelligence capabilities that plug into a data backbone, which is that I would say the technology, swim lane that needs to get right. Along with that, what we also need to get right for that effective data supply chain is that data layer. That is, how do you make sure that there is the right data navigation capability, probably you make sure that we have the right ontology mapping and the understanding around the data. How do we have data navigation? It is something that we have invested very heavily in. So imagine a new employee joining BMS, any organization our size has a pretty wide technology ecosystem and data ecosystem. How do you navigate that, how do we find the data? Data discovery has been a key focus for us. So for an effective data supply chain, then we knew that and we have instituted our roadmap to make sure that we have a robust technology orchestration of it, but equally important is an effective data operations orchestration. Both needs to go hand in hand for us to be able to make sure that that supply chain is effective from a business use case and analytic use standpoint. So that has led us on a journey from a cloud perspective, since you refer that in your question, is we have invested very heavily to move from very disparate set of data ecosystems to a more converse cloud-based data backbone. That has been a big focus at the BMS since 2016, whether it is from a research and development standpoint or from commercialization, it is our word for the sales and marketing or manufacturing and supply chain and HR, et cetera. How do we create a converged data backbone that allows us to use that data as a resource to drive many different consumption patterns? Because when you imagine an enterprise of our size, we have many different consumers of the data. So those consumers have different consumption needs. You have deep data science population who just needs access to the data and they have data science platforms but they are at once programmers as well, to the other end of the spectrum where executives need pre-packaged KPIs. So the effective orchestration of the data ecosystem at BMS through a data supply chain and the data backbone, there's a couple of things for us. One, it drives productivity of our data consumers, the scientific researchers, analytic community or other operational staff. And second, in a world where we need to make sure that the data consumption appalls ethical standards as well as privacy and other regulatory expectations, we are able to build it into our system and process the necessary controls to make sure that the consumption and the use of data meets our highest trust advancements standards. >> That makes a lot of sense. I mean, converging your data like that, people always talk about stove pipes. I know it's kind of a bromide but it's true, and allows you to sort of inject consistent policies. What about automation? How has that affected your data pipeline recently and on your journey with things like data classification and the like? >> I think in pursuing a broad data automation journey, one of the things that we did was to operate at two different speed points. In a historically, the data organizations have been bundled with long-running data infrastructure programs. By the time you complete them, their business context have moved on and the organization leaders are also exhausted from having to wait from these massive programs to reach its full potential. So what we did very intentionally from our data automation journey is to organize ourselves in two speed dimensions. First, a concept called Rapid Data Lab. The idea is that recognizing the reality that the data is not well automated and orchestrated today, we need a SWAT team of data engineers, data SMEs to partner with consumers of data to make sure that we can make effective data supply chain decisions here and now, and enable the business to answer questions of today. Simultaneously in a longer time horizon, we need to do the necessary work of moving the data automation to a better footprint. So enterprise data lake investments, where we built services based on, we had chosen AWS as the cloud backbone for data. So how do we use the AWS services? How do we wrap around it with the necessary capabilities so that we have a consistent reference and technical architecture to drive the many different function journeys? So we organized ourselves into speed dimensions; the Rapid Data Lab teams focus around partnering with the consumers of data to help them with data automation needs here and now, and then a secondary team focused around the convergence of data into a better cloud-based data backbone. So that allowed us to one, make an impact here and now and deliver value from data to the dismiss here and now. Secondly, we also learned a lot from actually partnering with consumers of data on what needs to get adjusted over a period of time in our automation journey. >> It makes sense, I mean again, that whole notion of converged data, putting data at the core of your business, you brought up AWS, I wonder if I could ask you a question. You don't have to comment on specific vendors, but there's a conversation we have in our community. You have AWS huge platform, tons of partners, a lot of innovation going on and you see innovation in areas like the cloud data warehouse or data science tooling, et cetera, all components of that data pipeline. As well, you have AWS with its own tooling around there. So a question we often have in the community is will technologists and technology buyers go for kind of best of breed and cobble together different services or would they prefer to have sort of the convenience of a bundled service from an AWS or a Microsoft or Google, or maybe they even go best of breeds for all cloud. Can you comment on that, what's your thinking? >> I think, especially for organizations, our size and breadth, having a converged to convenient, all of the above from a single provider does not seem practical and feasible, because a couple of reasons. One, the heterogeneity of the data, the heterogeneity of consumption of the data and we are yet to find a single stack provider who can meet all of the different needs. So I am more in the best of breed camp with a few caveats, a hybrid best of breed, if you will. It is important to have a converged the data backbone for the enterprise. And so whether you invest in a singular cloud or private cloud or a combination, you need to have a clear intention strategy around where are you going to host the data and how is the data is going to be organized. But you could have a lot more flexibility in the consumption of data. So once you have the data converged into, in our case, we converged on AWS-based backbone. We allow many different consumptions of the data, because I think the analytic and insights layer, data science community within R&D is different from a data science community in the supply chain context, we have business intelligence needs, we have a catered needs and then there are other data needs that needs to be funneled into software as service platforms like the sales forces of the world, to be able to drive operational execution as well. So when you look at it from that context, having a hybrid model of best of breed, whether you have a lot more convergence from a data backbone standpoint, but then allow for best of breed from an analytic and consumption of data is more where my heart and my brain is. >> I know a lot of companies would be excited to hear that answer, but I love it because it fosters competition and innovation. I wish I could talk for you forever, but you made me think of another question which is around self-serve. On your journey, are you at the point where you can deliver self-serve to the lines of business? Is that something that you're trying to get to? >> Yeah, I think it does. The self-serve is an absolutely important point because I think the traditional boundaries of what you consider the classical IT versus a classical business is great. I think there is an important gray area in the middle where you have a deep citizen data scientist in the business community who really needs to be able to have access to the data and I have advanced data science and programming skills. So self-serve is important but in that, companies need to be very intentional and very conscious of making sure that you're allowing that self-serve in a safe containment sock. Because at the end of the day, whether it is a cyber risk or data risk or technology risk, it's all real. So we need to have a balanced approach between promoting whether you call it data democratization or whether you call it self-serve, but you need to balance that with making sure that you're meeting the right risk mitigation strategy standpoint. So that's how then our focus is to say, how do we promote self-serve for the communities that they need self-serve, where they have deeper levels of access? How do we set up the right safe zones for those which may be the appropriate mitigation from a cyber risk or data risk or technology risk. >> Security pieces, again, you keep bringing up topics that I could talk to you forever on, but I heard on TV the other night, I heard somebody talking about how COVID has affected, because of remote access, affected security. And it's like hey, give everybody access. That was sort of the initial knee-jerk response, but the example they gave as well, if your parents go out of town and the kid has a party, you may have some people show up that you don't want to show up. And so, same issue with remote working, work from home. Clearly you guys have had to pivot to support that, but where does the security organization fit? Does that report separate alongside the CIO? Does it report into the CIO? Are they sort of peers of yours, how does that all work? >> Yeah, I think at Bristol-Myers Squibb, we have a Chief Information Security Officer who is a peer of mine, who also reports to the global CIO. The CDO and the CSO are effective partners and are two sides of the coin and trying to advance a total risk mitigation strategy, whether it is from a cyber risk standpoint, which is the focus of the Chief Information Security Officer and whether it is the general data consumption risk. And that is the focus from a Chief Data Officer in the capacities that I have. And together, those are two sides of a coin that the CIO needs to be accountable for. So I think that's how we have orchestrated it, because I think it is important in these worlds where you want to be able to drive data-driven innovation but you want to be able to do that in a way that doesn't open the company to unwanted risk exposures as well. And that is always a delicate balancing act, because if you index too much on risk and then high levels of security and control, then you could lose productivity. But if you index too much on productivity, collaboration and open access and data, it opens up the company for risks. So it is a delicate balance within the two. >> Increasingly, we're seeing that reporting structure evolve and coalesce, I think it makes a lot of sense. I felt like at some point you had too many seats at the executive leadership table, too many kind of competing agendas. And now your structure, the CIO is obviously a very important position. I'm sure has a seat at the leadership table, but also has the responsibility for managing that sort of data as an asset versus a liability which my view, has always been sort of the role of the Head of Information. I want to ask you, I want to hit the Escape key a little bit and ask you about data as a resource. You hear a lot of people talk about data is the new oil. We often say data is more valuable than oil because you can use it, it doesn't follow the laws of scarcity. You could use data in infinite number of places. You can only put oil in your car or your house. How do you think about data as a resource today and going forward? >> Yeah, I think the data as the new oil paradigm in my opinion, was an unhealthy, and it prompts different types of conversations around that. I think for certain companies, data is indeed an asset. If you're a company that is focused on information products and data products and that is core of your business, then of course there's monetization of data and then data as an asset, just like any other assets on the company's balance sheet. But for many enterprises to further their mission, I think considering data as a resource, I think is a better focus. So as a vital resource for the company, you need to make sure that there is an appropriate caring and feeding for it, there is an appropriate management of the resource and an appropriate evolution of the resource. So that's how I would like to consider it, it is a personal end of one perspective, that data as a resource that can power the mission of the company, the new products and services, I think that's a good, healthy way to look at it. At the center of it though, a lot of strategies, whether people talk about a digital strategy, whether the people talk about data strategy, what is important is a company to have a pool north star around what is the core mission of the company and what is the core strategy of the company. For Bristol-Myers Squibb, we are about transforming patients' lives through science. And we think about digital and data as key value levers and drivers of that strategy. So digital for the sake of digital or data strategy for the sake of data strategy is meaningless in my opinion. We are focused on making sure that how do we make sure that data and digital is an accelerant and has a value lever for the company's mission and company strategy. So that's why thinking about data as a resource, as a key resource for our scientific researchers or a key resource for our manufacturing team or a key resource for our sales and marketing, allows us to think about the actions and the strategies and tactics we need to deploy to make that effective. >> Yeah, that makes a lot of sense, you're constantly using that North star as your guideline and how data contributes to that mission. Krishna Cheriath, thanks so much for coming on the Cube and supporting the MIT Chief Data Officer community, it was a really pleasure having you. >> Thank you so much for Dave, hopefully you and the audience is safe and healthy during these times. >> Thank you for that and thank you for watching everybody. This is Vellante for the Cube's coverage of the MIT CDOIQ Conference 2020 gone virtual. Keep it right there, we'll right back right after this short break. (lively upbeat music)

Published Date : Sep 3 2020

SUMMARY :

leaders all around the world, coverage of the MIT CDOIQ. I'm looking forward to it. so that the important medicines I drive by it all the time, and digital infrastructure of the company of reporting into the CIO? So that's the construct that we have and accelerating the time to insights. and the data backbone, and allows you to sort of and enable the business to in areas like the cloud data warehouse and how is the data is to the lines of business? in the business community that I could talk to you forever on, that the CIO needs to be accountable for. about data is the new oil. that can power the mission of the company, and supporting the MIT Chief and healthy during these times. of the MIT CDOIQ Conference

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
DavePERSON

0.99+

Dave VellantePERSON

0.99+

MicrosoftORGANIZATION

0.99+

GoogleORGANIZATION

0.99+

Bristol-Myers SquibbORGANIZATION

0.99+

New JerseyLOCATION

0.99+

AWSORGANIZATION

0.99+

DevonLOCATION

0.99+

Palo AltoLOCATION

0.99+

Rapid Data LabORGANIZATION

0.99+

2013DATE

0.99+

Krishna CheriathPERSON

0.99+

two sidesQUANTITY

0.99+

twoQUANTITY

0.99+

COVID-19OTHER

0.99+

CelgeneORGANIZATION

0.99+

FirstQUANTITY

0.99+

CubeORGANIZATION

0.99+

KrishnaPERSON

0.99+

Heritage Bristol-Myers SquibbORGANIZATION

0.99+

2018DATE

0.99+

both sidesQUANTITY

0.99+

BothQUANTITY

0.98+

BostonLOCATION

0.98+

2016DATE

0.98+

CDOTITLE

0.98+

two modesQUANTITY

0.98+

COVIDOTHER

0.98+

firstQUANTITY

0.98+

Bristol-Myers SquibbORGANIZATION

0.98+

last NovemberDATE

0.98+

Data Protection OfficeORGANIZATION

0.98+

OneQUANTITY

0.98+

two partQUANTITY

0.98+

SecondlyQUANTITY

0.98+

secondQUANTITY

0.98+

MITORGANIZATION

0.98+

bothQUANTITY

0.98+

MIT CDOIQ Conference 2020EVENT

0.97+

Heritage CelgeneORGANIZATION

0.97+

oneQUANTITY

0.97+

COVID-19 timesOTHER

0.96+

todayDATE

0.96+

BMSORGANIZATION

0.96+

single providerQUANTITY

0.95+

single stackQUANTITY

0.93+

Bristol Myers SquibbPERSON

0.93+

one shotQUANTITY

0.92+

Cube StudiosORGANIZATION

0.9+

one perspectiveQUANTITY

0.9+

Bristol-MyersORGANIZATION

0.9+

Business InsightsORGANIZATION

0.89+

two speedQUANTITY

0.89+

twofoldQUANTITY

0.84+

secondaryQUANTITY

0.8+

SecondarilyQUANTITY

0.77+

MIT CDOIQORGANIZATION

0.76+

MassachusettsLOCATION

0.75+

MITCDOIQ 2020EVENT

0.74+

VellantePERSON

0.72+

DataPERSON

0.71+

Chief Data OfficerPERSON

0.61+

Inderpal Bhandari, IBM | MIT CDOIQ 2020


 

>>from around the globe If the cube with digital coverage of M I t. Chief data officer and Information quality symposium brought to you by Silicon Angle Media >>Hello, everyone. This is Day Volonte and welcome back to our continuing coverage of the M I t. Chief Data Officer CDO I Q event Interpol Bhandari is here. He's a leading voice in the CDO community and a longtime Cubillan Interpol. Great to see you. Thanks for coming on for this. Especially >>program. My pleasure. >>So when you you and I first met, you laid out what I thought was, you know, one of the most cogent frameworks to understand what a CDO is job was where the priority should be. And one of those was really understanding how, how, how data contributes to the monetization of station aligning with lines of business, a number of other things. And that was several years ago. A lot of change since then. You know, we've been doing this conference since probably twenty thirteen and back then, you know, Hadoop was coming on strong. A lot of CEOs didn't want to go near the technology that's beginning to change. CDOs and cto Zehr becoming much more aligned at the hip. The reporting organizations have changed. But I love your perspective on what you've observed as changing in the CDO roll over the last half decade or so. >>Well, did you know that I became chief data officer in two thousand six? December two thousand and six And I have done this job four times four major overnight have created of the organization from scratch each time. Now, in December of two thousand six, when I became chief data officer, there were only four. Chief Data Officer, uh, boom and I was the first in health care, and there were three, three others, you know, one of the Internet one and credit guns one and banking. And I think I'm the only one actually left standing still doing this job. That's a good thing or a bad thing. But like, you know, it certainly has allowed me to love the craft and then also scripted down to the level that, you know, I actually do think of it purely as a craft. That is. I know, going into a mutual what I'm gonna do. They were on the central second. No, the interesting things that have unfolded. Obviously, the professions taken off There are literally thousands off chief data officers now, and there are plenty off changes. I think the main change, but the job is it's, I think, a little less daunting in terms off convincing the senior leadership that it's need it because I think the awareness at the CEO level is much, much, much better than what it waas in two thousand six. Across the world. Now, having said that, I think it is still only awareness and don't think that there's really a deep understanding of those levels. And so there's a lot off infusion, which is why you will. You kind of think this is my period. But you saw all these professions take off with C titles, right? Chief Data officer, chief analytics officer, chief digital officer and chief technology officer. See, I off course is being there for a long time. And but I think these newer see positions. They're all very, very related, and they all kind of went to the same need which had to do with enterprise transformation, digital transformation, that enterprises chief digital officer, that's another and and people were all trying to essentially feel the elephants and they could only see part of it at the senior levels, and they came up with which have a role you know, seemed most meaningful to them. But really, all of us are trying to do the same job, which is to accelerate digital transformation in the enterprise. Your comment about you kind of see that the seat eels and sea deals now, uh, partnering up much more than in the past, and I think that's in available the major driving force full. That is, in my view, anyway. It's is artificial intelligence as people try to infuse artificial intelligence. Well, then it's very technical field. Still, it's not something that you know you can just hand over to somebody who has the business jobs, but not the deep technical chops to pull that off. And so, in the case off chief data officers that do have the technical jobs, you'll see them also pretty much heading up the I effort in total and you know, as I do for the IBM case, will be building the Data and AI Enablement internal platform for for IBM. But I think in other cases you you've got Chief date officers who are coming in from a different angle. You know, they built Marghera but the CTO now, because they have to. Otherwise you cannot get a I infused into the organization. >>So there were a lot of other priorities, obviously certainly digital transformation. We've been talking about it for years, but still in many organisations, there was a sense of, well, not on my watch, maybe a sense of complacency or maybe just other priorities. Cove. It obviously has changed that now one hundred percent of the companies that we talked to are really putting this digital transformation on the front burner. So how has that changed the role of CDO? Has it just been interpolate an acceleration of that reality, or has it also somewhat altered the swim lanes? >>I think I think it's It's It's Bolt actually, so I have a way of looking at this in my mind, the CDO role. But if you look at it from a business perspective, they're looking for three things. The CEO is looking for three things from the CDO. One is you know this person is going to help with the revenue off the company by enabling the production of new products, new products of resulting in new revenue and so forth. That's kind of one aspect of the monetization. Another aspect is the CEO is going to help with the efficiency within the organization by making data a lot more accessible, as well as enabling insights that reduce into and cycle time for major processes. And so that's another way that they have monitor. And the last one is a risk reduction that they're going to reduce the risk, you know, as regulations. And as you have cybersecurity exposure on incidents that you know just keep keep accelerating as well. You're gonna have to also step in and help with that. So every CDO, the way their senior leadership looks at them is some mix off three. And in some cases, one has given more importance than the other, and so far, but that's how they are essentially looking at it now. I think what digital transformation has done is it's managed to accelerate, accelerate all three off these outcomes because you need to attend to all three as you move forward. But I think that the individual balance that's struck for individuals reveals really depends on their ah, their company, their situation, who their peers are, who is actually leading the transformation and so >>forth, you know, in the value pie. A lot of the early activity around CDO sort of emanated from the quality portions of the organization. It was sort of a compliance waited roll, not necessarily when you started your own journey here. Obviously been focused on monetization how data contributes to that. But But you saw that generally, organizations, even if they didn't have a CDO, they had this sort of back office alliance thing that has totally changed the the in the value equation. It's really much more about insights, as you mentioned. So one of the big changes we've seen in the organization is that data pipeline you mentioned and and cycle time. And I'd like to dig into that a little bit because you and I have talked about this. This is one of the ways that a chief data officer and the related organizations can add the most value reduction in that cycle time. That's really where the business value comes from. So I wonder if we could talk about that a little bit and how that the constituents in the stakeholders in that in that life cycle across that data pipeline have changed. >>That's a very good question. Very insightful questions. So if you look at ah, company like idea, you know, my role in totally within IBM is to enable Ibn itself to become an AI enterprise. So infuse a on into all our major business processes. You know, things like our supply chain lead to cash well, process, you know, our finance processes like accounts receivable and procurement that soulful every major process that you can think off is using Watson mouth. So that's the That's the That's the vision that's essentially what we've implemented. And that's how we are using that now as a showcase for clients and customers. One of the things that be realized is the data and Ai enablement spots off business. You know, the work that I do also has processes. Now that's the pipeline you refer to. You know, we're setting up the data pipeline. We're setting up the machine learning pipeline, deep learning blank like we're always setting up these pipelines, And so now you have the opportunity to actually turn the so called EI ladder on its head because the Islander has to do with a first You collected data, then you curated. You make sure that it's high quality, etcetera, etcetera, fit for EI. And then eventually you get to applying, you know, ai and then infusing it into business processes. And so far, But once you recognize that the very first the earliest creases of work with the data those themselves are essentially processes. You can infuse AI into those processes, and that's what's made the cycle time reduction. And although things that I'm talking about possible because it just makes it much, much easier for somebody to then implement ai within a lot enterprise, I mean, AI requires specialized knowledge. There are pieces of a I like deep learning, but there are, you know, typically a company's gonna have, like a handful of people who even understand what that is, how to apply it. You know how models drift when they need to be refreshed, etcetera, etcetera, and so that's difficult. You can't possibly expect every business process, every business area to have that expertise, and so you've then got to rely on some core group which is going to enable them to do so. But that group can't do it manually because I get otherwise. That doesn't scale again. So then you come down to these pipelines and you've got to actually infuse AI into these data and ai enablement processes so that it becomes much, much easier to scale across another. >>Some of the CEOs, maybe they don't have the reporting structure that you do, or or maybe it's more of a far flung organization. Not that IBM is not far flung, but they may not have the ability to sort of inject AI. Maybe they can advocate for it. Do you see that as a challenge for some CEOs? And how do they so to get through that, what's what's the way in which they should be working with their constituents across the organization to successfully infuse ai? >>Yeah, that's it's. In fact, you get a very good point. I mean, when I joined IBM, one of the first observations I made and I in fact made it to a senior leadership, is that I didn't think that from a business standpoint, people really understood what a I met. So when we talked about a cognitive enterprise on the I enterprise a zaydi em. You know, our clients don't really understand what that meant, which is why it became really important to enable IBM itself to be any I enterprise. You know that. That's my data strategy. Your you kind of alluded to the fact that I have this approach. There are these five steps, while the very first step is to come up with the data strategy that enables a business strategy that the company's on. And in my case, it was, Hey, I'm going to enable the company because it wants to become a cloud and cognitive company. I'm going to enable that. And so we essentially are data strategy became one off making IBM. It's something I enterprise, but the reason for doing that the reason why that was so important was because then we could use it as a showcase for clients and customers. And so But I'm talking with our clients and customers. That's my role. I'm really the only role I'm playing is what I call an experiential selling there. I'm saying, Forget about you know, the fact that we're selling this particular product or that particular product that you got GPU servers. We've got you know what's an open scale or whatever? It doesn't really matter. Why don't you come and see what we've done internally at scale? And then we'll also lay out for you all the different pain points that we have to work through using our products so that you can kind of make the same case when you when you when you apply it internally and same common with regard to the benefit, you know the cycle, time reduction, some of the cycle time reductions that we've seen in my process is itself, you know, like this. Think about metadata business metadata generating that is so difficult. And it's again, something that's critical if you want to scale your data because you know you can't really have a good catalogue of data if you don't have good business, meditate. Eso. Anybody looking at what's in your catalog won't understand what it is. They won't be able to use it etcetera. And so we've essentially automated business metadata generation using AI and the cycle time reduction that was like ninety five percent, you know, haven't actually argue. It's more than that, because in the past, most people would not. For many many data sets, the pragmatic approach would be. Don't even bother with the business matter data. Then it becomes just put somewhere in the are, you know, data architecture somewhere in your data leg or whatever, you have data warehouse, and then it becomes the data swamp because nobody understands it now with regard to our experience applying AI, infusing it across all our major business processes are average cycle time reduction is seventy percent, so just a tremendous amount of gains are there. But to your point, unless you're able to point to some application at scale within the enterprise, you know that's meaningful for the enterprise, Which is kind of what the what the role I play in terms of bringing it forward to our clients and customers. It's harder to argue. I'll make a case or investment into A I would then be enterprise without actually being able to point to those types of use cases that have been scaled where you can demonstrate the value. So that's extremely important part of the equation. To make sure that that happens on a regular basis with our clients and customers, I will say that you know your point is vomited a lot off. Our clients and customers come back and say, Tell me when they're having a conversation. I was having a conversation just last week with major major financial service of all nations, and I got the same point saying, If you're coming out of regulation, how do I convince my leadership about the value of a I and you know, I basically responded. He asked me about the scale use cases You can show that. But perhaps the biggest point that you can make as a CDO after the senior readership is can we afford to be left up? That is the I think the biggest, you know, point that the leadership has to appreciate. Can you afford to be left up? >>I want to come back to this notion of seventy percent on average, the cycle time reduction. That's astounding. And I want to make sure people understand the potential impacts. And, I would say suspected many CEOs, if not most understand sort of system thinking. It's obviously something that you're big on but often times within organisations. You might see them trying to optimize one little portion of the data lifecycle and you know having. Okay, hey, celebrate that success. But unless you can take that systems view and reduce that overall cycle time, that's really where the business value is. And I guess my we're real question around. This is Every organization has some kind of Northstar, many about profit, and you can increase revenue are cut costs, and you can do that with data. It might be saving lives, but ultimately to drive this data culture, you've got to get people thinking about getting insights that help you with that North Star, that mission of the company, but then taking a systems view and that's seventy percent cycle time reduction is just the enormous business value that that drives, I think, sometimes gets lost on people. And these air telephone numbers in the business case aren't >>yes, No, absolutely. It's, you know, there's just a tremendous amount of potential on, and it's it's not an easy, easy thing to do by any means. So we've been always very transparent about the Dave. As you know, we put forward this this blueprint right, the cognitive enterprise blueprint, how you get to it, and I kind of have these four major pillars for the blueprint. There's obviously does this data and you're getting the data ready for the consummation that you want to do but also things like training data sets. How do you kind of run hundreds of thousands of experiments on a regular basis, which kind of review to the other pillar, which is techology? But then the last two pillars are business process, change and the culture organizational culture, you know, managing organizational considerations, that culture. If you don't keep all four in lockstep, the transformation is usually not successful at an end to end level, then it becomes much more what you pointed out, which is you have kind of point solutions and the role, you know, the CEO role doesn't make the kind of strategic impact that otherwise it could do so and this also comes back to some of the only appointee of you to do. If you think about how do you keep those four pillars and lock sync? It means you've gotta have the data leader. You also gotta have the technology, and in some cases they might be the same people. Hey, just for the moment, sake of argument, let's say they're all different people and many, many times. They are so the data leader of the technology of you and the operations leaders because the other ones own the business processes as well as the organizational years. You know, they've got it all worked together to make it an effective conservation. And so the organization structure that you talked about that in some cases my peers may not have that. You know, that's that. That is true. If the if the senior leadership is not thinking overall digital transformation, it's going to be difficult for them to them go out that >>you've also seen that culturally, historically, when it comes to data and analytics, a lot of times that the lines of business you know their their first response is to attack the quality of the data because the data may not support their agenda. So there's this idea of a data culture on, and I want to ask you how self serve fits into that. I mean, to the degree that the business feels as though they actually have some kind of ownership in the data, and it's largely, you know, their responsibility as opposed to a lot of the finger pointing that has historically gone on. Whether it's been decision support or enterprise data, warehousing or even, you know, Data Lakes. They've sort of failed toe live up to that. That promise, particularly from a cultural standpoint, it and so I wonder, How have you guys done in that regard? How did you get there? Many Any other observations you could make in that regard? >>Yeah. So, you know, I think culture is probably the hardest nut to crack all of those four pillars that I back up and you've got You've got to address that, Uh, not, you know, not just stop down, but also bottom up as well. As you know, period. Appear I'll give you some some examples based on our experience, that idea. So the way my organization is set up is there is a obviously a technology on the other. People who are doing all the data engineering were kind of laying out the foundational technical elements or the transformation. You know, the the AI enabled one be planning networks, and so so that are those people. And then there is another senior leader who reports directly to me, and his organization is all around adoptions. He's responsible for essentially taking what's available in the technology and then working with the business areas to move forward and make this make and infuse. A. I do the processes that the business and he is looking. It's done in a bottom upwards, deliberately set up, designed it to be bottom up. So what I mean by that is the team on my side is fully empowered to move forward. Why did they find a like minded team on the other side and go ahead and do it? They don't have to come back for funding they don't have, You know, they just go ahead and do it. They're basically empowered to do that. And that particular set up enabled enabled us in a couple of years to have one hundred thousand internal users on our Central data and AI enabled platform. And when I mean hundred thousand users, I mean users who were using it on a monthly basis. We company, you know, So if you haven't used it in a month, we won't come. So there it's over one hundred thousand, even very rapidly to that. That's kind of the enterprise wide storm. That's kind of the bottom up direction. The top down direction Waas the strategic element that I talked with you about what I said, Hey, be our data strategy is going to be to create, make IBM itself into any I enterprise and then use that as a showcase for plants and customers That kind of and be reiterated back. And I worked the senior leadership on that view all the time talking to customers, the central and our senior leaders. And so that's kind of the air cover to do this, you know, that mix gives you, gives you that possibility. I think from a peer to peer standpoint, but you get to these lot scale and to end processes, and that there, a couple of ways I worked that one way is we've kind of looked at our enterprise data and said, Okay, therefore, major pillars off data that we want to go after data, tomato plants, data about our offerings, data about financial data, that s and then our work full student and then within that there are obviously some pillars, like some sales data that comes in and, you know, been workforce. You could have contractors. Was his employees a center But I think for the moment, about these four major pillars off data. And so let me map that to end to end large business processes within the company. You know, the really large ones, like Enterprise Performance Management, into a or lead to cash generation into and risk insides across our full supply chain and to and things like that. And we've kind of tied these four major data pillars to those major into and processes Well, well, yes, that there's a mechanism they're obviously in terms off facilitating, and to some extent one might argue, even forcing some interaction between teams that are the way they talk. But it also brings me and my peers much closer together when you set it up that way. And that means, you know, people from the HR side people from the operation side, the data side technology side, all coming together to really move things forward. So all three tracks being hit very, very hard to move the culture fall. >>Am I also correct that you have, uh, chief data officers that reporting to you whether it's a matrix or direct within the division's? Is that right? >>Yeah, so? So I mean, you know, for in terms off our structure, as you know, way our global company, we're also far flung company. We have many different products in business units and so forth. And so, uh, one of the things that I realized early on waas we are going to need data officers, each of those business units and the business units. There's obviously the enterprise objective. And, you know, you could think of the enterprise objectives in terms of some examples based on what I said in the past, which is so enterprise objective would be We've gotta have a data foundation by essentially making data along these four pillars. I talked about clients offerings, etcetera, you know, very accessible self service. You have mentioned south, so thank you. This is where the South seven speaks. Comes it right. So you can you can get at that data quickly and appropriately, right? You want to make sure that the access control, all that stuff is designed out and you're able to change your policies and you'd swap manual. But, you know, those things got implemented very rapidly and quickly. And so you've got you've got that piece off off the off the puzzle due to go after. And then I think the other aspect off off. This is, though, when you recognize that every business unit also has its own objectives and they are looking at some of those things somewhat differently. So I'll give you an example. We've got data any our product units. Now, those CEOs right there, concern is going to be a lot more around the products themselves And how were monetizing those box and so they're not per se concerned with, You know, how you reduce the enter and cycle time off IBM in total supply chain so that this is my point. So they but they're gonna have substantial considerations and objectives that they want to accomplish. And so I recognize that early on, and we came up with this notion off a data officer council and I helped staff the council s. So this is why that's the Matrix to reporting that we talked about. But I selected some of the key Blair's that we have in those units, and I also made sure they were funded by the unit. So they report into the units because their paycheck is actually determined. Pilot unit and which makes them than aligned with the objectives off the unit, but also obviously part of my central approach so that I can disseminate it out to the organization. It comes in very, very handy when you are trying to do things across the company as well. So when we you know GDP our way, we have to get the company ready for Judy PR, I would say that this mechanism became a key key aspect of what enabled us to move forward and do it rapidly. Trouble them >>be because you had the structure that perhaps the lines of business weren't. Maybe is concerned about GDP are, but you had to be concerned with it overall. And this allowed you to sort of hiding their importance, >>right? Because think of in the case of Jeannie PR, they have to be a company wide policy and implementation, right? And if he did not have that structure already in place, it would have made it that much harder. Do you get that uniformity and consistency across the company, right, You know, So you will have to in the weapon that structure, but we already have it because way said Hey, this is around for data. We're gonna have these types of considerations that they are. And so we have this thing regular. You know, this man network that meat meets regularly every month, actually, and you know, when things like GDP are much more frequently than that, >>right? So that makes sense. We're out of time. But I wonder if we could just close if you could address the M I t CDO audience that probably this is the largest audience, Believe or not, now that it's that's virtual definitely expanded the audience, but it's still a very elite group. And the reason why I was so pleased that you agreed to do this is because you've got one of the more complex organizations out there and you've succeeded. And, ah, a lot of the hard, hard work. So what? What message would you leave the M I t CDO audience Interpol? >>So I would say that you know, it's it's this particular professional. Receiving a profession is, uh, if I have to pick one trait of let me pick two traits, I think what is your A change agent? So you have to be really comfortable with change things are going to change, the organization is going to look to you to make those changes. And so that's what aspect off your job, you know, may or may not be part of me immediately. But the those particular set of skills and characteristics and something that you know, one has to, uh one has to develop or time, And I think the other thing I would say is it's a continuous looming jaw. So you continue sexism and things keep changing around you and changing rapidly. And, you know, if you just even think just in terms off the subject areas, I mean this Syria today you've got to understand technology. Obviously, you've gotta understand data you've got to understand in a I and data science. You've got to understand cybersecurity. You've gotta understand the regulatory framework, and you've got to keep all that in mind, and you've got to distill it down to certain trends. That's that's happening, right? I mean, so this is an example of that is that there's a trend towards more regulation around privacy and also in terms off individual ownership of data, which is very different from what's before the that's kind of weather. Bucket's going and so you've got to be on top off all those things. And so the you know, the characteristic of being a continual learner, I think is a is a key aspect off this job. One other thing I would add. And this is All Star Coleman nineteen, you know, prik over nineteen in terms of those four pillars that we talked about, you know, which had to do with the data technology, business process and organization and culture. From a CDO perspective, the data and technology will obviously from consent, I would say most covert nineteen most the civil unrest. And so far, you know, the other two aspects are going to be critical as we move forward. And so the people aspect of the job has never bean, you know, more important down it's today, right? That's something that I find myself regularly doing the stalking at all levels of the organization, one on a one, which is something that we never really did before. But now we find time to do it so obviously is doable. I don't think it's just it's a change that's here to stay, and it ships >>well to your to your point about change if you were in your comfort zone before twenty twenty two things years certainly taking you out of it into Parliament. All right, thanks so much for coming back in. The Cuban addressing the M I t CDO audience really appreciate it. >>Thank you for having me. That my pleasant >>You're very welcome. And thank you for watching everybody. This is Dave a lot. They will be right back after this short >>break. You're watching the queue.

Published Date : Sep 3 2020

SUMMARY :

to you by Silicon Angle Media Great to see you. So when you you and I first met, you laid out what I thought was, you know, one of the most cogent frameworks and they came up with which have a role you know, seemed most meaningful to them. So how has that changed the role of CDO? And the last one is a risk reduction that they're going to reduce the risk, you know, So one of the big changes we've seen in the organization is that data pipeline you mentioned and and Now that's the pipeline you refer that you do, or or maybe it's more of a far flung organization. That is the I think the biggest, you know, and you know having. and the role, you know, the CEO role doesn't make the kind of strategic impact and it's largely, you know, their responsibility as opposed to a lot of the finger pointing that has historically gone And that means, you know, people from the HR side people from the operation side, So I mean, you know, for in terms off our structure, as you know, And this allowed you to sort of hiding their importance, and consistency across the company, right, You know, So you will have to in the weapon that structure, And the reason why I was so pleased that you agreed to do this is because you've got one And so the you know, the characteristic of being a two things years certainly taking you out of it into Parliament. Thank you for having me. And thank you for watching everybody. You're watching the queue.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
IBMORGANIZATION

0.99+

seventy percentQUANTITY

0.99+

DecemberDATE

0.99+

Inderpal BhandariPERSON

0.99+

seventy percentQUANTITY

0.99+

threeQUANTITY

0.99+

first stepQUANTITY

0.99+

five stepsQUANTITY

0.99+

ninety five percentQUANTITY

0.99+

two thousandQUANTITY

0.99+

Silicon Angle MediaORGANIZATION

0.99+

hundred thousand usersQUANTITY

0.99+

last weekDATE

0.99+

DavePERSON

0.99+

thousandsQUANTITY

0.99+

one hundred thousandQUANTITY

0.99+

OneQUANTITY

0.99+

one hundred percentQUANTITY

0.99+

fourQUANTITY

0.99+

firstQUANTITY

0.99+

oneQUANTITY

0.99+

two traitsQUANTITY

0.98+

eachQUANTITY

0.98+

NorthstarORGANIZATION

0.98+

two aspectsQUANTITY

0.98+

todayDATE

0.98+

four pillarsQUANTITY

0.97+

first responseQUANTITY

0.97+

North StarORGANIZATION

0.97+

SyriaLOCATION

0.97+

three thingsQUANTITY

0.97+

secondQUANTITY

0.96+

over one hundred thousandQUANTITY

0.95+

several years agoDATE

0.95+

one traitQUANTITY

0.94+

sixQUANTITY

0.93+

yearsQUANTITY

0.93+

nineteenQUANTITY

0.93+

one wayQUANTITY

0.93+

four major pillarsQUANTITY

0.92+

last half decadeDATE

0.92+

IbnORGANIZATION

0.92+

InterpolPERSON

0.91+

BhandariPERSON

0.91+

first observationsQUANTITY

0.91+

each timeQUANTITY

0.9+

MITORGANIZATION

0.9+

hundreds of thousands of experimentsQUANTITY

0.89+

CDOTITLE

0.89+

two pillarsQUANTITY

0.87+

a monthQUANTITY

0.86+

one aspectQUANTITY

0.86+

twenty thirteenDATE

0.85+

JeanniePERSON

0.84+

two thingsQUANTITY

0.83+

four pillarsQUANTITY

0.82+

2020DATE

0.8+

Tallapragada and Hartman for review


 

>>from around the globe. It's >>the Cube with digital coverage of >>AWS Public Sector Partner Awards >>brought to you by >>Amazon Web services. Everyone, welcome to this cube coverage of AWS Public Sector Partner Awards program. I'm John Furrow, your host of the Cube with two great guests here. Travis Department director of analytics and Weather at Max. Our technologies and VJ teleplay Gotta Who's the chief? Modeling and data a simulation branch at Noah. Tell us about the success of this. What's the big deal? Take us through the award and why Max are what you guys do. >>Yeah, so Macs are is an organization. Does a lot of different activities unearth intelligence as well as space? We have about 4000 employees around the world. One side of the economy works on space infrastructure, actually building satellites on all the infrastructure that's going to help us get us back to the moon and things like that. And then on the other side we have a north of intelligence group, which is where, I said, and we leverage remote sensing information for science information to help people better understand how, how and what they do might impact the Earth or have the earth, and it's activities might impact their business mission. Our operation. So what we wanted to set out to do was help people better understand how weather could impact their mission, business or operations. And a big element of that was doing it with speed. Ah, so we we knew? No. I had capabilities running America weather prediction models and very traditional on Prem. Big, beefy ah, high performance compute supercomputers. But we wanted to do it in The cloud we want to do is AWS is a key part. So we collaborated with B. J and Noah and his team is there to help pull that off. They gave this access public domain information, but they showed us the right places to look. We've had some of the research scientists talking, and after pretty short effort, it didn't take a lot of time. We were able to pull something off that a lot of people didn't think was possible. I'm we got pretty excited. Once we saw some of the outcome >>Travis to be, Jay was just mentioning the relationship. Can you talk about the relationship together because this is not your classic Amazon partner client relationship that you have. You guys have been partnering together V. J and your team with AWS. Talk about the relationship and that and how Amazon plays because it's a unique partnership plane in more detail at specific relationship. >>Yeah, with Max or in AWS. You know, our partnership has gone back A number of years on Macs are being a fairly large organization. There's lots of different activities. I think Max Star was the first client of AWS Snowmobile, where they have the big tractor trailer back up to a data center, load all the data in and then take it to an AWS data center. We were the first users of that because we had over 100 petabytes of satellite imagery and archive that just moving across the Internet would probably still be going. Um, so the snowmobile is a good success story for us, but just with >>the >>amount of data that we have, the amount of data we collect every day and all the analytics that we're running on it, whether it's in an HPC environment or, you know, the scalable Ai ml were able to scale out that architecture scale out that compute the much easier, dynamic and really cost effective way with AWS, because when we don't need to use the machines, we turn them off. We don't have a big data center sitting somewhere. We have to have security, have all the overhead costs of just keeping the lights on. Literally. AWS allows us to run our organization and a much more efficient way. Um and Noah, you know, they're They're seeing some of that same success story that we're seeing as far as how they can use the cloud for accelerating research, accelerating how the advancement of numerical weather prediction from the United States can benefit from cloud from cloud architecture, cloud computer, things like that. And I think a lot of the stuff that we've done here, Max our with our HPC HPC solution in the cloud. It's something that's pretty interesting to know, and it's it's a good opportunity for us to continue our collaboration. >>If I could drill down on that solution architecture for a minute. How did you guys set up the services, and what lessons did you learn from that process? >>We're still learning. It was probably the the short answer, but it all started with our people. Uh, you know, we have some really strong engineers, really strong data scientists that fundamentally have a background in meteorology or atmospheric science, you know? So they understand the physics. So you know why the wind blows is the way it doesn't. Why Cloud's doing clouds to do, Um, but we also having a key strategic partnership with AWS. We really have to tap into some of their subject matter experts. And we really put those people together, you know, and come up with new solutions, new innovative ideas, stuff that people hadn't tried before. We're able to steer a little bit of AWS is product roadmap for is what we were trying to do and how their current technology might not have been able to support it. But by interacting with us gave them some ideas as far as what the tech had to move towards. And then that's that's what allowed us to move pretty quick fashion. Um, you know, it's it's neat stuff technology, but it really comes down to the people. Um, and I feel very honored and privileged to work with both great people here. Attacks are as well as aws, um, as well as being able to collaborate with your great teams. That power, it's been a lot of fun. Well, >>Travis gonna create example? I think it's a template that could be applied to many other areas, certainly even beyond. You've got large scale, multi scale situation there. Congratulations. Final question. What does it mean to be an award winner for AWS Partner Awards as part of the show? You're the best in show for HPC. What's it like? What's the feeling? Give us a quick side from the field? >>Yeah. I mean, I don't know if there's really a lot of good words that kind of sum it up. It's Ah, I shared the news with the team last night, and you know, there are a lot of a lot of good responses that came from a lot of people think it's cool. And at the end of the day, a lot of people on our team, you know, took a hobby or a passion of weather and turned it into a career. Ah, and being acknowledged and recognized by groups like AWS for best solution in a particular thing. Um, I think we take a lot of that to heart. And, ah, we're very honored and proud of what we were able to do and proud that other people recognize the need stuff that we're doing well, >>Certainly taking advantage. The cloud, which is large scale. But you you're on a great wave. You've got a great area. I mean, whether you talk about whether it's exciting, it's dynamic. It's always changing. It's big data. It's large scale. So you get a lot of problems to solve in a lot of impact to get it right. So congratulations on ECs. >>Thank you very much. Great mission. Thank you. >>Love what you do love to follow up again. Maybe do another interview and talk about the impact of weather and all the HPC kind of down the road. But, Travis, thank you very much. >>Thank you. Appreciate it. >>Good to see you. >>Thank you. Good to be here. >>So Noah, National Oceanic Atmospheric Administration, National Weather Center, National Center for Environmental Predictions, Environmental Modeling Center year. That's your organization? You guys are competing to be best in the world. Tell us what you guys do at a high level. Then we'll jump into some of the successes. >>So the national Weather Service is responsible for providing weather forecast to save lives and property and improve the economy of the nation. And that's part of that. That the national weather services responsible for providing data and also the forecasts to the public and the industry and be responsible for providing the guidance on how they create the forecasts. So we are at the Environmental Modeling Center, uh, the nation's finest institute in advancing our numerical weather prediction modelling development, and you play it off all the data that's available from the world to initialize our models and provide the future state of the atmosphere from hours all the way to seasons and years. That's that's the kind of a range of products that we don't lock and provide are our key for managing the emergency services and patch it management and mitigation and also improving the nation's economy by preparing well in advance for the future events. And it's it's a science based organization, and we have ah well class scientists working in this organization. I manage about 170 of them at the moment of modeling center. They're all PhDs from various disciplines, mostly from meteorology, atmospheric sciences, oceanography, land surface modelling space weather, all weather related areas and the mathematics and computer science. And we are at the stage where we are probably the most. Uh huh. Most developed, uh, advanced modelling center that we use almost all possible computational resources available in the world. So this is a really computational in terms of user data, user computer seems off. Uh, all the power that we can get and we have a 3.5 petaflop machine that we use to provide these weather forecasts, and they provide the services every hour. For some sense is like the CDO rather our rates for every three hours for hurricanes and for every six hours for the regular, Rather like the participation, uh, the temperature forecast. So all the data that you see coming out from either the public media, our department agencies, they are originated in our center and disseminated in various forms. I think no one is the only center in the world that provides all this information for your past. So it is, ah, public service organization and we riding on a visa with society. >>We'll I love your title, Chief modeling and data, a simulation title branch of a lot of these organizations. This >>is >>whether it's ever critical. I want to get your thoughts cause we were talking before we came on about how the Hurricane Katrina was something that really kind of forcing you to rethink things. Whether it is an evolving system, it's always changing. Either the catastrophe or something happens. Were you trying to proactive predicting, say, whether it's a fire season in California, all kinds of things going on that's not It's always hard to get a certain prediction. You have big job. It's a lot of data you need. Horsepower need computing. You need to stand up. Some HPC take us through like like the thinking around the organization. And what was The impact is that you see, because whether does have that impact. >>So traditionally, you know, as you mentioned, there are radius weather phenomenon that you describe like the five rather the Americans, every presentation, the flooding. So we developed solutions for individual weather phenomena, and, uh, we have grown in that direction by developing separate solutions for separate problems. And very soon it became obvious that we cannot manage all these independent modeling systems to provide the best possible forecasts. So the thinking has to be changed. And then there is Another big problem is that there's a lot of research going out in the community like the academic institutes, the universities, other government labs. There are several people working in these areas, and all their work is not necessarily a coordinated, uh, development activity that we cannot take advantage. And they have no incentive for people to come and contribute towards the mission that we are engaged in. So that actually prompted to change the direction of thinking. And as you mentioned, Hurricane Katrina was an eye opener. We had the best forecasts, but the dissemination of that information waas not probably accurate enough, and also there is a lot of room for improvement in predicting these catastrophic events. How are >>you guys using AWS? Because HPC high performance computing I mean you can't ask for more resources in the massive cloud that is Amazon. How is that help to you? Can you take a minute to explain, but walk us through? >>What? >>Aws? There >>are a few example. Second site. But before then, I would like to really appreciate a Travis Hartman from Max. Are you know who is probably the only private sector partner that we had in the beginning. And now we're expanding on. That s so we were able to share our community. Cores with Max are and without how they were able to establish this and drive modeling system as it is done in operations that Noah and they were able to reproduce operational forecast using the cloud resources. And then they went ahead and did even more by scaling the modeling systems is that it can run even faster and quicker them are what insert no operations can do. So that gives us one example of how the cloud can be used. You know, the same forecast that we produce, ah, globally, which will take about eight minutes per day. And, uh, Max I was able to do it much faster, like 50% improvement and in the efficiency of the colors. And now the one piece of this is that the improvements that matter are other collaborators are using, or cords that they're putting into the system are coming back to us. So we take advantage of that, improving the efficiency in operations. So this is that this is like a win win situation for both, uh, who are participating in the R and D on who are using it in operations, and on top of it, you can create multiple configurations of this model in various instances on the cloud when you can run it more efficiently and you can create an ensemble of solutions that can be captured toe individual needs. And the one additional thing I want to mention about User Cloud is, is that you know, this is like when you have a need, you can search the compute you can. Instead she 8000 sub simulations to test a new innovation. For instance, you don't need to wait for the resources to be done in a sequential manner. Instead, you can ramp up the production off these apartments in no kind and without Don't worry about. Of course, the cost is the fact that we need to worry about, but otherwise the capacity is there. The facilities are reacting to take advantage of the cloud solutions. If I'm a >>computer scientist person, I'm working on a project. Now I have all this goodness in the cloud, how's morale been and what's the reaction been like from from people doing the work. Because usually the bottleneck has been like I gotta provision resource. I gotta send a procurement request for some servers or I want to really push some load. And right now, I got a critical juncture. I mean, it's got a push morale up a bit, and you talk about the impact to the psychology of the people in your organization. >>Um, I haven't. I have two answers to this question. One from a scientist perspective like me. You know, I was not a computer scientist from the beginning, but I became a software engineer, kind of because I have to work with these software and hardware stuff more more on solving the computational problems than the critical problems. So people like us who have invested their careers in improving the science, they were not care whether it's ah, uh hbc on premise Cloud, what will be delighted to have, uh, resources available alleviate that they can drive. But on the other hand, the computer computational engineers are software engineers who are entering into this field. I think they are probably the most excited because of these emerging opportunities. And so there is a kind of a friction between the scientific and the computational aspects off personnel, I would say. But that difference is slowly raising on and we are working together as never before. So the collective moral is very high to take advantage of these resources and opportunities. I think way of making the we're going in the right direction. >>It's so much faster. I mean, in the old days, you write a paper, you got to get some traction. Gonna do a pilot now It's like you run an experiment, get it out there. VJ I'm very impressed with the organization. Love to do a follow up with you. I love the impact that you're doing certainly in the weather impact society from forecasting disasters and giving people the ability to look at supply chain, whether it's providing for potentially a fire season or water shortage or anything going on there. But also it's a template. You're exceeding a new kind of waiting to innovate with community with large scale, multi scale data points. So congratulations and >>thank you. >>Thank you very much. I'm John Furrier here part of AWS partner Awards program. Best HPC solution. Great. Great Example. Great use case. Great conversation. Thanks for watching two great interviews. Here is part of AWS Public Sector Partner Awards program. I'm John Furrier. The best in show for HPC Solutions. China's Hartman Max, our technologies and Vijay tell Apartado at Noah. Two great guests. Thanks for watching. Yeah, Yeah, yeah, yeah, yeah, yeah

Published Date : Jul 31 2020

SUMMARY :

from around the globe. What's the big deal? We have about 4000 employees around the world. Talk about the relationship and that and how Amazon plays because it's a unique partnership plane of satellite imagery and archive that just moving across the Internet would probably still be going. that compute the much easier, dynamic and really cost effective way with set up the services, and what lessons did you learn from that process? And we really put those people together, you know, and come up with new solutions, You're the best in show for HPC. And at the end of the day, a lot of people on our team, you know, I mean, whether you talk about whether it's exciting, it's dynamic. Thank you very much. Maybe do another interview and talk about the impact Thank you. Good to be here. what you guys do at a high level. So all the data that you see coming out from branch of a lot of these organizations. And what was The impact is that you see, So the thinking has to be changed. Can you take a minute to explain, but walk us through? You know, the same forecast that we produce, it's got a push morale up a bit, and you talk about the impact to the psychology of the people in your organization. So the collective moral is very high to I mean, in the old days, you write a paper, you got to get some traction. Thank you very much.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
JayPERSON

0.99+

AWSORGANIZATION

0.99+

John FurrowPERSON

0.99+

National Oceanic Atmospheric AdministrationORGANIZATION

0.99+

AmazonORGANIZATION

0.99+

TravisPERSON

0.99+

50%QUANTITY

0.99+

National Weather CenterORGANIZATION

0.99+

John FurrierPERSON

0.99+

VijayPERSON

0.99+

National Center for Environmental PredictionsORGANIZATION

0.99+

CaliforniaLOCATION

0.99+

earthLOCATION

0.99+

TallapragadaPERSON

0.99+

EarthLOCATION

0.99+

NoahPERSON

0.99+

fiveQUANTITY

0.99+

over 100 petabytesQUANTITY

0.99+

Hurricane KatrinaEVENT

0.99+

bothQUANTITY

0.99+

Environmental Modeling CenterORGANIZATION

0.99+

United StatesLOCATION

0.99+

OneQUANTITY

0.99+

VJPERSON

0.99+

first usersQUANTITY

0.98+

B. JPERSON

0.98+

two answersQUANTITY

0.98+

Hurricane KatrinaEVENT

0.98+

One sideQUANTITY

0.98+

last nightDATE

0.98+

HartmanPERSON

0.98+

oneQUANTITY

0.98+

Second siteQUANTITY

0.98+

AmericaLOCATION

0.97+

ChinaLOCATION

0.97+

about 4000 employeesQUANTITY

0.97+

Hartman MaxPERSON

0.97+

Two great guestsQUANTITY

0.97+

one pieceQUANTITY

0.97+

MaxPERSON

0.97+

V. JPERSON

0.96+

first clientQUANTITY

0.96+

two great guestsQUANTITY

0.96+

one exampleQUANTITY

0.95+

ApartadoPERSON

0.94+

national Weather ServiceORGANIZATION

0.93+

Travis HartmanPERSON

0.93+

MacsORGANIZATION

0.93+

NoahORGANIZATION

0.93+

HPC SolutionsORGANIZATION

0.91+

AWS Public Sector Partner AwardsEVENT

0.89+

about eight minutes per dayQUANTITY

0.89+

AmericansPERSON

0.89+

CubeCOMMERCIAL_ITEM

0.89+

every six hoursQUANTITY

0.88+

every three hoursQUANTITY

0.84+

Environmental Modeling Center yearORGANIZATION

0.84+

snowmobileORGANIZATION

0.83+

8000 sub simulationsQUANTITY

0.81+

HPCORGANIZATION

0.81+

two great interviewsQUANTITY

0.81+

MacsCOMMERCIAL_ITEM

0.8+

3.5 petaflop machineQUANTITY

0.8+

Amazon WebORGANIZATION

0.8+

about 170 ofQUANTITY

0.79+

AWS Partner AwardsEVENT

0.73+

peopleQUANTITY

0.73+

Max StarPERSON

0.65+

national weather servicesORGANIZATION

0.64+

Public Sector Partner AwardsTITLE

0.62+

Max.ORGANIZATION

0.61+

thingQUANTITY

0.58+

moonLOCATION

0.55+

Amit Walia, Informatica | CUBE Conversation, May 2020


 

>> Presenter: From theCUBE Studios in Palo Alto and Boston, connecting with Dot leaders all around the world. This is a CUBE conversation. >> Everyone welcome to theCUBE studio here in Palo Alto. I'm John Furrier, host of theCUBE. We're here with our quarantine crew. We've been here for three months quarantining but we're getting the stories out. We're talking to all of our favorite guests and most important stories in technologies here remotely and we have a great conversation in store for you today with Amit Walia CEO of Informatica. Cube alumni, frequent guest of theCUBE, now, the CEO of Informatica. Amit, great to see you. Thank you for coming on this CUBE conversation. >> Good to see you John. It's different to be doing this like this versus being in the studio with you but I'm glad that we could leverage technology to still talk to each other. >> You're usually right here, right next to me, but I'm glad to get you remotely at least and I really appreciate you. You always have some great commentary and insights. And Amit, before we get into the real meaty stuff that I'd love around the data, I want to get your thoughts on this COVID-19 crisis. It's a new reality, it's highlighted as we've been reporting on SiliconANGLE for the past few months. The at scale problems that people are facing but it's also an opportunity. People are sheltered in place, there's a lot of anxiety on what their work environment is going to look like but the world still runs. Your thoughts on the current crisis and how you're looking at it, how you're navigating it as a leader. >> No doubt, it is a very unique situation we all live in. We've never all faced something like this. So I think first of all, I'll begin by expressing my prayers for anyone out there who has been impacted by it and of course, a huge round of thank you to all the heroes out there at the front lines. The healthcare workers, the doctors, the nurses (mumbles) so we can't forget that. These are very unique situations but as you said, let's not forget that this is a health crisis first and then it becomes an economic crisis. And then, as you said there is a tremendous amount of disruption and (mumbles) I think all of them will go through some phases and I think you can see already while there is disruption in front of us, you see the digital contents of organizations who are ready for that have definitely faced it lot better but as obviously the ones that have been somewhat in the previous generations, let's just say business models or technologies models are struggling through it. So there is a lot data chain. I think they're still learning. We're absolutely still learning and we will continue to learn til the end of this year and we'll come out very different for the next decade for sure. >> If anyone who's watching goes to YouTube on the SiliconANGLE CUBE and look at your videos over the years, we've been talking about big data and these transformational things. It's been an inside the industry kind of discussion. Board room for your clients and your business and Informatica but I think this is now showing the world this digital transformation. The future has been pulled forward faster than people have been expecting it and innovation strategy has been on paper, maybe some execution but now I think it's apparent to everyone that the innovation strategy needs to start now because of this business model impact, the economic crisis is exposed. The scale of opportunities and challenges, there will be winners and losers and projects still need to get done or reset or reinvented to come out of this with growth. So this is going to be the number one conversation. What are your thoughts around this? >> No, so I've talked to hundreds of customers across the globe and we see the same thing. In fact actually, in some ways as we went through this, something very profound dawned on me. We, John, talked about digital transformation for the last few years and clearly digital transformation will accelerate but as I was talking to customers, I came to this realization that we actually haven't digitally transformed. To be honest, what happened in the last three to four years is that it was more digital modernization. A few apps got tweaked, a few front-ends got tweaked but if you realize, it was more digital modernization, not transformation because in my opinion, there are four aspects to digital transformation. You think of new products and services, you think of new models of engaging with your customers, you think of absolutely new operating models and you think of fundamentally new business models. That's a whole rewrite of an organization, which is not just creating a new application out there, fundamental end to end transformation. My belief is, our belief is that, now starts a whole new era of transformation, digital transformation. We've just gone through digital modernization. >> Well, that's a great point and the business model impacts create... And in times of these inflection points, and again, you're a student of history in the tech industry, PC revolution, TCP IP. These are big points in time. They're not transitions. The big players tend to win the transitions. When you have a transformation, it's a Cambrian explosion of new kinds of capabilities. This is really, I would agree with your point but I think it's going to be a Cambrian explosion because the business model forcing function is there. How do you see it play, 'cause you're in the middle of all this, 'cause you guys are the control plane for data in the industry as a company. You enable these new apps. Could you share your-- >> So, we see a lot of that and I think the way to think about it, I think first of all, you said it right. This is a step function changing orbit. This is a whole new... You get to a new curve, you go to a different model. It's a whole new equation you're hiking for the curve you're going to be on. It's not just changing the gradient of the curve you've been on, this is going to be a whole journey. And when we think of the new world of digital transformation, there are four elements that are taught. First of all, it has to be strategic. It has to be Board, CEO, executive topped down, fundamentally across the whole organization, across every function of an organization. Second one you talked about scale. I believe this is all about innovating at scale. It's not about, hey, let me go put a new application in some far plans of my business. You've got to innovate at scale, end to end change does not happen in bits and pieces. Third one, this is cloud native, absolutely cloud native. If there was any minuscule of doubt, this is taking it away. Cloud nativity is the fundamental differentiator and the last but not the least is digital natives, which is where everybody wants to go become a digitally transformed company that are data-led. You got to make data-led decisions. So for competence, strategic mindset, innovation at scale cloud nativity and being data-led is going to define digital transformation. >> I think that encapsulates absolutely innovation strategy. I agree with you 100%, that's really insightful. I want to also get your thoughts on some things that you're talking about and you have always had some really kind of high level conversations around this and theCUBE has been a very social organization. We'd love to be that social construct between companies and audiences but you use a term, the digital transformation, the soul of digital transformation is data 4.0. This idea of having a soul is interesting because the apps all have personalization built in. You have CLAIRE, you've been doing CLAIRE AI for a while. So this idea of social organizations, a soul is kind of an interesting piece of metadata you're putting out in the messaging. What do you mean by that? How can digital transmission have a soul? >> I think we talked about it a lot and I think it just came to me that, look at the end of the day, any transformation is so fundamental to anything that anybody does and I think if you think about, you can go to a fundamental transformation that is just qualitative, it's qualitative and quantitative. It's about a human body, it's about a human body transforming itself and then something doesn't have a soul, John, it does not have life. It cannot truly move to the next paradigm. So I believe that, any transformation has to have a soul and the digital world is all about data. So obviously, we believe that we're walking into a data-for-data world where, as I said, the four pillars of digital transformation would be data-led and I believe data is the soul of that transformation and data itself is moving into a new paradigm. You've heard us talk about 1.0, 2.0, 3.0, and this is the new world of 4.0, a data 4.0 which basically is all about cloud nativity, intelligent automation, AI powered, focusing on data, trust in data ethics and operations and innovation at scale. When you bring these elements together, then that enables digital transformation to happen on the shoulders of data 4.0, which in my opinion, is the soul of digital transformation. >> All right, so just rewind on data 4.0 for a minute. Pretend I'm a CIO, I'm super busy. I don't have time to read up about it. Give me the bottom line, what is data 4.0? Describe it to me in basic terms, is it just an advancement, acceleration? What's the quick elevator pitch on 4.0, data 4.0? >> Very simple?. We're all walking into a world where we're going to be digital. Digital means that we're basically going to be creating tons of data. By the way, and data is everywhere. It's not just within the four walls of us. It's basically what I call transaction and interaction and with the scale and volume of data increasing, the complexity of it increasing. We want to make decisions. I say, tomorrow's decision, today and with data that is available to us yesterday, so I can be better at that decision. So we need intelligence, we need automation, we need flexibility, which is where AI comes in. These are all very fundamental rewrites of the technology stack to enable a fundamental business transformation. So in that world, data is front and center and you look at the amount of data we are going to collect, the whole concept of data ethics and data trust become very important, not just Goodwill governance, governance is important but data privacy, data trust becomes very important. Then we're going to do things like contact tracing, it's very important for the society but the ethics, trust and privacy of what you and I will give to the government is going to become very much important. So to me, that world that we go in, every enterprise has to think data first, data led, build an infrastructure to support the business in that context and then, as I said, then the soul, which is data will give life to digital transformation. >> That's awesome. Love the personalization and the soul angle on it. I always believe that you guys had that intelligent automation fabric and to me, you said earlier, cloud native is apparent to everyone now. I think out of all this crisis, I think the one thing that's not going to be debated anymore is that cloud native is the operating model. I think that's pretty much a done deal at this point. So having this horizontally scalable data, you know I've been on this rant for years. I think that's the killer app. I think having horizontally scalable data is going to enable a lot, souls and more life. So I got to ask you the real, the billion dollar question. I'm a customer of yours or prospect or a large enterprise. I'm seeing what's happening at scale, provisioning of VPNs for 100% employees at home, except for the most needed workers. I now see all the things I need to either process, I need to cancel and projects that double down on. I still got to go out and build my competitive advantage. I still have to run my business. So I need to really start deploying right out of the gate data centric, data first, virtual first, whatever you want to call it, the new reality first, this inflection point. What do I do? What is the things that you see as projects or playbook recipes that people could implement? >> First of all, we see a very fundamental reevaluation of the entire business model. In fact, we have this term that we're using now that we have to think of business has a business 360 and if I think about it in this new world, that the businesses that stood the test is that had basically what I call, a digital supply chain or in a very digital scalable way of interacting with their customers, being able to engage with their customers. A digital fabric often making sure that they can bring their product and services to the customers very quickly or in some cases, if they were creating new products and services, they had the ability for a whole new supply chain to reach that end customer. And of course, a business model that is flexible so they dont obviously, they can cater to the needs of their customers. So in all of these worlds, customers are a building digital, scalable data platforms and when I say platforms, it's not about some monolithic platform. These are, as you and I have talked about, very modular microservices based platform that reside on what we call metadata. Data has to be the soul of the digital enterprise. Metadata is the nervous system, that makes it all work. That's the left brain, right brain, that makes it all work, which is where we put AI on top. AI that works for the customers and then they leverage it but AI applied to that metadata allows them to be very flexible, nimble and make these decisions very rapidly, whether they are doing analytics for tomorrow's offering to be brought in front of a customer or understanding the customer better to give them something that appeals to them in changing times or to protect the customer's data or to provide governance on top of it. Anything that you would like to do has to ride on top of what I call a, AI led metadata driven platform that can scale horizontally. >> Okay, so I got to go to the next level on this, which is, okay, you got me on that. I hear what you're saying, I agree, great. But I got to put my developers to work and I got insight, I got analytics teams, I got competencies but Amit, my complexities don't go away. I still got compliance at scale, I got governance at scale but I also got, now my developers not just to get analytical insight, there's great dashboards and there's great analytic data out there, you guys do a good job there. I got to get my developers coding so I can get that agility of the data into the apps for visualization in the app or having a key ingredient of the software. How do I do that? What's your answer to that one? >> So, that's a critic use case. If you think about it, for a developer, one of the biggest challenge for analytics project is how do I bring all the data that is in sites across the enterprise so then I can put it in any kind of visualization analytics tool and things are happening at scale. An enterprise is spread across the globe. It's so many different data sources available everywhere. Again, what we've done is that as a part of the data platform when you focus upon the metadata, that allows you to go to one place where you can have full access to all of the data assets that are available across (mumbles). Do you remember at theCUBE years ago, we unveiled the launch of our enterprise data catalog, which as I said, was the Google for enterprise data through metadata. Now, developers don't have to go start wasting their time, trying to find whether data has (mumbles), through the catalog that CLAIRE is in-built, they have access to it. They can start putting that to work and figuring out how do I take different kinds of data? How do I put it in some data times tool? Through which we have the in-built integrations. Do what I call the valuable last mile work, which is where the intelligence is needed from them versus spend their energy trying to figure out where good data, clean data, all kinds of data sets. We have eliminated all of that complexity with the help of metadata data platform, CLAIRE, to let the developers do what I call value-added productive work. >> Amit, final question for you. I know you talk to customers a lot, you're always on the road, you got a great product background, that's where you came from, good mix understanding of the business but now your customers and prospects are trynna put the fires out. The big room that... No one's going to talk about their kitchen appliances when the house is burning down and in some cases on the business model side or if it's a growth strategy, they're going to put all their energies where the action is. So getting mind share with them is going to be very difficult. How are you as a leader and how is Informatica getting in front of these folks and saying, "Look, I know things are tough "but we're an important supplier for you." How do you differentiate? How are you going to get that mind share? What are some of those conversations? 'Cause this is really the psychology of the marketplace right now, the buyer and the customer. >> Well, first of all, obviously we had to adapt to reach our customers in a different way because, virtually based just like you and I are chatting right now and to be candid, our teams were fantastic in being able to do it. We've actually already had multiple pretty big sides of it. In fact, the first week before we started (mumbles), we had set up the MDM and Data Governance Summit up in New York and we expected thousands of customers to come there, ask them (mumbles) virtual and we did it virtually and we had three times more people attend the virtual event. It was much easier for people who attended from the confines of their living room. So we'd gone 100% virtual and good news is, that our customers are heavily engaged. We've actually had more participation of customers coming and attending our events. We've had obviously our customers speaking, talking about how they've created value. In light of that next week, we have the big event which we're calling, CLAIREview named after ClAIRE AI engine. It's basically a beautiful net-filled tech experience. We'll have a keynote, we'll have seasons and episodes, people can do bite-sized viewing at their own leisure. We'll talk about all kinds of transformation. In fact, we have Scott Guthrie who runs all of Azure and Cloud at Microsoft as a part of my Keynote. We have two great customers, CDO at XXL and a CEO of GDR nonprofit that does (mumbles) on diabetes work talk about the data journeys. We have Martin Byer from Gardner. So we've been able to pivot and our customers are heavily engaged because data is a P-zero or a P-one activity for them to invest in. So we haven't seen any drop-off in customer engagement with us and we've been very blessed that we have a very loyal and a very high retention rate customer base. >> Well, I would expect that being the center of the value proposition, where we've always said data has been. One more final question since this just popped in my head. You and I have been talking about the edge for years. Certainly now the edge is exposed, we all know what the edge is, it's working at home. It's the human, it's me, it's my IOT devices. More than ever, the edge is now the new perimeter. It's the edge and now the edges is there. There's something that you've been talking a while. This is another part of data fabric that's important. Your view on this new edge that's now visualized by everybody, realized this immersion. What's your thoughts on the edge? >> Oh, I think the edge is real now. You and me chatted about that almost four years ago and I (mumbles). Look, think of it this way. Think of how security is going to change. There's no more data center to which we route our traffic anymore. It's sitting over there somewhere where no human beings is going to have access. People are connecting to all kinds of cloud application directly from their offices or living rooms or their cultures and the world of security has to change in that context. And people are more going to be more, enterprise (mumbles) are more worried about, hey, how do I make sure that that data centric, privacy and security is there in my device and that connects to the third party cloud vendors versus I can't transfer traffic to mine, everything to my VPN. So the edge is going to become a lot more compute intensive as well as it will require a lot of the elements that are, to be honest, used to be data center centric. We have to lighten them and bring them to the edge so enterprises can feel assured and working because at the end of the day, they have to run a business by the standards that an enterprise is held to. So you will see a ton of innovation, by the way, robotics. Robotics is going to make edge even more interesting in live view. So I see the next couple of years, heavy IOP edge computing, just like the clients that are modeled to mainframe that the PC became like a mainframe in terms of compute capacity. I guarantee at the desktop, compute capacity will go down to the edge and we're going to see that happen in the next five years or so. >> The edge is the new data centers. I always say, it's the land is the way, the way is the land. Amit, great to see you and thanks for sharing and I'm sorry, we can't do it in person but this has been like a fireside chat meets CUBE interview, remote. Thanks for spending the time and sharing your insights and we've always had great interviews at your events, virtual again, this year. We're going to spread it out over time, good call. Thanks for coming on, I appreciate it. >> Thanks, John, take care. >> Okay, Amit, CEO of Informatica, always great to get the conversation updates from him on the industry and what Informatica, as at the center of the value proposition data 4.0. This is really the new transformation, not transition, data science, data, data engineering, all happening. theCUBE with our remote interviews, bringing you all the coverage here from our Palo Alto studios, I'm John Furrier. Thanks for watching. (gentle music)

Published Date : May 27 2020

SUMMARY :

all around the world. Amit, great to see you. Good to see you John. but I'm glad to get you remotely at least and of course, a huge round of thank you So this is going to be the the last three to four years and the business model impacts create... and being data-led is going to and audiences but you use a term, and I think it just came to me that, I don't have time to read up about it. is going to become very much important. and to me, you said earlier, that the businesses that stood the test so I can get that agility of the data They can start putting that to work is going to be very difficult. and to be candid, our teams were fantastic is now the new perimeter. and that connects to the Amit, great to see you This is really the new transformation,

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
AmitPERSON

0.99+

JohnPERSON

0.99+

John FurrierPERSON

0.99+

InformaticaORGANIZATION

0.99+

Scott GuthriePERSON

0.99+

New YorkLOCATION

0.99+

Amit WaliaPERSON

0.99+

Palo AltoLOCATION

0.99+

GDRORGANIZATION

0.99+

BostonLOCATION

0.99+

100%QUANTITY

0.99+

May 2020DATE

0.99+

yesterdayDATE

0.99+

three monthsQUANTITY

0.99+

tomorrowDATE

0.99+

todayDATE

0.99+

theCUBEORGANIZATION

0.99+

Martin ByerPERSON

0.99+

GoogleORGANIZATION

0.99+

MicrosoftORGANIZATION

0.99+

YouTubeORGANIZATION

0.99+

CubeORGANIZATION

0.99+

next decadeDATE

0.98+

CLAIREPERSON

0.98+

MDMEVENT

0.98+

hundreds of customersQUANTITY

0.98+

XXLORGANIZATION

0.98+

theCUBE StudiosORGANIZATION

0.97+

Data Governance SummitEVENT

0.97+

next weekDATE

0.97+

this yearDATE

0.96+

Third oneQUANTITY

0.96+

CUBEORGANIZATION

0.95+

billion dollarQUANTITY

0.95+

Second oneQUANTITY

0.94+

thousands of customersQUANTITY

0.94+

four yearsQUANTITY

0.93+

One more final questionQUANTITY

0.93+

one placeQUANTITY

0.91+

FirstQUANTITY

0.91+

first weekDATE

0.91+

four years agoDATE

0.91+

oneQUANTITY

0.91+

one thingQUANTITY

0.9+

AzureTITLE

0.9+

next five yearsDATE

0.89+

firstQUANTITY

0.89+

next couple of yearsDATE

0.89+

GardnerORGANIZATION

0.88+

4.0QUANTITY

0.88+

end of this yearDATE

0.85+

years agoDATE

0.85+

four aspectsQUANTITY

0.81+

two great customersQUANTITY

0.8+

360QUANTITY

0.8+

yearsQUANTITY

0.79+

four elementsQUANTITY

0.78+

CLAIREviewORGANIZATION

0.78+

past few monthsDATE

0.77+

three times more peopleQUANTITY

0.74+

CloudTITLE

0.71+

four pillarsQUANTITY

0.71+

SiliconANGLEORGANIZATION

0.67+

CDOORGANIZATION

0.64+

ClAIREORGANIZATION

0.64+

Glenn Fitzgerald, Fujitsu | SUSECON Digital '20


 

>> Narrator: From around the globe, it's the CUBE with coverage of SUSECON Digital, brought to you by SUSE. >> Hi, and welcome back. I'm Stu Miniman, and this is the CUBE's coverage of SUSECON Digital '20. Happy to welcome to the program Glenn Fitzgerald, he is the Chief Data Officer for Fujitsu Products in Europe, coming to me from across the pond. Ah, Glenn, great to see you, so thanks so much for joining us. >> Hi Stu, thanks, very glad to be here. >> All right, so, first of all, you know, Fujitsu Products Europe, Chief Data Officer, give us a little bit, your role and responsibility inside Fujitsu. >> Of course, the Fujitsu Products Europe is as the name suggests, that part of the Fujitsu Corporation that is dedicated to delivering our products out through the European geography. Fujitsu's product sets runs the full range of ITC components from... tablets to PCs to servers to big storage devices to networks, which is to integrated systems and the software stacks that sit on top of them. It's a wide profile, yeah. And my role has been to be the Chief Technology Officer for that organization for several years. Recently, we have as an organization adopted a new approach to take to the marketplace. And that has necessitated a slight change in my role to one that's more focused on enabling customers to get value out of their data and their data repositories and the correlation of that data to generate business value. A long description, Stu, but I think necessary. >> Yeah, no, super important, Glenn. One thing we've actually been saying for more than a year on the CUBE now, is when you have that discussion of digital transformation, one of the things that differentiates companies before they've gone digital and if they are truly to call themself, you know, have gone through this transformation, is they need to be data-driven, you know. Data needs to be how they're making their decisions. It was definitely a key theme that we heard from SUSE in the keynote. So maybe talk a little bit about how digital transformation and the partnership with SUSE fits into your world. >> Absolutely. So, in terms of the transformation of our business and the changes that we're trying to make to it, as a product organization, traditionally our relationships with our customers is kind of transactional. You know, we sell stuff and they buy stuff. And that relationship with customers is increasingly less viable. It's increasingly challenged. And I think it's challenged by the many things that have happened in the marketplace. It's a sign of a maturing industry. So, you have the Cloud and you have the ISVs who are providing compute power and storage capacity and network capability to our customers in a different way. They're delivering it on the click of a button on an internet browser. Now, that's suitable for some customers in some situations, it isn't suitable for others, but it's definitely here to stay and it's definitely going to change the way the marketplace works, and it has. So we've recognized inside our organization that we need to leverage some of the capabilities that exists inside the Fujitsu services organizations. Fujitsu is a large company. It also has very significant manage services capabilities, we deliver to huge customers all across Europe in terms of German government, British government, a lot of the big manufacturing industrials in Europe and a lot of the travel and insurance financial sectors. So leveraging some of that to take a more consultant-led approach to our marketplace, to our customers. So what we want to do with them is take them through the story of data transformation. And as you said, and I quite agree, the marketplace is becoming increasingly data-driven. You've only got to look at some of the well-known examples, and I'm not going to rehearse them again because everybody's heard them and knows who they are. But, every organization, however large or small, has to derive business advantage and discrimination from its data. Otherwise, they'll go the way of... I hate to say it, the High Street. You can see, in this recent pandemic, the COVID-19 stuff. I don't know what it's like in the US, but absolutely in the UK and in Europe, those retailers that have been able to provide a online presence have survived, and some of them have thrived. And those retailers that haven't been able to provide that presence aren't here anymore. And that's just, it's a current and rather violent example of this change of how to manage data and get the best value out of it. Now, in order to take that to our customers, the Fujitsu Product team needs to change some of its capabilities, it needs to introduce some of those consulting capabilities into its portfolio, which we do. It needs to work with some of our partners to deliver the capabilities either as an installation or a service and SUSE are one of our prime partners in that sense. Both in terms of delivering the computing platform standards, the SUSE Data Hub, I believe it's changed its name now. The SUSE Data Hub as I know it, is core to our offerings in this space. We have just launched in Germany, for example, a manufacturing optimization application which runs off the SUSE infrastructure and uses the SAP database and database management systems above that to deliver things like predictive maintenance and just-in-time parts delivery, and in-factory automated routine of little robots carrying the bits to the right place. And that's an example of something that was led by a consulting activity between Fujitsu and our customer, in this case, a large manufacturer. We recognized during that consultancy that some of the stuff we needed to do to deliver the solution, that would deliver the data-derived business benefit the customer needed, was not in our immediate scope. We got some of our larger partners, SUSE and SAP in this case, involved in it, and they outcome has been happy for everybody. There are some lessons in all that. The Fujitsu is still learning, if I'm frank, like how to price it. When you have consultant-led activities that are generating very great benefit for your client, it's not too great for the supplier to still be charging that just on consultant day-rate. That can lead you to not getting the value out of what you're providing to your client. So there's lessons there. There's lessons in how to interact between ourselves and some of our services partners and clients. And making sure that the optimum route to market is delivered. But that essentially, Stu, is the story. It's a change from a transactional approach to a consultant-led approach, and the generation of a large ecosystem of partners, like SUSE, like SAP, with the capabilities to build stuff with us and deliver business outcomes to clients, not a stack of tin. >> Excellent. So, Glenn, what about kind of emerging requirements, what you're hearing from your customers, you know, AI is an area that we heard quite a bit in the keynote from SUSE. Where do you see that fitting into the entire discussion? Obviously, the key, leverage of data, when you talk about AI. >> Absolutely, and to talk about that in two ways. The first way, the first issue with that is exactly the point you make, Stu, around data. So, AI, which is not artificial and not intelligent, it's just maths. It's statistical mathematics acting upon a large set of data. And if you have a large set of the right data, it can produce fantastic results for the client. But without that data, it is a relatively meaningless exercise. Once that data are assembled, we're beginning to see very significant results produced by the application of new networking, the machine learning. To technology-based, data-derived solutions for our clients, and there are many examples. I'll give you just one or two. We are working with a large financial institution in the city of London that wants to produce, basically, an artificial knowledge base that will perform the task of insurance underwriting. Don't ask me how that works, I'm not a financial guy. But apparently, insurance underwriting is a relatively mechanical task. You have a set of actuarial tables, you have a set of risks, you compare one with the other and produce a premium. We're working with them on that. There is a lot in the manufacturing space, and a surprising amount in healthcare. One of the most personally rewarding examples I've been involved with was the delivery of intelligent heart monitoring to clients with pacemakers. So, the pacemaker is made intelligent and it dumps to a Bluetooth-connected device in the patient's home, and that uploads to an AI-based knowledge system in the Cloud, and the Cloud says, "Sit down, you're going to have a heart attack." And the important element of that is that it says, "Sit down, you're going to have a heart attack" before you've had the heart attack, so you don't have one. A really fantastic example of human-centric interest. So, I think, as a separate subject, AI is largely of academic interest. But as a component of a data-driven solution for a customer, it's rapidly emerging as an important element in our armory, as indeed some other technologies. Like data annealing, and like data analytics, and to a slightly lesser extent at the moment, but I think it will come, blockchain. >> Excellent. So, Glenn, one of the things we always like talking about when we talk to a CDO is how are companies getting along with their data strategy? And I think back four or five years ago when we were first hearing about CDO as a role, it was, you know, the CDO, where do they fit compared to the CIO, what is the changing role of the CIO? So, like you were saying about some of these things, data often can be an afterthought or not necessarily connected, but just as we were saying, data needs to be a critical piece of how companies plan. You gave a great example of medical, obviously. You know, the data can really help transform lives in that environment. So, bring us inside what you're hearing from customers, how are they structuring, and are they really being, I guess, data-driven is one of the terms that I... >> That's a very good question. And the answer is yes to everything. So, one of the most difficult things to estimate, if you're going in to a customer with a client, especially if it's a client that you don't know very well, is exactly what their point of reference is going to be, what their comfort with some of these things is. As a result, we at Fujitsu invested a good deal of effort in going out to our client base and asking them the necessary questions to generate a thing we call the Data Maturity Model. Now the Data Maturity Model is not a new concept, it's a very solid and sound concept, it's been around for a long time. I think what we're trying to do is bring more rigor to that with a very large sample base of our customers. And the model is what you'd expect. There are five levels within it from at Level 1, what is data? To Level 5, where data is continually monitored, continually exploited, and continually developed as part of the business that the organization delivers. So there's a spectrum. In my experience, slightly controversially perhaps, the state of organizations on that Maturity Index varies with geography. And I think it's something to do with acceptance of risk, I think it's something to do with security concerns and liabilities. It's my observation that in the Anglophone world, in England and in the US certainly, there is a higher average awareness of the importance of data and the need to derive business benefit from it than there is, for example, in the Germanophone world, where there are more concerns around security and more regulations around security. They're quite constraining. And as a result, where organizations are a bit more traditional and a bit less aware of the value to be derived from the data. So, people, organizations hit everywhere on this scope, this plane of awareness of data and its potential. But it's definitely the case that the average is always going up. >> Yeah-. >> You only have to look at some of the public stocks, under the stuff in the public domain, to observe that that's happening all the time. >> Yeah, Glenn, I'm curious with the global pandemic happening, are you seeing any impact on that? I've heard some anecdotal data that you talk about some of the companies that are, you know, might not be interested in doing Cloud adoption because they're concerned about security, and all of a sudden realizing they need to take advantage of certain solutions. Or if you look at something like the tracking and tracing, obviously, people are rightfully concerned about personal information and having rights infringed upon. So, what will, in your opinion, are you seeing any early indications as to what this impact will be on how we think about data? >> I think there, again, there are two different dimensions. There's a Darwinian element in the attitudes towards commercial data. As we said right at the start of the conversation, in the current environment, you can see large retailers disappearing at a rate of knots because they haven't been data-aware and data-adopting. That lesson is not lost on other retailers. So, retailers are beginning to do things that in the past they wouldn't have done because of that sort of security concern, but also because of concerns about things like function and performance... and the sheer security that you have in owning your own stuff and therefore being certain of its ownership by you and your retention of the IPR involved. So there is definitely a slackening of that concern and a faster adoption of data exploitation technologies in the commercial sector. In the domestic sector, I think it's very mixed. And again, extremely geography different. In the UK, we have, if I could just talk about my own country for a second, we have this trial of a smartphone COVID-19 tracking app going on on the Isle of Wight. The British media is full to brimming with discussions of the implications of that upon individual liberty, of whether or not it's the nanny state gone mad, of whether or not we should all be not cooperating with it and catching the damn disease anyway because it's a step too far. In Germany, they just implemented it. And everybody went, "Right." (makes click sound) So there are all these different cultural adoptions of these things. But always and forever the trend is upwards. Similar debates around video surveillance technology. So you've got the pressure of security and protecting the public, against intrusion and violation of individual rights. And that debate has got to the stage now where there have been some pilots for threat detection based on video surveillance in the UK that have been stopped. Not so much in Germany. In the US, I don't know, but I guess, you're even more Libertarian than we Brits are, so it's probably more the other way. But with all of these discussions of differences, of culture and nation and area and geography, the trend is definitely upward. So, however the British people resolve that stress, you have to have a tracking app if you want to beat this disease. And that will happen in due course. >> Excellent. Well, Glenn, I'll give you the final takeaway, SUSECON '20, talk about the importance of the Fujitsu and SUSE partnership. >> I think it's a growing part of the base of an ecosystem that's required for all organizations like Fujitsu, like SUSE, that want to reach out and deliver solutions to our customers' business problems, which is after all, what we're here for and what we're all about. Because let's face it. In any sizable organization, the data landscape is unbelievably complicated. You have different formats of data, in RBDs, in unstructured file store, in whatever floats around employees' devices, on social media, for God's sake. Getting all of that out, understanding its relationship to infrastructure, understanding its relation through infrastructure, through application stacks, and service delivery, and then being able to transform that into new applications and new service paradigms that deliver the business benefits that our customers are looking for, is an incredibly complex act. And no one organization is going to be able to do it on its own. So I see the future as one of these growing ecosystems of people that work together some of the time, compete some of the time. Are in what we might call a frenemy relationship. Because we all have to work together to deliver what the customers need. Fujitsu is working with SUSE and our other partners at the forefront of that trying to build economic and commercial and technical partnerships. And I'm sure that will continue through SUSECON '20 and into the future. >> All right, well, Glenn Fitzgerald, thank you so much for joining us. Really appreciate the updates. >> I've enjoyed it. Thank you for having me. >> All right, much more coverage from SUSECON '20 Digital. I'm Stu Miniman and thank you for watching the CUBE. (upbeat music)

Published Date : May 20 2020

SUMMARY :

it's the CUBE with coverage he is the Chief Data Officer and responsibility inside Fujitsu. and the correlation of that and the partnership with and a lot of the travel and in the keynote from SUSE. and the Cloud says, "Sit down, is one of the terms that I... and the need to derive look at some of the public stocks, the tracking and tracing, obviously, and the sheer security that you have of the Fujitsu and SUSE partnership. that deliver the business benefits Really appreciate the updates. Thank you for having me. I'm Stu Miniman and thank

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
FujitsuORGANIZATION

0.99+

GlennPERSON

0.99+

EuropeLOCATION

0.99+

GermanyLOCATION

0.99+

Glenn FitzgeraldPERSON

0.99+

EnglandLOCATION

0.99+

StuPERSON

0.99+

UKLOCATION

0.99+

USLOCATION

0.99+

LondonLOCATION

0.99+

Stu MinimanPERSON

0.99+

oneQUANTITY

0.99+

SUSEORGANIZATION

0.99+

five levelsQUANTITY

0.99+

first wayQUANTITY

0.99+

twoQUANTITY

0.99+

Fujitsu ProductsORGANIZATION

0.99+

SUSETITLE

0.99+

two waysQUANTITY

0.99+

first issueQUANTITY

0.99+

BothQUANTITY

0.98+

German governmentORGANIZATION

0.98+

firstQUANTITY

0.97+

five years agoDATE

0.97+

fourDATE

0.97+

SUSE Data HubTITLE

0.96+

Fujitsu CorporationORGANIZATION

0.96+

British governmentORGANIZATION

0.96+

more than a yearQUANTITY

0.96+

OneQUANTITY

0.95+

two different dimensionsQUANTITY

0.95+

CUBEORGANIZATION

0.95+

COVID-19OTHER

0.95+

SUSECON '20 DigitalORGANIZATION

0.94+

IPRORGANIZATION

0.91+

Fujitsu Products EuropeORGANIZATION

0.89+

BritishOTHER

0.83+

pandemicEVENT

0.83+

SUSECONORGANIZATION

0.81+

AnglophoneOTHER

0.8+

SUSECON Digital '20ORGANIZATION

0.78+

Level 1QUANTITY

0.77+

CDOTITLE

0.76+

Level 5QUANTITY

0.75+

SUSECON '20TITLE

0.75+

SAPORGANIZATION

0.72+

GermanophoneLOCATION

0.68+

SUSECON DigitalTITLE

0.67+

Ritika Gunnar, IBM | IBM Think 2020


 

>>Yeah, >>from the Cube Studios in Palo Alto and Boston. It's the Cube covering IBM. Think brought to you by IBM. >>Everybody, this is Dave Vellante of the Cube. Welcome back. The continuous coverage that we're running here of the IBM Think Digital 2020 Experience. I'm with Radica Gunnar, who is a longtime Cube alum. She's the vice president for Data and AI. Expert labs and learning Radica. Always a pleasure. I wish we were seeing each other face to face in San Francisco. But, you know, we have to make the best. >>Always a pleasure to be with you, Dave. >>So, listen, um, we last saw each other in Miami Attain IBM data event. You hear a lot of firsts in the industry. You hear about Cloud? First, you hear about data. First hear about AI first. I'm really interested in how you see AI first coming customers. They want to operationalize ai. They want to be data first. They see cloud, you know, is basic infrastructure to get there, but ultimately they want insights out of data. And that's where AI comes in. What's your point of view on this? >>I think any client that's really trying to establish how to be able to develop a AI factory in their organization so that they're embedding AI across the most pervasive problems that they have in their order. They need to be able to start first with the data. That's why we have the AI ladder, where we really think the foundation is about how clients organized there to collect their data, organize their data, analyze it, infuse it in the most important applications and, of course, use that whole capability to be able to modernize what they're doing. So we all know to be able to have good ai, you need a good foundational information, architecture and the US A lot of the first steps we have with our clients is really starting with data doing an analysis of where are you with the data maturity? Once you have that, it becomes easier to start applying AI and then to scale AI across the business. >>So unpack that a little bit and talk about some of the critical factors and the ingredients that are really necessary to be successful. What are you seeing with customers? >>Well, to be successful with, a lot of these AI projects have mentioned. It starts with the data, and when we come to those kind of characteristics, you would often think that the most important thing is the technology. It's not that is a myth. It's not the reality. What we found is some of the most important things start with really understanding and having a sponsor who understands the importance of the AI capabilities that you're trying to be able to drive through business. So do you have the right hunger and curiosity of across your organization from top to bottom to really embark on a lot of these AI project? So that's cultural element. I would say that you have to be able to have that in beds within it, like the skills capabilities that you need to be able to have, not just by having the right data scientists or the right data engineers, but by having every person who is going to be able to touch these new applications and to use these new applications, understand how AI is going to impact them, and then it's really about the process. You know, I always talk about AI is not a thing. It's an ingredient that makes everything else better, and that means that you have to be able to change your processes. Those same applications that had Dev ops process is to be able to put it in production. Need to really consider what it means to have something that's ever changing, like AI as part of that which is also really critical. So I think about it as it is a foundation in the data, the cultural changes that you need to have from top to bottom of the organization, which includes the skills and then the process components that need to be able to change. >>Do you really talking about like Dev ops for AI data ops, I think is a term that's gonna gaining popularity of you guys have applied some of that in internally. Is that right? >>Yeah, it's about the operations of the AI life cycle in, and how you can automate as much of that is possible by AI. They're as much as possible, and that's where a lot of our investments in the Data and AI space are going into. How do you use AI for AI to be able to automate that whole AI life site that you need to be able to have in it? Absolutely >>So I've been talking a lot of C. XO CEO CEOs. We've held some C so and CEO roundtables with our data partner ET are. And one of the things that's that's clear is they're accelerating certain things as a result of code 19. There's certainly much more receptive to cloud. Of course, the first thing you heard from them was a pivot to work from home infrastructure. Many folks weren't ready, so okay, but the other thing that they've said is even in some hard hit industries, we've essentially shut down all spending, with the exception of very, very critical things, including, interestingly, our digital transformation. And so they're still on that journey. They realized the strategic imperative. Uh, and they don't want to lose out. In fact, they want to come out of this stronger AI is a critical part of that. So I'm wondering what you've seen specifically with respect to the pandemic and customers, how they're approaching ai, whether or not you see it accelerating or sort of on the same track. What are you seeing out there with clients? >>You know, this is where, um in pandemics In areas where, you know, we face a lot of uncertainty. I am so proud to be an IBM. Er, um, we actually put out offer when the pandemic started in a March timeframe. Teoh Many of our organizations and communities out there to be able to use our AI technologies to be able to help citizens really understand how Kobe 19 was gonna affect them. What are the symptoms? Where can I get tested? Will there be school tomorrow? We've helped hundreds of organizations, and not only in the public sector in the healthcare sector, across every sector be able to use AI capabilities. Like what we have with Watson assistant to be able to understand how code in 19 is impacting their constituents. As I mentioned, we have hundreds of them. So one example was Children's health care of Atlanta, where they wanted to be able to create an assistant to be able to help parents really understand what symptoms are and how to handle diagnosis is so. We have been leveraging a lot of AI technologies, especially right now, to be able to help, um, not just citizens and other organizations in the public and healthcare sector, but even in the consumer sector, really understand how they can use AI to be able to engage with their constituents a lot more closely. That's one of the areas where we have done quite a bit of work, and we're seeing AI actually being used at a much more rapid rate than ever >>before. Well, I'm excited about this because, you know, we were talking about the recovery, What there's a recovery look like is it v shaped? Nobody really expects that anymore. But maybe a U shaped. But the big concern people have, you know, this w shape recovery. And I'm hopeful that machine intelligence and data can be used to just help us really understand the risks. Uh, and then also getting out good quality information. I think it's critical. Different parts of the country in the world are gonna open at different rates. We're gonna learn from those experiences, and we need to do this in near real time. I mean, things change. Certainly there for a while they were changing daily. They kind of still are. You know, maybe we're on a slower. Maybe it's three or four times a week now, but that pace of change is critical and, you know, machine machines and the only way to keep up with that wonder if you could comment. >>Well, machines are the only way to keep, and not only that, but you want to be able to have the most up to date relevant information that's able to be communicated to the masses and ways that they can actually consume that data. And that's one of the things that AI and one of the assistant technologies that we have right now are able to do. You can continually update and train them such that they can continually engage with that end consumer and that end user and be able to give them the answers they want. And you're absolutely right, Dave. In this world, the answers change every single day and that kind of workload, um, and and the man you can't leave that alone to human laborers. Even human human labors need an assistant to be able to help them answer, because it's hard for them to keep up with what the latest information is. So using AI to be able to do that, it's absolutely critical, >>and I want to stress that I said machines you can't do without machines. And I believe that, but machines or a tool for humans to ultimately make the decisions in a crisis like this because, you see, I mean, I know we have a global audience, but here in the United States, you got you have 50 different governors making decisions about when and how certainly the federal government putting down guidelines. But the governor of Georgia is going to come back differently than the governor of New York, Different from the governor of California. They're gonna make different decisions, and they need data. And AI and Machine intelligence will inform that ultimately their public policy is going to be dictated by a combination of things which obviously includes, you know, machine intelligence. >>Absolutely. I think we're seeing that, by the way, I think many of those governors have made different decisions at different points, and therefore their constituents need to really have a place to be able to understand that as well. >>You know, you're right. I mean, the citizens ultimately have to make the decision while the governor said sick, safe to go out. You know, I'm gonna do some of my own research and you know, just like if you're if you're investing in the stock market, you got to do your own research. It's your health and you have to decide. And to the extent that firms like IBM can provide that data, I think it's critical. Where does the cloud fit in all this? I mentioned the cloud before. I mean, it seems to be critical infrastructure to get information that will talk about >>all of the capabilities that we have. They run on the IBM cloud, and I think this is where you know, when you have data that needs to be secured and needs to be trusted. And you need these AI capabilities. A lot of the solutions that I talked about, the hundreds of implementations that we have done over the past just six weeks. If you kind of take a look at 6 to 8 weeks, all of that on the IBM Public cloud, and so cloud is the thing that facilitates that it facilitates it in a way where it is secure. It is trusted, and it has the AI capabilities that augmented >>critical. There's learning in your title. Where do people go toe? Learn more How can you help them learn about AI And I think it started or keep going? >>Well, you know, we think about a lot of these technologies as it isn't just about the technology. It is about the expertise and the methodologies that we bring to bear. You know, when you talk about data and AI, you want to be able to blend the technology with expertise. Which is why are my title is expert labs that come directly from the labs and we take our learnings through thousands of different clients that we have interacted with, working with the technologies in the lab, understanding those outcomes and use cases and helping our clients be successful with their data and AI projects. So we that's what we do That's our mission. Love doing that every day. >>Well, I think this is important, because I mean, ah company, an organization the size of IBM, a lot of different parts of that organization. So I would I would advise our audience the challenge IBM and say, Okay, you've got that expertise. How are you applying that expertise internally? I mean, I've talked into public Sorry about how you know the data. Science is being applied within IBM. How that's then being brought out to the customers. So you've actually you've got a Petri dish inside this massive organization and it sounds like, you know, through the, you know, the expert labs. And so the Learning Center's you're sort of more than willing to and aggressively actually sharing that with clients. >>Yeah, I think it's important for us to not only eat our own dog food, so you're right. Interpol, The CDO Office Depot office we absolutely use our own technology is to be able to drive the insights we need for our large organization and through the learnings that we have, not only from ourselves but from other clients. We should help clients, our clients and our communities and organizations progress their use of their data and their AI. We really firmly believe this is the only way. Not only these organizations will progress that society as a whole breast, that we feel like it's part of our mission, part of our duty to make sure that it isn't just a discussion on the technology. It is about helping our clients and the community get to the outcomes that they need to using ai. >>Well, guy, I'm glad you invoke the dog food ing because, you know, we use that terminology a lot. A lot of people marketing people stepped back and said, No, no, it's sipping our champagne. Well, to get the champagne takes a lot of work, and the grapes at the early stages don't taste that pain I have to go through. And so that's why I think it's a sort of an honest metaphor, but critical your you've been a friend of the Cube, but we've been on this data journey together for many, many years. Really appreciate you coming on back on the Cube and sharing with the think audience. Great to see you stay safe. And hopefully we'll see you face to face soon. >>All right. Thank you. >>Alright. Take care, my friend. And thank you for watching everybody. This is Dave Volante for the Cube. You're watching IBM think 2020. The digital version of think we'll be right back after this short break. >>Yeah, yeah, yeah.

Published Date : May 7 2020

SUMMARY :

Think brought to you by IBM. you know, we have to make the best. They see cloud, you know, is basic infrastructure to get there, know to be able to have good ai, you need a good foundational information, that are really necessary to be successful. and that means that you have to be able to change your processes. gonna gaining popularity of you guys have applied some of that in internally. to be able to automate that whole AI life site that you need to be able to have in it? Of course, the first thing you heard from them and communities out there to be able to use our AI technologies to be able But the big concern people have, you know, this w shape recovery. Well, machines are the only way to keep, and not only that, but you want to be able to have the most up to date relevant But the governor of Georgia is going to come back differently than the governor of at different points, and therefore their constituents need to really have a place to be able to understand that I mean, it seems to be critical infrastructure to get information that will and I think this is where you know, when you have data that needs to be secured and needs to be Learn more How can you help them learn about It is about the expertise and the methodologies that we bring to bear. and it sounds like, you know, through the, you know, the expert labs. It is about helping our clients and the community get to the outcomes that they need to Great to see you stay safe. And thank you for watching everybody.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
IBMORGANIZATION

0.99+

San FranciscoLOCATION

0.99+

DavePERSON

0.99+

Dave VellantePERSON

0.99+

Dave VolantePERSON

0.99+

Palo AltoLOCATION

0.99+

Ritika GunnarPERSON

0.99+

hundredsQUANTITY

0.99+

BostonLOCATION

0.99+

United StatesLOCATION

0.99+

MiamiLOCATION

0.99+

threeQUANTITY

0.99+

6QUANTITY

0.99+

tomorrowDATE

0.99+

thousandsQUANTITY

0.99+

FirstQUANTITY

0.99+

AtlantaLOCATION

0.99+

MarchDATE

0.99+

50 different governorsQUANTITY

0.99+

CDOORGANIZATION

0.99+

8 weeksQUANTITY

0.99+

InterpolORGANIZATION

0.99+

code 19OTHER

0.99+

Radica GunnarPERSON

0.98+

oneQUANTITY

0.97+

first stepsQUANTITY

0.97+

firstQUANTITY

0.96+

New YorkLOCATION

0.94+

pandemicEVENT

0.94+

hundreds of organizationsQUANTITY

0.94+

USLOCATION

0.93+

firstsQUANTITY

0.9+

Cube StudiosORGANIZATION

0.88+

four times a weekQUANTITY

0.87+

CubeCOMMERCIAL_ITEM

0.87+

ETORGANIZATION

0.86+

ThinkCOMMERCIAL_ITEM

0.85+

one exampleQUANTITY

0.84+

six weeksQUANTITY

0.84+

Dev opsTITLE

0.82+

IBM Think Digital 2020 ExperienceEVENT

0.82+

CaliforniaLOCATION

0.81+

WatsonTITLE

0.8+

first thingQUANTITY

0.8+

pandemicsEVENT

0.77+

GeorgiaLOCATION

0.74+

governorPERSON

0.74+

C. XOPERSON

0.7+

RadicaPERSON

0.7+

single dayQUANTITY

0.69+

Kobe 19COMMERCIAL_ITEM

0.59+

think 2020TITLE

0.56+

implementationsQUANTITY

0.54+

DepotORGANIZATION

0.49+

CubeORGANIZATION

0.4+

2020TITLE

0.32+

19TITLE

0.31+

Aliye 1 1 w dave crowdchat v2


 

>> Hi everybody, this is Dave Velante with the CUBE. And when we talk to practitioners about data and AI they have troubles infusing AI into their data pipeline and automating that data pipeline. So we're bringing together the community, brought to you by IBM to really understand how successful organizations are operationalizing the data pipeline and with me to talk about that is Aliye Ozcan. Aliye, hello, introduce yourself. Tell us about who you are. >> Hi Dave, how are you doing? Yes, my name is Aliye Ozcan I'm the Data Operations Data ops Global Marketing Leader at IBM. >> So I'm very excited about this project. Go to crowdchat.net/dataops, add it to your calendar and check it out. So we have practitioners, Aliye from Harley Davidson, Standard Bank, Associated Bank. What are we going to learn from them? >> What we are going to learn from them is the data experiences. What are the data challenges that they are going through? What are the data bottlenecks that they had? And especially in these challenging times right now. The industry is going through this challenging time. We are all going through this. How the foundation that they invested. Is now helping them to pivot quickly to market demands, the new market demands fast. That is fascinating to see, and I'm very excited having individual conversations with those experts and bringing those stories to the audience here. >> Awesome, and we also have Inderpal Bhandari from the CDO office at IBM, so go to crowdchat.net/dataops, add it to your calendar, we'll see you in the crowd chat.

Published Date : May 6 2020

SUMMARY :

are operationalizing the data pipeline I'm the Data Operations Data ops What are we going to learn from them? What are the data challenges add it to your calendar, we'll

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
DavePERSON

0.99+

Dave VelantePERSON

0.99+

Standard BankORGANIZATION

0.99+

IBMORGANIZATION

0.99+

Associated BankORGANIZATION

0.99+

Inderpal BhandariPERSON

0.99+

Harley DavidsonORGANIZATION

0.99+

AliyePERSON

0.99+

crowdchat.net/dataopsOTHER

0.99+

Aliye OzcanPERSON

0.99+

Aliye 1PERSON

0.86+

CUBEORGANIZATION

0.85+

crowdchatTITLE

0.67+

Data opsORGANIZATION

0.61+

CDOORGANIZATION

0.53+

Seth Dobrin, IBM | IBM Data and AI Forum


 

>>live from Miami, Florida It's the Q covering. IBM is data in a I forum brought to you by IBM. >>Welcome back to the port of Miami, everybody. We're here at the Intercontinental Hotel. You're watching the Cube? The leader and I live tech covered set. Daubert is here. He's the vice president of data and I and a I and the chief data officer of cloud and cognitive software. And I'd be upset too. Good to see you again. >>Good. See, Dave, thanks for having me >>here. The data in a I form hashtag data. I I It's amazing here. 1700 people. Everybody's gonna hands on appetite for learning. Yeah. What do you see out in the marketplace? You know what's new since we last talked. >>Well, so I think if you look at some of the things that are really need in the marketplace, it's really been around filling the skill shortage. And how do you operationalize and and industrialize? You're a I. And so there's been a real need for things ways to get more productivity out of your data. Scientists not necessarily replace them. But how do you get more productivity? And we just released a few months ago, something called Auto A I, which really is, is probably the only tool out there that automates the end end pipeline automates 80% of the work on the Indian pipeline, but isn't a black box. It actually kicks out code. So your data scientists can then take it, optimize it further and understand it, and really feel more comfortable about it. >>He's got a eye for a eyes. That's >>exactly what is a eye for an eye. >>So how's that work? So you're applying machine intelligence Two data to make? Aye. Aye, more productive pick algorithms. Best fit. >>Yeah, So it does. Basically, you feed it your data and it identifies the features that are important. It does feature engineering for you. It does model selection for you. It does hyper parameter tuning and optimization, and it does deployment and also met monitors for bias. >>So what's the date of scientists do? >>Data scientist takes the code out the back end. And really, there's some tweaks that you know, the model, maybe the auto. Aye, aye. Maybe not. Get it perfect, Um, and really customize it for the business and the needs of the business. that the that the auto A I so they not understand >>the data scientist, then can can he or she can apply it in a way that is unique to their business that essentially becomes their I p. It's not like generic. Aye, aye for everybody. It's it's customized by And that's where data science to complain that I have the time to do this. Wrangling data >>exactly. And it was built in a combination from IBM Research since a great assets at IBM Research plus some cattle masters at work here at IBM that really designed and optimize the algorithm selection and things like that. And then at the keynote today, uh, wonderment Thompson was up there talking, and this is probably one of the most impactful use cases of auto. Aye, aye to date. And it was also, you know, my former team, the data science elite team, was engaged, but wonderment Thompson had this problem where they had, like, 17,000 features in their data sets, and what they wanted to do was they wanted to be able to have a custom solution for their customers. And so every time they get a customer that have to have a data scientist that would sit down and figure out what the right features and how the engineer for this customer. It was an intractable problem for them. You know, the person from wonderment Thompson have prevented presented today said he's been trying to solve this problem for eight years. Auto Way I, plus the data science elite team solve the form in two months, and after that two months, it went right into production. So in this case, oughta way. I isn't doing the whole pipeline. It's helping them identify the features and engineering the features that are important and giving them a head start on the model. >>What's the, uh, what's the acquisition bottle for all the way as a It's a license software product. Is it assassin part >>of Cloudpack for data, and it's available on IBM Cloud. So it's on IBM Cloud. You can use it paper use so you get a license as part of watching studio on IBM Cloud. If you invest in Cloudpack for data, it could be a perpetual license or committed term license, which essentially assassin, >>it's essentially a feature at dawn of Cloudpack for data. >>It's part of Cloudpack per day and you're >>saying it can be usage based. So that's key. >>Consumption based hot pack for data is all consumption based, >>so people want to use a eye for competitive advantage. I said by my open that you know, we're not marching to the cadence of Moore's Law in this industry anymore. It's a combination of data and then cloud for scale. So so people want competitive advantage. You've talked about some things that folks are doing to gain that competitive advantage. But the same time we heard from Rob Thomas that only about 4 to 10% penetration for a I. What? What are the key blockers that you see and how you're knocking them >>down? Well, I think there's. There's a number of key blockers, so one is of access to data, right? Cos have tons of data, but being able to even know what data is, they're being able to pull it all together and being able to do it in a way that is compliant with regulation because you got you can't do a I in a vacuum. You have to do it in the context of ever increasing regulation like GDP R and C, C, P A and all these other regulator privacy regulations that are popping up. So so that's that's really too so access to data and regulation can be blockers. The 2nd 1 or the 3rd 1 is really access to appropriate skills, which we talked a little bit about. Andi, how do you retrain, or how do you up skill, the talent you have? And then how do you actually bring in new talent that can execute what you want on then? Sometimes in some cos it's a lack of strategy with appropriate measurement, right? So what is your A II strategy, and how are you gonna measure success? And you and I have talked about this on Cuban on Cube before, where it's gotta measure your success in dollars and cents right cost savings, net new revenue. That's really all your CFO is care about. That's how you have to be able to measure and monitor your success. >>Yes. Oh, it's so that's that Last one is probably were where most organizations start. Let's prioritize the use cases of the give us the best bang for the buck, and then business guys probably get really excited and say Okay, let's go. But to up to truly operationalize that you gotta worry about these other things. You know, the compliance issues and you gotta have the skill sets. Yeah, it's a scale. >>And sometimes that's actually the first thing you said is sometimes a mistake. So focusing on the one that's got the most bang for the buck is not necessarily the best place to start for a couple of reasons. So one is you may not have the right data. It may not be available. It may not be governed properly. Number one, number two the business that you're building it for, may not be ready to consume it right. They may not be either bought in or the processes need to change so much or something like that, that it's not gonna get used. And you can build the best a I in the world. If it doesn't get used, it creates zero value, right? And so you really want to focus on for the first couple of projects? What are the one that we can deliver the best value, not Sarah, the most value, but the best value in the shortest amount of time and ensure that it gets into production because especially when you're starting off, if you don't show adoption, people are gonna lose interest. >>What are you >>seeing in terms of experimentation now in the customer base? You know, when you talk to buyers and you talk about, you know, you look at the I T. Spending service. People are concerned about tariffs. The trade will hurt the 2020 election. They're being a little bit cautious. But in the last two or three years have been a lot of experimentation going on. And a big part of that is a I and machine learning. What are you seeing in terms of that experimentation turning into actually production project that we can learn from and maybe do some new experiments? >>Yeah, and I think it depends on how you're doing the experiments. There's, I think there's kind of academic experimentation where you have data science, Sistine Data science teams that come work on cool stuff that may or may not have business value and may or may not be implemented right. They just kind of latch on. The business isn't really involved. They latch on, they do projects, and that's I think that's actually bad experimentation if you let it that run your program. The good experimentation is when you start identity having a strategy. You identify the use cases you want to go after and you experiment by leveraging, agile to deliver these methodologies. You deliver value in two weeks prints, and you can start delivering value quickly. You know, in the case of wonderment, Thompson again 88 weeks, four sprints. They got value. That was an experiment, right? That was an experiment because it was done. Agile methodologies using good coding practices using good, you know, kind of design up front practices. They were able to take that and put it right into production. If you're doing experimentation, you have to rewrite your code at the end. And it's a waste of time >>T to your earlier point. The moon shots are oftentimes could be too risky. And if you blow it on a moon shot, it could set you back years. So you got to be careful. Pick your spots, picked ones that maybe representative, but our lower maybe, maybe lower risk. Apply agile methodologies, get a quick return, learn, develop those skills, and then then build up to the moon ship >>or you break that moon shot down its consumable pieces. Right, Because the moon shot may take you two years to get to. But maybe there are sub components of that moon shot that you could deliver in 34 months and you start delivering knows, and you work up to the moon shot. >>I always like to ask the dog food in people. And I said, like that. Call it sipping your own champagne. What do you guys done internally? When we first met, it was and I think, a snowy day in Boston, right at the spark. Some it years ago. And you did a big career switch, and it's obviously working out for you, But But what are some of the things? And you were in part, brought in to help IBM internally as well as Interpol Help IBM really become data driven internally? Yeah. How has that gone? What have you learned? And how are you taking that to customers? >>Yeah, so I was hired three years ago now believe it was that long toe lead. Our internal transformation over the last couple of years, I got I don't want to say distracted there were really important business things I need to focus on, like gpr and helping our customers get up and running with with data science, and I build a data science elite team. So as of a couple months ago, I'm back, you know, almost entirely focused on her internal transformation. And, you know, it's really about making sure that we use data and a I to make appropriate decisions on DSO. Now we have. You know, we have an app on her phone that leverages Cognos analytics, where at any point, Ginny Rometty or Rob Thomas or Arvin Krishna can pull up and look in what we call E P M. Which is enterprise performance management and understand where the business is, right? What what do we do in third quarter, which just wrapped up what was what's the pipeline for fourth quarter? And it's at your fingertips. We're working on revamping our planning cycle. So today planning has been done in Excel. We're leveraging Planning Analytics, which is a great planning and scenario planning tool that with the tip of a button, really let a click of a button really let you understand how your business can perform in the future and what things need to do to get it perform. We're also looking across all of cloud and cognitive software, which data and A I sits in and within each business unit and cloud and cognitive software. The sales teams do a great job of cross sell upsell. But there's a huge opportunity of how do we cross sell up sell across the five different businesses that live inside of cloud and cognitive software. So did an aye aye hybrid cloud integration, IBM Cloud cognitive Applications and IBM Security. There's a lot of potential interplay that our customers do across there and providing a I that helps the sales people understand when they can create more value. Excuse me for our customers. >>It's interesting. This is the 10th year of doing the Cube, and when we first started, it was sort of the beginning of the the big data craze, and a lot of people said, Oh, okay, here's the disruption, crossing the chasm. Innovator's dilemma. All that old stuff going away, all the new stuff coming in. But you mentioned Cognos on mobile, and that's this is the thing we learned is that the key ingredients to data strategies. Comprised the existing systems. Yes. Throw those out. Those of the systems of record that were the single version of the truth, if you will, that people trusted you, go back to trust and all this other stuff built up around it. Which kind of created dissidents. Yeah. And so it sounds like one of the initiatives that you you're an IBM I've been working on is really bringing in the new pieces, modernizing sort of the existing so that you've got sort of consistent data sets that people could work. And one of the >>capabilities that really has enabled this transformation in the last six months for us internally and for our clients inside a cloud pack for data, we have this capability called IBM data virtualization, which we have all these independent sources of truth to stomach, you know? And then we have all these other data sources that may or may not be as trusted, but to be able to bring them together literally. With the click of a button, you drop your data sources in the Aye. Aye, within data. Virtualization actually identifies keys across the different things so you can link your data. You look at it, you check it, and it really enables you to do this at scale. And all you need to do is say, pointed out the data. Here's the I. P. Address of where the data lives, and it will bring that in and help you connect it. >>So you mentioned variances in data quality and consumer of the data has to have trust in that data. Can you use machine intelligence and a I to sort of give you a data confidence meter, if you will. Yeah. So there's two things >>that we use for data confidence. I call it dodging this factor, right. Understanding what the dodging this factor is of the data. So we definitely leverage. Aye. Aye. So a I If you have a date, a dictionary and you have metadata, the I can understand eight equality. And it can also look at what your data stewards do, and it can do some of the remediation of the data quality issues. But we all in Watson Knowledge catalog, which again is an in cloudpack for data. We also have the ability to vote up and vote down data. So as much as the team is using data internally. If there's a data set that had a you know, we had a hive data quality score, but it wasn't really valuable. It'll get voted down, and it will help. When you search for data in the system, it will sort it kind of like you do a search on the Internet and it'll it'll down rank that one, depending on how many down votes they got. >>So it's a wisdom of the crowd type of. >>It's a crowd sourcing combined with the I >>as that, in your experience at all, changed the dynamics of politics within organizations. In other words, I'm sure we've all been a lot of meetings where somebody puts foursome data. And if the most senior person in the room doesn't like the data, it doesn't like the implication he or she will attack the data source, and then the meeting's over and it might not necessarily be the best decision for the organization. So So I think it's maybe >>not the up, voting down voting that does that, but it's things like the E PM tool that I said we have here. You know there is a single source of truth for our finance data. It's on everyone's phone. Who needs access to it? Right? When you have a conversation about how the company or the division or the business unit is performing financially, it comes from E. P M. Whether it's in the Cognos app or whether it's in a dashboard, a separate dashboard and Cognos or is being fed into an aye aye, that we're building. This is the source of truth. Similarly, for product data, our individual products before me it comes from here's so the conversation at the senior senior meetings are no longer your data is different from my data. I don't believe it. You've eliminated that conversation. This is the data. This is the only data. Now you can have a conversation about what's really important >>in adult conversation. Okay, Now what are we going to do? It? It's >>not a bickering about my data versus your data. >>So what's next for you on? You know, you're you've been pulled in a lot of different places again. You started at IBM as an internal transformation change agent. You got pulled into a lot of customer situations because yeah, you know, you're doing so. Sales guys want to drag you along and help facilitate activity with clients. What's new? What's what's next for you. >>So really, you know, I've only been refocused on the internal transformation for a couple months now. So really extending IBM struck our cloud and cognitive software a data and a I strategy and starting to quickly implement some of these products, just like project. So, like, just like I just said, you know, we're starting project without even knowing what the prioritized list is. Intuitively, this one's important. The team's going to start working on it, and one of them is an aye aye project, which is around cross sell upsell that I mentioned across the portfolio and the other one we just got done talking about how in the senior leadership meeting for Claude Incognito software, how do we all work from a Cognos dashboard instead of Excel data data that's been exported put into Excel? The challenge with that is not that people don't trust the data. It's that if there's a question you can't drill down. So if there's a question about an Excel document or a power point that's up there, you will get back next meeting in a month or in two weeks, we'll have an e mail conversation about it. If it's presented in a really live dashboard, you can drill down and you can actually answer questions in real time. The value of that is immense, because now you as a leadership team, you can make a decision at that point and decide what direction you're going to do. Based on data, >>I said last time I have one more questions. You're CDO but you're a polymath on. So my question is, what should people look for in a chief data officer? What sort of the characteristics in the attributes, given your >>experience, that's kind of a loaded question, because there is. There is no good job, single job description for a chief date officer. I think there's a good solid set of skill sets, the fine for a cheap date officer and actually, as part of the chief data officer summits that you you know, you guys attend. We had were having sessions with the chief date officers, kind of defining a curriculum for cheap date officers with our clients so that we can help build the chief. That officer in the future. But if you look a quality so cheap, date officer is also a chief disruption officer. So it needs to be someone who is really good at and really good at driving change and really good at disrupting processes and getting people excited about it changes hard. People don't like change. How do you do? You need someone who can get people excited about change. So that's one thing. On depending on what industry you're in, it's got to be. It could be if you're in financial or heavy regulated industry, you want someone that understands governance. And that's kind of what Gardner and other analysts call a defensive CDO very governance Focus. And then you also have some CDOs, which I I fit into this bucket, which is, um, or offensive CDO, which is how do you create value from data? How do you caught save money? How do you create net new revenue? How do you create new business models, leveraging data and a I? And now there's kind of 1/3 type of CDO emerging, which is CDO not as a cost center but a studio as a p N l. How do you generate revenue for the business directly from your CDO office. >>I like that framework, right? >>I can't take credit for it. That's Gartner. >>Its governance, they call it. We say he called defensive and offensive. And then first time I met Interpol. He said, Look, you start with how does data affect the monetization of my organization? And that means making money or saving money. Seth, thanks so much for coming on. The Cube is great to see you >>again. Thanks for having me >>again. All right, Keep it right to everybody. We'll be back at the IBM data in a I form from Miami. You're watching the Cube?

Published Date : Oct 22 2019

SUMMARY :

IBM is data in a I forum brought to you by IBM. Good to see you again. What do you see out in the marketplace? And how do you operationalize and and industrialize? He's got a eye for a eyes. So how's that work? Basically, you feed it your data and it identifies the features that are important. And really, there's some tweaks that you know, the data scientist, then can can he or she can apply it in a way that is unique And it was also, you know, my former team, the data science elite team, was engaged, Is it assassin part You can use it paper use so you get a license as part of watching studio on IBM Cloud. So that's key. What are the key blockers that you see and how you're knocking them the talent you have? You know, the compliance issues and you gotta have the skill sets. And sometimes that's actually the first thing you said is sometimes a mistake. You know, when you talk to buyers and you talk You identify the use cases you want to go after and you experiment by leveraging, And if you blow it on a moon shot, it could set you back years. Right, Because the moon shot may take you two years to And how are you taking that to customers? with the tip of a button, really let a click of a button really let you understand how your business And so it sounds like one of the initiatives that you With the click of a button, you drop your data sources in the Aye. to sort of give you a data confidence meter, if you will. So a I If you have a date, a dictionary and you have And if the most senior person in the room doesn't like the data, so the conversation at the senior senior meetings are no longer your data is different Okay, Now what are we going to do? a lot of customer situations because yeah, you know, you're doing so. So really, you know, I've only been refocused on the internal transformation for What sort of the characteristics in the attributes, given your And then you also have some CDOs, which I I I can't take credit for it. The Cube is great to see you Thanks for having me We'll be back at the IBM data in a I form from Miami.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
SethPERSON

0.99+

Arvin KrishnaPERSON

0.99+

IBMORGANIZATION

0.99+

DaubertPERSON

0.99+

BostonLOCATION

0.99+

Rob ThomasPERSON

0.99+

DavePERSON

0.99+

Ginny RomettyPERSON

0.99+

Seth DobrinPERSON

0.99+

IBM ResearchORGANIZATION

0.99+

two yearsQUANTITY

0.99+

MiamiLOCATION

0.99+

ExcelTITLE

0.99+

eight yearsQUANTITY

0.99+

88 weeksQUANTITY

0.99+

Rob ThomasPERSON

0.99+

GardnerPERSON

0.99+

SarahPERSON

0.99+

Miami, FloridaLOCATION

0.99+

34 monthsQUANTITY

0.99+

17,000 featuresQUANTITY

0.99+

two thingsQUANTITY

0.99+

10th yearQUANTITY

0.99+

two weeksQUANTITY

0.99+

1700 peopleQUANTITY

0.99+

GartnerORGANIZATION

0.99+

CognosTITLE

0.99+

three years agoDATE

0.99+

two monthsQUANTITY

0.99+

first timeQUANTITY

0.98+

oneQUANTITY

0.98+

todayDATE

0.98+

each businessQUANTITY

0.97+

first coupleQUANTITY

0.97+

InterpolORGANIZATION

0.96+

about 4QUANTITY

0.96+

ThompsonPERSON

0.96+

third quarterDATE

0.96+

five different businessesQUANTITY

0.95+

Two dataQUANTITY

0.95+

Intercontinental HotelORGANIZATION

0.94+

IBM DataORGANIZATION

0.94+

firstQUANTITY

0.93+

single jobQUANTITY

0.93+

first thingQUANTITY

0.92+

CognosORGANIZATION

0.91+

last couple of yearsDATE

0.91+

single sourceQUANTITY

0.89+

few months agoDATE

0.89+

one more questionsQUANTITY

0.89+

couple months agoDATE

0.88+

CloudpackTITLE

0.87+

single versionQUANTITY

0.87+

CubeCOMMERCIAL_ITEM

0.86+

80% ofQUANTITY

0.85+

last six monthsDATE

0.84+

Claude IncognitoORGANIZATION

0.84+

agileTITLE

0.84+

10%QUANTITY

0.84+

yearsDATE

0.84+

MooreORGANIZATION

0.82+

zeroQUANTITY

0.81+

three yearsQUANTITY

0.8+

2020 electionEVENT

0.8+

E PMTITLE

0.79+

four sprintsQUANTITY

0.79+

WatsonORGANIZATION

0.77+

2nd 1QUANTITY

0.75+

Cristina Pirola, Generali Assicurazioni & Leyla Delic, Coca Cola İçecek | UiPath FORWARD III 2019


 

>>Live from Las Vegas. It's the cube covering UI path forward Americas 2019. Brought to you by UI path. Hello everyone and welcome >>do the cubes live coverage of UI path forward. I'm your host, Rebecca Knight, co-hosting alongside of Dave Volante. We are joined by Layla Delage. She is the chief information and digital officer at Coca-Cola. ECEK thanks so much for coming on the show. Thank you. Great to be here. Very exciting. And also Christina Perala, she is the group RPA lead at Generali. Thank you so much for coming into, for inviting me. Thank you. So I want to hear from you both about what, what your industry is and what your role is. Level. Let's start with you. Okay, great. Um, so we are, um, one of the Rogers bottlers within the Coca-Cola system. Uh, we produce, distribute and sell Coca Cola company products. The operating around 10 countries are middle East and central Asia and parts of middle East, Pakistan, Syria and Turkey. They are actually born out of Turkey and that's where our central offices, um, we've operate with 26 plants, around 8,500 employees. >>Uh, we serve a consumer base of 400 million and we have around close to 1 billion, uh, customers. Uh, and we continue to invest in the countries where we operate. And my role is to film and my role is all things digital within this community. So leading technologists, leading technology, all things digital. Yes. So Christina, tell us about Generali. Generalia. Sikora Zuni is a leading insurance company as the presidency. Enough 50 countries worldwide and more than a 70,000 employees that were wider. So it's a bigger company, not only for insurance. And my role with the internet rally group is to leader the LPA program. So I'm inside of the group that I in digital. So am I inside this group, I'm very focused on smart process automation. So RPA plus AI, because a has a, we already know all I loudly, LPA without a AI is announcer nowadays. So we have to keep on talking about AI, machine learning algorithms to enrich, uh, uh, the capabilities of basic robotic sell, hand reach, also the Antwerp and automation of processes. You're the CIO and the CDO. Yes. Yes. That's unique. First of all, there's one that's unique too. It's even more unique than a woman has both roles. So what's the reason behind it? So, um, there's definitely a reason behind it. I joined the Coca Cola >>system about a year ago, so I'm just a over a year in the company. The reason actually I wanted to make sure that we highlight the CIO and CTO CDO role together is, um, I want to advocate for all the it organizations to transform and really get into the digital world and get into the world of advanced technologies, become strategic business partners. Get out of the kitchen, I call it kitchen kitchen, it, you know, get out of the managing of data centers or cloud and um, just the core foundational systems and applications. Get into the advanced technology, understand the business, gain business acumen and deliver solutions based on business needs. So to highlight that, I want to make sure that I hold the role of both and I'm able to be advocate of both worlds. Cause digital without it support is not able to accomplish what they need to accomplish and it needs to get into more of the digital space. And Christina, as the RPA, you write bots, you evangelize the organization. >>Um, mostly the second. So in generally we have a, a very, uh, so, uh, sort of ivory the organization. So for something we are very decentralized, for example, for the developing of robots or the deploying for the action, the operational stuff and so on. Uh, but uh, for some stuff like a guidelines, uh, uh, risk framework to ensure that robots can do their work in the right way with notice to all for the business processes, uh, for this stuff before guidelines, framework, best practice sharing. We are a central centralized, we, we try to be centralized. So, uh, my role is to try to collect is to collect and not try and super lat, uh, best practices and share with you in the companies chair, uh, um, the best use cases. And, uh, also tried to gather what are the main concerns, what are the difficulties in order to a facilitator and to boost smarter process automation of the option. So >>Laila, you are up on the main stage this morning. You, I Pat highlighted Coca Cola itchy as a, as a customer that is embraced automation, embrace the UI pass solution. So tell us a little bit about the challenges you are facing and then why you chose I a UI path. So as I joined the company, uh, I introduced a very strong digital strategy that required a lot of change and it's within a company that has been very successfully operating all these years and doing pretty much know what to do very well. And all of a sudden with digital we are starting to disrupt the, are trying to say, Hey, we've got to change the way, do some of the things. Um, so belief in digital and belief that it can really bring efficiency and outcomes was very important. And I needed a quick win. I needed to have a technology or a solution or an outcome that I would generate very quickly and show to the whole organization that this can be done and we can do this as Coca-Cola. TJ. >>So that was, that was RPA, that was our PA for this fascinates me because you're an incumbent business, been around for a long time. you're a bottler and distributor, right? So yeah, processes are around the bottling plants and the distribution system. Yes. And now you're transforming into a digital business. Yes. I'll put data at your core. Totally not start his daytime customer. Okay. So describe the difference between the traditional business and what it looks like when you've transformed, particularly from a data perspective. And then I want to understand what role RPA plays. So we are definitely a very data rich company, however, to call ourselves data rich and to call it a strategic asset, I first need to capture and control my data and I have to treat it like a strategic asset. So that is a huge transformation. The second, once you treat it as an asset, how do you generate more insights? >>And I call this augmenting the gut feeling. I have an amazing gut feeling in the company. How do I augment that with data and provide our, this is partners and then our customers and our suppliers and some of the information. And then obviously future maturity level is, you know, shared economy and data monetization, et cetera. So that's how I describe within the company. And then assets, other assets like our plants and coolers cooler, we call it cooler, you know, where do you actually see all our products? They are called, they are visible and they are available, but they are also in that set where I can turn them into a digital cooler and I can do so much more with the cooler that standing. And I recently, in one of our leadership meetings I said we have as many coolers as the um, population on the fishy Island, which is close to 1 million. >>So just imagine in this new world, in this digital era, everything that you can do by just having a cooler, 1 million coolers present out there on the street, I can serve the consumers, I can serve customers with very different information. So that's kind of what I mean by turning the business into a digital business. So that's an awesome story. By the way, how does RPA fit into that vision? RPA is everywhere in division. So I said when I started the journey, uh, any digital journey has some Muslim battles for me. There are four must win battles. I need to get certain things right in it, in the, and that was one, one of the Mustin battles was alteration. So we have to create efficiency, we have to optimize, we have to streamline. And we said automation first. Um, and we started with, I call it robotics and automation. >>And I agree with what you said, Christina. It's more than just robots. It's actually a strategic application. It could be a good old ERP. It's the RPA, it's AI, it's all the other technologies that are out there that they bring the two of them brings. So how do you create this end to end solution using all the trends, technologies to create optimization? Uh, our goal was how do we get back to our customer much faster. We had so many customer facing processes and they're going to be there forever. They are a very customer centric customer into company obviously. So how do I get back to my customer faster? How do I make my employees just happy? They were working on so many things would be until midnight over time during weekends. How do I take that away from them? So we called it lifting the weight of the shoulders and giving you a new capabilities. So again, augmentation and then giving them that space. So we had uh, three of my employees upskilled and reskilled themselves. They became a developers in the robotics space, a couple of fire functional, um, colleagues are now reskilling themselves because now they have the time to reskill. More importantly, they have the time to actually leverage their expertise and they are so much more motivated. The engagement, the employee engagement is increasing. So that's how we are positioning RPA. Pristina ICU >>nodding a lot, your head too. A lot of what Layla is saying. I'm wondering if you can talk to about any best practices that have emerged as you've implemented RPA at Generali to what you've learned. Yes, for sure. Um, we have a lot of processes automated, uh, all around the group. Uh, but we are not, we have not reached our maximum or, uh, benefits, uh, gaining. So what we need to do right now is to try to boost the smart process automation, uh, via analyzing the issue around value, Cena. So each business area of the value chain because currently we have countries that has, that have a different level of maturity. So, so some countries are at the very beginning and we have to help them with best practice sharings with a huge case, successful use cases. And we are, uh, we have a lot of help from parts into, in this because locally and who I Potter as a, a very strong presence and is very powerful in doing that. >>And, uh, now, uh, our next mouth are very focused on try to, um, uh, deep dive, the vertical, our area of the issue around value chain and identify which are the processes inside them are best to automated. Uh, uh, Basinger. Uh, these activities are not so you, I part, we'd, his experience has created a heat mapper, value chain Heath mapper. And so it's given up as some advice where to focus our strengths, our hand energy in automating. And I think that this is a very huge, uh, uh, support that you are UI parties given us. So it's not just a matter of, okay, let's start, uh, uh, do some, uh, process assessment in order to identify which processes are the best candidates to be automated. But, uh, we have, uh, how our back, uh, us. So we, we are, uh, we have the backing of UI pass saying it's better to do that and automate in depth, uh, processes of that, but Oh, the value chain. So we are starting a program to do that with all the countries or the vertical area of the country. So, and I think that this could really bring a, uh, high benefits and can, uh, uh, drive us to, uh, really having a scaling up in using a smart process, automation and UI. But you a bot ecosystem not only are, so >>one of the nice things about RPA is you can take the software robots and apply them to an existing process. A lot of times changing processes and a lot of times almost always changing processes is painful. However, we've talked to some customers that have said by applying RPA to our business, it's exposed some really bad processes. Have you experienced that and can you maybe share that experience with it? Absolutely. So for us, one of the initial, um, robots, we applied to a customer facing process. It was our field team trying to get back to our customer with a, with some information. And we realize that the, um, the cycle time was very long. And the reason is there are four functions involved in answering the question and seven different applications are being touched all the way from XL to ERP to CRM. So what we did obviously bringing a strategic solution to fix the cycle time and reduce that to streamline the process was going to take us long. So RPA was great help. We reduced the cycle time by putting a robot and we were able to get back to ours, priests, sales team in the field in matter of minutes. What used to take hours was now being responded to in minutes. Now that doesn't mean that process is perfect, but that's our next step. So we created value for our customer and our sales team within the field, um, before, you know, streamlining and going into a bigger initiatives. So then you could share Christina. >>Yes. Uh, so, um, it is necessary to automate something that could be automated. So, uh, it is necessarily to out optimize the process before automating it, but sometimes it's better to automate it as Caesar because, uh, also the not optimize the process can bring value if ultimated. So let me share an example. If you, for example, have to migrate some data obviously is a one shot, uh, uh, activity. But with the robot you can do it in a very short, well sharp timer. Maybe it's not the best, uh, process to be automated, but that could be useful as well. So it's always a matter of understanding the costs and the benefits. Uh, and sometimes, uh, FBA is very quickly, is very quick to be implemented and can be, can have a, also a lot of savings instead of integrating instead of doing more complex things. >>And then other things, uh, that it's important to take into account is that, uh, uh, after having a automating goal, all the low hanging fruits and so the processes with a low cost, uh, uh, low complexity and high benefits, uh, then it starts to facer when it's necessary to understand how to the end to end processes. Because, uh, it happens, uh, in, uh, some of our countries that, uh, the second phase is very difficult because, uh, the situation is that you have very, um, a lot of very fermented processes. And so before automating it is necessary to apply operational efficiency methodology, lean six Sigma, rare business process for engineering and then automate it. So it's a longer trip. And our Amer as group head office in general is to give these kinds of methodologies and best practices for all kinds of level of maturity in our countries. So finally, w what is the customer is the employee response then in terms of how you're talking a lot about streamlining, getting rid of these tedious tasks that took forever, how, how our employees reacting to the implementation. >>So we, um, we actually launched the, uh, announce announced RPA robotics and automation with a Hekaton in our company. And we invited 40 colleagues from various functions and two and everybody from the business was there and they participated actually in gathering ideas and prioritizing what matters most to the company. And we looked at customer, we looked at compliance, we look to the employee and we actually with during the hackathon you iPad team helped us to go live with one of the robots. They were mesmerized. They couldn't believe that this could happen. I think that's where we kind of engaged them and now going forward everyone who generate the idea was part of the building of the robots so they continue to be engaged to me allowed them to name the robots so they start naming and once the robots were alive yet literally had some of our teams who are dancing from happiness and I think that said it all. That was the strongest voice of our business partner and we published that video. So our business partners became our advocates and that's really our how we born the robotic and automation within CCI. We have so many advocates right now they are coming to us. Our business partners are coming to us with more use cases and they are actually, they are sharing with rest of the system within Coca-Cola and with the group that we are part of locally in Turkey, they are sharing their stories. So now we have a hype going on in the system. >>Yes. And in generally, um, at the beginning, uh, we face some fears in our employees fears of losing their job, but fear is not be able to use this kind of technology. Uh, but, uh, also with the help of HR because I, Charlie is, uh, driving a huge program of upskilling and reskilling of people. Uh, nowadays, uh, also hand user are very happy to use robotics, uh, because, uh, uh, when they realize that they can really help in their activities, in their very boring and not useful activities, they are very happy to enjoy this, this program. But it is so, uh, it, it was a trip, a journey with the employees to make them understand that it's not something that, uh, is affecting their job. So, at least in generally group, we are, we are programming, uh, these, uh, uh, or employees, uh, journey in order to make them, uh, uh, to have more, uh, uh, awareness about robotics and not be scared about it. Layla and Christina, thank you both so much for coming on the cube. It was wonderful. Thank you very much for you. I'm Rebecca Knight for Dave Volante. Please stay tuned for more of the cubes live coverage of UI path forward.

Published Date : Oct 15 2019

SUMMARY :

Brought to you by UI path. So I want to hear from you both about what, what your industry is and what your role is. So we have to keep on talking about AI, And Christina, as the RPA, you write So in generally we have a, So as I joined the company, uh, I introduced a So describe the difference between the traditional in one of our leadership meetings I said we have as many coolers as the So we have to create efficiency, So that's how we are positioning RPA. the very beginning and we have to help them with best practice sharings with a huge So we are starting So we created value for our customer and our sales team within the field, Uh, and sometimes, uh, FBA is very quickly, the end to end processes. So now we have a hype going on in the system. the beginning, uh, we face some fears in our employees fears

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Rebecca KnightPERSON

0.99+

ChristinaPERSON

0.99+

Christina PeralaPERSON

0.99+

LaylaPERSON

0.99+

Layla DelagePERSON

0.99+

TurkeyLOCATION

0.99+

Coca-ColaORGANIZATION

0.99+

Cristina PirolaPERSON

0.99+

LailaPERSON

0.99+

Coca ColaORGANIZATION

0.99+

SyriaLOCATION

0.99+

26 plantsQUANTITY

0.99+

twoQUANTITY

0.99+

Las VegasLOCATION

0.99+

Dave VolantePERSON

0.99+

threeQUANTITY

0.99+

Leyla DelicPERSON

0.99+

40 colleaguesQUANTITY

0.99+

CharliePERSON

0.99+

second phaseQUANTITY

0.99+

PakistanLOCATION

0.99+

GeneraliORGANIZATION

0.99+

one shotQUANTITY

0.99+

seven different applicationsQUANTITY

0.99+

bothQUANTITY

0.99+

secondQUANTITY

0.99+

GeneraliaORGANIZATION

0.99+

Generali AssicurazioniORGANIZATION

0.99+

both rolesQUANTITY

0.99+

firstQUANTITY

0.98+

both worldsQUANTITY

0.98+

iPadCOMMERCIAL_ITEM

0.98+

oneQUANTITY

0.98+

around 8,500 employeesQUANTITY

0.98+

CCIORGANIZATION

0.98+

FirstQUANTITY

0.97+

1 million coolersQUANTITY

0.97+

400 millionQUANTITY

0.97+

central AsiaLOCATION

0.97+

more than a 70,000 employeesQUANTITY

0.96+

RogersORGANIZATION

0.96+

around 10 countriesQUANTITY

0.95+

each businessQUANTITY

0.95+

PristinaPERSON

0.94+

middle EastLOCATION

0.94+

2019DATE

0.94+

Coca Cola İçecekORGANIZATION

0.93+

four functionsQUANTITY

0.92+

fishy IslandLOCATION

0.87+

aroundQUANTITY

0.86+

CaesarPERSON

0.85+

this morningDATE

0.85+

BasingerPERSON

0.84+

four must win battlesQUANTITY

0.84+

AmerORGANIZATION

0.82+

ECEKORGANIZATION

0.82+

HekatonORGANIZATION

0.81+

over a yearQUANTITY

0.8+

about a year agoDATE

0.78+

AntwerpLOCATION

0.78+

1 millionQUANTITY

0.77+

Enough 50 countriesQUANTITY

0.76+

close to 1 billionQUANTITY

0.74+

Sikora ZuniORGANIZATION

0.68+

UiPathORGANIZATION

0.67+

CenaPERSON

0.67+

MuslimOTHER

0.65+

FORWARD IIITITLE

0.63+

CDOORGANIZATION

0.62+

RPATITLE

0.56+

Show Wrap | MIT CDOIQ 2019


 

>> from Cambridge, Massachusetts. It's three Cube covering M I T. Chief data officer and information quality Symposium 2019. Brought to you by Silicon Angle Media. >> Welcome back. We're here to wrap up the M I T. Chief data officer officer, information quality. It's hashtag m i t CDO conference. You're watching the Cube. I'm David Dante, and Paul Gill is my co host. This is two days of coverage. We're wrapping up eyes. Our analysis of what's going on here, Paul, Let me let me kick it off. When we first started here, we talked about that are open. It was way saw the chief data officer role emerged from the back office, the information quality role. When in 2013 the CEO's that we talked to when we asked them what was their scope. We heard things like, Oh, it's very wide. Involves analytics, data science. Some CEOs even said Oh, yes, security is actually part of our purview because all the cyber data so very, very wide scope. Even in some cases, some of the digital initiatives were sort of being claimed. The studios were staking their claim. The reality was the CDO also emerged out of highly regulated industries financialservices healthcare government. And it really was this kind of wonky back office role. And so that's what my compliance, that's what it's become again. We're seeing that CEOs largely you're not involved in a lot of the emerging. Aye, aye initiatives. That's what we heard, sort of anecdotally talking to various folks At the same time. I feel as though the CDO role has been more fossilized than it was before. We used to ask, Is this role going to be around anymore? We had C I. Ose tell us that the CEO Rose was going to disappear, so you had both ends of the spectrum. But I feel as though that whatever it's called CDO Data's our chief analytics off officer, head of data, you know, analytics and governance. That role is here to stay, at least for for a fair amount of time and increasingly, issues of privacy and governance. And at least the periphery of security are gonna be supported by that CD a role. So that's kind of takeaway Number one. Let me get your thoughts. >> I think there's a maturity process going on here. What we saw really in 2016 through 2018 was, ah, sort of a celebration of the arrival of the CDO. And we're here, you know, we've got we've got power now we've got an agenda. And that was I mean, that was a natural outcome of all this growth and 90% of organizations putting sea Dios in place. I think what you're seeing now is a realization that Oh, my God, this is a mess. You know what I heard? This year was a lot less of this sort of crowing about the ascendance of sea Dios and Maura about We've got a big integration problem of big data cleansing problem, and we've got to get our hands down to the nitty gritty. And when you talk about, as you said, we had in here so much this year about strategic initiatives, about about artificial intelligence, about getting involved in digital business or customer experience transformation. What we heard this year was about cleaning up data, finding the data that you've got organizing it, applying meditator, too. It is getting in shape to do something with it. There's nothing wrong with that. I just think it's part of the natural maturation process. Organizations now have to go through Tiu to the dirty process of cleaning up this data before they can get to the next stage, which was a couple of three years out for most of >> the second. Big theme, of course. We heard this from the former head of analytics. That G s K on the opening keynote is the traditional methods have failed the the Enterprise Data Warehouse, and we've actually studied this a lot. You know, my analogy is often you snake swallowing a basketball, having to build cubes. E D W practitioners would always used to call it chasing the chips until we come up with a new chip. Oh, we need that because we gotta run faster because it's taking us hours and hours, weeks days to run these analytics. So that really was not an agile. It was a rear view mirror looking thing. And Sarbanes Oxley saved the E. D. W. Business because reporting became part of compliance thing perspective. The master data management piece we've heard. Do you consistently? We heard Mike Stone Breaker, who's obviously a technology visionary, was right on. It doesn't scale through this notion of duping. Everything just doesn't work and manually creating rules. It's just it's just not the right approach. This we also heard the top down data data enterprise data model doesn't works too complicated, can operationalize it. So what they do, they kick the can to governance. The Duke was kind of a sidecar, their big data that failed to live up to its promises. And so it's It's a big question as to whether or not a I will bring that level of automation we heard from KPMG. Certainly, Mike Stone breaker again said way heard this, uh, a cz well, from Andy Palmer. They're using technology toe automate and scale that big number one data science problem, which is? They spend all their time wrangling data. We'll see if that if that actually lives up >> to his probable is something we did here today from several of our guests. Was about the promise of machine learning to automate this day to clean up process and as ah Mark Ramsay kick off the conference saying that all of these efforts to standardize data have failed in the past. This does look, He then showed how how G s K had used some of the tools that were represented here using machine learning to actually clean up the data at G S. K. So there is. And I heard today a lot of optimism from the people we talked to about the capability of Chris, for example, talking about the capability of machine learning to bring some order to solve this scale scale problem Because really organizing data creating enterprise data models is a scale problem, and the only way you can solve that it's with with automation, Mike Stone breaker is right on top of that. So there was optimism at this event. There was kind of an ooh, kind of, ah, a dismay at seeing all the data problems they have to clean up, but also promised that tools are on the way that could do that. >> Yeah, The reason I'm an optimist about this role is because data such a hard problem. And while there is a feeling of wow, this is really a challenge. There's a lot of smart people here who are up for the challenge and have the d n a for it. So the role, that whole 360 thing. We talked about the traditional methods, you know, kind of failing, and in the third piece that touched on, which is really bringing machine intelligence to the table. We haven't heard that as much at this event. It's now front and center. It's just another example of a I injecting itself into virtually every aspect every corner of the industry. And again, I often jokes. Same wine, new bottle. Our industry has a habit of doing that, but it's cyclical, but it is. But we seem to be making consistent progress. >> And the machine learning, I thought was interesting. Several very guest spoke to machine learning being applied to the plumbing projects right now to cleaning up data. Those are really self contained projects. You can manage those you can. You can determine out test outcomes. You can vet the quality of the of the algorithms. It's not like you're putting machine learning out there in front of the customer where it could potentially do some real damage. There. They're vetting their burning in machine, learning in a environment that they control. >> Right, So So, Amy, Two solid days here. I think that this this conference has really grown when we first started here is about 130 people, I think. And now it was 500 registrants. This'd year. I think 600 is the sort of the goal for next year. Moving venues. The Cube has been covering this all but one year since 2013. Hope to continue to do that. Paul was great working with you. Um, always great work. I hope we can, uh we could do more together. We heard the verdict is bringing back its conference. You put that together. So we had column. Mahoney, um, had the vertical rock stars on which was fun. Com Mahoney, Mike Stone breaker uh, Andy Palmer and Chris Lynch all kind of weighed in, which was great to get their perspectives kind of the days of MPP and how that's evolved improving on traditional relational database. And and now you're Stone breaker. Applying all these m i. Same thing with that scale with Chris Lynch. So it's fun to tow. Watch those guys all Boston based East Coast folks some news. We just saw the news hit President Trump holding up jet icon contractors is we've talked about. We've been following that story very closely and I've got some concerns over that. It's I think it's largely because he doesn't like Bezos in The Washington Post Post. Exactly. You know, here's this you know, America first. The Pentagon says they need this to be competitive with China >> and a I. >> There's maybe some you know, where there's smoke. There's fire there, so >> it's more important to stick in >> the eye. That's what it seems like. So we're watching that story very closely. I think it's I think it's a bad move for the executive branch to be involved in those type of decisions. But you know what I know? Well, anyway, Paul awesome working with you guys. Thanks. And to appreciate you flying out, Sal. Good job, Alex Mike. Great. Already wrapping up. So thank you for watching. Go to silicon angle dot com for all the news. Youtube dot com slash silicon angles where we house our playlist. But the cube dot net is the main site where we have all the events. It will show you what's coming up next. We've got a bunch of stuff going on straight through the summer. And then, of course, VM World is the big kickoff for the fall season. Goto wicked bond dot com for all the research. We're out. Thanks for watching Dave. A lot day for Paul Gillon will see you next time.

Published Date : Aug 1 2019

SUMMARY :

Brought to you by in 2013 the CEO's that we talked to when we asked them what was their scope. And that was I mean, And Sarbanes Oxley saved the E. data models is a scale problem, and the only way you can solve that it's with with automation, We talked about the traditional methods, you know, kind of failing, and in the third piece that touched on, And the machine learning, I thought was interesting. We just saw the news hit President Trump holding up jet icon contractors There's maybe some you know, where there's smoke. And to appreciate you flying out, Sal.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Andy PalmerPERSON

0.99+

David DantePERSON

0.99+

Chris LynchPERSON

0.99+

ChrisPERSON

0.99+

2013DATE

0.99+

PaulPERSON

0.99+

Paul GillPERSON

0.99+

Mike StonePERSON

0.99+

2016DATE

0.99+

Paul GillonPERSON

0.99+

Mike Stone BreakerPERSON

0.99+

Silicon Angle MediaORGANIZATION

0.99+

2018DATE

0.99+

RosePERSON

0.99+

Alex MikePERSON

0.99+

BezosPERSON

0.99+

G s KORGANIZATION

0.99+

MahoneyPERSON

0.99+

BostonLOCATION

0.99+

KPMGORGANIZATION

0.99+

90%QUANTITY

0.99+

SalPERSON

0.99+

third pieceQUANTITY

0.99+

DavePERSON

0.99+

500 registrantsQUANTITY

0.99+

two daysQUANTITY

0.99+

Cambridge, MassachusettsLOCATION

0.99+

todayDATE

0.99+

next yearDATE

0.99+

Mark RamsayPERSON

0.99+

360QUANTITY

0.99+

this yearDATE

0.99+

MauraPERSON

0.99+

G S. K.ORGANIZATION

0.98+

YoutubeORGANIZATION

0.98+

AmyPERSON

0.98+

PentagonORGANIZATION

0.98+

C I. OsePERSON

0.98+

Sarbanes OxleyPERSON

0.97+

firstQUANTITY

0.97+

This yearDATE

0.96+

one yearQUANTITY

0.96+

Mike Stone breakerPERSON

0.95+

Enterprise Data WarehouseORGANIZATION

0.95+

DiosPERSON

0.94+

Two solid daysQUANTITY

0.94+

secondQUANTITY

0.94+

three yearsQUANTITY

0.92+

about 130 peopleQUANTITY

0.91+

600QUANTITY

0.9+

DukeORGANIZATION

0.89+

VM WorldEVENT

0.88+

dot comORGANIZATION

0.85+

ChinaORGANIZATION

0.84+

E. D. W.ORGANIZATION

0.83+

CubeORGANIZATION

0.8+

MITORGANIZATION

0.77+

East CoastLOCATION

0.75+

M I T.PERSON

0.75+

2019DATE

0.74+

President TrumpPERSON

0.71+

both endsQUANTITY

0.71+

threeQUANTITY

0.68+

M I T.EVENT

0.64+

cube dot netORGANIZATION

0.59+

ChiefPERSON

0.58+

The Washington Post PostTITLE

0.57+

AmericaORGANIZATION

0.56+

Goto wickedORGANIZATION

0.54+

CEOPERSON

0.54+

coupleQUANTITY

0.54+

CDOORGANIZATION

0.45+

StonePERSON

0.43+

CDOIQTITLE

0.24+

Bob Parr & Sreekar Krishna, KPMG US | MIT CDOIQ 2019


 

>> from Cambridge, Massachusetts. It's the Cube covering M I T. Chief data officer and information quality Symposium 2019. Brought to you by Silicon Angle Media. >> Welcome back to Cambridge, Massachusetts. Everybody watching the Cuban leader live tech coverage. We here covering the M I t CDO conference M I t CEO Day to wrapping up. Bob Parr is here. He's a partner in principle at KPMG, and he's joined by Streetcar Krishna, who is the managing director of data science. Aye, aye. And innovation at KPMG. Gents, welcome to the Cube. Thank >> thank you. Let's start with your >> roles. So, Bob, where do you focus >> my focus? Ah, within KPMG, we've got three main business lines audit tax, an advisory. And so I'm the advisory chief date officer. So I'm more focused on how we use data competitively in the market. More the offense side of our focus. So, you know, how do we make sure that our teams have the data they need to deliver value? Uh, much as possible working concert with the enterprise? CDO uh, who's more focused on our infrastructure, Our standards, security, privacy and those >> you've focused on making KPMG better A >> supposed exactly clients. OK, >> I also have a second hat, and I also serve financial service is si Dios as well. So Okay, so >> get her out of a dual role. I got sales guys in >> streetcar. What was your role? >> Yeah, You know, I focus a lot on data science, artificial intelligence and overall innovation s o my reaction. I actually represent a centre of >> excellence within KPMG that focuses on the I machine learning natural language processing. And I work with Bob's Division to actually advance the data site off the store because all the eye needs data. And without data, there's no algorithms, So we're focusing a lot on How do we use a I to make data Better think about their equality. Think about data lineage. Think about all of the problems that data has. How can we make it better using algorithms? And I focused a lot on that working with Bob, But no, it's it's customers and internal. I mean, you know, I were a horizontal within the form, So we help customers. We help internal, we focus a lot on the market. >> So, Bob, you mentioned used data offensively. So 10 12 years ago, it was data was a liability. You had to get rid of it. Keep it no longer than you had to, because you're gonna get soon. So email archives came in and obviously thinks flipped after the big data. But so what do you What are you seeing in terms of that shift from From the defense data to the offensive? >> Yeah, and it's it's really you know, when you think about it and let me define sort of offense versus defense. Who on the defense side, historically, that's where most of CEOs have played. That's risk regulatory reporting, privacy, um, even litigation support those types of activities today. Uh, and really, until about a year and 1/2 ago, we really saw most CEOs still really anchored in that I run a forum with a number of studios and financial service is, and every year we get them together and asked him the same set of questions. This was the first year where they said that you know what my primary focus now is. Growth. It's bringing efficiency is trying to generate value on the offensive side. It's not like the regulatory work's going away, certainly in the face of some of the pending privacy regulation. But you know, it's It's a sign that the volume of use cases as the investments in their digital transformations are starting to kick out, as well as the volumes of data that are available. The raw material that's available to them in terms of third party data in terms of the the just the general volumes that that exist that are streaming into the organization and the overall literacy in the business units are creating this, this massive demand. And so they're having to >> respond because of getting a handle on the data they're actually finding. Word is, they're categorizing it there, there, >> yeah, organizing that. That is still still a challenge. Um, I think it's better with when you have a very narrow scope of critical data elements going back to the structure data that we're talking it with the regulatory reporting when you start to get into the three offense, the generating value, getting the customer experience, you know, really exploring. You know that side of it. There's there's a ton of new muscle that has to be built new muscle in terms of data quality, new muscle in terms of um, really more scalable operating model. I think that's a big issue right now with Si Dios is, you know, we've got ah, we're used to that limited swath of CDs and they've got Stewardship Network. That's very labor intensive. A lot of manual processes still, um, and and they have some good basic technology, but it's a lot of its rules based. And when you do you think about those how that constraints going to scale when you have all of this demand. You know, when you look at the customer experience analytics that they want to do when you look at, you know, just a I applied to things like operations. The demand on the focus there is is is gonna start to create a fundamental shift >> this week are one of things that I >> have scene, and maybe it's just my small observation space. But I wonder, if you could comment Is that seems like many CBO's air not directly involved in the aye aye initiatives. Clearly, the chief digital officer is involved, but the CDO zehr kind of, you know, in the background still, you see that? >> That's a fantastic question, and I think this is where we're seeing some off the cutting it change that is happening in the industry. And when Barbara presenter idea that we can often civilly look at data, this is what it is that studios for a long time have become more reactive in their roles. And that is that is starting to come forefront now. So a lot of institutions were working with are asking What's the next generation Roll off a CDO and why are they in the background and why are they not in the foreground? And this is when you become more often they were proactive with data and the digital officers are obviously focused on, you know, the transformation that has to happen. But the studios are their backbone in order to make the transformation. Really. And if the CDO started, think about their data as an asset did as a product did us a service. The judicial officers are right there because those are the real, you know, like the data data they're living so CDO can really become from my back office to really become a business line. We've >> seen taking the reins in machine learning in machine learning projects and cos you work with. Who >> was driving that? Yeah. Great question. So we are seeing, like, you know, different. I would put them in buckets, right? There is no one mortal fits all. We're seeing different generations within the company's. Some off. The ones were just testing out the market. There's two keeping it in their technology space in their back office. Take idea and, you know, in in forward I d let me call them where they are starting to experiment with this. But you see, the mature organizations on the other end of the spectrum, they are integrating action, learning and a I right into the business line because they want to see ex souls having the technology right by their side so they can lead leverage. Aye, aye. And machine learning spot right for the business right there. And that is where we're seeing know some of the new models. Come on. >> I think the big shift from a CDO perspective is using a i to prep data for a That's that's fundamentally where you know, where the data science was distributed. Some of that data science has to come back and free the integration for equality for data prepping because you've got all this data third party and other from customer streaming into the organization. And you know, the work that you're doing around, um, anomaly detection is it transcends developing the rules, doing the profiling, doing the rules. You know, the very manual, the very labor intensive process you've got to get away from that >> is used in order for this to be scale goes and a I to figure out which out goes to apply t >> clean to prepare the data toe, see what algorithms we can use. So it's basically what we're calling a eye for data rather than just data leading into a I. So it's I mean, you know, you developed a technology for one off our clients and pretty large financial service. They were getting closer, like 1,000,000,000 data points every day. And there was no way manually, you could go through the same quality controls and all of those processes. So we automated it through algorithms, and these algorithms are learning the behavior of data as they flow into the organization, and they're able to proactively tell their problems are starting very much. And this is the new face that we see in in the industry, you cannot scale the traditional data governance using manual processes, we have to go to the next generation where a i natural language processing and think about on structure data, right? I mean, that is, like 90% off. The organization is unstructured data, and we have not talked about data quality. We have not talked about data governance. For a lot of these sources of information, now is the time. Hey, I can do it. >> And I think that raised a great question. If you look at unstructured and a lot of the data sources, as you start to take more of an offensive stance will be unstructured. And the data quality, what it means to apply data quality isn't the the profiling and the rules generation the way you would with standard data. So the teams, the skills that CEOs have in their organizations, have to change. You have to start to, and, you know, it's a great example where, you know, you guys were ingesting documents and there was handwriting all over the documents, you know, and >> yeah, you know, you're a great example, Bob. Like you no way would ask the client, like, you know, is this document gonna scanned into the system so my algorithm can run and they're like, Yeah, everything is good. I mean, the deal is there, but when you then start scanning it, you realize there's handwriting and the information is in the handwriting. So all the algorithms breakdown now >> tribal knowledge striving Exactly. >> Exactly. So that's what we're seeing. You know, if I if we talk about the digital transformation in data in the city organization, it is this idea dart. Nothing is left unseen. Some algorithm or some technology, has seen everything that is coming into. The organization has has has a para 500. So you can tell you where the problems are. And this is what algorithms do. This scale beautifully. >> So the data quality approaches are evolving, sort of changing. So rather than heavy, heavy emphasis on masking or duplication and things like that, you would traditionally think of participating the difficult not that that goes away. But it's got to evolve to use machine >> intelligence. Exactly what kind of >> skill sets people need thio achieve that Is it Is it the same people or do we need to retrain them or bring in new skills. >> Yeah, great question. And I can talk from the inspector off. Where is disrupting every industry now that we know, right? But we knew when you look at what skills are >> required, all of the eye, including natural language processing, machine learning, still require human in the loop. And >> that is the training that goes in there. And who do you who are the >> people who have that knowledge? It is the business analyst. It's the data analyst who are the knowledge betters the C suite and the studios. They are able to make decisions. But the day today is still with the data analyst. >> Those s Emmys. Those sm >> means So we have to obscure them to really start >> interacting with these new technologies where they are the leaders, rather than just waiting for answers to come through. And >> when that happens now being as a data scientist, my job is easy because they're Siamese, are there? I deploy the technology. They're semi's trained algorithms on a regular basis. Then it is a fully fungible model which is evolving with the business. And no longer am I spending time re architect ing my rules. And like my, you know, what are the masking capabilities I need to have? It is evolving us. >> Does that change the >> number one problem that you hear from data scientists, which is the 80% of the time >> spent on wrangling cleaning data 10 15 20% run into sm. He's being concerned that they're gonna be replaced by the machine. Their training. >> I actually see them being really enabled now where they're spending 80% of the time doing boring job off, looking at data. Now they're spending 90% of their time looking at the elements future creative in which requires human intelligence to say, Hey, this is different because off X, >> y and Z so let's let's go out. It sounds like a lot of what machine learning is being used for now in your domain is clean things up its plumbing. It's basic foundation work. So go out. Three years after all that work has been done and the data is clean. Where are your clients talking about going next with machine learning? Bob, did you want? >> I mean, it's a whole. It varies by by industry, obviously, but, um but it covers the gamut from, you know, and it's generally tied to what's driving their strategies. So if you look at a financial service is organization as an example today, you're gonna have, you know, really a I driving a lot of the behind the scenes on the customer experience. It's, you know, today with your credit card company. It's behind the scenes doing fraud detection. You know, that's that's going to continue. So it's take the critical functions that were more data. It makes better models that, you know, that that's just going to explode. And I think they're really you can look across all the functions, from finance to to marketing to operations. I mean, it's it's gonna be pervasive across, you know all of that. >> So if I may, I don't top award. While Bob was saying, I think what's gonna what What our clients are asking is, how can I exhilarate the decision making? Because at the end of the day on Lee, all our leaders are focused on making decisions, and all of this data science is leading up to their decision, and today you see like you know what you brought up, like 80% of the time is wasted in cleaning the data. So only 20% time was spent in riel experimentation and analytics. So your decision making time was reduced to 20% off the effort that I put in the pipeline. What if now I can make it 80% of the time? They're I put in the pipeline, better decisions are gonna come on the train. So when I go into a meeting and I'm saying like, Hey, can you show me what happened in this particular region or in this particular part of the country? Previously, it would have been like, Oh, can you come back in two weeks? I will have the data ready, and I will tell you the answer. But in two weeks, the business has ran away and the CDO know or the C Street doesn't require the same answer. But where we're headed as as the data quality improves, you can get to really time questions and decisions. >> So decision, sport, business, intelligence. Well, we're getting better. Isn't interesting to me. Six months to build a cube, we'd still still not good enough. Moving too fast. As the saying goes, data is plentiful. Insights aren't Yes, you know, in your view, well, machine intelligence. Finally, close that gap. Get us closer to real time decision >> making. It will eventually. But there's there's so much that we need to. Our industry needs to understand first, and it really ingrained. And, you know, today there is still a fundamental trust issues with a I you know, it's we've done a lot of work >> watch Black box or a part of >> it. Part of it. I think you know, the research we've done. And some of this is nine countries, 2400 senior executives. And we asked some, ah, a lot of questions around their data and trusted analytics, and 92% of them came back with. They have some fundamental trust issues with their data and their analytics and and they feel like there's reputational risk material reputational risk. This isn't getting one little number wrong on one of the >> reports about some more of an >> issue, you know, we also do a CEO study, and we've done this many years in a row going back to 2017. We started asked them okay, making a lot of companies their data driven right. When it comes to >> what they say they're doing well, They say they're day driven. That's the >> point. At the end of the day, they making strategic decisions where you have an insight that's not intuitive. Do you trust your gut? Go with the analytics back then. You know, 67% said they go with their gut, So okay, this is 2017. This industry's moving quickly. There's tons and tons of investment. Look at it. 2018 go down. No, went up 78%. So it's not aware this issue there is something We're fundamentally wrong and you hit it on. It's a part of its black box, and part of it's the date equality and part of its bias. And there's there's all of these things flowing around it. And so when we dug into that, we said, Well, okay, if that exists, how are we going to help organizations get their arms around this issue and start digging into that that trust issue and really it's the front part is, is exactly what we're talking about in terms of data quality, both structured more traditional approaches and unstructured, using the handwriting example in those types of techniques. But then you get into the models themselves, and it's, you know, the critical thing she had to worry about is, you know, lineage. So from an integrity perspective, where's the data coming from? Whether the sources for the change controls on some of that, they need to look at explain ability, gain at the black box part where you can you tell me the inferences decisions are those documented. And this is important for this me, the human in the loop to get confidence in the algorithm as well as you know, that executive group. So they understand there's a structure set of processes around >> Moneyball. Problem is actually pretty confined. It's pretty straightforward. Dono 32 teams are throwing minor leagues, but the data models pretty consistent through the problem with organizations is I didn't know data model is consistent with the organization you mentioned, Risk Bob. The >> other problem is organizational inertia. If they don't trust it, what is it? What is a P and l manage to do when he or she wants to preserve? Yeah, you know, their exit position. They attacked the data. You know, I don't believe that well, which which is >> a fundamental point, which is culture. Yes. I mean, you can you can have all the data, science and all the governance that you want. But if you don't work culture in parallel with all this, it's it's not gonna stick. And and that's, I think the lot of the leading organisations, they're starting to really dig into this. We hear a lot of it literacy. We hear a lot about, you know, top down support. What does that really mean? It means, you know, senior executives are placing bats around and linking demonstrably linking the data and the role of data days an asset into their strategies and then messaging it out and being specific around the types of investments that are going to reinforce that business strategy. So that's absolutely critical. And then literacy absolutely fundamental is well, because it's not just the executives and the data scientists that have to get this. It's the guy in ops that you're trying to get you. They need to understand, you know, not only tools, but it's less about the tools. But it's the techniques, so it's not. The approach is being used, are more transparent and and that you know they're starting to also understand, you know, the issues of privacy and data usage rights. That's that's also something that we can't leave it the curb. With all this >> innovation, it's also believing that there's an imperative. I mean, there's a lot of for all the talk about digital transformation hear it everywhere. Everybody's trying to get digital, right? But there's still a lot of complacency in the organization in the lines of business in operation to save. We're actually doing really well. You know, we're in financial service is health care really hasn't been disrupted. This is Oh, it's coming, it's coming. But there's still a lot of I'll be retired by then or hanging. Actually, it's >> also it's also the fact that, you know, like in the previous generation, like, you know, if I had to go to a shopping, I would go into a shop and if I wanted by an insurance product, I would call my insurance agent. But today the New world, it's just a top off my screen. I have to go from Amazon, so some other some other app, and this is really this is what is happening to all of our kind. Previously that they start their customers, pocketed them in different experience. Buckets. It's not anymore that's real in front of them. So if you don't get into their digital transformation, a customer is not going to discount you by saying, Oh, you're not Amazon. So I'm not going to expect that you're still on my phone and you're only two types of here, so you have to become really digital >> little surprises that you said you see the next. The next stage is being decision support rather than customer experience, because we hear that for CEOs, customer experience is top of mind right now. >> No natural profile. There are two differences, right? One is external facing is absolutely the customer internal facing. It's absolutely the decision making, because that's how they're separating. The internal were, says the external, and you know most of the meetings that we goto Customer insight is the first place where analytics is starting where data is being cleaned up. Their questions are being asked about. Can I master my customer records? Can I do a good master off my vendor list? That is where they start. But all of that leads to good decision making to support the customers. So it's like that external towards internal view well, back >> to the offense versus defense and the shift. I mean, it absolutely is on the offense side. So it is with the customer, and that's a more directly to the business strategy. So it's get That's the area that's getting the money, the support and people feel like it's they're making an impact with it there. When it's it's down here in some admin area, it's below the water line, and, you know, even though it's important and it flows up here, it doesn't get the VIN visibility. So >> that's great conversation. You coming on? You got to leave it there. Thank you for watching right back with our next guest, Dave Lot. Paul Gillen from M I t CDO I Q Right back. You're watching the Cube

Published Date : Aug 1 2019

SUMMARY :

Brought to you by We here covering the M I t CDO conference M I t CEO Day to wrapping Let's start with your So, Bob, where do you focus And so I'm the advisory chief date officer. I also have a second hat, and I also serve financial service is si Dios as well. I got sales guys in What was your role? Yeah, You know, I focus a lot on data science, artificial intelligence and I mean, you know, I were a horizontal within the form, So we help customers. seeing in terms of that shift from From the defense data to the offensive? Yeah, and it's it's really you know, when you think about it and let me define sort of offense versus respond because of getting a handle on the data they're actually finding. getting the customer experience, you know, really exploring. if you could comment Is that seems like many CBO's air not directly involved in And this is when you become more often they were proactive with data and the digital officers seen taking the reins in machine learning in machine learning projects and cos you work with. So we are seeing, like, you know, different. And you know, the work that you're doing around, um, anomaly detection is So it's I mean, you know, you developed a technology for one off our clients and pretty and the rules generation the way you would with standard data. I mean, the deal is there, but when you then start scanning it, So you can tell you where the problems are. So the data quality approaches are evolving, Exactly what kind of do we need to retrain them or bring in new skills. And I can talk from the inspector off. machine learning, still require human in the loop. And who do you who are the But the day today is still with the data Those s Emmys. And And like my, you know, what are the masking capabilities I need to have? He's being concerned that they're gonna be replaced by the machine. 80% of the time doing boring job off, looking at data. the data is clean. And I think they're really you and all of this data science is leading up to their decision, and today you see like you know what you brought Insights aren't Yes, you know, fundamental trust issues with a I you know, it's we've done a lot of work I think you know, the research we've done. issue, you know, we also do a CEO study, and we've done this many years That's the in the algorithm as well as you know, that executive group. is I didn't know data model is consistent with the organization you mentioned, Yeah, you know, science and all the governance that you want. the organization in the lines of business in operation to save. also it's also the fact that, you know, like in the previous generation, little surprises that you said you see the next. The internal were, says the external, and you know most of the meetings it's below the water line, and, you know, even though it's important and it flows up here, Thank you for

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
BarbaraPERSON

0.99+

KPMGORGANIZATION

0.99+

BobPERSON

0.99+

20%QUANTITY

0.99+

AmazonORGANIZATION

0.99+

90%QUANTITY

0.99+

80%QUANTITY

0.99+

Bob ParrPERSON

0.99+

2017DATE

0.99+

Silicon Angle MediaORGANIZATION

0.99+

Dave LotPERSON

0.99+

2018DATE

0.99+

67%QUANTITY

0.99+

nine countriesQUANTITY

0.99+

92%QUANTITY

0.99+

Cambridge, MassachusettsLOCATION

0.99+

2400 senior executivesQUANTITY

0.99+

Six monthsQUANTITY

0.99+

three offenseQUANTITY

0.99+

firstQUANTITY

0.99+

Paul GillenPERSON

0.99+

LeePERSON

0.99+

todayDATE

0.99+

78%QUANTITY

0.99+

Sreekar KrishnaPERSON

0.99+

two typesQUANTITY

0.99+

OneQUANTITY

0.98+

32 teamsQUANTITY

0.98+

second hatQUANTITY

0.98+

Three yearsQUANTITY

0.98+

two differencesQUANTITY

0.98+

10DATE

0.98+

bothQUANTITY

0.97+

twoQUANTITY

0.97+

two weeksQUANTITY

0.97+

this weekDATE

0.96+

oneQUANTITY

0.95+

M I t CDOEVENT

0.95+

C StreetORGANIZATION

0.93+

M I t CEO DayEVENT

0.93+

Streetcar KrishnaPERSON

0.92+

about a year andDATE

0.91+

2019DATE

0.9+

CubanOTHER

0.9+

CBOORGANIZATION

0.88+

first yearQUANTITY

0.88+

Si DiosORGANIZATION

0.87+

12 years agoDATE

0.86+

10QUANTITY

0.84+

RiskPERSON

0.81+

1,000,000,000 data pointsQUANTITY

0.8+

CDOTITLE

0.8+

ParrPERSON

0.79+

CubeORGANIZATION

0.79+

1/2 agoDATE

0.78+

CDOORGANIZATION

0.78+

tons andQUANTITY

0.76+

dualQUANTITY

0.72+

15QUANTITY

0.71+

DonoORGANIZATION

0.7+

one little numberQUANTITY

0.69+

MITORGANIZATION

0.67+

threeQUANTITY

0.64+

500OTHER

0.63+

boxTITLE

0.61+

M I T.EVENT

0.6+

Cube BobORGANIZATION

0.59+

Robert Abate, Global IDS | MIT CDOIQ 2019


 

>> From Cambridge, Massachusetts, it's theCUBE. Covering MIT Chief Data Officer and Information Quality Symposium 2019. Brought to you by SiliconANGLE Media. (futuristic music) >> Welcome back to Cambridge, Massachusetts everybody. You're watching theCUBE, the leader in live tech coverage. We go out to the events and we extract the signal from the noise. This is day two, we're sort of wrapping up the Chief Data Officer event. It's MIT CDOIQ, it started as an information quality event and with the ascendancy of big data the CDO emerged and really took center stage here. And it's interesting to know that it's kind of come full circle back to information quality. People are realizing all this data we have, you know the old saying, garbage in, garbage out. So the information quality worlds and this chief data officer world have really come colliding together. Robert Abate is here, he's the Vice President and CDO of Global IDS and also the co-chair of next year's, the 14th annual MIT CDOIQ. Robert, thanks for coming on. >> Oh, well thank you. >> Now you're a CDO by background, give us a little history of your career. >> Sure, sure. Well I started out with an Electrical Engineering degree and went into applications development. By 2000, I was leading the Ralph Lauren's IT, and I realized when Ralph Lauren hired me, he was getting ready to go public. And his problem was he had hired eight different accounting firms to do eight different divisions. And each of those eight divisions were reporting a number, but the big number didn't add up, so he couldn't go public. So he searched the industry to find somebody who could figure out the problem. Now I was, at the time, working in applications and had built this system called Service Oriented Architectures, a way of integrating applications. And I said, "Well I don't know if I could solve the problem, "but I'll give it a shot." And what I did was, just by taking each silo as it's own problem, which was what EID Accounting Firm had done, I was able to figure out that one of Ralph Lauren's policies was if you buy a garment, you can return it anytime, anywhere, forever, however long you own it. And he didn't think about that, but what that meant is somebody could go to a Bloomingdale's, buy a garment and then go to his outlet store and return it. Well, the cross channels were different systems. So the outlet stores were his own business, retail was a different business, there was a completely different, each one had their own AS/400, their own data. So what I quickly learned was, the problem wasn't the systems, the problem was the data. And it took me about two months to figure it out and he offered me a job, he said well, I was a consultant at the time, he says, "I'm offering you a job, you're going to run my IT." >> Great user experience but hard to count. >> (laughs) Hard to count. So that's when I, probably 1999 was when that happened. I went into data and started researching-- >> Sorry, so how long did it take you to figure that out? You said a couple of months? >> A couple of months, I think it was about two months. >> 'Cause jeez, it took Oracle what, 10 years to build Fusion with SOA? That's pretty good. (laughs) >> This was a little bit of luck. When we started integrating the applications we learned that the messages that we were sending back and forth didn't match, and we said, "Well that's impossible, it can't not match." But what didn't match was it was coming from one channel and being returned in another channel, and the returns showed here didn't balance with the returns on this side. So it was a data problem. >> So a forensics showdown. So what did you do after? >> After that I went into ICICI Bank which was a large bank in India who was trying to integrate their systems, and again, this was a data problem. But they heard me giving a talk at a conference on how SOA had solved the data challenge, and they said, "We're a bank with a wholesale, a retail, "and other divisions, "and we can't integrate the systems, can you?" I said, "Well yeah, I'd build a website "and make them web services and now what'll happen is "each of those'll kind of communicate." And I was at ICICI Bank for about six months in Mumbai, and finished that which was a success, came back and started consulting because now a lot of companies were really interested in this concept of Service Oriented Architectures. Back then when we first published on it, myself, Peter Aiken, and a gentleman named Joseph Burke published on it in 1996. The publisher didn't accept the book, it was a really interesting thing. We wrote the book called, "Services Based Architectures: A Way to Integrate Systems." And the way Wiley & Sons, or most publishers work is, they'll have three industry experts read your book and if they don't think what you're saying has any value, they, forget about it. So one guy said this is brilliant, one guy says, "These guys don't know what they're talking about," and the third guy says, "I don't even think what they're talking about is feasible." So they decided not to publish. Four years later it came back and said, "We want to publish the book," and Peter said, "You know what, they lost their chance." We were ahead of them by four years, they didn't understand the technology. So that was kind of cool. So from there I went into consulting, eventually took a position as the Head of Enterprise and Director of Enterprise Information Architecture with Walmart. And Walmart, as you know, is a huge entity, almost the size of the federal government. So to build an architecture that integrates Walmart would've been a challenge, a behemoth challenge, and I took it on with a phenomenal team. >> And when was this, like what timeframe? >> This was 2010, and by the end of 2010 we had presented an architecture to the CIO and the rest of the organization, and they came back to me about a week later and said, "Look, everybody agrees what you did was brilliant, "but nobody knows how to implement it. "So we're taking you away, "you're no longer Director of Information Architecture, "you're now Director of Enterprise Information Management. "Build it. "Prove that what you say you could do, you could do." So we built something called the Data CAFE, and CAFE was an acronym, it stood for: Collaborative Analytics Facility for the Enterprise. What we did was we took data from one of the divisions, because you didn't want to take on the whole beast, boil the ocean. We picked Sam's Club and we worked with their CFO, and because we had information about customers we were able to build a room with seven 80 inch monitors that surrounded anyone in the room. And in the center was the Cisco telecommunications so you could be a part of a meeting. >> The TelePresence. >> TelePresence. And we built one room in one facility, and one room in another facility, and we labeled the monitors, one red, one blue, one green, and we said, "There's got to be a way where we can build "data science so it's interactive, so somebody, "an executive could walk into the room, "touch the screen, and drill into features. "And in another room "the features would be changing simultaneously." And that's what we built. The room was brought up on Black Friday of 2013, and we were able to see the trends of sales on the East Coast that we quickly, the executives in the room, and these are the CEO of Walmart and the heads of Sam's Club and the like, they were able to change the distribution in the Mountain Time Zone and west time zones because of the sales on the East Coast gave them the idea, well these things are going to sell, and these things aren't. And they saw a tremendous increase in productivity. We received the 2014, my team received the 2014 Walmart Innovation Project of the Year. >> And that's no slouch. Walmart has always been heavily data-oriented. I don't know if it's urban legend or not, but the famous story in the '80s of the beer and the diapers, right? Walmart would position beer next to diapers, why would they do that? Well the father goes in to buy the diapers for the baby, picks up a six pack while he's on the way, so they just move those proximate to each other. (laughs) >> In terms of data, Walmart really learned that there's an advantage to understanding how to place items in places that, a path that you might take in a store, and knowing that path, they actually have a term for it, I believe it's called, I'm sorry, I forgot the name but it's-- >> Selling more stuff. (laughs) >> Yeah, it's selling more stuff. It's the way you position items on a shelf. And Walmart had the brilliance, or at least I thought it was brilliant, that they would make their vendors the data champion. So the vendor, let's say Procter & Gamble's a vendor, and they sell this one product the most. They would then be the champion for that aisle. Oh, it's called planogramming. So the planogramming, the way the shelves were organized, would be set up by Procter & Gamble for that entire area, working with all their other vendors. And so Walmart would give the data to them and say, "You do it." And what I was purporting was, well, we shouldn't just be giving the data away, we should be using that data. And that was the advent of that. From there I moved to Kimberly-Clark, I became Global Director of Enterprise Data Management and Analytics. Their challenge was they had different teams, there were four different instances of SAP around the globe. One for Latin America, one for North America called the Enterprise Edition, one for EMEA, Europe, Middle East, and Africa, and one for Asia-Pacific. Well when you have four different instances of SAP, that means your master data doesn't exist because the same thing that happens in this facility is different here. And every company faces this challenge. If they implement more than one of a system the specialty fields get used by different companies in different ways. >> The gold standard, the gold version. >> The golden version. So I built a team by bringing together all the different international teams, and created one team that was able to integrate best practices and standards around data governance, data quality. Built BI teams for each of the regions, and then a data science and advanced analytics team. >> Wow, so okay, so that makes you uniquely qualified to coach here at the conference. >> Oh, I don't know about that. (laughs) There are some real, there are some geniuses here. >> No but, I say that because these are your peeps. >> Yes, they are, they are. >> And so, you're a practitioner, this conference is all about practitioners talking to practitioners, it's content-heavy, There's not a lot of fluff. Lunches aren't sponsored, there's no lanyard sponsor and it's not like, you know, there's very subtle sponsor desks, you have to have sponsors 'cause otherwise the conference's not enabled, and you've got costs associated with it. But it's a very intimate event and I think you guys want to keep it that way. >> And I really believe you're dead-on. When you go to most industry conferences, the industry conferences, the sponsors, you know, change the format or are heavily into the format. Here you have industry thought leaders from all over the globe. CDOs of major Fortune 500 companies who are working with their peers and exchanging ideas. I've had conversations with a number of CDOs and the thought leadership at this conference, I've never seen this type of thought leadership in any conference. >> Yeah, I mean the percentage of presentations by practitioners, even when there's a vendor name, they have a practitioner, you know, internal practitioner presenting so it's 99.9% which is why people attend. We're moving venues next year, I understand. Just did a little tour of the new venue, so, going to be able to accommodate more attendees, so that's great. >> Yeah it is. >> So what are your objectives in thinking ahead a year from now? >> Well, you know, I'm taking over from my current peer, Dr. Arka Mukherjee, who just did a phenomenal job of finding speakers. People who are in the industry, who are presenting challenges, and allowing others to interact. So I hope could do a similar thing which is, find with my peers people who have real world challenges, bring them to the forum so they can be debated. On top of that, there are some amazing, you know, technology change is just so fast. One of the areas like big data I remember only five years ago the chart of big data vendors maybe had 50 people on it, now you would need the table to put all the vendors. >> Who's not a data vendor, you know? >> Who's not a data vendor? (laughs) So I would think the best thing we could do is, is find, just get all the CDOs and CDO-types into a room, and let us debate and talk about these points and issues. I've seen just some tremendous interactions, great questions, people giving advice to others. I've learned a lot here. >> And how about long term, where do you see this going? How many CDOs are there in the world, do you know? Is that a number that's known? >> That's a really interesting point because, you know, only five years ago there weren't that many CDOs to be called. And then Gartner four years ago or so put out an article saying, "Every company really should have a CDO." Not just for the purpose of advancing your data, and to Doug Laney's point that data is being monetized, there's a need to have someone responsible for information 'cause we're in the Information Age. And a CIO really is focused on infrastructure, making sure I've got my PCs, making sure I've got a LAN, I've got websites. The focus on data has really, because of the Information Age, has turned data into an asset. So organizations realize, if you utilize that asset, let me reverse this, if you don't use data as an asset, you will be out of business. I heard a quote, I don't know if it's true, "Only 10 years ago, 250 of the Fortune 10 no longer exists." >> Yeah, something like that, the turnover's amazing. >> Many of those companies were companies that decided not to make the change to be data-enabled, to make data decision processing. Companies still use data warehouses, they're always going to use them, and a warehouse is a rear-view mirror, it tells you what happened last week, last month, last year. But today's businesses work forward-looking. And just like driving a car, it'd be really hard to drive your car through a rear-view mirror. So what companies are doing today are saying, "Okay, let's start looking at this as forward-looking, "a prescriptive and predictive analytics, "rather than just what happened in the past." I'll give you an example. In a major company that is a supplier of consumer products, they were leading in the industry and their sales started to drop, and they didn't know why. Well, with a data science team, we were able to determine by pulling in data from the CDC, now these are sources that only 20 years ago nobody ever used to bring in data in the enterprise, now 60% of your data is external. So we brought in data from the CDC, we brought in data on maternal births from the national government, we brought in data from the Census Bureau, we brought in data from sources of advertising and targeted marketing towards mothers. Pulled all that data together and said, "Why are diaper sales down?" Well they were targeting the large regions of the country and putting ads in TV stations in New York and California, big population centers. Birth rates in population centers have declined. Birth rates in certain other regions, like the south, and the Bible Belt, if I can call it that, have increased. So by changing the marketing, their product sales went up. >> Advertising to Texas. >> Well, you know, and that brings to one of the points, I heard a lecture today about ethics. We made it a point at Walmart that if you ran a query that reduced a result to less than five people, we wouldn't allow you to see the result. Because, think about it, I could say, "What is my neighbor buying? "What are you buying?" So there's an ethical component to this as well. But that, you know, data is not political. Data is not chauvinistic. It doesn't discriminate, it just gives you facts. It's the interpretation of that that is hard CDOs, because we have to say to someone, "Look, this is the fact, and your 25 years "of experience in the business, "granted, is tremendous and it's needed, "but the facts are saying this, "and that would mean that the business "would have to change its direction." And it's hard for people to do, so it requires that. >> So whether it's called the chief data officer, whatever the data czar rubric is, the head of analytics, there's obviously the data quality component there whatever that is, this is the conference for, as I called them, your peeps, for that role in the organization. People often ask, "Will that role be around?" I think it's clear, it's solidifying. Yes, you see the chief digital officer emerging and there's a lot of tailwinds there, but the information quality component, the data architecture component, it's here to stay. And this is the premiere conference, the premiere event, that I know of anyway. There are a couple of others, perhaps, but it's great to see all the success. When I first came here in 2013 there were probably about 130 folks here. Today, I think there were 500 people registered almost. Next year, I think 600 is kind of the target, and I think it's very reasonable with the new space. So congratulations on all the success, and thank you for stepping up to the co-chair role, I really appreciate it. >> Well, let me tell you I thank you guys. You provide a voice at these IT conferences that we really need, and that is the ability to get the message out. That people do think and care, the industry is not thoughtless and heartless. With all the data breaches and everything going on there's a lot of fear, fear, loathing, and anticipation. But having your voice, kind of like ESPN and a sports show, gives the technology community, which is getting larger and larger by the day, a voice and we need that so, thank you. >> Well thank you, Robert. We appreciate that, it was great to have you on. Appreciate the time. >> Great to be here, thank you. >> All right, and thank you for watching. We'll be right back with out next guest as we wrap up day two of MIT CDOIQ. You're watching theCUBE. (futuristic music)

Published Date : Aug 1 2019

SUMMARY :

Brought to you by SiliconANGLE Media. and also the co-chair of next year's, give us a little history of your career. So he searched the industry to find somebody (laughs) Hard to count. 10 years to build Fusion with SOA? and the returns showed here So what did you do after? and the third guy says, And in the center was the Cisco telecommunications and the heads of Sam's Club and the like, Well the father goes in to buy the diapers for the baby, (laughs) So the planogramming, the way the shelves were organized, and created one team that was able to integrate so that makes you uniquely qualified to coach here There are some real, there are some geniuses here. and it's not like, you know, the industry conferences, the sponsors, you know, Yeah, I mean the percentage of presentations by One of the areas like big data I remember just get all the CDOs and CDO-types into a room, because of the Information Age, and the Bible Belt, if I can call it that, have increased. It's the interpretation of that that is hard CDOs, the data architecture component, it's here to stay. and that is the ability to get the message out. We appreciate that, it was great to have you on. All right, and thank you for watching.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
PeterPERSON

0.99+

WalmartORGANIZATION

0.99+

Peter AikenPERSON

0.99+

Robert AbatePERSON

0.99+

RobertPERSON

0.99+

Procter & GambleORGANIZATION

0.99+

CiscoORGANIZATION

0.99+

IndiaLOCATION

0.99+

MumbaiLOCATION

0.99+

Census BureauORGANIZATION

0.99+

2010DATE

0.99+

1996DATE

0.99+

New YorkLOCATION

0.99+

last weekDATE

0.99+

last yearDATE

0.99+

last monthDATE

0.99+

60%QUANTITY

0.99+

BloomingdaleORGANIZATION

0.99+

Next yearDATE

0.99+

1999DATE

0.99+

TexasLOCATION

0.99+

25 yearsQUANTITY

0.99+

10 yearsQUANTITY

0.99+

one roomQUANTITY

0.99+

2014DATE

0.99+

2013DATE

0.99+

Doug LaneyPERSON

0.99+

Sam's ClubORGANIZATION

0.99+

ICICI BankORGANIZATION

0.99+

99.9%QUANTITY

0.99+

Wiley & SonsORGANIZATION

0.99+

50 peopleQUANTITY

0.99+

Arka MukherjeePERSON

0.99+

next yearDATE

0.99+

JosPERSON

0.99+

TodayDATE

0.99+

third guyQUANTITY

0.99+

2000DATE

0.99+

todayDATE

0.99+

oneQUANTITY

0.99+

500 peopleQUANTITY

0.99+

Cambridge, MassachusettsLOCATION

0.99+

one channelQUANTITY

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

OracleORGANIZATION

0.99+

eachQUANTITY

0.99+

OneQUANTITY

0.99+

CDCORGANIZATION

0.99+

less than five peopleQUANTITY

0.99+

Ralph LaurenORGANIZATION

0.99+

one guyQUANTITY

0.99+

six packQUANTITY

0.99+

ESPNORGANIZATION

0.99+

four years agoDATE

0.98+

AfricaLOCATION

0.98+

SOATITLE

0.98+

five years agoDATE

0.98+

CaliforniaLOCATION

0.98+

GartnerORGANIZATION

0.98+

three industry expertsQUANTITY

0.98+

Global IDSORGANIZATION

0.98+

Four years laterDATE

0.98+

600QUANTITY

0.98+

20 years agoDATE

0.98+

East CoastLOCATION

0.98+

250QUANTITY

0.98+

Middle EastLOCATION

0.98+

four yearsQUANTITY

0.98+

one teamQUANTITY

0.97+

monthsQUANTITY

0.97+

firstQUANTITY

0.97+

about two monthsQUANTITY

0.97+

Latin AmericaLOCATION

0.97+

Michael Stonebraker, TAMR | MIT CDOIQ 2019


 

>> from Cambridge, Massachusetts. It's the Cube covering M I T. Chief data officer and information quality Symposium 2019. Brought to you by Silicon Angle Media. >> Welcome back to Cambridge, Massachusetts. Everybody, You're watching the Cube, the leader in live tech coverage, and we're covering the M I t CDO conference M I t. CDO. My name is David Monty in here with my co host, Paul Galen. Mike Stone breakers here. The legend is founder CTO of Of Tamer, as well as many other companies. Inventor Michael. Thanks for coming back in the Cube. Good to see again. Nice to be here. So this is kind of ah, repeat pattern for all of us. We kind of gather here in August that the CDO conference You're always the highlight of the show. You gave a talk this week on the top 10. Big data mistakes. You and I are one of the few. You were the few people who still use the term big data. I happen to like it. Sad that it's out of vogue already, but people associated with the doo doop it's kind of waning, but regardless, so welcome. How'd the talk go? What were you talking about. >> So I talked to a lot of people who were doing analytics. We're doing operation Offer operational day of data at scale, and they always make most of them make a collection of bad mistakes. And so the talk waas a litany of the blunders that I've seen people make, and so the audience could relate to the blunders about most. Most of the enterprise is represented. Make a bunch of the blunders. So I think no. One blunder is not planning on moving most everything to the cloud. >> So that's interesting, because a lot of people would would would love to debate that, but and I would imagine you probably could have done this 10 years ago in a lot of the blunders would be the same, but that's one that wouldn't have been there. But so I tend to agree. I was one of the two hands that went up this morning, and vocalist talk when he asked, Is the cloud cheaper for us? It is anyway. But so what? Why should everybody move everything? The cloud aren't there laws of physics, laws of economics, laws of the land that suggest maybe you >> shouldn't? Well, I guess 22 things and then a comment. First thing is James Hamilton, who's no techies. Techie works for Amazon. We know James. So he claims that he could stand up a server for 25% of your cost. I have no reason to disbelieve him. That number has been pretty constant for a few years, so his cost is 1/4 of your cost. Sooner or later, prices are gonna reflect costs as there's a race to the bottom of cloud servers. So >> So can I just stop you there for a second? Because you're some other date on that. All you have to do is look at a W S is operating margin and you'll see how profitable they are. They have software like economics. Now we're deploying servers. So sorry to interrupt, but so carry. So >> anyway, sooner or later, they're gonna have their gonna be wildly cheaper than you are. The second, then yet is from Dave DeWitt, whose database wizard. And here's the current technology that that Microsoft Azure is using. As of 18 months ago, it's shipping containers and parking lots, chilled water in power in Internet, Ian otherwise sealed roof and walls optional. So if you're doing raised flooring in Cambridge versus I'm doing shipping containers in the Columbia River Valley, who's gonna be a lot cheaper? And so you know the economies of scale? I mean, that, uh, big, big cloud guys are building data centers as fast as they can, using the cheapest technology around. You put up the data center every 10 years on dhe. You do it on raised flooring in Cambridge. So sooner or later, the cloud guys are gonna be a lot cheaper. And the only thing that isn't gonna the only thing that will change that equation is For example, my lab is up the street with Frank Gehry building, and we have we have an I t i t department who runs servers in Cambridge. Uh, and they claim they're cheaper than the cloud. And they don't pay rent for square footage and they don't pay for electricity. So yeah, if if think externalities, If there are no externalities, the cloud is assuredly going to be cheaper. And then the other thing is that most everybody tonight that I talk thio including me, has very skewed resource demands. So in the cloud finding three servers, except for the last day of the month on the last day of the month. I need 20 servers. I just do it. If I'm doing on Prem, I've got a provision for peak load. And so again, I'm just way more expensive. So I think sooner or later these combinations of effects was going to send everybody to the cloud for most everything, >> and my point about the operating margins is difference in price and cost. I think James Hamilton's right on it. If he If you look at the actual cost of deploying, it's even lower than the price with the market allows them to their growing at 40 plus percent a year and a 35 $40,000,000,000 run rate company sooner, Sooner or >> later, it's gonna be a race to the lot of you >> and the only guys are gonna win. You have guys have the best cost structure. A >> couple other highlights from your talk. >> Sure, I think 2nd 2nd thing like Thio Thio, no stress is that machine learning is going to be a game is going to be a game changer for essentially everybody. And not only is it going to be autonomous vehicles. It's gonna be automatic. Check out. It's going to be drone delivery of most everything. Uh, and so you can, either. And it's gonna affect essentially everybody gonna concert of, say, categorically. Any job that is easy to understand is going to get automated. And I think that's it's gonna be majorly impactful to most everybody. So if you're in Enterprise, you have two choices. You can be a disrupt or or you could be a disruptive. And so you can either be a taxi company or you can be you over, and it's gonna be a I machine learning that's going going to be determined which side of that equation you're on. So I was a big blunder that I see people not taking ml incredibly seriously. >> Do you see that? In fact, everyone I talked who seems to be bought in that this is we've got to get on the bandwagon. Yeah, >> I'm just pointing out the obvious. Yeah, yeah, I think, But one that's not quite so obvious you're is a lot of a lot of people I talked to say, uh, I'm on top of data science. I've hired a group of of 10 data scientists, and they're doing great. And when I talked, one vignette that's kind of fun is I talked to a data scientist from iRobot, which is the guys that have the vacuum cleaner that runs around your living room. So, uh, she said, I spend 90% of my time locating the data. I want to analyze getting my hands on it and cleaning it, leaving the 10% to do data science job for which I was hired. Of the 10% I spend 90% fixing the data cleaning errors in my data so that my models work. So she spends 99% of her time on what you call data preparation 1% of her time doing the job for which he was hired. So data science is not about data science. It's about data integration, data cleaning, data, discovery. >> But your new latest venture, >> so tamer does that sort of stuff. And so that's But that's the rial data science problem. And a lot of people don't realize that yet, And, uh, you know they will. I >> want to ask you because you've been involved in this by my count and starting up at least a dozen companies. Um, 99 Okay, It's a lot. >> It's not overstated. You estimated high fall. How do you How >> do you >> decide what challenge to move on? Because they're really not. You're not solving the same problems. You're You're moving on to new problems. How do you decide? What's the next thing that interests you? Enough to actually start a company. Okay, >> that's really easy. You know, I'm on the faculty of M i t. My job is to think of news new ship and investigate it, and I come up. No, I'm paid to come up with new ideas, some of which have commercial value, some of which don't and the ones that have commercial value, like, commercialized on. So it's whatever I'm doing at the time on. And that's why all the things I've commercialized, you're different >> s so going back to tamer data integration platform is a lot of companies out there claim to do it day to get integration right now. What did you see? What? That was the deficit in the market that you could address. >> Okay, great question. So there's the traditional data. Integration is extract transforming load systems and so called Master Data management systems brought to you by IBM in from Attica. Talent that class of folks. So a dirty little secret is that that technology does not scale Okay, in the following sense that it's all well, e t l doesn't scale for a different reason with an m d l e t l doesn't scale because e t. L is based on the premise that somebody really smart comes up with a global data model For all the data sources you want put together. You then send a human out to interview each business unit to figure out exactly what data they've got and then how to transform it into the global data model. How to load it into your data warehouse. That's very human intensive. And it doesn't scale because it's so human intensive. So I've never talked to a data warehouse operator who who says I integrate the average I talk to says they they integrate less than 10 data sources. Some people 20. If you twist my arm hard, I'll give you 50. So a Here. Here's a real world problem, which is Toyota Motor Europe. I want you right now. They have a distributor in Spain, another distributor in France. They have a country by country distributor, sometimes canton by Canton. Distribute distribution. So if you buy a Toyota and Spain and move to France, Toyota develops amnesia. The French French guys know nothing about you. So they've got 250 separate customer databases with 40,000,000 total records in 50 languages. And they're in the process of integrating that. It was single customer database so that they can Duke custom. They could do the customer service we expect when you cross cross and you boundary. I've never seen an e t l system capable of dealing with that kind of scale. E t l dozen scale to this level of problem. >> So how do you solve that problem? >> I'll tell you that they're a tamer customer. I'll tell you all about it. Let me first tell you why MGM doesn't scare. >> Okay. Great. >> So e t l says I now have all your data in one place in the same format, but now you've got following problems. You've got a d duplicated because if if I if I bought it, I bought a Toyota in Spain, I bought another Toyota in France. I'm both databases. So if you want to avoid double counting customers, you got a dupe. Uh, you know, got Duke 30,000,000 records. And so MGM says Okay, you write some rules. It's a rule based technology. So you write a rule. That's so, for example, my favorite example of a rule. I don't know if you guys like to downhill downhill skiing, All right? I love downhill skiing. So ski areas, Aaron, all kinds of public databases assemble those all together. Now you gotta figure out which ones are the same the same ski area, and they're called different names in different addresses and so forth. However, a vertical drop from bottom to the top is the same. Chances are they're the same ski area. So that's a rule that says how to how to put how to put data together in clusters. And so I now have a cluster for mount sanity, and I have a problem which is, uh, one address says something rather another address as something else. Which one is right or both? Right, so now you want. Now you have a gold. Let's call the golden Record problem to basically decide which, which, which data elements among a variety that maybe all associated with the same entity are in fact correct. So again, MDM, that's a rule's a rule based system. So it's a rule based technology and rule systems don't scale the best example I can give you for why Rules systems don't scale. His tamer has another customer. General Electric probably heard of them, and G wanted to do spend analytics, and so they had 20,000,000 spend transactions. Frank the year before last and spend transaction is I paid $12 to take a cab from here here to the airport, and I charged it to cost center X Y Z 20,000,000 of those so G has a pre built classification system for spend, so they have parts and underneath parts or computers underneath computers and memory and so forth. So pre existing preexisting class classifications for spend they want to simply classified 20,000,000 spent transactions into this pre existing hierarchy. So the traditional technology is, well, let's write some rules. So G wrote 500 rules, which is about the most any single human I can get there, their arms around so that classified 2,000,000 of the 20,000,000 transactions. You've now got 18 to go and another 500 rules is not going to give you 2,000,000 more. It's gonna give you love diminishing returns, right? So you have to write a huge number of rules and no one can possibly understand. So the technology simply doesn't scale, right? So in the case of G, uh, they had tamer health. Um, solve this. Solved this classification problem. Tamer used their 2,000,000 rule based, uh, tag records as training data. They used an ML model, then work off the training data classifies remaining 18,000,000. So the answer is machine learning. If you don't use machine learning, you're absolutely toast. So the answer to MDM the answer to MGM doesn't scale. You've got to use them. L The answer to each yell doesn't scale. You gotta You're putting together disparate records can. The answer is ml So you've got to replace humans by machine learning. And so that's that seems, at least in this conference, that seems to be resonating, which is people are understanding that at scale tradition, traditional data integration, technology's just don't work >> well and you got you got a great shot out on yesterday from the former G S K Mark Grams, a leader Mark Ramsay. Exactly. Guys. And how they solve their problem. He basically laid it out. BTW didn't work and GM didn't work, All right. I mean, kick it, kick the can top down data modelling, didn't work, kicked the candid governance That's not going to solve the problem. And But Tamer did, along with some other tooling. Obviously, of course, >> the Well, the other thing is No. One technology. There's no silver bullet here. It's going to be a bunch of technologies working together, right? Mark Ramsay is a great example. He used his stream sets and a bunch of other a bunch of other startup technology operating together and that traditional guys >> Okay, we're good >> question. I want to show we have time. >> So with traditional vendors by and large or 10 years behind the times, And if you want cutting edge stuff, you've got to go to start ups. >> I want to jump. It's a different topic, but I know that you in the past were critic of know of the no sequel movement, and no sequel isn't going away. It seems to be a uh uh, it seems to be actually gaining steam right now. What what are the flaws in no sequel? It has your opinion changed >> all? No. So so no sequel originally meant no sequel. Don't use it then. Then the marketing message changed to not only sequel, So sequel is fine, but no sequel does others. >> Now it's all sequel, right? >> And my point of view is now. No sequel means not yet sequel because high level language, high level data languages, air good. Mongo is inventing one Cassandra's inventing one. Those unless you squint, look like sequel. And so I think the answer is no sequel. Guys are drifting towards sequel. Meanwhile, Jason is That's a great idea. If you've got your regular data sequel, guys were saying, Sure, let's have Jason is the data type, and I think the only place where this a fair amount of argument is schema later versus schema first, and I pretty much think schema later is a bad idea because schema later really means you're creating a data swamp exactly on. So if you >> have to fix it and then you get a feel of >> salary, so you're storing employees and salaries. So, Paul salaries recorded as dollars per month. Uh, Dave, salary is in euros per week with a lunch allowance minds. So if you if you don't, If you don't deal with irregularities up front on data that you care about, you're gonna create a mess. >> No scheme on right. Was convenient of larger store, a lot of data cheaply. But then what? Hard to get value out of it created. >> So So I think the I'm not opposed to scheme later. As long as you realize that you were kicking the can down the road and you're just you're just going to give your successor a big mess. >> Yeah, right. Michael, we gotta jump. But thank you so much. Sure appreciate it. All right. Keep it right there, everybody. We'll be back with our next guest right into the short break. You watching the cue from M i t cdo Ike, you right back

Published Date : Aug 1 2019

SUMMARY :

Brought to you by We kind of gather here in August that the CDO conference You're always the highlight of the so the audience could relate to the blunders about most. physics, laws of economics, laws of the land that suggest maybe you So he claims that So can I just stop you there for a second? And so you know the and my point about the operating margins is difference in price and cost. You have guys have the best cost structure. And so you can either be a taxi company got to get on the bandwagon. leaving the 10% to do data science job for which I was hired. But that's the rial data science problem. want to ask you because you've been involved in this by my count and starting up at least a dozen companies. How do you How You're You're moving on to new problems. No, I'm paid to come up with new ideas, s so going back to tamer data integration platform is a lot of companies out there claim to do and so called Master Data management systems brought to you by IBM I'll tell you that they're a tamer customer. So the answer to MDM the I mean, kick it, kick the can top down data modelling, It's going to be a bunch of technologies working together, I want to show we have time. and large or 10 years behind the times, And if you want cutting edge It's a different topic, but I know that you in the past were critic of know of the no sequel movement, No. So so no sequel originally meant no So if you So if you if Hard to get value out of it created. So So I think the I'm not opposed to scheme later. But thank you so much.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
MichaelPERSON

0.99+

JamesPERSON

0.99+

Mark RamsayPERSON

0.99+

James HamiltonPERSON

0.99+

Paul GalenPERSON

0.99+

Dave DeWittPERSON

0.99+

ToyotaORGANIZATION

0.99+

David MontyPERSON

0.99+

General ElectricORGANIZATION

0.99+

2,000,000QUANTITY

0.99+

FranceLOCATION

0.99+

AmazonORGANIZATION

0.99+

20,000,000QUANTITY

0.99+

10%QUANTITY

0.99+

Michael StonebrakerPERSON

0.99+

CambridgeLOCATION

0.99+

IBMORGANIZATION

0.99+

50QUANTITY

0.99+

$12QUANTITY

0.99+

SpainLOCATION

0.99+

18,000,000QUANTITY

0.99+

25%QUANTITY

0.99+

20 serversQUANTITY

0.99+

90%QUANTITY

0.99+

Columbia River ValleyLOCATION

0.99+

99%QUANTITY

0.99+

18QUANTITY

0.99+

AaronPERSON

0.99+

DavePERSON

0.99+

AugustDATE

0.99+

Silicon Angle MediaORGANIZATION

0.99+

three serversQUANTITY

0.99+

35 $40,000,000,000QUANTITY

0.99+

50 languagesQUANTITY

0.99+

500 rulesQUANTITY

0.99+

22 thingsQUANTITY

0.99+

10 data scientistsQUANTITY

0.99+

Mike StonePERSON

0.99+

Cambridge, MassachusettsLOCATION

0.99+

MGMORGANIZATION

0.99+

less than 10 data sourcesQUANTITY

0.99+

IanPERSON

0.99+

PaulPERSON

0.99+

1%QUANTITY

0.99+

bothQUANTITY

0.99+

Toyota Motor EuropeORGANIZATION

0.99+

Of TamerORGANIZATION

0.99+

MicrosoftORGANIZATION

0.99+

oneQUANTITY

0.99+

singleQUANTITY

0.99+

AtticaORGANIZATION

0.99+

10 years agoDATE

0.99+

yesterdayDATE

0.99+

iRobotORGANIZATION

0.99+

Mark GramsPERSON

0.99+

TAMRPERSON

0.99+

10 yearsQUANTITY

0.99+

20QUANTITY

0.98+

1/4QUANTITY

0.98+

250 separate customer databasesQUANTITY

0.98+

CassandraPERSON

0.98+

First thingQUANTITY

0.98+

30,000,000 recordsQUANTITY

0.98+

both databasesQUANTITY

0.98+

18 months agoDATE

0.98+

firstQUANTITY

0.98+

M I t CDOEVENT

0.98+

One blunderQUANTITY

0.98+

TamerPERSON

0.98+

one placeQUANTITY

0.98+

secondQUANTITY

0.97+

two choicesQUANTITY

0.97+

tonightDATE

0.97+

each business unitQUANTITY

0.97+

Thio ThioPERSON

0.97+

two handsQUANTITY

0.96+

this weekDATE

0.96+

FrankPERSON

0.95+

DukeORGANIZATION

0.95+