Image Title

Search Results for chief data officer:

Gene Kolker, IBM & Seth Dobrin, Monsanto - IBM Chief Data Officer Strategy Summit 2016 - #IBMCDO


 

>> live from Boston, Massachusetts. It's the Cube covering IBM Chief Data Officer Strategy Summit brought to you by IBM. Now, here are your hosts. Day Volante and Stew Minimum. >> Welcome back to Boston, everybody. This is the Cube, the worldwide leader in live tech coverage. Stillman and I have pleased to have Jean Kolker on a Cuba lem. Uh, he's IBM vice president and chief data officer of the Global Technology Services division. And Seth Dobrin who's the Director of Digital Strategies. That Monsanto. You may have seen them in the news lately. Gentlemen. Welcome to the Cube, Jean. Welcome back. Good to see you guys again. Thanks. Thank you. So let's start with the customer. Seth, Let's, uh, tell us about what you're doing here, and then we'll get into your role. >> Yes. So, you know, the CDO summit has been going on for a couple of years now, and I've been lucky enoughto be participating for a couple of a year and 1/2 or so, Um, and you know, really, the nice thing about the summit is is the interaction with piers, um, and the interaction and networking with people who are facing similar challenges from a similar perspective. >> Yes, kind of a relatively new Roland topic, one that's evolved, Gene. We talked about this before, but now you've come from industry into, ah, non regulated environment. Now what's happened like >> so I think the deal is that way. We're developing some approaches, and we get in some successes in regulated environment. Right? And now I feel with And we were being client off IBM for years, right? Using their technology's approaches. Right? So and now I feel it's time for me personally to move on something different and tried to serve our power. I mean, IBM clients respected off in this striking from healthcare, but their approaches, you know, and what IBM can do for clients go across the different industries, right? And doing it. That skill that's very beneficial, I think, for >> clients. So Monsanto obviously guys do a lot of stuff in the physical world. Yeah, you're the head of digital strategy. So what does that entail? What is Monte Santo doing for digital? >> Yes, so, you know, for as head of digital strategies for Monsanto, really? My role is to number one. Help Monsanto internally reposition itself so that we behave and act like a digital companies, so leveraging data and analytics and also the cultural shifts associated with being more digital, which is that whole kind like you start out this conversation with the whole customer first approach. So what is the real impact toe? What we're doing to our customers on driving that and then based on on those things, how can we create new business opportunities for us as a company? Um, and how can we even create new adjacent markets or new revenues in adjacent areas based on technologies and things we already have existing within the company? >> It was the scope of analytics, customer engagement of digital experiences, all of the above, so that the scope is >> really looking at our portfolio across the gamut on DH, seeing how we can better serve our customers and society leveraging what we're doing today. So it's really leveraging the re use factor of the whole digital concept. Right? So we have analytics for geospatial, right? Big part of agriculture is geospatial. Are there other adjacent areas that we could apply some of that technology? Some of that learning? Can we monetize those data? We monetize the the outputs of those models based on that, Or is there just a whole new way of doing business as a company? Because we're in this digital era >> this way? Talked about a lot of the companies that have CEOs today are highly regulated. What are you learning from them? What's what's different? Kind of a new organization. You know, it might be an opportunity for you that they don't have. And, you know, do you have a CDO yet or is that something you're planning on having? >> Yes, So we don't have a CDO We do have someone acts as an essential. he's a defacto CEO, he has all of the data organizations on his team. Um, it's very recent for Monsanto, Um, and and so I think, you know, in terms of from the regular, what can we learn from, you know, there there are. It's about half financial people have non financial people, are half heavily regulated industries, and I think, you know, on the surface you would. You would think that, you know, there was not a lot of overlap, but I think the level of rigor that needs to go into governance in a financial institution that same thought process. Khun really be used as a way Teo really enable Maur R and D. Mohr you know, growth centered companies to be able to use data more broadly and so thinking of governance not as as a roadblock or inhibitor, but really thinking about governance is an enabler. How does it enable us to be more agile as it enable us to beam or innovative? Right? If if people in the company there's data that people could get access to by unknown process of known condition, right, good, bad, ugly. As long as people know they can do things more quickly because the data is there, it's available. It's curated. And if they shouldn't have access it under their current situation, what do they need to do to be able to access that data? Right. So if I would need If I'm a data scientist and I want to access data about my customers, what can I can't? What can and can't I do with that data? Number one doesn't have to be DEA Nana Mayes, right? Or if I want to access in, it's current form. What steps do I need to go through? What types of approval do I need to do to do to access that data? So it's really about removing roadblocks through governance instead of putting him in place. >> Gina, I'm curious. You know, we've been digging into you know, IBM has a very multifaceted role here. You know how much of this is platforms? How much of it is? You know, education and services. How much of it is, you know, being part of the data that your your customers you're using? >> Uh so I think actually, that different approaches to this issues. My take is basically we need Teo. I think that with even cognitive here, right and data is new natural resource worldwide, right? So data service, cognitive za za service. I think this is where you know IBM is coming from. And the BM is, you know, tradition. It was not like that, but it's under a lot of transformation as we speak. A lot of new people coming in a lot off innovation happening as we speak along. This line's off new times because cognitive with something, really you right, and it's just getting started. Data's a service is really new. It's just getting started. So there's a lot to do. And I think my role specifically global technology services is you know, ah, largest by having your union that IBM, you're 30 plus 1,000,000,000 answered You okay? And we support a lot of different industries basically going across all different types of industries how to transition from offerings to new business offerings, service, integrated services. I think that's the key for us. >> Just curious, you know? Where's Monsanto with kind of the adoption of cognitive, You know what? Where are you in that journey? >> Um, so we are actually a fairly advanced in the journey In terms of using analytics. I wouldn't say that we're using cognitive per se. Um, we do use a lot of machine learning. We have some applications that on the back end run on a I So some form of artificial or formal artificial intelligence, that machine learning. Um, we haven't really gotten into what, you know, what? IBM defined his cognitive in terms of systems that you can interact with in a natural, normal course of doing voice on DH that you spend a whole lot of time constantly teaching. But we do use like I said, artificial intelligence. >> Jean I'm interested in the organizational aspects. So we have Inderpal on before. He's the global CDO, your divisional CDO you've got a matrix into your leadership within the Global Services division as well as into the chief date officer for all of IBM. Okay, Sounds sounds reasonable. He laid out for us a really excellent sort of set of a framework, if you will. This is interval. Yeah, I understand your data strategy. Identify your data store says, make those data sources trusted. And then those air sequential activities. And in parallel, uh, you have to partner with line of business. And then you got to get into the human resource planning and development piece that has to start right away. So that's the framework. Sensible framework. A lot of thought, I'm sure, went into it and a lot of depth and meaning behind it. How does that framework translate into the division? Is it's sort of a plug and play and or is there their divisional goals that are create dissonance? Can you >> basically, you know, I'm only 100 plus days in my journey with an IBM right? But I can feel that the global technology services is transforming itself into integrated services business. Okay, so it's thiss framework you just described is very applicable to this, right? So basically what we're trying to do, we're trying to become I mean, it was the case before for many industries, for many of our clients. But we I want to transform ourselves into trusted broker. So what they need to do and this framework help is helping tremendously, because again, there's things we can do in concert, you know, one after another, right to control other and things we can do in parallel. So we trying those things to be put on the agenda for our global technology services, okay. And and this is new for them in some respects. But some respects it's kind of what they were doing before, but with new emphasis on data's A service cognitive as a service, you know, major thing for one of the major things for global technology services delivery. So cognitive delivery. That's kind of new type off business offerings which we need to work on how to make it truly, you know, once a sense, you know, automated another sense, you know, cognitive and deliver to our clients some you value and on value compared to what was done up until recently. What >> do you mean by cognitive delivery? Explained that. >> Yeah, so basically in in plain English. So what's right now happening? Usually when you have a large systems  computer IT system, which are basically supporting lot of in this is a lot of organizations corporations, right? You know, it's really done like this. So it's people run technology assistant, okay? And you know what Of decisions off course being made by people, But some of the decisions can be, you know, simple decisions. Right? Decisions, which can be automated, can standardize, normalize can be done now by technology, okay and people going to be used for more complex decisions, right? It's basically you're going toe. It turned from people around technology assisted toa technology to technology around people assisted. OK, that's very different. Very proposition, right? So, again, it's not about eliminating jobs, it's very different. It's taken off, you know, routine and automata ble part off the business right to technology and given options and, you know, basically options to choose for more complex decision making to people. That's kind of I would say approach. >> It's about scale and the scale to, of course, IBM. When when Gerstner made the decision, Tio so organized as a services company, IBM came became a global leader, if not the global leader but a services business. Hard to scale. You could scare with bodies, and the bigger it gets, the more complicated it gets, the more expensive it gets. So you saying, If I understand correctly, the IBM is using cognitive and software essentially to scale its services business where possible, assisted by humans. >> So that's exactly the deal. So and this is very different. Very proposition, toe say, compared what was happening recently or earlier? Always. You know other. You know, players. We're not building your shiny and much more powerful and cognitive, you know, empowered mouse trap. No, we're trying to become trusted broker, OK, and how to do that at scale. That's an open, interesting question, but we think that this transition from you know people around technology assisted Teo technology around people assisted. That's the way to go. >> So what does that mean to you? How does that resonate? >> Yeah, you know, I think it brings up a good point actually, you know, if you think of the whole litany of the scope of of analytics, you have everything from kind of describing what happened in the past All that to cognitive. Um, and I think you need to I understand the power of each of those and what they shouldn't should be used for. A lot of people talk. You talk. People talk a lot about predictive analytics, right? And when you hear predictive analytics, that's really where you start doing things that fully automate processes that really enable you to replace decisions that people make right, I think. But those air mohr transactional type decisions, right? More binary type decisions. As you get into things where you can apply binary or I'm sorry, you can apply cognitive. You're moving away from those mohr binary decisions. There's more transactional decisions, and you're moving mohr towards a situation where, yes, the system, the silicon brain right, is giving you some advice on the types of decisions that you should make, based on the amount of information that it could absorb that you can't even fathom absorbing. But they're still needs really some human judgment involved, right? Some some understanding of the contacts outside of what? The computer, Khun Gay. And I think that's really where something like cognitive comes in. And so you talk about, you know, in this in this move to have, you know, computer run, human assisted right. There's a whole lot of descriptive and predictive and even prescriptive analytics that are going on before you get to that cognitive decision but enables the people to make more value added decisions, right? So really enabling the people to truly add value toe. What the data and the analytics have said instead of thinking about it, is replacing people because you're never going to replace you. Never gonna replace people. You know, I think I've heard people at some of these conferences talking about, Well, no cognitive and a I is going to get rid of data scientist. I don't I don't buy that. I think it's really gonna enable data scientist to do more valuable, more incredible things >> than they could do today way. Talked about this a lot to do. I mean, machines, through the course of history, have always replaced human tasks, right, and it's all about you know, what's next for the human and I mean, you know, with physical labor, you know, driving stakes or whatever it is. You know, we've seen that. But now, for the first time ever, you're seeing cognitive, cognitive assisted, you know, functions come into play and it's it's new. It's a new innovation curve. It's not Moore's law anymore. That's driving innovation. It's how we interact with systems and cognitive systems one >> tonight. And I think, you know, I think you hit on a good point there when you said in driving innovation, you know, I've run, you know, large scale, automated process is where the goal was to reduce the number of people involved. And those were like you said, physical task that people are doing we're talking about here is replacing intellectual tasks, right or not replacing but freeing up the intellectual capacity that is going into solving intellectual tasks to enable that capacity to focus on more innovative things, right? We can teach a computer, Teo, explain ah, an area to us or give us some advice on something. I don't know that in the next 10 years, we're gonna be able to teach a computer to innovate, and we can free up the smart minds today that are focusing on How do we make a decision? Two. How do we be more innovative in leveraging this decision and applying this decision? That's a huge win, and it's not about replacing that person. It's about freeing their time up to do more valuable things. >> Yes, sure. So, for example, from my previous experience writing healthcare So physicians, right now you know, basically, it's basically impossible for human individuals, right to keep up with spaced of changes and innovations happening in health care and and by medical areas. Right? So in a few years it looks like there was some numbers that estimate that in three days you're going to, you know, have much more information for several years produced during three days. What was done by several years prior to that point. So it's basically becomes inhuman to keep up with all these innovations, right? Because of that decision is going to be not, you know, optimal decisions. So what we'd like to be doing right toe empower individuals make this decision more, you know, correctly, it was alternatives, right? That's about empowering people. It's not about just taken, which is can be done through this process is all this information and get in the routine stuff out of their plate, which is completely full. >> There was a stat. I think it was last year at IBM Insight. Exact numbers, but it's something like a physician would have to read 1,500 periodic ALS a week just to keep up with the new data innovations. I mean, that's virtually impossible. That something that you're obviously pointing, pointing Watson that, I mean, But there are mundane examples, right? So you go to the airport now, you don't need a person that the agent to give you. Ah, boarding pass. It's on your phone already. You get there. Okay, so that's that's That's a mundane example we're talking about set significantly more complicated things. And so what's The gate is the gate. Creativity is it is an education, you know, because these are step functions in value creation. >> You know, I think that's ah, what? The gate is a question I haven't really thought too much about. You know, when I approach it, you know the thinking Mohr from you know, not so much. What's the gate? But where? Where can this ad the most value um So maybe maybe I have thought about it. And the gate is value, um, and and its value both in terms of, you know, like the physician example where, you know, physicians, looking at images. And I mean, I don't even know what the error rate is when someone evaluates and memory or something. And I probably don't want Oh, right. So, getting some advice there, the value may not be monetary, but to me, it's a lot more than monetary, right. If I'm a patient on DH, there's a lot of examples like that. And other places, you know, that are in various industries. That I think that's that's the gate >> is why the value you just hit on you because you are a heat seeking value missile inside of your organisation. What? So what skill sets do you have? Where did you come from? That you have this capability? Was your experience, your education, your fortitude, >> While the answer's yes, tell all of them. Um, you know, I'm a scientist by training my backgrounds in statistical genetics. Um, and I've kind of worked through the business. I came up through the RND organization with him on Santo over the last. Almost exactly 10 years now, Andi, I've had lots of opportunities to leverage. Um, you know, Data and analytics have changed how the company operates on. I'm lucky because I'm in a company right now. That is extremely science driven, right? Monsanto is a science based company. And so being in a company like that, you don't face to your question about financial industry. I don't think you face the same barriers and Monsanto about using data and analytics in the same way you may in a financial types that you've got company >> within my experience. 50% of diagnosis being proven incorrect. Okay, so 50% 05 0/2 summation. You go to your physician twice. Once you on average, you get in wrong diagnosis. We don't know which one, by the way. Definitely need some someone. Garrett A cz Individuals as humans, we do need some help. Us cognitive, and it goes across different industries. Right, technologist? So if your server is down, you know you shouldn't worry about it because there is like system, you know, Abbas system enough, right? So think about how you can do that scale, and then, you know start imagined future, which going to be very empowering. >> So I used to get a second opinion, and now the opinion comprises thousands, millions, maybe tens of millions of opinions. Is that right? >> It's a try exactly and scale ofthe data accumulation, which you're going to help us to solve. This problem is enormous. So we need to keep up with that scale, you know, and do it properly exactly for business. Very proposition. >> Let's talk about the role of the CDO and where you see that evolving how it relates to the role of the CIA. We've had this conversation frequently, but is I'm wondering if the narratives changing right? Because it was. It's been fuzzy when we first met a couple years ago that that was still a hot topic. When I first started covering this. This this topic, it was really fuzzy. Has it come in two more clarity lately in terms of the role of the CDO versus the CIA over the CTO, its chief digital officer, we starting to see these roles? Are they more than just sort of buzzwords or grey? You know, areas. >> I think there's some clarity happening already. So, for example, there is much more acceptance for cheap date. Office of Chief Analytics Officer Teo, Chief Digital officer. Right, in addition to CEO. So basically station similar to what was with Serious 20 plus years ago and CEO Row in one sentence from my viewpoint would be How you going using leverage in it. Empower your business. Very proposition with CDO is the same was data how using data leverage and data, your date and your client's data. You, Khun, bring new value to your clients and businesses. That's kind ofthe I would say differential >> last word, you know, And you think you know I'm not a CDO. But if you think about the concept of establishing a role like that, I think I think the name is great because that what it demonstrates is support from leadership, that this is important. And I think even if you don't have the name in the organization like it, like in Monsanto, you know, we still have that executive management level support to the data and analytics, our first class citizens and their important, and we're going to run our business that way. I think that's really what's important is are you able to build the culture that enable you to leverage the maximum capability Data and analytics. That's really what matters. >> All right, We'll leave it there. Seth Gene, thank you very much for coming that you really appreciate your time. Thank you. Alright. Keep it right there, Buddy Stew and I'll be back. This is the IBM Chief Data Officer Summit. We're live from Boston right back.

Published Date : Oct 4 2016

SUMMARY :

IBM Chief Data Officer Strategy Summit brought to you by IBM. Good to see you guys again. be participating for a couple of a year and 1/2 or so, Um, and you know, Yes, kind of a relatively new Roland topic, one that's evolved, approaches, you know, and what IBM can do for clients go across the different industries, So Monsanto obviously guys do a lot of stuff in the physical world. the cultural shifts associated with being more digital, which is that whole kind like you start out this So it's really leveraging the re use factor of the whole digital concept. And, you know, do you have a CDO I think, you know, in terms of from the regular, what can we learn from, you know, there there are. How much of it is, you know, being part of the data that your your customers And the BM is, you know, tradition. Um, we haven't really gotten into what, you know, what? And in parallel, uh, you have to partner with line of business. because again, there's things we can do in concert, you know, one after another, do you mean by cognitive delivery? and given options and, you know, basically options to choose for more complex decision So you saying, If I understand correctly, the IBM is using cognitive and software That's an open, interesting question, but we think that this transition from you know people you know, in this in this move to have, you know, computer run, know, what's next for the human and I mean, you know, with physical labor, And I think, you know, I think you hit on a good point there when you said in driving innovation, decision is going to be not, you know, optimal decisions. So you go to the airport now, you don't need a person that the agent to give you. of, you know, like the physician example where, you know, physicians, is why the value you just hit on you because you are a heat seeking value missile inside of your organisation. I don't think you face the same barriers and Monsanto about using data and analytics in the same way you may So think about how you can do that scale, So I used to get a second opinion, and now the opinion comprises thousands, So we need to keep up with that scale, you know, Let's talk about the role of the CDO and where you So basically station similar to what was with Serious And I think even if you don't have the name in the organization like it, like in Monsanto, Seth Gene, thank you very much for coming that you really appreciate your time.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
MonsantoORGANIZATION

0.99+

GinaPERSON

0.99+

IBMORGANIZATION

0.99+

Seth DobrinPERSON

0.99+

SethPERSON

0.99+

Jean KolkerPERSON

0.99+

CIAORGANIZATION

0.99+

Gene KolkerPERSON

0.99+

thousandsQUANTITY

0.99+

BostonLOCATION

0.99+

50%QUANTITY

0.99+

JeanPERSON

0.99+

three daysQUANTITY

0.99+

Seth GenePERSON

0.99+

StillmanPERSON

0.99+

Boston, MassachusettsLOCATION

0.99+

TeoPERSON

0.99+

AndiPERSON

0.99+

Khun GayPERSON

0.99+

last yearDATE

0.99+

D. MohrPERSON

0.99+

todayDATE

0.99+

second opinionQUANTITY

0.99+

one sentenceQUANTITY

0.99+

Nana MayesPERSON

0.99+

Buddy StewPERSON

0.99+

tonightDATE

0.99+

twiceQUANTITY

0.99+

bothQUANTITY

0.98+

100 plus daysQUANTITY

0.98+

IBM InsightORGANIZATION

0.98+

firstQUANTITY

0.98+

CubaLOCATION

0.98+

GenePERSON

0.97+

tens of millionsQUANTITY

0.97+

eachQUANTITY

0.97+

Monte SantoORGANIZATION

0.97+

EnglishOTHER

0.97+

MoorePERSON

0.96+

KhunPERSON

0.96+

first timeQUANTITY

0.96+

Global Technology ServicesORGANIZATION

0.96+

10 yearsQUANTITY

0.96+

WatsonPERSON

0.95+

RNDORGANIZATION

0.95+

GerstnerPERSON

0.95+

CDOEVENT

0.95+

millionsQUANTITY

0.95+

Maur RPERSON

0.94+

first approachQUANTITY

0.94+

IBM Chief Data Officer SummitEVENT

0.93+

twoQUANTITY

0.93+

Global ServicesORGANIZATION

0.93+

20 plus years agoDATE

0.92+

SantoORGANIZATION

0.92+

1,000,000,000QUANTITY

0.9+

50QUANTITY

0.88+

SeriousORGANIZATION

0.88+

30 plusQUANTITY

0.87+

CubeORGANIZATION

0.86+

DEAORGANIZATION

0.85+

1/2QUANTITY

0.85+

InderpalPERSON

0.84+

1,500 periodic ALS a weekQUANTITY

0.84+

Garrett APERSON

0.84+

next 10 yearsDATE

0.84+

#IBMCDOEVENT

0.84+

Bob Picciano & Inderpal Bhandari, IBM, - IBM Chief Data Officer Strategy Summit - #IBMCDO - #theCUBE


 

>> live from Boston, Massachusetts. It's the Cube covering IBM Chief Data Officer Strategy Summit brought to you by IBM. Now here are your hosts. Day villain Day >> and stew Minimum. We're back. Welcome to Boston, Everybody. This is the IBM Chief Data Officer Summit. This is the Cube, the worldwide leader in live tech coverage. Inderpal. Bhandari is here. He's the newly appointed chief data officer at IBM. He's joined, but joined by Bob Picciano who is the senior vice president of IBM Analytics Group. Bob. Great to see again Inderpal. Welcome. Thank you. Thank you. So good event, Bob, Let's start with you. Um, you guys have been on the chief data officer kicked for several years now. You ahead of the curve. What, are you trying to achieve it? That this event? Yes. So, >> Dave, thanks again for having us here. And thanks for being here is well, tto help your audience share in what we're doing here. We've always appreciated that your commitment to help in the the masses understand all the important pulses that are going on the industry. What we're doing here is we're really moderating form between chief date officers on. We started this really on the curve. As you said 2014, where the conference was pretty small, there were some people who were actually examining the role, thinking about becoming a chief did officer. We probably had a few formal cheap date officers we're talking about, you know, maybe 100 or so people who are participating in the very 1st 1 Now you can see it's not, You know, it's it's grown much larger. We have hundreds of people, and we're doing it multiple times a year in multiple cities. But what we're really doing is bringing together a moderated form, Um, and it's a privilege to be able to do this. Uh, this is not about selling anything to anybody. This is about exchanging ideas, understanding. You know what, the challenges of the role of the opportunities which changing about the role, what's changing about the market and the landscape, what new risks might be on the horizon? What new opportunities might be on the horizon on we you know, we really liketo listen very closely to what's going on so we can, you know, maybe build better approach is to help their mother. That's through the services we provide or whether that's through the cloud capabilities were offering or whether that's new products and services that need to be developed. And so it gives us a great understanding. And we're really fortunate to have our chief data officer here, Interpol, who's doing a great job in IBM and in helping us on our mission around really becoming a cognitive enterprise and making analytics and insight on data really be central to that transformation. >> So, Dr Bhandari, new, uh, new to the chief date officer role, not nude. IBM. You worked here and came back. I was first exposed to roll maybe 45 years ago with the chief Data officer event. OK, so you come in is the chief data officer in December. Where do you start? >> So, you know, I've had the fortune of being in this role for a long time. I was one of the earliest created, the role for healthcare in two thousand six. Then I have honed that roll over three different Steve Data officer appointments at health care companies. And now I'm at IBM. So I do have, you know, I do view with the job as a craft. So it's a practitioner job and there's a craft to it. And do I answer your question? There are five things that you have to do to get moving on the job, and three of those have to be non sequentially and to must be done and powerful but everything else. So the five alarm. The first thing is you've got to develop a data strategy and data strategy is around, is focused around having an understanding ofthe how the company monetize is or plans to monetize itself. You know, what is the strategic monetization part of the company? Not so much how it monetize is data. But what is it trying to do? How is it going to make money in the future? So in the case of IBM, it's all around cognition. It's around enabling customers to become cognitive businesses. So my data strategy or our data strategy, I should say, is focused on enabling cognition becoming a cauldron of enterprise. You know, we've now realized that impacto prerequisite for cognition. So that's the data strategy piece. And that's the very first thing that needs to be done because once you understand that, then you understand what data is critical for the company, so you don't boil the ocean instead, what you do is you begin to govern exactly what's necessary and make sure it's fit for purpose. And then you can also create trusted data sources around those critical data assets that are critical for the for the monetization strategy of the company's. Those three have to go in sequence because if you don't know what you can do to adequately kind of three, and they're also significant pitfalls if you don't follow that sequence because you can end up pointing the ocean and the other two activities that must be done concurrently. One is in terms ofthe establishing deep partnerships with the other areas of the company the key business units, the key functional units because that's how you end up understanding what that data strategy ought to be. You know, if you don't have that knowledge of the company by making that effort that due diligence, that it's very difficult to get the data strategy right, so you've got to establish those partnerships and then the 5th 1 is because this is a space where you do require very significant talent. You have to start developing that talent and that all the organizational capability right from day one. >> So, Bob, you said that, uh, data is the new middle manager. You can't have an effective middle manager come unless you at least have some framework that was just described. >> Yeah, absolutely. So, you know, when Interpol talks about that fourth initiative about the engagement with the business units and making sure that we're in alignment on how the company's monetizing its value to its clients, his involvement with our team goes way beyond how he thinks about what date it is that we're collecting in the products that you're offering and what we might understand about our customers or about the marketplace. His involvement goes also into how we're curating the right user experience for who we want to win power with our products and offerings. Sometimes that's the role of the chief date officer. Sometimes that's the role of a data engineer. Sometimes it's the role of a data scientist. You mentioned data becoming the new middle management middle manager. We think the citizen analyst is ushering in that from from their seat, But we also need to be able to, from a perspective, to help them eliminate the long tail and and get transparency, the information. And sometimes it's the application developer. So we, uh, we collaborate on a very frequent basis, where, when we think about offering new capabilities to those roles, well, what's the data implication of that? What's the governance implication of that? How do we make it a seamless experience? So as people start to move down the path of igniting all of the innovation across those roles, there is a continuum to the information to using To be able to do that, how it's serving the enterprise, how it leads to that transformation to be a cognitive enterprise on DH. That's a very, very close collaboration >> we're moving from. You said you talked the process era to what I just inserted to an insight era. Yeah, um, and I have a question around that I'm not sure exactly how to formulate it, but maybe you can help. In the process, era technology was unknown. The process was very well, Don't know. Well known, but technology was mysterious. But with IBM and said help today it seems as though process is unknown. The technology's pretty known look at what uber airbnb you're doing the grabbing different technologies and putting them together. But the process is his new first of all, is that a reasonable observation? And if so, what does that mean for chief data officers? >> So the process is, you know, is new in the sense that in terms ofthe making it a cognitive process, it's going to end up being new, right? So the memorization that you >> never done it before, but it's never been done before, right >> in that sense. But it's different from process automation in the past. This is much more about knowledge, being able to scale knowledge, not just, you know, across one process, but across all the process cities that make up a company. And so in there. That goes also to the comment about data being the middle manager. I mean, if you've essentially got the ability to scale and manage knowledge, not just data but knowledge in terms of the insights that the people who are working these processes are coming up in conjunction with these data and intelligent capabilities, that that that that that of the hub right, it's the intelligence system that's had the Hubble this that's enabling all that so that That's really what leads Teo leads to the so called civilization >> way had dates to another >> important aspect of this is the process is dramatically different in the sense that it's ongoing. It's it's continuous, right, the process and your intimacy with uber and the trust that you're developing. A brand doesn't start and stop with one transaction and actually, you know branches into many different things. So your expectations, a CZ that relationships have all changed. So what they need to understand about you, what they need to protect about you, how they need to protect you in their transformation, the richness of their service needs to continue to evolve. So how they perform that task on the abundance of information they have available to perform that task. But the difficulty of being able to really consume it and make use of it is is a change. The other thing is, it's a lot more conversational, right? So the process isn't a deterministic set of steps that someone at a desk can really formulate in a business rule or a static process. It's conversationally changes. It needs to be dis ambiguity, and it needs to introduce new information during the process of disintegration. And that really, really calls upon the capabilities of a cognitive system that is rich and its ability to understand and interact with natural language to potentially introduce other sources of rich information. Because you might take a picture about what you're experiencing and all those things change that that notion from process to the conversational element. >> Dr. Bhandari, you've got an interesting role. Companies like IBM I think about the Theo with the CDO. Not only do you have your internal role, but you're also you know, a model for people going out there. You come too. Events like this. You're trying to help people in the role you've been a CDO. It's, um, health care organization to tell Yu know what's different about being kind of internal role of IBM. What kind of things? IBM Obviously, you know, strong technology culture, But tell us a little bit inside. You've learned what anything surprise you. You know, in your time that you've been doing it. >> Oh, you know, over the course ofthe time that I've been doing the roll across four different organizations, >> I guess specifically at IBM. But what's different there? >> You know, I mean IBM, for one thing, is a the The environment has tremendous scale. And if you're essentially talking about taking cognition to the enterprise, that gives us a tremendous A desperate to try out all the capabilities that were basically offering to our to our customers and to home that in the context of our own enterprise, you know, to build our own cognitive enterprise. And that's the journey that way, sharing with our with our customers and so forth. So that's that's different in in in in it. That wasn't the case in the previous previous rules that I had. And I think the other aspect that's different is the complexity of the organisation. This is a large global organization that wasn't true off the previous roles as well. They were Muchmore, not America century, you know, organizations. And so there's a There's an aspect there that also then that's complexity of the role in terms ofthe having to deal with different countries, different languages, different regulations, it just becomes much more complex. >> You first became a CDO in two thousand six, You said two thousand six, which was the same year as the Federal Rules of Civil Procedure came out and the emails became smoking guns. And then it was data viewed as a liability, and now it's completely viewed as an asset. But traditionally the CDO role was financial services and health care and government and highly regulated businesses. And it's clearly now seeping into new industries. What's driving that? Is that that value? >> Well, it is. I mean, it's, I think, that understanding that. You know, there's a tremendous natural resource in in the information in the data. But there is, you know, very much you know, union Yang around that notion of being responsible. I mean, one of the things that we're very proud of is the type of trust that we established over 105 year journey with our clients in the types of interactions we have with one another, the level of intimacy that we have in their business and very foundation away, that we serve them on. So we can never, ever do anything to compromise that you know. So the focus on really providing the ability to do the necessary governance and to do the necessary data providence and lineage in cyber security while not stifling innovation and being able to push into the next horizon. Interpol mentioned the fact that IBM, in and of itself, we think of ourselves as a laboratory, a laboratory for cognitive information innovation, a laboratory for design and innovation, which is so necessary in the digital era. And I think we've done a really good job in the spaces, but we're constantly pushing the envelope. A good example of that is blockchain, a technology that you know sometimes people think about and nefarious circumstances about, You know, what it meant to the ability to launch a Silk Road or something of that nature. We looked at the innovation understanding quite a lot about it being one of the core interview innovators around it, and saw great promise in being able to transform the way people thought about, you know, clearing multiparty transactions and applied it to our own IBM credit organization To think about a very transparent hyper ledger, we could bring those multiple parties together. People could have transparency and the transactions have a great deal of access into that space, and in a very, very rapid amount of time, we're able to take our very sizable IBM credit organization and implement that hyper ledger. Also, while thinking about the data regulation, the data government's implications. I think that's a really >> That's absolutely right. I mean, I think you know, Bob mentioned the example about the IBM credit organizer Asian, but there is. There are implications far beyond that. Their applications far beyond that in the data space. You know, it affords us now the opportunity to bring together identity management. You know, the profiles that people create from data of security aspects and essentially combined all of these aspects into what will then really become a trusted source ofthe data. You know, by trusted by me, I don't mean internally, but trusted by the consumers off the data. The subject's off the data because you'll be able to do that much in a way that's absolutely appropriate, not just fit for business purpose, but also very, very respectful of the consent on DH. Those aspects the privacy aspect ofthe data. So Blockchain really is a critical technology. >> Hype alleges a great example. We're IBM edge this week. >> You're gonna be a world of Watson. >> We will be a world Watson. We had the CEO of ever ledger on and they basically brought 1,000,000 diamonds and bringing transparency for the diamond industry. It's it's fraught with, with fraud and theft and counterfeiting and >> helping preserve integrity, the industry and eliminating the blood diamonds. And they right. >> It's fascinating to see how you know this bitcoin. You know, when so many people disparaged it is a currency, but not just the currency. You know, you guys IBM saw that early on and obviously participated in the open source. Be, You know, the old saying follow the money with us is like follow the data. So if I understand correctly, your job, a CDO is to sort of super charge of the business lines with the data strategy. And then, Bob, you're job is the line of business managers the supercharge your customers, businesses with the data strategy. Is that right? Is that the right value >> chain? I think you nailed it. Yeah, that's >> one of the things people are struggling with these days is, you know, if they can get their own data in house, then they've also gotta deal with third party. That industry did everything like that. IBM's role in that data chain is really interesting. You talked this morning about kind of the Weather Channel and kind of the data play there. Yeah, you know what? What's IBM is rolling. They're going forward. >> It's one of the most exciting things. I think about how we've evolved our strategy. And, you know, we're very fortunate to have Jimmy at the helm. Who really understands, You know, that transformational landscape on DH, how partnerships really change the ability to innovate for the companies we serve on? It was very obvious in understanding our client's problems that while they had a wealth of information that we were dealing with internally, there was great promise and being able to introduce these outside signals. If you will insights from other sources of data, Sometimes I call them vectors of information that could really transform the way they were thinking about solving their customer problem. So, you know, why wouldn't you ever want to understand that customers sentiment about your brand or about the product or service? And as a consequence to that, you know, capabilities that are there on Twitter or we chat or line are essential to that, depending on where your brand is operating in your branch, probably operating in a multinational space anyway, so you have to listen to all those signals and they're all in multiple language and sentiment is very, very bespoke. It's a different language, so you have to apply sophisticated machine learning. We've invented new algorithms to understand how to glean the signal at all that white noise. You use the weather example as well. You know, we think about the economic impact of climate atmosphere, whether on business and its profound. It's 1/2 trillion dollars, you know, in each calendar year that are, you know, lost information, lost assets, lost opportunity, misplaced inventory, you know, un delivered inventory. And we think we can do a better job of helping our clients take the weather excuses out of business in a variety of different industries. And so we've focused our initiatives on that information integration, governance, understanding new analytics toe to introduce those outside signals directly in the heart and want to place it on the desk of the chief data officer of those who are innovating around information and data. >> My my joke last Columbus. If they was Dell's buying DMC, IBM is buying the weather company. What does What does that say? My question is Interpol. When when Emma happens. And Bob, when you go out and purchase companies that are data driven, what role does the chief data officer play in both em in a pre and post. >> So, you know, I think the one that there being a cop, just gonna touch on a couple of points that Bob Major and I'll address your question directly as well. Uh, in terms of the role of the chief data officer, I think you're giving me that question before how that's he walled. The one very interesting thing that's happening now with what IBM is doing is previously the chief data officer. All at least with regard to the data, Not so much the strategy, but the data itself was internal focused. You know, you kind of worried about the data you had in house or the data you're bringing in now you've gotta worry as much about the exogenous status and because, you know, that's so That's one way that that role has changed considerably and is changing and evolving, and it's creating new opportunities for us. The other is again. In the past, the chief state officer all was around creating a warehouse for analytics and separated out from the operational processes. That's changing, too, because now we've got to transform these processes themselves. So that's, you know, that's that's another expanded role to come back to. Acquisitions emanate. I mean, I view that as essentially another process that, you know, company has. And so the chief data officer role is pretty key in terms of enabling that world in terms ofthe data, but also in terms ofthe giving, you know, guidance and advice. If, for instance, the acquisition isn't that problem itself, then you know, then we would be more closely involved. But if it's beyond that in terms of being able to get the right data, do that process as well as then once you've acquired the company in being able to integrate back the critical data assets those out of the key aspect, it's an ongoing role. >> So you've got the simplest level. You've got data sources and all the things associated with that. And then you've got your algorithms and your machine learning, and we're moving beyond sort of do tow cut costs into this new era. But so hot Oh cos adjudicate. And I guess you got to do both. You've got to get new data sources and you've got to improve this continuous process. By that you talked about how do you guide your customers as to where they put their resource? No. And that's >> really Davis. You have, you know, touching out again. That's really the benefit of this sort of a forum. In this sort of a conference, it's sharing the best practices of how the top experts in the world are really wrestling with that and identifying. I think you know Interpol's framework. What do you do sequentially to build the disciplines, to build a solid corn foundation, to make the connections that are lined with the business strategy? And then what do you do concurrently along that model to continue to operate? And how do you How do you manage and make sure your stakeholders understand what's being done? What they need to continue to do to evolve the innovation and come join us here and we'll go through that in detail. But, you know, he deposited a greatjob sharing his framers of success, and I think in the other room, other CEOs are doing that now. >> Yeah, I just wanted to quickly add to Bob's comment. The framework that I described right? It has a check and balance built into it because if you are all about governance, then the Sirio role becomes very defensive in nature. It's all about making sure you within the hour, you know, within the guard rails and so forth. But you're not really moving forward in a strategic way to help the company. And and that's why you know, setting it up by driving it from the strategy don't just makes it easier to strike that plus >> clerical and more about innovation here. We talked about the D and CDO today meaning data, but really, I think about it is being a great crucible for for disruption in information because you've disruption off. I called the Chief Disruption Office under Sheriff you >> incident in Data's digitalis data. So there's that piece of Ava's Well, we have to go. I don't want to go. So that way one last question for each of you. So Interpol, uh, thinking about and you just kind of just touched on it. He's not just playing defense, you know, thinking more offense this role. Where do you want to take it. What do your you know, sort of mid term, long term goals with this role? >> It's the specific role in IBM or just in general specifically. Well, I think in the case of I B M, we have the data strategy pretty well defined. Now it's all about being able to enable a cognitive enterprise. And so in, You know, in my mind and 2 to 3 years, we'll have completely established how that ought to be done, you know, as a prescription. And we'll also have our clients essentially sharing in that in that journey so that they can go off and create cognitive enterprises themselves. So that's pretty well set. You know, I have a pretty short window to three years to make that make that happen, And I think it's it's doable. And I think it will be, you know, just just a tremendous transformation. >> Well, we're excited to be to be watching and documenting that Bob, I have to ask you a world of washing coming up. New name for new conference. We're trying to get Pepper on, trying to get Jimmy on. Say, what should we expect? Maybe could. Although it was >> coming, and I think this year we're sort of blowing the roof off on literally were getting so big that we had to move the venue. It is very much still in its core that multiple practitioner, that multiple industry event that you experienced with insight, right? So whether or not you're thinking about this and the auspices of managing your traditional environments and what you need to do to bring them into the future and how you tie these things together, that's there for you. All those great industry tracks around the product agendas and what's coming out are are there. But the level of inspiration and involvement around this cognitive innovation space is going to be front and center. We're joined by Ginny Rometty herself, who's going to be very special. Key note. We have, I think, an unprecedented lineup of industry leaders who were going to come and talk about disruption and about disruption in the cognitive era on then. And as always, the most valuable thing is the journeys that our clients are partners sharing with us about how we're leading this inflection point transformation, the industry. So I'm very much excited to see their and I hope that your audience joins us as well. >> Great. We'll Interpol. Congratulations on the new roll. Thank you. Get a couple could plug, block post out of your comments today, so I really appreciate that, Bob. Always a pleasure. Thanks so much for having us here. Really? Appreciate. >> Thanks for having us. >> Alright. Keep right, everybody, this is the Cube will be back. This is the IBM Chief Data Officer Summit. We're live from Boston. You're back. My name is Dave Volante on DH. I'm along.

Published Date : Sep 23 2016

SUMMARY :

IBM Chief Data Officer Strategy Summit brought to you by IBM. You ahead of the curve. on we you know, we really liketo listen very closely to what's going on so we can, OK, so you come in is the chief data officer in December. And that's the very first thing that needs to be done because once you understand that, So, Bob, you said that, uh, data is the new middle manager. of igniting all of the innovation across those roles, there is a continuum to the information to using You said you talked the process era to what I just inserted to an insight that that that that that of the hub right, it's the intelligence system that's had the Hubble this that's on the abundance of information they have available to perform that task. IBM Obviously, you know, strong technology culture, I guess specifically at IBM. home that in the context of our own enterprise, you know, to build our own cognitive enterprise. Rules of Civil Procedure came out and the emails became smoking guns. So the focus on really providing the ability to do the necessary governance I mean, I think you know, Bob mentioned the example We're IBM edge this week. We had the CEO of ever ledger on and they basically helping preserve integrity, the industry and eliminating the blood diamonds. Be, You know, the old saying follow the money with us is like follow the data. I think you nailed it. one of the things people are struggling with these days is, you know, if they can get their own data in house, And as a consequence to that, you know, capabilities that are there And Bob, when you go out and purchase companies that are data driven, much about the exogenous status and because, you know, that's so That's one way that that role has changed By that you talked about how do you guide your customers as to where they put their resource? And how do you How do you manage and make sure your stakeholders understand And and that's why you know, setting it up by driving it from the strategy I called the Chief Disruption Office under Sheriff you you know, thinking more offense this role. And I think it will be, you know, just just a tremendous transformation. Well, we're excited to be to be watching and documenting that Bob, I have to ask you a world that multiple industry event that you experienced with insight, right? Congratulations on the new roll. This is the IBM Chief Data Officer Summit.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
IBMORGANIZATION

0.99+

BhandariPERSON

0.99+

Dave VolantePERSON

0.99+

BobPERSON

0.99+

Bob PiccianoPERSON

0.99+

Ginny RomettyPERSON

0.99+

BostonLOCATION

0.99+

DecemberDATE

0.99+

Inderpal BhandariPERSON

0.99+

InderpalPERSON

0.99+

2014DATE

0.99+

2QUANTITY

0.99+

DellORGANIZATION

0.99+

uberORGANIZATION

0.99+

Bob MajorPERSON

0.99+

JimmyPERSON

0.99+

DavePERSON

0.99+

IBM Analytics GroupORGANIZATION

0.99+

threeQUANTITY

0.99+

Boston, MassachusettsLOCATION

0.99+

DMCORGANIZATION

0.99+

100QUANTITY

0.99+

three yearsQUANTITY

0.99+

EmmaPERSON

0.99+

oneQUANTITY

0.99+

3 yearsQUANTITY

0.99+

two activitiesQUANTITY

0.99+

bothQUANTITY

0.98+

five thingsQUANTITY

0.98+

two thousandQUANTITY

0.98+

todayDATE

0.98+

first thingQUANTITY

0.98+

over 105 yearQUANTITY

0.98+

InterpolORGANIZATION

0.98+

eachQUANTITY

0.98+

firstQUANTITY

0.98+

this yearDATE

0.97+

fourth initiativeQUANTITY

0.97+

five alarmQUANTITY

0.97+

ColumbusLOCATION

0.97+

1,000,000 diamondsQUANTITY

0.97+

this weekDATE

0.96+

hundreds of peopleQUANTITY

0.96+

OneQUANTITY

0.96+

TwitterORGANIZATION

0.96+

1/2 trillion dollarsQUANTITY

0.96+

45 years agoDATE

0.96+

WatsonORGANIZATION

0.96+

one processQUANTITY

0.96+

one transactionQUANTITY

0.95+

PepperPERSON

0.95+

DavisPERSON

0.95+

Steve DataPERSON

0.95+

each calendar yearQUANTITY

0.94+

Caitlin Lepech & Dave Schubmehl - IBM Chief Data Officer Strategy Summit - #IBMCDO - #theCUBE


 

>> live from Boston, Massachusetts. >> It's the Cube >> covering IBM Chief Data Officer Strategy Summit brought to you by IBM. Now, here are your hosts. Day villain Day and >> stew minimum. Welcome back to Boston, everybody. This is the IBM Chief Data Officer Summit. And this is the Cube, the worldwide leader in live tech coverage. Caitlin Lepic is here. She's an executive within the chief data officer office at IBM. And she's joined by Dave Shoot Mel, who's a research director at, uh D. C. And he covers cognitive systems and content analytics. Folks, welcome to the Cube. Good to see you. Thank you. Can't. Then we'll start with you. You were You kicked off the morning and I referenced the Forbes article or CDOs. Miracle workers. That's great. I hadn't read that article. You put up their scanned it very quickly, but you set up the event. It started yesterday afternoon at noon. You're going through, uh, this afternoon? What's it all about? This is evolved. Since, what, 2014 >> it has, um, we started our first CDO summit back in 2014. And at that time, we estimated there were maybe 200 or so CDOs worldwide, give or take and we had 30, 30 people at our first event. and we joked that we had one small corner of the conference room and we were really quite excited to start the event in 30 2014. And we've really grown. So this year we have about 170 folks joining us, 70 of which are CEOs, more acting, the studios in the organization. And so we've really been able to grow the community over the last two years and are really excited to see to see how we can continue to do that moving forward. >> And IBM has always had a big presence at the conference that we've covered the CDO event. So that's nice that you can leverage that community and continue to cultivate it. Didn't want to ask you, so it used that we were talking when we first met this morning. It used to be dated was such a wonky topic, you know, data was data value. People would try to put a value on data, and but it was just a really kind of boring but important topic. Now it's front and center with cognitive with analytics. What are you seeing in the marketplace. >> Yeah, I think. Well, what we're seeing in the market is this emphasis on predictive applications, predictive analytics, cognitive applications, artificial intelligence of deep learning. All of those those types of applications are derived and really run by data. So unless you have really good authoritative data to actually make these models work, you know, the systems aren't going to be effective. So we're seeing an emerging marketplace in both people looking at how they can leverage their first party data, which, you know, IBM is really talking about what you know, Bob Picciotto talked about this morning. But also, we're seeing thie emergency of a second party and third party data market to help build these models out even further so that I think that's what we're really seeing is the combination of the third party data along with the first party data really being the instrument for building these kind of predictive models, you know, they're going to take us hopefully, you know, far into the future. >> Okay, so, Caitlin square the circle for us. So the CDO roll generally is not perceived. Is it technology role? Correct. Yet as Davis to saying, we're talking about machine learning cognitive. Aye, aye. These air like heavy technical topics. So how does the miracle worker deal with all this stuff generally? And how does IBM deal with it inside the CDO office? Specifically? >> Sure. So it is. It's a very good point, you know, Traditionally, Seo's really have a business background, and we find that the most successful CDO sit in the business organization. So they report somewhere in a line of business. Um, and there are certainly some that have a technical background, but far more come from business background and sit in the business. I can't tell you how we are setting up our studio office at IBM. Um, so are new. And our first global chief date officer joined in December of last year. Interpol Bhandari, um and I started working for him shortly thereafter, and the way he's setting up his office is really three pillars. So first and foremost, we focused on the data engineering data sign. So getting that team in place next, it's information, governance and policy. How are we going to govern access, manage, work with data, both data that we own within our organization as well as the long list of of external data sources that that we bring in and then third is the business integration filler. So the idea is CDOs are going to be most successful when they deliver those data Science data engineering. Um, they manage and govern the data, but they pull it through the business, so ensuring that were really, you know, grounded in business unit and doing this. And so those there are three primary pillars at this point. So prior >> to formalizing the CDO role at I b m e mean remnants of these roles existed. There was a date, equality, you know, function. There was certainly governance in policy, and somebody was responsible to integrate between, you know, from the i t. To the applications, tow the business. Were those part of I t where they sort of, you know, by committee and and how did you bring all those pieces together? That couldn't have been trivial, >> and I would say it's filling. It's still going filling ongoing process. But absolutely, I would say they typically resided within particular business units, um, and so certainly have mature functions within the unit. But when we're looking for enterprise wide answers to questions about certain customers, certain business opportunities. That's where I think the role the studio really comes in and what we're What we're doing now is we are partnering very closely with business units. One example is IBM analytic. Seen it. So we're here with Bob Luciano and other business units to ensure that, as they provide us, you know, their data were able to create the single trusted source of data across the organization across the enterprise. And so I agree with you, I think, ah, lot of those capabilities and functions quite mature, they, you know, existed within units. And now it's about pulling that up to the enterprise level and then our next step. The next vision is starting to make that cognitive and starting to add some of those capabilities in particular data science, engineering, the deep learning on starting to move toward cognitive. >> Dave, I think Caitlin brought up something really interesting. We've been digging into the last couple of years is you know, there's that governance peace, but a lot of CEOs are put into that role with a mandate for innovation on. That's something that you know a lot of times it has been accused of not being all that innovative. Is that what you're seeing? You know what? Because some of the kind of is it project based or, you know, best initiatives that air driving forward with CEOs. I think what we're seeing is that enterprises they're beginning to recognize that it's not just enough to be a manufacturer. It's not just enough to be a retail organization. You need to be the one of the best one of the top two or the top three. And the only way to get to that top two or top three is to have that innovation that you're talking about and that innovation relies on having accurate data for decision making. It also relies on having accurate data for operations. So we're seeing a lot of organizations that are really, you know, looking at how data and predictive models and innovation all become part of the operational fabric of a company. Uh, you know, and if you think about the companies that are there, you know, just beating it together. You know Amazon, for example. I mean, Amazon is a completely data driven company. When you get your recommendations for, you know what to buy, or that's all coming from the data when they set up these logistics centers where they're, you know, shipping the latest supplies. They're doing that because they know where their customers are. You know, they have all this data, so they're they're integrating data into their day to day decision making. And I think that's what we're seeing, You know, throughout industry is this this idea of integrating decision data into the decision making process and elevating it? And I think that's why the CDO rule has become so much more important over the last 2 to 3 years. >> We heard this morning at 88% percent of data is dark data. Papa Geno talked about that. So thinking about the CEOs scope roll agenda, you've got data sources. You've gotto identify those. You gotta deal with data quality and then Dave, with some of the things you've been talking about, you've got predictive models that out of the box they may not be the best predictive models in the world. You've got iterated them. So how does an organization, because not every organizations like Amazon with virtually unlimited resource is capital? How does an organization balance What are you seeing in terms of getting new data sources? Refining those data source is putting my emphasis on the data vs refining and calibrating the predictive models. How organizations balancing that Maybe we start with how IBM is doing. It's what you're seeing in the field. >> So So I would say, from what we're doing from a setting up the chief data office role, we've taken a step back to say, What's the company's monitor monetization strategy? Not how your mind monetizing data. How are how are you? What's your strategy? Moving forward, Um, for Mance station. And so with IBM we've talked about it is moved to enabling cognition throughout the enterprise. And so we've really talked about taking all of your standard business processes, whether they be procurement HR finance and infusing those with cognitive and figuring out how to make those smarter. We talking examples with contracts, for example. Every organization has a lot of contracts, and right now it's, you know, quite a manual process to go through and try and discern the sorts of information you need to make better decisions and optimize the contract process. And so the idea is, you start with that strategy for us. IBM, it's cognitive. And that then dictates what sort of data sources you need. Because that's the problem you're trying to solve in the opportunity you're chasing down. And so then we talk about Okay, we've got some of that data currently residing today internally, typically in silos, typically in business units, you know, some different databases. And then what? What are longer term vision is, is we want to build the intelligence that pulls in that internal data and then really does pull in the external data that we've that we've all talked about. You know, the social data, the sentiment analysis, analysis, the weather. You know, all of that sort of external data to help us. Ultimately, in our value proposition, our mission is, you know, data driven enablement cognition. So helps us achieve our our strategy there. >> Thank you, Dad, to that. Yeah, >> I mean, I think I mean, you could take a number of examples. I mean, there's there's ah, uh, small insurance company in Florida, for example. Uh, and what they've done is they have organized their emergency situation, their emergency processing to be able to deal with tweets and to be able to deal with, you know, SMS messages and things like that. They're using sentiment analysis. They're using Tex analytics to identify where problems are occurring when hurricane happens. So they're what they're doing is they're they're organizing that kind of data and >> there and there were >> relatively small insurance company. And a lot of this is being done to the cloud, but they're basically getting that kind of sentiment analysis being ableto interpret that and add that to their decision making process. About where should I land a person? Where should I land? You know, an insurance adjuster and agent, you know, based on the tweets, that air coming in rather than than just the phone calls that air coming into the into the organization, you know? So that's a That's a simple example. And you were talking about Not everybody has the resources of an Amazon, but, you know, certainly small insurance companies, small manufacturers, small retail organizations, you, Khun get started by, you know, analyzing your You know what people are saying about you. You know, what are people saying about me on Twitter? What are people saying about me on Facebook? You know how can I use that to improve my customer service? Uh, you know, we're seeing ah whole range of solutions coming out, and and IBM actually has a broad range of solutions for things like that. But, you know, they're not the only points out there. There's there's a lot of folks do it that kind of thing, you know, in terms of the dark data analysis and barely providing that, you know, as part of the solution to help people make better decisions. >> So the answers to the questions both You're doing both new sources of data and trying to improve the the the analytics and the models. But it's a balancing act, and you could come back to the E. R. A. Y question. It sounds like IBM strategies to supercharge your existing businesses by infusing them with new data and new insights. Is >> that correctly? I would say that is correct. >> Okay, where is in many cases, the R A. Y of analytics projects that date have been a reduction on investment? You know, I'm going to move stuff from my traditional W two. A dupe is cheaper, and we feels like Dave, we're entering a new wave now maybe could talk about that a little bit. >> Yeah. I mean, I think I think there's a desk in the traditional way of measuring ROI. And I think what people are trying to do now is look at how you mentioned disruption, for example. You know what I think? Disruption is a huge opportunity. How can I increase my sales? How can I increase my revenue? How can I find new customers, you know, through these mechanisms? And I think that's what we're starting to see in the organization. And we're starting to see start ups that are dedicated to providing this level of disruption and helping address new markets. You know, by using these kinds of technologies, uh, in in new and interesting ways. I mean, everybody uses the airbnb example. Everybody uses uber example. You know that these are people who don't own cars. They don't know what hotel rooms. But, you know, they provide analytics to disrupt the hotel industry and disrupt the taxi industry. It's not just limited to those two industries. It's, you know, virtually everything you know. And I think that's what we're starting to see is this height of, uh, virtual disruption based on the dark data, uh, that people can actually begin to analyze >> within IBM. Uh, the chief data officer reports to whom. >> So the way we've set up in our organization is our CBO reports to our senior vice president of transformation and operations, who then reports to our CEO our recommendation as we talked with clients. I mean, we see this as a CEO level reporting relationship, and and oftentimes we advocate, you know, for that is where we're talking with customers and clients. It fits nicely in our organization within transformation operations, because this line is really responsible for transforming IBM. And so they're really charged with a number of initiatives throughout the organization to have better skills alignment with some of the new opportunities. To really improve process is to bring new folks on board s. So it made sense to fit within, uh, organization that the mandate is really transformation of the company of the >> and the CDO was a peer of the CIA. Is that right? Yes. >> Yes, that's right. That's right. Um, and then in our organization, the role of split and that we have a chief data officer as well as a chief analytics officer. Um, but, you know, we often see one person serving both of those roles as well. So that's kind of, you know, depend on the organizational structure of the company. >> So you can't run the business. So to grow the business, which I guess is the P and L manager's role and transformed the business, which is where the CDO comes. >> Right? Right, right. Exactly. >> I can't give you the last word. Sort of Put a bumper sticker on this event. Where do you want to see it go? In the future? >> Yes. Eso last word. You know, we try Tio, we tried a couple new things. Uh, this this year we had our deep dive breakout sessions yesterday. And the feedback I've been hearing from folks is the opportunity to talk about certain topics they really care about. Is their governance or is innovation being able to talk? How do you get started in the 1st 90 days? What? What do you do first? You know, we we have sort of a five steps that we talk through around, you know, getting your data strategy and your plan together and how you execute against that. Um And I have to tell you, those topics continue to be of interest to our to our participants every year. So we're going to continue to have those, um, and I just I love to see the community grow. I saw the first Chief data officer University, you know, announced earlier this year. I did notice a lot of PR and media around. Role of studio is miracle workers, As you mentioned, doing a lot of great work. So, you know, we're really supportive. Were big supporters of the role we'll continue to host in person events. Uh, do virtual events continue to support studios? To be successful on our big plug is will be world of Watson. Eyes are big IBM Analytics event in October, last week of October in Vegas. So we certainly invite folks to join us. There >> will be, >> and he'll be there. Right? >> Get still, try to get Jimmy on. So, Jenny, if you're watching, talking to come on the Q. >> So we do a second interview >> and we'll see. We get Teo, And I saw Hillary Mason is going to be the oh so fantastic to see her so well. Excellent. Congratulations. on being ahead of the curve with the chief date officer can theme. And I really appreciate you coming to Cube, Dave. Thank you. Thank you. All right, Keep right there. Everybody stew and I were back with our next guest. We're live from the Chief Data Officers Summit. IBM sze event in Boston Right back. My name is Dave Volante on DH. I'm a longtime industry analysts.

Published Date : Sep 23 2016

SUMMARY :

covering IBM Chief Data Officer Strategy Summit brought to you by You put up their scanned it very quickly, but you set up the event. And at that time, we estimated there were maybe 200 or so CDOs worldwide, give or take and we had 30, 30 people at our first event. the studios in the organization. a wonky topic, you know, data was data value. data to actually make these models work, you know, the systems aren't going to be effective. So how does the miracle worker deal with all this stuff generally? so ensuring that were really, you know, grounded in business unit and doing this. and somebody was responsible to integrate between, you know, from the i t. units to ensure that, as they provide us, you know, their data were able to create the single that are really, you know, looking at how data and are you seeing in terms of getting new data sources? And so the idea is, you start with that Thank you, Dad, to that. to be able to deal with, you know, SMS messages and things like that. You know, an insurance adjuster and agent, you know, based on the tweets, that air coming in rather than than just So the answers to the questions both You're doing both new sources of data and trying to improve I would say that is correct. You know, I'm going to move stuff from my traditional W two. And I think what people are trying to do now is look at how you mentioned disruption, Uh, the chief data officer reports to whom. you know, for that is where we're talking with customers and clients. and the CDO was a peer of the CIA. So that's kind of, you know, depend on the organizational structure of So you can't run the business. Right? I can't give you the last word. I saw the first Chief data officer University, you know, announced earlier this and he'll be there. So, Jenny, if you're watching, talking to come on the Q. And I really appreciate you coming to Cube, Dave.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Caitlin LepicPERSON

0.99+

AmazonORGANIZATION

0.99+

CIAORGANIZATION

0.99+

Dave SchubmehlPERSON

0.99+

Dave VolantePERSON

0.99+

FloridaLOCATION

0.99+

IBMORGANIZATION

0.99+

Bob LucianoPERSON

0.99+

Bob PicciottoPERSON

0.99+

JennyPERSON

0.99+

Dave Shoot MelPERSON

0.99+

Caitlin LepechPERSON

0.99+

DavePERSON

0.99+

DavisPERSON

0.99+

BostonLOCATION

0.99+

uberORGANIZATION

0.99+

TeoPERSON

0.99+

70QUANTITY

0.99+

VegasLOCATION

0.99+

2014DATE

0.99+

JimmyPERSON

0.99+

Hillary MasonPERSON

0.99+

OctoberDATE

0.99+

Boston, MassachusettsLOCATION

0.99+

CaitlinPERSON

0.99+

firstQUANTITY

0.99+

bothQUANTITY

0.99+

200QUANTITY

0.99+

both peopleQUANTITY

0.99+

thirdQUANTITY

0.99+

first eventQUANTITY

0.99+

One exampleQUANTITY

0.99+

two industriesQUANTITY

0.99+

yesterdayDATE

0.99+

five stepsQUANTITY

0.99+

second interviewQUANTITY

0.98+

singleQUANTITY

0.98+

top threeQUANTITY

0.98+

December of last yearDATE

0.98+

this afternoonDATE

0.97+

1st 90 daysQUANTITY

0.97+

oneQUANTITY

0.97+

Chief Data Officers SummitEVENT

0.97+

IBM Chief Data OfficerEVENT

0.96+

one personQUANTITY

0.96+

this yearDATE

0.96+

last week of OctoberDATE

0.95+

about 170 folksQUANTITY

0.95+

both dataQUANTITY

0.95+

earlier this yearDATE

0.95+

top twoQUANTITY

0.95+

yesterday afternoon at noonDATE

0.94+

30 2014DATE

0.94+

CDOEVENT

0.94+

88% percentQUANTITY

0.93+

three primary pillarsQUANTITY

0.93+

first partyQUANTITY

0.9+

this morningDATE

0.89+

CDOORGANIZATION

0.89+

30,QUANTITY

0.88+

KhunORGANIZATION

0.88+

D. C.PERSON

0.88+

three pillarsQUANTITY

0.87+

one small cornerQUANTITY

0.87+

Rinesh Patel, Snowflake & Jack Berkowitz, ADP | Snowflake Summit 2022


 

(upbeat music) >> Welcome back to theCUBE's continuing coverage of Snowflake Summit 22 live from Caesars Forum in Las Vegas. I'm Lisa Martin with Dave Vellante. We've got a couple of guests joining us now. We're going to be talking about financial services. Rinesh Patel joins us, the Global Head of Financial Services for Snowflake, and Jack Berkowitz, Chief Data Officer at ADP. Guys, welcome to the program. >> Thanks, thanks for having us. >> Thanks for having us. >> Talk to us about what's going on in the financial services industry as a whole. Obviously, we've seen so much change in the last couple of years. What does the data experience look like for internal folks and of course, for those end user consumers and clients? >> So, one of the big things happening inside of the financial services industry is overcoming the COVID wait, right? A lot of banks, a lot of institutions like ours had a lot of stuff on-prem. And then the move to the Cloud allows us to have that flexibility to deal with it. And out of that is also all these new capabilities. So the machine learning revolution has really hit the services industry, right? And so it's affecting how our IT teams or our data teams are building applications. Also really affecting what the end consumers get out of them. And so there's all sorts of consumerization of the experience over the past couple of years much faster than we ever expected it to happen. >> Right, we have these expectations as consumers that bleed into our business lives that I can do transactions. It's going to be on the swipe in terms of checking authenticity, fraud detection, et cetera. And of course we don't want things to go back in terms of how brands are serving us. Talk about some of the things that you guys have put in place with Snowflake in the last couple of years, particularly at ADP. >> Yeah, so one of the big things that we've done, is, one of the things that we provide is compensation data. So we issue a thing called the National Employment Report that informs the world as to what's happening in the U.S. economy in terms of workers. And then we have compensation data on top of that. So the thing that we've been able to do with Snowflake is to lower the time that it takes us to process that and get that information out into the fingertips of people. And so people can use it to see what's changed in terms of with the worker changes, how much people are making. And they can get it very, very quickly. And we're able to do that with Snowflake now. Used to take us weeks, now it's in a matter of moments we can get that updated information out to people. >> Interesting. It helps with the talent war and- >> Helps in the talent war, helps people adjust, even where they're going to put supply chain in reaction to where people are migrating. We can have all of that inside of the Snowflake system and available almost instantaneously. >> You guys announced the Financial Data Cloud last year. What was that like? 'Cause I know we had Frank on early, he clearly was driving the verticalization of Snowflake if you will, which is kind of rare for a relatively new software company but what's that been like? Give us the update on where you're at and biggest vertical, right? >> Absolutely, it's been an exciting 12 months. We're a platform, but the journey and the vision is more. We're trying to bring together a fragmented ecosystem across financial services. The aim is really to bring together key customers, key data providers, key solution providers all across the different Clouds that exist to allow them to collaborate with data in a seamless way. To solve industry problems. To solve industry problems like ESG, to solve industry problems like quantitative research. And we're seeing a massive groundswell of customers coming to Snowflake, looking at the Financial Services Data Cloud now to actually solve business problems, business critical problems. That's really driving a lot of change in terms of how they operate, in terms of how they win customers, mitigate risk and so forth. >> Jack, I think, I feel like the only industry that's sometimes more complicated than security, is data. Maybe not, security's still maybe more fragmented- >> Well really the intersection of the two is a nightmare. >> And so as you look out on this ecosystem, how do you as the chief data officer, how do you and your organization, what process do you use to decide, okay, which of the, like a chef, which of these ingredients am I going to put together for my business. >> It's a great question, right? There's been explosion of companies. We kind of look at it in two ways. One is we want to make sure that the software and the data can interoperate because we don't want to be in the business of writing bridge code. So first thing is, is having the ecosystem so that the things are tested and can work together. The other area is, and it's important to us is understanding the risk profile of that company. We process about 20% of the U.S. payroll, another 25% of the taxes. And so there's a risk to us that we have an imperative to protect. So we're looking at those companies are they financed, what's their management team. What's the sales experience like, that's important to us. And so technology and the experience of the company coming together are super important to us. >> What's your purview as a chief data officer, I mean, a lot of CDOs that I know came out of the back office and it was a compliance or data quality. You come out of industry from a technology company. So you're sort of the modern... You're like the modern CDO. >> Thanks. Thanks. >> Dave: What's your role? >> I appreciate that. >> You know what I'm saying though? >> And for a while it was like, oh yeah, compliance. >> So I actually- >> And then all of a sudden, boom, big deal. >> Yeah, I really have two jobs. So I have that job with data governance but a lot of data security. But I also have a product development unit, a massive business in monetization of data or people analytics or these compensation benchmarks or helping people get mortgages. So providing that information, so that people can get their mortgage, or their bank loans, or all this other type of transactional data. *So it's both sides of that equation is my reading inside. >> You're responsible for building data products? >> That's right. >> Directly. >> That's right. I've got a massive team that builds data products. >> Okay. That's somewhat unique in your... >> I think it's where CDOs need to be. So we build data products. We build, and we assist as a hub to allow other business units to build analytics that help them either optimize their cost or increase their sales. And then we help with all that governance and communication, we don't want to divide it up. There's a continuum to it. >> And you're a peer of the CIO and the CISO? >> Yeah, exactly. They're my peers. I actually talk to them almost every day. So I've got the CIO as a peer. >> It's a team. >> I've got the security as a peer and we get things done together. >> Talk about the alignment with business. We've been talking a lot about alignment with the data folks, the business folks, the technical folks to identify the right solutions, to be able to govern data, to monetize it, to create data products. What does that... You mentioned a couple of your cohorts, but on the business side, who are some of those key folks? >> So we're like any other big, big organization. We have lots of different business units. So we work directly with either the operational team or the heads of those business units to divine analytic missions that they'll actually execute. And at the same time, we actually have a business unit that's all around data monetization. And so I work with them every single day. And so these business units will come together. I think the big thing for us is to define value and measure that value as we go. As long as we're measuring that value as we go, then we can continue to see improvements. And so, like I said, sometimes it's bottom line, sometimes it's top line, but we're involved. Data is actually a substrate of the company. It's not a side thing to the company. >> Yeah, you are. >> ADP. >> Yeah but if they say data first but you really are data first. >> Yeah. I mean, our CEO says- >> Data's your product. >> Data's our middle name. And it literally is. >> Well, so what do you do in the Snowflake financial services data Cloud? Are you monetizing? >> Yeah. >> What's the plan? >> Yeah, so we have clients. So part of our data monetization is actually providing aggregate and anonymized information that helps other clients make business decisions. So they'll take it into their analytics. So, supply chain optimization, where should we actually put the warehouses based on the population shifts? And so we're actually using the file distribution capabilities or the information distribution, no longer files, where we use Snowflake to actually be that data cloud for those clients. So the data just pops up for our other clients. >> I think the industry's existed a lot with the physical movement of data. When you physically move data, you also physically move the data management challenges. Where do you store it? How do you map it? How do you concord it? And ultimately data sharing is taking away that friction that exists. So it's easier to be able to make informed decisions with the data at hand across two counterparties. >> Yeah, and there's a benefit to us 'cause it lowers our friction. We can have a conversation and somebody can be... Obviously the contracts have to be signed, but once they get done, somebody's up and running on it within minutes. And where it used to be, as you were saying, the movement of data and loss of control, we never actually lose control of it. We know where it is. >> Or yeah, contracts signed, now you got to go through this long process of making sure everything's cool, or a lot of times it could slow down the sale. >> That's right. >> Let's see how that's going to... Let's do a little advanced work. Now you're working without a contract. Here, you can say, "Hey, we're in the Snowflake data cloud. It's governed, you're a part of the ecosystem." >> Yeah, and the ecosystem we announced, oh gee, I think it's probably almost a year and a half ago, a relationship with ICE, Intercontinental Exchange, where they're actually taking our information and their information and creating a new data product that they in turn sell. So you get this sort of combination. >> Absolutely. The ability to form partnerships and monetize data with your partners vastly increases as a consequence. >> Talk to us about the adoption of the financial services data cloud in the last what, maybe nine months or so, since it was announced? And also in terms of the its value proposition, how does the ADP use case articulate that? >> So, very much so. So in terms of momentum, we're a global organization, as you mentioned, we are verticalized. So we have increasingly more expertise and expertise experience now within financial services that allows us to really engage and accelerate our momentum with the top banks, with the biggest asset managers by AUM, insurance companies, sovereign wealth funds on Snowflake. And obviously those data providers and solution providers that we engage with. So the momentum's really there. We're really moving very, very fast in a great market because we've got great opportunity with the capabilities that we have. I mean, ADP is just one of many use cases that we're working with and collaborations that we're taking to market. So yeah, the opportunity to monetize data and help our partners monetize the data has vastly increased within this space. >> When you think about... Oh go ahead, please. >> Yeah I was just going to say, and from our perspective, as we were getting into this, Snowflake was with us on the journey. And that's been a big deal. >> So when you think about data privacy, governance, et cetera, and public policy, it seems like you have, obviously you got things going on in Europe, and you got California, you have other states, there's increasing in complexity. You guys probably love that. (Dave laughs) More data warehouses, but where are we at with that whole? >> It's a great question. Privacy is... We hold some of the most critical information about people because that's our job to help people get paid. And we respect that as sort of our prime agenda. Part of it deals with the technology. How do you monitor, how do you see, make sure that you comply with all these regulations, but a lot of it has to do with the basic ethics of why you're doing and what you're doing. So we have a data and AI ethics board that meets and reviews our use cases. Make sure not only are we doing things properly to the regulation, but are these the types of products, are these the types of opportunities that we as a company want to stand behind on behalf of the consumers? Our company's been around 75 years. We talk about ourselves as a national asset. We have a trust relationship. We want to ensure that that trust relationship is never violated. >> Are you in a position where you can influence public policy and create more standards or framework. >> We actually are, right. We issue something every month called the National Employment Report. It actually tells you what's happening in the U.S. economy. We also issue it in some overseas countries like France. Because of that, we work a lot with various groups. And we can help shape, either data policy, we're involved in understanding although we don't necessarily want to be out in the front, but we want to learn about what's happening with federal trade commission, EOC, because at the end of the day we serve people, I always joke ADP, it's my grandfather's ADP. Well, it was actually my grandfather's ADP. (Dave laughs) He was a small businessman, and he used a ADP all those years ago. So we want to be part of that conversation because we want to continue to earn that trust every day. >> Well, plus your observation space is pretty wide. >> And you've got context and perspective on that that you can bring. >> We move somewhere between two, two and a half trillion dollars a year through our systems. And so we understand what's happening in the economy. >> What are some of the, oh sorry. >> Can your National Employment Report combined with a little Snowflake magic tell us what the hell's going to happen with this economy? >> It's really interesting you say that. Yeah, we actually can. >> Okay. (panelists laugh) >> I think when you think about the amount of data that we are working with, the types of partners that we're working with, the opportunities are infinite. They really, really are. >> So it's either a magic eight ball or it's a crystal ball, but you have it. >> We think- >> We've just uncovered that here on theCUBE. >> We think we have great partners. We have great data. We have a set of industry problems out there that we're working, collaboration with the community to be able to solve. >> What are some of the upcoming use cases Rinesh, that excite you, that are coming up in financial services- >> Great question. >> That snowflake is just going to knock out of the park. >> So look, I think there's a set of here and now problems that the industry faces, ESG's a good one. If you think about ESG, it means many different things from business ethics, to diversity, to your carbon footprint and every asset manager has to make sure they have now some form of green strategy that reflects the values of their investors. And every bank is looking to put in place sustainable lending to help their corporate customers transition. That's a big data problem. And so we're very much at the center of helping those organizations support those informed investors and help those corporates transition to a more sustainable landscape. >> Let me give you an example on Snowflake, we launched capabilities about diversity benchmarks. The first time in the industry companies can understand for their industry, their size, their location what their diversity profile looks like and their org chart profile looks like to differentiate or at least to understand are they doing the right things inside the business. The ability for banks to understand that and everything else, it's a big deal. And that was built on Snowflake. >> I think it's massive, especially in the context of the question around regulation 'cause we're seeing more and more disclosure agreements come out where regulators are making sure that there's no greenwashing taking place. So when you have really strong sources of data that are standardized, that allow that investment process to ingest that data, it does allow for a better outcome for investors. >> Real data, I mean, that diversity example they don't have to rely on a survey. >> It's not a survey. >> Anecdotes. >> It's coming right out of the transactional systems and it's updated, whenever those paychecks are run, whether it's weekly, whether it's biweekly or monthly, all that information gets updated and it's available. >> So it sounds like ADP is a facilitator of a lot of companies ESG initiatives, at least in part? >> Well, we partner with companies all the time. We have over 900,000 clients and all of them are... We've never spoken to a client who's not concerned about their people. And that's just good business. And so, yeah we're involved in that and we'll see where it goes over time now. >> I think there's tremendous opportunity if you think about the data that the ADP have in terms of diversity, in terms of gender pay gap. Huge, huge opportunity to incorporate that, as I said into the ESG principles and criteria. >> Good, 'cause that definitely is what needs to be addressed. (Lisa laughs) Guys thank you so much for joining Dave and me on the program, talking about Snowflake ADP, what you're doing together, and the massive potential that you're helping unlock with the value of data. We appreciate your insights and your time. >> Thank you for having us. >> Dave: Thanks guys. >> Thank you so much. >> For our guests, and Dave Vellante, I'm Lisa Martin. You're watching theCUBE, live in Las Vegas at Snowflake Summit 22. Dave and I will be right back with our next guest. (upbeat music)

Published Date : Jun 15 2022

SUMMARY :

the Global Head of Financial in the last couple of years. inside of the financial services industry And of course we don't is, one of the things that we It helps with the talent war and- inside of the Snowflake system You guys announced the We're a platform, but the like the only industry Well really the intersection of the two And so as you look so that the things are I mean, a lot of CDOs that I know Thanks. And for a while it was And then all of a sudden, So I have that job with data governance that builds data products. That's somewhat unique in your... And then we help with all that governance So I've got the CIO I've got the security as a peer Talk about the alignment with business. and measure that value as we go. but you really are data first. I mean, our CEO says- And it literally is. So the data just pops up So it's easier to be able Obviously the contracts have to be signed, could slow down the sale. in the Snowflake data cloud. Yeah, and the ecosystem we announced, and monetize data with your partners and help our partners monetize the data When you think about... as we were getting into this, are we at with that whole? behalf of the consumers? where you can influence public policy the day we serve people, Well, plus your observation that you can bring. happening in the economy. It's really interesting you say that. Okay. about the amount of data or it's a crystal ball, but you have it. that here on theCUBE. We think we have great partners. going to knock out of the park. that the industry faces, ESG's a good one. And that was built on Snowflake. of the question around regulation they don't have to rely on a survey. the transactional systems companies all the time. about the data that the ADP and the massive potential Dave and I will be right

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Brian GilmorePERSON

0.99+

David BrownPERSON

0.99+

Tim YoakumPERSON

0.99+

Lisa MartinPERSON

0.99+

Dave VolantePERSON

0.99+

Dave VellantePERSON

0.99+

BrianPERSON

0.99+

DavePERSON

0.99+

Tim YokumPERSON

0.99+

StuPERSON

0.99+

Herain OberoiPERSON

0.99+

JohnPERSON

0.99+

Dave ValantePERSON

0.99+

Kamile TaoukPERSON

0.99+

John FourierPERSON

0.99+

Rinesh PatelPERSON

0.99+

Dave VellantePERSON

0.99+

Santana DasguptaPERSON

0.99+

EuropeLOCATION

0.99+

CanadaLOCATION

0.99+

BMWORGANIZATION

0.99+

CiscoORGANIZATION

0.99+

MicrosoftORGANIZATION

0.99+

ICEORGANIZATION

0.99+

AmazonORGANIZATION

0.99+

Jack BerkowitzPERSON

0.99+

AustraliaLOCATION

0.99+

NVIDIAORGANIZATION

0.99+

TelcoORGANIZATION

0.99+

VenkatPERSON

0.99+

MichaelPERSON

0.99+

CamillePERSON

0.99+

Andy JassyPERSON

0.99+

IBMORGANIZATION

0.99+

Venkat KrishnamachariPERSON

0.99+

DellORGANIZATION

0.99+

Don TapscottPERSON

0.99+

thousandsQUANTITY

0.99+

Palo AltoLOCATION

0.99+

Intercontinental ExchangeORGANIZATION

0.99+

Children's Cancer InstituteORGANIZATION

0.99+

Red HatORGANIZATION

0.99+

telcoORGANIZATION

0.99+

Sabrina YanPERSON

0.99+

TimPERSON

0.99+

SabrinaPERSON

0.99+

John FurrierPERSON

0.99+

GoogleORGANIZATION

0.99+

MontyCloudORGANIZATION

0.99+

AWSORGANIZATION

0.99+

LeoPERSON

0.99+

COVID-19OTHER

0.99+

Santa AnaLOCATION

0.99+

UKLOCATION

0.99+

TusharPERSON

0.99+

Las VegasLOCATION

0.99+

ValentePERSON

0.99+

JL ValentePERSON

0.99+

1,000QUANTITY

0.99+

Howard Levenson


 

>>AWS public sector summit here in person in Washington, D. C. For two days live. Finally a real event. I'm john for your host of the cube. Got a great guest Howard Levinson from data bricks, regional vice president and general manager of the federal team for data bricks. Uh Super unicorn. Is it a decade corn yet? It's uh, not yet public but welcome to the cube. >>I don't know what the next stage after unicorn is, but we're growing rapidly. >>Thank you. Our audience knows David bricks extremely well. Always been on the cube many times. Even back, we were covering them back when big data was big data. Now it's all data everything. So we watched your success. Congratulations. Thank you. Um, so there's no, you know, not a big bridge for us across to see you here at AWS public sector summit. Tell us what's going on inside the data bricks amazon relationship. >>Yeah. It's been a great relationship. You know, when the company got started some number of years ago we got a contract with the government to deliver the data brooks capability and they're classified cloud in amazon's classified cloud. So that was the start of a great federal relationship today. Virtually all of our businesses in AWS and we run in every single AWS environment from commercial cloud to Govcloud to secret top secret environments and we've got customers doing great things and experiencing great results from data bricks and amazon. >>The federal government's the classic, I call migration opportunity. Right? Because I mean, let's face it before the pandemic even five years ago, even 10 years ago. Glacier moving speed slow, slow and they had to get modernized with the pandemic forced really to do it. But you guys have already cleared the runway with your value problems. You've got lake house now you guys are really optimized for the cloud. >>Okay, hardcore. Yeah. We are, we only run in the cloud and we take advantage of every single go fast feature that amazon gives us. But you know john it's The Office of Management and Budget. Did a study a couple of years ago. I think there were 28,000 federal data centers, 28,000 federal data centers. Think about that for a minute and just think about like let's say in each one of those data centers you've got a handful of operational data stores of databases. The federal government is trying to take all of that data and make sense out of it. The first step to making sense out of it is bringing it all together, normalizing it. Fed aerating it and that's exactly what we do. And that's been a real win for our federal clients and it's been a real exciting opportunity to watch people succeed in that >>endeavour. We have another guest on. And she said those data center huggers tree huggers data center huggers, majority of term people won't let go. Yeah. So but they're slowly dying away and moving on to the cloud. So migrations huge. How are you guys migrating with your customers? Give us an example of how it's working. What are some of the use cases? >>So before I do that I want to tell you a quick story. I've I had the luxury of working with the Air Force Chief data officer Ailene vedrine and she is commonly quoted as saying just remember as as airmen it's not your data it's the Air Force's data. So people were data center huggers now their data huggers but all of that data belongs to the government at the end of the day. So how do we help in that? Well think about all this data sitting in all these operational data stores they're getting it's getting updated all the time. But you want to be able to Federated this data together and make some sense out of it. So for like an organization like uh us citizenship and immigration services they had I think 28 different data sources and they want to be able to pull that data basically in real time and bring it into a data lake. Well that means doing a change data capture off of those operational data stores transforming that data and normalizing it so that you can then enjoy it. And we've done that I think they're now up to 70 data sources that are continually ingested into their data lake. And from there they support thousands of users doing analysis and reports for the whole visa processing system for the United States, the whole naturalization environment And their efficiency has gone up I think by their metrics by 24 x. >>Yeah. I mean Sandy carter was just on the cube earlier. She's the Vice president partner ecosystem here at public sector. And I was coming to her that federal game has changed, it used to be hard to get into you know everybody and you navigate the trip wires and all the subtle hints and and the people who are friends and it was like cloak and dagger and so people were locked in on certain things databases and data because now has to be freely available. I know one of the things that you guys are passionate about and this is kind of hard core architectural thing is that you need horizontally scalable data to really make a I work right. Machine learning works when you have data. How far along are these guys in their thinking when you have a customer because we're seeing progress? How far along are we? >>Yeah, we still have a long way to go in the federal government. I mean, I tell everybody, I think the federal government's probably four or five years behind what data bricks top uh clients are doing. But there are clearly people in the federal government that have really ramped it up and are on a par were even exceeding some of the commercial clients, U. S. C. I. S CBP FBI or some of the clients that we work with that are pretty far ahead and I'll say I mentioned a lot about the operational data stores but there's all kinds of data that's coming in at U S. C. I. S. They do these naturalization interviews, those are captured in real text. So now you want to do natural language processing against them, make sure these interviews are of the highest quality control, We want to be able to predict which people are going to show up for interviews based on their geospatial location and the day of the week and other factors the weather perhaps. So they're using all of these data types uh imagery text and structure data all in the Lake House concept to make predictions about how they should run their >>business. So that's a really good point. I was talking with keith brooks earlier directive is development, go to market strategy for AWS public sector. He's been there from the beginning this the 10th year of Govcloud. Right, so we're kind of riffing but the jpl Nasa Jpl, they did production workloads out of the gate. Yeah. Full mission. So now fast forward today. Cloud Native really is available. So like how do you see the the agencies in the government handling Okay. Re platform and I get that but now to do the reef acting where you guys have the Lake House new things can happen with cloud Native technologies, what's the what's the what's the cross over point for that point. >>Yeah, I think our Lake House architecture is really a big breakthrough architecture. It used to be, people would take all of this data, they put it in a Hadoop data lake, they'd end up with a data swamp with really not good control or good data quality. And uh then they would take the data from the data swamp where the data lake and they curate it and go through an E. T. L. Process and put a second copy into their data warehouse. So now you have two copies of the data to governance models. Maybe two versions of the data. A lot to manage. A lot to control with our Lake House architecture. You can put all of that data in the data lake it with our delta format. It comes in a curated way. Uh there's a catalogue associated with the data. So you know what you've got. And now you can literally build an ephemeral data warehouse directly on top of that data and it exists only for the period of time that uh people need it. And so it's cloud Native. It's elastically scalable. It terminates when nobody's using it. We run the whole center for Medicaid Medicare services. The whole Medicaid repository for the United States runs in an ephemeral data warehouse built on Amazon S three. >>You know, that is a huge call out, I want to just unpack that for a second. What you just said to me puts the exclamation point on cloud value because it's not your grandfather's data warehouse, it's like okay we do data warehouse capability but we're using higher level cloud services, whether it's governance stuff for a I to actually make it work at scale for those environments. I mean that that to me is re factoring that's not re platform Ng. Just re platform that's re platform Ng in the cloud and then re factoring capability for on uh new >>advantages. It's really true. And now you know at CMS, they have one copy of the data so they do all of their reporting, they've got a lot of congressional reports that they need to do. But now they're leveraging that same data, not making a copy of it for uh the center for program integrity for fraud. And we know how many billions of dollars worth of fraud exist in the Medicaid system. And now we're applying artificial intelligence and machine learning on entity analytics to really get to the root of those problems. It's a game >>changer. And this is where the efficiency comes in at scale. Because you start to see, I mean we always talk on the cube about like how software is changed the old days you put on the shelf shelf where they called it. Uh that's our generation. And now you got the cloud, you didn't know if something is hot or not until the inventory is like we didn't sell through in the cloud. If you're not performing, you suck basically. So it's not working, >>it's an instant Mhm. >>Report card. So now when you go to the cloud, you think the data lake and uh the lake house what you guys do uh and others like snowflake and were optimized in the cloud, you can't deny it. And then when you compare it to like, okay, so I'm saving you millions and millions if you're just on one thing, never mind the top line opportunities. >>So so john you know, years ago people didn't believe the cloud was going to be what it is. Like pretty much today, the clouds inevitable. It's everywhere. I'm gonna make you another prediction. Um And you can say you heard it here first, the data warehouse is going away. The Lake house is clearly going to replace it. There's no need anymore for two separate copies, there's no need for a proprietary uh storage copy of your data and people want to be able to apply more than sequel to the data. Uh Data warehouses, just restrict. What about an ocean house? >>Yeah. Lake is kind of small. When you think about this lake, Michigan is pretty big now, I think it's I >>think it's going to go bigger than that. I think we're talking about Sky Computer, we've been a cloud computing, we're going to uh and we're going to do that because people aren't gonna put all of their data in one place, they're going to have, it spread across different amazon regions or or or amazon availability zones and you're going to want to share data and you know, we just introduced this delta sharing capability. I don't know if you're familiar with it but it allows you to share data without a sharing server directly from picking up basically the amazon, you RLS and sharing them with different organizations. So you're sharing in place. The data actually isn't moving. You've got great governance and great granularity of the data that you choose to share and data sharing is going to be the next uh >>next break. You know, I really loved the Lake House were fairly sing gateway. I totally see that. So I totally would align with that and say I bet with you on that one. The Sky net Skynet, the Sky computing. >>See you're taking it away man, >>I know Skynet got anything that was computing in the Sky is Skynet that's terminated So but that's real. I mean I think that's a concept where it's like, you know what services and functions does for servers, you don't have a data, >>you've got to be able to connect data, nobody lives in an island. You've got to be able to connect data and more data. We all know more data produces better results. So how do you get more data? You connect to more data sources, >>Howard great to have you on talk about the relationship real quick as we end up here with amazon, What are you guys doing together? How's the partnership? >>Yeah, I mean the partnership with amazon is amazing. We have, we work uh, I think probably 95% of our federal business is running in amazon's cloud today. As I mentioned, john we run across uh, AWS commercial AWS GovCloud secret environment. See to us and you know, we have better integration with amazon services than I'll say some of the amazon services if people want to integrate with glue or kinesis or Sagemaker, a red shift, we have complete integration with all of those and that's really, it's not just a partnership at the sales level. It's a partnership and integration at the engineering level. >>Well, I think I'm really impressed with you guys as a company. I think you're an example of the kind of business model that people might have been afraid of which is being in the cloud, you can have a moat, you have competitive advantage, you can build intellectual property >>and, and john don't forget, it's all based on open source, open data, like almost everything that we've done. We've made available to people, we get 30 million downloads of the data bricks technology just for people that want to use it for free. So no vendor lock in. I think that's really important to most of our federal clients into everybody. >>I've always said competitive advantage scale and choice. Right. That's a data bricks. Howard? Thanks for coming on the key, appreciate it. Thanks again. Alright. Cube coverage here in Washington from face to face physical event were on the ground. Of course, we're also streaming a digital for the hybrid event. This is the cubes coverage of a W. S. Public sector Summit will be right back after this short break.

Published Date : Sep 28 2021

SUMMARY :

to the cube. Um, so there's no, you know, So that was the start of a great federal relationship But you guys have already cleared the runway with your value problems. But you know john it's The How are you guys migrating with your customers? So before I do that I want to tell you a quick story. I know one of the things that you guys are passionate So now you want to do natural language processing against them, make sure these interviews are of the highest quality So like how do you see the So now you have two copies of the data to governance models. I mean that that to me is re factoring that's not re platform And now you know at CMS, they have one copy of the data talk on the cube about like how software is changed the old days you put on the shelf shelf where they called So now when you go to the cloud, you think the data lake and uh the lake So so john you know, years ago people didn't believe the cloud When you think about this lake, Michigan is pretty big now, I think it's I of the data that you choose to share and data sharing is going to be the next uh So I totally would align with that and say I bet with you on that one. I mean I think that's a concept where it's like, you know what services So how do you get more See to us and you know, we have better integration with amazon services Well, I think I'm really impressed with you guys as a company. I think that's really important to most of our federal clients into everybody. Thanks for coming on the key, appreciate it.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
amazonORGANIZATION

0.99+

Howard LevinsonPERSON

0.99+

WashingtonLOCATION

0.99+

SkynetORGANIZATION

0.99+

HowardPERSON

0.99+

AWSORGANIZATION

0.99+

two copiesQUANTITY

0.99+

Washington, D. C.LOCATION

0.99+

two daysQUANTITY

0.99+

30 millionQUANTITY

0.99+

two versionsQUANTITY

0.99+

keith brooksPERSON

0.99+

95%QUANTITY

0.99+

two separate copiesQUANTITY

0.99+

Howard LevensonPERSON

0.99+

millionsQUANTITY

0.99+

Ailene vedrinePERSON

0.99+

one copyQUANTITY

0.99+

fourQUANTITY

0.99+

SkyORGANIZATION

0.99+

10 years agoDATE

0.99+

five yearsQUANTITY

0.99+

first stepQUANTITY

0.99+

28 different data sourcesQUANTITY

0.99+

MichiganLOCATION

0.99+

AmazonORGANIZATION

0.99+

Sky ComputerORGANIZATION

0.98+

United StatesLOCATION

0.98+

28,000 federal data centersQUANTITY

0.98+

billions of dollarsQUANTITY

0.98+

28,000 federal data centersQUANTITY

0.98+

five years agoDATE

0.98+

second copyQUANTITY

0.98+

thousands of usersQUANTITY

0.98+

pandemicEVENT

0.98+

AWSEVENT

0.97+

todayDATE

0.97+

10th yearQUANTITY

0.97+

W. S. Public sector SummitEVENT

0.97+

Lake HouseLOCATION

0.97+

johnPERSON

0.96+

Air ForceORGANIZATION

0.96+

oneQUANTITY

0.96+

NasaORGANIZATION

0.96+

Sky netORGANIZATION

0.96+

each oneQUANTITY

0.96+

Medicaid MedicareORGANIZATION

0.95+

one thingQUANTITY

0.94+

24QUANTITY

0.94+

data bricksORGANIZATION

0.94+

U S. C. I. S.LOCATION

0.92+

up to 70 data sourcesQUANTITY

0.91+

Chief data officerPERSON

0.9+

firstQUANTITY

0.89+

GovcloudORGANIZATION

0.88+

Cloud NativeTITLE

0.88+

one placeQUANTITY

0.87+

GovCloudTITLE

0.87+

couple of years agoDATE

0.86+

Office of Management and BudgetORGANIZATION

0.85+

Sandy carterPERSON

0.84+

years agoDATE

0.83+

AWS public sector summitEVENT

0.83+

U. S. C. I. SORGANIZATION

0.81+

MedicaidORGANIZATION

0.79+

a minuteQUANTITY

0.77+

number of years agoDATE

0.77+

a secondQUANTITY

0.75+

center huggersORGANIZATION

0.72+

NgTITLE

0.71+

Thought.Leaders Digital 2020 | Japan


 

(speaks in foreign language) >> Narrator: Data is at the heart of transformation and the change every company needs to succeed, but it takes more than new technology. It's about teams, talent, and cultural change. Empowering everyone on the front lines to make decisions, all at the speed of digital. The transformation starts with you. It's time to lead the way, it's time for thought leaders. >> Welcome to Thought Leaders, a digital event brought to you by ThoughtSpot. My name is Dave Vellante. The purpose of this day is to bring industry leaders and experts together to really try and understand the important issues around digital transformation. We have an amazing lineup of speakers and our goal is to provide you with some best practices that you can bring back and apply to your organization. Look, data is plentiful, but insights are not. ThoughtSpot is disrupting analytics by using search and machine intelligence to simplify data analysis, and really empower anyone with fast access to relevant data. But in the last 150 days, we've had more questions than answers. Creating an organization that puts data and insights at their core, requires not only modern technology, but leadership, a mindset and a culture that people often refer to as data-driven. What does that mean? How can we equip our teams with data and fast access to quality information that can turn insights into action. And today, we're going to hear from experienced leaders, who are transforming their organizations with data, insights and creating digital-first cultures. But before we introduce our speakers, I'm joined today by two of my co-hosts from ThoughtSpot. First, Chief Data Strategy Officer for ThoughtSpot is Cindi Hausen. Cindi is an analytics and BI expert with 20 plus years experience and the author of Successful Business Intelligence Unlock The Value of BI and Big Data. Cindi was previously the lead analyst at Gartner for the data and analytics magic quadrant. And early last year, she joined ThoughtSpot to help CDOs and their teams understand how best to leverage analytics and AI for digital transformation. Cindi, great to see you, welcome to the show. >> Thank you, Dave. Nice to join you virtually. >> Now our second cohost and friend of theCUBE is ThoughtSpot CEO Sudheesh Nair. Hello Sudheesh, how are you doing today? >> I am well Dave, it's good to talk to you again. >> It's great to see you. Thanks so much for being here. Now Sudheesh, please share with us why this discussion is so important to your customers and of course, to our audience and what they're going to learn today? (gentle music) >> Thanks, Dave, I wish you were there to introduce me into every room that I walk into because you have such an amazing way of doing it. It makes me feel also good. Look, since we have all been cooped up in our homes, I know that the vendors like us, we have amped up our, you know, sort of effort to reach out to you with invites for events like this. So we are getting way more invites for events like this than ever before. So when we started planning for this, we had three clear goals that we wanted to accomplish. And our first one that when you finish this and walk away, we want to make sure that you don't feel like it was a waste of time. We want to make sure that we value your time, and this is going to be useful. Number two, we want to put you in touch with industry leaders and thought leaders, and generally good people that you want to hang around with long after this event is over. And number three, as we plan through this, you know, we are living through these difficult times, we want an event to be, this event to be more of an uplifting and inspiring event too. Now, the challenge is, how do you do that with the team being change agents? Because change and as much as we romanticize it, it is not one of those uplifting things that everyone wants to do or likes to do. The way I think of it, change is sort of like, if you've ever done bungee jumping. You know, it's like standing on the edges, waiting to make that one more step. You know, all you have to do is take that one step and gravity will do the rest, but that is the hardest step to take. Change requires a lot of courage and when we are talking about data and analytics, which is already like such a hard topic, not necessarily an uplifting and positive conversation, in most businesses it is somewhat scary. Change becomes all the more difficult. Ultimately change requires courage. Courage to to, first of all, challenge the status quo. People sometimes are afraid to challenge the status quo because they are thinking that, "You know, maybe I don't have the power to make the change that the company needs. Sometimes I feel like I don't have the skills." Sometimes they may feel that, I'm probably not the right person to do it. Or sometimes the lack of courage manifest itself as the inability to sort of break the silos that are formed within the organizations, when it comes to data and insights that you talked about. You know, there are people in the company, who are going to hog the data because they know how to manage the data, how to inquire and extract. They know how to speak data, they have the skills to do that, but they are not the group of people who have sort of the knowledge, the experience of the business to ask the right questions off the data. So there is this silo of people with the answers and there is a silo of people with the questions, and there is gap. These sort of silos are standing in the way of making that necessary change that we all I know the business needs, and the last change to sort of bring an external force sometimes. It could be a tool, it could be a platform, it could be a person, it could be a process, but sometimes no matter how big the company is or how small the company is. You may need to bring some external stimuli to start that domino of the positive changes that are necessary. The group of people that we have brought in, the four people, including Cindi, that you will hear from today are really good at practically telling you how to make that step, how to step off that edge, how to trust the rope that you will be safe and you're going to have fun. You will have that exhilarating feeling of jumping for a bungee jump. All four of them are exceptional, but my honor is to introduce Michelle and she's our first speaker. Michelle, I am very happy after watching her presentation and reading her bio, that there are no country vital worldwide competition for cool patents, because she will beat all of us because when her children were small, you know, they were probably into Harry Potter and Disney and she was managing a business and leading change there. And then as her kids grew up and got to that age, where they like football and NFL, guess what? She's the CIO of NFL. What a cool mom. I am extremely excited to see what she's going to talk about. I've seen the slides with a bunch of amazing pictures, I'm looking to see the context behind it. I'm very thrilled to make the acquaintance of Michelle. I'm looking forward to her talk next. Welcome Michelle. It's over to you. (gentle music) >> I'm delighted to be with you all today to talk about thought leadership. And I'm so excited that you asked me to join you because today I get to be a quarterback. I always wanted to be one. This is about as close as I'm ever going to get. So, I want to talk to you about quarterbacking our digital revolution using insights, data and of course, as you said, leadership. First, a little bit about myself, a little background. As I said, I always wanted to play football and this is something that I wanted to do since I was a child but when I grew up, girls didn't get to play football. I'm so happy that that's changing and girls are now doing all kinds of things that they didn't get to do before. Just this past weekend on an NFL field, we had a female coach on two sidelines and a female official on the field. I'm a lifelong fan and student of the game of football. I grew up in the South. You can tell from the accent and in the South football is like a religion and you pick sides. I chose Auburn University working in the athletic department, so I'm testament. Till you can start, a journey can be long. It took me many, many years to make it into professional sports. I graduated in 1987 and my little brother, well not actually not so little, he played offensive line for the Alabama Crimson Tide. And for those of you who know SEC football, you know this is a really big rivalry, and when you choose sides your family is divided. So it's kind of fun for me to always tell the story that my dad knew his kid would make it to the NFL, he just bet on the wrong one. My career has been about bringing people together for memorable moments at some of America's most iconic brands, delivering memories and amazing experiences that delight. From Universal Studios, Disney, to my current position as CIO of the NFL. In this job, I'm very privileged to have the opportunity to work with a team that gets to bring America's game to millions of people around the world. Often, I'm asked to talk about how to create amazing experiences for fans, guests or customers. But today, I really wanted to focus on something different and talk to you about being behind the scenes and backstage. Because behind every event, every game, every awesome moment, is execution. Precise, repeatable execution and most of my career has been behind the scenes doing just that. Assembling teams to execute these plans and the key way that companies operate at these exceptional levels is making good decisions, the right decisions, at the right time and based upon data. So that you can translate the data into intelligence and be a data-driven culture. Using data and intelligence is an important way that world-class companies do differentiate themselves, and it's the lifeblood of collaboration and innovation. Teams that are working on delivering these kind of world class experiences are often seeking out and leveraging next generation technologies and finding new ways to work. I've been fortunate to work across three decades of emerging experiences, which each required emerging technologies to execute. A little bit first about Disney. In '90s I was at Disney leading a project called Destination Disney, which it's a data project. It was a data project, but it was CRM before CRM was even cool and then certainly before anything like a data-driven culture was ever brought up. But way back then we were creating a digital backbone that enabled many technologies for the things that you see today. Like the MagicBand, Disney's Magical Express. My career at Disney began in finance, but Disney was very good about rotating you around. And it was during one of these rotations that I became very passionate about data. I kind of became a pain in the butt to the IT team asking for data, more and more data. And I learned that all of that valuable data was locked up in our systems. All of our point of sales systems, our reservation systems, our operation systems. And so I became a shadow IT person in marketing, ultimately, leading to moving into IT and I haven't looked back since. In the early 2000s, I was at Universal Studio's theme park as their CIO preparing for and launching the Wizarding World of Harry Potter. Bringing one of history's most memorable characters to life required many new technologies and a lot of data. Our data and technologies were embedded into the rides and attractions. I mean, how do you really think a wand selects you at a wand shop. As today at the NFL, I am constantly challenged to do leading edge technologies, using things like sensors, AI, machine learning and all new communication strategies, and using data to drive everything, from player performance, contracts, to where we build new stadiums and hold events. With this year being the most challenging, yet rewarding year in my career at the NFL. In the middle of a global pandemic, the way we are executing on our season is leveraging data from contact tracing devices joined with testing data. Talk about data actually enabling your business. Without it we wouldn't be having a season right now. I'm also on the board of directors of two public companies, where data and collaboration are paramount. First, RingCentral, it's a cloud based unified communications platform and collaboration with video message and phone, all-in-one solution in the cloud and Quotient Technologies, whose product is actually data. The tagline at Quotient is The Result in Knowing. I think that's really important because not all of us are data companies, where your product is actually data, but we should operate more like your product is data. I'd also like to talk to you about four areas of things to think about as thought leaders in your companies. First, just hit on it, is change. how to be a champion and a driver of change. Second, how to use data to drive performance for your company and measure performance of your company. Third, how companies now require intense collaboration to operate and finally, how much of this is accomplished through solid data-driven decisions. First, let's hit on change. I mean, it's evident today more than ever, that we are in an environment of extreme change. I mean, we've all been at this for years and as technologists we've known it, believed it, lived it. And thankfully, for the most part, knock on wood, we were prepared for it. But this year everyone's cheese was moved. All the people in the back rooms, IT, data architects and others were suddenly called to the forefront because a global pandemic has turned out to be the thing that is driving intense change in how people work and analyze their business. On March 13th, we closed our office at the NFL in the middle of preparing for one of our biggest events, our kickoff event, The 2020 Draft. We went from planning a large event in Las Vegas under the bright lights, red carpet stage, to smaller events in club facilities. And then ultimately, to one where everyone coaches, GMs, prospects and even our commissioner were at home in their basements and we only had a few weeks to figure it out. I found myself for the first time, being in the live broadcast event space. Talking about bungee jumping, this is really what it felt like. It was one in which no one felt comfortable because it had not been done before. But leading through this, I stepped up, but it was very scary, it was certainly very risky, but it ended up being also rewarding when we did it. And as a result of this, some things will change forever. Second, managing performance. I mean, data should inform how you're doing and how to get your company to perform at its level, highest level. As an example, the NFL has always measured performance, obviously, and it is one of the purest examples of how performance directly impacts outcome. I mean, you can see performance on the field, you can see points being scored and stats, and you immediately know that impact. Those with the best stats usually win the games. The NFL has always recorded stats. Since the beginning of time here at the NFL a little... This year is our 101st year and athlete's ultimate success as a player has also always been greatly impacted by his stats. But what has changed for us is both how much more we can measure and the immediacy with which it can be measured and I'm sure in your business it's the same. The amount of data you must have has got to have quadrupled recently. And how fast do you need it and how quickly you need to analyze it is so important. And it's very important to break the silos between the keys to the data and the use of the data. Our next generation stats platform is taking data to the next level. It's powered by Amazon Web Services and we gather this data, real-time from sensors that are on players' bodies. We gather it in real time, analyze it, display it online and on broadcast. And of course, it's used to prepare week to week in addition to what is a normal coaching plan would be. We can now analyze, visualize, route patterns, speed, match-ups, et cetera, so much faster than ever before. We're continuing to roll out sensors too, that will gather more and more information about a player's performance as it relates to their health and safety. The third trend is really, I think it's a big part of what we're feeling today and that is intense collaboration. And just for sort of historical purposes, it's important to think about, for those of you that are IT professionals and developers, you know, more than 10 years ago agile practices began sweeping companies. Where small teams would work together rapidly in a very flexible, adaptive and innovative way and it proved to be transformational. However today, of course that is no longer just small teams, the next big wave of change and we've seen it through this pandemic, is that it's the whole enterprise that must collaborate and be agile. If I look back on my career, when I was at Disney, we owned everything 100%. We made a decision, we implemented it. We were a collaborative culture but it was much easier to push change because you own the whole decision. If there was buy-in from the top down, you got the people from the bottom up to do it and you executed. At Universal, we were a joint venture. Our attractions and entertainment was licensed. Our hotels were owned and managed by other third parties, so influence and collaboration, and how to share across companies became very important. And now here I am at the NFL an even the bigger ecosystem. We have 32 clubs that are all separate businesses, 31 different stadiums that are owned by a variety of people. We have licensees, we have sponsors, we have broadcast partners. So it seems that as my career has evolved, centralized control has gotten less and less and has been replaced by intense collaboration, not only within your own company but across companies. The ability to work in a collaborative way across businesses and even other companies, that has been a big key to my success in my career. I believe this whole vertical integration and big top-down decision-making is going by the wayside in favor of ecosystems that require cooperation, yet competition to co-exist. I mean, the NFL is a great example of what we call co-oppetition, which is cooperation and competition. We're in competition with each other, but we cooperate to make the company the best it can be. And at the heart of these items really are data-driven decisions and culture. Data on its own isn't good enough. You must be able to turn it to insights. Partnerships between technology teams who usually hold the keys to the raw data and business units, who have the knowledge to build the right decision models is key. If you're not already involved in this linkage, you should be, data mining isn't new for sure. The availability of data is quadrupling and it's everywhere. How do you know what to even look at? How do you know where to begin? How do you know what questions to ask? It's by using the tools that are available for visualization and analytics and knitting together strategies of the company. So it begins with, first of all, making sure you do understand the strategy of the company. So in closing, just to wrap up a bit, many of you joined today, looking for thought leadership on how to be a change agent, a change champion, and how to lead through transformation. Some final thoughts are be brave and drive. Don't do the ride along program, it's very important to drive. Driving can be high risk, but it's also high reward. Embracing the uncertainty of what will happen is how you become brave. Get more and more comfortable with uncertainty, be calm and let data be your map on your journey. Thanks. >> Michelle, thank you so much. So you and I share a love of data and a love of football. You said you want to be the quarterback. I'm more an a line person. >> Well, then I can't do my job without you. >> Great and I'm getting the feeling now, you know, Sudheesh is talking about bungee jumping. My vote is when we're past this pandemic, we both take him to the Delaware Water Gap and we do the cliff jumping. >> Oh that sounds good, I'll watch your watch. >> Yeah, you'll watch, okay. So Michelle, you have so many stakeholders, when you're trying to prioritize the different voices you have the players, you have the owners, you have the league, as you mentioned, the broadcasters, your partners here and football mamas like myself. How do you prioritize when there are so many different stakeholders that you need to satisfy? >> I think balancing across stakeholders starts with aligning on a mission and if you spend a lot of time understanding where everyone's coming from, and you can find the common thread that ties them all together. You sort of do get them to naturally prioritize their work and I think that's very important. So for us at the NFL and even at Disney, it was our core values and our core purpose is so well known and when anything challenges that, we're able to sort of lay that out. But as a change agent, you have to be very empathetic, and I would say empathy is probably your strongest skill if you're a change agent and that means listening to every single stakeholder. Even when they're yelling at you, even when they're telling you your technology doesn't work and you know that it's user error, or even when someone is just emotional about what's happening to them and that they're not comfortable with it. So I think being empathetic, and having a mission, and understanding it is sort of how I prioritize and balance. >> Yeah, empathy, a very popular word this year. I can imagine those coaches and owners yelling, so thank you for your leadership here. So Michelle, I look forward to discussing this more with our other customers and disruptors joining us in a little bit. >> (gentle music) So we're going to take a hard pivot now and go from football to Chernobyl. Chernobyl, what went wrong? 1986, as the reactors were melting down, they had the data to say, "This is going to be catastrophic," and yet the culture said, "No, we're perfect, hide it. Don't dare tell anyone." Which meant they went ahead and had celebrations in Kiev. Even though that increased the exposure, additional thousands getting cancer and 20,000 years before the ground around there can even be inhabited again. This is how powerful and detrimental a negative culture, a culture that is unable to confront the brutal facts that hides data. This is what we have to contend with and this is why I want you to focus on having, fostering a data-driven culture. I don't want you to be a laggard. I want you to be a leader in using data to drive your digital transformation. So I'll talk about culture and technology, is it really two sides of the same coin? Real-world impacts and then some best practices you can use to disrupt and innovate your culture. Now, oftentimes I would talk about culture and I talk about technology. And recently a CDO said to me, "You know, Cindi, I actually think this is two sides of the same coin, one reflects the other." What do you think? Let me walk you through this. So let's take a laggard. What does the technology look like? Is it based on 1990s BI and reporting, largely parametrized reports, on-premises data warehouses, or not even that operational reports. At best one enterprise data warehouse, very slow moving and collaboration is only email. What does that culture tell you? Maybe there's a lack of leadership to change, to do the hard work that Sudheesh referred to, or is there also a culture of fear, afraid of failure, resistance to change, complacency. And sometimes that complacency, it's not because people are lazy. It's because they've been so beaten down every time a new idea is presented. It's like, "No, we're measured on least to serve." So politics and distrust, whether it's between business and IT or individual stakeholders is the norm, so data is hoarded. Let's contrast that with the leader, a data and analytics leader, what does their technology look like? Augmented analytics, search and AI driven insights, not on-premises but in the cloud and maybe multiple clouds. And the data is not in one place but it's in a data lake and in a data warehouse, a logical data warehouse. The collaboration is via newer methods, whether it's Slack or Teams, allowing for that real-time decisioning or investigating a particular data point. So what is the culture in the leaders? It's transparent and trust. There is a trust that data will not be used to punish, that there is an ability to confront the bad news. It's innovation, valuing innovation in pursuit of the company goals. Whether it's the best fan experience and player safety in the NFL or best serving your customers, it's innovative and collaborative. There's none of this, "Oh, well, I didn't invent that. I'm not going to look at that." There's still pride of ownership, but it's collaborating to get to a better place faster. And people feel empowered to present new ideas, to fail fast and they're energized knowing that they're using the best technology and innovating at the pace that business requires. So data is democratized and democratized, not just for power users or analysts, but really at the point of impact, what we like to call the new decision-makers or really the frontline workers. So Harvard Business Review partnered with us to develop this study to say, "Just how important is this? We've been working at BI and analytics as an industry for more than 20 years, why is it not at the front lines? Whether it's a doctor, a nurse, a coach, a supply chain manager, a warehouse manager, a financial services advisor." 87% said they would be more successful if frontline workers were empowered with data-driven insights, but they recognize they need new technology to be able to do that. It's not about learning hard tools. The sad reality only 20% of organizations are actually doing this. These are the data-driven leaders. So this is the culture and technology, how did we get here? It's because state-of-the-art keeps changing. So the first generation BI and analytics platforms were deployed on-premises, on small datasets, really just taking data out of ERP systems that were also on-premises and state-of-the-art was maybe getting a management report, an operational report. Over time, visual based data discovery vendors disrupted these traditional BI vendors, empowering now analysts to create visualizations with the flexibility on a desktop, sometimes larger data, sometimes coming from a data warehouse. The current state-of-the-art though, Gartner calls it augmented analytics. At ThoughtSpot, we call it search and AI driven analytics, and this was pioneered for large scale data sets, whether it's on-premises or leveraging the cloud data warehouses. And I think this is an important point, oftentimes you, the data and analytics leaders, will look at these two components separately. But you have to look at the BI and analytics tier in lock-step with your data architectures to really get to the granular insights and to leverage the capabilities of AI. Now, if you've never seen ThoughtSpot, I'll just show you what this looks like. Instead of somebody hard coding a report, it's typing in search keywords and very robust keywords contains rank, top, bottom, getting to a visual visualization that then can be pinned to an existing pin board that might also contain insights generated by an AI engine. So it's easy enough for that new decision maker, the business user, the non-analyst to create themselves. Modernizing the data and analytics portfolio is hard because the pace of change has accelerated. You used to be able to create an investment, place a bet for maybe 10 years. A few years ago, that time horizon was five years. Now, it's maybe three years and the time to maturity has also accelerated. So you have these different components, the search and AI tier, the data science tier, data preparation and virtualization but I would also say, equally important is the cloud data warehouse. And pay attention to how well these analytics tools can unlock the value in these cloud data warehouses. So ThoughtSpot was the first to market with search and AI driven insights. Competitors have followed suit, but be careful, if you look at products like Power BI or SAP analytics cloud, they might demo well, but do they let you get to all the data without moving it in products like Snowflake, Amazon Redshift, or Azure Synapse, or Google BigQuery, they do not. They require you to move it into a smaller in-memory engine. So it's important how well these new products inter-operate. The pace of change, its acceleration, Gartner recently predicted that by 2022, 65% of analytical queries will be generated using search or NLP or even AI and that is roughly three times the prediction they had just a couple of years ago. So let's talk about the real world impact of culture and if you've read any of my books or used any of the maturity models out there, whether the Gartner IT Score that I worked on or the Data Warehousing Institute also has a maturity model. We talk about these five pillars to really become data-driven. As Michelle spoke about, it's focusing on the business outcomes, leveraging all the data, including new data sources, it's the talent, the people, the technology and also the processes. And often when I would talk about the people in the talent, I would lump the culture as part of that. But in the last year, as I've traveled the world and done these digital events for thought leaders. You have told me now culture is absolutely so important, and so we've pulled it out as a separate pillar. And in fact, in polls that we've done in these events, look at how much more important culture is as a barrier to becoming data-driven. It's three times as important as any of these other pillars. That's how critical it is. And let's take an example of where you can have great data, but if you don't have the right culture, there's devastating impacts. And I will say I have been a loyal customer of Wells Fargo for more than 20 years, but look at what happened in the face of negative news with data. It said, "Hey, we're not doing good cross-selling, customers do not have both a checking account and a credit card and a savings account and a mortgage." They opened fake accounts facing billions in fines, change in leadership that even the CEO attributed to a toxic sales culture and they're trying to fix this, but even recently there's been additional employee backlash saying the culture has not changed. Let's contrast that with some positive examples. Medtronic, a worldwide company in 150 countries around the world. They may not be a household name to you, but if you have a loved one or yourself, you have a pacemaker, spinal implant, diabetes, you know this brand. And at the start of COVID when they knew their business would be slowing down, because hospitals would only be able to take care of COVID patients. They took the bold move of making their IP for ventilators publicly available. That is the power of a positive culture. Or Verizon, a major telecom organization looking at late payments of their customers and even though the U.S. Federal Government said, "Well, you can't turn them off." They said, "We'll extend that even beyond the mandated guidelines," and facing a slow down in the business because of the tough economy, They said, "You know what? We will spend the time upskilling our people, giving them the time to learn more about the future of work, the skills and data and analytics for 20,000 of their employees rather than furloughing them. That is the power of a positive culture. So how can you transform your culture to the best in class? I'll give you three suggestions. Bring in a change agent, identify the relevance or I like to call it WIIFM and organize for collaboration. So the CDO, whatever your title is, Chief Analytics Officer, Chief Digital Officer, you are the most important change agent. And this is where you will hear that oftentimes a change agent has to come from outside the organization. So this is where, for example, in Europe you have the CDO of Just Eat, a takeout food delivery organization coming from the airline industry or in Australia, National Australian Bank taking a CDO within the same sector from TD Bank going to NAB. So these change agents come in, disrupt. It's a hard job. As one of you said to me, it often feels like. I make one step forward and I get knocked down again, I get pushed back. It is not for the faint of heart, but it's the most important part of your job. The other thing I'll talk about is WIIFM What's In It For Me? And this is really about understanding the motivation, the relevance that data has for everyone on the frontline, as well as those analysts, as well as the executives. So, if we're talking about players in the NFL, they want to perform better and they want to stay safe. That is why data matters to them. If we're talking about financial services, this may be a wealth management advisor. Okay, we could say commissions, but it's really helping people have their dreams come true, whether it's putting their children through college or being able to retire without having to work multiple jobs still into your 70s or 80s. For the teachers, teachers you ask them about data. They'll say, "We don't need that, I care about the student." So if you can use data to help a student perform better, that is WIIFM and sometimes we spend so much time talking the technology, we forget, what is the value we're trying to deliver with this? And we forget the impact on the people that it does require change. In fact, the Harvard Business Review study found that 44% said lack of change management is the biggest barrier to leveraging both new technology, but also being empowered to act on those data-driven insights. The third point, organize for collaboration. This does require diversity of thought, but also bringing the technology, the data and the business people together. Now there's not a single one size fits all model for data and analytics. At one point in time, even having a BICC, a BI competency center was considered state of the art. Now for the biggest impact, what I recommend is that you have a federated model centralized for economies of scale. That could be the common data, but then embed these evangelists, these analysts of the future within every business unit, every functional domain. And as you see this top bar, all models are possible, but the hybrid model has the most impact, the most leaders. So as we look ahead to the months ahead, to the year ahead, an exciting time because data is helping organizations better navigate a tough economy, lock in the customer loyalty and I look forward to seeing how you foster that culture that's collaborative with empathy and bring the best of technology, leveraging the cloud, all your data. So thank you for joining us at Thought Leaders. And next, I'm pleased to introduce our first change agent, Tom Mazzaferro Chief Data Officer of Western Union and before joining Western Union, Tom made his Mark at HSBC and JP Morgan Chase spearheading digital innovation in technology, operations, risk compliance and retail banking. Tom, thank you so much for joining us today. (gentle music) >> Very happy to be here and looking forward to talking to all of you today. So as we look to move organizations to a data-driven capability into the future, there is a lot that needs to be done on the data side, but also how does data connect and enable different business teams and the technology teams into the future? As we look across our data ecosystems and our platforms, and how we modernize that to the cloud in the future, it all needs to basically work together, right? To really be able to drive an organization from a data standpoint, into the future. That includes being able to have the right information with the right quality of data, at the right time to drive informed business decisions, to drive the business forward. As part of that, we actually have partnered with ThoughtSpot to actually bring in the technology to help us drive that. As part of that partnership and it's how we've looked to integrate it into our overall business as a whole. We've looked at, how do we make sure that our business and our professional lives, right? Are enabled in the same ways as our personal lives. So for example, in your personal lives, when you want to go and find something out, what do you do? You go onto google.com or you go onto Bing or you go onto Yahoo and you search for what you want, search to find an answer. ThoughtSpot for us is the same thing, but in the business world. So using ThoughtSpot and other AI capability is it's allowed us to actually enable our overall business teams in our company to actually have our information at our fingertips. So rather than having to go and talk to someone, or an engineer to go pull information or pull data. We actually can have the end users or the business executives, right. Search for what they need, what they want, at the exact time that they actually need it, to go and drive the business forward. This is truly one of those transformational things that we've put in place. On top of that, we are on a journey to modernize our larger ecosystem as a whole. That includes modernizing our underlying data warehouses, our technology, our... The local environments and as we move that, we've actually picked two of our cloud providers going to AWS and to GCP. We've also adopted Snowflake to really drive and to organize our information and our data, then drive these new solutions and capabilities forward. So a big portion of it though is culture. So how do we engage with the business teams and bring the IT teams together, to really help to drive these holistic end-to-end solutions and capabilities, to really support the actual business into the future. That's one of the keys here, as we look to modernize and to really enhance our organizations to become data-driven. This is the key. If you can really start to provide answers to business questions before they're even being asked and to predict based upon different economic trends or different trends in your business, what decisions need to be made and actually provide those answers to the business teams before they're even asking for it. That is really becoming a data-driven organization and as part of that, it really then enables the business to act quickly and take advantage of opportunities as they come in based upon industries, based upon markets, based upon products, solutions or partnerships into the future. These are really some of the keys that become crucial as you move forward, right, into this new age, Especially with COVID. With COVID now taking place across the world, right? Many of these markets, many of these digital transformations are celebrating and are changing rapidly to accommodate and to support customers in these very difficult times. As part of that, you need to make sure you have the right underlying foundation, ecosystems and solutions to really drive those capabilities and those solutions forward. As we go through this journey, both in my career but also each of your careers into the future, right? It also needs to evolve, right? Technology has changed so drastically in the last 10 years, and that change is only accelerating. So as part of that, you have to make sure that you stay up to speed, up to date with new technology changes, both on the platform standpoint, tools, but also what do our customers want, what do our customers need and how do we then service them with our information, with our data, with our platform, and with our products and our services to meet those needs and to really support and service those customers into the future. This is all around becoming a more data-driven organization, such as how do you use your data to support your current business lines, but how do you actually use your information and your data to actually better support your customers, better support your business, better support your employees, your operations teams and so forth. And really creating that full integration in that ecosystem is really when you start to get large dividends from these investments into the future. With that being said, I hope you enjoyed the segment on how to become and how to drive a data-driven organization, and looking forward to talking to you again soon. Thank you. >> Tom, that was great. Thanks so much and now going to have to drag on you for a second. As a change agent you've come in, disrupted and how long have you been at Western Union? >> Only nine months, so just started this year, but there have been some great opportunities to integrate changes and we have a lot more to go, but we're really driving things forward in partnership with our business teams and our colleagues to support those customers going forward. >> Tom, thank you so much. That was wonderful. And now, I'm excited to introduce you to Gustavo Canton, a change agent that I've had the pleasure of working with meeting in Europe and he is a serial change agent. Most recently with Schneider Electric but even going back to Sam's Clubs. Gustavo, welcome. (gentle music) >> So, hey everyone, my name is Gustavo Canton and thank you so much, Cindi, for the intro. As you mentioned, doing transformations is, you know, a high reward situation. I have been part of many transformations and I have led many transformations. And, what I can tell you is that it's really hard to predict the future, but if you have a North Star and you know where you're going, the one thing that I want you to take away from this discussion today is that you need to be bold to evolve. And so, in today, I'm going to be talking about culture and data, and I'm going to break this down in four areas. How do we get started, barriers or opportunities as I see it, the value of AI and also, how you communicate. Especially now in the workforce of today with so many different generations, you need to make sure that you are communicating in ways that are non-traditional sometimes. And so, how do we get started? So, I think the answer to that is you have to start for you yourself as a leader and stay tuned. And by that, I mean, you need to understand, not only what is happening in your function or your field, but you have to be very in tune what is happening in society socioeconomically speaking, wellbeing. You know, the common example is a great example and for me personally, it's an opportunity because the number one core value that I have is wellbeing. I believe that for human potential for customers and communities to grow, wellbeing should be at the center of every decision. And as somebody mentioned, it's great to be, you know, stay in tune and have the skillset and the courage. But for me personally, to be honest, to have this courage is not about not being afraid. You're always afraid when you're making big changes and you're swimming upstream, but what gives me the courage is the empathy part. Like I think empathy is a huge component because every time I go into an organization or a function, I try to listen very attentively to the needs of the business and what the leaders are trying to do. But I do it thinking about the mission of, how do I make change for the bigger workforce or the bigger good despite the fact that this might have perhaps implication for my own self interest in my career. Right? Because you have to have that courage sometimes to make choices that are not well seen, politically speaking, but are the right thing to do and you have to push through it. So the bottom line for me is that, I don't think we're they're transforming fast enough. And the reality is, I speak with a lot of leaders and we have seen stories in the past and what they show is that, if you look at the four main barriers that are basically keeping us behind budget, inability to act, cultural issues, politics and lack of alignment, those are the top four. But the interesting thing is that as Cindi has mentioned, these topic about culture is actually gaining more and more traction. And in 2018, there was a story from HBR and it was about 45%. I believe today, it's about 55%, 60% of respondents say that this is the main area that we need to focus on. So again, for all those leaders and all the executives who understand and are aware that we need to transform, commit to the transformation and set a deadline to say, "Hey, in two years we're going to make this happen. What do we need to do, to empower and enable these change agents to make it happen? You need to make the tough choices. And so to me, when I speak about being bold is about making the right choices now. So, I'll give you examples of some of the roadblocks that I went through as I've been doing transformations, most recently, as Cindi mentioned in Schneider. There are three main areas, legacy mindset and what that means is that, we've been doing this in a specific way for a long time and here is how we have been successful. What worked in the past is not going to work now. The opportunity there is that there is a lot of leaders, who have a digital mindset and they're up and coming leaders that are perhaps not yet fully developed. We need to mentor those leaders and take bets on some of these talents, including young talent. We cannot be thinking in the past and just wait for people, you know, three to five years for them to develop because the world is going in a way that is super-fast. The second area and this is specifically to implementation of AI. It's very interesting to me because just the example that I have with ThoughtSpot, right? We went on implementation and a lot of the way the IT team functions or the leaders look at technology, they look at it from the prism of the prior or success criteria for the traditional BIs, and that's not going to work. Again, the opportunity here is that you need to redefine what success look like. In my case, I want the user experience of our workforce to be the same user experience you have at home. It's a very simple concept and so we need to think about, how do we gain that user experience with these augmented analytics tools and then work backwards to have the right talent, processes, and technology to enable that. And finally and obviously with COVID, a lot of pressure in organizations and companies to do more with less. And the solution that most leaders I see are taking is to just minimize costs sometimes and cut budget. We have to do the opposite. We have to actually invest on growth areas, but do it by business question. Don't do it by function. If you actually invest in these kind of solutions, if you actually invest on developing your talent and your leadership to see more digitally, if you actually invest on fixing your data platform, it's not just an incremental cost. It's actually this investment is going to offset all those hidden costs and inefficiencies that you have on your system, because people are doing a lot of work and working very hard but it's not efficient and it's not working in the way that you might want to work. So there is a lot of opportunity there and just to put in terms of perspective, there have been some studies in the past about, you know, how do we kind of measure the impact of data? And obviously, this is going to vary by organization maturity, there's going to be a lot of factors. I've been in companies who have very clean, good data to work with and I've been with companies that we have to start basically from scratch. So it all depends on your maturity level. But in this study, what I think is interesting is they try to put a tagline or a tag price to what is the cost of incomplete data. So in this case, it's about 10 times as much to complete a unit of work when you have data that is flawed as opposed to having perfect data. So let me put that just in perspective, just as an example, right? Imagine you are trying to do something and you have to do 100 things in a project, and each time you do something, it's going to cost you a dollar. So if you have perfect data, the total cost of that project might be $100. But now let's say you have 80% perfect data and 20% flawed data. By using this assumption that flawed data is 10 times as costly as perfect data, your total costs now becomes $280 as opposed to $100. This just for you to really think about as a CIO, CTO, you know CHRO, CEO, "Are we really paying attention and really closing the gaps that we have on our data infrastructure?" If we don't do that, it's hard sometimes to see the snowball effect or to measure the overall impact, but as you can tell, the price tag goes up very, very quickly. So now, if I were to say, how do I communicate this or how do I break through some of these challenges or some of these barriers, right? I think the key is, I am in analytics, I know statistics obviously and love modeling, and, you know, data and optimization theory, and all that stuff. That's what I came to analytics, but now as a leader and as a change agent, I need to speak about value and in this case, for example, for Schneider. There was this tagline, make the most of your energy. So the number one thing that they were asking from the analytics team was actually efficiency, which to me was very interesting. But once I understood that, I understood what kind of language to use, how to connect it to the overall strategy and basically, how to bring in the right leaders because you need to, you know, focus on the leaders that you're going to make the most progress, you know. Again, low effort, high value. You need to make sure you centralize all the data as you can, you need to bring in some kind of augmented analytics, you know, solution. And finally, you need to make it super-simple for the, you know, in this case, I was working with the HR teams and other areas, so they can have access to one portal. They don't have to be confused and looking for 10 different places to find information. I think if you can actually have those four foundational pillars, obviously under the guise of having a data-driven culture, that's when you can actually make the impact. So in our case, it was about three years total transformation, but it was two years for this component of augmented analytics. It took about two years to talk to, you know, IT, get leadership support, find the budgeting, you know, get everybody on board, make sure the success criteria was correct. And we call this initiative, the people analytics portal. It was actually launched in July of this year and we were very excited and the audience was very excited to do this. In this case, we did our pilot in North America for many, many, many factors but one thing that is really important is as you bring along your audience on this, you know. You're going from Excel, you know, in some cases or Tableu to other tools like, you know, ThoughtSpot. You need to really explain them what is the difference and how this tool can truly replace some of the spreadsheets or some of the views that you might have on these other kinds of tools. Again, Tableau, I think it's a really good tool. There are other many tools that you might have in your toolkit but in my case, personally, I feel that you need to have one portal. Going back to Cindi's points, that really truly enable the end user. And I feel that this is the right solution for us, right? And I will show you some of the findings that we had in the pilot in the last two months. So this was a huge victory and I will tell you why, because it took a lot of effort for us to get to this stage and like I said, it's been years for us to kind of lay the foundation, get the leadership, initiating culture so people can understand, why you truly need to invest on augmented analytics. And so, what I'm showing here is an example of how do we use basically, you know, a tool to capturing video, the qualitative findings that we had, plus the quantitative insights that we have. So in this case, our preliminary results based on our ambition for three main metrics. Hours saved, user experience and adoption. So for hours saved, our ambition was to have 10 hours per week for employee to save on average. User experience, our ambition was 4.5 and adoption 80%. In just two months, two months and a half of the pilot, we were able to achieve five hours per week per employee savings, a user experience for 4.3 out of five and adoption of 60%. Really, really amazing work. But again, it takes a lot of collaboration for us to get to the stage from IT, legal, communications, obviously the operations things and the users. In HR safety and other areas that might be basically stakeholders in this whole process. So just to summarize, this kind of effort takes a lot of energy. You are a change agent, you need to have courage to make this decision and understand that, I feel that in this day and age with all this disruption happening, we don't have a choice. We have to take the risk, right? And in this case, I feel a lot of satisfaction in how we were able to gain all these great resource for this organization and that give me the confident to know that the work has been done and we are now in a different stage for the organization. And so for me, it's just to say, thank you for everybody who has belief, obviously in our vision, everybody who has belief in, you know, the work that we were trying to do and to make the life of our, you know, workforce or customers and community better. As you can tell, there is a lot of effort, there is a lot of collaboration that is needed to do something like this. In the end, I feel very satisfied with the accomplishments of this transformation and I just want to tell for you, if you are going right now in a moment that you feel that you have to swim upstream, you know, work with mentors, work with people in the industry that can help you out and guide you on this kind of transformation. It's not easy to do, it's high effort, but it's well worth it. And with that said, I hope you are well and it's been a pleasure talking to you. Talk to you soon. Take care. >> Thank you, Gustavo. That was amazing. All right, let's go to the panel. (light music) Now I think we can all agree how valuable it is to hear from practitioners and I want to thank the panel for sharing their knowledge with the community. Now one common challenge that I heard you all talk about was bringing your leadership and your teams along on the journey with you. We talk about this all the time and it is critical to have support from the top. Why? Because it directs the middle and then it enables bottoms up innovation effects from the cultural transformation that you guys all talked about. It seems like another common theme we heard is that you all prioritize database decision making in your organizations. And you combine two of your most valuable assets to do that and create leverage, employees on the front lines, and of course the data. Now as as you rightly pointed out, Tom, the pandemic has accelerated the need for really leaning into this. You know, the old saying, if it ain't broke, don't fix it, well COVID has broken everything and it's great to hear from our experts, you know, how to move forward, so let's get right into it. So Gustavo, let's start with you. If I'm an aspiring change agent and let's say I'm a budding data leader, what do I need to start doing? What habits do I need to create for long-lasting success? >> I think curiosity is very important. You need to be, like I said, in tune to what is happening, not only in your specific field, like I have a passion for analytics, I've been doing it for 50 years plus, but I think you need to understand wellbeing of the areas across not only a specific business. As you know, I come from, you know, Sam's Club, Walmart retail. I've been in energy management, technology. So you have to try to push yourself and basically go out of your comfort zone. I mean, if you are staying in your comfort zone and you want to just continuous improvement, that's just going to take you so far. What you have to do is, and that's what I try to do, is I try to go into areas, businesses and transformations, that make me, you know, stretch and develop as a leader. That's what I'm looking to do, so I can help transform the functions, organizations, and do the change management, the essential mindset that's required for this kind of effort. >> Well, thank you for that. That is inspiring and Cindi you love data and the data is pretty clear that diversity is a good business, but I wonder if you can, you know, add your perspectives to this conversation? >> Yeah, so Michelle has a new fan here because she has found her voice. I'm still working on finding mine and it's interesting because I was raised by my dad, a single dad, so he did teach me how to work in a predominantly male environment, but why I think diversity matters more now than ever before and this is by gender, by race, by age, by just different ways of working and thinking, is because as we automate things with AI, if we do not have diverse teams looking at the data, and the models, and how they're applied, we risk having bias at scale. So this is why I think I don't care what type of minority you are, finding your voice, having a seat at the table and just believing in the impact of your work has never been more important and as Michelle said, more possible. >> Great perspectives, thank you. Tom, I want to go to you. So, I mean, I feel like everybody in our businesses is in some way, shape, or form become a COVID expert, but what's been the impact of the pandemic on your organization's digital transformation plans? >> We've seen a massive growth, actually, in our digital business over the last 12 months really, even acceleration, right, once COVID hit. We really saw that in the 200 countries and territories that we operate in today and service our customers in today, that there's been a huge need, right, to send money to support family, to support friends, and to support loved ones across the world. And as part of that we are very honored to be able to support those customers that, across all the centers today, but as part of the acceleration, we need to make sure that we have the right architecture and the right platforms to basically scale, right? To basically support and provide the right kind of security for our customers going forward. So as part of that, we did do some pivots and we did accelerate some of our plans on digital to help support that overall growth coming in and to support our customers going forward, because during these times, during this pandemic, right, this is the most important time and we need to support those that we love and those that we care about. And doing that some of those ways is actually by sending money to them, support them financially. And that's where really our products and our services come into play that, you know, and really support those families. So, it was really a great opportunity for us to really support and really bring some of our products to the next level and supporting our business going forward. >> Awesome, thank you. Now, I want to come back to Gustavo. Tom, I'd love for you to chime in too. Did you guys ever think like you were pushing the envelope too much in doing things with data or the technology that it was just maybe too bold, maybe you felt like at some point it was failing, or you're pushing your people too hard? Can you share that experience and how you got through it? >> Yeah, the way I look at it is, you know, again, whenever I go to an organization, I ask the question, "Hey, how fast you would like to conform?" And, you know, based on the agreements on the leadership and the vision that we want to take place, I take decisions and I collaborate in a specific way. Now, in the case of COVID, for example, right, it forces us to remove silos and collaborate in a faster way. So to me, it was an opportunity to actually integrate with other areas and drive decisions faster, but make no mistake about it, when you are doing a transformation, you are obviously trying to do things faster than sometimes people are comfortable doing, and you need to be okay with that. Sometimes you need to be okay with tension or you need to be okay, you know, debating points or making repetitive business cases until people connect with the decision because you understand and you are seeing that, "Hey, the CEO is making a one, two year, you know, efficiency goal. The only way for us to really do more with less is for us to continue this path. We can not just stay with the status quo, we need to find a way to accelerate the transformation." That's the way I see it. >> How about Utah, we were talking earlier with Sudheesh and Cindi about that bungee jumping moment. What can you share? >> Yeah, you know, I think you hit upon it. Right now, the pace of change will be the slowest pace that you see for the rest of your career. So as part of that, right, this is what I tell my team, is that you need to be, you need to feel comfortable being uncomfortable. Meaning that we have to be able to basically scale, right? Expand and support the ever changing needs in the marketplace and industry and our customers today, and that pace of change that's happening, right? And what customers are asking for and the competition in the marketplace, it's only going to accelerate. So as part of that, you know, as you look at how you're operating today in your current business model, right? Things are only going to get faster. So you have to plan and to align and to drive the actual transformation, so that you can scale even faster into the future. So it's part of that, that's what we're putting in place here, right? It's how do we create that underlying framework and foundation that allows the organization to basically continue to scale and evolve into the future? >> Yeah, we're definitely out of our comfort zones, but we're getting comfortable with it. So Cindi, last question, you've worked with hundreds of organizations and I got to believe that, you know, some of the advice you gave when you were at Gartner, which was pre-COVID, maybe sometimes clients didn't always act on it. You know, not my watch or for whatever, variety of reasons, but it's being forced on them now. But knowing what you know now that, you know, we're all in this isolation economy, how would you say that advice has changed? Has it changed? What's your number one action and recommendation today? >> Yeah, well first off, Tom, just freaked me out. What do you mean, this is the slowest ever? Even six months ago I was saying the pace of change in data and analytics is frenetic. So, but I think you're right, Tom, the business and the technology together is forcing this change. Now, Dave, to answer your question, I would say the one bit of advice, maybe I was a little more very aware of the power in politics and how to bring people along in a way that they are comfortable and now I think it's, you know what, you can't get comfortable. In fact, we know that the organizations that were already in the cloud have been able to respond and pivot faster. So, if you really want to survive, as Tom and Gustavo said, get used to being uncomfortable. The power and politics are going to happen, break the rules, get used to that and be bold. Do not be afraid to tell somebody they're wrong and they're not moving fast enough. I do think you have to do that with empathy, as Michelle said and Gustavo, I think that's one of the key words today besides the bungee jumping. So I want to know where Sudheesh is going to go bungee jumping. (all chuckling) >> Guys, fantastic discussion, really. Thanks again to all the panelists and the guests, it was really a pleasure speaking with you today. Really, virtually all of the leaders that I've spoken to in theCUBE program recently, they tell me that the pandemic is accelerating so many things. Whether it's new ways to work, we heard about new security models and obviously the need for cloud. I mean, all of these things are driving true enterprise-wide digital transformation, not just as I said before, lip service. You know, sometimes we minimize the importance and the challenge of building culture and in making this transformation possible. But when it's done right, the right culture is going to deliver tournament results. You know, what does that mean? Getting it right. Everybody's trying to get it right. My biggest takeaway today is it means making data part of the DNA of your organization. And that means making it accessible to the people in your organization that are empowered to make decisions, decisions that can drive new revenue, cut costs, speed access to critical care, whatever the mission is of your organization, data can create insights and informed decisions that drive value. Okay, let's bring back Sudheesh and wrap things up. Sudheesh, please bring us home. >> Thank you, thank you, Dave. Thank you, theCUBE team, and thanks goes to all of our customers and partners who joined us, and thanks to all of you for spending the time with us. I want to do three quick things and then close it off. The first thing is I want to summarize the key takeaways that I heard from all four of our distinguished speakers. First, Michelle, I will simply put it, she said it really well. That is be brave and drive, don't go for a drive alone. That is such an important point. Often times, you know the right thing that you have to do to make the positive change that you want to see happen, but you wait for someone else to do it, not just, why not you? Why don't you be the one making that change happen? That's the thing that I picked up from Michelle's talk. Cindi talked about finding, the importance of finding your voice. Taking that chair, whether it's available or not, and making sure that your ideas, your voice is heard and if it requires some force, then apply that force. Make sure your ideas are heard. Gustavo talked about the importance of building consensus, not going at things all alone sometimes. The importance of building the quorum, and that is critical because if you want the changes to last, you want to make sure that the organization is fully behind it. Tom, instead of a single takeaway, what I was inspired by is the fact that a company that is 170 years old, 170 years old, 200 companies and 200 countries they're operating in and they were able to make the change that is necessary through this difficult time in a matter of months. If they could do it, anyone could. The second thing I want to do is to leave you with a takeaway, that is I would like you to go to ThoughtSpot.com/nfl because our team has made an app for NFL on Snowflake. I think you will find this interesting now that you are inspired and excited because of Michelle's talk. And the last thing is, please go to ThoughtSpot.com/beyond. Our global user conference is happening in this December. We would love to have you join us, it's, again, virtual, you can join from anywhere. We are expecting anywhere from five to 10,000 people and we would love to have you join and see what we've been up to since last year. We have a lot of amazing things in store for you, our customers, our partners, our collaborators, they will be coming and sharing. We'll be sharing things that we have been working to release, something that will come out next year. And also some of the crazy ideas our engineers have been cooking up. All of those things will be available for you at ThoughtSpot Beyond. Thank you, thank you so much.

Published Date : Oct 10 2020

SUMMARY :

and the change every to you by ThoughtSpot. Nice to join you virtually. Hello Sudheesh, how are you doing today? good to talk to you again. is so important to your and the last change to sort of and talk to you about being So you and I share a love of do my job without you. Great and I'm getting the feeling now, Oh that sounds good, stakeholders that you need to satisfy? and you can find the common so thank you for your leadership here. and the time to maturity at the right time to drive to drag on you for a second. to support those customers going forward. but even going back to Sam's Clubs. in the way that you might want to work. and of course the data. that's just going to take you so far. but I wonder if you can, you know, and the models, and how they're applied, everybody in our businesses and to support loved and how you got through it? and the vision that we want to take place, What can you share? and to drive the actual transformation, to believe that, you know, I do think you have to the right culture is going to and thanks to all of you for

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
TomPERSON

0.99+

Dave VellantePERSON

0.99+

SudheeshPERSON

0.99+

MichellePERSON

0.99+

CindiPERSON

0.99+

VerizonORGANIZATION

0.99+

WalmartORGANIZATION

0.99+

DavePERSON

0.99+

AustraliaLOCATION

0.99+

TD BankORGANIZATION

0.99+

EuropeLOCATION

0.99+

1987DATE

0.99+

$100QUANTITY

0.99+

Tom MazzaferroPERSON

0.99+

GustavoPERSON

0.99+

MedtronicORGANIZATION

0.99+

Cindi HausenPERSON

0.99+

$280QUANTITY

0.99+

DisneyORGANIZATION

0.99+

Universal StudiosORGANIZATION

0.99+

2018DATE

0.99+

GartnerORGANIZATION

0.99+

Wells FargoORGANIZATION

0.99+

fiveQUANTITY

0.99+

HSBCORGANIZATION

0.99+

10 timesQUANTITY

0.99+

two sidesQUANTITY

0.99+

Las VegasLOCATION

0.99+

twoQUANTITY

0.99+

Sudheesh NairPERSON

0.99+

4.3QUANTITY

0.99+

20 plus yearsQUANTITY

0.99+

last yearDATE

0.99+

KievLOCATION

0.99+

ThoughtSpotORGANIZATION

0.99+

Schneider ElectricORGANIZATION

0.99+

two sidelinesQUANTITY

0.99+

60%QUANTITY

0.99+

101st yearQUANTITY

0.99+

March 13thDATE

0.99+

80%QUANTITY

0.99+

Alabama Crimson TideORGANIZATION

0.99+

threeQUANTITY

0.99+

20%QUANTITY

0.99+

ExcelTITLE

0.99+

4.5QUANTITY

0.99+

JP Morgan ChaseORGANIZATION

0.99+

Sam's ClubORGANIZATION

0.99+

1990sDATE

0.99+

20,000QUANTITY

0.99+

32 clubsQUANTITY

0.99+

North AmericaLOCATION

0.99+

50 yearsQUANTITY

0.99+

Eileen Vidrine, US Air Force | MIT CDOIQ 2020


 

>> Announcer: From around the globe, it's theCube with digital coverage of MIT, Chief Data Officer and Information Quality Symposium brought to you by Silicon Angle Media. >> Hi, I'm Stu Miniman and this is the seventh year of theCubes coverage of the MIT, Chief Data Officer and Information Quality Symposium. We love getting to talk to these chief data officers and the people in this ecosystem, the importance of data, driving data-driven cultures, and really happy to welcome to the program, first time guests Eileen Vitrine, Eileen is the Chief Data Officer for the United States Air Force, Eileen, thank you so much for joining us. >> Thank you Stu really excited about being here today. >> All right, so the United States Air Force, I believe had it first CDO office in 2017, you were put in the CDO role in June of 2018. If you could, bring us back, give us how that was formed inside the Air force and how you came to be in that role. >> Well, Stu I like to say that we are a startup organization and a really mature organization, so it's really about culture change and it began by bringing a group of amazing citizen airman reservists back to the Air Force to bring their skills from industry and bring them into the Air Force. So, I like to say that we're a total force because we have active and reservists working with civilians on a daily basis and one of the first things we did in June was we stood up a data lab, that's based in the Jones building on Andrews Air Force Base. And there, we actually take small use cases that have enterprise focus, and we really try to dig deep to try to drive data insights, to inform senior leaders across the department on really important, what I would call enterprise focused challenges, it's pretty exciting. >> Yeah, it's been fascinating when we've dug into this ecosystem, of course while the data itself is very sensitive and I'm sure for the Air Force, there are some very highest level of security, the practices that are done as to how to leverage data, the line between public and private blurs, because you have people that have come from industry that go into government and people that are from government that have leveraged their experiences there. So, if you could give us a little bit of your background and what it is that your charter has been and what you're looking to build out, as you mentioned that culture of change. >> Well, I like to say I began my data leadership journey as an active duty soldier in the army, and I was originally a transportation officer, today we would use the title condition based maintenance, but back then, it was really about running the numbers so that I could optimize my truck fleet on the road each and every day, so that my soldiers were driving safely. Data has always been part of my leadership journey and so I like to say that one of our challenges is really to make sure that data is part of every airmans core DNA, so that they're using the right data at the right level to drive insights, whether it's tactical, operational or strategic. And so it's really about empowering each and every airman, which I think is pretty exciting. >> There's so many pieces of that data, you talk about data quality, there's obviously the data life cycle. I know your presentation that you're given here at the CDO, IQ talks about the data platform that your team has built, could you explain that? What are the key tenants and what maybe differentiates it from what other organizations might have done? >> So, when we first took the challenge to build our data lab, we really wanted to really come up. Our goal was to have a cross domain solution where we could solve data problems at the appropriate classification level. And so we built the VAULT data platform, VAULT stands for visible, accessible, understandable, linked, and trustworthy. And if you look at the DOD data strategy, they will also add the tenants of interoperability and secure. So, the first steps that we have really focused on is making data visible and accessible to airmen, to empower them, to drive insights from available data to solve their problems. So, it's really about that data empowerment, we like to use the hashtag built by airmen because it's really about each and every airman being part of the solution. And I think it's really an exciting time to be in the Air Force because any airman can solve a really hard challenge and it can very quickly wrap it up rapidly, escalate up with great velocity to senior leadership, to be an enterprise solution. >> Is there some basic training that goes on from a data standpoint? For any of those that have lived in data, oftentimes you can get lost in numbers, you have to have context, you need to understand how do I separate good from bad data, or when is data still valid? So, how does someone in the Air Force get some of that beta data competency? >> Well, we have taken a multitenant approach because each and every airman has different needs. So, we have quite a few pathfinders across the Air Force today, to help what I call, upscale our total force. And so I developed a partnership with the Air Force Institute of Technology and they now have a online graduate level data science certificate program. So, individuals studying at AFIT or remotely have the opportunity to really focus on building up their data touchpoints. Just recently, we have been working on a pathfinder to allow our data officers to get their ICCP Federal Data Sector Governance Certificate Program. So, we've been running what I would call short boot camps to prep data officers to be ready for that. And I think the one that I'm most excited about is that this year, this fall, new cadets at the U.S Air Force Academy will be able to have an undergraduate degree in data science and so it's not about a one prong approach, it's about having short courses as well as academe solutions to up skill our total force moving forward. >> Well, information absolutely is such an important differentiator(laughs) in general business and absolutely the military aspects are there. You mentioned the DOD talks about interoperability in their platform, can you speak a little bit to how you make sure that data is secure? Yet, I'm sure there's opportunities for other organizations, for there to be collaboration between them. >> Well, I like to say, that we don't fight alone. So, I work on a daily basis with my peers, Tom Cecila at the Department of Navy and Greg Garcia at the Department of Army, as well as Mr. David Berg in the DOD level. It's really important that we have an integrated approach moving forward and in the DOD we partner with our security experts, so it's not about us doing security individually, it's really about, in the Air Force we use a term called digital air force, and it's about optimizing and building a trusted partnership with our CIO colleagues, as well as our chief management colleagues because it's really about that trusted partnership to make sure that we're working collaboratively across the enterprise and whatever we do in the department, we also have to reach across our services so that we're all working together. >> Eileen, I'm curious if there's been much impact from the global pandemic. When I talk to enterprise companies, that they had to rapidly make sure that while they needed to protect data, when it was in their four walls and maybe for VPN, now everyone is accessing data, much more work from home and the like. I have to imagine some of those security measures you've already taken, but have there anything along those lines or anything else that this shift in where people are, and a little bit more dispersed has impacted your work? >> Well, the story that I like to say is, that this has given us velocity. So, prior to COVID, we built our VAULT data platform as a multitenancy platform that is also cross-domain solution, so it allows people to develop and do their problem solving in an appropriate classification level. And it allows us to connect or pushup if we need to into higher classification levels. The other thing that it has helped us really work smart because we do as much as we can in that unclassified environment and then using our cloud based solution in our gateways, it allows us to bring people in at a very scheduled component so that we maximize, or we optimize their time on site. And so I really think that it's really given us great velocity because it has really allowed people to work on the right problem set, on the right class of patient level at a specific time. And plus the other pieces, we look at what we're doing is that the problem set that we've had has really allowed people to become more data focused. I think that it's personal for folks moving forward, so it has increased understanding in terms of the need for data insights, as we move forward to drive decision making. It's not that data makes the decision, but it's using the insight to make the decision. >> And one of the interesting conversations we've been having about how to get to those data insights is the use of things like machine learning, artificial intelligence, anything you can share about, how you're looking at that journey, where you are along that discovery. >> Well, I love to say that in order to do AI and machine learning, you have to have great volumes of high quality data. And so really step one was visible, accessible data, but we in the Department of the Air Force stood up an accelerator at MIT. And so we have a group of amazing airmen that are actually working with MIT on a daily basis to solve some of those, what I would call opportunities for us to move forward. My office collaborates with them on a consistent basis, because they're doing additional use cases in that academic environment, which I'm pretty excited about because I think it gives us access to some of the smartest minds. >> All right, Eileen also I understand it's your first year doing the event. Unfortunately, we don't get, all come together in Cambridge, walking those hallways and being able to listen to some of those conversations and follow up is something we've very much enjoyed over the years. What excites you about being interact with your peers and participating in the event this year? >> Well, I really think it's about helping each other leverage the amazing lessons learned. I think that if we look collaboratively, both across industry and in the federal sector, there have been amazing lessons learned and it gives us a great forum for us to really share and leverage those lessons learned as we move forward so that we're not hitting the reboot button, but we actually are starting faster. So, it comes back to the velocity component, it all helps us go faster and at a higher quality level and I think that's really exciting. >> So, final question I have for you, we've talked for years about digital transformation, we've really said that having that data strategy and that culture of leveraging data is one of the most critical pieces of having gone through that transformation. For people that are maybe early on their journey, any advice that you'd give them, having worked through a couple of years of this and the experience you've had with your peers. >> I think that the first thing is that you have to really start with a blank slate and really look at the art of the possible. Don't think about what you've always done, think about where you want to go because there are many different paths to get there. And if you look at what the target goal is, it's really about making sure that you do that backward tracking to get to that goal. And the other piece that I tell my colleagues is celebrate the wins. My team of airmen, they are amazing, it's an honor to serve them and the reality is that they are doing great things and sometimes you want more. And it's really important to celebrate the victories because it's a very long journey and we keep moving the goalposts because we're always striving for excellence. >> Absolutely, it is always a journey that we're on, it's not about the destination. Eileen, thank you so much for sharing all that you've learned and glad you could participate. >> Thank you, STU, I appreciate being included today. Have a great day. >> Thanks and thank you for watching theCube. I'm Stu Miniman stay tuned for more from the MIT, CDO IQ event. (lively upbeat music)

Published Date : Sep 3 2020

SUMMARY :

brought to you by Silicon Angle Media. and the people in this ecosystem, Thank you Stu really All right, so the of the first things we did sure for the Air Force, at the right level to drive at the CDO, IQ talks to build our data lab, we have the opportunity to and absolutely the It's really important that we that they had to rapidly make Well, the story that I like to say is, And one of the interesting that in order to do AI and participating in the event this year? in the federal sector, is one of the most critical and really look at the art it's not about the destination. Have a great day. from the MIT, CDO IQ event.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
MichaelPERSON

0.99+

EileenPERSON

0.99+

ClairePERSON

0.99+

Tom CecilaPERSON

0.99+

Lisa MartinPERSON

0.99+

David BergPERSON

0.99+

2017DATE

0.99+

Greg GarciaPERSON

0.99+

June of 2018DATE

0.99+

Jonathan RosenbergPERSON

0.99+

Michael RosePERSON

0.99+

JuneDATE

0.99+

Eileen VitrinePERSON

0.99+

BlairPERSON

0.99+

U.S Air Force AcademyORGANIZATION

0.99+

MITORGANIZATION

0.99+

WednesdayDATE

0.99+

five minutesQUANTITY

0.99+

Omni ChannelORGANIZATION

0.99+

five billionQUANTITY

0.99+

Air Force Institute of TechnologyORGANIZATION

0.99+

two yearsQUANTITY

0.99+

CambridgeLOCATION

0.99+

ThursdayDATE

0.99+

Orlando, FloridaLOCATION

0.99+

Silicon Angle MediaORGANIZATION

0.99+

United States Air ForceORGANIZATION

0.99+

Eileen VidrinePERSON

0.99+

RyanPERSON

0.99+

GoogleORGANIZATION

0.99+

BlakePERSON

0.99+

Stu MinimanPERSON

0.99+

Blair PleasantPERSON

0.99+

BC StrategiesORGANIZATION

0.99+

Department of NavyORGANIZATION

0.99+

next yearDATE

0.99+

StuPERSON

0.99+

firstQUANTITY

0.99+

ConfusionORGANIZATION

0.99+

todayDATE

0.99+

five o'ClockDATE

0.99+

YouTubeORGANIZATION

0.99+

seventh yearQUANTITY

0.99+

twenty years agoDATE

0.99+

first yearQUANTITY

0.99+

twentyQUANTITY

0.99+

decades agoDATE

0.99+

last yearDATE

0.99+

JonesLOCATION

0.98+

Andrews Air Force BaseLOCATION

0.98+

this yearDATE

0.98+

United States Air ForceORGANIZATION

0.98+

last yearDATE

0.98+

fiveQUANTITY

0.98+

five nine mugsQUANTITY

0.98+

first thingQUANTITY

0.98+

first thingQUANTITY

0.98+

first stepsQUANTITY

0.98+

this fallDATE

0.98+

DODORGANIZATION

0.98+

Enterprise ConnectORGANIZATION

0.97+

Department of the Air ForceORGANIZATION

0.97+

AFITORGANIZATION

0.97+

bothQUANTITY

0.97+

oneQUANTITY

0.97+

Department of ArmyORGANIZATION

0.97+

first timeQUANTITY

0.97+

eachQUANTITY

0.96+

CDO IQEVENT

0.96+

OneQUANTITY

0.95+

TwitterORGANIZATION

0.95+

Doug Laney, Caserta | MIT CDOIQ 2020


 

>> Announcer: From around the globe, it's theCUBE with digital coverage of MIT Chief Data Officer and Information Quality symposium brought to you by SiliconANGLE Media. >> Hi everybody. This is Dave Vellante and welcome back to theCUBE's coverage of the MIT CDOIQ 2020 event. Of course, it's gone virtual. We wish we were all together in Cambridge. They were going to move into a new building this year for years they've done this event at the Tang Center, moving into a new facility, but unfortunately going to have to wait at least a year, we'll see, But we've got a great guest. Nonetheless, Doug Laney is here. He's a Business Value Strategist, the bestselling author, an analyst, consultant then a long time CUBE friend. Doug, great to see you again. Thanks so much for coming on. >> Dave, great to be with you again as well. So can I ask you? You have been an advocate for obviously measuring the value of data, the CDO role. I don't take this the wrong way, but I feel like the last 150 days have done more to accelerate people's attention on the importance of data and the value of data than all the great work that you've done. What do you think? (laughing) >> It's always great when organizations, actually take advantage of some of these concepts of data value. You may be speaking specifically about the situation with United Airlines and American Airlines, where they have basically collateralized their customer loyalty data, their customer loyalty programs to the tunes of several billion dollars each. And one of the things that's very interesting about that is that the third party valuations of their customer loyalty data, resulted in numbers that were larger than the companies themselves. So basically the value of their data, which is as we've discussed previously off balance sheet is more valuable than the market cap of those companies themselves, which is just incredibly fascinating. >> Well, and of course, all you have to do is look to the Trillionaire's Club. And now of course, Apple pushing two trillion to really see the value that the market places on data. But the other thing is of course, COVID, everybody talks about the COVID acceleration. How have you seen it impact the awareness of the importance of data, whether it applies to business resiliency or even new monetization models? If you're not digital, you can't do business. And digital is all about data. >> I think the major challenge that most organizations are seeing from a data and analytics perspective due to COVID is that their traditional trend based forecast models are broken. If you're a company that's only forecasting based on your own historical data and not taking into consideration, or even identifying what are the leading indicators of your business, then COVID and the economic shutdown have entirely broken those models. So it's raised the awareness of companies to say, "Hey, how can we predict our business now? We can't do it based on our own historical data. We need to look externally at what are those external, maybe global indicators or other kinds of markets that proceed our own forecasts or our own activity." And so the conversion from trend based forecast models to what we call driver based forecast models, isn't easy for a lot of organizations to do. And one of the more difficult parts is identifying what are those external data factors from suppliers, from customers, from partners, from competitors, from complimentary products and services that are leading indicators of your business. And then recasting those models and executing on them. >> And that's a great point. If you think about COVID and how it's changed things, everything's changed, right? The ideal customer profile has changed, your value proposition to those customers has completely changed. You got to rethink that. And of course, it's very hard to predict even when this thing eventually comes back, some kind of hybrid mode, you used to be selling to people in an office environment. That's obviously changed. There's a lot that's permanent there. And data is potentially at least the forward indicator, the canary in the coal mine. >> Right. It also is the product and service. So not only can it help you and improve your forecasting models, but it can become a product or service that you're offering. Look at us right now, we would generally be face to face and person to person, but we're using video technology to transfer this content. And then one of the things that I... It took me awhile to realize, but a couple of months after the COVID shutdown, it occurred to me that even as a consulting organization, Caserta focuses on North America. But the reality is that every consultancy is now a global consultancy because we're all doing business remotely. There are no particular or real strong localization issues for doing consulting today. >> So we talked a lot over the years about the role of the CDO, how it's evolved, how it's changed the course of the early... The pre-title days it was coming out of a data quality world. And it's still vital. Of course, as we heard today from the Keynote, it's much more public, much more exposed, different public data sources, but the role has certainly evolved initially into regulated industries like financial, healthcare and government, but now, many, many more organizations have a CDO. My understanding is that you're giving a talk in the business case for the CDO. Help us understand that. >> Yeah. So one of the things that we've been doing here for the last couple of years is a running an ongoing study of how organizations are impacted by the role of the CDO. And really it's more of a correlation and looking at what are some of the qualities of organizations that have a CDO or don't have a CDO. So some of the things we found is that organizations with a CDO nearly twice as often, mention the importance of data and analytics in their annual report organizations with a C level CDO, meaning a true executive are four times more often likely to be using data, to transform the business. And when we're talking about using data and advanced analytics, we found that organizations with a CIO, not a CDO responsible for their data assets are only half as likely to be doing advanced analytics in any way. So there are a number of interesting things that we found about companies that have a CDO and how they operate a bit differently. >> I want to ask you about that. You mentioned the CIO and we're increasingly seeing lines of reporting and peer reporting alter shift. The sands are shifting a little bit. In the early days the CDO and still predominantly I think is an independent organization. We've seen a few cases and increasingly number where they're reporting into the CIO, we've seen the same thing by the way with the chief Information Security Officer, which used to be considered the fox watching the hen house. So we're seeing those shifts. We've also seen the CDO become more aligned with a technical role and sometimes even emerging out of that technical role. >> Yeah. I think the... I don't know, what I've seen more is that the CDOs are emerging from the business, companies are realizing that data is a business asset. It's not an IT asset. There was a time when data was tightly coupled with applications of technologies, but today data is very easily decoupled from those applications and usable in a wider variety of contexts. And for that reason, as data gets recognized as a business, not an IT asset, you want somebody from the business responsible for overseeing that asset. Yes, a lot of CDOs still report to the CIO, but increasingly more CDOs you're seeing and I think you'll see some other surveys from other organizations this week where the CDOs are more frequently reporting up to the CEO level, meaning they're true executives. Along I advocated for the bifurcation of the IT organization into separate I and T organizations. Again, there's no reason other than for historical purposes to keep the data and technology sides of the organizations so intertwined. >> Well, it makes sense that the Chief Data Officer would have an affinity with the lines of business. And you're seeing a lot of organizations, really trying to streamline their data pipeline, their data life cycles, bringing that together, infuse intelligence into that, but also take a systems view and really have the business be intimately involved, if not even owned into the data. You see a lot of emphasis on self-serve, what are you seeing in terms of that data pipeline or the data life cycle, if you will, that used to be wonky, hard core techies, but now it really involving a lot more constituent. >> Yeah. Well, the data life cycle used to be somewhat short. The data life cycles, they're longer and they're more a data networks than a life cycle and or a supply chain. And the reason is that companies are finding alternative uses for their data, not just using it for a single operational purpose or perhaps reporting purpose, but finding that there are new value streams that can be generated from data. There are value streams that can be generated internally. There are a variety of value streams that can be generated externally. So we work with companies to identify what are those variety of value streams? And then test their feasibility, are they ethically feasible? Are they legally feasible? Are they economically feasible? Can they scale? Do you have the technology capabilities? And so we'll run through a process of assessing the ideas that are generated. But the bottom line is that companies are realizing that data is an asset. It needs to be not just measured as one and managed as one, but also monetized as an asset. And as we've talked about previously, data has these unique qualities that it can be used over and over again, and it generate more data when you use it. And it can be used simultaneously for multiple purposes. So companies like, you mentioned, Apple and others have built business models, based on these unique qualities of data. But I think it's really incumbent upon any organization today to do so as well. >> But when you observed those companies that we talk about all the time, data is at the center of their organization. They maybe put people around that data. That's got to be one of the challenge for many of the incumbents is if we talked about the data silos, the different standards, different data quality, that's got to be fairly major blocker for people becoming a "Data-driven organization." >> It is because some organizations were developed as people driven product, driven brand driven, or other things to try to convert. To becoming data-driven, takes a high degree of data literacy or fluency. And I think there'll be a lot of talk about that this week. I'll certainly mention it as well. And so getting the organization to become data fluent and appreciate data as an asset and understand its possibilities and the art of the possible with data, it's a long road. So the culture change that goes along with it is really difficult. And so we're working with 150 year old consumer brand right now that wants to become more data-driven and they're very product driven. And we hear the CIO say, "We want people to understand that we're a data company that just happens to produce this product. We're not a product company that generates data." And once we realized that and started behaving in that fashion, then we'll be able to really win and thrive in our marketplace. >> So one of the key roles of a Chief Data Officers to understand how data affects the monetization of an organization. Obviously there are four profit companies of your healthcare organization saving lives, obviously being profitable as well, or at least staying within the budget, depending upon the structure of the organization. But a lot of people I think oftentimes misunderstand that it's like, "Okay, do I have to become a data broker? Am I selling data directly?" But I think, you pointed out many times and you just did that unlike oil, that's why we don't like that data as a new oil analogy, because it's so much more valuable and can be use, it doesn't fall because of its scarcity. But what are you finding just in terms of people's application of that notion of monetization? Cutting costs, increasing revenue, what are you seeing in the field? What's that spectrum look like? >> So one of the things I've done over the years is compile a library of hundreds and hundreds of examples of how organizations are using data and analytics in innovative ways. And I have a book in process that hopefully will be out this fall. I'm sharing a number of those inspirational examples. So that's the thing that organizations need to understand is that there are a variety of great examples out there, and they shouldn't just necessarily look to their own industry. There are inspirational examples from other industries as well, many clients come to me and they ask, "What are others in my industry doing?" And my flippant response to that is, "Why do you want to be in second place or third place? Why not take an idea from another industry, perhaps a digital product company and apply that to your own business." But like you mentioned, there are a variety of ways to monetize data. It doesn't involve necessarily selling it. You can deliver analytics, you can report on it, you can use it internally to generate improved business process performance. And as long as you're measuring how data's being applied and what its impact is, then you're in a position to claim that you're monetizing it. But if you're not measuring the impact of data on business processes or on customer relationships or partner supplier relationships or anything else, then it's difficult to claim that you're monetizing it. But one of the more interesting ways that we've been working with organizations to monetize their data, certainly in light of GDPR and the California consumer privacy act where I can't sell you my data anymore, but we've identified ways to monetize your customer data in a couple of ways. One is to synthesize the data, create synthetic data sets that retain the original statistical anomalies in the data or features of the data, but don't share actually any PII. But another interesting way that we've been working with organizations to monetize their data is what I call, Inverted data monetization, where again, I can't share my customer data with you, but I can share information about your products and services with my customers. And take a referral fee or a commission, based on that. So let's say I'm a hospital and I can't sell you my patient data, of course, due to variety of regulations, but I know who my diabetes patients are, and I can introduce them to your healthy meal plans, to your gym memberships, to your at home glucose monitoring kits. And again, take a referral fee or a cut of that action. So we're working with customers and the financial services firm industry and in the healthcare industry on just those kinds of examples. So we've identified hundreds of millions of dollars of incremental value for organizations that from their data that we're just sitting on. >> Interesting. Doug because you're a business value strategist at the top, where in the S curve do you see you're able to have the biggest impact. I doubt that you enter organizations where you say, "Oh, they've got it all figured out. They can't use my advice." But as well, sometimes in the early stages, you may not be able to have as big of an impact because there's not top down support or whatever, there's too much technical data, et cetera, where are you finding you can have the biggest impact, Doug? >> Generally we don't come in and run those kinds of data monetization or information innovation exercises, unless there's some degree of executive support. I've never done that at a lower level, but certainly there are lower level more immediate and vocational opportunities for data to deliver value through, to simply analytics. One of the simple examples I give is, I sold a home recently and when you put your house on the market, everybody comes out of the woodwork, the fly by night, mortgage companies, the moving companies, the box companies, the painters, the landscapers, all know you're moving because your data is in the U.S. and the MLS directory. And it was interesting. The only company that didn't reach out to me was my own bank, and so they lost the opportunity to introduce me to a Mortgage they'd retain me as a client, introduce me to my new branch, print me new checks, move the stuff in my safe deposit box, all of that. They missed a simple opportunity. And I'm thinking, this doesn't require rocket science to figure out which of your customers are moving, the MLS database or you can harvest it from Zillow or other sites is basically public domain data. And I was just thinking, how stupid simple would it have been for them to hire a high school programmer, give him a can of red bull and say, "Listen match our customer database to the MLS database to let us know who's moving on a daily or weekly basis." Some of these solutions are pretty simple. >> So is that part of what you do, come in with just hardcore tactical ideas like that? Are you also doing strategy? Tell me more about how you're spending your time. >> I trying to think more of a broader approach where we look at the data itself and again, people have said, "If you tortured enough, what would you tell us? We're just take that angle." We look at examples of how other organizations have monetized data and think about how to apply those and adapt those ideas to the company's own business. We look at key business drivers, internally and externally. We look at edge cases for their customers' businesses. We run through hypothesis generating activities. There are a variety of different kinds of activities that we do to generate ideas. And most of the time when we run these workshops, which last a week or two, we'll end up generating anywhere from 35 to 50 pretty solid ideas for generating new value streams from data. So when we talk about monetizing data, that's what we mean, generating new value streams. But like I said, then the next step is to go through that feasibility assessment and determining which of these ideas you actually want to pursue. >> So you're of course the longtime industry watcher as well, as a former Gartner Analyst, you have to be. My question is, if I think back... I've been around a while. If I think back at the peak of Microsoft's prominence in the PC era, it was like windows 95 and you felt like, "Wow, Microsoft is just so strong." And then of course the Linux comes along and a lot of open source changes and low and behold, a whole new set of leaders emerges. And you see the same thing today with the Trillionaire's Club and you feel like, "Wow, even COVID has been a tailwind for them." But you think about, "Okay, where could the disruption come to these large players that own huge clouds, they have all the data." Is data potentially a disruptor for what appear to be insurmountable odds against the newbies" >> There's always people coming up with new ways to leverage data or new sources of data to capture. So yeah, there's certainly not going to be around for forever, but it's been really fascinating to see the transformation of some companies I think nobody really exemplifies it more than IBM where they emerged from originally selling meat slicers. The Dayton Meat Slicer was their original product. And then they evolved into Manual Business Machines and then Electronic Business Machines. And then they dominated that. Then they dominated the mainframe software industry. Then they dominated the PC industry. Then they dominated the services industry to some degree. And so they're starting to get into data. And I think following that trajectory is something that really any organization should be looking at. When do you actually become a data company? Not just a product company or a service company or top. >> We have Inderpal Bhandari is one of our huge guests here. He's a Chief-- >> Sure. >> Data Officer of IBM, you know him well. And he talks about the journey that he's undertaken to transform the company into a data company. I think a lot of people don't really realize what's actually going on behind the scenes, whether it's financially oriented or revenue opportunities. But one of the things he stressed to me in our interview was that they're on average, they're reducing the end to end cycle time from raw data to insights by 70%, that's on average. And that's just an enormous, for a company that size, it's just enormous cost savings or revenue generating opportunity. >> There's no doubt that the technology behind data pipelines is improving and the process from moving data from those pipelines directly into predictive or diagnostic or prescriptive output is a lot more accelerated than the early days of data warehousing. >> Is the skills barrier is acute? It seems like it's lessened somewhat, the early Hadoop days you needed... Even data scientist... Is it still just a massive skill shortage, or we're starting to attack that. >> Well, I think companies are figuring out a way around the skill shortage by doing things like self service analytics and focusing on more easy to use mainstream type AI or advanced analytics technologies. But there's still very much a need for data scientists and organizations and the difficulty in finding people that are true data scientists. There's no real certification. And so really anybody can call themselves a data scientist but I think companies are getting good at interviewing and determining whether somebody's got the goods or not. But there are other types of skills that we don't really focus on, like the data engineering skills, there's still a huge need for data engineering. Data doesn't self-organize. There are some augmented analytics technologies that will automatically generate analytic output, but there really aren't technologies that automatically self-organize data. And so there's a huge need for data engineers. And then as we talked about, there's a large interest in external data and harvesting that and then ingesting it and even identifying what external data is out there. So one of the emerging roles that we're seeing, if not the sexiest role of the 21st century is the role of the Data Curator, somebody who acts as a librarian, identifying external data assets that are potentially valuable, testing them, evaluating them, negotiating and then figuring out how to ingest that data. So I think that's a really important role for an organization to have. Most companies have an entire department that procures office supplies, but they don't have anybody who's procuring data supplies. And when you think about which is more valuable to an organization? How do you not have somebody who's dedicated to identifying the world of external data assets that are out there? There are 10 million data sets published by government, organizations and NGOs. There are thousands and thousands of data brokers aggregating and sharing data. There's a web content that can be harvested, there's data from your partners and suppliers, there's data from social media. So to not have somebody who's on top of all that it demonstrates gross negligence by the organization. >> That is such an enlightening point, Doug. My last question is, I wonder how... If you can share with us how the pandemic has effected your business personally. As a consultant, you're on the road a lot, obviously not on the road so much, you're doing a lot of chalk talks, et cetera. How have you managed through this and how have you been able to maintain your efficacy with your clients? >> Most of our clients, given that they're in the digital world a bit already, made the switch pretty quick. Some of them took a month or two, some things went on hold but we're still seeing the same level of enthusiasm for data and doing things with data. In fact some companies have taken our (mumbles) that data to be their best defense in a crisis like this. It's affected our business and it's enabled us to do much more international work more easily than we used to. And I probably spend a lot less time on planes. So it gives me more time for writing and speaking and actually doing consulting. So that's been nice as well. >> Yeah, there's that bonus. Obviously theCUBE yes, we're not doing physical events anymore, but hey, we've got two studios operating. And Doug Laney, really appreciate you coming on. (Dough mumbles) Always a great guest and sharing your insights and have a great MIT CDOIQ. >> Thanks, you too, Dave, take care. (mumbles) >> Thanks Doug. All right. And thank you everybody for watching. This is Dave Vellante for theCUBE, our continuous coverage of the MIT Chief Data Officer conference, MIT CDOIQ, will be right back, right after this short break. (bright music)

Published Date : Sep 3 2020

SUMMARY :

symposium brought to you Doug, great to see you again. and the value of data And one of the things of the importance of data, And one of the more difficult the canary in the coal mine. But the reality is that every consultancy a talk in the business case for the CDO. So some of the things we found is that In the early days the CDO is that the CDOs are that data pipeline or the data life cycle, of assessing the ideas that are generated. for many of the incumbents and the art of the possible with data, of the organization. and apply that to your own business." I doubt that you enter organizations and the MLS directory. So is that part of what you do, And most of the time when of Microsoft's prominence in the PC era, the services industry to some degree. is one of our huge guests here. But one of the things he stressed to me is improving and the process the early Hadoop days you needed... and the difficulty in finding people and how have you been able to maintain our (mumbles) that data to be and sharing your insights Thanks, you too, Dave, take care. of the MIT Chief Data Officer conference,

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
DavePERSON

0.99+

Dave VellantePERSON

0.99+

Doug LaneyPERSON

0.99+

United AirlinesORGANIZATION

0.99+

American AirlinesORGANIZATION

0.99+

AppleORGANIZATION

0.99+

IBMORGANIZATION

0.99+

DougPERSON

0.99+

thousandsQUANTITY

0.99+

hundredsQUANTITY

0.99+

CambridgeLOCATION

0.99+

21st centuryDATE

0.99+

10 millionQUANTITY

0.99+

MicrosoftORGANIZATION

0.99+

70%QUANTITY

0.99+

Inderpal BhandariPERSON

0.99+

two trillionQUANTITY

0.99+

windows 95TITLE

0.99+

North AmericaLOCATION

0.99+

oneQUANTITY

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

U.S.LOCATION

0.99+

a monthQUANTITY

0.99+

35QUANTITY

0.99+

twoQUANTITY

0.99+

third placeQUANTITY

0.99+

OneQUANTITY

0.99+

MLSORGANIZATION

0.98+

two studiosQUANTITY

0.98+

MIT CDOIQ 2020EVENT

0.98+

Trillionaire's ClubORGANIZATION

0.98+

todayDATE

0.98+

this weekDATE

0.98+

Tang CenterLOCATION

0.98+

California consumer privacy actTITLE

0.97+

second placeQUANTITY

0.97+

LinuxTITLE

0.97+

COVIDEVENT

0.97+

GartnerORGANIZATION

0.97+

ZillowORGANIZATION

0.97+

50QUANTITY

0.97+

GDPRTITLE

0.97+

CUBEORGANIZATION

0.97+

this yearDATE

0.97+

MIT Chief Data OfficerEVENT

0.96+

theCUBEORGANIZATION

0.95+

a weekQUANTITY

0.94+

singleQUANTITY

0.94+

CasertaORGANIZATION

0.93+

four timesQUANTITY

0.92+

COVIDOTHER

0.92+

pandemicEVENT

0.92+

2020DATE

0.91+

hundreds of millions of dollarsQUANTITY

0.86+

150 year oldQUANTITY

0.86+

this fallDATE

0.85+

MIT CDOIQEVENT

0.85+

last couple of yearsDATE

0.84+

four profit companiesQUANTITY

0.84+

COVIDORGANIZATION

0.82+

DoughPERSON

0.78+

KeynoteEVENT

0.77+

Krishna Cheriath, Bristol Myers Squibb | MITCDOIQ 2020


 

>> From the Cube Studios in Palo Alto in Boston, connecting with thought leaders all around the world, this is a Cube Conversation. >> Hi everyone, this is Dave Vellante and welcome back to the Cube's coverage of the MIT CDOIQ. God, we've been covering this show since probably 2013, really trying to understand the intersection of data and organizations and data quality and how that's evolved over time. And with me to discuss these issues is Krishna Cheriath, who's the Vice President and Chief Data Officer, Bristol-Myers Squibb. Krishna, great to see you, thanks so much for coming on. >> Thank you so much Dave for the invite, I'm looking forward to it. >> Yeah first of all, how are things in your part of the world? You're in New Jersey, I'm also on the East coast, how you guys making out? >> Yeah, I think these are unprecedented times all around the globe and whether it is from a company perspective or a personal standpoint, it is how do you manage your life, how do you manage your work in these unprecedented COVID-19 times has been a very interesting challenge. And to me, what is most amazing has been, I've seen humanity rise up and so to our company has sort of snap to be able to manage our work so that the important medicines that have to be delivered to our patients are delivered on time. So really proud about how we have done as a company and of course, personally, it has been an interesting journey with my kids from college, remote learning, wife working from home. So I'm very lucky and blessed to be safe and healthy at this time. So hopefully the people listening to this conversation are finding that they are able to manage through their lives as well. >> Obviously Bristol-Myers Squibb, very, very strong business. You guys just recently announced your quarter. There's a biologics facility near me in Devon's, Massachusetts, I drive by it all the time, it's a beautiful facility actually. But extremely broad portfolio, obviously some COVID impact, but you're managing through that very, very well, if I understand it correctly, you're taking a collaborative approach to a COVID vaccine, you're now bringing people physically back to work, you've been very planful about that. My question is from your standpoint, what role did you play in that whole COVID response and what role did data play? >> Yeah, I think it's a two part as you rightly pointed out, the Bristol-Myers Squibb, we have been an active partner on the the overall scientific ecosystem supporting many different targets that is, from many different companies I think. Across biopharmaceuticals, there's been a healthy convergence of scientific innovation to see how can we solve this together. And Bristol-Myers Squibb have been an active participant as our CEO, as well as our Chief Medical Officer and Head of Research have articulated publicly. Within the company itself, from a data and technology standpoint, data and digital is core to the response from a company standpoint to the COVID-19, how do we ensure that our work continues when the entire global workforce pivots to a kind of a remote setting. So that really calls on the digital infrastructure to rise to the challenge, to enable a complete global workforce. And I mean workforce, it is not just employees of the company but the all of the third-party partners and others that we work with, the whole ecosystem needs to work. And I think our digital infrastructure has proven to be extremely resilient than that. From a data perspective, I think it is twofold. One is how does the core book of business of data continue to drive forward to make sure that our companies key priorities are being advanced. Secondarily, we've been partnering with a research and development organization as well as medical organization to look at what kind of real world data insights can really help in answering the many questions around COVID-19. So I think it is twofold. Main summary; one is, how do we ensure that the data and digital infrastructure of the company continues to operate in a way that allows us to progress the company's mission even during a time when globally, we have been switched to a remote working force, except for some essential staff from lab and manufacturing standpoint. And secondarily is how do we look at the real-world evidence as well as the scientific data to be a good partner with other companies to look at progressing the societal innovations needed for this. >> I think it's a really prudent approach because let's face it, sometimes one shot all vaccine can be like playing roulette. So you guys are both managing your risk and just as I say, financially, a very, very successful company in a sound approach. I want to ask you about your organization. We've interviewed many, many Chief Data Officers over the years, and there seems to be some fuzziness as to the organizational structure. It's very clear with you, you report in to the CIO, you came out of a technical bag, you have a technical degree but you also of course have a business degree. So you're dangerous from that standpoint. You got both sides which is critical, I would think in your role, but let's start with the organizational reporting structure. How did that come about and what are the benefits of reporting into the CIO? >> I think the Genesis for that as Bristol-Myers Squibb and when I say Bristol-Myers Squibb, the new Bristol-Myers Squibb is a combination of Heritage Bristol-Myers Squibb and Heritage Celgene after the Celgene acquisition last November. So in the Heritage Bristol-Myers Squibb acquisition, we came to a conclusion that in order for BMS to be able to fully capitalize on our scientific innovation potential as well as to drive data-driven decisions across the company, having a robust data agenda is key. Now the question is, how do you progress that? Historically, we had approached a very decentralized mechanism that made a different data constituencies. We didn't have a formal role of a Chief Data Officer up until 2018 or so. So coming from that realization that we need to have an effective data agenda to drive forward the necessary data-driven innovations from an analytic standpoint. And equally importantly, from optimizing our execution, we came to conclusion that we need an enterprise-level data organization, we need to have a first among equals if you will, to be mandated by the CEO, his leadership team, to be the kind of an orchestrator of a data agenda for the company, because data agenda cannot be done individually by a singular CDO. It has to be done in partnership with many stakeholders, business, technology, analytics, et cetera. So from that came this notion that we need an enterprise-wide data organization. So we started there. So for awhile, I would joke around that I had all of the accountabilities of the CDO without the lofty title. So this journey started around 2016, where we create an enterprise-wide data organization. And we made a very conscious choice of separating the data organization from analytics. And the reason we did that is when we look at the bowl of Bristol-Myers Squibb, analytics for example, is core and part of our scientific discovery process, research, our clinical development, all of them have deep data science and analytic embedded in it. But we also have other analytics whether it is part of our sales and marketing, whether it is part of our finance and our enabling functions they catch all across global procurement et cetera. So the world of analytics is very broad. BMS did a separation between the world of analytics and from the world of data. Analytics at BMS is in two modes. There is a central analytics organization called Business Insights and Analytics that drive most of the enterprise-level analytics. But then we have embedded analytics in our business areas, which is research and development, manufacturing and supply chain, et cetera, to drive what needs to be closer to the business idea. And the reason for separating that out and having a separate data organization is that none of these analytic aspirations or the business aspirations from data will be met if the world of data is, you don't have the right level of data available, the velocity of data is not appropriate for the use cases, the quality of data is not great or the control of the data. So that we are using the data for the right intent, meeting the compliance and regulatory expectations around the data is met. So that's why we separated out that data world from the analytics world, which is a little bit of a unique construct for us compared to what we see generally in the world of CDOs. And from that standpoint, then the decision was taken to make that report for global CIO. At Bristol-Myers Squibb, they have a very strong CIO organization and IT organization. When I say strong, it is from this lens standpoint. A, it is centralized, we have centralized the budget as well as we have centralized the execution across the enterprise. And the CDO reporting to the CIO with that data-specific agenda, has a lot of value in being able to connect the world of data with the world of technology. So at BMS, their Chief Data Officer organization is a combination of traditional CDO-type accountabilities like data risk management, data governance, data stewardship, but also all of the related technologies around master data management, data lake, data and analytic engineering and a nascent AI data and technology lab. So that construct allows us to be a true enterprise horizontal, supporting analytics, whether it is done in a central analytics organization or embedded analytics teams in the business area, but also equally importantly, focus on the world of data from operational execution standpoint, how do we optimize data to drive operational effectiveness? So that's the construct that we have where CDO reports to the CIO, data organization separated from analytics to really focus around the availability but also the quality and control of data. And the last nuance that is that at BMS, the Chief Data Officer organization is also accountable to be the Data Protection Office. So we orchestrate and facilitate all privacy-related actions across because that allows us to make sure that all personal data that is collected, managed and consumed, meets all of the various privacy standards across the world, as well as our own commitments as a company from across from compliance principles standpoint. >> So that makes a lot of sense to me and thank you for that description. You're not getting in the way of R&D and the scientists, they know data science, they don't need really your help. I mean, they need to innovate at their own pace, but the balance of the business really does need your innovation, and that's really where it seems like you're focused. You mentioned master data management, data lakes, data engineering, et cetera. So your responsibility is for that enterprise data lifecycle to support the business side of things, and I wonder if you could talk a little bit about that and how that's evolved. I mean a lot has changed from the old days of data warehouse and cumbersome ETL and you mentioned, as you say data lakes, many of those have been challenging, expensive, slow, but now we're entering this era of cloud, real-time, a lot of machine intelligence, and I wonder if you could talk about the changes there and how you're looking at and thinking about the data lifecycle and accelerating the time to insights. >> Yeah, I think the way we think about it, we as an organization in our strategy and tactics, think of this as a data supply chain. The supply chain of data to drive business value whether it is through insights and analytics or through operation execution. When you think about it from that standpoint, then we need to get many elements of that into an effective stage. This could be the technologies that is part of that data supply chain, you reference some of them, the master data management platforms, data lake platforms, the analytics and reporting capabilities and business intelligence capabilities that plug into a data backbone, which is that I would say the technology, swim lane that needs to get right. Along with that, what we also need to get right for that effective data supply chain is that data layer. That is, how do you make sure that there is the right data navigation capability, probably you make sure that we have the right ontology mapping and the understanding around the data. How do we have data navigation? It is something that we have invested very heavily in. So imagine a new employee joining BMS, any organization our size has a pretty wide technology ecosystem and data ecosystem. How do you navigate that, how do we find the data? Data discovery has been a key focus for us. So for an effective data supply chain, then we knew that and we have instituted our roadmap to make sure that we have a robust technology orchestration of it, but equally important is an effective data operations orchestration. Both needs to go hand in hand for us to be able to make sure that that supply chain is effective from a business use case and analytic use standpoint. So that has led us on a journey from a cloud perspective, since you refer that in your question, is we have invested very heavily to move from very disparate set of data ecosystems to a more converse cloud-based data backbone. That has been a big focus at the BMS since 2016, whether it is from a research and development standpoint or from commercialization, it is our word for the sales and marketing or manufacturing and supply chain and HR, et cetera. How do we create a converged data backbone that allows us to use that data as a resource to drive many different consumption patterns? Because when you imagine an enterprise of our size, we have many different consumers of the data. So those consumers have different consumption needs. You have deep data science population who just needs access to the data and they have data science platforms but they are at once programmers as well, to the other end of the spectrum where executives need pre-packaged KPIs. So the effective orchestration of the data ecosystem at BMS through a data supply chain and the data backbone, there's a couple of things for us. One, it drives productivity of our data consumers, the scientific researchers, analytic community or other operational staff. And second, in a world where we need to make sure that the data consumption appalls ethical standards as well as privacy and other regulatory expectations, we are able to build it into our system and process the necessary controls to make sure that the consumption and the use of data meets our highest trust advancements standards. >> That makes a lot of sense. I mean, converging your data like that, people always talk about stove pipes. I know it's kind of a bromide but it's true, and allows you to sort of inject consistent policies. What about automation? How has that affected your data pipeline recently and on your journey with things like data classification and the like? >> I think in pursuing a broad data automation journey, one of the things that we did was to operate at two different speed points. In a historically, the data organizations have been bundled with long-running data infrastructure programs. By the time you complete them, their business context have moved on and the organization leaders are also exhausted from having to wait from these massive programs to reach its full potential. So what we did very intentionally from our data automation journey is to organize ourselves in two speed dimensions. First, a concept called Rapid Data Lab. The idea is that recognizing the reality that the data is not well automated and orchestrated today, we need a SWAT team of data engineers, data SMEs to partner with consumers of data to make sure that we can make effective data supply chain decisions here and now, and enable the business to answer questions of today. Simultaneously in a longer time horizon, we need to do the necessary work of moving the data automation to a better footprint. So enterprise data lake investments, where we built services based on, we had chosen AWS as the cloud backbone for data. So how do we use the AWS services? How do we wrap around it with the necessary capabilities so that we have a consistent reference and technical architecture to drive the many different function journeys? So we organized ourselves into speed dimensions; the Rapid Data Lab teams focus around partnering with the consumers of data to help them with data automation needs here and now, and then a secondary team focused around the convergence of data into a better cloud-based data backbone. So that allowed us to one, make an impact here and now and deliver value from data to the dismiss here and now. Secondly, we also learned a lot from actually partnering with consumers of data on what needs to get adjusted over a period of time in our automation journey. >> It makes sense, I mean again, that whole notion of converged data, putting data at the core of your business, you brought up AWS, I wonder if I could ask you a question. You don't have to comment on specific vendors, but there's a conversation we have in our community. You have AWS huge platform, tons of partners, a lot of innovation going on and you see innovation in areas like the cloud data warehouse or data science tooling, et cetera, all components of that data pipeline. As well, you have AWS with its own tooling around there. So a question we often have in the community is will technologists and technology buyers go for kind of best of breed and cobble together different services or would they prefer to have sort of the convenience of a bundled service from an AWS or a Microsoft or Google, or maybe they even go best of breeds for all cloud. Can you comment on that, what's your thinking? >> I think, especially for organizations, our size and breadth, having a converged to convenient, all of the above from a single provider does not seem practical and feasible, because a couple of reasons. One, the heterogeneity of the data, the heterogeneity of consumption of the data and we are yet to find a single stack provider who can meet all of the different needs. So I am more in the best of breed camp with a few caveats, a hybrid best of breed, if you will. It is important to have a converged the data backbone for the enterprise. And so whether you invest in a singular cloud or private cloud or a combination, you need to have a clear intention strategy around where are you going to host the data and how is the data is going to be organized. But you could have a lot more flexibility in the consumption of data. So once you have the data converged into, in our case, we converged on AWS-based backbone. We allow many different consumptions of the data, because I think the analytic and insights layer, data science community within R&D is different from a data science community in the supply chain context, we have business intelligence needs, we have a catered needs and then there are other data needs that needs to be funneled into software as service platforms like the sales forces of the world, to be able to drive operational execution as well. So when you look at it from that context, having a hybrid model of best of breed, whether you have a lot more convergence from a data backbone standpoint, but then allow for best of breed from an analytic and consumption of data is more where my heart and my brain is. >> I know a lot of companies would be excited to hear that answer, but I love it because it fosters competition and innovation. I wish I could talk for you forever, but you made me think of another question which is around self-serve. On your journey, are you at the point where you can deliver self-serve to the lines of business? Is that something that you're trying to get to? >> Yeah, I think it does. The self-serve is an absolutely important point because I think the traditional boundaries of what you consider the classical IT versus a classical business is great. I think there is an important gray area in the middle where you have a deep citizen data scientist in the business community who really needs to be able to have access to the data and I have advanced data science and programming skills. So self-serve is important but in that, companies need to be very intentional and very conscious of making sure that you're allowing that self-serve in a safe containment sock. Because at the end of the day, whether it is a cyber risk or data risk or technology risk, it's all real. So we need to have a balanced approach between promoting whether you call it data democratization or whether you call it self-serve, but you need to balance that with making sure that you're meeting the right risk mitigation strategy standpoint. So that's how then our focus is to say, how do we promote self-serve for the communities that they need self-serve, where they have deeper levels of access? How do we set up the right safe zones for those which may be the appropriate mitigation from a cyber risk or data risk or technology risk. >> Security pieces, again, you keep bringing up topics that I could talk to you forever on, but I heard on TV the other night, I heard somebody talking about how COVID has affected, because of remote access, affected security. And it's like hey, give everybody access. That was sort of the initial knee-jerk response, but the example they gave as well, if your parents go out of town and the kid has a party, you may have some people show up that you don't want to show up. And so, same issue with remote working, work from home. Clearly you guys have had to pivot to support that, but where does the security organization fit? Does that report separate alongside the CIO? Does it report into the CIO? Are they sort of peers of yours, how does that all work? >> Yeah, I think at Bristol-Myers Squibb, we have a Chief Information Security Officer who is a peer of mine, who also reports to the global CIO. The CDO and the CSO are effective partners and are two sides of the coin and trying to advance a total risk mitigation strategy, whether it is from a cyber risk standpoint, which is the focus of the Chief Information Security Officer and whether it is the general data consumption risk. And that is the focus from a Chief Data Officer in the capacities that I have. And together, those are two sides of a coin that the CIO needs to be accountable for. So I think that's how we have orchestrated it, because I think it is important in these worlds where you want to be able to drive data-driven innovation but you want to be able to do that in a way that doesn't open the company to unwanted risk exposures as well. And that is always a delicate balancing act, because if you index too much on risk and then high levels of security and control, then you could lose productivity. But if you index too much on productivity, collaboration and open access and data, it opens up the company for risks. So it is a delicate balance within the two. >> Increasingly, we're seeing that reporting structure evolve and coalesce, I think it makes a lot of sense. I felt like at some point you had too many seats at the executive leadership table, too many kind of competing agendas. And now your structure, the CIO is obviously a very important position. I'm sure has a seat at the leadership table, but also has the responsibility for managing that sort of data as an asset versus a liability which my view, has always been sort of the role of the Head of Information. I want to ask you, I want to hit the Escape key a little bit and ask you about data as a resource. You hear a lot of people talk about data is the new oil. We often say data is more valuable than oil because you can use it, it doesn't follow the laws of scarcity. You could use data in infinite number of places. You can only put oil in your car or your house. How do you think about data as a resource today and going forward? >> Yeah, I think the data as the new oil paradigm in my opinion, was an unhealthy, and it prompts different types of conversations around that. I think for certain companies, data is indeed an asset. If you're a company that is focused on information products and data products and that is core of your business, then of course there's monetization of data and then data as an asset, just like any other assets on the company's balance sheet. But for many enterprises to further their mission, I think considering data as a resource, I think is a better focus. So as a vital resource for the company, you need to make sure that there is an appropriate caring and feeding for it, there is an appropriate management of the resource and an appropriate evolution of the resource. So that's how I would like to consider it, it is a personal end of one perspective, that data as a resource that can power the mission of the company, the new products and services, I think that's a good, healthy way to look at it. At the center of it though, a lot of strategies, whether people talk about a digital strategy, whether the people talk about data strategy, what is important is a company to have a pool north star around what is the core mission of the company and what is the core strategy of the company. For Bristol-Myers Squibb, we are about transforming patients' lives through science. And we think about digital and data as key value levers and drivers of that strategy. So digital for the sake of digital or data strategy for the sake of data strategy is meaningless in my opinion. We are focused on making sure that how do we make sure that data and digital is an accelerant and has a value lever for the company's mission and company strategy. So that's why thinking about data as a resource, as a key resource for our scientific researchers or a key resource for our manufacturing team or a key resource for our sales and marketing, allows us to think about the actions and the strategies and tactics we need to deploy to make that effective. >> Yeah, that makes a lot of sense, you're constantly using that North star as your guideline and how data contributes to that mission. Krishna Cheriath, thanks so much for coming on the Cube and supporting the MIT Chief Data Officer community, it was a really pleasure having you. >> Thank you so much for Dave, hopefully you and the audience is safe and healthy during these times. >> Thank you for that and thank you for watching everybody. This is Vellante for the Cube's coverage of the MIT CDOIQ Conference 2020 gone virtual. Keep it right there, we'll right back right after this short break. (lively upbeat music)

Published Date : Sep 3 2020

SUMMARY :

leaders all around the world, coverage of the MIT CDOIQ. I'm looking forward to it. so that the important medicines I drive by it all the time, and digital infrastructure of the company of reporting into the CIO? So that's the construct that we have and accelerating the time to insights. and the data backbone, and allows you to sort of and enable the business to in areas like the cloud data warehouse and how is the data is to the lines of business? in the business community that I could talk to you forever on, that the CIO needs to be accountable for. about data is the new oil. that can power the mission of the company, and supporting the MIT Chief and healthy during these times. of the MIT CDOIQ Conference

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
DavePERSON

0.99+

Dave VellantePERSON

0.99+

MicrosoftORGANIZATION

0.99+

GoogleORGANIZATION

0.99+

Bristol-Myers SquibbORGANIZATION

0.99+

New JerseyLOCATION

0.99+

AWSORGANIZATION

0.99+

DevonLOCATION

0.99+

Palo AltoLOCATION

0.99+

Rapid Data LabORGANIZATION

0.99+

2013DATE

0.99+

Krishna CheriathPERSON

0.99+

two sidesQUANTITY

0.99+

twoQUANTITY

0.99+

COVID-19OTHER

0.99+

CelgeneORGANIZATION

0.99+

FirstQUANTITY

0.99+

CubeORGANIZATION

0.99+

KrishnaPERSON

0.99+

Heritage Bristol-Myers SquibbORGANIZATION

0.99+

2018DATE

0.99+

both sidesQUANTITY

0.99+

BothQUANTITY

0.98+

BostonLOCATION

0.98+

2016DATE

0.98+

CDOTITLE

0.98+

two modesQUANTITY

0.98+

COVIDOTHER

0.98+

firstQUANTITY

0.98+

Bristol-Myers SquibbORGANIZATION

0.98+

last NovemberDATE

0.98+

Data Protection OfficeORGANIZATION

0.98+

OneQUANTITY

0.98+

two partQUANTITY

0.98+

SecondlyQUANTITY

0.98+

secondQUANTITY

0.98+

MITORGANIZATION

0.98+

bothQUANTITY

0.98+

MIT CDOIQ Conference 2020EVENT

0.97+

Heritage CelgeneORGANIZATION

0.97+

oneQUANTITY

0.97+

COVID-19 timesOTHER

0.96+

todayDATE

0.96+

BMSORGANIZATION

0.96+

single providerQUANTITY

0.95+

single stackQUANTITY

0.93+

Bristol Myers SquibbPERSON

0.93+

one shotQUANTITY

0.92+

Cube StudiosORGANIZATION

0.9+

one perspectiveQUANTITY

0.9+

Bristol-MyersORGANIZATION

0.9+

Business InsightsORGANIZATION

0.89+

two speedQUANTITY

0.89+

twofoldQUANTITY

0.84+

secondaryQUANTITY

0.8+

SecondarilyQUANTITY

0.77+

MIT CDOIQORGANIZATION

0.76+

MassachusettsLOCATION

0.75+

MITCDOIQ 2020EVENT

0.74+

VellantePERSON

0.72+

DataPERSON

0.71+

Chief Data OfficerPERSON

0.61+

Sriram Raghavan, IBM Research AI | IBM Think 2020


 

(upbeat music) >> Announcer: From the cube Studios in Palo Alto and Boston, it's the cube! Covering IBM Think. Brought to you by IBM. >> Hi everybody, this is Dave Vellante of theCUBE, and you're watching our coverage of the IBM digital event experience. A multi-day program, tons of content, and it's our pleasure to be able to bring in experts, practitioners, customers, and partners. Sriram Raghavan is here. He's the Vice President of IBM Research in AI. Sriram, thanks so much for coming on thecUBE. >> Thank you, pleasure to be here. >> I love this title, I love the role. It's great work if you're qualified for it.(laughs) So, tell us a little bit about your role and your background. You came out of Stanford, you had the pleasure, I'm sure, of hanging out in South San Jose at the Almaden labs. Beautiful place to create. But give us a little background. >> Absolutely, yeah. So, let me start, maybe go backwards in time. What do I do now? My role's responsible for AI strategy, planning, and execution in IBM Research across our global footprint, all our labs worldwide and their working area. I also work closely with the commercial parts. The parts of IBM, our Software and Services business that take the innovation, AI innovation, from IBM Research to market. That's the second part of what I do. And where did I begin life in IBM? As you said, I began life at our Almaden Research Center up in San Jose, up in the hills. Beautiful, I had in a view. I still think it's the best view I had. I spent many years there doing work at the intersection of AI and large-scale data management, NLP. Went back to India, I was running the India lab there for a few years, and now I'm back here in New York running AI strategy. >> That's awesome. Let's talk a little bit about AI, the landscape of AI. IBM has always made it clear that you're not doing consumer AI. You're really tying to help businesses. But how do you look at the landscape? >> So, it's a great question. It's one of those things that, you know, we constantly measure ourselves and our partners tell us. I think we, you've probably heard us talk about the cloud journey . But look barely 20% of the workloads are in the cloud, 80% still waiting. AI, at that number is even less. But, of course, it varies. Depending on who you ask, you would say AI adoption is anywhere from 4% to 30% depending on who you ask in this case. But I think it's more important to look at where is this, directionally? And it's very, very clear. Adoption is rising. The value is more, it's getting better appreciated. But I think more important, I think is, there is broader recognition, awareness and investment, knowing that to get value out of AI, you start with where AI begins, which is data. So, the story around having a solid enterprise information architecture as the base on which to drive AI, is starting to happen. So, as the investments in data platform, becoming making your data ready for AI, starts to come through. We're definitely seeing that adoption. And I think, you know, the second imperative that businesses look for obviously is the skills. The tools and the skills to scale AI. It can't take me months and months and hours to go build an AI model, I got to accelerate it, and then comes operationalizing. But this is happening, and the upward trajectory is very, very clear. >> We've been talking a lot on theCUBE over the last couple of years, it's not the innovation engine of our industry is no longer Moore's Law, it's a combination of data. You just talked about data. Applying machine technology to that data, being able to scale it, across clouds, on-prem, wherever the data lives. So. >> Right. >> Having said that, you know, you've had a journey. You know, you started out kind of playing "Jeopardy!", if you will. It was a very narrow use case, and you're expanding that use case. I wonder if you could talk about that journey, specifically in the context of your vision. >> Yeah. So, let me step back and say for IBM Research AI, when I think about how we, what's our strategy and vision, we think of it as in two parts. One part is the evolution of the science and techniques behind AI. And you said it, right? From narrow, bespoke AI that all it can do is this one thing that it's really trained for, it takes a large amount of data, a lot of computing power. Two, how do you have the techniques and the innovation for AI to learn from one use case to the other? Be less data hungry, less resource hungry. Be more trustworthy and explainable. So, we call that the journey from narrow to broad AI. And one part of our strategy, as scientists and technologists, is the innovation to make that happen. So that's sort of one part. But, as you said, as people involved in making AI work in the enterprise, and IBM Research AI vision would be incomplete without the second part, which is, what are the challenges in scaling and operationalizing AI? It isn't sufficient that I can tell you AI can do this, how do I make AI do this so that you get the right ROI, the investment relative to the return makes sense and you can scale and operationalize. So, we took both of these imperatives. The AI narrow-to-broad journey, and the need to scale and operationalize. And what of the things that are making it hard? The things that make scaling and operationalizing harder: data challenges, we talked about that, skills challenges, and the fact that in enterprises, you have to govern and manage AI. And we took that together and we think of our AI agenda in three pieces: Advancing, trusting, and scaling AI. Advancing is the piece of pushing the boundary, making AI narrow to broad. Trusting is building AI which is trustworthy, is explainable, you can control and understand its behavior, make sense of it and all of the technology that goes with it. And scaling AI is when we address the problem of, how do I, you know, reduce the time and cost for data prep? How do I reduce the time for model tweaking and engineering? How do I make sure that a model that you build today, when something changes in the data, I can quickly allow for you to close the loop and improve the model? All of the things, think of day-two operations of AI. All of that is part of our scaling AI strategy. So advancing, trusting, scaling is sort of the three big mantras around which the way we think about our AI. >> Yeah, so I've been doing a little work in this around this notion of DataOps. Essentially, you know, DevOps applied to the data and the data pipeline, and I had a great conversation recently with Inderpal Bhandari, IBM's Global Chief Data Officer, and he explained to me how, first of all, customers will tell you, it's very hard to operationalize AIs. He and his team took that challenge on themselves and have had some great success. And, you know, we all know the problem. It's that, you know AI has to wait for the data. It has to wait for the data to be cleansed and wrangled. Can AI actually help with that part of the problem, compressing that? >> 100%. In fact, the way we think of the automation and scaling story is what we call the "AI For AI" story. So, AI in service of helping you build the AI that helps you make this with speed, right? So, and I think of it really in three parts. It's AI for data automation, our DataOps. AI used in better discovery, better cleansing, better configuration, faster linking, quality assessment, all of that. Using AI to do all of those data problems that you had to do. And I called it AI for data automation. The second part is using AI to automatically figure out the best model. And that's AI for data science automation, which is, feature engineering, hyperparameter optimization, having them all do work, why should a data scientist take weeks and months experimenting? If the AI can accelerate that from weeks to a matter of hours? That's data science automation. And then comes the important part, also, which is operations automation. Okay, I've put a data model into an application. How do I monitor its behavior? If the data that it's seeing is different from the data it was trained on, how do I quickly detect it? And a lot of the work from Research that was part of that Watson OpenScale offering is really addressing the operational side. So AI for data, AI for data science automation, and AI to help automate production of AI, is the way we break that problem up. >> So, I always like to ask folks that are deep into R&D, how they are ultimately are translating into commercial products and offerings? Because ultimately, you got to make money to fund more R&D. So, can you talk a little bit about how you do that, what your focus is there? >> Yeah, so that's a great question, and I'm going to use a few examples as well. But let me say at the outset, this is a very, very closed partnership. So when we, the Research part of AI and our portfolio, it's a closed partnership where we're constantly both drawing problem as well as building technology that goes into the offering. So, a lot of our work, much of our work in AI automation that we were talking about, is part of our Watson Studio, Watson Machine Learning, Watson OpenScale. In fact, OpenScale came out of Research working Trusted AI, and is now a centerpiece of our Watson project. Let me give a very different example. We have a very, very strong portfolio and focus in NLP, Natural Language Processing. And this directly goes into capabilities out of Watson Assistant, which is our system for conversational support and customer support, and Watson Discovery, which is about making enterprise understand unstructurally. And a great example of that is the Working Project Debater that you might have heard, which is a grand challenge in Research about building a machine that can do debate. Now, look, we weren't looking to go sell you a debating machine. But what did we build as part of doing that, is advances in NLP that are all making their way into assistant and discovery. And we actually just talked about earlier this year, announced a set of capabilities around better clustering, advanced summarization, deeper sentiment analysis. These made their way into Assistant and Discovery but are born out of research innovation and solving a grand problem like building a debating machine. That's just an example of how that journey from research to product happens. >> Yeah, the Debater documentary, I've seen some of that. It's actually quite astounding. I don't know what you're doing there. It sounds like you're taking natural language and turning it into complex queries with data science and AI, but it's quite amazing. >> Yes, and I would encourage you, you will see that documentary, by the way, on Channel 7, in the Think Event. And I would encourage you, actually the documentary around how Debater happened, sort of featuring back of the you know, backdoor interviews with the scientist who created it was actually featured last minute at Copenhagen International Documentary Festival. I'll invite viewers to go to Channel 7 and Data and AI Tech On-Demand to go take a look at that documentary. >> Yeah, you should take a look at it. It's actually quite astounding and amazing. Sriram, what are you working on these days? What kind of exciting projects or what's your focus area today? >> Look, I think there are three imperatives that we're really focused on, and one is very, you know, just really the project you're talking about, NLP. NLP in the enterprise, look, text is a language of business, right? Text is the way business is communicated. Within each other, with their partners, with the entire world. So, helping machines understand language, but in an enterprise context, recognizing that data and the enterprises live in complex documents, unstructured documents, in e-mail, they live in conversations with the customers. So, really pushing the boundary on how all our customers and clients can make sense of this vast volume of unstructured data by pushing the advances of NLP, that's one focus area. Second focus area, we talked about trust and how important that is. And we've done amazing work in monitoring and explainability. And we're really focused now on this emerging area of causality. Using causality to explain, right? The model makes this because the model believes this is what it wants, it's a beautiful way. And the third big focus continues to be on automation. So, NLP, trust, automation. Those are, like, three big focus areas for us. >> sriram, how far do you think we can take AI? I know it's a topic of conversation, but from your perspective, deep into the research, how far can it go? And maybe how far should it go? >> Look, I think we are, let me answer it this way. I think the arc of the possible is enormous. But I think we are at this inflection point in which I think the next wave of AI, the AI that's going to help us this narrow-to-broad journey we talked about, look, the narrow-to-broad journey's not like a one-week, one-year. We're talking about a decade of innovation. But I think we are at a point where we're going to see a wave of AI that we like to call "neuro-symbolic AI," which is AI that brings together two sort of fundamentally different approaches to building intelligence systems. One approach of building intelligence system is what we call "knowledge driven." Understand data, understand concept, logically, reasonable. We human beings do that. That was really the way AI was born. The more recent last couple of decades of AI was data driven, Machine learning. Give me vast volumes of data, I'll use neural techniques, deep learning, to to get value. We're at a point where we're going to bring both of them together. Cause you can't build trustworthy, explainable systems using only one, you can't get away from not using all of the data that you have to make them. So, neuro-symbolic AI is, I think, going to be the linchpin of how we advance AI and make it more powerful and trustworthy. >> So, are you, like, living your childhood dream here or what? >> Look, for me I'm fascinated. I've always been fascinated. And any time you can't find a technology person who hasn't dreamt of building an intelligent machine. To have a job where I can work across our worldwide set of 3,000 plus researchers and think and brainstorm on strategy with AI. And then, most importantly, not to forget, right? That you talked about being able to move it into our portfolios so it actually makes a difference for our clients. I think it's a dream job and a whole lot of fun. >> Well, Sriram, it was great having you on theCUBE. A lot of fun, interviewing folks like you. I feel a little bit smarter just talking to you. So thanks so much for coming on. >> Fantastic. It's been a pleasure to be here. >> And thank you for watching, everybody. You're watching theCUBE's coverage of IBM Think 2020. This is Dave Vellante. We'll be right back right after this short break. (upbeat music)

Published Date : May 7 2020

SUMMARY :

Brought to you by IBM. and it's our pleasure to be at the Almaden labs. that take the innovation, AI innovation, But how do you look at the landscape? But look barely 20% of the it's not the innovation I wonder if you could and the innovation for AI to learn and the data pipeline, and And a lot of the work from So, can you talk a little that goes into the offering. Yeah, the Debater documentary, of featuring back of the Sriram, what are you and the enterprises live the data that you have to make them. And any time you can't just talking to you. a pleasure to be here. And thank you for watching, everybody.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave VellantePERSON

0.99+

IBMORGANIZATION

0.99+

Sriram RaghavanPERSON

0.99+

New YorkLOCATION

0.99+

80%QUANTITY

0.99+

20%QUANTITY

0.99+

BostonLOCATION

0.99+

SriramPERSON

0.99+

IBM ResearchORGANIZATION

0.99+

Palo AltoLOCATION

0.99+

Inderpal BhandariPERSON

0.99+

two partsQUANTITY

0.99+

second partQUANTITY

0.99+

bothQUANTITY

0.99+

4%QUANTITY

0.99+

IndiaLOCATION

0.99+

One partQUANTITY

0.99+

one partQUANTITY

0.99+

Channel 7ORGANIZATION

0.99+

one-yearQUANTITY

0.99+

San JoseLOCATION

0.99+

sriramPERSON

0.99+

one-weekQUANTITY

0.99+

3,000 plus researchersQUANTITY

0.99+

TwoQUANTITY

0.99+

three partsQUANTITY

0.98+

Copenhagen International Documentary FestivalEVENT

0.98+

South San JoseLOCATION

0.98+

Second focusQUANTITY

0.98+

30%QUANTITY

0.98+

three piecesQUANTITY

0.98+

DataORGANIZATION

0.98+

One approachQUANTITY

0.97+

earlier this yearDATE

0.97+

JeopardyTITLE

0.96+

AlmadenORGANIZATION

0.96+

oneQUANTITY

0.95+

OpenScaleORGANIZATION

0.95+

threeQUANTITY

0.94+

one focus areaQUANTITY

0.94+

third bigQUANTITY

0.93+

Watson AssistantTITLE

0.92+

one use caseQUANTITY

0.92+

MooreORGANIZATION

0.92+

todayDATE

0.91+

StanfordLOCATION

0.91+

Almaden Research CenterORGANIZATION

0.9+

one thingQUANTITY

0.88+

2020TITLE

0.87+

waveEVENT

0.87+

WatsonTITLE

0.86+

three big mantrasQUANTITY

0.85+

> 100%QUANTITY

0.85+

two sortQUANTITY

0.84+

ThinkCOMMERCIAL_ITEM

0.83+

second imperativeQUANTITY

0.81+

Global Chief Data OfficerPERSON

0.8+

three imperativesQUANTITY

0.76+

last couple of yearsDATE

0.76+

DebaterTITLE

0.76+

WatsonORGANIZATION

0.72+

NLPORGANIZATION

0.72+

StudioORGANIZATION

0.72+

dayQUANTITY

0.67+

twoQUANTITY

0.65+

VicePERSON

0.65+

theCUBEORGANIZATION

0.63+

Watson DiscoveryTITLE

0.62+

theCUBETITLE

0.6+

Chris Lynch, AtScale | MIT CDOIQ 2019


 

>> From Cambridge, Massachusetts it's theCUBE, covering MIT Chief Data Officer and Information Quality Symposium 2019. Brought to you by, SiliconANGLE Media. >> Welcome back to Cambridge, Massachusetts, everybody. You're watching theCUBE, the leader in live tech coverage. I'm Dave Vellante with my co-host, Paul Gillan. Chris Lynch, good friend is here CEO, newly minted CEO and AtScale and legend. Good to see you. >> In my own mind. >> In mine too. >> It's great to be here. >> It's awesome, thank you for taking time. I know how busy you are, you're running around like crazy your next big thing. I was excited to hear that you got back into it. I predicted it a while ago you were a very successful venture capitalists but at heart, you're startup guy, aren't ya? >> Yeah 100%, 100%. I couldn't be more thrilled, I feel invigorated. I think I've told you many times, when you've interviewed me and asked me about the transition from being an entrepreneur to being a VC and since it's a PG show, I've got a different analog than the one I usually give you. I used to be a movie star and now I'm an executive producer of movies. Now am back to being a movie star, hopefully. >> yeah well, so you told me when you first became a VC you said, I look for startups that have a 10X impact either 10X value, 10X cost reduction. What was it that attracted you to AtScale? What's the 10X? >> AtScale, addresses $150 billion market problem which is basically bringing traditional BI to the cloud. >> That's the other thing you told me, big markets. >> Yeah, so that's the first thing massive market opportunity. The second is, the innovation component and where the 10X comes we're uniquely qualified to virtualize data into the pipeline and out. So I like to say that we're the bridge between BI and AI and back. We make every BI user, a citizen data scientist and that's a game changer. And that's sort of the new futuristic component of what we do. So one part is steeped in, that $150 billion BI marketplace in a traditional analytics platforms and then the second piece is into you delivering the data, into these BI excuse me, these AI machine learning platforms. >> Do you see that ultimately getting integrated into some kind of larger, data pipeline framework. I mean, maybe it lives in the cloud or maybe on prem, how do you see that evolving over time? >> So I believe that, with AtScale as one single pane of glass, we basically are providing an API, to the data and to the user, one single API. The reason that today we haven't seen the delivery of the promise of big data is because we don't have big data. Fortunate 2000 companies don't have big data. They have lots of data but to me big data means you can have one logical view of that data and get the best data pumped into these models in these tools, and today that's not the case. They're constricted by location they're constricted by vendor they're constricted by whether it's in the cloud or on prem. We eliminate those restrictions. >> The single API, I think is important actually. Because when you look at some of these guys what they're doing with their data pipeline they might have 10 or 15 unique API's that they're trying to manage. So there's a simplification aspect to, I suppose. >> One of the knocks on traditional BI has always been the need for extract databases and all the ETL that goes that's involved in that. Do you guys avoid that stage? You go to the production data directly or what's the-- >> It's a great question. The way I put it is, we bring Moses to the mountain the mountain being the data, Moses being the user. Traditionally, what people have been trying to do is bring the mountain to Moses, doesn't scale. At AtScale, we provide an abstraction a logical distraction between the data and the BI user. >> You don't touch, you don't move the data. >> We don't move the data. Which is what's unique and that's what's delivering I think, way more than a 10X delivery in value. >> Because you leave the data in place you bring that value to wherever the data is. Which is the original concept of Hadoop, by the way. That was what was profound about Hadoop everybody craps on it now, but that was the game changer and if you could take advantage of that that's how you tap your 10X. >> To the difference is, we're not, to your point we're not moving the data. Hadoop, in my humble opinion why it plateaued is because to get the value, you had to ask the user to bring and put data in yet another platform. And the reason that we're not delivering on big data as an industry, I believe is because we've too many data sources, too many platforms too many consumers of data and too many producers. As we build all these islands of data, with no connectivity. The idea is, we'll create this big data lake and we're going to physically put everything in there. Guess what? Someday turned out to be never. Because people aren't going to deal with the business disruption. We move thousands of users from a platform like Teradata to a platform like Snowflake or Google BigQuery, we don't care. We're a multi-cloud and we're a hybrid cloud. But we do it without any disruption. You're using Excel, you just continue and use it. You just see the results are faster. You use Tableau, same difference. >> So we had all the vertical rock stars in here. So we had Colin in yesterday, we had Stonebraker around earlier. Andy Palmer just came on and Chris here with the CEO who ultimately sold the company to HP. That really didn't do anything with it and then spun it off and now it's back. Aaron was, he had a spring in his step yesterday. So when you think about, Vertica. The technology behind Vertica go back 10 years and where we come now give us a little journey of, your data journey. >> So I think it plays into the, the original assertion is that, vertical is a best-in-class platform for analytics but it was yet another platform. The analog I give now, is now we have Snowflake and six months, 12 months from now we're going to have another one. And that creates a set of problems if you have to live in the physical world. Because you've all these islands of data and I believe, it's about the data not about the models, it's about the data. You can't get optimal results if you don't have an optimal access to the pertinent data. I believe that having that Universal API is going to make the next platform that more valuable. You're not going to be making the trade-off is, okay we have this platform that has some neat capability but the trade-off is from an enterprise architecture perspective we're never going to be able to connect all this stuff. That's how all of these things proliferated. My view is, in a world where you have that single pane of glass, that abstraction layer between the user and the data. Then innovation can be spawned quicker and you can use these tools effectively 'cause you're not compromising being able to get a logical view of the data and get access to it as a user. >> What's your issue with Snowflake you mentioned them, Mugli's company-- >> No issue, they're a great partner of ours. We eliminate the friction between the user going from an on-prem solution to the cloud. >> Slootman just took over there. So you know where that's going. >> Yep (laughing) >> Frank's got the magic touch. Okay good, you say they're a partner yours how are you guys partnering? >> They refer us into customers that, if you want to buy Snowflake now the next issue is, how do i migrate? You don't. You put our virtualization layer in and then we allow you access to Snowflake in a non-disruptive way, versus having to move data into their system or into a particular cloud which creates sales friction. >> Moving data is just, you want to avoid it at all cost. >> I do want to ask you because I met with your predecessors, Dave Mariani last year and I know he was kind of a reluctant CEO he didn't really want to be CEO but wanted to be CTO, which is what he is now. How did that come about, that they found you that you connected with them and decided this was the right opportunity. >> That's a great question. I actually looked at the company at the seed stage when I was in venture, but I had this thing as you know that, I wanted to move companies to Boston and they're about my vintage age-wise and he's married with four kids so that wasn't in the cards. I said look, it doesn't make sense for me to seed this company 'cause I can't give you the time you're out in California everything I'm instrumenting is around Boston. We parted friends. And I was skeptical whether he could build this 'cause people have been talking about building a heterogeneous universal semantic layer, for years and it's never come to fruition. And then he read in Fortune or Forbes that I was leaving Accomplice and that I was looking for one more company to operate. He reached out and he told me what they were doing that hey, we really built it but we need help and I don't want to run this. It's not right for the company and the opportunity So he said, "I'll come and I'll consult to you." I put together a plan and I had my Vertica and data robot. NekTony guys do the technical diligence to make sure that the architecture wasn't wedded to the dupe, like all the other ones were and when I saw it wasn't then I knew the market opportunity was to take that, rifle and point it at that legacy $150 billion BI market not at the billion dollar market of Hadoop. And when we did that, we've been growing at 162% quarter-over-quarter. We've built development centers in Bulgaria. We've moved all operations, non-technical to Boston here down in our South Station. We've been on fire and we are the partner of choice of every cloud manner, because we eliminate the sales friction, for customers being able to take advantage of movement to the cloud and we're able through our intelligent pipeline and capability. We're able to reduce the cost significantly of queries because we understand and we were able to intelligently cash those queries. >> Sales ops is here, all-- >> Sales marketing, customer support, customer success and we're building a machine learning team here at Dev team here. >> Where are you in that sort of Boston build-out? >> We have an office on 711 Atlantic that we opened in the fall. We're actually moving from 4,000 square feet to 10,000 this month. In less than six months and we'll house by the first year, 100 employees in Boston 100 in Bulgaria and about that same hundred in San Mateo. >> Are you going after net new business mainly? Or there's a lot of legacy BI out there are you more displacing those products? >> A couple of things. What we find is that, customers want to evolve into the cloud, they don't want a revolution they want a evolution. So we allow them, because we support hybrid cloud to keep some data behind the firewall and then experiment with moving other data to the cloud platform of choice but we're still providing that one logical view. I would say most of our customers are looking to reap platform, off of Teradata or something onto a, another platform like Snowflake. And then we have a set of customers that see that as part of the solution but not the whole solution. They're more true hybrids but I would say that 80% of our customers are traditional BI customers that are trying to contemporize their environments and be able to take advantage of tabular support and multidimensional, the things that we do in addition to the cube world. >> They can keep whatever they're using. >> Correct, that's the key. >> Did you do the series D, you did, right? >> Yes, Morgan Stanely led. >> So you're not actively but you're good for now, It was like $50 million >> Yeah we raised $50 million. >> You're good for a bit. Who's in the Chris Lynch target? (laughs) Who's the enemy? Vertica, I could say it was the traditional database guys. Who's the? >> We're in a unique position, we're almost Switzerland so we could be friend to foe, of anybody in that ecosystem because we can, non-disruptively re-platform customers between legacy platforms or from legacy platforms to the cloud. We're an interesting position. >> So similar to the file sharing. File virtualization company >> The Copier. >> Copier yeah. >> It puts us in an interesting position. They need to be friends with us and at the same time I'm sure that they're concerned about the capabilities we have but we have a number of retail customers for instance that have asked us to move down from Amazon to Google BigQuery, which we accommodate and because we can do that non-disruptively. The cost and the ability to move is eliminated. It gives customers true freedom of choice. >> How worried are you, that AWS tries to replicate what you guys do. You're in their sights. >> I think there are technical, legal and structural barriers to them doing that. The technical is, this team has been at it for six and a half years. So to do what we do, they'll have to do what we've done. Structurally from a business perspective if they could, I'm not sure they want to. The way to think about Amazon is, they're no different than Teradata, except for they want the same vendor lock-in except they want it to be the Amazon Cloud when Teradata wanted it to be, their data warehouse. >> They don't promote multi-cloud versus-- >> Yeah, they don't want multi-cloud they don't want >> On Prem >> Customers to have a freedom of choice. Would they really enable a heterogeneous abstraction layer, I don't think they would nor do I think any of the big guys would. They all claim to have this capability for their system. It's like the old IBM adage I'm in prison but the food's going to get three squares a day, I get cable TV but I'm in prison. (laughing) >> Awesome, all right, parting thoughts. >> Parting thoughts, oh geez you got to give me a question I'm not that creative. >> What's next, for you guys? What should we be paying attention to? >> I think you're going to see some significant announcements in September regarding the company and relationships that I think will validate the impact we're having in the market. >> Give you some leverage >> Yeah, will give us, better channel leverage. We have a major technical announcement that I think will be significant to the marketplace and what will be highly disruptive to some of the people you just mentioned. In terms of really raising the bar for customers to be able to have the freedom of choice without any sort of vendor lock-in. And I think that that will create some counter strike which we'll be ready for. (laughing) >> If you've never heard of AtScale before trust me you're going to in the next 18 months. Chris Lynch, thanks so much for coming on theCUBE. >> It's my pleasure. >> Great to see you. All right, keep it right there everybody we're back with our next guest, right after this short break you're watching theCUBE from MIT, right back. (upbeat music)

Published Date : Aug 2 2019

SUMMARY :

Brought to you by, SiliconANGLE Media. Good to see you. that you got back into it. and asked me about the transition What was it that attracted you to AtScale? traditional BI to the cloud. That's the other thing and then the second piece is into you I mean, maybe it lives in the cloud and get the best data Because when you look and all the ETL that goes is bring the mountain don't move the data. We don't move the data. and if you could take advantage of that is because to get the value, So when you think about, Vertica. and I believe, it's about the data We eliminate the friction between the user So you know where that's going. Frank's got the magic touch. and then we allow you access to Snowflake you want to avoid it that they found you and it's never come to fruition. and we're building a by the first year, 100 employees in Boston the things that we do Who's in the Chris Lynch target? to the cloud. So similar to the file sharing. about the capabilities we have tries to replicate what you guys do. So to do what we do, they'll I'm in prison but the food's you got to give me a question in September regarding the to some of the people you just mentioned. in the next 18 months. Great to see you.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Paul GillanPERSON

0.99+

Dave VellantePERSON

0.99+

Chris LynchPERSON

0.99+

AmazonORGANIZATION

0.99+

BulgariaLOCATION

0.99+

SeptemberDATE

0.99+

ChrisPERSON

0.99+

AWSORGANIZATION

0.99+

10QUANTITY

0.99+

HPORGANIZATION

0.99+

Andy PalmerPERSON

0.99+

Dave MarianiPERSON

0.99+

CaliforniaLOCATION

0.99+

AaronPERSON

0.99+

BostonLOCATION

0.99+

San MateoLOCATION

0.99+

$150 billionQUANTITY

0.99+

$50 millionQUANTITY

0.99+

$150 billionQUANTITY

0.99+

MosesPERSON

0.99+

80%QUANTITY

0.99+

4,000 square feetQUANTITY

0.99+

last yearDATE

0.99+

second pieceQUANTITY

0.99+

162%QUANTITY

0.99+

South StationLOCATION

0.99+

AtScaleORGANIZATION

0.99+

Morgan StanelyPERSON

0.99+

100%QUANTITY

0.99+

four kidsQUANTITY

0.99+

ExcelTITLE

0.99+

six and a half yearsQUANTITY

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

Cambridge, MassachusettsLOCATION

0.99+

IBMORGANIZATION

0.99+

TableauTITLE

0.99+

yesterdayDATE

0.99+

firstQUANTITY

0.99+

secondQUANTITY

0.99+

TeradataORGANIZATION

0.99+

Cambridge, MassachusettsLOCATION

0.99+

less than six monthsQUANTITY

0.99+

SnowflakeORGANIZATION

0.99+

FrankPERSON

0.99+

todayDATE

0.98+

this monthDATE

0.98+

SwitzerlandLOCATION

0.98+

HadoopTITLE

0.98+

10XQUANTITY

0.98+

100 employeesQUANTITY

0.98+

one partQUANTITY

0.98+

SlootmanPERSON

0.98+

10,000QUANTITY

0.97+

VerticaORGANIZATION

0.97+

MugliORGANIZATION

0.97+

GoogleORGANIZATION

0.97+

15 unique APIQUANTITY

0.96+

hundredQUANTITY

0.96+

six monthsQUANTITY

0.96+

three squares a dayQUANTITY

0.96+

thousands of usersQUANTITY

0.96+

NekTonyORGANIZATION

0.96+

FortuneTITLE

0.96+

12 monthsQUANTITY

0.95+

single APIQUANTITY

0.95+

711 AtlanticLOCATION

0.95+

2000 companiesQUANTITY

0.94+

OneQUANTITY

0.94+

next 18 monthsDATE

0.94+

ColinPERSON

0.93+

one more companyQUANTITY

0.92+

one single APIQUANTITY

0.92+

single paneQUANTITY

0.91+

Susan Wilson, Informatica & Blake Andrews, New York Life | MIT CDOIQ 2019


 

(techno music) >> From Cambridge, Massachusetts, it's theCUBE. Covering MIT Chief Data Officer and Information Quality Symposium 2019. Brought to you by SiliconANGLE Media. >> Welcome back to Cambridge, Massachusetts everybody, we're here with theCUBE at the MIT Chief Data Officer Information Quality Conference. I'm Dave Vellante with my co-host Paul Gillin. Susan Wilson is here, she's the vice president of data governance and she's the leader at Informatica. Blake Anders is the corporate vice president of data governance at New York Life. Folks, welcome to theCUBE, thanks for coming on. >> Thank you. >> Thank you. >> So, Susan, interesting title; VP, data governance leader, Informatica. So, what are you leading at Informatica? >> We're helping our customers realize their business outcomes and objectives. Prior to joining Informatica about 7 years ago, I was actually a customer myself, and so often times I'm working with our customers to understand where they are, where they going, and how to best help them; because we recognize data governance is more than just a tool, it's a capability that represents people, the processes, the culture, as well as the technology. >> Yeah so you've walked the walk, and you can empathize with what your customers are going through. And Blake, your role, as the corporate VP, but more specifically the data governance lead. >> Right, so I lead the data governance capabilities and execution group at New York Life. We're focused on providing skills and tools that enable government's activities across the enterprise at the company. >> How long has that function been in place? >> We've been in place for about two and half years now. >> So, I don't know if you guys heard Mark Ramsey this morning, the key-note, but basically he said, okay, we started with enterprise data warehouse, we went to master data management, then we kind of did this top-down enterprise data model; that all failed. So we said, all right, let's pump the governance. Here you go guys, you fix our corporate data problem. Now, right tool for the right job but, and so, we were sort of joking, did data governance fail? No, you always have to have data governance. It's like brushing your teeth. But so, like I said, I don't know if you heard that, but what are your thoughts on that sort of evolution that he described? As sort of, failures of things like EDW to live up to expectations and then, okay guys over to you. Is that a common theme? >> It is a common theme, and what we're finding with many of our customers is that they had tried many of the, if you will, the methodologies around data governance, right? Around policies and structures. And we describe this as the Data 1.0 journey, which was more application-centric reporting to Data 2.0 to data warehousing. And a lot of the failed attempts, if you will, at centralizing, if you will, all of your data, to now Data 3.0, where we look at the explosion of data, the volumes of data, the number of data consumers, the expectations of the chief data officer to solve business outcomes; crushing under the scale of, I can't fit all of this into a centralized data at repository, I need something that will help me scale and to become more agile. And so, that message does resonate with us, but we're not saying data warehouses don't exist. They absolutely do for trusted data sources, but the ability to be agile and to address many of your organizations needs and to be able to service multiple consumers is top-of-mind for many of our customers. >> And the mind set from 1.0 to 2.0 to 3.0 has changed. From, you know, data as a liability, to now data as this massive asset. It's sort of-- >> Value, yeah. >> Yeah, and the pendulum is swung. It's almost like a see-saw. Where, and I'm not sure it's ever going to flip back, but it is to a certain extent; people are starting to realize, wow, we have to be careful about what we do with our data. But still, it's go, go, go. But, what's the experience at New York Life? I mean, you know. A company that's been around for a long time, conservative, wants to make sure risk averse, obviously. >> Right. >> But at the same time, you want to keep moving as the market moves. >> Right, and we look at data governance as really an enabler and a value-add activity. We're not a governance practice for the sake of governance. We're not there to create a lot of policies and restrictions. We're there to add value and to enable innovation in our business and really drive that execution, that efficiency. >> So how do you do that? Square that circle for me, because a lot of people think, when people think security and governance and compliance they think, oh, that stifles innovation. How do you make governance an engine of innovation? >> You provide transparency around your data. So, it's transparency around, what does the data mean? What data assets do we have? Where can I find that? Where are my most trusted sources of data? What does the quality of that data look like? So all those things together really enable your data consumers to take that information and create new value for the company. So it's really about enabling your value creators throughout the organization. >> So data is an ingredient. I can tell you where it is, I can give you some kind of rating as to the quality of that data and it's usefulness. And then you can take it and do what you need to do with it in your specific line of business. >> That's right. >> Now you said you've been at this two and half years, so what stages have you gone through since you first began the data governance initiative. >> Sure, so our first year, year and half was really focused on building the foundations, establishing the playbook for data governance and building our processes and understanding how data governance needed to be implemented to fit New York Life in the culture of the company. The last twelve months or so has really been focused on operationalizing governance. So we've got the foundations in place, now it's about implementing tools to further augment those capabilities and help assist our data stewards and give them a better skill set and a better tool set to do their jobs. >> Are you, sort of, crowdsourcing the process? I mean, you have a defined set of people who are responsible for governance, or is everyone taking a role? >> So, it is a two-pronged approach, we do have dedicated data stewards. There's approximately 15 across various lines of business throughout the company. But, we are building towards a data democratization aspect. So, we want people to be self-sufficient in finding the data that they need and understanding the data. And then, when they have questions, relying on our stewards as a network of subject matter experts who also have some authorizations to make changes and adapt the data as needed. >> Susan, one of the challenges that we see is that the chief data officers often times are not involved in some of these skunkworks AI projects. They're sort of either hidden, maybe not even hidden, but they're in the line of business, they're moving. You know, there's a mentality of move fast and break things. The challenge with AI is, if you start operationalizing AI and you're breaking things without data quality, without data governance, you can really affect lives. We've seen it. In one of these unintended consequences. I mean, Facebook is the obvious example and there are many, many others. But, are you seeing that? How are you seeing organizations dealing with that problem? >> As Blake was mentioning often times what it is about, you've got to start with transparency, and you got to start with collaborating across your lines of businesses, including the data scientists, and including in terms of what they are doing. And actually provide that level of transparency, provide a level of collaboration. And a lot of that is through the use of our technology enablers to basically go out and find where the data is and what people are using and to be able to provide a mechanism for them to collaborate in terms of, hey, how do I get access to that? I didn't realize you were the SME for that particular component. And then also, did you realize that there is a policy associated to the data that you're managing and it can't be shared externally or with certain consumer data sets. So, the objective really is around how to create a platform to ensure that any one in your organization, whether I'm in the line of business, that I don't have a technical background, or someone who does have a technical background, they can come and access and understand that information and connect with their peers. >> So you're helping them to discover the data. What do you do at that stage? >> What we do at that stage is, creating insights for anyone in the organization to understand it from an impact analysis perspective. So, for example, if I'm going to make changes, to as well as discovery. Where exactly is my information? And so we have-- >> Right. How do you help your customers discover that data? >> Through machine learning and artificial intelligence capabilities of our, specifically, our data catalog, that allows us to do that. So we use such things like similarity based matching which help us to identify. It doesn't have to be named, in miscellaneous text one, it could be named in that particular column name. But, in our ability to scan and discover we can identify in that column what is potentially social security number. It might have resided over years of having this data, but you may not realize that it's still stored there. Our ability to identify that and report that out to the data stewards as well as the data analysts, as well as to the privacy individuals is critical. So, with that being said, then they can actually identify the appropriate policies that need to be adhered to, alongside with it in terms of quality, in terms of, is there something that we need to archive. So that's where we're helping our customers in that aspect. >> So you can infer from the data, the meta data, and then, with a fair degree of accuracy, categorize it and automate that. >> Exactly. We've got a customer that actually ran this and they said that, you know, we took three people, three months to actually physically tag where all this information existed across something like 7,000 critical data elements. And, basically, after the set up and the scanning procedures, within seconds we were able to get within 90% precision. Because, again, we've dealt a lot with meta data. It's core to our artificial intelligence and machine learning. And it's core to how we built out our platforms to share that meta data, to do something with that meta data. It's not just about sharing the glossary and the definition information. We also want to automate and reduce the manual burden. Because we recognize with that scale, manual documentation, manual cataloging and tagging just, >> It doesn't work. >> It doesn't work. It doesn't scale. >> Humans are bad at it. >> They're horrible at it. >> So I presume you have a chief data officer at New York Life, is that correct? >> We have a chief data and analytics officer, yes. >> Okay, and you work within that group? >> Yes, that is correct. >> Do you report it to that? >> Yes, so-- >> And that individual, yeah, describe the organization. >> So that sits in our lines of business. Originally, our data governance office sat in technology. And then, our early 2018 we actually re-orged into the business under the chief data and analytics officer when that role was formed. So we sit under that group along with a data solutions and governance team that includes several of our data stewards and also some others, some data engineer-type roles. And then, our center for data science and analytics as well that contains a lot of our data science teams in that type of work. >> So in thinking about some of these, I was describing to Susan, as these skunkworks projects, is the data team, the chief data officer's team involved in those projects or is it sort of a, go run water through the pipes, get an MVP and then you guys come in. How does that all work? >> We're working to try to centralize that function as much as we can, because we do believe there's value in the left hand knowing what the right hand is doing in those types of things. So we're trying to build those communications channels and build that network of data consumers across the organization. >> It's hard right? >> It is. >> Because the line of business wants to move fast, and you're saying, hey, we can help. And they think you're going to slow them down, but in fact, you got to make the case and show the success because you're actually not going to slow them down to terms of the ultimate outcome. I think that's the case that you're trying to make, right? >> And that's one of the things that we try to really focus on and I think that's one of the advantages to us being embedded in the business under the CDAO role, is that we can then say our objectives are your objectives. We are here to add value and to align with what you're working on. We're not trying to slow you down or hinder you, we're really trying to bring more to the table and augment what you're already trying to achieve. >> Sometimes getting that organization right means everything, as we've seen. >> Absolutely. >> That's right. >> How are you applying governance discipline to unstructured data? >> That's actually something that's a little bit further down our road map, but one of the things that we have started doing is looking at our taxonomy's for structured data and aligning those with the taxonomy's that we're using to classify unstructured data. So, that's something we're in the early stages with, so that when we get to that process of looking at more of our unstructured content, we can, we already have a good feel for there's alignment between the way that we think about and organize those concepts. >> Have you identified automation tools that can help to bring structure to that unstructured data? >> Yes, we have. And there are several tools out there that we're continuing to investigate and look at. But, that's one of the key things that we're trying to achieve through this process is bringing structure to unstructured content. >> So, the conference. First year at the conference. >> Yes. >> Kind of key take aways, things that interesting to you, learnings? >> Oh, yes, well the number of CDO's that are here and what's top of mind for them. I mean, it ranges from, how do I stand up my operating model? We just had a session just about 30 minutes ago. A lot of questions around, how do I set up my organization structure? How do I stand up my operating model so that I could be flexible? To, right, the data scientists, to the folks that are more traditional in structured and trusted data. So, still these things are top-of-mind and because they're recognizing the market is also changing too. And the growing amount of expectations, not only solving business outcomes, but also regulatory compliance, privacy is also top-of-mind for a lot of customers. In terms of, how would I get started? And what's the appropriate structure and mechanism for doing so? So we're getting a lot of those types of questions as well. So, the good thing is many of us have had years of experience in this phase and the convergence of us being able to support our customers, not only in our principles around how we implement the framework, but also the technology is really coming together very nicely. >> Anything you'd add, Blake? >> I think it's really impressive to see the level of engagement with thought leaders and decision makers in the data space. You know, as Susan mentioned, we just got out of our session and really, by the end of it, it turned into more of an open discussion. There was just this kind of back and forth between the participants. And so it's really engaging to see that level of passion from such a distinguished group of individuals who are all kind of here to share thoughts and ideas. >> Well anytime you come to a conference, it's sort of any open forum like this, you learn a lot. When you're at MIT, it's like super-charged. With the big brains. >> Exactly, you feel it when you come on the campus. >> You feel smarter when you walk out of here. >> Exactly, I know. >> Well, guys, thanks so much for coming to theCUBE. It was great to have you. >> Thank you for having us. We appreciate it, thank you. >> You're welcome. All right, keep it right there everybody. Paul and I will be back with our next guest. You're watching theCUBE from MIT in Cambridge. We'll be right back. (techno music)

Published Date : Aug 2 2019

SUMMARY :

Brought to you by SiliconANGLE Media. Susan Wilson is here, she's the vice president So, what are you leading at Informatica? and how to best help them; but more specifically the data governance lead. Right, so I lead the data governance capabilities and then, okay guys over to you. And a lot of the failed attempts, if you will, And the mind set from 1.0 to 2.0 to 3.0 has changed. Where, and I'm not sure it's ever going to flip back, But at the same time, Right, and we look at data governance So how do you do that? What does the quality of that data look like? and do what you need to do with it so what stages have you gone through in the culture of the company. in finding the data that they need is that the chief data officers often times and to be able to provide a mechanism What do you do at that stage? So, for example, if I'm going to make changes, How do you help your customers discover that data? and report that out to the data stewards and then, with a fair degree of accuracy, categorize it And it's core to how we built out our platforms It doesn't work. And that individual, And then, our early 2018 we actually re-orged is the data team, the chief data officer's team and build that network of data consumers but in fact, you got to make the case and show the success and to align with what you're working on. Sometimes getting that organization right but one of the things that we have started doing is bringing structure to unstructured content. So, the conference. And the growing amount of expectations, and decision makers in the data space. it's sort of any open forum like this, you learn a lot. when you come on the campus. Well, guys, thanks so much for coming to theCUBE. Thank you for having us. Paul and I will be back with our next guest.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Paul GillinPERSON

0.99+

SusanPERSON

0.99+

Dave VellantePERSON

0.99+

PaulPERSON

0.99+

Susan WilsonPERSON

0.99+

BlakePERSON

0.99+

InformaticaORGANIZATION

0.99+

CambridgeLOCATION

0.99+

Mark RamseyPERSON

0.99+

Blake AndersPERSON

0.99+

three monthsQUANTITY

0.99+

three peopleQUANTITY

0.99+

FacebookORGANIZATION

0.99+

New York LifeORGANIZATION

0.99+

early 2018DATE

0.99+

Cambridge, MassachusettsLOCATION

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

First yearQUANTITY

0.99+

oneQUANTITY

0.99+

90%QUANTITY

0.99+

two and half yearsQUANTITY

0.98+

firstQUANTITY

0.98+

approximately 15QUANTITY

0.98+

7,000 critical data elementsQUANTITY

0.97+

about two and half yearsQUANTITY

0.97+

first yearQUANTITY

0.96+

twoQUANTITY

0.96+

about 30 minutes agoDATE

0.96+

theCUBEORGANIZATION

0.95+

Blake AndrewsPERSON

0.95+

MIT Chief Data Officer andEVENT

0.93+

MIT Chief Data Officer Information Quality ConferenceEVENT

0.91+

EDWORGANIZATION

0.86+

last twelve monthsDATE

0.86+

skunkworksORGANIZATION

0.85+

CDAOORGANIZATION

0.85+

this morningDATE

0.83+

MITORGANIZATION

0.83+

7 years agoDATE

0.78+

yearQUANTITY

0.75+

Information Quality Symposium 2019EVENT

0.74+

3.0OTHER

0.66+

York LifeORGANIZATION

0.66+

2.0OTHER

0.59+

MIT CDOIQ 2019EVENT

0.58+

halfQUANTITY

0.52+

Data 2.0OTHER

0.52+

Data 3.0TITLE

0.45+

1.0OTHER

0.43+

DataOTHER

0.21+

Robert Abate, Global IDS | MIT CDOIQ 2019


 

>> From Cambridge, Massachusetts, it's theCUBE. Covering MIT Chief Data Officer and Information Quality Symposium 2019. Brought to you by SiliconANGLE Media. (futuristic music) >> Welcome back to Cambridge, Massachusetts everybody. You're watching theCUBE, the leader in live tech coverage. We go out to the events and we extract the signal from the noise. This is day two, we're sort of wrapping up the Chief Data Officer event. It's MIT CDOIQ, it started as an information quality event and with the ascendancy of big data the CDO emerged and really took center stage here. And it's interesting to know that it's kind of come full circle back to information quality. People are realizing all this data we have, you know the old saying, garbage in, garbage out. So the information quality worlds and this chief data officer world have really come colliding together. Robert Abate is here, he's the Vice President and CDO of Global IDS and also the co-chair of next year's, the 14th annual MIT CDOIQ. Robert, thanks for coming on. >> Oh, well thank you. >> Now you're a CDO by background, give us a little history of your career. >> Sure, sure. Well I started out with an Electrical Engineering degree and went into applications development. By 2000, I was leading the Ralph Lauren's IT, and I realized when Ralph Lauren hired me, he was getting ready to go public. And his problem was he had hired eight different accounting firms to do eight different divisions. And each of those eight divisions were reporting a number, but the big number didn't add up, so he couldn't go public. So he searched the industry to find somebody who could figure out the problem. Now I was, at the time, working in applications and had built this system called Service Oriented Architectures, a way of integrating applications. And I said, "Well I don't know if I could solve the problem, "but I'll give it a shot." And what I did was, just by taking each silo as it's own problem, which was what EID Accounting Firm had done, I was able to figure out that one of Ralph Lauren's policies was if you buy a garment, you can return it anytime, anywhere, forever, however long you own it. And he didn't think about that, but what that meant is somebody could go to a Bloomingdale's, buy a garment and then go to his outlet store and return it. Well, the cross channels were different systems. So the outlet stores were his own business, retail was a different business, there was a completely different, each one had their own AS/400, their own data. So what I quickly learned was, the problem wasn't the systems, the problem was the data. And it took me about two months to figure it out and he offered me a job, he said well, I was a consultant at the time, he says, "I'm offering you a job, you're going to run my IT." >> Great user experience but hard to count. >> (laughs) Hard to count. So that's when I, probably 1999 was when that happened. I went into data and started researching-- >> Sorry, so how long did it take you to figure that out? You said a couple of months? >> A couple of months, I think it was about two months. >> 'Cause jeez, it took Oracle what, 10 years to build Fusion with SOA? That's pretty good. (laughs) >> This was a little bit of luck. When we started integrating the applications we learned that the messages that we were sending back and forth didn't match, and we said, "Well that's impossible, it can't not match." But what didn't match was it was coming from one channel and being returned in another channel, and the returns showed here didn't balance with the returns on this side. So it was a data problem. >> So a forensics showdown. So what did you do after? >> After that I went into ICICI Bank which was a large bank in India who was trying to integrate their systems, and again, this was a data problem. But they heard me giving a talk at a conference on how SOA had solved the data challenge, and they said, "We're a bank with a wholesale, a retail, "and other divisions, "and we can't integrate the systems, can you?" I said, "Well yeah, I'd build a website "and make them web services and now what'll happen is "each of those'll kind of communicate." And I was at ICICI Bank for about six months in Mumbai, and finished that which was a success, came back and started consulting because now a lot of companies were really interested in this concept of Service Oriented Architectures. Back then when we first published on it, myself, Peter Aiken, and a gentleman named Joseph Burke published on it in 1996. The publisher didn't accept the book, it was a really interesting thing. We wrote the book called, "Services Based Architectures: A Way to Integrate Systems." And the way Wiley & Sons, or most publishers work is, they'll have three industry experts read your book and if they don't think what you're saying has any value, they, forget about it. So one guy said this is brilliant, one guy says, "These guys don't know what they're talking about," and the third guy says, "I don't even think what they're talking about is feasible." So they decided not to publish. Four years later it came back and said, "We want to publish the book," and Peter said, "You know what, they lost their chance." We were ahead of them by four years, they didn't understand the technology. So that was kind of cool. So from there I went into consulting, eventually took a position as the Head of Enterprise and Director of Enterprise Information Architecture with Walmart. And Walmart, as you know, is a huge entity, almost the size of the federal government. So to build an architecture that integrates Walmart would've been a challenge, a behemoth challenge, and I took it on with a phenomenal team. >> And when was this, like what timeframe? >> This was 2010, and by the end of 2010 we had presented an architecture to the CIO and the rest of the organization, and they came back to me about a week later and said, "Look, everybody agrees what you did was brilliant, "but nobody knows how to implement it. "So we're taking you away, "you're no longer Director of Information Architecture, "you're now Director of Enterprise Information Management. "Build it. "Prove that what you say you could do, you could do." So we built something called the Data CAFE, and CAFE was an acronym, it stood for: Collaborative Analytics Facility for the Enterprise. What we did was we took data from one of the divisions, because you didn't want to take on the whole beast, boil the ocean. We picked Sam's Club and we worked with their CFO, and because we had information about customers we were able to build a room with seven 80 inch monitors that surrounded anyone in the room. And in the center was the Cisco telecommunications so you could be a part of a meeting. >> The TelePresence. >> TelePresence. And we built one room in one facility, and one room in another facility, and we labeled the monitors, one red, one blue, one green, and we said, "There's got to be a way where we can build "data science so it's interactive, so somebody, "an executive could walk into the room, "touch the screen, and drill into features. "And in another room "the features would be changing simultaneously." And that's what we built. The room was brought up on Black Friday of 2013, and we were able to see the trends of sales on the East Coast that we quickly, the executives in the room, and these are the CEO of Walmart and the heads of Sam's Club and the like, they were able to change the distribution in the Mountain Time Zone and west time zones because of the sales on the East Coast gave them the idea, well these things are going to sell, and these things aren't. And they saw a tremendous increase in productivity. We received the 2014, my team received the 2014 Walmart Innovation Project of the Year. >> And that's no slouch. Walmart has always been heavily data-oriented. I don't know if it's urban legend or not, but the famous story in the '80s of the beer and the diapers, right? Walmart would position beer next to diapers, why would they do that? Well the father goes in to buy the diapers for the baby, picks up a six pack while he's on the way, so they just move those proximate to each other. (laughs) >> In terms of data, Walmart really learned that there's an advantage to understanding how to place items in places that, a path that you might take in a store, and knowing that path, they actually have a term for it, I believe it's called, I'm sorry, I forgot the name but it's-- >> Selling more stuff. (laughs) >> Yeah, it's selling more stuff. It's the way you position items on a shelf. And Walmart had the brilliance, or at least I thought it was brilliant, that they would make their vendors the data champion. So the vendor, let's say Procter & Gamble's a vendor, and they sell this one product the most. They would then be the champion for that aisle. Oh, it's called planogramming. So the planogramming, the way the shelves were organized, would be set up by Procter & Gamble for that entire area, working with all their other vendors. And so Walmart would give the data to them and say, "You do it." And what I was purporting was, well, we shouldn't just be giving the data away, we should be using that data. And that was the advent of that. From there I moved to Kimberly-Clark, I became Global Director of Enterprise Data Management and Analytics. Their challenge was they had different teams, there were four different instances of SAP around the globe. One for Latin America, one for North America called the Enterprise Edition, one for EMEA, Europe, Middle East, and Africa, and one for Asia-Pacific. Well when you have four different instances of SAP, that means your master data doesn't exist because the same thing that happens in this facility is different here. And every company faces this challenge. If they implement more than one of a system the specialty fields get used by different companies in different ways. >> The gold standard, the gold version. >> The golden version. So I built a team by bringing together all the different international teams, and created one team that was able to integrate best practices and standards around data governance, data quality. Built BI teams for each of the regions, and then a data science and advanced analytics team. >> Wow, so okay, so that makes you uniquely qualified to coach here at the conference. >> Oh, I don't know about that. (laughs) There are some real, there are some geniuses here. >> No but, I say that because these are your peeps. >> Yes, they are, they are. >> And so, you're a practitioner, this conference is all about practitioners talking to practitioners, it's content-heavy, There's not a lot of fluff. Lunches aren't sponsored, there's no lanyard sponsor and it's not like, you know, there's very subtle sponsor desks, you have to have sponsors 'cause otherwise the conference's not enabled, and you've got costs associated with it. But it's a very intimate event and I think you guys want to keep it that way. >> And I really believe you're dead-on. When you go to most industry conferences, the industry conferences, the sponsors, you know, change the format or are heavily into the format. Here you have industry thought leaders from all over the globe. CDOs of major Fortune 500 companies who are working with their peers and exchanging ideas. I've had conversations with a number of CDOs and the thought leadership at this conference, I've never seen this type of thought leadership in any conference. >> Yeah, I mean the percentage of presentations by practitioners, even when there's a vendor name, they have a practitioner, you know, internal practitioner presenting so it's 99.9% which is why people attend. We're moving venues next year, I understand. Just did a little tour of the new venue, so, going to be able to accommodate more attendees, so that's great. >> Yeah it is. >> So what are your objectives in thinking ahead a year from now? >> Well, you know, I'm taking over from my current peer, Dr. Arka Mukherjee, who just did a phenomenal job of finding speakers. People who are in the industry, who are presenting challenges, and allowing others to interact. So I hope could do a similar thing which is, find with my peers people who have real world challenges, bring them to the forum so they can be debated. On top of that, there are some amazing, you know, technology change is just so fast. One of the areas like big data I remember only five years ago the chart of big data vendors maybe had 50 people on it, now you would need the table to put all the vendors. >> Who's not a data vendor, you know? >> Who's not a data vendor? (laughs) So I would think the best thing we could do is, is find, just get all the CDOs and CDO-types into a room, and let us debate and talk about these points and issues. I've seen just some tremendous interactions, great questions, people giving advice to others. I've learned a lot here. >> And how about long term, where do you see this going? How many CDOs are there in the world, do you know? Is that a number that's known? >> That's a really interesting point because, you know, only five years ago there weren't that many CDOs to be called. And then Gartner four years ago or so put out an article saying, "Every company really should have a CDO." Not just for the purpose of advancing your data, and to Doug Laney's point that data is being monetized, there's a need to have someone responsible for information 'cause we're in the Information Age. And a CIO really is focused on infrastructure, making sure I've got my PCs, making sure I've got a LAN, I've got websites. The focus on data has really, because of the Information Age, has turned data into an asset. So organizations realize, if you utilize that asset, let me reverse this, if you don't use data as an asset, you will be out of business. I heard a quote, I don't know if it's true, "Only 10 years ago, 250 of the Fortune 10 no longer exists." >> Yeah, something like that, the turnover's amazing. >> Many of those companies were companies that decided not to make the change to be data-enabled, to make data decision processing. Companies still use data warehouses, they're always going to use them, and a warehouse is a rear-view mirror, it tells you what happened last week, last month, last year. But today's businesses work forward-looking. And just like driving a car, it'd be really hard to drive your car through a rear-view mirror. So what companies are doing today are saying, "Okay, let's start looking at this as forward-looking, "a prescriptive and predictive analytics, "rather than just what happened in the past." I'll give you an example. In a major company that is a supplier of consumer products, they were leading in the industry and their sales started to drop, and they didn't know why. Well, with a data science team, we were able to determine by pulling in data from the CDC, now these are sources that only 20 years ago nobody ever used to bring in data in the enterprise, now 60% of your data is external. So we brought in data from the CDC, we brought in data on maternal births from the national government, we brought in data from the Census Bureau, we brought in data from sources of advertising and targeted marketing towards mothers. Pulled all that data together and said, "Why are diaper sales down?" Well they were targeting the large regions of the country and putting ads in TV stations in New York and California, big population centers. Birth rates in population centers have declined. Birth rates in certain other regions, like the south, and the Bible Belt, if I can call it that, have increased. So by changing the marketing, their product sales went up. >> Advertising to Texas. >> Well, you know, and that brings to one of the points, I heard a lecture today about ethics. We made it a point at Walmart that if you ran a query that reduced a result to less than five people, we wouldn't allow you to see the result. Because, think about it, I could say, "What is my neighbor buying? "What are you buying?" So there's an ethical component to this as well. But that, you know, data is not political. Data is not chauvinistic. It doesn't discriminate, it just gives you facts. It's the interpretation of that that is hard CDOs, because we have to say to someone, "Look, this is the fact, and your 25 years "of experience in the business, "granted, is tremendous and it's needed, "but the facts are saying this, "and that would mean that the business "would have to change its direction." And it's hard for people to do, so it requires that. >> So whether it's called the chief data officer, whatever the data czar rubric is, the head of analytics, there's obviously the data quality component there whatever that is, this is the conference for, as I called them, your peeps, for that role in the organization. People often ask, "Will that role be around?" I think it's clear, it's solidifying. Yes, you see the chief digital officer emerging and there's a lot of tailwinds there, but the information quality component, the data architecture component, it's here to stay. And this is the premiere conference, the premiere event, that I know of anyway. There are a couple of others, perhaps, but it's great to see all the success. When I first came here in 2013 there were probably about 130 folks here. Today, I think there were 500 people registered almost. Next year, I think 600 is kind of the target, and I think it's very reasonable with the new space. So congratulations on all the success, and thank you for stepping up to the co-chair role, I really appreciate it. >> Well, let me tell you I thank you guys. You provide a voice at these IT conferences that we really need, and that is the ability to get the message out. That people do think and care, the industry is not thoughtless and heartless. With all the data breaches and everything going on there's a lot of fear, fear, loathing, and anticipation. But having your voice, kind of like ESPN and a sports show, gives the technology community, which is getting larger and larger by the day, a voice and we need that so, thank you. >> Well thank you, Robert. We appreciate that, it was great to have you on. Appreciate the time. >> Great to be here, thank you. >> All right, and thank you for watching. We'll be right back with out next guest as we wrap up day two of MIT CDOIQ. You're watching theCUBE. (futuristic music)

Published Date : Aug 1 2019

SUMMARY :

Brought to you by SiliconANGLE Media. and also the co-chair of next year's, give us a little history of your career. So he searched the industry to find somebody (laughs) Hard to count. 10 years to build Fusion with SOA? and the returns showed here So what did you do after? and the third guy says, And in the center was the Cisco telecommunications and the heads of Sam's Club and the like, Well the father goes in to buy the diapers for the baby, (laughs) So the planogramming, the way the shelves were organized, and created one team that was able to integrate so that makes you uniquely qualified to coach here There are some real, there are some geniuses here. and it's not like, you know, the industry conferences, the sponsors, you know, Yeah, I mean the percentage of presentations by One of the areas like big data I remember just get all the CDOs and CDO-types into a room, because of the Information Age, and the Bible Belt, if I can call it that, have increased. It's the interpretation of that that is hard CDOs, the data architecture component, it's here to stay. and that is the ability to get the message out. We appreciate that, it was great to have you on. All right, and thank you for watching.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
PeterPERSON

0.99+

WalmartORGANIZATION

0.99+

Peter AikenPERSON

0.99+

Robert AbatePERSON

0.99+

RobertPERSON

0.99+

Procter & GambleORGANIZATION

0.99+

CiscoORGANIZATION

0.99+

IndiaLOCATION

0.99+

MumbaiLOCATION

0.99+

Census BureauORGANIZATION

0.99+

2010DATE

0.99+

1996DATE

0.99+

New YorkLOCATION

0.99+

last weekDATE

0.99+

last yearDATE

0.99+

last monthDATE

0.99+

60%QUANTITY

0.99+

BloomingdaleORGANIZATION

0.99+

Next yearDATE

0.99+

1999DATE

0.99+

TexasLOCATION

0.99+

25 yearsQUANTITY

0.99+

10 yearsQUANTITY

0.99+

one roomQUANTITY

0.99+

2014DATE

0.99+

2013DATE

0.99+

Doug LaneyPERSON

0.99+

Sam's ClubORGANIZATION

0.99+

ICICI BankORGANIZATION

0.99+

99.9%QUANTITY

0.99+

Wiley & SonsORGANIZATION

0.99+

50 peopleQUANTITY

0.99+

Arka MukherjeePERSON

0.99+

next yearDATE

0.99+

JosPERSON

0.99+

TodayDATE

0.99+

third guyQUANTITY

0.99+

2000DATE

0.99+

todayDATE

0.99+

oneQUANTITY

0.99+

500 peopleQUANTITY

0.99+

Cambridge, MassachusettsLOCATION

0.99+

one channelQUANTITY

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

OracleORGANIZATION

0.99+

eachQUANTITY

0.99+

OneQUANTITY

0.99+

CDCORGANIZATION

0.99+

less than five peopleQUANTITY

0.99+

Ralph LaurenORGANIZATION

0.99+

one guyQUANTITY

0.99+

six packQUANTITY

0.99+

ESPNORGANIZATION

0.99+

four years agoDATE

0.98+

AfricaLOCATION

0.98+

SOATITLE

0.98+

five years agoDATE

0.98+

CaliforniaLOCATION

0.98+

GartnerORGANIZATION

0.98+

three industry expertsQUANTITY

0.98+

Global IDSORGANIZATION

0.98+

Four years laterDATE

0.98+

600QUANTITY

0.98+

20 years agoDATE

0.98+

East CoastLOCATION

0.98+

250QUANTITY

0.98+

Middle EastLOCATION

0.98+

four yearsQUANTITY

0.98+

one teamQUANTITY

0.97+

monthsQUANTITY

0.97+

firstQUANTITY

0.97+

about two monthsQUANTITY

0.97+

Latin AmericaLOCATION

0.97+

Aaron Kalb, Alation | MIT CDOIQ 2019


 

>> From Cambridge, Massachusetts, it's theCUBE covering MIT Chief Data Officer and Information Quality Symposium 2019, brought to you by SiliconANGLE Media. (dramatic music) >> Welcome back to Cambridge, Massachusetts, everybody. This is theCUBE, the leader in live tech coverage. We go out to the events, and we extract the signal from then noise. And, we're here at the MIT CDOIQ, the Chief Data Officer conference. I'm Dave Vellante with my cohost Paul Gillin. Day two of our wall to wall coverage. Aaron Kalb is here. He's the cofounder and chief data officer of Alation. Aaron, thanks for making the time to come on. >> Thanks so much Dave and Paul for having me. >> You're welcome. So, words matter, you know, and we've been talking about data, and big data, and the three Vs, and data is the new oil, and all this stuff. You gave a talk this week about, you know, "We're maybe not talking the right language "when it comes to data." What did you mean by all that? >> Absolutely, so I get a little bit frustrated by some of these cliques we hear at conference after conference, and the one I, sort of, took aim at in this talk is, data is the new oil. I think what people want to invoke with that is to say, in the same way that oil powered the industrial age, data's powering the information age. Just saying, data's really cool and trendy and important. That's true, but there are a lot of other associations and contexts that people have with data, and some of them don't really apply as, I'm sorry, with oil. And, some of them apply, as well, to data. >> So, is data more valuable than oil? >> Well, I think they're each valuable in different ways, but I think there's a couple issues with the metaphor. One is that data is scarce and dwindling, and part of value comes from the fact that it's so rare. Whereas, the experience with data is that it's so plentiful and abundant, we're almost drowning in it. And so, what I contend is, instead of talking about data as compared to oil, we should talk about data compared to water. And, the idea is, you know, water is very plentiful on the planet, but sometimes, you know, if you have saltwater or contaminated water, you can't drink it. Water is good for different purposes, depending on its form, and so it's all about getting the right data for the right purpose, like water. >> Well, we've certainly, at least in my opinion, fought wars, Paul, over oil. >> And, over water. >> And, certainly, conflicts over water. Do you think we'll be fighting wars over data? Or, are we already? >> No, we might be. One of my favorite talks from the sessions here was a keynote by the CDO for the Department of Defense, who was talking about, you know, the civic duty about transparency but was observing that, actually, more IP addresses from China and Russia are looking at our public datasets than from within the country. So, you know, it's definitely a resource that can be very powerful. >> So, what was the reaction to your premise from the audience. What kind of questions did you get? >> You know, people actually responded very favorably, including some folks from the oil and gas industry, which I was pleased to find. We have a lot of customers in energy, so that was cool. But, what it was nice being here at MIT and just really geeking out about language and linguistics and data with a bunch of CDOs and other people who are, kind of, data intellectuals. >> Right, so if data is not the new oil. >> And, water isn't really a good analogy either, because the supply of water is finite. >> That's true. >> So, what is data? >> Yeah. >> Space? >> Yeah, it's a good point. >> Matter? >> Maybe it is like the universe in that it's always expanding, right, somehow. Right, because any thing, any physic which is on the planet probably won't be growing at that exponential speed. >> So, give us the punchline. >> Well, so I contend that water, while imperfect, is, actually, a really good metaphor that helps for a lot of things. It has properties like the fact that if it's a data quality issue, it flows downstream like pollution in a river. It's the fact that it can come in different forms, useful for different purposes. You might have gray water, right, which is good enough for, you know, irrigation or industrial purposes, but not safe to drink. And so, you rely on metadata to get the data that's in the right form. And, you know, the talk is more fun because you've a lot of visual examples that make this clear. >> Yeah, of course, yeah. >> I actually had one person in the audience say that he used a similar analogy in his own company, so it's fun to trade notes. >> So, chief data officer is a relatively new title for you, is it not? In terms of your role at Alation. >> Yeah, that's right, and the most fun thing about my job is being able to interact with all of the other CDOs and CDAOs at a conference like this. And, it was cool to see. I believe this conference doubled since the last year. Is that right? >> No. >> No, it's up about a hundred, though. >> Right. >> Well. >> And, it's about double from three years ago. >> And, when we first started, in 2013, yeah. >> 130 people, yeah. >> Yeah, it was a very small and intimate event. >> Yeah, here we're outgrowing this building, it seems. >> Yeah, they're kicking us out. >> I think what's interesting is, you know, if we do a little bit of analysis, this is a small data, within our own company, you know, our biggest and most visionary customers typically bought Alation. The buyer champion either was a CDO or they weren't a CDO when they bought the software and have since been promoted to be a CDO. And so, seeing this trend of more and more CDOs cropping up is really exciting for us. And also, just hearing all of the people at the conference saying, two trends we're hearing. A move from, sort of, infrastructure and technology to driving business value, and a move from defense and governance to, sort of, playing offense and doing revenue generation with data. Both of those trends are really exciting for us. >> So, don't hate me for asking this question, because what a lot of companies will do is, they'll give somebody a CDO title, and it's, kind of, a little bit of gimmick, right, to go to market. And, they'll drag you into sales, because I'm sure they do, as a cofounder. But, as well, I know CDOs at tech companies that are actually trying to apply new techniques, figure out how data contributes to their business, how they can cut costs, raise revenue. Do you have an internal role, as well? >> Absolutely, yeah. >> Explain that. >> So, Alation, you know, we're about 250 people, so we're not at the same scale as many of the attendees here. But, we want to learn, you know, from the best, and always apply everything that we learn internally as well. So, obviously, analytics, data science is a huge role in our internal operations. >> And so, what kinds of initiatives are you driving internally? Is it, sort of, cost initiatives, efficiency, innovation? >> Yeah, I think it's all of the above, right. Every single division and both in the, sort of, operational efficiency and cost cutting side as well as figuring out the next big bet to make, can be informed by data. And, our goal was to empower a curious and rational world, and our every decision be based not on the highest paid person's opinion, but on the best evidence possible. And so, you know, the goal of my function is largely to enable that both centrally and within each business unit. >> I want to talk to you about data catalogs a bit because it's a topic close to my heart. I've talked to a lot of data catalog companies over the last couple years, and it seems like, for one thing, the market's very crowded right now. It seems to me. Would you agree there are a lot of options out there? >> Yeah, you know, it's been interesting because when we started it, we were basically the first company to make this technology and to, kind of, use this term, data catalog, in this way. And, it's been validating to see, you know, a lot of big players and other startups even, kind of, coming to that terminology. But, yeah, it has gotten more crowded, and I think our customers who, or our prospects, used to ask us, you know, "What is it that you do? "Explain this catalog metaphor to me," are now saying, "Yeah, catalogs, heard about that." >> It doesn't need to be defined anymore. >> "Which one should I pick? "Why you?" Yeah. >> What distinguished one product from another, you know? What are the major differentiation points? >> Yeah, I think one thing that's interesting is, you know, my talk was about how the metaphors we use shape the way we think. And, I think there's a sense in which, kind of, the history of each company shapes their philosophy and their approach, so we've always been a data catalog company. That's our one product. Some of the other catalog vendors come from ETL background, so they're a lot more focused on technical metadata and infrastructure. Some of the catalog products grew out of governance, and so it's, sort of, governance first, no sorry, defense first and then offense secondary. So, I think that's one of the things, I think, we encourage our prospects to look at, is, kind of, the soul of the company and how that affects their decisions. The other thing is, of course, technology. And, what we at Alation are really excited about, and it's been validating to hear Gartner and others and a lot of the people here, like the GSK keynote speaker yesterday, talking about the importance of comprehensiveness and on taking a behavioral approach, right. We have our Behavioral IO technology that really says, "Let's not look at all the bits and the bytes, "but how are people using the data to drive results?" As our core differentiator. >> Do your customers generally standardize on one data catalog, or might they have multiple catalogs for multiple purposes? >> Yeah, you know, we heard a term more last season, of catalog of catalogs, you know. And, people here can get arbitrarily, you know, meta, meta, meta data, where we like to go there. I think the customers we see most successful tend to have one catalog that serves this function of the single source of reference. Many of our customers will say, you know, that their catalog serves as, sort of, their internal Google for data. Or, the one stop shop where you could find everything. Even though they may have many different sources, Typically you don't want to have siloed catalogs. It makes it harder to find what you're looking for. >> Let's play a little word association with some metaphors. Data lake. (laughter) >> Data lake's another one that I sort of hate. If you think about it, people had data warehouses and didn't love them, but at least, when you put something into a warehouse, you can get it out, right. If you throw something into a lake, you know, there's really no hope you're ever going to find it. It's probably not going to be in great shape, and we're not surprised to find that many folks who invested heavily in data lakes are now having to invest in a layer over it, to make it comprehensible and searchable. >> So, yeah, the lake is where we hide the stolen cars. Data swamp. >> Yeah, I mean, I think if your point is it's worse than lake, it works. But, I think we can do better a lake, right. >> How about data ocean? (laughter) >> You know, out of respect for John Furrier, I'll say it's fantastic. But, to us we think, you know, it isn't really about the size. The more data you have, people think the more data the better. It's actually the more data the worse unless you have a mechanism for finding the little bit of data that is relevant and useful for your task and put it to use. >> And to, want to set up, enter the catalog. So, technically, how does the catalog solve that problem? >> Totally, so if we think about, maybe let's go to the warehouse, for example. But, it works just as well on a data lake in practice. >> Yeah, cool. >> Through the catalog is. It starts with the inventory, you know, what's on every single shelf. But, if you think about what Amazon has done, they have the inventory warehouse in the back, but what you see as a consumer is a simple search interface, where you type in the word of the product you're looking for. And then, you see ranked suggestions for different items, you know, toasters, lamps, whatever, books I want to buy. Same thing for data. I can type in, you know, if I'm at the DOD, you know, information about aircraft, or information about, you know, drug discovery if I'm at GSK. And, I should be able to therefore see all of the different data sets that I have. And, that's true in almost any catalog, that you can do some search over the curated data sets there. With Alation in particular, what I can see is, who's using it, how are they using it, what are they joining it with, what results do they find in that process. And, that can really accelerate the pace of discovery. >> Go ahead. >> I'm sorry, Dave. To what degree can you automate some of that detail, like who's using it and what it's being used for. I mean, doesn't that rely on people curating the catalog? Or, to what degree can you automate that? >> Yeah, so it's a great question. I think, sometimes, there's a sense with AI or ML that it's like the computer is making the decisions or making things up. Which is, obviously, very scary. Usually, the training data comes from humans. So, our goal is to learn from humans in two ways. There's learning from humans where humans explicitly teach you. Somebody goes and says, "This is goal standard data versus this is, "you know, low quality data." And, they do that manually. But, there's also learning implicitly from people. So, in the same way on amazon.com, if I buy one item and then buy another, I'm doing that for my own purposes, but Amazon can do collaborative filtering over all of these trends and say, "You might want to buy this item." We can do a similar thing where we parse the query logs, parse the usage logs and be eye tools, and can basically watch what people are doing for their own purposes. Not to, you know, extra work on top of their job to help us. We can learn from that and make everybody more effective. >> Aaron, is data classification a part of all this? Again, when we started in the industry, data classification was a manual exercise. It's always been a challenge. Certainly, people have applied math to it. You've seen support vector machines and probabilistic latent cement tech indexing being used to classify data. Have we solved that problem, as an industry? Can you automate the classification of data on creation or use at this point in time? >> Well, one thing that came up in a few talks about AI and ML here is, regardless of the algorithm you're using, whether it's, you know, IFH or SVM, or something really modern and exciting that keeps learning. >> Stuff that's been around forever or, it's like you say, some new stuff, right. >> Yeah, you know, actually, I think it was said best by Michael Collins at the DOD, that data is more important than the algorithm because even the best algorithm is useless without really good training data. Plus, the algorithm's, kind of, everyone's got them. So, really often, training data is the limiting reactant in getting really good classification. One thing we try to do at Alation is create an upward spiral where maybe some data is curated manually, and then we can use that as a seed to make some suggestions about how to label other data. And then, it's easier to just do a confirm or deny of a guess than to actually manually label everything. So, then you get more training, get it faster, and it kind of accelerates that way instead of being a big burden. >> So, that's really the advancement in the last five to what, five, six years. Where you're able to use machine intelligence to, sort of, solve that problem as opposed to brute forcing it with some algorithm. Is that fair? >> Yeah, I think that's right, and I think what gets me very excited is when you can have these interactive loops where the human helps the computer, which helps the human. You get, again, this upward spiral. Instead of saying, "We have to have all of this, "you know, manual step done "before we even do the first step," or trying to have an algorithm brute force it without any human intervention. >> It's kind of like notes key mode on write, except it actually works. I'm just kidding to all my ADP friends. All right, Aaron, hey. Thanks very much for coming on theCUBE, but give your last word on the event. I think, is this your first one or no? >> This is our first time here. >> Yeah, okay. So, what are your thoughts? >> I think we'll be back. It's just so exciting to get people who are thinking really big about data but are also practitioners who are solving real business problems. And, just the exchange of ideas and best practices has been really inspiring for me. >> Yeah, that's great. >> Yeah. >> Well, thank you for the support of the event, and thanks for coming on theCUBE. It was great to see you again. >> Thanks Dave, thanks Paul. >> All right, you're welcome. >> Thank you, sir. >> All right, keep it right there, everybody. We'll be back with our next guest right after this short break. You're watching theCUBE from MIT CDOIQ. Be right back. (upbeat music)

Published Date : Aug 1 2019

SUMMARY :

brought to you by SiliconANGLE Media. Aaron, thanks for making the time to come on. and data is the new oil, and all this stuff. in the same way that oil powered the industrial age, And, the idea is, you know, water is very plentiful Well, we've certainly, at least in my opinion, Do you think we'll be fighting wars over data? So, you know, it's definitely a resource What kind of questions did you get? We have a lot of customers in energy, so that was cool. because the supply of water is finite. Maybe it is like the universe And, you know, the talk is more fun because you've a lot I actually had one person in the audience say So, chief data officer is a relatively Yeah, that's right, and the most fun thing I think what's interesting is, you know, And, they'll drag you into sales, But, we want to learn, you know, from the best, And so, you know, the goal of my function I want to talk to you about data catalogs a bit And, it's been validating to see, you know, "Which one should I pick? Yeah, I think one thing that's interesting is, you know, Or, the one stop shop where you could find everything. Data lake. when you put something into a warehouse, So, yeah, the lake is where we hide the stolen cars. But, I think we can do better a lake, right. But, to us we think, you know, So, technically, how does the catalog solve that problem? maybe let's go to the warehouse, for example. I can type in, you know, if I'm at the DOD, you know, Or, to what degree can you automate that? Not to, you know, extra work on top of their job to help us. Can you automate the classification of data whether it's, you know, IFH or SVM, or something it's like you say, some new stuff, right. Yeah, you know, actually, I think it was said best in the last five to what, five, six years. when you can have these interactive loops I'm just kidding to all my ADP friends. So, what are your thoughts? And, just the exchange of ideas It was great to see you again. We'll be back with our next guest

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Michael CollinsPERSON

0.99+

Paul GillinPERSON

0.99+

PaulPERSON

0.99+

AmazonORGANIZATION

0.99+

DavePERSON

0.99+

2013DATE

0.99+

Aaron KalbPERSON

0.99+

Dave VellantePERSON

0.99+

AaronPERSON

0.99+

fiveQUANTITY

0.99+

Department of DefenseORGANIZATION

0.99+

six yearsQUANTITY

0.99+

John FurrierPERSON

0.99+

amazon.comORGANIZATION

0.99+

yesterdayDATE

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

AlationPERSON

0.99+

AlationORGANIZATION

0.99+

GartnerORGANIZATION

0.99+

one itemQUANTITY

0.99+

Cambridge, MassachusettsLOCATION

0.99+

first stepQUANTITY

0.99+

last yearDATE

0.99+

GSKORGANIZATION

0.99+

bothQUANTITY

0.99+

DODORGANIZATION

0.99+

one personQUANTITY

0.99+

GoogleORGANIZATION

0.99+

130 peopleQUANTITY

0.98+

OneQUANTITY

0.98+

first timeQUANTITY

0.98+

MITORGANIZATION

0.98+

one productQUANTITY

0.97+

three years agoDATE

0.97+

this weekDATE

0.97+

twoQUANTITY

0.97+

MIT CDOIQORGANIZATION

0.96+

MIT Chief Data Officer andEVENT

0.96+

one data catalogQUANTITY

0.96+

eachQUANTITY

0.96+

each companyQUANTITY

0.95+

BothQUANTITY

0.95+

one thingQUANTITY

0.95+

first oneQUANTITY

0.94+

one catalogQUANTITY

0.93+

two trendsQUANTITY

0.93+

theCUBEORGANIZATION

0.93+

firstQUANTITY

0.92+

first companyQUANTITY

0.92+

last couple yearsDATE

0.92+

CDOORGANIZATION

0.91+

about a hundredQUANTITY

0.91+

single shelfQUANTITY

0.88+

about 250 peopleQUANTITY

0.88+

single sourceQUANTITY

0.87+

ChinaLOCATION

0.87+

2019DATE

0.86+

Day twoQUANTITY

0.86+

oneQUANTITY

0.85+

each business unitQUANTITY

0.82+

MIT CDOIQEVENT

0.79+

ADPORGANIZATION

0.79+

couple issuesQUANTITY

0.76+

Information Quality Symposium 2019EVENT

0.76+

One thingQUANTITY

0.7+

single divisionQUANTITY

0.69+

one stopQUANTITY

0.68+

RussiaLOCATION

0.64+

threeQUANTITY

0.61+

doubleQUANTITY

0.59+

favoriteQUANTITY

0.5+

CDOIQEVENT

0.46+

ChiefPERSON

0.42+

Jeanne Ross, MIT CISR | MIT CDOIQ 2019


 

(techno music) >> From Cambridge, Massachusetts, it's theCUBE. Covering MIT Chief Data Officer and Information Quality Symposium 2019, brought to you by SiliconANGLE Media. >> Welcome back to MIT CDOIQ. The CDO Information Quality Conference. You're watching theCUBE, the leader in live tech coverage. My name is Dave Vellante. I'm here with my co-host, Paul Gillin. This is our day two of our two day coverage. Jean Ross is here. She's the principle research scientist at MIT CISR, Jean good to see you again. >> Nice to be here! >> Welcome back. Okay, what do all these acronyms stand for, I forget. MIT CISR. >> CISR which we pronounce scissor, is the Center for Information Systems Research. It's a research center that's been at MIT since 1974, studying how big companies use technology effectively. >> So and, what's your role as a research scientist? >> As a research scientist, I work with both researchers and with company leaders to understand what's going on out there, and try to present some simple succinct ideas about how companies can generate greater value from information technology. >> Well, I guess not much has changed in information technology since 1974. (laughing) So let's fast forward to the big, hot trend, digital transformation, digital business. What's the difference between a business and a digital business? >> Right now, you're hoping there's no difference for you and your business. >> (chuckling) Yeah, for sure. >> The main thing about a digital business is it's being inspired by technology. So in the past, we would establish a strategy, and then we would check out technology and say, okay, how can technology make us more effective with that strategy? Today, and this has been driven a lot by start-ups, we have to stop and say, well wait a minute, what is technology making possible? Because if we're not thinking about it, there sure are a lot of students at MIT who are, and we're going to miss the boat. We're going to get Ubered if you will, somebody's going to think of a value proposition that we should be offering and aren't, and we'll be left in the dust. So, our digital businesses are those that are recognizing the opportunities that digital technologies make possible. >> Now, and what about data? In terms of the role of digital business, it seems like that's an underpinning of a digital business. Is it not? >> Yeah, the single biggest capability that digital technologies provide, is ubiquitous data that's readily accessible anytime. So when we think about being inspired by technology, we could reframe that as inspired by the availability of ubiquitous data that's readily accessible. >> Your premise about the difference between digitization and digital business is interesting. It's more than just a sematic debate. Do companies now, when companies talk about digital transformation these days, in fact, are most of them of thinking of digitization rather than really transformative business change? >> Yeah, this is so interesting to me. In 2006, we wrote a book that said, you need to become more agile, and you need to rely on information technology to get you there. And these are basic things like SAP and salesforce.com and things like that. Just making sure that your core processes are disciplined and reliable and predictable. We said this in 2006. What we didn't know is that we were explaining digitization, which is very effective use of technology in your underlying process. Today, when somebody says to me, we're going digital, I'm thinking about the new value propositions, the implications of the data, right? And they're often actually saying they're finally doing what we thought they should do in 2006. The problem is, in 2006, we said get going on this, it's a long journey. This could take you six, 10 years to accomplish. And then we gave examples of companies that took six to 10 years. LEGO, and USAA and really great companies. And now, companies are going, "Ah, you know, we really ought to do that". They don't have six to 10 years. They get this done now, or they're in trouble, and it's still a really big deal. >> So how realistic is it? I mean, you've got big established companies that have got all these information silos, as we've been hearing for the last two days, just pulling their information together, knowing what they've got is a huge challenge for them. Meanwhile, you're competing with born on the web, digitally native start-ups that don't have any of that legacy, is it really feasible for these companies to reinvent themselves in the way you're talking about? Or should they just be buying the companies that have already done it? >> Well good luck with buying, because what happens is that when a company starts up, they can do anything, but they can't do it to scale. So most of these start-ups are going to have to sell themselves because they don't know anything about scale. And the problem is, the companies that want to buy them up know about the scale of big global companies but they don't know how to do this seamlessly because they didn't do the basic digitization. They relied on basically, a lot of heroes in their company to pull of the scale. So now they have to rely more on technology than they did in the past, but they still have a leg up if you will, on the start-up that doesn't want to worry about the discipline of scaling up a good idea. They'd rather just go off and have another good idea, right? They're perpetual entrepreneurs if you will. So if we look at the start-ups, they're not really your concern. Your concern is the very well run company, that's been around, knows how to be inspired by technology and now says, "Oh I see what you're capable of doing, "or should be capable of doing. "I think I'll move into your space". So this, the Amazon's, and the USAA's and the LEGO's who say "We're good at what we do, "and we could be doing more". We're watching Schneider Electric, Phillips's, Ferovial. These are big ole companies who get digital, and they are going to start moving into a lot of people's territory. >> So let's take the example of those incumbents that you've used as examples of companies that are leaning into digital, and presumably doing a good job of it, they've got a lot of legacy debt, as you know people call it technical debt. The question I have is how they're using machine intelligence. So if you think about Facebook, Amazon, Microsoft, Google, they own horizontal technologies around machine intelligence. The incumbents that you mentioned, do not. Now do they close the gap? They're not going to build their own A.I. They're going to buy it, and then apply it. It's how they apply it that's going to be the difference. So do you agree with that premise, and where are they getting it, do they have the skill sets to do it, how are they closing that gap? >> They're definitely partnering. When you say they're not going to build any of it, that's actually not quite true. They're going to build a lot around the edges. They'll rely on partners like Microsoft and Google to provide some of the core, >> Yes, right. >> But they are bringing in their own experts to take it to the, basically to the customer level. How do I take, let me just take Schneider Electric for an example. They have gone from being an electrical equipment manufacturer, to a purveyor of energy management solutions. It's quite a different value proposition. To do that, they need a lot of intelligence. Some of it is data analytics of old, and some of it is just better representation on dashboards and things like that. But there is a layer of intelligence that is new, and it is absolutely essential to them by relying on partners and their own expertise in what they do for customers, and then co-creating a fair amount with customers, they can do things that other companies cannot. >> And they're developing a software presumably, a SAS revenue stream as part of that, right? >> Yeah, absolutely. >> How about the innovators dilemma though, the problem that these companies often have grown up, they're very big, they're very profitable, they see disruption coming, but they are unable to make the change, their shareholders won't let them make the change, they know what they have to do, but they're simply not able to do it, and then they become paralyzed. Is there a -- I mean, looking at some of the companies you just mentioned, how did they get over that mindset? >> This is real leadership from CEO's, who basically explain to their boards and to their investors, this is our future, we are... we're either going this direction or we're going down. And they sell it. It's brilliant salesmanship, and it's why when we go out to study great companies, we don't have that many to choose from. I mean, they are hard to find, right? So you are at such a competitive advantage right now. If you understand, if your own internal processes are cleaned up and you know how to rely on the E.R.P's and the C.R.M's, to get that done, and on the other hand, you're using the intelligence to provide value propositions, that new technologies and data make possible, that is an incredibly powerful combination, but you have to invest. You have to convince your boards and your investors that it's a good idea, you have to change your talent internally, and the biggest surprise is, you have to convince your customers that they want something from you that they never wanted before. So you got a lot of work to do to pull this off. >> Right now, in today's economy, the economy is sort of lifting all boats. But as we saw when the .com implosion happened in 2001, often these breakdown gives birth to great, new companies. Do you see that the next recession, which is inevitably coming, will be sort of the turning point for some of these companies that can't change? >> It's a really good question. I do expect that there are going to be companies that don't make it. And I think that they will fail at different rates based on their, not just the economy, but their industry, and what competitors do, and things like that. But I do think we're going to see some companies fail. We're going to see many other companies understand that they are too complex. They are simply too complex. They cannot do things end to end and seamlessly and present a great customer experience, because they're doing everything. So we're going to see some pretty dramatic changes, we're going to see failure, it's a fair assumption that when we see the economy crash, it's also going to contribute, but that's, it's not the whole story. >> But when the .com blew up, you had the internet guys that actually had a business model to make money, and the guys that didn't, the guys that didn't went away, and then you also had the incumbents that embrace the internet, so when we came out of that .com downturn, you had the survivors, who was Google and eBay, and obviously Amazon, and then you had incumbent companies who had online retailing, and e-tailing and e-commerce etc, who thrived. I would suspect you're going to see something similar, but I wonder what you guys think. The street today is rewarding growth. And we got another near record high today after the rate cut yesterday. And so, but companies that aren't making money are getting rewarded, 'cause they're growing. Well when the recession comes, those guys are going to get crushed. >> Right. >> Yeah. >> And you're going to have these other companies emerge, and you'll see the winners, are going to be those ones who have truly digitized, not just talking the talk, or transformed really, to use your definition. That's what I would expect. I don't know, what do you think about that? >> I totally agree. And, I mean, we look at industries like retail, and they have been fundamentally transformed. There's still lots of opportunities for innovation, and we're going to see some winners that have kind of struggled early but not given up, and they're kind of finding their footing. But we're losing some. We're losing a lot, right? I think the surprise is that we thought digital was going to replace what we did. We'd stop going to stores, we'd stop reading books, we wouldn't have newspapers anymore. And it hasn't done that. Its only added, it hasn't taken anything away. >> It could-- >> I don't think the newspaper industry has been unscathed by digital. >> No, nor has retail. >> Nor has retail, right. >> No, no no, not unscathed, but here's the big challenge. Is if I could substitute, If I could move from newspaper to online, I'm fine. You don't get to do that. You add online to what you've got, right? And I think this right now is the big challenge. Is that nothing's gone away, at least yet. So we have to sustain the business we are, so that it can feed the business we want to be. And we have to make that transition into new capabilities. I would argue that established companies need to become very binary, that there are people that do nothing but sustain and make better and better and better, who they are. While others, are creating the new reality. You see this in auto companies by the way. They're creating not just the autonomous automobiles, but the mobility services, the whole new value propositions, that will become a bigger and bigger part of their revenue stream, but right now are tiny. >> So, here's the scary thing to me. And again, I'd love to hear your thoughts on this. And I've been an outspoken critic of Liz Warren's attack on big tech. >> Absolutely. >> I just think if they're breaking the law, and they're really acting like monopolies, the D.O.J and F.T.C should do something, but to me, you don't just break up big tech because they're good capitalists. Having said that, one of the things that scares me is, when you see Apple getting into payment systems, Amazon getting into grocery and logistics. Digital allows you to do something that's never happened before which is, you can traverse industries. >> Yep. >> Yeah, absolutely >> You used to have this stack of industries, and if you were in that industry, you're stuck in healthcare, you're stuck in financial services or whatever it was. And today, digital allows you to traverse those. >> It absolutely does. And so in theory, Amazon and Apple and Facebook and Google, they can attack virtually any industry and they kind of are. >> Yeah they kind are. I would certainly not break up anything. I would really look hard though at acquisitions, because I think that's where some of this is coming from. They can stop the overwhelming growth, but I do think you're right. That you get these opportunities from digital that are just so much easier because they're basically sharing information and technology, not building buildings and equipment and all that kind of thing. But I think there all limits to all this. I do not fear these companies. I think there, we need some law, we need some regulations, they're fine. They are adding a lot of value and the great companies, I mean, you look at the Schneider's and the Phillips, yeah they fear what some of them can do, but they're looking forward to what they provide underneath. >> Doesn't Cloud change the equation here? I mean, when you think of something like Amazon getting into the payments business, or Google in the payments business, you know it used to be that the creating of global payments processing network, just going global was a huge barrier to entry. Now, you don't have nearly that same level of impediment right? I mean the cloud eliminates much of the traditional barrier. >> Yeah, but I'll tell you what limits it, is complexity. Every company we've studied gets a little over anxious and becomes too complex, and they cannot run themselves effectively anymore. It happens to everyone. I mean, remember when we were terrified about what Microsoft was going to become? But then it got competition because it's trying to do so many things, and somebody else is offering, Sales Force and others, something simpler. And this will happen to every company that gets overly ambitious. Something simpler will come along, and everybody will go "Oh thank goodness". Something simpler. >> Well with Microsoft, I would argue two things. One is the D.O.J put some handcuffs on them , and two, with Steve Ballmer, I wouldn't get his nose out of Windows, and then finally stuck on a (mumbles) (laughter) >> Well it's they had a platform shift. >> Well this is exactly it. They will make those kind of calls . >> Sure, and I think that talks to their legacy, that they won't end up like Digital Equipment Corp or Wang and D.G, who just ignored the future and held onto the past. But I think, a colleague of ours, David Moschella wrote a book, it's called "Seeing Digital". And his premise was we're moving from a world of remote cloud services, to one where you have to, to use your word, ubiquitous digital services that you can access upon which you can build your business and new business models. I mean, the simplest example is Waves, you mentioned Uber. They're using Cloud, they're using OAuth.in with Google, Facebook or LinkedIn and they've got a security layer, there's an A.I layer, there's all your BlockChain, mobile, cognitive, it's all these sets of services that are now ubiquitous on which you're building, so you're leveraging, he calls it the matrix, to the extent that these companies that you're studying, these incumbents can leverage that matrix, they should be fine. >> Yes. >> The part of the problem is, they say "No, we're going to invent everything ourselves, we're going to build it all ourselves". To use Andy Jassy's term, it's non-differentiated heavy lifting, slows them down, but there's no reason why they can't tap that matrix, >> Absolutely >> And take advantage of it. Where I do get scared is, the Facebooks, Apples, Googles, Amazons, they're matrix companies, their data is at their core, and they get this. It's not like they're putting data around the core, data is the core. So your thoughts on that? I mean, it looks like your slide about disruption, it's coming. >> Yeah, yeah, yeah, yeah. >> No industry is safe. >> Yeah, well I'll go back to the complexity argument. We studied complexity at length, and complexity is a killer. And as we get too ambitious, and we're constantly looking for growth, we start doing things that create more and more tensions in our various lines of business, causes to create silos, that then we have to coordinate. I just think every single company that, no cloud is going to save us from this. It, complexity will kill us. And we have to keep reminding ourselves to limit that complexity, and we've just not seen the example of the company that got that right. Sooner or later, they just kind of chop them, you know, create problems for themselves. >> Well isn't that inherent though in growth? >> Absolutely! >> It's just like, big companies slow down. >> That's right. >> They can't make decisions as quickly. >> That's right. >> I haven't seen a big company yet that moves nimbly. >> Exactly, and that's the complexity thing-- >> Well wait a minute, what about AWS? They're a 40 billion dollar company. >> Oh yeah, yeah, yeah >> They're like the agile gorilla. >> Yeah, yeah, yeah. >> I mean, I think they're breaking the rule, and my argument would be, because they have data at their core, and they've got that, its a bromide, but that common data model, that they can apply now to virtually any business. You know, we're been expecting, a lot of people have been expecting that growth to attenuate. I mean it hasn't yet, we'll see. But they're like a 40 billion dollar firm-- >> No that's a good example yeah. >> So we'll see. And Microsoft, is the other one. Microsoft is demonstrating double digit growth. For such a large company, it's astounding. I wonder, if the law of large numbers is being challenged, so. >> Yeah, well it's interesting. I do think that what now constitutes "so big" that you're really going to struggle with the complexity. I think that has definitely been elevated a lot. But I still think there will be a point at which human beings can't handle-- >> They're getting away. >> Whatever level of complexity we reach, yeah. >> Well sure, right because even though this great new, it's your point. Cloud technology, you know, there's going to be something better that comes along. Even, I think Jassy might have said, If we had to do it all over again, we would have built the whole thing on lambda functions >> Yeah. >> Oh, yeah. >> Not on, you know so there you go. >> So maybe someone else does that-- >> Yeah, there you go. >> So now they've got their hybrid. >> Yeah, yeah. >> Yeah, absolutely. >> You know maybe it'll take another ten years, but well Jean, thanks so much for coming to theCUBE, >> it was great to have you. >> My pleasure! >> Appreciate you coming back. >> Really fun to talk. >> All right, keep right there everybody, Paul Gillin and Dave Villante, we'll be right back from MIT CDOIQ, you're watching theCUBE. (chuckles) (techno music)

Published Date : Aug 1 2019

SUMMARY :

brought to you by SiliconANGLE Media. Jean good to see you again. Okay, what do all these acronyms stand for, I forget. is the Center for Information Systems Research. to understand what's going on out there, So let's fast forward to the big, hot trend, for you and your business. We're going to get Ubered if you will, Now, and what about data? Yeah, the single biggest capability and digital business is interesting. information technology to get you there. to reinvent themselves in the way you're talking about? and they are going to start moving into It's how they apply it that's going to be the difference. They're going to build a lot around the edges. and it is absolutely essential to them I mean, looking at some of the companies you just mentioned, and the biggest surprise is, you have to convince often these breakdown gives birth to great, new companies. I do expect that there are going to be companies and then you also had the incumbents I don't know, what do you think about that? and they have been fundamentally transformed. I don't think the newspaper industry so that it can feed the business we want to be. So, here's the scary thing to me. but to me, you don't just break up big tech and if you were in that industry, they can attack virtually any industry and they kind of are. But I think there all limits to all this. I mean, when you think of something like and they cannot run themselves effectively anymore. One is the D.O.J put some handcuffs on them , Well this is exactly it. Sure, and I think that talks to their legacy, The part of the problem is, they say data is the core. that then we have to coordinate. Well wait a minute, what about AWS? that growth to attenuate. And Microsoft, is the other one. I do think that what now constitutes "so big" that you're there's going to be something better that comes along. Paul Gillin and Dave Villante,

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave VellantePERSON

0.99+

Paul GillinPERSON

0.99+

AmazonORGANIZATION

0.99+

David MoschellaPERSON

0.99+

FacebookORGANIZATION

0.99+

MicrosoftORGANIZATION

0.99+

GoogleORGANIZATION

0.99+

Jean RossPERSON

0.99+

2006DATE

0.99+

sixQUANTITY

0.99+

Steve BallmerPERSON

0.99+

Jeanne RossPERSON

0.99+

Liz WarrenPERSON

0.99+

LEGOORGANIZATION

0.99+

AppleORGANIZATION

0.99+

Schneider ElectricORGANIZATION

0.99+

Dave VillantePERSON

0.99+

AmazonsORGANIZATION

0.99+

GooglesORGANIZATION

0.99+

JeanPERSON

0.99+

FacebooksORGANIZATION

0.99+

PhillipsORGANIZATION

0.99+

USAAORGANIZATION

0.99+

Center for Information Systems ResearchORGANIZATION

0.99+

ApplesORGANIZATION

0.99+

Andy JassyPERSON

0.99+

AWSORGANIZATION

0.99+

FerovialORGANIZATION

0.99+

Digital Equipment CorpORGANIZATION

0.99+

2001DATE

0.99+

1974DATE

0.99+

two dayQUANTITY

0.99+

twoQUANTITY

0.99+

UberORGANIZATION

0.99+

D.O.JORGANIZATION

0.99+

yesterdayDATE

0.99+

eBayORGANIZATION

0.99+

40 billion dollarQUANTITY

0.99+

MITORGANIZATION

0.99+

JassyPERSON

0.99+

Cambridge, MassachusettsLOCATION

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

todayDATE

0.99+

10 yearsQUANTITY

0.99+

ten yearsQUANTITY

0.99+

TodayDATE

0.99+

OneQUANTITY

0.99+

CISRORGANIZATION

0.98+

MIT CISRORGANIZATION

0.98+

Seeing DigitalTITLE

0.98+

two thingsQUANTITY

0.98+

singleQUANTITY

0.97+

UberedORGANIZATION

0.97+

LinkedInORGANIZATION

0.97+

WindowsTITLE

0.96+

OAuth.inTITLE

0.96+

oneQUANTITY

0.94+

Wang and D.GORGANIZATION

0.94+

CDO Information Quality ConferenceEVENT

0.94+

D.O.JPERSON

0.87+

Gokula Mishra | MIT CDOIQ 2019


 

>> From Cambridge, Massachusetts, it's theCUBE covering MIT Chief Data Officer and Information Quality Symposium 2019 brought to you by SiliconANGLE Media. (upbeat techno music) >> Hi everybody, welcome back to Cambridge, Massachusetts. You're watching theCUBE, the leader in tech coverage. We go out to the events. We extract the signal from the noise, and we're here at the MIT CDOIQ Conference, Chief Data Officer Information Quality Conference. It is the 13th year here at the Tang building. We've outgrown this building and have to move next year. It's fire marshal full. Gokula Mishra is here. He is the Senior Director of Global Data and Analytics and Supply Chain-- >> Formerly. Former, former Senior Director. >> Former! I'm sorry. It's former Senior Director of Global Data Analytics and Supply Chain at McDonald's. Oh, I didn't know that. I apologize my friend. Well, welcome back to theCUBE. We met when you were at Oracle doing data. So you've left that, you're on to your next big thing. >> Yes, thinking through it. >> Fantastic, now let's start with your career. You've had, so you just recently left McDonald's. I met you when you were at Oracle, so you cut over to the dark side for a while, and then before that, I mean, you've been a practitioner all your life, so take us through sort of your background. >> Yeah, I mean my beginning was really with a company called Tata Burroughs. Those days we did not have a lot of work getting done in India. We used to send people to U.S. so I was one of the pioneers of the whole industry, coming here and working on very interesting projects. But I was lucky to be working on mostly data analytics related work, joined a great company called CS Associates. I did my Master's at Northwestern. In fact, my thesis was intelligent databases. So, building AI into the databases and from there on I have been with Booz Allen, Oracle, HP, TransUnion, I also run my own company, and Sierra Atlantic, which is part of Hitachi, and McDonald's. >> Awesome, so let's talk about use of data. It's evolved dramatically as we know. One of the themes in this conference over the years has been sort of, I said yesterday, the Chief Data Officer role emerged from the ashes of sort of governance, kind of back office information quality compliance, and then ascended with the tailwind of the Big Data meme, and it's kind of come full circle. People are realizing actually to get value out of data, you have to have information quality. So those two worlds have collided together, and you've also seen the ascendancy of the Chief Digital Officer who has really taken a front and center role in some of the more strategic and revenue generating initiatives, and in some ways the Chief Data Officer has been a supporting role to that, providing the quality, providing the compliance, the governance, and the data modeling and analytics, and a component of it. First of all, is that a fair assessment? How do you see the way in which the use of data has evolved over the last 10 years? >> So to me, primarily, the use of data was, in my mind, mostly around financial reporting. So, anything that companies needed to run their company, any metrics they needed, any data they needed. So, if you look at all the reporting that used to happen it's primarily around metrics that are financials, whether it's around finances around operations, finances around marketing effort, finances around reporting if it's a public company reporting to the market. That's where the focus was, and so therefore a lot of the data that was not needed for financial reporting was what we call nowadays dark data. This is data we collect but don't do anything with it. Then, as the capability of the computing, and the storage, and new technologies, and new techniques evolve, and are able to handle more variety and more volume of data, then people quickly realize how much potential they have in the other data outside of the financial reporting data that they can utilize too. So, some of the pioneers leverage that and actually improved a lot in their efficiency of operations, came out with innovation. You know, GE comes to mind as one of the companies that actually leverage data early on, and number of other companies. Obviously, you look at today data has been, it's defining some of the multi-billion dollar company and all they have is data. >> Well, Facebook, Google, Amazon, Microsoft. >> Exactly. >> Apple, I mean Apple obviously makes stuff, but those other companies, they're data companies. I mean largely, and those five companies have the highest market value on the U.S. stock exchange. They've surpassed all the other big leaders, even Berkshire Hathaway. >> So now, what is happening is because the market changes, the forces that are changing the behavior of our consumers and customers, which I talked about which is everyone now is digitally engaging with each other. What that does is all the experiences now are being captured digitally, all the services are being captured digitally, all the products are creating a lot of digital exhaust of data and so now companies have to pay attention to engage with their customers and partners digitally. Therefore, they have to make sure that they're leveraging data and analytics in doing so. The other thing that has changed is the time to decision to the time to act on the data inside that you get is shrinking, and shrinking, and shrinking, so a lot more decision-making is now going real time. Therefore, you have a situation now, you have the capability, you have the technology, you have the data now, you have to make sure that you convert that in what I call programmatic kind of data decision-making. Obviously, there are people involved in more strategic decision-making. So, that's more manual, but at the operational level, it's going more programmatic decision-making. >> Okay, I want to talk, By the way, I've seen a stat, I don't know if you can confirm this, that 80% of the data that's out there today is dark data or it's data that's behind a firewall or not searchable, not open to Google's crawlers. So, there's a lot of value there-- >> So, I would say that percent is declining over time as companies have realized the value of data. So, more and more companies are removing the silos, bringing those dark data out. I think the key to that is companies being able to value their data, and as soon as they are able to value their data, they are able to leverage a lot of the data. I still believe there's a large percent still not used or accessed in companies. >> Well, and of course you talked a lot about data monetization. Doug Laney, who's an expert in that topic, we had Doug on a couple years ago when he, just after, he wrote Infonomics. He was on yesterday. He's got a very detailed prescription as to, he makes strong cases as to why data should be valued like an asset. I don't think anybody really disagrees with that, but then he gave kind of a how-to-do-it, which will, somewhat, make your eyes bleed, but it was really well thought out, as you know. But you talked a lot about data monetization, you talked about a number of ways in which data can contribute to monetization. Revenue, cost reduction, efficiency, risk, and innovation. Revenue and cost is obvious. I mean, that's where the starting point is. Efficiency is interesting. I look at efficiency as kind of a doing more with less but it's sort of a cost reduction, but explain why it's not in the cost bucket, it's different. >> So, it is first starts with doing what we do today cheaper, better, faster, and doing more comes after that because if you don't understand, and data is the way to understand how your current processes work, you will not take the first step. So, to take the first step is to understand how can I do this process faster, and then you focus on cheaper, and then you focus on better. Of course, faster is because of some of the market forces and customer behavior that's driving you to do that process faster. >> Okay, and then the other one was risk reduction. I think that makes a lot of sense here. Actually, let me go back. So, one of the key pieces of it, of efficiency is time to value. So, if you can compress the time, or accelerate the time and you get the value that means more cash in house faster, whether it's cost reduction or-- >> And the other aspect you look at is, can you automate more of the processes, and in that way it can be faster. >> And that hits the income statement as well because you're reducing headcount cost of your, maybe not reducing headcount cost, but you're getting more out of different, out ahead you're reallocating them to more strategic initiatives. Everybody says that but the reality is you hire less people because you just automated. And then, risk reduction, so the degree to which you can lower your expected loss. That's just instead thinking in insurance terms, that's tangible value so certainly to large corporations, but even midsize and small corporations. Innovation, I thought was a good one, but maybe you could use an example of, give us an example of how in your career you've seen data contribute to innovation. >> So, I'll give an example of oil and gas industry. If you look at speed of innovation in the oil and gas industry, they were all paper-based. I don't know how much you know about drilling. A lot of the assets that goes into figuring out where to drill, how to drill, and actually drilling and then taking the oil or gas out, and of course selling it to make money. All of those processes were paper based. So, if you can imagine trying to optimize a paper-based innovation, it's very hard. Not only that, it's very, very by itself because it's on paper, it's in someone's drawer or file. So, it's siloed by design and so one thing that the industry has gone through, they recognize that they have to optimize the processes to be better, to innovate, to find, for example, shale gas was a result output of digitizing the processes because otherwise you can't drill faster, cheaper, better to leverage the shale gas drilling that they did. So, the industry went through actually digitizing a lot of the paper assets. So, they went from not having data to knowingly creating the data that they can use to optimize the process and then in the process they're innovating new ways to drill the oil well cheaper, better, faster. >> In the early days of oil exploration in the U.S. go back to the Osage Indian tribe in northern Oklahoma, and they brilliantly, when they got shuttled around, they pushed him out of Kansas and they negotiated with the U.S. government that they maintain the mineral rights and so they became very, very wealthy. In fact, at one point they were the wealthiest per capita individuals in the entire world, and they used to hold auctions for various drilling rights. So, it was all gut feel, all the oil barons would train in, and they would have an auction, and it was, again, it was gut feel as to which areas were the best, and then of course they evolved, you remember it used to be you drill a little hole, no oil, drill a hole, no oil, drill a hole. >> You know how much that cost? >> Yeah, the expense is enormous right? >> It can vary from 10 to 20 million dollars. >> Just a giant expense. So, now today fast-forward to this century, and you're seeing much more sophisticated-- >> Yeah, I can give you another example in pharmaceutical. They develop new drugs, it's a long process. So, one of the initial process is to figure out what molecules this would be exploring in the next step, and you could have thousand different combination of molecules that could treat a particular condition, and now they with digitization and data analytics, they're able to do this in a virtual world, kind of creating a virtual lab where they can test out thousands of molecules. And then, once they can bring it down to a fewer, then the physical aspect of that starts. Think about innovation really shrinking their processes. >> All right, well I want to say this about clouds. You made the statement in your keynote that how many people out there think cloud is cheaper, or maybe you even said cheap, but cheaper I inferred cheaper than an on-prem, and so it was a loaded question so nobody put their hand up they're afraid, but I put my hand up because we don't have any IT. We used to have IT. It was a nightmare. So, for us it's better but in your experience, I think I'm inferring correctly that you had meant cheaper than on-prem, and certainly we talked to many practitioners who have large systems that when they lift and shift to the cloud, they don't change their operating model, they don't really change anything, they get a bill at the end of the month, and they go "What did this really do for us?" And I think that's what you mean-- >> So what I mean, let me make it clear, is that there are certain use cases that cloud is and, as you saw, that people did raise their hand saying "Yeah, I have use cases where cloud is cheaper." I think you need to look at the whole thing. Cost is one aspect. The flexibility and agility of being able to do things is another aspect. For example, if you have a situation where your stakeholder want to do something for three weeks, and they need five times the computing power, and the data that they are buying from outside to do that experiment. Now, imagine doing that in a physical war. It's going to take a long time just to procure and get the physical boxes, and then you'll be able to do it. In cloud, you can enable that, you can get GPUs depending on what problem we are trying to solve. That's another benefit. You can get the fit for purpose computing environment to that and so there are a lot of flexibility, agility all of that. It's a new way of managing it so people need to pay attention to the cost because it will add to the cost. The other thing I will point out is that if you go to the public cloud, because they make it cheaper, because they have hundreds and thousands of this canned CPU. This much computing power, this much memory, this much disk, this much connectivity, and they build thousands of them, and that's why it's cheaper. Well, if your need is something that's very unique and they don't have it, that's when it becomes a problem. Either you need more of those and the cost will be higher. So, now we are getting to the IOT war. The volume of data is growing so much, and the type of processing that you need to do is becoming more real-time, and you can't just move all this bulk of data, and then bring it back, and move the data back and forth. You need a special type of computing, which is at the, what Amazon calls it, adds computing. And the industry is kind of trying to design it. So, that is an example of hybrid computing evolving out of a cloud or out of the necessity that you need special purpose computing environment to deal with new situations, and all of it can't be in the cloud. >> I mean, I would argue, well I guess Microsoft with Azure Stack was kind of the first, although not really. Now, they're there but I would say Oracle, your former company, was the first one to say "Okay, we're going to put the exact same infrastructure on prem as we have in the public cloud." Oracle, I would say, was the first to truly do that-- >> They were doing hybrid computing. >> You now see Amazon with outposts has done the same, Google kind of has similar approach as Azure, and so it's clear that hybrid is here to stay, at least for some period of time. I think the cloud guys probably believe that ultimately it's all going to go to the cloud. We'll see it's going to be a long, long time before that happens. Okay! I'll give you last thoughts on this conference. You've been here before? Or is this your first one? >> This is my first one. >> Okay, so your takeaways, your thoughts, things you might-- >> I am very impressed. I'm a practitioner and finding so many practitioners coming from so many different backgrounds and industries. It's very, very enlightening to listen to their journey, their story, their learnings in terms of what works and what doesn't work. It is really invaluable. >> Yeah, I tell you this, it's always a highlight of our season and Gokula, thank you very much for coming on theCUBE. It was great to see you. >> Thank you. >> You're welcome. All right, keep it right there everybody. We'll be back with our next guest, Dave Vellante. Paul Gillin is in the house. You're watching theCUBE from MIT. Be right back! (upbeat techno music)

Published Date : Aug 1 2019

SUMMARY :

brought to you by SiliconANGLE Media. He is the Senior Director of Global Data and Analytics Former, former Senior Director. We met when you were at Oracle doing data. I met you when you were at Oracle, of the pioneers of the whole industry, and the data modeling and analytics, So, if you look at all the reporting that used to happen the highest market value on the U.S. stock exchange. So, that's more manual, but at the operational level, that 80% of the data that's out there today and as soon as they are able to value their data, Well, and of course you talked a lot and data is the way to understand or accelerate the time and you get the value And the other aspect you look at is, Everybody says that but the reality is you hire and of course selling it to make money. the mineral rights and so they became very, very wealthy. and you're seeing much more sophisticated-- So, one of the initial process is to figure out And I think that's what you mean-- and the type of processing that you need to do I mean, I would argue, and so it's clear that hybrid is here to stay, and what doesn't work. Yeah, I tell you this, Paul Gillin is in the house.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave VellantePERSON

0.99+

AmazonORGANIZATION

0.99+

HitachiORGANIZATION

0.99+

AppleORGANIZATION

0.99+

FacebookORGANIZATION

0.99+

MicrosoftORGANIZATION

0.99+

GoogleORGANIZATION

0.99+

Doug LaneyPERSON

0.99+

five timesQUANTITY

0.99+

OracleORGANIZATION

0.99+

KansasLOCATION

0.99+

TransUnionORGANIZATION

0.99+

Paul GillinPERSON

0.99+

HPORGANIZATION

0.99+

three weeksQUANTITY

0.99+

IndiaLOCATION

0.99+

10QUANTITY

0.99+

Sierra AtlanticORGANIZATION

0.99+

Gokula MishraPERSON

0.99+

DougPERSON

0.99+

hundredsQUANTITY

0.99+

Berkshire HathawayORGANIZATION

0.99+

five companiesQUANTITY

0.99+

80%QUANTITY

0.99+

U.S.LOCATION

0.99+

Booz AllenORGANIZATION

0.99+

Tata BurroughsORGANIZATION

0.99+

first stepQUANTITY

0.99+

GokulaPERSON

0.99+

next yearDATE

0.99+

thousandsQUANTITY

0.99+

McDonald'sORGANIZATION

0.99+

one aspectQUANTITY

0.99+

Cambridge, MassachusettsLOCATION

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

firstQUANTITY

0.99+

yesterdayDATE

0.99+

thousands of moleculesQUANTITY

0.99+

first oneQUANTITY

0.99+

OneQUANTITY

0.98+

GEORGANIZATION

0.98+

northern OklahomaLOCATION

0.98+

todayDATE

0.97+

CS AssociatesORGANIZATION

0.97+

20 million dollarsQUANTITY

0.97+

oneQUANTITY

0.96+

FirstQUANTITY

0.96+

Global Data and Analytics and Supply ChainORGANIZATION

0.95+

MIT CDOIQ ConferenceEVENT

0.95+

13th yearQUANTITY

0.94+

U.S. governmentORGANIZATION

0.93+

two worldsQUANTITY

0.92+

Azure StackTITLE

0.91+

one thingQUANTITY

0.9+

one pointQUANTITY

0.9+

NorthwesternORGANIZATION

0.9+

couple years agoDATE

0.89+

MIT Chief Data Officer and Information Quality Symposium 2019EVENT

0.87+

this centuryDATE

0.85+

Tang buildingLOCATION

0.85+

Global Data Analytics andORGANIZATION

0.83+

Chief Data Officer Information Quality ConferenceEVENT

0.81+

MITORGANIZATION

0.78+

theCUBEORGANIZATION

0.77+

thousand different combination of moleculesQUANTITY

0.74+

lastDATE

0.67+

yearsDATE

0.66+

U.S.ORGANIZATION

0.66+

billion dollarQUANTITY

0.65+

themesQUANTITY

0.65+

Osage IndianOTHER

0.64+

Julie Johnson, Armored Things | MIT CDOIQ 2019


 

>> From Cambridge Massachusetts, it's The Cube covering MIT Chief Data Officer, and Information Quality Symposium 2019. Brought to you by SiliconANGLE Media. (electronic music) >> Welcome back to MIT in Cambridge, Massachusets everybody. You're watching The Cube, the leader in live tech coverage. My name is Dave Vellante I'm here with Paul Gillin. Day two of the of the MIT Chief Data Officer Information Quality Conference. One of the things we like to do, at these shows, we love to profile Boston area start-ups that are focused on data, and in particular we love to focus on start-ups that are founded by women. Julie Johnson is here, She's the Co-founder and CEO of Armored Things. Julie, great to see you again. Thanks for coming on. >> Great to see you. >> So why did you start Armored Things? >> You know, Armored Things was created around a mission to keep people safe. Early in the time where were looking at starting this company, incidents like Las Vegas happened, Parkland happened, and we realized that the world of security and operations was really stuck in the past right? It's a manual solutions generally driven by a human instinct, anecdotal evidence, and tools like Walkie-Talkies and video cameras. We knew there had to be a better way right? In the world of Data that we live in today, I would ask if either of you got in your car this morning without turning on Google Maps to see where you were going, and the best route with traffic. We want to help universities, ball parks, corporate campuses do that for people. How do we keep our people safe? By understanding how they live. >> Yeah, and stay away from Lambert Street in Cambridge by the way. >> (laughing) >> Okay so, you know in people, when they think about security they think about cyber, they think about virtual security, et cetera et cetera, but there's also the physical security aspect. Can you talk about the balance of those two? >> Yeah, and I think both are very important. We actually tend to mimic some of the revolutions that have happened on the cyber security side over the last 10 years with what we're trying to do in the world of physical security. So, folks watching this who are familiar with cyber security might understand concepts like anomaly detection, SIEM and SOAR for orchestrated response. We very much believe that similar concepts can be applied to the physical world, but the unique thing about the physical world, is that it has defined boundaries, right? People behave in accordance with their environment. So, how do we take the lessons learned in cyber security over 10 to 15 years, and apply them to that physical world? I also believe that physical and cyber security are converging. So, are there things that we know in the physical world because of how we approach the problem? That can be a leading indicator of a threat in either the physical world or the digital world. What many people don't understand is that for some of these cyber security hacks, the first weak link is physical access to your network, to your data, to your systems. How do we actually help you get an eye on that, so you already have some context when you notice it in the digital realm. >> So, go back to the two examples you sited earlier, the two shooting examples. Could those have been prevented or mitigated in some way using the type of technology you're building? >> Yeah, I hate to say that you could ever prevent an incident like that. Everyone wants us to do better. Our goal is to get a better sense predicatively of the leading indicators that tell you you have a problem. So, because we're fundamentally looking at patterns of people and flow, I want to know when a normal random environment starts to disperse in a certain way, or if I have a bottle neck in my environment. Because if then I have that type of incident occur, I already know where my hotspots are, where my pockets of risk are. So, I can address it that much more efficiently from a response perspective. >> So if people are moving quickly away from a venue, it might be and indication that there's something wrong- >> It could be, Yeah. That demands attention. >> Yeah, when you go to a baseball game, or when you go to work I would imagine that you generally have a certain pattern of behavior. People know conceptually what those patterns are. But, we're the first effort to bring them data to prove what those patterns are so that they can actually use that data to consistently re-examine their operations, re-examine their security from a staffing perspective, from a management perspective, to make sure that they're using all the data that's at their disposal. >> Seems like there would be many other applications beyond security of this type of analysis. Are you committed to the security space, or do you have broader ambitions? >> Are we committed to the security space is a hundred percent. I would say the number one reason why people join our team, and the number one reason why people call us to be customers is for security. There's a better way to do things. We fundamentally believe that every ball park, every university, every corporate campus, needs a better way. I think what we've seen though is exactly what you're saying. As we built our software, for security in these venues, and started with an understanding of people and flow, there's a lot that falls out of that right? How do I open gates that are more effective based on patterns of entry and exit. How do I make sure that my staffing's appropriate for the number of people I have in my environment. There's lots of other contextual information that can ultimately drive a bottom line or top line revenue. So, you take a pro sports venue for example. If we know that on a 10 degree colder day people tend to eagres more early in the game, how do we adjust our food and beverage strategy to save money on hourly workers, so that we're not over staffing in a period of time that doesn't need those resources. >> She's talking about the physical and the logical security worlds coming together, and security of course has always been about data, but 10 years ago it was staring at logs increasing the machines are helping us do that, and software is helping us do that. So can you add some color to at least the trends in the market generally, and then maybe specifically what you're doing bringing machine intelligence to the data to make us more secure. >> Sure, and I hate to break it to you, but logs are still a pretty big part of what people are watching on a daily basis, as are video cameras. We've seen a lot of great technology evolve in the video management system realm. Very advanced technology great at object recognition and detecting certain behaviors with a video only solution, right? How do we help pinpoint certain behaviors on a specific frame or specific camera. The only problem with that is, if you have people watching those cameras, you're still relying on humans in the loop to catch a malicious behavior, to respond in the event that they're notified about something unusual. That still becomes a manual process. What we do, is we use data to watch not only cameras, but we are watching your cameras, your Wi-Fi, access control. Contextual data from public transit, or weather. How do we get this greater understanding of your environment that helps us watch everything so that we can surface the things that you want the humans in the loop to pay attention to, right? So, we're not trying to remove the human, we're trying to help them focus their time and make decisions that are backed by data in the most efficient way possible. >> How about the concerns about The Surveillance Society? In some countries, it's just taken for granted now that you're on camera all the time. In the US that's a little bit more controversial. Is what your doing, do you have to be sensitive to that in designing the tools you're building? >> Yeah, and I think to Dave's question, there are solutions like facial recognition which are very much working on identifying the individual. We have a philosophy as a company, that security doesn't necessarily start with the individual, it starts with the aggregate. How do we understand at an aggregate macro level, the patterns in an environment. Which means I don't have to identify Paul, or I don't have to identify Dave. I want to look for what's usual and unusual, and use that as the basis of my response. There's certain instances where you want to know who people are. Do I want to know who my security personnel are so I can dispatch them more efficiently? Absolutely. Let's opt those people in and allow them to share the information they need to share to be better resources for our environment. But, that's the exception not the norm. If we make the norm privacy first, I think we'll be really successful in this emerging GDPR data centric world. >> But I could see somebody down the road saying hey can you help us find this bad guy? And my kids at camp this week, This is his 7th year of camp, and this year was the first year my wife, she was able to sign up for a facial recognition thing. So, we used to have to scroll through hundreds and hundreds of pictures to see oh, there he is! And so Deb signs up for this thing, and then it pings you when your son has a picture taken. >> Yeah. And I was like, That's awesome. Oh. (laughing) >> That's great until you think about it. >> But there aren't really any clear privacy laws today. And so you guys are saying, look it, we're looking at the big picture. >> That's right. >> But that day is coming isn't it? >> There's certain environments that care more than others. If you think about universities, which is where we first started building our technology, they cared greatly about the privacy of their students. Health care is a great example. We want to make sure that we're protecting peoples personal data at a different level. Not only because that's the right thing to do, but also from a regulatory perspective. So, how do we give them the same security without compromising the privacy. >> Talk about Bottom line. You mentioned to us earlier that you just signed a contract with a sports franchise, you're actually going to help them, help save them money by deploying their resources more efficiently. How does your technology help the bottom line? >> Sure, you're average sporting venue, is getting great information at the point a ticket is scanned or a ticket is purchased, they have very little visibility beyond that into the customer journey during an event at their venue. So, if you think about again, patterns of people and flow from a security perspective, at our core we're helping them staff the right gates, or figure out where people need to be based on hot spots in their environment. But, what that also results in is an ability to drive other operational benefits. Do we have a zone that's very low utilization that we could use as maybe even a benefit to our avid fans. Send them to that area, get traffic in that area, and now give them a better concession experience because of it, right? Where they're going to end up spending more money because they're not waiting in line in the different zone. So, how do we give them a dashboard in real time, but also alerts or reports that they can use on an ongoing basis to change their decision making going forward. >> So, give us the company overview. Where are you guys at with funding, head count, all that good stuff. >> So, we raised a seed round with some great Boston and Silicon Valley investors a year ago. So, that was Glasswing is a Boston AI focused fund, has been a great partner for us, and Inovia which is Canada's largest VC fund recently opened a Silicon Valley office. We just started raising a series A about a week ago. I'm excited to say those conversation have been going really well so far. We have some potential strategic partners who we're excited about who know data better then anyone else that we think would help us accelerate our business. We also have a few folks who are very familiar with the large venue space. You know, the distributed campuses, the sporting and entertainment venues. So, we're out looking for the right partner to lead our series A round, and take our business to the next level, but where we are today with five really great branded customers, I think we'll have 20 by the end of next year, and we won't stop fighting 'till we're at every ball park, every football stadium, every convention center, school. >> The big question, at some point will you be able to eliminate security lines? (laughing) >> I don't think that's my core mission. (laughing) But, optimistically I'd love to help you. Right, I think there's some very talented people working on that challenge, so I'll defer that one to them. >> And rough head count today? >> We have 23 people. >> You're 23 people so- >> Yeah, I headquartered in Boston Post Office Square. >> Awesome, great location. So, and you say you've got five customers, so you're generating revenue? >> Yes >> Okay, good. Well, thank you for coming in The Cube >> Yeah, thank you. >> And best of luck with the series A- >> I appreciate it and going forward >> Yeah, great. >> All right, and thank you for watching. Paul Gillin and I will be back right after this short break. This is The Cube from MIT Chief Data Officer Information Quality Conference in Cambridge. We'll be right back. (electronic music)

Published Date : Aug 1 2019

SUMMARY :

Brought to you by SiliconANGLE Media. Julie, great to see you again. to see where you were going, in Cambridge by the way. Okay so, you know in people, How do we actually help you get an eye on that, So, go back to the two examples you sited earlier, Yeah, I hate to say that you could ever prevent That demands attention. data to prove what those patterns are or do you have broader ambitions? and the number one reason why people bringing machine intelligence to the data Sure, and I hate to break it to you, sensitive to that in designing the tools you're building? Yeah, and I think to Dave's question, and then it pings you when your son And I was like, That's awesome. And so you guys are saying, Not only because that's the right thing to do, You mentioned to us earlier that you So, if you think about again, Where are you guys at with funding, head count, and take our business to the next level, so I'll defer that one to them. So, and you say you've got five customers, Well, thank you for coming in The Cube All right, and thank you for watching.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
DavePERSON

0.99+

Paul GillinPERSON

0.99+

Dave VellantePERSON

0.99+

Julie JohnsonPERSON

0.99+

JuliePERSON

0.99+

CambridgeLOCATION

0.99+

7th yearQUANTITY

0.99+

InoviaORGANIZATION

0.99+

PaulPERSON

0.99+

Lambert StreetLOCATION

0.99+

BostonLOCATION

0.99+

fiveQUANTITY

0.99+

two examplesQUANTITY

0.99+

10 degreeQUANTITY

0.99+

USLOCATION

0.99+

five customersQUANTITY

0.99+

23 peopleQUANTITY

0.99+

twoQUANTITY

0.99+

todayDATE

0.99+

DebPERSON

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

Armored ThingsORGANIZATION

0.99+

bothQUANTITY

0.99+

Google MapsTITLE

0.99+

Silicon ValleyLOCATION

0.99+

GlasswingORGANIZATION

0.99+

OneQUANTITY

0.99+

this weekDATE

0.98+

Cambridge MassachusettsLOCATION

0.98+

a year agoDATE

0.98+

first yearQUANTITY

0.98+

series AOTHER

0.98+

hundred percentQUANTITY

0.98+

20QUANTITY

0.98+

Day twoQUANTITY

0.97+

The CubeTITLE

0.97+

Las VegasLOCATION

0.97+

firstQUANTITY

0.97+

CanadaLOCATION

0.96+

GDPRTITLE

0.96+

Chief Data OfficerEVENT

0.95+

over 10QUANTITY

0.94+

10 years agoDATE

0.94+

this yearDATE

0.94+

Surveillance SocietyORGANIZATION

0.93+

Boston Post Office SquareLOCATION

0.92+

15 yearsQUANTITY

0.91+

first effortQUANTITY

0.91+

end of next yearDATE

0.89+

MITORGANIZATION

0.89+

this morningDATE

0.88+

two shooting examplesQUANTITY

0.85+

about a week agoDATE

0.83+

ThiPERSON

0.83+

ArmoredORGANIZATION

0.83+

football stadiumQUANTITY

0.82+

oneQUANTITY

0.82+

2019DATE

0.81+

Information Quality SymposiumEVENT

0.8+

hundreds of picturesQUANTITY

0.79+

great branded customersQUANTITY

0.77+

last 10 yearsDATE

0.73+

hundreds andQUANTITY

0.73+

MIT Chief Data Officer Information Quality ConferenceEVENT

0.72+

MassachusetsLOCATION

0.7+

ParklandORGANIZATION

0.7+

every ball parkQUANTITY

0.7+

one reasonQUANTITY

0.69+

Walkie-ORGANIZATION

0.66+

first weak linkQUANTITY

0.66+

convention centerQUANTITY

0.65+

The CubeORGANIZATION

0.64+

corporate campusQUANTITY

0.64+

ball parkQUANTITY

0.61+

MIT ChiefORGANIZATION

0.59+

TalkiesTITLE

0.57+

universityQUANTITY

0.57+

Data Officer Information Quality ConferenceEVENT

0.54+

Colin Mahony, Vertica | MIT CDOIQ 2019


 

>> From Cambridge, Massachusetts, it's theCUBE, covering MIT Chief Data Officer and Information Quality Symposium 2019, brought to you by SiliconANGLE Media. >> Welcome back to Cambridge, Massachusetts everybody, you're watching The Cube, the leader in tech coverage. My name is Dave Vellante here with my cohost Paul Gillin. This is day one of our two day coverage of the MIT CDOIQ conferences. CDO, Chief Data Officer, IQ, information quality. Colin Mahoney is here, he's a good friend and long time CUBE alum. I haven't seen you in awhile, >> I know >> But thank you so much for taking some time, you're like a special guest here >> Thank you, yeah it's great to be here, thank you. >> Yeah, so, this is not, you know, something that you would normally attend. I caught up with you, invited you in. This conference has started as, like back office governance, information quality, kind of wonky stuff, hidden. And then when the big data meme took off, kind of around the time we met. The Chief Data Officer role emerged, the whole Hadoop thing exploded, and then this conference kind of got bigger and bigger and bigger. Still intimate, but very high level, very senior. It's kind of come full circle as we've been saying, you know, information quality still matters. You have been in this data business forever, so I wanted to invite you in just to get your perspectives, we'll talk about what's new with what's going on in your company, but let's go back a little bit. When we first met and even before, you saw it coming, you kind of invested your whole career into data. So, take us back 10 years, I mean it was so different, remember it was Batch, it was Hadoop, but it was cool. There was a lot of cool >> It's still cool. (laughs) projects going on, and it's still cool. But, take a look back. >> Yeah, so it's changed a lot, look, I got into it a while ago, I've always loved data, I had no idea, the explosion and the three V's of data that we've seen over the last decade. But, data's really important, and it's just going to get more and more important. But as I look back I think what's really changed, and even if you just go back a decade I mean, there's an insatiable appetite for data. And that is not slowing down, it hasn't slowed down at all, and I think everybody wants that perfect solution that they can ask any question and get an immediate answers to. We went through the Hadoop boom, I'd argue that we're going through the Hadoop bust, but what people actually want is still the same. You know, they want real answers, accurate answers, they want them quickly, and they want it against all their information and all their data. And I think that Hadoop evolved a lot as well, you know, it started as one thing 10 years ago, with MapReduce and I think in the end what it's really been about is disrupting the storage market. But if you really look at what's disrupting storage right now, public clouds, S3, right? That's the new data league. So there's always a lot of hype cycles, everybody talks about you know, now it's Cloud, everything, for maybe the last 10 years it was a lot of Hadoop, but at the end of the day I think what people want to do with data is still very much the same. And a lot of companies are still struggling with it, hence the role for Chief Data Officers to really figure out how do I monetize data on the one hand and how to I protect that asset on the other hand. >> Well so, and the cool this is, so this conference is not a tech conference, really. And we love tech, we love talking about this, this is why I love having you on. We kind of have a little Vertica thread that I've created here, so Colin essentially, is the current CEO of Vertica, I know that's not your title, you're GM and Senior Vice President, but you're running Vertica. So, Michael Stonebreaker's coming on tomorrow, >> Yeah, excellent. >> Chris Lynch is coming on tomorrow, >> Oh, great, yeah. >> we've got Andy Palmer >> Awesome, yeah. >> coming up as well. >> Pretty cool. (laughs) >> So we have this connection, why is that important? It's because, you know, Vertica is a very cool company and is all about data, and it was all about disrupting, sort of the traditional relational database. It's kind of doing more with data, and if you go back to the roots of Vertica, it was like how do you do things faster? How do you really take advantage of data to really drive new business? And that's kind of what it's all about. And the tech behind it is really cool, we did your conference for many, many years. >> It's coming back by the way. >> Is it? >> Yeah, this March, so March 30th. >> Oh, wow, mark that down. >> At Boston, at the new Encore Hotel. >> Well we better have theCUBE there, bro. (laughs) >> Yeah, that's great. And yeah, you've done that conference >> Yep. >> haven't you before? So very cool customers, kind of leading edge, so I want to get to some of that, but let's talk the disruption for a minute. So you guys started with the whole architecture, MPP and so forth. And you talked about Cloud, Cloud really disrupted Hadoop. What are some of the other technology disruptions that you're seeing in the market space? >> I think, I mean, you know, it's hard not to talk about AI machine learning, and what one means versus the other, who knows right? But I think one thing that is definitely happening is people are leveraging the volumes of data and they're trying to use all the processing power and storage power that we have to do things that humans either are too expensive to do or simply can't do at the same speed and scale. And so, I think we're going through a renaissance where a lot more is being automated, certainly on the Vertica roadmap, and our path has always been initially to get the data in and then we want the platform to do a lot more for our customers, lots more analytics, lots more machine-learning in the platform. So that's definitely been a lot of the buzz around, but what's really funny is when you talk to a lot of customers they're still struggling with just some basic stuff. Forget about the predictive thing, first you've got to get to what happened in the past. Let's give accurate reporting on what's actually happening. The other big thing I think as a disruption is, I think IOT, for all the hype that it's getting it's very real. And every device is kicking off lots of information, the feedback loop of AB testing or quality testing for predictive maintenance, it's happening almost instantly. And so you're getting massive amounts of new data coming in, it's all this machine sensor type data, you got to figure out what it means really quick, and then you actually have to do something and act on it within seconds. And that's a whole new area for so many people. It's not their traditional enterprise data network warehouse and you know, back to you comment on Stonebreaker, he got a lot of this right from the beginning, you know, and I think he looked at the architectures, he took a lot of the best in class designs, we didn't necessarily invent everything, but we put a lot of that together. And then I think the other you've got to do is constantly re-invent your platform. We came out with our Eon Mode to run cloud native, we just got rated the best cloud data warehouse from a net promoter score rating perspective, so, but we got to keep going you know, we got to keep re-inventing ourselves, but leverage everything that we've done in the past as well. >> So one of the things that you said, which is kind of relevant for here, Paul, is you're still seeing a real data quality issue that customers are wrestling with, and that's a big theme here, isn't it? >> Absolutely, and the, what goes around comes around, as Dave said earlier, we're still talking about information quality 13 years after this conference began. Have the tools to improve quality improved all that much? >> I think the tools have improved, I think that's another area where machine learning, if you look at Tamr, and I know you're going to have Andy here tomorrow, they're leveraging a lot of the augmented things you can do with the processing to make it better. But I think one thing that makes the problem worse now, is it's gotten really easy to pour data in. It's gotten really easy to store data without having to have the right structure, the right quality, you know, 10 years ago, 20 years ago, everything was perfect before it got into the platform. Right, everything was, there was quality, everything was there. What's been happening over the last decade is you're pumping data into these systems, nobody knows if it's redundant data, nobody knows if the quality's any good, and the amount of data is massive. >> And it's cheap to store >> Very cheap to store. >> So people keep pumping it in. >> But I think that creates a lot of issues when it comes to data quality. So, I do think the technology's gotten better, I think there's a lot of companies that are doing a great job with it, but I think the challenge has definitely upped. >> So, go ahead. >> I'm sorry. You mentioned earlier that we're seeing the death of Hadoop, but I'd like you to elaborate on that becuase (Dave laughs) Hadoop actually came up this morning in the keynote, it's part of what GlaxoSmithKline did. Came up in a conversation I had with the CEO of Experian last week, I mean, it's still out there, why do you think it's in decline? >> I think, I mean first of all if you look at the Hadoop vendors that are out there, they've all been struggling. I mean some of them are shutting down, two of them have merged and they've got killed lately. I think there are some very successful implementations of Hadoop. I think Hadoop as a storage environment is wonderful, I think you can process a lot of data on Hadoop, but the problem with Hadoop is it became the panacea that was going to solve all things data. It was going to be the database, it was going to be the data warehouse, it was going to do everything. >> That's usually the kiss of death, isn't it? >> It's the kiss of death. And it, you know, the killer app on Hadoop, ironically, became SQL. I mean, SQL's the killer app on Hadoop. If you want to SQL engine, you don't need Hadoop. But what we did was, in the beginning Mike sort of made fun of it, Stonebreaker, and joked a lot about he's heard of MapReduce, it's called Group By, (Dave laughs) and that created a lot of tension between the early Vertica and Hadoop. I think, in the end, we embraced it. We sit next to Hadoop, we sit on top of Hadoop, we sit behind it, we sit in front of it, it's there. But I think what the reality check of the industry has been, certainly by the business folks in these companies is it has not fulfilled all the promises, it has not fulfilled a fraction on the promises that they bet on, and so they need to figure those things out. So I don't think it's going to go away completely, but I think its best success has been disrupting the storage market, and I think there's some much larger disruptions of technologies that frankly are better than HTFS to do that. >> And the Cloud was a gamechanger >> And a lot of them are in the cloud. >> Which is ironic, 'cause you know, cloud era, (Colin laughs) they didn't really have a cloud strategy, neither did Hortonworks, neither did MapR and, it just so happened Amazon had one, Google had one, and Microsoft has one, so, it's just convenient to-- >> Well, how is that affecting your business? We've seen this massive migration to the cloud (mumbles) >> It's actually been great for us, so one of the things about Vertica is we run everywhere, and we made a decision a while ago, we had our own data warehouse as a service offering. It might have been ahead of its time, never really took off, what we did instead is we pivoted and we say "you know what? "We're going to invest in that experience "so it's a SaaS-like experience, "but we're going to let our customers "have full control over the cloud. "And if they want to go to Amazon they can, "if they want to go to Google they can, "if they want to go to Azure they can." And we really invested in that and that experience. We're up on the Amazon marketplace, we have lots of customers running up on Amazon Cloud as well as Google and Azure now, and then about two years ago we went down and did this endeavor to completely re-architect our product so that we could separate compute and storage so that our customers could actually take advantage of the cloud economics as well. That's been huge for us, >> So you scale independent-- >> Scale independently, cloud native, add compute, take away compute, and for our existing customers, they're loving the hybrid aspect, they love that they can still run on Premise, they love that they can run up on a public cloud, they love that they can run in both places. So we will continue to invest a lot in that. And it is really, really important, and frankly, I think cloud has helped Vertica a lot, because being able to provision hardware quickly, being able to tie in to these public clouds, into our customers' accounts, give them control, has been great and we're going to continue on that path. >> Because Vertica's an ISV, I mean you're a software company. >> We're a software company. >> I know you were a part of HP for a while, and HP wanted to mash that in and run it on it's hardware, but software runs great in the cloud. And then to you it's another hardware platform. >> It's another hardware platform, exactly. >> So give us the update on Micro Focus, Micro Focus acquired Vertica as part of the HPE software business, how many years ago now? Two years ago? >> Less than two years ago. >> Okay, so how's that going, >> It's going great. >> Give us the update there. >> Yeah, so first of all it is great, HPE and HP were wonderful to Vertica, but it's great being part of a software company. Micro Focus is a software company. And more than just a software company it's a company that has a lot of experience bridging the old and the new. Leveraging all of the investments that you've made but also thinking about cloud and all these other things that are coming down the pike. I think for Vertica it's been really great because, as you've seen Vertica has gotten its identity back again. And that's something that Micro Focus is very good at. You can look at what Micro Focus did with SUSE, the Linux company, which actually you know, now just recently spun out of Micro Focus but, letting organizations like Vertica that have this culture, have this product, have this passion, really focus on our market and our customers and doing the right thing by them has been just really great for us and operating as a software company. The other nice thing is that we do integrate with a lot of other products, some of which came from the HPE side, some of which came from Micro Focus, security products is an example. The other really nice thing is we've been doing this insource thing at Micro Focus where we open up our source code to some of the other teams in Micro Focus and they've been contributing now in amazing ways to the product. In ways that we would just never be able to scale, but with 4,000 engineers strong in Micro Focus, we've got a much larger development organization that can actually contribute to the things that Vertica needs to do. And as we go into the cloud and as we do a lot more operational aspects, the experience that these teams have has been incredible, and security's another great example there. So overall it's been great, we've had four different owners of Vertica, our job is to continue what we do on the innovation side in the culture, but so far Micro Focus has been terrific. >> Well, I'd like to say, you're kind of getting that mojo back, because you guys as an independent company were doing your own thing, and then you did for a while inside of HP, >> We did. >> And that obviously changed, 'cause they wanted more integration, but, and Micro Focus, they know what they're doing, they know how to do acquisitions, they've been very successful. >> It's a very well run company, operationally. >> The SUSE piece was really interesting, spinning that out, because now RHEL is part of IBM, so now you've got SUSE as the lone independent. >> Yeah. >> Yeah. >> But I want to ask you, go back to a technology question, is NoSQL the next Hadoop? Are these databases, it seems to be that the hot fad now is NoSQL, it can do anything. Is the promise overblown? >> I think, I mean NoSQL has been out almost as long as Hadoop, and I, we always say not only SQL, right? Mike's said this from day one, best tool for the job. Nothing is going to do every job well, so I think that there are, whether it's key value stores or other types of NoSQL engines, document DB's, now you have some of these DB's that are running on different chips, >> Graph, yeah. >> there's always, yeah, graph DBs, there's always going to be specialty things. I think one of the things about our analytic platform is we can do, time series is a great example. Vertica's a great time series database. We can compete with specialized time series databases. But we also offer a lot of, the other things that you can do with Vertica that you wouldn't be able to do on a database like that. So, I always think there's going to be specialty products, I also think some of these can do a lot more workloads than you might think, but I don't see as much around the NoSQL movement as say I did a few years ago. >> But so, and you mentioned the cloud before as kind of, your position on it I think is a tailwind, not to put words in your mouth, >> Yeah, yeah, it's a great tailwind. >> You're in the Amazon marketplace, I mean they have products that are competitive, right? >> They do, they do. >> But, so how are you differentiating there? >> I think the way we differentiate, whether it's Redshift from Amazon, or BigQuery from Google, or even what Azure DB does is, first of all, Vertica, I think from, feature functionality and performance standpoint is ahead. Number one, I think the second thing, and we hear this from a lot of customers, especially at the C-level is they don't want to be locked into these full stacks of the clouds. Having the ability to take a product and run it across multiple clouds is a big thing, because the stack lock-in now, the full stack lock-in of these clouds is scary. It's really easy to develop in their ecosystems but you get very locked into them, and I think a lot of people are concerned about that. So that works really well for Vertica, but I think at the end of the day it's just, it's the robustness of the product, we continue to innovate, when you look at separating compute and storage, believe it or not, a lot of these cloud-native databases don't do that. And so we can actually leverage a lot of the cloud hardware better than the native cloud databases do themselves. So, like I said, we have to keep going, those guys aren't going to stop, and we actually have great relationships with those companies, we work really well with the clouds, they seem to care just as much about their cloud ecosystem as their own database products, and so I think that's going to continue as well. >> Well, Colin, congratulations on all the success >> Yeah, thank you, yeah. >> It's awesome to see you again and really appreciate you coming to >> Oh thank you, it's great, I appreciate the invite, >> MIT. >> it's great to be here. >> All right, keep it right there everybody, Paul and I will be back with our next guest from MIT, you're watching theCUBE. (electronic jingle)

Published Date : Jul 31 2019

SUMMARY :

brought to you by SiliconANGLE Media. I haven't seen you in awhile, kind of around the time we met. It's still cool. but at the end of the day I think is the current CEO of Vertica, (laughs) and if you go back to the roots of Vertica, at the new Encore Hotel. Well we better have theCUBE there, bro. And yeah, you've done that conference but let's talk the disruption for a minute. but we got to keep going you know, Have the tools to improve quality the right quality, you know, But I think that creates a lot of issues but I'd like you to elaborate on that becuase I think you can process a lot of data on Hadoop, and so they need to figure those things out. so one of the things about Vertica is we run everywhere, and frankly, I think cloud has helped Vertica a lot, I mean you're a software company. And then to you it's another hardware platform. the Linux company, which actually you know, and Micro Focus, they know what they're doing, so now you've got SUSE as the lone independent. is NoSQL the next Hadoop? Nothing is going to do every job well, the other things that you can do with Vertica and so I think that's going to continue as well. Paul and I will be back with our next guest from MIT,

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
DavePERSON

0.99+

Andy PalmerPERSON

0.99+

Paul GillinPERSON

0.99+

Dave VellantePERSON

0.99+

MicrosoftORGANIZATION

0.99+

GoogleORGANIZATION

0.99+

AmazonORGANIZATION

0.99+

Colin MahoneyPERSON

0.99+

PaulPERSON

0.99+

ColinPERSON

0.99+

IBMORGANIZATION

0.99+

VerticaORGANIZATION

0.99+

Chris LynchPERSON

0.99+

HPEORGANIZATION

0.99+

Michael StonebreakerPERSON

0.99+

HPORGANIZATION

0.99+

Micro FocusORGANIZATION

0.99+

HadoopTITLE

0.99+

Colin MahonyPERSON

0.99+

last weekDATE

0.99+

AndyPERSON

0.99+

March 30thDATE

0.99+

NoSQLTITLE

0.99+

MikePERSON

0.99+

ExperianORGANIZATION

0.99+

tomorrowDATE

0.99+

SQLTITLE

0.99+

two dayQUANTITY

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

BostonLOCATION

0.99+

Cambridge, MassachusettsLOCATION

0.99+

4,000 engineersQUANTITY

0.99+

Two years agoDATE

0.99+

SUSETITLE

0.99+

Azure DBTITLE

0.98+

second thingQUANTITY

0.98+

20 years agoDATE

0.98+

10 years agoDATE

0.98+

oneQUANTITY

0.98+

VerticaTITLE

0.98+

HortonworksORGANIZATION

0.97+

MapReduceORGANIZATION

0.97+

one thingQUANTITY

0.97+

Mark Krzysko, US Department of Defense | MIT CDOIQ 2019


 

>> From Cambridge, Massachusetts, it's The Cube, covering MIT Chief data Officer and Information Quality Symposium 2019. Brought to you by SiliconANGLE Media. >> Welcome back to Cambridge, everybody. We're here at Tang building at MIT for the MIT CDOIQ Conference. This is the 13th annual MIT CDOIQ. It started as a information quality conference and grew through the big data era, the Chief Data Officer emerged and now it's sort of a combination of those roles. That governance role, the Chief Data Officer role. Critical for organizations for quality and data initiatives, leading digital transformations ans the like. I'm Dave Vallante with my cohost Paul Gillin, you're watching The Cube, the leader in tech coverage. Mark Chrisco is here, the deputy, sorry, Principle Deputy Director for Enterprise Information at the Department of Defense. Good to see you again, thanks for coming on. >> Oh, thank you for having me. >> So, Principle Deputy Director Enterprise Information, what do you do? >> I do data. I do acquisition data. I'm the person in charge of lining the acquisition data for the programs for the Under Secretary and the components so a strong partnership with the army, navy, and air force to enable the department and the services to execute their programs better, more efficiently, and be efficient in the data management. >> What is acquisition data? >> So acquisition data generally can be considered best in the shorthand of cost schedule performance data. When a program is born, you have to manage, you have to be sure it's resourced, you're reporting up to congress, you need to be sure you have insight into the programs. And finally, sometimes you have to make decisions on those programs. So, cost schedule performance is a good shorthand for it. >> So kind of the key metrics and performance metrics around those initiatives. And how much of that is how you present that data? The visualization of it. Is that part of your role or is that, sort of, another part of the organization you partner with, or? >> Well, if you think about it, the visualization can take many forms beyond that. So a good part of the role is finding the authoritative trusted source of that data, making sure it's accurate so we don't spend time disagreeing on different data sets on cost schedule performance. The major programs are tremendously complex and large and involve and awful lot of data in the a buildup to a point where you can look at that. It's just not about visualizing, it's about having governed authoritative data that is, frankly, trustworthy that you can can go operate in. >> What are some of the challenges of getting good quality data? >> Well, I think part of the challenge was having a common lexicon across the department and the services. And as I said, the partnership with the services had been key in helping define and creating a semantic data model for the department that we can use. So we can have agreement on what it would mean when we were using it and collecting it. The services have thrown all in and, in their perspective, have extended that data model down through their components to their programs so they can better manage the programs because the programs are executed at a service level, not at an OSD level. >> Can you make that real? I mean, is there an example you can give us of what you mean by a common semantic model? >> So for cost schedule, let's take a very simple one, program identification. Having a key number for that, having a long name, a short name, and having just the general description of that, were in various states amongst the systems. We've had decades where, however the system was configured, configured it the way they wanted to. It was largely not governed and then trying to bring those data sets together were just impossible to do. So even with just program identification. Since the majority of the programs and numbers are executed at a service level, we worked really hard to get the common words and meanings across all the programs. >> So it's a governance exercise the? >> Yeah. It is certainly a governance exercise. I think about it as not so much as, in the IT world or the data world will call it governance, it's leadership. Let's settle on some common semantics here that we can all live with and go forward and do that. Because clearly there's needs for other pieces of data that we may or may not have but establishing a core set of common meanings across the department has proven very valuable. >> What are some of the key data challenges that the DOD faces? And how is your role helping address them? >> Well in our case, and I'm certain there's a myriad of data choices across the department. In our place it was clarity in and the governance of this. Many of the pieces of data were required by statute, law, police, or regulation. We came out of eras where data was the piece of a report and not really considered data. And we had to lead our ways to beyond the report to saying, "No, we're really "talking about key data management." So we've been at this for a few years and working with the services, that has been a challenge. I think we're at the part where we've established the common semantics for the department to go forward with that. And one of the challenges that I think is the access and dissemination of knowing what you can share and when you can share it. Because Michael Candolim said earlier that the data in mosaic, sometimes you really need to worry about it from our perspective. Is too much publicly available or should we protect on behalf of the government? >> That's a challenge. Is the are challenge in terms of, I'm sure there is but I wonder if you can describe it or maybe talk about how you might have solved it, maybe it's not a big deal, but you got to serve the mission of the organization. >> Absolutely. >> That's, like, number one. But at the same time, you've got stakeholders and they're powerful politicians and they have needs and there's transparency requirements, there are laws. They're not always aligned, those two directives, are they? >> No, thank goodness I don't have to deal with misalignments of those. We try to speak in the truth of here's the data and the decisions across the organization of our reports still go to congress, they go to congress on an annual basis through the selected acquisition report. And, you know, we are better understanding what we need to protect and how to advice congress on what should be protected and why. I would not say that's an easy proposition. The demands for those data come from the GAO, come from congress, come from the Inspector General and having to navigate that requires good access and dissemination controls and knowing why. We've sponsored some research though the RAND organization to help us look and understand why you have got to protect it and what policies, rules, and regulations are. And all those reports have been public so we could be sure that people would understand what it is. We're coming out of an era where data was not considered as it is today where reports were easily stamped with a little rubber stamp but data now moves at the velocities of milliseconds not as the velocity of reports. So we really took a comprehensive look at that. How do you manage data in a world where it is data and it is on infrastructures like data models. >> So, the future of war. Everybody talks about cyber as the future of war. There's a lot of data associated with that. How does that change what you guys do? Or does it? >> Well, I think from an acquisition perspective, you would think, you know. In that discussion that you just presented us, we're micro in that. We're equipping and acquiring through acquisitions. What we've done is we make sure that our data is shareable, you know? Open I, API structures. Having our data models. Letting the war fighters have our data so they could better understand where information is here. Letting other communities to better help that. By us doing our jobs where we sit, we can contribute to their missions and we've aways been every sharing in that. >> Is technology evolving to the point where, let's assume you could dial back 10 or 15 years and you had the nirvana of data quality. We know how fast technology is changing but is it changing as an enabler to really leverage that quality of data in ways that you might not have even envision 10 or 15 years ago? >> I think technology is. I think a lot of this is not in tools, it's now in technique and management practices. I think many of us find ourselves rethinking of how to do this now that you have data, now that you have tools that you can get them. How can you adopt better and faster? That requires a cultural change to organization. In some cases it requires more advanced skills, in other cases it requires you to think differently about the problems. I always like to consider that we, at some point, thought about it as a process-driven organization. Step one to step two to step three. Now process is ubiquitous because data becomes ubiquitous and you could refactor your processes and decisions much more efficiently and effectively. >> What are some of the information quality problems you have to wrestle with? >> Well, in our case, by setting a definite semantic meaning, we kicked the quality problems to those who provide the authoritative data. And if they had a quality problem, we said, "Here's your data. "We're going to now use it." So it spurs, it changes the model of them ensuring the quality of those who own the data. And by working with the services, they've worked down through their data issues and have used us a bit as the foil for cleaning up their data errors that they have from different inputs. And I like to think about it as flipping the model of saying, "It's not my job to drive quality, "it's my job to drive clarity, "it's their job to drive the quality into the system." >> Let's talk about this event. So, you guys are long-time contributors to the event. Mark, have you been here since the beginning? Or close to it? >> Um... About halfway through I think. >> When the focus was primarily on information quality? >> Yes. >> Was it CDOIQ at the time or was it IQ? >> It was the very beginnings of CDOIQ. It was right before it became CDOIQ. >> Early part of this decade? >> Yes. >> Okay. >> It was Information Quality Symposium originally, is that was attracted you to it? >> Well, yes, I was interested in it because I think there were two things that drew my interest. One, a colleague had told me about it and we were just starting the data journey at that point. And it was talking about information quality and it was out of a business school in the MIT slenton side of the house. And coming from a business perspective, it was not just the providence of IT, I wanted to learn form others because I sit on the business side of the equation. Not a pure IT-ist or technology. And I came here to learn. I've never stopped learning through my entire journey here. >> What have you learned this week? >> Well, there's an awful lot I learned. I think it's been... This space is evolving so rapidly with the law, policy, and regulation. Establishing the CDOs, establishing the roles, getting hear from the CDOs, getting to hear from visions, hear from Michael Conlan and hear from others in the federal agencies. Having them up here and being able to collaborate and talk to them. Also hearing from the technology people, the people that're bringing solutions to the table. And then, I always say this is a bit like group therapy here because many of us have similar problems, we have different start and end points and learning from each other has proven to be very valuable. From the hallway conversations to hearing somebody and seeing how they thought about the products, seeing how commercial industry has implemented data management. And you have a lot of similarity of focus of people dealing with trying to bring data to bring value to the organizations and understanding their transformations, it's proven invaluable. >> Well, what did the appointment of the DOD's first CDO last year, what statement did that make to the organization? >> That data's important. Data are important. And having a CDO in that and, when Micheal came on board, we shared some lessons learned and we were thinking about how to do that, you know? As I said, I function in a, arguably a silo of the institution is the acquisition data. But we were copying CDO homework so it helped in my mind that we can go across to somebody else that would understand and could understand what we're trying to do and help us. And I think it becomes, the CDO community has always been very sharing and collaborative and I hold that true with Micheal today. >> It's kind of the ethos of this event. I mean, obviously you guys have been heavily involved. We've always been thrilled to cover this. I think we started in 2013 and we've seen it grow, it's kind of fire marshal full now. We got to get to a new facility, I understand. >> Fire marshal full. >> Next year. So that's congratulations to all the success. >> Yeah, I think it's important and we've now seen, you know, you hear it, you can read it in every newspaper, every channel out there, that data are important. And what's more important than the factor of governance and the factor of bringing safety and security to the nation? >> I do feel like a lot in, certainly in commercial world, I don't know if it applies in the government, but a lot of these AI projects are moving really fast. Especially in Silicon Valley, there's this move fast and break things mentality. And I think that's part of why you're seeing some of these big tech companies struggle right now because they're moving fast and they're breaking things without the governance injected and many CDOs are not heavily involved in some of these skunk works projects and it's almost like they're bolting on governance which has never been a great formula for success in areas like governance and compliance and security. You know, the philosophy of designing it in has tangible benefits. I wonder if you could comment on that? >> Yeah, I can talk about it as we think about it in our space and it may be limited. AI is a bit high on the hype curve as you might imagine right now, and the question would be is can it solve a problem that you have? Well, you just can't buy a piece of software or a methodology and have it solve a problem if you don't know what problem you're trying to solve and you wouldn't understand the answer when it gave it to you. And I think we have to raise our data intellectualism across the organization to better work with these products because they certainly represent utility but it's not like you give it with no fences on either side or you open up your aperture to find basic solution on this. How you move forward with it is your workforce has got to be in tune with that, you have to understand some of the data, at least the basics, and particularly with products when you get the machine learning AI deep learning, the models are going to be moving so fast that you have to intellectually understand them because you'll never be able to go all the way back and stubby pencil back to an answer. And if you don't have the skills and the math and the understanding of how these things are put together, it may not bring the value that they can bring to us. >> Mark, thanks very much for coming on The Cube. >> Thank you very much. >> Great to see you again and appreciate all the work you guys both do for the community. All right. And thank you for watching. We'll be right back with our next guest right after this short break. You're watching The Cube from MIT CDOIQ.

Published Date : Jul 31 2019

SUMMARY :

Brought to you by SiliconANGLE Media. Good to see you again, thanks for coming on. and be efficient in the data management. And finally, sometimes you have to make another part of the organization you partner with, or? and involve and awful lot of data in the a buildup And as I said, the partnership with the services and having just the general description of that, in the IT world or the data world And one of the challenges that I think but you got to serve the mission of the organization. But at the same time, you've got stakeholders and the decisions across the organization How does that change what you guys do? In that discussion that you just presented us, and you had the nirvana of data quality. rethinking of how to do this now that you have data, So it spurs, it changes the model of them So, you guys are long-time contributors to the event. About halfway through I think. It was the very beginnings of CDOIQ. in the MIT slenton side of the house. getting hear from the CDOs, getting to hear from visions, and we were thinking about how to do that, you know? It's kind of the ethos of this event. So that's congratulations to all the success. and the factor of bringing safety I don't know if it applies in the government, across the organization to better work with these products all the work you guys both do for the community.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Peter BurrisPERSON

0.99+

Dave VellantePERSON

0.99+

Michael DellPERSON

0.99+

Rebecca KnightPERSON

0.99+

MichaelPERSON

0.99+

ComcastORGANIZATION

0.99+

ElizabethPERSON

0.99+

Paul GillanPERSON

0.99+

Jeff ClarkPERSON

0.99+

Paul GillinPERSON

0.99+

NokiaORGANIZATION

0.99+

SavannahPERSON

0.99+

DavePERSON

0.99+

RichardPERSON

0.99+

MichealPERSON

0.99+

Carolyn RodzPERSON

0.99+

Dave VallantePERSON

0.99+

VerizonORGANIZATION

0.99+

AmazonORGANIZATION

0.99+

Eric SeidmanPERSON

0.99+

PaulPERSON

0.99+

Lisa MartinPERSON

0.99+

GoogleORGANIZATION

0.99+

KeithPERSON

0.99+

Chris McNabbPERSON

0.99+

JoePERSON

0.99+

CarolynPERSON

0.99+

QualcommORGANIZATION

0.99+

AlicePERSON

0.99+

2006DATE

0.99+

JohnPERSON

0.99+

NetflixORGANIZATION

0.99+

AWSORGANIZATION

0.99+

congressORGANIZATION

0.99+

EricssonORGANIZATION

0.99+

AT&TORGANIZATION

0.99+

Elizabeth GorePERSON

0.99+

Paul GillenPERSON

0.99+

Madhu KuttyPERSON

0.99+

1999DATE

0.99+

Michael ConlanPERSON

0.99+

2013DATE

0.99+

Michael CandolimPERSON

0.99+

PatPERSON

0.99+

Yvonne WassenaarPERSON

0.99+

Mark KrzyskoPERSON

0.99+

BostonLOCATION

0.99+

Pat GelsingerPERSON

0.99+

DellORGANIZATION

0.99+

Willie LuPERSON

0.99+

IBMORGANIZATION

0.99+

YvonnePERSON

0.99+

HertzORGANIZATION

0.99+

AndyPERSON

0.99+

2012DATE

0.99+

MicrosoftORGANIZATION

0.99+

Lisa Ehrlinger, Johannes Kepler University | MIT CDOIQ 2019


 

>> From Cambridge, Massachusetts, it's theCUBE, covering MIT Chief Data Officer and Information Quality Symposium 2019. Brought to you by SiliconANGLE Media. >> Hi, everybody, welcome back to Cambridge, Massachusetts. This is theCUBE, the leader in tech coverage. I'm Dave Vellante with my cohost, Paul Gillin, and we're here covering the MIT Chief Data Officer Information Quality Conference, #MITCDOIQ. Lisa Ehrlinger is here, she's the Senior Researcher at the Johannes Kepler University in Linz, Austria, and the Software Competence Center in Hagenberg. Lisa, thanks for coming in theCUBE, great to see you. >> Thanks for having me, it's great to be here. >> You're welcome. So Friday you're going to lay out the results of the study, and it's a study of Data Quality Tools. Kind of the long tail of tools, some of those ones that may not have made the Gartner Magic Quadrant and maybe other studies, but talk about the study and why it was initiated. >> Okay, so the main motivation for this study was actually a very practical one, because we have many company projects with companies from different domains, like steel industry, financial sector, and also focus on automotive industry at our department at Johannes Kepler University in Linz. We have experience with these companies for more than 20 years, actually, in this department, and what reoccurred was the fact that we spent the majority of time in such big data projects on data quality measurement and improvement tasks. So at some point we thought, okay, what possibilities are there to automate these tasks and what tools are out there on the market to automate these data quality tasks. So this was actually the motivation why we thought, okay, we'll look at those tools. Also, companies ask us, "Do you have any suggestions? "Which tool performs best in this-and-this domain?" And I think this study answers some questions that have not been answered so far in this particular detail, in these details. For example, Gartner Magic Quadrant of Data Quality Tools, it's pretty interesting but it's very high-level and focusing on some global windows, but it does not look on the specific measurement functionalities. >> Yeah, you have to have some certain number of whatever, customers or revenue to get into the Magic Quadrant. So there's a long tail that they don't cover. But talk a little bit more about the methodology, was it sort of you got hands-on or was it more just kind of investigating what the capabilities of the tools were, talking to customers? How did you come to the conclusions? >> We actually approached this from a very scientific side. We conducted a systematic search, which tools are out there on the market, not only industrial tools, but also open-sourced tools were included. And I think this gives a really nice digest of the market from different perspectives, because we also include some tools that have not been investigated by Gartner, for example, like more BTQ, Data Quality, or Apache Griffin, which has really nice monitoring capabilities, but lacks some other features from these comprehensive tools, of course. >> So was the goal of the methodology largely to capture a feature function analysis of being able to compare that in terms of binary, did it have it or not, how robust is it? And try to develop a common taxonomy across all these tools, is that what you did? >> So we came up with a very detailed requirements catalog, which is divided into three fields, like the focuses on data profiling to get a first insight into data quality. The second is data quality management in terms of dimensions, metrics, and rules. And the third part is dedicated to data quality monitoring over time, and for all those three categories, we came up with different case studies on a database, on a test database. And so we conducted, we looked, okay, does this tool, yes, support this feature, no, or partially? And when partially, to which extent? So I think, especially on the partial assessment, we got a lot into detail in our survey, which is available on Archive online already. So the preliminary results are already online. >> How do you find it? Where is it available? >> On Archive. >> Archive? >> Yes. >> What's the URL, sorry. Archive.com, or .org, or-- >> Archive.org, yeah. >> Archive.org. >> But actually there is a ID I have not with me currently, but I can send you afterwards, yeah. >> Yeah, maybe you can post that with the show notes. >> We can post it afterwards. >> I was amazed, you tested 667 tools. Now, I would've expected that there would be 30 or 40. Where are all of these, what do all of these long tail tools do? Are they specialized by industry or by function? >> Oh, sorry, I think we got some confusion here, because we identified 667 tools out there on the market, but we narrowed this down. Because, as you said, it's quite impossible to observe all those tools. >> But the question still stands, what is the difference, what are these very small, niche tools? What do they do? >> So most of them are domain-specific, and I think this really highlights also these very basic early definition about data quality, of like data qualities defined as fitness for use, and we can pretty much see it here that we excluded the majority of these tools just because they assess some specific kind of data, and we just really wanted to find tools that are generally applicable for different kinds of data, for structured data, unstructured data, and so on. And most of these tools, okay, someone came up with, we want to assess the quality of our, I don't know, like geological data or something like that, yeah. >> To what extent did you consider other sort of non-technical factors? Did you do that at all? I mean, was there pricing or complexity of downloading or, you know, is there a free version available? Did you ignore those and just focus on the feature function, or did those play a role? >> So basically the focus was on the feature function, but of course we had to contact the customer support. Especially with the commercial tools, we had to ask them to provide us with some trial licenses, and there we perceived different feedback from those companies, and I think the best comprehensive study here is definitely Gartner Magic Quadrant for Data Quality Tools, because they give a broad assessment here, but what we also highlight in our study are companies that have a very open support and they are very willing to support you. For example, Informatica Data Quality, we perceived a really close interaction with them in terms of support, trial licenses, and also like specific functionality. Also Experian, our contact from Experian from France was really helpful here. And other companies, like IBM, they focus on big vendors, and here, it was not able to assess these tools, for example, yeah. >> Okay, but the other differences of the Magic Quadrant is you guys actually used the tools, played with them, experienced firsthand the customer experience. >> Exactly, yeah. >> Did you talk to customers as well, or, because you were the customer, you had that experience. >> Yes, I were the customer, but I was also happy to attend some data quality event in Vienna, and there I met some other customers who had experience with single tools. Not of course this wide range we observed, but it was interesting to get feedback on single tools and verify our results, and it matched pretty good. >> How large was the team that ran the study? >> Five people. >> Five people, and how long did it take you from start to finish? >> Actually, we performed it for one year, roughly. The assessment. And I think it's a pretty long time, especially when you see how quick the market responds, especially in the open source field. But nevertheless, you need to make some cut, and I think it's a very recent study now, and there is also the idea to publish it now, the preliminary results, and we are happy with that. >> Were there any surprises in the results? >> I think the main results, or one of the surprises was that we think that there is definitely more potential for automation, but not only for automation. I really enjoyed the keynote this morning that we need more automation, but at the same time, we think that there is also the demand for more declaration. We observed some tools that say, yeah, we apply machine learning, and then you look into their documentation and find no information, which algorithm, which parameters, which thresholds. So I think this is definitely, especially if you want to assess the data quality, you really need to know what algorithm and how it's attuned and give the user, which in most case will be a technical person with technical background, like some chief data officer. And he or she really needs to have the possibility to tune these algorithms to get reliable results and to know what's going on and why, which records are selected, for example. >> So now what? You're presenting the results, right? You're obviously here at this conference and other conferences, and so it's been what, a year, right? >> Yes. >> And so what's the next wave? What's next for you? >> The next wave, we're currently working on a project which is called some Knowledge Graph for Data Quality Assessment, which should tackle two problems in ones. The first is to come up with a semantic representation of your data landscape in your company, but not only the data landscape itself in terms of gathering meta data, but also to automatically improve or annotate this data schema with data profiles. And I think what we've seen in the tools, we have a lot of capabilities for data profiling, but this is usually left to the user ad hoc, and here, we store it centrally and allow the user to continuously verify newly incoming data if this adheres to this standard data profile. And I think this is definitely one step into the way into more automation, and also I think it's the most... The best thing here with this approach would be to overcome this very arduous way of coming up with all the single rules within a team, but present the data profile to a group of data, within your data quality project to those peoples involved in the projects, and then they can verify the project and only update it and refine it, but they have some automated basis that is presented to them. >> Oh, great, same team or new team? >> Same team, yeah. >> Oh, great. >> We're continuing with it. >> Well, Lisa, thanks so much for coming to theCUBE and sharing the results of your study. Good luck with your talk on Friday. >> Thank you very much, thank you. >> All right, and thank you for watching. Keep it right there, everybody. We'll be back with our next guest right after this short break. From MIT CDOIQ, you're watching theCUBE. (upbeat music)

Published Date : Jul 31 2019

SUMMARY :

Brought to you by SiliconANGLE Media. and the Software Competence Center in Hagenberg. it's great to be here. Kind of the long tail of tools, Okay, so the main motivation for this study of the tools were, talking to customers? And I think this gives a really nice digest of the market And the third part is dedicated to data quality monitoring What's the URL, sorry. but I can send you afterwards, yeah. Yeah, maybe you can post that I was amazed, you tested 667 tools. Oh, sorry, I think we got some confusion here, and I think this really highlights also these very basic So basically the focus was on the feature function, Okay, but the other differences of the Magic Quadrant Did you talk to customers as well, or, and there I met some other customers and we are happy with that. or one of the surprises was that we think but present the data profile to a group of data, and sharing the results of your study. All right, and thank you for watching.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Lisa EhrlingerPERSON

0.99+

Paul GillinPERSON

0.99+

IBMORGANIZATION

0.99+

Dave VellantePERSON

0.99+

HagenbergLOCATION

0.99+

LisaPERSON

0.99+

ViennaLOCATION

0.99+

LinzLOCATION

0.99+

Five peopleQUANTITY

0.99+

30QUANTITY

0.99+

Johannes Kepler UniversityORGANIZATION

0.99+

40QUANTITY

0.99+

FridayDATE

0.99+

one yearQUANTITY

0.99+

667 toolsQUANTITY

0.99+

FranceLOCATION

0.99+

three categoriesQUANTITY

0.99+

third partQUANTITY

0.99+

Cambridge, MassachusettsLOCATION

0.99+

ExperianORGANIZATION

0.99+

secondQUANTITY

0.99+

two problemsQUANTITY

0.99+

more than 20 yearsQUANTITY

0.99+

GartnerORGANIZATION

0.99+

single toolsQUANTITY

0.99+

SiliconANGLE MediaORGANIZATION

0.98+

firstQUANTITY

0.98+

MIT CDOIQORGANIZATION

0.98+

a yearQUANTITY

0.97+

three fieldsQUANTITY

0.97+

Apache GriffinORGANIZATION

0.97+

Archive.orgOTHER

0.96+

.orgOTHER

0.96+

one stepQUANTITY

0.96+

Linz, AustriaLOCATION

0.95+

oneQUANTITY

0.94+

singleQUANTITY

0.94+

first insightQUANTITY

0.93+

theCUBEORGANIZATION

0.92+

2019DATE

0.92+

this morningDATE

0.91+

BTQORGANIZATION

0.91+

MIT Chief Data Officer andEVENT

0.9+

Archive.comOTHER

0.88+

InformaticaORGANIZATION

0.85+

Software Competence CenterORGANIZATION

0.84+

Information Quality Symposium 2019EVENT

0.81+

MIT Chief Data Officer Information Quality ConferenceEVENT

0.72+

Data QualityORGANIZATION

0.67+

#MITCDOIQEVENT

0.65+

Magic QuadrantCOMMERCIAL_ITEM

0.63+

MagicCOMMERCIAL_ITEM

0.45+

nextEVENT

0.44+

waveEVENT

0.43+

Magic QuadrantORGANIZATION

0.43+

waveDATE

0.41+

MagicTITLE

0.39+

Veda Bawo, Raymond James & Althea Davis, ING Bank | MIT CDOIQ 2019


 

>> From Cambridge Massachusetts, it's the CUBE, covering MIT Chief Data Officer and Information Quality Symposium 2019. Brought to you by silicon angle media. >> Welcome back to Cambridge Massachusetts everybody you're watching the cube. The leader in live tech coverage. The cubes two day coverage of MIT's CDOIQ. The chief data officer information quality event. Thirteenth year we started here in 2013. I'm Dave Vallante with my co-host Paul Gillin. Veda Bawo. Bowo. Bawo. Sorry Veda Bawo is here. Did I get that right? >> That's close enough. >> The director of data governance at Raymond James and Althea Davis the former chief data officer of ING bank challengers and growth markets. Ladies welcome to the cube thanks so much for coming on. >> Thank you. >> Thank you. >> Hi Vita, talk about your role at Raymond James. Relatively new role for you? >> It is a relatively new role. So I recently left fifth third bank as their managing director of data governance and I've moved on to Raymond James in sunny Florida. And I am now the director of data governance for Raymond James. So it's a global financial services company they do asset wealth management, investment banking, retail banking. So I'm excited, I'm very excited about it. >> So we've been talking all day and actually several years about how the chief data officer role kind of emerged from the back office of the data governance. >> Mmm >> And the information quality and now its come you know front and center. And actually we've seen a full circle because now it's all about data quality again. So Althea as the former CDO right is that a fair assessment that it sort of came out of the ashes of the back room. >> Yeah, I mean its definitely a fair assessment. That's where we got started. That's how we got our budgets that's how we got our teams. However, now we have to serve many masters. We have to deal with all of the privacy, we have to deal with the multiple compliancies. We have to deal with the data operations and we have to deal with all of the new, sexy emerging technologies. So to do AI and data science you need a lot of data. You need data rich. You need it to be knowledge management, you need it to be information management. And it needs to be intelligent. So we need to actually raise the bar on what we do and at the same time get the credibility from our sea sweet peers. >> Well I think we no longer have the. We don't have the luxury of being just a cost center anymore . >> No. >> Right, we have to generate revenue. So it's about data monetization. It's about partnering with our businesses to make sure that we're helping to drive strategy and deliver results for the broader organization. >> So you got to hit the bottom line. >> Yeah. >> Either raise revenue or cut costs >> Yeah absolutely >> You know directly that can be tangibly monetized. >> Exactly keep them out of jail. Right. Save money >> That too. >> Save money, make money. (inaudible laughter) keep them out of jail. >> Like both CDO's you do not study for this career path because it didn't exist a few years ago. So talk about your backgrounds and how you came to come into this role Veda. >> Yeah absolutely so you know you talked about you know data kind of starting in the bowels of the back office. So I am that person right. So I am an accountant by training. So I am the person who is non legally entity controllership by book journal entries I've closed the books. I've done regulatory reporting so I know what it feels like to have to deal with dirty data every single month end, every single quarter end right. And I know the pain of having to cleanse it and having to deal with our business partners and having experienced that gave me the passion to want to do better. Right so I want to influence my partners upstream to do better as well as to take away some of the pain points that my teams experiencing over and over again it really was groundhog day. So that really made me feel passionate about going into the data discipline. Right and so you know the benefit is great it's not an easy journey but yeah out of accounting finance and that kind of back office operational support was boring right. A data evangelist and some passionate were about it. >> Which made sense because you have to have quality. >> Absolutely. >> Consistency. You have to have so called single version of the truth. >> Absolutely because you look regularly there's light for the financial reports to be accurate. All the time. (laughter) >> Exactly >> How about you? >> I came at it from a totally different angle. I was a marketeer so I was a business manager, a marketeer I was working with the big retail brands you know the Nikes and the Levi's strauss's of the world. So I came to it from a value chain perspective from marketing you know from rolling out retail chains across Europe. And I went from there as a line management position and all the pains of the different types of data we needed and then did quite a bit of consulting with some of the big consultancies accenture. And then rolled more into the data migration so dealing with those huge change projects and having teams from all of the world. And knowing the pains what all of the guys didn't want to work on. I got it all on my plate. But it put me in position to be a really solid chief data officer. >> Somebody it was called like data chicks or something like that (laughter) and I snuck in I was like the lone >> Data chicks >> I was like the lone data dude >> You can be a data chick. It's okay no judgement here. >> And so one of the things that one of the CDO's said there. She was a woman obviously. And she said you know I think that and the stat was there was a higher proportion of women as CDO's than there were across tech which is like I don't know fifty seventeen percent. And she's positive that the reason was because it's like a thankless job that nobody wants and so I just wonder as woman CDO your thoughts on that is that true. >> Well first of all we're the newest to the table right so you're the new kid on the block it doesn't matter if you're man or woman you're the new kid on the block so you know the CFO's got the four thousand year history behind him or her. The CIO or CTO they've got the fifty, sixty year up on us. So we're new. So you have to calve out your space and I do think that a lot of women by nature like to take on things big. To do things that other people don't want to do. So I can see how women kind of fell into that. But, at the same time you know data it's an asset and it is the newest asset. And it's definitely misunderstood. So I do think that you know women you know we kind of fell into it but it was actually something that happened good for women because there's a big future in data. >> Well let's just be realistic right. Woman have unique skillset. I may be a little bias but we have a unique skillset. We're able to solve problems creatively. Right there's no one size fits all solution for data. There's no accounting pronouncement that tells me how to handle and manage my data. Right I have to kind of figure it out as I go along and pivot when something doesn't work. I think that's something that is very natural to women. >> Yeah. >> I think that contributes to us kind of taking on these roles. >> Can I just do a little survey here (laughter) We hear that the chief data officer of function is defined differently at different organizations. Now you both are in financial services. You both have a chief data function. Are you doing the same thing? (laughter) >> Absolutely not! (laughter) >> You know this is data by design. I mean I'm getting lucky I've had teams that go the whole gammon right so. From the compliancy side through to the data operations through to all of the like I said the exotics, sexy you know emerging technologies stuff with the data scientists. So I've had the whole thing. I've also had my last position at ING bank I had to you know lead a team of chief data officers across three different continents Australia, Asia and also Eastern and Western Europe. So it's totally different than you know maybe another company that they've only got to chief data officer working on data quality and data governance. >> So again another challenge of being the new kid on the block right. Defining roles and responsibilities. There's no one globally, universally accepted definition of what a chief data officer should do. >> Right >> Right is data science in or out are analytics in or out. Right. >> Security sometimes. >> Security right sometimes privacy is it or out. Do you have operational responsibilities or are you truly just a second line governance function right? There's a mixed bag out there in the industry. I don't know that we have one answer that we know for sure is true. But I do know for sure is that data is not an IT function. >> Well okay. That's really important. >> It's not an IT asset. >> Yeah. >> I want to say that it's not an IT asset. It is an information asset or a data asset which is a different asset than an IT asset or a financial asset or a human asset. >> But and that's the other big change is that fifteen. Ten to fifteen years ago data was assumed to be a liability right. >> Mmm. >> Federal rules set up a civil procedure we got to get rid of the data or you know we're going to get sued. Number one and number two is that data because it's digital you know people say data is the new oil. I always say it's not. It's more important than oil. >> It's like blood. >> Oil you can only use in one use case. Data you can reuse over and over again. >> Reuse, reuse perpetual. It goes on and on and on. And every time you reuse it the value increases. So I would agree with you it is not the new oil. It is much bigger than that and it needs to I mean I know from some of my colleagues in the profession. We talk about borrowing from other more mature disciplines to make data management, information management and knowledge management much more robust and be much more professional. We also need to be more professional about it as the data leaders. >> So when you're a little panel today. One of the things that you guys addressed is what keeps the CDO up at night. >> Yes >> I presume it's data. (laughter) >> No, no, no. >> It's our payers that don't get it. (laughter) >> That's what keeps us up at night. >> Its the sponsors that keep us up at night. (laughter) So what was that discussion like? >> So yeah I mean it was a lively discussion. Um, great attendance at the panel so we appreciate everyone who came out and supported. >> Full house. >> Definitely a full house. Great reviews so far. >> Yep. >> Okay, so the thing that definitely keeps folks up at night and I'm going to start with my standard one which is quality. Right you can have all of the fancy tools, right you can have a million data scientists but if the quality is not good or sufficient. Then you're no where. So quality is fundamentally the thing that the CDO has to always pay attention to. And there's no magic you know pill or magic right potion that's going to make the quality right. It's something that the entire organization has a rally around. And it's not a one thing done right it has to be a sustainable approach to making sure the quality is good enough so that you can actually reap the benefits or derive the value right from your data. >> Absolutely and I would say you know following on from the quality and I consider that trustworthiness of the data. I would say as a chief data officer you're coming to the table. You're coming to the executive table you need to bring it all so you need to be impactful. You need to be absolutely relevant to your peers. You also need to be able to make their teams in a position to act. So it needs to be actionable. And if you don't have all of that combination with the trustworthiness you're dead in the water. So it is a hard act and that's why there is a high attrition for chief data officers. You know it's a hard job. But I think it's very much worthwhile because this particular asset this new asset we haven't been able to even scratch the surface of what it could mean for us a society and for commercial organizations or government organizations. >> To your point it's not a technology problem when Mark Ramsay who was surveying the audience this morning. He said you know why have we had so many failures and the first hand that went up said. It's because of relations with the database. >> And I wanted to say it's not a technology problem. >> It's a hearts, minds and haves >> Absolutely. Absolutely. You couldn't make an impact to your data landscape without changing your technology. >> You said at the outset how important it is for you to show a bottom line impact. >> Right >> What's one project you've worked on or that you've led in your tenure that did that. >> If we're talking about for example I can't say specifics but if we're looking at one of institutions I worked at in an insurance firm and we looked at the customer journey. So we worked with some of the different departments that traditionally did not get access to data for them to be able to be effective at their jobs. But they wanted to do in marketing was create actually new products to make you know increase the wallet from the existing customers other things they wanted to do was for example, when there were problems with the customers instead of customer you know leaving you know the journey they were able to bring them back in by getting access to the data. So we either gave them insight like you know looking back to make sure that things didn't happen wrong the next time or we helped them giving them information so they could develop new products so this is all about going to market. So that's absolutely bottom line. It's not just all cost efficiency and products to begin . >> Yeah pipeline. (laughter) >> And that's really valid but you know. >> Absolutely so I'll give you one example where the data organization partnered with our data scientists. To try to figure out the best location for various branches. For that particular institution. And it was taking right trillions of data points right about current footprint as well as other information about geographic information that was out there publicly available. Taking that and using the analytics to figure out okay where should we have our branches, our ATM's etc... and then conslidating the footprint or expanding where appropriate. So that is bottom line impact for sure. >> I remember in the early part of the two thousands I remember reading a Harvard business review article about gut feel trumps data every time. But that's an example where no way. >> Nope. >> You could never do better with the gut than that example that you just gave. >> Absolutely. >> Veda. I want to ask you a question. I don't know if you've heard Mark Ramsays talk this morning but he sort of. He sort of declared that data governance was over. >> Mmm. >> And as the director of data governance >> Never! >> I wondered if you would disagree with that. >> Never! >> Look. >> Were you surprised? >> It's just like saying that I should stop brushing my teeth. Right I always will have to maintain a certain level of data hygiene. And I don't think that employees and executives and organizations have reached a level of maturity where I can trust them to maintain that level of hygiene independently. And therefore I need a governance function. I need to check to make sure you brush your teeth in the morning and in the evening. Right and I need you to go for your annual exam to make sure you don't have any cavities that weren't detected. Right so I think that there's still a role for governance to play. It will evolve over time for sure. Right as you know the landscape changes but I think there's still a role right for like governance. >> And that wasn't my takeaway part. I think he said that basically enterprise data warehouse fail massive data management fail. The single data model failed so we punted to governance and that's not going to solve the enterprise data problem. >> I think it's a one leg in the stool. It's one leg in the stool. ` >> Yeah I think I would really sum it up as a monolithic data storage approach failed. Like that. And then our attention went to data governance but that's not going to solve it either. Look, data management is about twelve different data capabilties it's a discipline so we give the title data governance but it means multiple things. And I think that if we're more educated and we have more confidence on what we're doing on those different areas. Plus information and knowledge management then we're way ahead of the game. I mean knowledge graphs and semantics. That puts companies you know at the top of that you know corporate inequality gap that we're looking at right now. Where you know companies are you know five and thousand times more valuable then their competition and the gap is just going to get bigger considering if some of those companies at the bottom of the gap are you know just keep on doing the same thing. >> I agree I was just trying to get you worked up. (laughter) >> Well you did. >> It's going to be a different kind of show. >> But that point you're making. Microsoft, Apple, Amazon and Google, Facebook. Top five companies in terms of market cap. And they're all data companies. They surpass all the financial services, all the energy companies, all the manufacturers. >> And Alibaba same thing. >> Oh yeah. >> They're doing the same thing. >> They're coming right up there. With four or five hundred billion. >> They're all doing the knowledge approach. They're doing all of this stuff and that's a much more comprehensive approach to looking at it as a full spectrum and if we keep on in the financial industry or any industry keep on just kind of looking at little bits and pieces. It's not going to work. It's a lot of talk but there's no action. >> We are losing right. I know that Fintechs are right fringing upon are territory. Right if Amazon can provide a credit card or lend you money or extend you credit. They're now functioning as a traditional bank would. If we're not paying attention to them as real competitors. We've lost the battle. >> That's a really important point you're making because it's all digital now. >> Absolutely. >> You used to be you'd never see companies traverse industries and now you see it Apple pay and Amazon and healthcare. >> Yeah. >> And government organizations teaming up with corporations and individuals. Everything is free flowing so that means the knowledge and the data and the information also needs to flow freely but it needs to be managed. >> Now you're into a whole realm of privacy and security. >> And regulations right. Regulations for the non right traditional banks. So we're doing banking transactions. >> Do you think traditional banks will lose control over the payment systems? >> If they don't move with the time they will. If they don't. I mean it's not something that's going to happen tomorrow but you know there is a category of bank called Challenger banks so there's a reason. You know even within their own niche there's a group of banks. >> I mean not even just payments right. Think about cash transactions like if I do money transfer am I going to my traditional bank to do it or am I going to cashapp. >> I think it's interesting particularly in the retail banking business where you know one banking app looks pretty much like other and people don't go to branches anymore and so that brand affinity that used to exist is harder and harder to maintain and I wonder what role does data play in reestablishing that connection. >> Well for me right I get really excited and sometimes annoyed when I can open up my app for my bank and I can see the pie chart of my spending. They're using my data to inform me about my behaviors sometimes a good story, sometimes a bad story. But they're using it to inform me. That's making me more loyal to that particular institution right so I can also link all of my financial accounts in that one institutions app and I can see a full list of all of my credit cards, all of my loans, all of my investments in one stop shopping. That's making me go to their app more often versus the other options that are out there. So I think we can use the data in order to endear the customer source but we have to be smart about it. >> That's the accountant in you. I just refuse to not look. (laughter) >> You can afford to not look. I can't. >> Thank you. >> Thanks for riling us up. >> Alright thank you for watching everybody we'll be right back with our next guest right after this short break. You're watching the cube from MIT in Boston, Cambridge. Right back. (atmospheric music)

Published Date : Jul 31 2019

SUMMARY :

Brought to you by silicon angle media. Did I get that right? and Althea Davis the former chief data officer Hi Vita, talk about your role at Raymond James. And I am now the director of data of the data governance. So Althea as the former CDO right is that So to do AI and data science you need a lot of data. We don't have the luxury of being and deliver results for the broader organization. Right. keep them out of jail. you came to come into this role Veda. And I know the pain of having to cleanse it You have to have so called single version of the truth. light for the financial reports to be accurate. So I came to it from a value chain perspective You can be a data chick. And she's positive that the reason was because But, at the same time you know data it's an asset Right I have to kind of figure it out as I go along I think that contributes to us kind of We hear that the chief data officer of function I had to you know lead a team of chief data officers the new kid on the block right. Right is data science in or out are I don't know that we have one answer that we know That's really important. I want to say that it's not an IT asset. But and that's the other big change is that fifteen. we got to get rid of the data or you know Data you can reuse over and over again. So I would agree with you it is not the new oil. One of the things that you guys addressed I presume it's data. It's our payers that don't get it. Its the sponsors that keep us up at night. Um, great attendance at the panel so we appreciate Great reviews so far. the thing that the CDO has to always pay attention to. So it needs to be actionable. and the first hand that went up said. You couldn't make an impact to your data it is for you to show a bottom line impact. or that you've led in your tenure that did that. actually new products to make you know increase (laughter) Absolutely so I'll give you one example I remember in the early part of the two thousands than that example that you just gave. He sort of declared that data governance was over. I need to check to make sure you brush your and that's not going to solve the enterprise data problem. It's one leg in the stool. and the gap is just going to get bigger considering I agree I was just trying to get you worked up. all the energy companies, all the manufacturers. They're coming right up there. It's not going to work. I know that Fintechs are right fringing upon are territory. That's a really important point you're industries and now you see it and the data and the information also needs to Regulations for the non right traditional banks. I mean it's not something that's going to happen tomorrow am I going to my traditional bank to do it banking business where you know one banking app looks and I can see the pie chart of my spending. I just refuse to not look. You can afford to not look. Alright thank you for watching everybody we'll

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Mark RamsayPERSON

0.99+

Paul GillinPERSON

0.99+

MicrosoftORGANIZATION

0.99+

AmazonORGANIZATION

0.99+

FacebookORGANIZATION

0.99+

Veda BawoPERSON

0.99+

AppleORGANIZATION

0.99+

Dave VallantePERSON

0.99+

GoogleORGANIZATION

0.99+

AlibabaORGANIZATION

0.99+

2013DATE

0.99+

EuropeLOCATION

0.99+

ING BankORGANIZATION

0.99+

VitaPERSON

0.99+

fiveQUANTITY

0.99+

fiftyQUANTITY

0.99+

NikesORGANIZATION

0.99+

fourQUANTITY

0.99+

INGORGANIZATION

0.99+

MITORGANIZATION

0.99+

Raymond JamesORGANIZATION

0.99+

Althea DavisPERSON

0.99+

fifty seventeen percentQUANTITY

0.99+

bothQUANTITY

0.99+

two dayQUANTITY

0.99+

Levi'sORGANIZATION

0.99+

one legQUANTITY

0.99+

Mark RamsaysPERSON

0.99+

tomorrowDATE

0.99+

four thousand yearQUANTITY

0.99+

AustraliaLOCATION

0.99+

two thousandsQUANTITY

0.99+

five hundred billionQUANTITY

0.98+

Cambridge MassachusettsLOCATION

0.98+

AsiaLOCATION

0.98+

BawoPERSON

0.98+

todayDATE

0.98+

VedaPERSON

0.98+

singleQUANTITY

0.97+

Western EuropeLOCATION

0.97+

one exampleQUANTITY

0.97+

OneQUANTITY

0.97+

EasternLOCATION

0.97+

oneQUANTITY

0.97+

TenDATE

0.97+

Boston, CambridgeLOCATION

0.97+

Thirteenth yearQUANTITY

0.97+

fifteenDATE

0.96+

second lineQUANTITY

0.96+

a million data scientistsQUANTITY

0.95+

Raymond JamesPERSON

0.95+

five companiesQUANTITY

0.95+

fifteen years agoDATE

0.94+

fifth third bankQUANTITY

0.94+

BowoPERSON

0.94+

sixty yearQUANTITY

0.93+

trillions of data pointsQUANTITY

0.92+

ING bankORGANIZATION

0.92+

one use caseQUANTITY

0.91+

one projectQUANTITY

0.91+

this morningDATE

0.91+

firstQUANTITY

0.9+

thousand timesQUANTITY

0.9+

2019DATE

0.89+

one answerQUANTITY

0.87+

AltheaPERSON

0.86+

ChallengerORGANIZATION

0.85+

one banking appQUANTITY

0.84+

MIT Chief Data Officer andEVENT

0.83+

three different continentsQUANTITY

0.82+

few years agoDATE

0.81+

this morningDATE

0.8+

single versionQUANTITY

0.78+

number twoQUANTITY

0.76+

Information Quality Symposium 2019EVENT

0.75+

HarvardORGANIZATION

0.73+

payTITLE

0.72+

sunny FloridaLOCATION

0.7+

Michael Conlin, US Department of Defense | MIT CDOIQ 2019


 

(upbeat music) >> From Cambridge, Massachusetts, it's the CUBE. Covering MIT Chief Data Officer and Information Quality Symposium 2019. Brought to you by SiliconANGLE Media. (upbeat music) >> Welcome back to MIT in Cambridge Massachusetts everybody you're watching the CUBE the leader in live tech coverage. We go out to the events and extract the signal from the noise we hear at the MIT CDOIQ. It's the MIT Chief Data Officer event the 13th annual event. The CUBE started covering this show in 2013. I'm Dave Vellante with Paul Gillin, my co-host, and Michael Conlin is here as the chief data officer of the Department of Defense, Michael welcome, thank you for coming on. >> Thank you, it's a pleasure to be here. >> So the DoD is, I think it's the largest organization in the world, what does the chief data officer of the DoD do on a day to day basis? >> A range of things because we have a range of challenges at the Department of Defense. We are the single largest organization on the planet. We have the greatest scope and scale and complexity. We have the most dangerous competitors of anybody on the planet, it's not a trivial issue for us. So, I've a range of challenges. Challenges around, how do I lift the overall performance of the department using data effectively? How do I help executives make better decisions faster, using more recent, more common data? More common enterprise data is the expression we use. How do I help them become more sophisticated consumers of data and especially data analytics? And, how do we get to the point where, I can compare performance over here with performance over there, on a common basis? And compared to commercial benchmark? Which is now an expectation for us, and ask are we doing this as well as we should, right across the patch? Knowing, that all that data comes from multiple different places to start with. So we have to overcome all those differences and provide that department wide view. That's the essence of the role. And now with the recent passage of the Foundations for Evidenced-Based Policymaking Act, there are a number of additional expectations that go on top of that, but this is ultimately about improving affordability and performance of the department. >> So overall performance of the organization... >> Overall performance. >> ...as well, and maybe that comes from supporting various initiatives, and making sure you're driving performance on that basis as well. >> It does, but our litmus test is are we enabling the National Defense Strategy to succeed? Only reason to touch data is to enable the National Defense Strategy to be more successful than without it. And so we're always measuring ourselves against that. But it is, can we objectively say we're performing better? Can we objectively say that we are more affordable? In terms of the way we support the National Defense Strategy. >> I'm curious about your motivations for taking on this assignment because your background, as I see, is primarily in the private sector. A year ago you joined the US Department of Defense. A huge set of issues that you're tackling now, why'd you do it? >> So I am a capitalist, like most Americans, and I'm a serial entrepreneur. This was my first opportunity to serve government. And when I looked at it, knowing that I could directly support national defense, knowing that I could make a direct meaningful contribution, let me exercise that spirit of patriotism that many of us have, but we just not found ourselves an opportunity. When this opportunity came along I just couldn't say no to it. There's so much to be done and so much appetite for improvement that I just couldn't walk away for this. Now I've to tell you, when you start you take an oath of office to protect and defend the constitution. I don't know, it's maybe a paragraph or maybe it's two paragraphs. It felt like it took an hour to choke it out, because I was suddenly struck with all of this emotion. >> The gravity of what you were doing. >> Yeah, the gravity of what I'm doing. And that was just a reinforcement of the choice I'd already made, obviously right. But the chance to be the first chief data officer of the entire Department of Defense, just an enormous privilege. The chance to bring commercial sector best practices in and really lift the game of the department, again enormous privilege. There's so many people who could do this, probably better than me. The fact that I got the opportunity I just couldn't say no. Just too important, to many places I could see that we could make things better. I think anybody with a patriotic bone in their body would of jumped at the opportunity. >> That's awesome, I love that congratulations on getting that role and seemingly thrive in it. A big part of preserving that capitalist belief, defending the constitution and the American way, it sounds corny, but... >> It's real. >> I'm a patriot as well, is security. And security and data are intertwined. And just the whole future of warfare is dramatically changing. Can you talk about in a format like this, security, you're thinking on that, the department's thinking on that from a CDO's perspective? >> So as you know we have a number of swimlanes within the department and security is very clear swimlane, it's aligned under our chief information officer, but security is everybody's responsibility, of course. Now the longstanding criticism of security people is that they think they best way to secure anything is to permit nobody to touch it. The clear expectation for me as chief data officer is to make sure that information is shared to the right people as rapidly as possible. And, that's a different philosophy. Now I'm really lucky. Lieutenant General Denis Crall our principal cyber advisor, Dana Deasy our CIO, these people understand how important it is to get information in the right place at the right time, make it rapidly available and secure it every step along the way. We embrace the zero trust mantra. And because we embrace the zero trust mantra we're directly concerned with defending the data itself. And as long as we defend the data and the same mechanisms are the mechanisms we use to let people share it, suddenly the tension goes away. Suddenly we all have the same goal. Because the goal is not to prevent use of data, it's to enable use of data in a secure way. So the traditional tension that might be in that place doesn't exist in the department. Very productive, very professional level of collaboration with those folks in this space. Very sophisticated people. >> When we were talking before we went live you mentioned that the DoD has 10,000 plus operational systems... >> That's correct. >> A portfolio of that magnitude just overwhelming, I mean how did you know what to do first when you moved into this job, or did you have a clear mandate when you were hired? >> So I did have a clear mandate when I was hired and luckily that was spelled out. We knew what to do first because we sat down with actual leaders of the department and asked them what their goals were for improving the performance of the department. And everything starts from that conversation. You find those executives that what to improve performance, you understand what those goals are, and what data they need to manage that improvement. And you capture all the critical business questions they need answers to. From that point on they're bought in to everything that happens, right. Because they want those answers to those critical business questions. They have performance targets of their own, this is now aligned with. And so you have the support you need to go down the rest of the path of finding the data, standardizing it, et cetera. In order to deliver the answers to those questions. But it all starts which either the business mission leaders or the warfighting mission leaders who define the steps they're taking to implement the National Defense Strategy. Everything gets lined up against that, you get instant support and you know you're going after the right thing. This is not, an if you build it they will come. This is not, a driftnet the organization try to gather up all the data. This is spear fishing for specific answers to materially important questions, and everything we do is done on that basis. >> We hear Mark Ramsey this morning talk about the... He showed a picture of stove pipes and then he complicated that picture by showing multiple copies within each of those stove pipes, and says this is organizations that we've all lived in. >> That's my organization too. >> So talk about some of those data challenges at the DoD and how you're addressing those, specifically how you're enabling soldiers in the field to get the right data to the field when they need it. >> So what we'll be delicate when we talk about what we do for soldiers in the field. >> Understood, yeah. >> That tends to be sensitive. >> Understand why, sure. >> But all of those dynamics that Mark described in that presentation are present in every large cooperation I've ever served. And that includes the Department of Defense. That heterogeneity and sprawl of IT that what I would refer to, he showed us a hair ball of IT. Every large organization has a hair ball of IT. And data scattered all over the place. We took many of the same steps that he described in terms of organizing and presenting meaningful answers to questions, in almost exactly the same sequence. The challenge as you heard me use the statistics that our CIO's published digital monetization strategies, which calls out that we have roughly 10,000 operational systems. Well, every one of them is different. Every one's put in place by a different group of people at a different time, with a different set of requirements, and a different budget, and a different focus. You know organizational scope. We're just like he showed. We're trying to blend all that in to a common view. So we have to find what's the real authoritative piece of data, cause it's not all of those systems. It's only a subset of those systems. And you have to do all of the mapping and translations, to make the result add up. Otherwise you double count or you miss something. This is work in progress. This will always be a work in progress to any large organization. So I don't want to give you impression it's all sorted. Definitely not all sorted. But, the reality is we're trying to get to the point where people can see the data that's available and that's a requirement by the way under the Foundations Act that we have a data catalog, an authoritative data catalog so people can see it and they have the ability to then request access to that through automation. This is what's critical, you need to be able to request access and have it arbitraged on the basis of whether you should directly have access based on your role, your workflow, et cetera, but it should happen in real time. You don't want to wait weeks, or months, or however long for some paperwork to move around. So this all has to become highly automated. So, what's the data, who can access it under what policy, for what purpose? Our roles and responsibilities? Identity management? All this is a combined set of solutions that we have to put in place. I'm mostly worried about a subset of that. My colleagues in these other swimlanes are working to do the rest. Most people in the department have access to data they need in their space. That hasn't been a problem. The problem is you go from space to space, you have to learn a new set of systems and a new set of techniques for a new set of data formats which means you have to be retrained. That really limits our freedom of maneuver of human beings. In the ideal world you'd be able to move from any job in any part of the department to the same job in another part of the department with no retraining whatsoever. You'd be instantly able to make a contribution. That's what we're trying to get to. So that's a different kind of a challenge, right. How do we get that level of consistency in the user experience, a modern user experience. So that if I'm a real estate manager, or I'm a medical business manager, or I'm a clinical professional, or I'm whatever, I can go from this location in this part of the department to that location in that part and my experience is the same. It's completely modern, and it's completely consistent. No retraining. >> How much of that challenge pie is people, process and technology? How would you split that opportunity? >> Well everything starts for a process perspective. Because if you automate a bad process, you just make more mistakes in less time at greater costs. Obviously that's not the ideal. But the biggest single challenge is people. It's talent, it's culture. Both on the demand side and on the supply side. If fact a lot of what I talked about in my remarks, was the additional changes we need to put in place to bring people into a more modern approach to data, more modern consumption. And look, we have pockets of excellence. And they can hold their own against any team, any place on the planet. But they are pockets of excellence. And what we're trying to do is raise the entire organization's performance. So it's people, people, and people and then the other stuff. But the products, don't care about (laughs). >> We often here about... >> They're going to change in 12 to 18 months. I'm a technologist, I'm hands on. The products are going to change rapidly, I make no emotional commitment to products. But the people that's a different story. >> Well we know that in the commercial world we often hear that cultural resistance is what sabotages modernization efforts. The DoD is sort of the ultimate top-down organization. It is any easier to get buy-in because the culture is sort of command and control oriented? >> It's hard in the DoD, it's not easier in the DoD. Ultimately people respond to their performance incentives. That's the dirty secrets performance incentives, they work every time. So unless you restructure performance measures and incentives for people their behavior's never going to change. They need to see their personal future in the future you're prescribing. And if they don't see it, you're going to get resistance every time. They're going to do what they believe they're incented to do. Making those changes, cascading those performance measures down, has been difficult because much of the decision-making processes in the department have been based on slow-moving systems and slow-moving data. I mean think about it, our budget planning process was created by Robert McNamara, as the Secretary of Defense. It requires you to plan everything for five years. And it takes more than a year to plan a single year's worth of activities, it's slow-moving. And we have regulation, we have legislation, we're a law-abiding organization, we do what we have to do. All of those things slow things down. And there's a culture of expecting macro-level consensus building. Which means everybody feels they can say no. If everybody can say no, then change becomes peanut butter spread across an organization. When you peanut butter spread across something our size and scale, the layer's pretty thin. So we have the same problem that other organizations have. There is clearly a perception of top-down change and if the Secretary or the Deputy Secretary issue an instruction people will obey it. It just takes some time to work it's way down into all the detailed combinations and permutations. Cause you have to make sophisticated decisions now. How am I going to change for my performance measures for that group to that group? And that takes time and energy and thought. There's a natural sort of pipeline effect in this. So there's real tension I think in between this perception of top-down and people will obey the orders their given. But when you're trying to integrate those changes into a board set of policy and process and people, that takes time and energy. >> And as a result the leaders have to be circumspect about the orders they give because they want to see success. They want to make sure that what they say is actually implemented or it reflects poorly on the organization. >> I think that out leaders are absolutely concerned about accomplishing the outcomes that they set out. And I think that they are rightfully determined to get the change as rapidly as possible. I would not expect them to be circumspect. I would anticipate that they would be firm and clear in the direction that they set and they would set aggressive targets because you need aggressive targets to get aggressively changed outcomes. Now. >> But they would have to choose wisely, they can't just fire off orders and expect everything to be done. I would think that they got to really think about what they want to get done, and put all the wood behind the arrow as you... >> I think that they constantly balance all those considerations. I must say, I did not appreciate before I joined the department the extraordinary caliber of leadership we enjoy. We have people with real insight and experience, and high intellectual horsepower making the decisions in the department. We've been blessed with the continuing stream of them at all of the senior ranks. These people could go anywhere, or do anything that they wanted in the economy and they've chosen to be in the department. And they bring enormous intellectual firepower to bear on challenges. >> Well you mentioned the motivation at the top of the segment, that's largely pretty powerful. >> Yeah, oh absolutely. >> I want to ask you, we have to break, but the organizational structure, you talked about the CIO, actually the responsibility for security within the CIO. >> Sure. >> To whom do you report. What's the organization look like? >> So I report to the Chief Management Officer of the Department of Defense. So if you think about the order of precedents, there's the Secretary of Defense, the Deputy Secretary of Defense and third in order is the Chief Management Officer. I report to the Chief Management Officer. >> As does the CIO, is that right? >> As does the CIO, as does the CIO. And actually this is quite typical in large organizations, that you don't have the CDO and the CIO in the same space because the concerns are very different. They have to collaborate but very different concerns. We used to see CDOs reporting to CIOs that's fallen dramatically in terms of the frequency you see that. Cause we now recognize that's just a failure mode. So you don't want to go down that path. The number one most common reporting relationship is actually to a CEO, the chief executive officer, of an organization. It's all about, what executive is driving performance for the organization? That's the person the CDO should report to. And I'm blessed in that I do find myself reporting to the executive driving organizational improvement. For me, that's a critical thing. That would make the difference between whether I could succeed or whether I'm doomed to fail. >> COO would be common too in a commercial organization. >> Yeah, in certain commercial organizations, it's a COO. It just depends on the nature of the business and their maturity with data. But if you're in the... If data's the business, CDO will report to the CEO. There are other organizations where it'll be the COO or CFO, it just depends on the nature of that business. And in our case I'm quite fortunate. >> Well Michael, thank you for, not only the coming to the CUBE but the service you're providing to the country, we really appreciate your insights and... >> It's a pleasure meeting you. >> It's a pleasure meeting you. All right, keep it right there everybody we'll be right back with our next guest. You're watching the CUBE live from MIT CDOIQ, be right back. (upbeat music)

Published Date : Jul 31 2019

SUMMARY :

Brought to you by SiliconANGLE Media. and Michael Conlin is here as the chief data officer More common enterprise data is the expression we use. and maybe that comes from supporting various initiatives, In terms of the way we support as I see, is primarily in the private sector. I just couldn't say no to it. But the chance to be the first chief data officer defending the constitution and the American way, And just the whole future of warfare Because the goal is not to prevent use of data, you mentioned that the DoD has 10,000 plus This is not, a driftnet the organization and says this is organizations that we've all lived in. enabling soldiers in the field to get the right data for soldiers in the field. in any part of the department to the same job Both on the demand side and on the supply side. But the people that's a different story. The DoD is sort of the ultimate top-down organization. and if the Secretary or the Deputy Secretary And as a result the leaders have to be circumspect about in the direction that they set and they would set behind the arrow as you... the extraordinary caliber of leadership we enjoy. of the segment, that's largely pretty powerful. but the organizational structure, you talked about the CIO, What's the organization look like? of the Department of Defense. dramatically in terms of the frequency you see that. It just depends on the nature of the business to the CUBE but the service you're providing to the country, It's a pleasure meeting you.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
JimPERSON

0.99+

DavePERSON

0.99+

JohnPERSON

0.99+

JeffPERSON

0.99+

Paul GillinPERSON

0.99+

MicrosoftORGANIZATION

0.99+

DavidPERSON

0.99+

Lisa MartinPERSON

0.99+

PCCWORGANIZATION

0.99+

Dave VolantePERSON

0.99+

AmazonORGANIZATION

0.99+

Michelle DennedyPERSON

0.99+

Matthew RoszakPERSON

0.99+

Jeff FrickPERSON

0.99+

Rebecca KnightPERSON

0.99+

Mark RamseyPERSON

0.99+

GeorgePERSON

0.99+

Jeff SwainPERSON

0.99+

Andy KesslerPERSON

0.99+

EuropeLOCATION

0.99+

Matt RoszakPERSON

0.99+

Frank SlootmanPERSON

0.99+

John DonahoePERSON

0.99+

Dave VellantePERSON

0.99+

Dan CohenPERSON

0.99+

Michael BiltzPERSON

0.99+

Dave NicholsonPERSON

0.99+

Michael ConlinPERSON

0.99+

IBMORGANIZATION

0.99+

MeloPERSON

0.99+

John FurrierPERSON

0.99+

NVIDIAORGANIZATION

0.99+

Joe BrockmeierPERSON

0.99+

SamPERSON

0.99+

MattPERSON

0.99+

Jeff GarzikPERSON

0.99+

CiscoORGANIZATION

0.99+

Dave VellantePERSON

0.99+

JoePERSON

0.99+

George CanuckPERSON

0.99+

AWSORGANIZATION

0.99+

AppleORGANIZATION

0.99+

Rebecca NightPERSON

0.99+

BrianPERSON

0.99+

Dave ValantePERSON

0.99+

NUTANIXORGANIZATION

0.99+

NeilPERSON

0.99+

MichaelPERSON

0.99+

Mike NickersonPERSON

0.99+

Jeremy BurtonPERSON

0.99+

FredPERSON

0.99+

Robert McNamaraPERSON

0.99+

Doug BalogPERSON

0.99+

2013DATE

0.99+

Alistair WildmanPERSON

0.99+

KimberlyPERSON

0.99+

CaliforniaLOCATION

0.99+

Sam GroccotPERSON

0.99+

AlibabaORGANIZATION

0.99+

RebeccaPERSON

0.99+

twoQUANTITY

0.99+

Mark Ramsey, Ramsey International LLC | MIT CDOIQ 2019


 

>> From Cambridge, Massachusetts. It's theCUBE, covering MIT Chief Data Officer and Information Quality Symposium 2019. Brought to you by SiliconANGLE Media. >> Welcome back to Cambridge, Massachusetts, everybody. We're here at MIT, sweltering Cambridge, Massachusetts. You're watching theCUBE, the leader in live tech coverage, my name is Dave Vellante. I'm here with my co-host, Paul Gillin. Special coverage of the MITCDOIQ. The Chief Data Officer event, this is the 13th year of the event, we started seven years ago covering it, Mark Ramsey is here. He's the Chief Data and Analytics Officer Advisor at Ramsey International, LLC and former Chief Data Officer of GlaxoSmithKline. Big pharma, Mark, thanks for coming onto theCUBE. >> Thanks for having me. >> You're very welcome, fresh off the keynote. Fascinating keynote this evening, or this morning. Lot of interest here, tons of questions. And we have some as well, but let's start with your history in data. I sat down after 10 years, but I could have I could have stretched it to 20. I'll sit down with the young guns. But there was some folks in there with 30 plus year careers. How about you, what does your data journey look like? >> Well, my data journey, of course I was able to stand up for the whole time because I was in the front, but I actually started about 32, a little over 32 years ago and I was involved with building. What I always tell folks is that Data and Analytics has been a long journey, and the name has changed over the years, but we've been really trying to tackle the same problems of using data as a strategic asset. So when I started I was with an insurance and financial services company, building one of the first data warehouse environments in the insurance industry, and that was in the 87, 88 range, and then once I was able to deliver that, I ended up transitioning into being in consulting for IBM and basically spent 18 years with IBM in consulting and services. When I joined, the name had evolved from Data Warehousing to Business Intelligence and then over the years it was Master Data Management, Customer 360. Analytics and Optimization, Big Data. And then in 2013, I joined Samsung Mobile as their first Chief Data Officer. So, moving out of consulting, I really wanted to own the end-to-end delivery of advanced solutions in the Data Analytics space and so that made the transition to Samsung quite interesting, very much into consumer electronics, mobile phones, tablets and things of that nature, and then in 2015 I joined GSK as their first Chief Data Officer to deliver a Data Analytics solution. >> So you have long data history and Paul, Mark took us through. And you're right, Mark-o, it's a lot of the same narrative, same wine, new bottle but the technology's obviously changed. The opportunities are greater today. But you took us through Enterprise Data Warehouse which was ETL and then MAP and then Master Data Management which is kind of this mapping and abstraction layer, then an Enterprise Data Model, top-down. And then that all failed, so we turned to Governance which has been very very difficult and then you came up with another solution that we're going to dig into, but is it the same wine, new bottle from the industry? >> I think it has been over the last 20, 30 years, which is why I kind of did the experiment at the beginning of how long folks have been in the industry. I think that certainly, the technology has advanced, moving to reduction in the amount of schema that's required to move data so you can kind of move away from the map and move type of an approach of a data warehouse but it is tackling the same type of problems and like I said in the session it's a little bit like Einstein's phrase of doing the same thing over and over again and expecting a different answer is certainly the definition of insanity and what I really proposed at the session was let's come at this from a very different perspective. Let's actually use Data Analytics on the data to make it available for these purposes, and I do think I think it's a different wine now and so I think it's just now a matter of if folks can really take off and head that direction. >> What struck me about, you were ticking off some of the issues that have failed like Data Warehouses, I was surprised to hear you say Data Governance really hasn't worked because there's a lot of talk around that right now, but all of those are top-down initiatives, and what you did at GSK was really invert that model and go from the bottom up. What were some of the barriers that you had to face organizationally to get the cooperation of all these people in this different approach? >> Yeah, I think it's still key. It's not a complete bottoms up because then you do end up really just doing data for the sake of data, which is also something that's been tried and does not work. I think it has to be a balance and that's really striking that right balance of really tackling the data at full perspective but also making sure that you have very definitive use cases to deliver value for the organization and then striking the balance of how you do that and I think of the things that becomes a struggle is you're talking about very large breadth and any time you're covering multiple functions within a business it's getting the support of those different business functions and I think part of that is really around executive support and what that means, I did mention it in the session, that executive support to me is really stepping up and saying that the data across the organization is the organization's data. It isn't owned by a particular person or a particular scientist, and I think in a lot of organization, that gatekeeper mentality really does put barriers up to really tackling the full breadth of the data. >> So I had a question around digital initiatives. Everywhere you go, every C-level Executive is trying to get digital right, and a lot of this is top-down, a lot of it is big ideas and it's kind of the North Star. Do you think that that's the wrong approach? That maybe there should be a more tactical line of business alignment with that threaded leader as opposed to this big picture. We're going to change and transform our company, what are your thoughts? >> I think one of the struggles is just I'm not sure that organizations really have a good appreciation of what they mean when they talk about digital transformation. I think there's in most of the industries it is an initiative that's getting a lot of press within the organizations and folks want to go through digital transformation but in some cases that means having a more interactive experience with consumers and it's maybe through sensors or different ways to capture data but if they haven't solved the data problem it just becomes another source of data that we're going to mismanage and so I do think there's a risk that we're going to see the same outcome from digital that we have when folks have tried other approaches to integrate information, and if you don't solve the basic blocking and tackling having data that has higher velocity and more granularity, if you're not able to solve that because you haven't tackled the bigger problem, I'm not sure it's going to have the impact that folks really expect. >> You mentioned that at GSK you collected 15 petabytes of data of which only one petabyte was structured. So you had to make sense of all that unstructured data. What did you learn about that process? About how to unlock value from unstructured data as a result of that? >> Yeah, and I think this is something. I think it's extremely important in the unstructured data to apply advanced analytics against the data to go through a process of making sense of that information and a lot of folks talk about or have talked about historically around text mining of trying to extract an entity out of unstructured data and using that for the value. There's a few steps before you even get to that point, and first of all it's classifying the information to understand which documents do you care about and which documents do you not care about and I always use the story that in this vast amount of documents there's going to be, somebody has probably uploaded the cafeteria menu from 10 years ago. That has no scientific value, whereas a protocol document for a clinical trial has significant value, you don't want to look through manually a billion documents to separate those, so you have to apply the technology even in that first step of classification, and then there's a number of steps that ultimately lead you to understanding the relationship of the knowledge that's in the documents. >> Side question on that, so you had discussed okay, if it's a menu, get rid of it but there's certain restrictions where you got to keep data for decades. It struck me, what about work in process? Especially in the pharmaceutical industry. I mean, post Federal Rules of Civil Procedure was everybody looking for a smoking gun. So, how are organizations dealing with what to keep and what to get rid of? >> Yeah, and I think certainly the thinking has been to remove the excess and it's to your point, how do you draw the line as to what is excess, right, so you don't want to just keep every document because then if an organization is involved in any type of litigation and there's disclosure requirements, you don't want to have to have thousands of documents. At the same time, there are requirements and so it's like a lot of things. It's figuring out how do you abide by the requirements, but that is not an easy thing to do, and it really is another driver, certainly document retention has been a big thing over a number of years but I think people have not applied advanced analytics to the level that they can to really help support that. >> Another Einstein bro-mahd, you know. Keep everything you must but no more. So, you put forth a proposal where you basically had this sort of three approaches, well, combined three approaches. The crawlers to go, the spiders to go out and do the discovery and I presume that's where the classification is done? >> That's really the identification of all of the source information >> Okay, so find out what you got, okay. >> so that's kind of the start. Find out what you have. >> Step two is the data repository. Putting that in, I thought it was when I heard you I said okay it must be a logical data repository, but you said you basically told the CIO we're copying all the data and putting it into essentially one place. >> A physical location, yes. >> Okay, and then so I got another question about that and then use bots in the pipeline to move the data and then you sort of drew the diagram of the back end to all the databases. Unstructured, structured, and then all the fun stuff up front, visualization. >> Which people love to focus on the fun stuff, right? Especially, you can't tell how many articles are on you got to apply deep learning and machine learning and that's where the answers are, we have to have the data and that's the piece that people are missing. >> So, my question there is you had this tactical mindset, it seems like you picked a good workload, the clinical trials and you had at least conceptually a good chance of success. Is that a fair statement? >> Well, the clinical trials was one aspect. Again, we tackled the entire data landscape. So it was all of the data across all of R&D. It wasn't limited to just, that's that top down and bottom up, so the bottom up is tackle everything in the landscape. The top down is what's important to the organization for decision making. >> So, that's actually the entire R&D application portfolio. >> Both internal and external. >> So my follow up question there is so that largely was kind of an inside the four walls of GSK, workload or not necessarily. My question was what about, you hear about these emerging Edge applications, and that's got to be a nightmare for what you described. In other words, putting all the data into one physical place, so it must be like a snake swallowing a basketball. Thoughts on that? >> I think some of it really does depend on you're always going to have these, IOT is another example where it's a large amount of streaming information, and so I'm not proposing that all data in every format in every location needs to be centralized and homogenized, I think you have to add some intelligence on top of that but certainly from an edge perspective or an IOT perspective or sensors. The data that you want to then make decisions around, so you're probably going to have a filter level that will impact those things coming in, then you filter it down to where you're going to really want to make decisions on that and then that comes together with the other-- >> So it's a prioritization exercise, and that presumably can be automated. >> Right, but I think we always have these cases where we can say well what about this case, and you know I guess what I'm saying is I've not seen organizations tackle their own data landscape challenges and really do it in an aggressive way to get value out of the data that's within their four walls. It's always like I mentioned in the keynote. It's always let's do a very small proof of concept, let's take a very narrow chunk. And what ultimately ends up happening is that becomes the only solution they build and then they go to another area and they build another solution and that's why we end up with 15 or 25-- (all talk over each other) >> The conventional wisdom is you start small. >> And fail. >> And you go on from there, you fail and that's now how you get big things done. >> Well that's not how you support analytic algorithms like machine learning and deep learning. You can't feed those just fragmented data of one aspect of your business and expect it to learn intelligent things to then make recommendations, you've got to have a much broader perspective. >> I want to ask you about one statistic you shared. You found 26 thousand relational database schemas for capturing experimental data and you standardized those into one. How? >> Yeah, I mean we took advantage of the Tamr technology that Michael Stonebraker created here at MIT a number of years ago which is really, again, it's applying advanced analytics to the data and using the content of the data and the characteristics of the data to go from dispersed schemas into a unified schema. So if you look across 26 thousand schemas using machine learning, you then can understand what's the consolidated view that gives you one perspective across all of those different schemas, 'cause ultimately when you give people flexibility they love to take advantage of it but it doesn't mean that they're actually doing things in an extremely different way, 'cause ultimately they're capturing the same kind of data. They're just calling things different names and they might be using different formats but in that particular case we use Tamr very heavily, and that again is back to my example of using advanced analytics on the data to make it available to do the fun stuff. The visualization and the advanced analytics. >> So Mark, the last question is you well know that the CDO role emerged in these highly regulated industries and I guess in the case of pharma quasi-regulated industries but now it seems to be permeating all industries. We have Goka-lan from McDonald's and virtually every industry is at least thinking about this role or has some kind of de facto CDO, so if you were slotted in to a CDO role, let's make it generic. I know it depends on the industry but where do you start as a CDO for an organization large company that doesn't have a CDO. Even a mid-sized organization, where do you start? >> Yeah, I mean my approach is that a true CDO is maximizing the strategic value of data within the organization. It isn't a regulatory requirement. I know a lot of the banks started there 'cause they needed someone to be responsible for data quality and data privacy but for me the most critical thing is understanding the strategic objectives of the organization and how will data be used differently in the future to drive decisions and actions and the effectiveness of the business. In some cases, there was a lot of discussion around monetizing the value of data. People immediately took that to can we sell our data and make money as a different revenue stream, I'm not a proponent of that. It's internally monetizing your data. How do you triple the size of the business by using data as a strategic advantage and how do you change the executives so what is good enough today is not good enough tomorrow because they are really focused on using data as their decision making tool, and that to me is the difference that a CDO needs to make is really using data to drive those strategic decision points. >> And that nuance you mentioned I think is really important. Inderpal Bhandari, who is the Chief Data Officer of IBM often says how can you monetize the data and you're right, I don't think he means selling data, it's how does data contribute, if I could rephrase what you said, contribute to the value of the organization, that can be cutting costs, that can be driving new revenue streams, that could be saving lives if you're a hospital, improving productivity. >> Yeah, and I think what I've shared typically shared with executives when I've been in the CDO role is that they need to change their behavior, right? If a CDO comes in to an organization and a year later, the executives are still making decisions on the same data PowerPoints with spinning logos and they said ooh, we've got to have 'em. If they're still making decisions that way then the CDO has not been successful. The executives have to change what their level of expectation is in order to make a decision. >> Change agents, top down, bottom up, last question. >> Going back to GSK, now that they've completed this massive data consolidation project how are things different for that business? >> Yeah, I mean you look how Barron joined as the President of R&D about a year and a half ago and his primary focus is using data and analytics and machine learning to drive the decision making in the discovery of a new medicine and the environment that has been created is a key component to that strategic initiative and so they are actually completely changing the way they're selecting new targets for new medicines based on data and analytics. >> Mark, thanks so much for coming on theCUBE. >> Thanks for having me. >> Great keynote this morning, you're welcome. All right, keep it right there everybody. We'll be back with our next guest. This is theCUBE, Dave Vellante with Paul Gillin. Be right back from MIT. (upbeat music)

Published Date : Jul 31 2019

SUMMARY :

Brought to you by SiliconANGLE Media. Special coverage of the MITCDOIQ. I could have stretched it to 20. and so that made the transition to Samsung and then you came up with another solution on the data to make it available some of the issues that have failed striking the balance of how you do that and it's kind of the North Star. the bigger problem, I'm not sure it's going to You mentioned that at GSK you against the data to go through a process of Especially in the pharmaceutical industry. as to what is excess, right, so you and do the discovery and I presume Okay, so find out what you so that's kind of the start. all the data and putting it into essentially one place. and then you sort of drew the diagram of and that's the piece that people are missing. So, my question there is you had this Well, the clinical trials was one aspect. My question was what about, you hear about these and homogenized, I think you have to exercise, and that presumably can be automated. and then they go to another area and that's now how you get big things done. Well that's not how you support analytic and you standardized those into one. on the data to make it available to do the fun stuff. and I guess in the case of pharma the difference that a CDO needs to make is of the organization, that can be Yeah, and I think what I've shared and the environment that has been created This is theCUBE, Dave Vellante with Paul Gillin.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave VellantePERSON

0.99+

IBMORGANIZATION

0.99+

Paul GillinPERSON

0.99+

MarkPERSON

0.99+

Mark RamseyPERSON

0.99+

15 petabytesQUANTITY

0.99+

SamsungORGANIZATION

0.99+

Inderpal BhandariPERSON

0.99+

Michael StonebrakerPERSON

0.99+

2013DATE

0.99+

PaulPERSON

0.99+

GlaxoSmithKlineORGANIZATION

0.99+

BarronPERSON

0.99+

Ramsey International, LLCORGANIZATION

0.99+

26 thousand schemasQUANTITY

0.99+

GSKORGANIZATION

0.99+

18 yearsQUANTITY

0.99+

2015DATE

0.99+

thousandsQUANTITY

0.99+

EinsteinPERSON

0.99+

Cambridge, MassachusettsLOCATION

0.99+

tomorrowDATE

0.99+

Samsung MobileORGANIZATION

0.99+

26 thousandQUANTITY

0.99+

Ramsey International LLCORGANIZATION

0.99+

30 plus yearQUANTITY

0.99+

a year laterDATE

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

Federal Rules of Civil ProcedureTITLE

0.99+

20QUANTITY

0.99+

25QUANTITY

0.99+

BothQUANTITY

0.99+

first stepQUANTITY

0.99+

one petabyteQUANTITY

0.98+

todayDATE

0.98+

15QUANTITY

0.98+

oneQUANTITY

0.98+

three approachesQUANTITY

0.98+

13th yearQUANTITY

0.98+

one aspectQUANTITY

0.97+

MITORGANIZATION

0.97+

seven years agoDATE

0.97+

McDonald'sORGANIZATION

0.96+

MIT Chief Data Officer andEVENT

0.95+

R&DORGANIZATION

0.95+

10 years agoDATE

0.95+

this morningDATE

0.94+

this eveningDATE

0.93+

one placeQUANTITY

0.93+

one perspectiveQUANTITY

0.92+

about a year and a half agoDATE

0.91+

over 32 years agoDATE

0.9+

a lot of talkQUANTITY

0.9+

a billion documentsQUANTITY

0.9+

CDOTITLE

0.89+

decadesQUANTITY

0.88+

one statisticQUANTITY

0.87+

2019DATE

0.85+

first dataQUANTITY

0.84+

of years agoDATE

0.83+

Step twoQUANTITY

0.8+

TamrOTHER

0.77+

Information Quality Symposium 2019EVENT

0.77+

PowerPointsTITLE

0.76+

documentsQUANTITY

0.75+

theCUBEORGANIZATION

0.75+

one physicalQUANTITY

0.73+

10 yearsQUANTITY

0.72+

87, 88 rangeQUANTITY

0.71+

PresidentPERSON

0.7+

Chief Data OfficerPERSON

0.7+

Enterprise Data WarehouseORGANIZATION

0.66+

Goka-lanORGANIZATION

0.66+

first Chief DataQUANTITY

0.63+

first Chief Data OfficerQUANTITY

0.63+

EdgeTITLE

0.63+

tonsQUANTITY

0.62+

Keynote Analysis | MIT CDOIQ 2019


 

>> From Cambridge, Massachusetts, it's The Cube! Covering MIT Chief Data Officer and Information Qualities Symposium 2019. Brought to you by SiliconANGLE Media. >> Welcome to Cambridge, Massachusetts everybody. You're watching The Cube, the leader in live tech coverage. My name is Dave Vellante and I'm here with my cohost Paul Gillin. And we're covering the 13th annual MIT CDOIQ conference. The Cube first started here in 2013 when the whole industry Paul, this segment of the industry was kind of moving out of the ashes of the compliance world and the data quality world and kind of that back office role, and it had this tailwind of the so called big data movement behind it. And the Chief Data Officer was emerging very strongly within as we've talked about many times in theCube, within highly regulated industries like financial services and government and healthcare and now we're seeing data professionals from all industries join this symposium at MIT as I say 13th year, and we're now seeing a lot of discussion about not only the role of the Chief Data Officer, but some of what we heard this morning from Mark Ramsey some of the failures along the way of all these north star data initiatives, and kind of what to do about it. So this conference brings together several hundred practitioners and we're going to be here for two days just unpacking all the discussions the major trends that touch on data. The data revolution, whether it's digital transformation, privacy, security, blockchain and the like. Now Paul, you've been involved in this conference for a number of years, and you've seen it evolve. You've seen that chief data officer role both emerge from the back office into a c-level executive role, and now spanning a very wide scope of responsibilities. Your thoughts? >> It's been like being part of a soap opera for the last eight years that I've been part of this conference because as you said Dave, we've gone through all of these transitions. In the early days this conference actually started as an information qualities symposium. It has evolved to become about chief data officer and really about the data as an asset to the organization. And I thought that the presentation we saw this morning, Mark Ramsey's talk, we're going to have him on later, very interesting about what they did at GlaxoSmithKline to get their arms around all of the data within that organization. Now a project like that would've unthinkable five years ago, but we've seen all of these new technologies come on board, essentially they've created a massive search engine for all of their data. We're seeing organizations beginning to get their arms around this massive problem. And along the way I say it's a soap opera because along the way we've seen failure after failure, we heard from Mark this morning that data governance is a failure too. That was news to me! All of these promising initiatives that have started and fallen flat or failed to live up to their potential, the chief data officer role has emerged out of that to finally try to get beyond these failures and really get their arms around that organizational data and it's a huge project, and it's something that we're beginning to see some organization succeed at. >> So let's talk a little bit about the role. So the chief data officer in many ways has taken a lot of the heat off the chief information officer, right? It used to be CIO stood for career is over. Well, when you throw all the data problems at an individual c-level executive, that really is a huge challenge. And so, with the cloud it's created opportunities for CIOs to actually unburden themselves of some of the crapplications and actually focus on some of the mission critical stuff that they've always been really strong at and focus their budgets there. But the chief data officer has had somewhat of an unclear scope. Different organizations have different roles and responsibilities. And there's overlap with the chief digital officer. There's a lot of emphasis on monetization whether that's increasing revenue or cutting costs. And as we heard today from the keynote speaker Mark Ramsey, a lot of the data initiatives have failed. So what's your take on that role and its viability and its longterm staying power? >> I think it's coming together. I think last year we saw the first evidence of that. I talked to a number of CDOs last year as well as some of the analysts who were at this conference, and there was pretty good clarity beginning to emerge about what they chief data officer role stood for. I think a lot of what has driven this is this digital transformation, the hot buzz word of 2019. The foundation of digital transformation is a data oriented culture. It's structuring the entire organization around data, and when you get to that point when an organization is ready to do that, then the role of the CDO I think becomes crystal clear. It's not so much just an extract transform load discipline. It's not just technology, it's not just governance. It really is getting that data, pulling that data together and putting it at the center of the organization. That's the value that the CDO can provide, I think organizations are coming around to that. >> Yeah and so we've seen over the last 10 years the decrease, the rapid decrease in cost, the cost of storage. Microprocessor performance we've talked about endlessly. And now you've got the machine intelligence piece layering in. In the early days Hadoop was the hot tech, and interesting now nobody talks even about Hadoop. Rarely. >> Yet it was discussed this morning. >> It was mentioned today. It is a fundamental component of infrastructures. >> Yeah. >> But what it did is it dramatically lowered the cost of storing data, and allowing people to leave data in place. The old adage of ship a five megabytes of code to a petabyte of data versus the reverse. Although we did hear today from Mark Ramsey that they copied all the data into a centralized location so I got some questions on that. But the point I want to make is that was really early days. We're now entered an era and it's underscored by if you look at the top five companies in terms of market cap in the US stock market, obviously Microsoft is now over a trillion. Microsoft, Apple, Amazon, Google and Facebook. Top five. They're data companies, their assets are all data driven. They've surpassed the banks, the energy companies, of course any manufacturing automobile companies, et cetera, et cetera. So they're data companies, and they're wrestling with big issues around security. You can't help but open the paper and see issues on security. Yesterday was the big Capital One. The Equifax issue was resolved in terms of the settlement this week, et cetera, et cetera. Facebook struggling mightily with whether or not how to deal fake news, how to deal with deep fakes. Recently it shut down likes for many Instagram accounts in some countries because they're trying to protect young people who are addicted to this. Well, they need to shut down likes for business accounts. So what kids are doing is they're moving over to the business Instagram accounts. Well when that happens, it exposes their emails automatically so they've all kinds of privacy landmines and people don't know how to deal with them. So this data explosion, while there's a lot of energy and excitement around it, brings together a lot of really sticky issues. And that falls right in the lap of the chief data officer, doesn't it? >> We're in uncharted territory and all of the examples you used are problems that we couldn't have foreseen, those companies couldn't have foreseen. A problem may be created but then the person who suffers from that problem changes their behavior and it creates new problems as you point out with kids shifting where they're going to communicate with each other. So these are all uncharted waters and I think it's got to be scary if you're a company that does have large amounts of consumer data in particular, consumer packaged goods companies for example, you're looking at what's happening to these big companies and these data breaches and you know that you're sitting on a lot of customer data yourself, and that's scary. So we may see some backlash to this from companies that were all bought in to the idea of the 360 degree customer view and having these robust data sources about each one of your customers. Turns out now that that's kind of a dangerous place to be. But to your point, these are data companies, the companies that business people look up to now, that they emulate, are companies that have data at their core. And that's not going to change, and that's certainly got to be good for the role of the CDO. >> I've often said that the enterprise data warehouse failed to live up to its expectations and its promises. And Sarbanes-Oxley basically saved EDW because reporting became a critical component post Enron. Mark Ramsey talked today about EDW failing, master data management failing as kind of a mapping and masking exercise. The enterprise data model which was a top down push for a sort of distraction layer, that failed. You had all these failures and so we turned to governance. That failed. And so you've had this series of issues. >> Let me just point out, what do all those have in common? They're all top down. >> Right. >> All top down initiatives. And what Glaxo did is turn that model on its head and left the data where it was. Went and discovered it and figured it out without actually messing with the data. That may be the difference that changes the game. >> Yeah and it's prescription was basically taking a tactical approach to that problem, start small, get quick hits. And then I think they selected a workload that was appropriate for solving this problem which was clinical trials. And I have some questions for him. And of the big things that struck me is the edge. So as you see a new emerging data coming out of the edge, how are organizations going to deal with that? Because I think a lot of what he was talking about was a lot of legacy on-prem systems and data. Think about JEDI, a story we've been following on SiliconANGLE the joint enterprise defense infrastructure. This is all about the DOD basically becoming cloud enabled. So getting data out into the field during wartime fast. We're talking about satellite data, you're talking about telemetry, analytics, AI data. A lot of distributed data at the edge bringing new challenges to how organizations are going to deal with data problems. It's a whole new realm of complexity. >> And you talk about security issues. When you have a lot of data at the edge and you're sending data to the edge, you're bringing it back in from the edge, every device in the middle is from the smart thermostat. at the edge all the way up to the cloud is a potential failure point, a potential vulnerability point. These are uncharted waters, right? We haven't had to do this on a large scale. Organizations like the DOD are going to be the ones that are going to be the leaders in figuring this out because they are so aggressive. They have such an aggressive infrastructure and place. >> The other question I had, striking question listening to Mark Ramsey this morning. Again Mark Ramsey was former data God at GSK, GlaxoSmithKline now a consultant. We're going to hear from a number of folks like him and chief data officers. But he basically kind of poopooed, he used the example of build it and they will come. You know the Kevin Costner movie Field of Dreams. Don't go after the field of dreams. So my question is, and I wonder if you can weigh in on this is, everywhere we go we hear about digital transformation. They have these big digital transformation projects, they generally are top down. Every CEO wants to get digital right. Is that the wrong approach? I want to ask Mark Ramsey that. Are they doing field of dreams type stuff? Is it going to be yet another failure of traditional legacy systems to try to compete with cloud native and born in data era companies? >> Well he mentioned this morning that the research is already showing that digital transformation most initiatives are failing. Largely because of cultural reasons not technical reasons, and I think Ramsey underscored that point this morning. It's interesting that he led off by mentioning business process reengineering which you remember was a big fad in the 1990s, companies threw billions of dollars at trying to reinvent themselves and most of them failed. Is digital transformation headed down the same path? I think so. And not because the technology isn't there, it's because creating a culture where you can break down these silos and you can get everyone oriented around a single view of the organizations data. The bigger the organization the less likely that is to happen. So what does that mean for the CDO? Well, chief information officer at one point we said the CIO stood for career is over. I wonder if there'll be a corresponding analogy for the CDOs at some of these big organizations when it becomes obvious that pulling all that data together is just not feasible. It sounds like they've done something remarkable at GSK, maybe we'll learn from that example. But not all organizations have the executive support, which was critical to what they did, or just the organizational will to organize themselves around that central data storm. >> And I also said before I think the CDO is taking a lot of heat off the CIO and again my inference was the GSK use case and workload was actually quite narrow in clinical trials and was well suited to success. So my takeaway in this, if I were CDO what I would be doing is trying to figure out okay how does data contribute to the monetization of my organization? Maybe not directly selling the data, but what data do I have that's valuable and how can I monetize that in terms of either saving money, supply chain, logistics, et cetera, et cetera, or making money? Some kind of new revenue opportunity. And I would super glue myself for the line of business executive and go after a small hit. You're talking about digital transformations being top down and largely failing. Shadow digital transformations is maybe the answer to that. Aligning with a line of business, focusing on a very narrow use case, and building successes up that way using data as the ingredient to drive value. >> And big ideas. I recently wrote about Experian which launched a service last called Boost that enables the consumers to actually impact their own credit scores by giving Experian access to their bank accounts to see that they are at better credit risk than maybe portrayed in the credit store. And something like 600,000 people signed up in the first six months of this service. That's an example I think of using inspiration, creating new ideas about how data can be applied And in the process by the way, Experian gains data that they can use in other context to better understand their consumer customers. >> So digital meets data. Data is not the new oil, data is more valuable than oil because you can use it multiple times. The same data can be put in your car or in your house. >> Wish we could do that with the oil. >> You can't do that with oil. So what does that mean? That means it creates more data, more complexity, more security risks, more privacy risks, more compliance complexity, but yet at the same time more opportunities. So we'll be breaking that down all day, Paul and myself. Two days of coverage here at MIT, hashtag MITCDOIQ. You're watching The Cube, we'll be right back right after this short break. (upbeat music)

Published Date : Jul 31 2019

SUMMARY :

and Information Qualities Symposium 2019. and the data quality world and really about the data as an asset to the organization. and actually focus on some of the mission critical stuff and putting it at the center of the organization. In the early days Hadoop was the hot tech, It is a fundamental component of infrastructures. And that falls right in the lap of and all of the examples you used I've often said that the enterprise data warehouse what do all those have in common? and left the data where it was. And of the big things that struck me is the edge. Organizations like the DOD are going to be the ones Is that the wrong approach? the less likely that is to happen. and how can I monetize that in terms of either saving money, that enables the consumers to actually Data is not the new oil, You can't do that with oil.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Mark RamseyPERSON

0.99+

MicrosoftORGANIZATION

0.99+

Dave VellantePERSON

0.99+

AmazonORGANIZATION

0.99+

PaulPERSON

0.99+

AppleORGANIZATION

0.99+

FacebookORGANIZATION

0.99+

Paul GillinPERSON

0.99+

GoogleORGANIZATION

0.99+

2013DATE

0.99+

RamseyPERSON

0.99+

Kevin CostnerPERSON

0.99+

EnronORGANIZATION

0.99+

last yearDATE

0.99+

DODORGANIZATION

0.99+

ExperianORGANIZATION

0.99+

2019DATE

0.99+

GlaxoSmithKlineORGANIZATION

0.99+

DavePERSON

0.99+

GSKORGANIZATION

0.99+

GlaxoORGANIZATION

0.99+

Two daysQUANTITY

0.99+

five megabytesQUANTITY

0.99+

360 degreeQUANTITY

0.99+

two daysQUANTITY

0.99+

todayDATE

0.99+

Cambridge, MassachusettsLOCATION

0.99+

Field of DreamsTITLE

0.99+

billions of dollarsQUANTITY

0.99+

MarkPERSON

0.99+

EquifaxORGANIZATION

0.99+

YesterdayDATE

0.99+

over a trillionQUANTITY

0.99+

1990sDATE

0.98+

600,000 peopleQUANTITY

0.98+

USLOCATION

0.98+

this weekDATE

0.98+

SiliconANGLE MediaORGANIZATION

0.98+

first six monthsQUANTITY

0.98+

InstagramORGANIZATION

0.98+

The CubeTITLE

0.98+

five years agoDATE

0.97+

Capital OneORGANIZATION

0.96+

first evidenceQUANTITY

0.96+

bothQUANTITY

0.96+

firstQUANTITY

0.95+

MITORGANIZATION

0.93+

this morningDATE

0.91+

HadoopTITLE

0.88+

one pointQUANTITY

0.87+

13th yearQUANTITY

0.86+

MIT CDOIQ conferenceEVENT

0.84+

MITCDOIQTITLE

0.84+

each oneQUANTITY

0.82+

hundred practitionersQUANTITY

0.82+

EDWORGANIZATION

0.81+

last eight yearsDATE

0.81+

MIT Chief Data Officer andEVENT

0.81+

Sarbanes-OxleyPERSON

0.8+

top five companiesQUANTITY

0.78+

The CubeORGANIZATION

0.75+

Top fiveQUANTITY

0.74+

single viewQUANTITY

0.7+

last 10 yearsDATE

0.69+

BoostTITLE

0.68+

a petabyte of dataQUANTITY

0.65+

EDWTITLE

0.64+

SiliconANGLEORGANIZATION

0.64+

Influencer Panel | IBM CDO Summit 2019


 

>> Live from San Francisco, California, it's theCUBE covering the IBM Chief Data Officers Summit, brought to you by IBM. >> Welcome back to San Francisco everybody. I'm Dave Vellante and you're watching theCUBE, the leader in live tech coverage. This is the end of the day panel at the IBM Chief Data Officer Summit. This is the 10th CDO event that IBM has held and we love to to gather these panels. This is a data all-star panel and I've recruited Seth Dobrin who is the CDO of the analytics group at IBM. Seth, thank you for agreeing to chip in and be my co-host in this segment. >> Yeah, thanks Dave. Like I said before we started, I don't know if this is a promotion or a demotion. (Dave laughing) >> We'll let you know after the segment. So, the data all-star panel and the data all-star awards that you guys are giving out a little later in the event here, what's that all about? >> Yeah so this is our 10th CDU Summit. So two a year, so we've been doing this for 5 years. The data all-stars are those people that have been to four at least of the ten. And so these are five of the 16 people that got the award. And so thank you all for participating and I attended these like I said earlier, before I joined IBM they were immensely valuable to me and I was glad to see 16 other people that think it's valuable too. >> That is awesome. Thank you guys for coming on. So, here's the format. I'm going to introduce each of you individually and then ask you to talk about your role in your organization. What role you play, how you're using data, however you want to frame that. And the first question I want to ask is, what's a good day in the life of a data person? Or if you want to answer what's a bad day, that's fine too, you choose. So let's start with Lucia Mendoza-Ronquillo. Welcome, she's the Senior Vice President and the Head of BI and Data Governance at Wells Fargo. You told us that you work within the line of business group, right? So introduce your role and what's a good day for a data person? >> Okay, so my role basically is again business intelligence so I support what's called cards and retail services within Wells Fargo. And I also am responsible for data governance within the business. We roll up into what's called a data governance enterprise. So we comply with all the enterprise policies and my role is to make sure our line of business complies with data governance policies for enterprise. >> Okay, good day? What's a good day for you? >> A good day for me is really when I don't get a call that the regulators are knocking on our doors. (group laughs) Asking for additional reports or have questions on the data and so that would be a good day. >> Yeah, especially in your business. Okay, great. Parag Shrivastava is the Director of Data Architecture at McKesson, welcome. Thanks so much for coming on. So we got a healthcare, couple of healthcare examples here. But, Parag, introduce yourself, your role, and then what's a good day or if you want to choose a bad day, be fun the mix that up. >> Yeah, sounds good. Yeah, so mainly I'm responsible for the leader strategy and architecture at McKesson. What that means is McKesson has a lot of data around the pharmaceutical supply chain, around one-third of the world's pharmaceutical supply chain, clinical data, also around pharmacy automation data, and we want to leverage it for the better engagement of the patients and better engagement of our customers. And my team, which includes the data product owners, and data architects, we are all responsible for looking at the data holistically and creating the data foundation layer. So I lead the team across North America. So that's my current role. And going back to the question around what's a good day, I think I would say the good day, I'll start at the good day. Is really looking at when the data improves the business. And the first thing that comes to my mind is sort of like an example, of McKesson did an acquisition of an eight billion dollar pharmaceutical company in Europe and we were creating the synergy solution which was based around the analytics and data. And actually IBM was one of the partners in implementing that solution. When the solution got really implemented, I mean that was a big deal for me to see that all the effort that we did in plumbing the data, making sure doing some analytics, is really helping improve the business. I think that is really a good day I would say. I mean I wouldn't say a bad day is such, there are challenges, constant challenges, but I think one of the top priorities that we are having right now is to deal with the demand. As we look at the demand around the data, the role of data has got multiple facets to it now. For example, some of the very foundational, evidentiary, and compliance type of needs as you just talked about and then also profitability and the cost avoidance and those kind of aspects. So how to balance between that demand is the other aspect. >> All right good. And we'll get into a lot of that. So Carl Gold is the Chief Data Scientist at Zuora. Carl, tell us a little bit about Zuora. People might not be as familiar with how you guys do software for billing et cetera. Tell us about your role and what's a good day for a data scientist? >> Okay, sure, I'll start by a little bit about Zuora. Zuora is a subscription management platform. So any company who wants to offer a product or service as subscription and you don't want to build your billing and subscription management, revenue recognition, from scratch, you can use a product like ours. I say it lets anyone build a telco with a complicated plan, with tiers and stuff like that. I don't know if that's a good thing or not. You guys'll have to make up your own mind. My role is an interesting one. It's split, so I said I'm a chief data scientist and we work about 50% on product features based on data science. Things like churn prediction, or predictive payment retries are product areas where we offer AI-based solutions. And then but because Zuora is a subscription platform, we have an amazing set of data on the actual performance of companies using our product. So a really interesting part of my role has been leading what we call the subscription economy index and subscription economy benchmarks which are reports around best practices for subscription companies. And it's all based off this amazing dataset created from an anonymized data of our customers. So that's a really exciting part of my role. And for me, maybe this speaks to our level of data governance, I might be able to get some tips from some of my co-panelists, but for me a good day is when all the data for me and everyone on my team is where we left it the night before. And no schema changes, no data, you know records that you were depending on finding removed >> Pipeline failures. >> Yeah pipeline failures. And on a bad day is a schema change, some crucial data just went missing and someone on my team is like, "The code's broken." >> And everybody's stressed >> Yeah, so those are bad days. But, data governance issues maybe. >> Great, okay thank you. Jung Park is the COO of Latitude Food Allergy Care. Jung welcome. >> Yeah hi, thanks for having me and the rest of us here. So, I guess my role I like to put it as I'm really the support team. I'm part of the support team really for the medical practice so, Latitude Food Allergy Care is a specialty practice that treats patients with food allergies. So, I don't know if any of you guys have food allergies or maybe have friends, kids, who have food allergies, but, food allergies unfortunately have become a lot more prevalent. And what we've been able to do is take research and data really from clinical trials and other research institutions and really use that from the clinical trial setting, back to the clinical care model so that we can now treat patients who have food allergies by using a process called oral immunotherapy. It's fascinating and this is really personal to me because my son as food allergies and he's been to the ER four times. >> Wow. >> And one of the scariest events was when he went to an ER out of the country and as a parent, you know you prepare your child right? With the food, he takes the food. He was 13 years old and you had the chaperones, everyone all set up, but you get this call because accidentally he ate some peanut, right. And so I saw this unfold and it scared me so much that this is something I believe we just have to get people treated. So this process allows people to really eat a little bit of the food at a time and then you eat the food at the clinic and then you go home and eat it. Then you come back two weeks later and then you eat a little bit more until your body desensitizes. >> So you build up that immunity >> Exactly. >> and then you watch the data obviously. >> Yeah. So what's a good day for me? When our patients are done for the day and they have a smile on their face because they were able to progress to that next level. >> Now do you have a chief data officer or are you the de facto CFO? >> I'm the de facto. So, my career has been pretty varied. So I've been essentially chief data officer, CIO, at companies small and big. And what's unique about I guess in this role is that I'm able to really think about the data holistically through every component of the practice. So I like to think of it as a patient journey and I'm sure you guys all think of it similarly when you talk about your customers, but from a patient's perspective, before they even come in, you have to make sure the data behind the science of whatever you're treating is proper, right? Once that's there, then you have to have the acquisition part. How do you actually work with the community to make sure people are aware of really the services that you're providing? And when they're with you, how do you engage them? How do you make sure that they are compliant with the process? So in healthcare especially, oftentimes patients don't actually succeed all the way through because they don't continue all the way through. So it's that compliance. And then finally, it's really long-term care. And when you get the long-term care, you know that the patient that you've treated is able to really continue on six months, a year from now, and be able to eat the food. >> Great, thank you for that description. Awesome mission. Rolland Ho is the Vice President of Data and Analytics at Clover Health. Tell us a little bit about Clover Health and then your role. >> Yeah, sure. So Clover is a startup Medicare Advantage plan. So we provide Medicare, private Medicare to seniors. And what we do is we're because of the way we run our health plan, we're able to really lower a lot of the copay costs and protect seniors against out of pocket. If you're on regular Medicare, you get cancer, you have some horrible accident, your out of pocket is infinite potentially. Whereas with Medicare Advantage Plan it's limited to like five, $6,000 and you're always protected. One of the things I'm excited about being at Clover is our ability to really look at how can we bring the value of data analytics to healthcare? Something I've been in this industry for close to 20 years at this point and there's a lot of waste in healthcare. And there's also a lot of very poor application of preventive measures to the right populations. So one of the things that I'm excited about is that with today's models, if you're able to better identify with precision, the right patients to intervene with, then you fundamentally transform the economics of what can be done. Like if you had to pa $1,000 to intervene, but you were only 20% of the chance right, that's very expensive for each success. But, now if your model is 60, 70% right, then now it opens up a whole new world of what you can do. And that's what excites me. In terms of my best day? I'll give you two different angles. One as an MBA, one of my best days was, client calls me up, says, "Hey Rolland, you know, "your analytics brought us over $100 million "in new revenue last year." and I was like, cha-ching! Excellent! >> Which is my half? >> Yeah right. And then on the data geek side the best day was really, run a model, you train a model, you get ridiculous AUC score, so area under the curve, and then you expect that to just disintegrate as you go into validation testing and actual live production. But the 98 AUC score held up through production. And it's like holy cow, the model actually works! And literally we could cut out half of the workload because of how good that model was. >> Great, excellent, thank you. Seth, anything you'd add to the good day, bad day, as a CDO? >> So for me, well as a CDO or as CDO at IBM? 'Cause at IBM I spend most of my time traveling. So a good day is a day I'm home. >> Yeah, when you're not in an (group laughing) aluminum tube. >> Yeah. Hurdling through space (laughs). No, but a good day is when a GDPR compliance just happened, a good day for me was May 20th of last year when IBM was done and we were, or as done as we needed to be for GDPR so that was a good day for me last year. This year is really a good day is when we start implementing some new models to help IBM become a more effective company and increase our bottom line or increase our margins. >> Great, all right so I got a lot of questions as you know and so I want to give you a chance to jump in. >> All right. >> But, I can get it started or have you got something? >> I'll go ahead and get started. So this is a the 10th CDO Summit. So five years. I know personally I've had three jobs at two different companies. So over the course of the last five years, how many jobs, how many companies? Lucia? >> One job with one company. >> Oh my gosh you're boring. (group laughing) >> No, but actually, because I support basically the head of the business, we go into various areas. So, we're not just from an analytics perspective and business intelligence perspective and of course data governance, right? It's been a real journey. I mean there's a lot of work to be done. A lot of work has been accomplished and constantly improving the business, which is the first goal, right? Increasing market share through insights and business intelligence, tracking product performance to really helping us respond to regulators (laughs). So it's a variety of areas I've had to be involved in. >> So one company, 50 jobs. >> Exactly. So right now I wear different hats depending on the day. So that's really what's happening. >> So it's a good question, have you guys been jumping around? Sure, I mean I think of same company, one company, but two jobs. And I think those two jobs have two different layers. When I started at McKesson I was a solution leader or solution director for business intelligence and I think that's how I started. And over the five years I've seen the complete shift towards machine learning and my new role is actually focused around machine learning and AI. That's why we created this layer, so our own data product owners who understand the data science side of things and the ongoing and business architecture. So, same company but has seen a very different shift of data over the last five years. >> Anybody else? >> Sure, I'll say two companies. I'm going on four years at Zuora. I was at a different company for a year before that, although it was kind of the same job, first at the first company, and then at Zuora I was really focused on subscriber analytics and churn for my first couple a years. And then actually I kind of got a new job at Zuora by becoming the subscription economy expert. I become like an economist, even though I don't honestly have a background. My PhD's in biology, but now I'm a subscription economy guru. And a book author, I'm writing a book about my experiences in the area. >> Awesome. That's great. >> All right, I'll give a bit of a riddle. Four, how do you have four jobs, five companies? >> In five years. >> In five years. (group laughing) >> Through a series of acquisition, acquisition, acquisition, acquisition. Exactly, so yeah, I have to really, really count on that one (laughs). >> I've been with three companies over the past five years and I would say I've had seven jobs. But what's interesting is I think it kind of mirrors and kind of mimics what's been going on in the data world. So I started my career in data analytics and business intelligence. But then along with that I had the fortune to work with the IT team. So the IT came under me. And then after that, the opportunity came about in which I was presented to work with compliance. So I became a compliance officer. So in healthcare, it's very interesting because these things are tied together. When you look about the data, and then the IT, and then the regulations as it relates to healthcare, you have to have the proper compliance, both internal compliance, as well as external regulatory compliance. And then from there I became CIO and then ultimately the chief operating officer. But what's interesting is as I go through this it's all still the same common themes. It's how do you use the data? And if anything it just gets to a level in which you become closer with the business and that is the most important part. If you stand alone as a data scientist, or a data analyst, or the data officer, and you don't incorporate the business, you alienate the folks. There's a math I like to do. It's different from your basic math, right? I believe one plus one is equal to three because when you get the data and the business together, you create that synergy and then that's where the value is created. >> Yeah, I mean if you think about it, data's the only commodity that increases value when you use it correctly. >> Yeah. >> Yeah so then that kind of leads to a question that I had. There's this mantra, the more data the better. Or is it more of an Einstein derivative? Collect as much data as possible but not too much. What are your thoughts? Is more data better? >> I'll take it. So, I would say the curve has shifted over the years. Before it used to be data was the bottleneck. But now especially over the last five to 10 years, I feel like data is no longer oftentimes the bottleneck as much as the use case. The definition of what exactly we're going to apply to, how we're going to apply it to. Oftentimes once you have that clear, you can go get the data. And then in the case where there is not data, like in Mechanical Turk, you can all set up experiments, gather data, the cost of that is now so cheap to experiment that I think the bottleneck's really around the business understanding the use case. >> Mm-hmm. >> Mm-hmm. >> And I think the wave that we are seeing, I'm seeing this as there are, in some cases, more data is good, in some cases more data is not good. And I think I'll start it where it is not good. I think where quality is more required is the area where more data is not good. For example like regulation and compliance. So for example in McKesson's case, we have to report on opioid compliance for different states. How much opioid drugs we are giving to states and making sure we have very, very tight reporting and compliance regulations. There, highest quality of data is important. In our data organization, we have very, very dedicated focus around maintaining that quality. So, quality is most important, quantity is not if you will, in that case. Having the right data. Now on the other side of things, where we are doing some kind of exploratory analysis. Like what could be a right category management for our stores? Or where the product pricing could be the right ones. Product has around 140 attributes. We would like to look at all of them and see what patterns are we finding in our models. So there you could say more data is good. >> Well you could definitely see a lot of cases. But certainly in financial services and a lot of healthcare, particularly in pharmaceutical where you don't want work in process hanging around. >> Yeah. >> Some lawyer could find a smoking gun and say, "Ooh see." And then if that data doesn't get deleted. So, let's see, I would imagine it's a challenge in your business, I've heard people say, "Oh keep all the, now we can keep all the data, "it's so inexpensive to store." But that's not necessarily such a good thing is it? >> Well, we're required to store data. >> For N number of years, right? >> Yeah, N number of years. But, sometimes they go beyond those number of years when there's a legal requirements to comply or to answer questions. So we do keep more than, >> Like a legal hold for example. >> Yeah. So we keep more than seven years for example and seven years is the regulatory requirement. But in the case of more data, I'm a data junkie, so I like more data (laughs). Whenever I'm asked, "Is the data available?" I always say, "Give me time I'll find it for you." so that's really how we operate because again, we're the go-to team, we need to be able to respond to regulators to the business and make sure we understand the data. So that's the other key. I mean more data, but make sure you understand what that means. >> But has that perspective changed? Maybe go back 10 years, maybe 15 years ago, when you didn't have the tooling to be able to say, "Give me more data." "I'll get you the answer." Maybe, "Give me more data." "I'll get you the answer in three years." Whereas today, you're able to, >> I'm going to go get it off the backup tapes (laughs). >> (laughs) Yeah, right, exactly. (group laughing) >> That's fortunately for us, Wells Fargo has implemented data warehouse for so many number of years, I think more than 10 years. So we do have that capability. There's certainly a lot of platforms you have to navigate through, but if you are able to navigate, you can get to the data >> Yeah. >> within the required timeline. So I have, astonished you have the technology, team behind you. Jung, you want to add something? >> Yeah, so that's an interesting question. So, clearly in healthcare, there is a lot of data and as I've kind of come closer to the business, I also realize that there's a fine line between collecting the data and actually asking our folks, our clinicians, to generate the data. Because if you are focused only on generating data, the electronic medical records systems for example. There's burnout, you don't want the clinicians to be working to make sure you capture every element because if you do so, yes on the back end you have all kinds of great data, but on the other side, on the business side, it may not be necessarily a productive thing. And so we have to make a fine line judgment as to the data that's generated and who's generating that data and then ultimately how you end up using it. >> And I think there's a bit of a paradox here too, right? The geneticist in me says, "Don't ever throw anything away." >> Right. >> Right? I want to keep everything. But, the most interesting insights often come from small data which are a subset of that larger, keep everything inclination that we as data geeks have. I think also, as we're moving in to kind of the next phase of AI when you can start doing really, really doing things like transfer learning. That small data becomes even more valuable because you can take a model trained on one thing or a different domain and move it over to yours to have a starting point where you don't need as much data to get the insight. So, I think in my perspective, the answer is yes. >> Yeah (laughs). >> Okay, go. >> I'll go with that just to run with that question. I think it's a little bit of both 'cause people touched on different definitions of more data. In general, more observations can never hurt you. But, more features, or more types of things associated with those observations actually can if you bring in irrelevant stuff. So going back to Rolland's answer, the first thing that's good is like a good mental model. My PhD is actually in physical science, so I think about physical science, where you actually have a theory of how the thing works and you collect data around that theory. I think the approach of just, oh let's put in 2,000 features and see what sticks, you know you're leaving yourself open to all kinds of problems. >> That's why data science is not democratized, >> Yeah (laughing). >> because (laughing). >> Right, but first Carl, in your world, you don't have to guess anymore right, 'cause you have real data. >> Well yeah, of course, we have real data, but the collection, I mean for example, I've worked on a lot of customer churn problems. It's very easy to predict customer churn if you capture data that pertains to the value customers are receiving. If you don't capture that data, then you'll never predict churn by counting how many times they login or more crude measures of engagement. >> Right. >> All right guys, we got to go. The keynotes are spilling out. Seth thank you so much. >> That's it? >> Folks, thank you. I know, I'd love to carry on, right? >> Yeah. >> It goes fast. >> Great. >> Yeah. >> Guys, great, great content. >> Yeah, thanks. And congratulations on participating and being data all-stars. >> We'd love to do this again sometime. All right and thank you for watching everybody, it's a wrap from IBM CDOs, Dave Vellante from theCUBE. We'll see you next time. (light music)

Published Date : Jun 25 2019

SUMMARY :

brought to you by IBM. This is the end of the day panel Like I said before we started, I don't know if this is that you guys are giving out a little later And so thank you all for participating and then ask you to talk and my role is to make sure our line of business complies a call that the regulators are knocking on our doors. and then what's a good day or if you want to choose a bad day, And the first thing that comes to my mind So Carl Gold is the Chief Data Scientist at Zuora. as subscription and you don't want to build your billing and someone on my team is like, "The code's broken." Yeah, so those are bad days. Jung Park is the COO of Latitude Food Allergy Care. So, I don't know if any of you guys have food allergies of the food at a time and then you eat the food and then you When our patients are done for the day and I'm sure you guys all think of it similarly Great, thank you for that description. the right patients to intervene with, and then you expect that to just disintegrate Great, excellent, thank you. So a good day is a day I'm home. Yeah, when you're not in an (group laughing) for GDPR so that was a good day for me last year. and so I want to give you a chance to jump in. So over the course of the last five years, Oh my gosh you're boring. and constantly improving the business, So that's really what's happening. and the ongoing and business architecture. in the area. That's great. Four, how do you have four jobs, five companies? In five years. really count on that one (laughs). and you don't incorporate the business, Yeah, I mean if you think about it, Or is it more of an Einstein derivative? But now especially over the last five to 10 years, So there you could say more data is good. particularly in pharmaceutical where you don't want "it's so inexpensive to store." So we do keep more than, Like a legal hold So that's the other key. when you didn't have the tooling to be able to say, (laughs) Yeah, right, exactly. but if you are able to navigate, you can get to the data astonished you have the technology, and then ultimately how you end up using it. And I think there's a bit of a paradox here too, right? to have a starting point where you don't need as much data and you collect data around that theory. you don't have to guess anymore right, if you capture data that pertains Seth thank you so much. I know, I'd love to carry on, right? and being data all-stars. All right and thank you for watching everybody,

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
IBMORGANIZATION

0.99+

Dave VellantePERSON

0.99+

EuropeLOCATION

0.99+

Seth DobrinPERSON

0.99+

McKessonORGANIZATION

0.99+

Wells FargoORGANIZATION

0.99+

May 20thDATE

0.99+

five companiesQUANTITY

0.99+

ZuoraORGANIZATION

0.99+

two jobsQUANTITY

0.99+

seven jobsQUANTITY

0.99+

$1,000QUANTITY

0.99+

50 jobsQUANTITY

0.99+

three companiesQUANTITY

0.99+

last yearDATE

0.99+

SethPERSON

0.99+

DavePERSON

0.99+

CloverORGANIZATION

0.99+

Lucia Mendoza-RonquilloPERSON

0.99+

seven yearsQUANTITY

0.99+

fiveQUANTITY

0.99+

two companiesQUANTITY

0.99+

Clover HealthORGANIZATION

0.99+

four yearsQUANTITY

0.99+

Parag ShrivastavaPERSON

0.99+

San FranciscoLOCATION

0.99+

five yearsQUANTITY

0.99+

Rolland HoPERSON

0.99+

$6,000QUANTITY

0.99+

LuciaPERSON

0.99+

eight billion dollarQUANTITY

0.99+

5 yearsQUANTITY

0.99+

CarlPERSON

0.99+

more than seven yearsQUANTITY

0.99+

one companyQUANTITY

0.99+

San Francisco, CaliforniaLOCATION

0.99+

todayDATE

0.99+

North AmericaLOCATION

0.99+

OneQUANTITY

0.99+

FourQUANTITY

0.99+

JungPERSON

0.99+

three jobsQUANTITY

0.99+

Latitude Food Allergy CareORGANIZATION

0.99+

One jobQUANTITY

0.99+

2,000 featuresQUANTITY

0.99+

Carl GoldPERSON

0.99+

four jobsQUANTITY

0.99+

over $100 millionQUANTITY

0.99+

firstQUANTITY

0.99+

bothQUANTITY

0.99+

oneQUANTITY

0.99+

EinsteinPERSON

0.99+

first questionQUANTITY

0.99+

16 peopleQUANTITY

0.99+

threeQUANTITY

0.99+

first goalQUANTITY

0.99+

ParagPERSON

0.99+

IBM Chief Data Officers SummitEVENT

0.99+

RollandPERSON

0.99+

six monthsQUANTITY

0.98+

15 years agoDATE

0.98+

Jung ParkPERSON

0.98+