Image Title

Search Results for cambridge massachusetts:

Priyanka Sharma, CNCF | KubeCon + CloudNativeCon NA 2021


 

hey welcome back to los angeles thecube is live here at kubecon cloud native con 2021 we're so excited to be here in person lisa martin with dave nicholson and we are here with priyanka sharma the executive director of cnc at prayanka welcome to the program thank you so much for having me first of all congratulations on doing an event in person in such a safe clean way i was really impressed when i walked in this morning was asked for my vaccination record my temperature was scanned you're proving you can do these events safely this isn't rocket science so agreed and i'm so glad you appreciate all the measures we've put in place because this is how we can do it in-person interaction is essential for us as human beings for us as professionals and so we owe it to each other to just do the right thing you know have a vaccine requirement wear your masks have these what i call the traffic light uh system where if you have a green a green band it means people can come a little closer it's okay red means please at least six feet of distance and these things go a long way in making an event successful in times like this they do i love that when i saw that mine keeps falling off i'm cold so it keeps falling i'm green just so you know i know you're green what about you i'm green here you can can i have yours that's my favorite and you know you fell off again you had um the three folks that came up who were uh like uh co-chairs co-chairs yeah yeah and uh and they did almost a little almost a little skit yes that on the surface people could say well that's ridiculous and it's like no it's not it's giving everybody the guidelines so that everyone can be comfortable because when i see your green wristband i understand that you are comfortable because i don't want to accidentally reach out to give you a fist bump when you might be particularly of course yeah yeah so no visual cues make it easy yes yeah yeah very very easy very comfortable talk about the energy at the event this is the second full day tina was standing room only yesterday give us an overview of the energy and some of the things that are happening since you can't replicate those hallway networking conversations on video conference i know exactly what you mean man it is so lovely to be in person to meet people and you know for those who are comfortable there's like the fist bumps and the hugs and the big smiles and that energy i haven't seen it in almost two years um and even you know just standing on stage as i was telling you folks uh off camera i've been in this role for over a year and a close to a year and a half i've done three cube cons already but this was my first in person and being on that stage experiencing the energy of the people in that room like when i asked everyone during my keynote i was like are you all proud to be team cloud native and i got a resounding yes back from the audience that's what i'm talking about yeah you know it was amazing what's some of the news that's breaking lots of stuff going on obviously some first one in person in almost two years but talk to me about some of the the news that's breaking here at the event yes so so much new stuff to share um from our side on cncf our journey has been very much about being celebrating our culture and welcoming more and more people into it so that we can have more folks in team cloud native to take various jobs to find fulfillment and all those great things right and all of our announcements are around that theme of people finding a place here people paying it forward in this community and building the culture the first one i would like to share is the announcement of the kubernetes and cloud native associate certification so this is an exam that is going to go live end of the year so people sign up apparently the beta signups went away like this after i announced it so it was really cool wow popular by demanding yeah very very popular and it's it's an exam for folks who are brand new to cloud native and it the studying for it you'll go through you know the fun fundamentals of kubernetes what is the cncf landscape what are the key projects and ultimately you will actually deploy an application using coop cuddle commands and it's such a great primer so so how brand new can someone be when you when you say brand new are you talking about someone who already has a phd in computer science but hasn't done anything in the kubernetes space tell me how brand new can you be uh-huh that's a very good question and it is literally you can come with zero knowledge you would of course have to study for the exam and like go through that journey but the idea is that it is the gateway and so it is possible you're a phd in computer science but you've studied some esoteric part of computer science that's very unconnected to what we do sure go ahead and take it but maybe most likely you would like the more advanced certifications better but if you're let's say a marketer looking to break into the cloud native industry this is the move take this exam and suddenly all these employers you speak their language they'll be impressed that you took it and it's it's an opportunity to advance your career the oh community is huge i was looking at the website the other day 138 000 contributors yes from more than 177 countries 186 is the latest number 186 awesome 289 plus million lines of code written this community is really so productive and so prolific and it's great that you're offering more folks that don't have the background like you were saying to be able to get in and get started absolutely it's our whole thing of bring in more people because as you all probably know there's so much demand for cloud native skill sets across job functions so that's why we're here to help with yeah i you know i i want to double click on this as we say because you hear the word inclusive associated with this whole community so much um you're talking about something that is a certification yeah a marketer okay fine but we're really talking about anyone who has the drive to potentially completely transform their lives yes and in this age where things can be done remotely you don't necessarily have to live in silicon valley or cambridge massachusetts to do this or in one of the other global centers of technology anywhere yeah so that's the that's the kind of energy that's part of this that isn't a part of any large industry focused conference because you really are making opportunities for people of all backgrounds to change their lives so i don't know i don't am i extending a a virtual thank you from all of those people whose lives have been changed and will be changed in the future maybe i am but so but talk about inclusiveness in in you know from from other perspectives yes i think that you know talent drive skills none of these are exclusive to a certain zip code you know people everywhere have great qualities and deserve chances and why shouldn't they be part of a community that as you said is especially inclusive feels especially nice to be a part of and that's what i exhorted the community to do in my keynote yesterday which is that our ranks will grow and we should go out of our way to make sure our ranks grow and we do that by shining a light on our culture telling people to join in lending a hand and you know letting people's personalities shine even when they'll be different from who we are whether in terms of job function or skill set whatever and i think that's the top level um paradigm that we want to have right where we are always welcoming people when we think of inclusiveness it is you know there is certifications like kcna did do a great job there are also efforts that we must always be doing so something that we work on constantly consistently is contributor strategy where we're working on creating ladders and pathways for folks to become open source contributors it is known now that open source contributions lead to job advancement in your career right and so the whole goal is bring people in not just to hang out not just to talk but to actually grow and actually kubecon cloudnativecon is a great example of another little thing we do which is uh we uh award uh underrepresented minorities and people who are who need need-based funds scholarships to attend nice yeah and it's changed one thousand 1518 lives already and we actually uh in uh in this event have announced that we are renaming the scholarship to the dan khan scholarship fund um i i do you folks know dan yes did yeah so dan he breathed life into team cloud native right he built this organization to have the impact that it does today and all the while he was relentlessly focused on diversity equity inclusion so it was it was just like the idea came from within the team and the minute someone said it it just struck a chord with all of us yeah we're like we're doing this no question and it was one of the fastest decisions we've ever made i saw uh some results of a dei micro survey on the website where 75 percent of respondents say this community is becoming more inclusive there's obviously work to go but as a female in technology you feel that you see that as well yes i think i'm very proud of that survey that we did by the way because it's our way we're going to keep doing it it's our way to keep a pulse on the ecosystem because you can keep doing initiatives right but if people are not feeling great then who cares and so um but yes i think dei is a journey if there is no destination right always we have to be thinking harder trying harder to you know i think for example something cncf's done a great job is identifying particularly gender diverse folks who are in the community and maybe could deserve a role of high responsibility so i'm really proud that our technical oversight committee which is our really the top technical people in the ecosystem who desi decide project stuff they are led by a woman there's many women on that and it's they're all very exemplary awesome technologists and so i think um the diversity survey gives us like a hint into like the things people do like and i mean the fact remains we need to do more to source more people to come into the ecosystem we need to always be changing and evolving with the needs of the community right as i mentioned the community is 138 000 strong 6.8 million plus contributions so far you can imagine by opening that dei door just the thought diversity that comes in alone and the number of projects that will come from folks that just come in with a different mindset oh 100 we are already seeing that um we started off as folks who had you know lots of projects from the great big tech companies people who had web scale problems as i call it and that was great but in recent years the end users who are initially just consuming this technology and that too slowly are now hook line and sinker in and we have like argo cd came from intuit which is an end user uh backstage came from spotify which is an end user so this trend is growing and the diversity as you said is continuing yeah i i'm particularly interested in the dynamic where you have people who have their day job if you will where their employer is absolutely 100 encouraging them to participate in the community to develop things that will not only help the employer and that mission but also building uh solutions for everyone and providing enrichment for the for the person and and i i'm i'm going to make a little bit of a prediction i want to get your thoughts on this i think that um one of the silver linings of what we've been through in the pandemic having a lot of people at home having that relationship with your primary employer be just a little bit different and just a little bit more removed i think everyone is realizing that you know what um we all need a passion play to be a part of in addition to whatever we're doing to put bread on the table in the immediate future and so i i think that i want to hear your thoughts there's going to be an explosion in contributions from people and hopefully a lot more openness on the part of employers to let people dedicate their time to this do you do you see that do you think that yes i think i think you're really on to something here um something i mentioned in my keynote right was this conversation i've had with so many that we in this community our identity is cloud native first so we're folks who are in team cloud native before we are working at insert company name you know um google at t spotify whatever it's not a dig on the company it's actually a celebration of those companies because they are liking the developments that happen in open source they are appreciating the value these people are creating and they're employing them so absolutely there is this ongoing trend of folks seeing great value in folks who understand this cloud native projects in particular and of course right because we have been such a great place for industry collaboration lots of vendors have great products make lots of money on these projects and that's as it should be and so the value of the people contributing to these projects is very high and it will only continue to grow i imagine so so here we are in los angeles at kubecon cloud native con 21 what's what's next well uh the good news is this was the first of many to come hybrid events in person plus virtual and the next one is happening in end of may in valencia for europe 22. valencia spain and i have heard beautiful weather very nice people amazing food so just for that that alone is worth registration yes i know right it's going to be amazing i'm so excited and i hope i will see you folks there sign me up i've never been to spain i'm there me too let's do it excited let's do it for our spanish-speaking uh viewers i will say claroque he you can't you do you can do it all you can speak spanish on the queue we can have something honestly i'm impressed i'm impressed i can't i can't do that any and you donated your green card so thank you so much so nice congratulations on the event thank you uh for growing the community for and growing the diversity of it and for the the projects that are going on now and we're sure many more to come we look forward to seeing you in valencia in may thank you so much see you in valencia all right we'll see you there for dave nicholson i'm lisa martin we are live in los angeles the cube is covering kubecon and cloudnativecon at 21. stick around we'll be back after a short break with our next guest

Published Date : Oct 14 2021

SUMMARY :

that don't have the background like you

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Priyanka SharmaPERSON

0.99+

lisa martinPERSON

0.99+

valenciaLOCATION

0.99+

dave nicholsonPERSON

0.99+

los angelesLOCATION

0.99+

100QUANTITY

0.99+

yesterdayDATE

0.99+

spainLOCATION

0.99+

more than 177 countriesQUANTITY

0.99+

three folksQUANTITY

0.99+

spotifyORGANIZATION

0.99+

289QUANTITY

0.99+

cambridge massachusettsLOCATION

0.99+

firstQUANTITY

0.99+

one thousandQUANTITY

0.98+

138 000 contributorsQUANTITY

0.98+

spanishOTHER

0.98+

silicon valleyLOCATION

0.97+

a year and a halfQUANTITY

0.97+

kcnaORGANIZATION

0.96+

priyanka sharmaPERSON

0.96+

pandemicEVENT

0.96+

186QUANTITY

0.96+

CloudNativeConEVENT

0.95+

second full dayQUANTITY

0.95+

valencia spainLOCATION

0.95+

138 000 strongQUANTITY

0.95+

zero knowledgeQUANTITY

0.93+

tinaPERSON

0.93+

europeLOCATION

0.92+

todayDATE

0.92+

danPERSON

0.92+

cncORGANIZATION

0.92+

oneQUANTITY

0.91+

CNCFORGANIZATION

0.9+

first oneQUANTITY

0.9+

prayankaORGANIZATION

0.9+

6.8 million plusQUANTITY

0.89+

75 percent ofQUANTITY

0.88+

almost two yearsQUANTITY

0.88+

kubeconORGANIZATION

0.88+

over a year and aQUANTITY

0.88+

almost two yearsQUANTITY

0.84+

googleORGANIZATION

0.83+

KubeConEVENT

0.81+

cloudnativeconORGANIZATION

0.78+

million linesQUANTITY

0.77+

this morningDATE

0.77+

NA 2021EVENT

0.74+

end of mayDATE

0.73+

cncfORGANIZATION

0.71+

least six feetQUANTITY

0.71+

of moneyQUANTITY

0.71+

conEVENT

0.69+

lot of peopleQUANTITY

0.68+

one of the fastestQUANTITY

0.66+

2021DATE

0.61+

endDATE

0.6+

1518 livesQUANTITY

0.59+

executivePERSON

0.59+

mayDATE

0.57+

cloudORGANIZATION

0.57+

kubecon cloud nativeORGANIZATION

0.56+

dan khanPERSON

0.54+

intuitORGANIZATION

0.53+

tORGANIZATION

0.52+

the silverQUANTITY

0.5+

conLOCATION

0.37+

Wendi Whitmore, IBM | IBM Think 2020


 

>> Narrator: From theCUBE Studios in Palo Alto and Boston, it's theCUBE, covering IBM Think, brought to you by IBM. >> Hi everybody. Welcome back to theCUBE's continuous coverage of IBM Think 2020, the digital version of IBM Think. Wendi Whitmore is here. She's the vice president of IBM X-Force Threat Intelligence. Wendy, thanks for coming on. >> Thanks for having me. I'm excited to be here. >> Yeah, you're welcome. With a name like X-Force. That is a killer name. Tell us about X-Force. How are you protecting us? >> Yeah, we get a lot of interesting questions. So, my team is responsible for a pretty wide range of things. They range from incident response. So, when you think of data breaches, typically organizations will call an outside firm, and they'll jump on a plane and respond to threats on-site. Obviously right now, we're jumping on a bit fewer planes, but we still are helping our customers investigate data breaches, and we are on-site when needed. We also have a team of threat intelligence analysts and researchers, who are experts in a wide range of fields from geopolitical issues to cyber-related issues to industry specific. And then we've also got a team that does data breach simulations in a very immersive environment. We've got facilities at Cambridge Massachusetts, as well as within Europe, and now of course, we're bringing all those virtual as well. So, really anything that helps our clients respond more effectively to a data breach is something that we do. >> So, X-Force is traveling right now on empty planes, I presume. >> We are as needed. So, many clients have certainly shifted to where their whole environments are off-site and working remote as well, but we still have clients who are asking us to work on-site, and in those cases we have added a new protective gear to our go-backs, which are usually equipped with hard drives and disc imaging software and passports, and now we have some additional equipment to bring as well. >> And that breach simulation that you talked about. So that's what, like a penetration test, or in similar type of activities? >> Yeah, great question. No, it's actually an immersive environment where we go in, and actually simulate an entire breach for our clients. So, everything from the initial attack, how they would do the data analytics, to things like, how do they respond to the press, and inquiries from the press about the breach, how do they do media training, how they work with their legal counsel. So, it's really a comprehensive immersive environment that simulates kind of the heart pounding that occurs when you actually respond to a data breach. >> Oh, that's awesome, so that mean best practices in communications as well and the PR. I mean, that is obviously, maybe something that's often overlooked, but something that you guys are applying best practice to. >> Wendi: It's such a huge piece of it now, right? Our organizations are not always graded just on the breach itself, but more so on how they respond and how they communicate. The good news is, in that scenario that you can communicate effectively about a breach, and you can have something pretty negative that happens to your organization, but if you respond well, and you communicate really effectively to your clients and to the public, we've seen time and again that those brands actually have no reputational damage, and if anything, their clients trust them even more moving forward. >> We were early on when recording the, just trying to measure the budget impact of COVID-19, but we were early in recording the work from home shift. About 20% of the CIO organizations that we surveyed, actually spending more, or planning to spend more, but many weren't prepared for this work from home. They had to really beef up, and not just adding licenses of video collaboration software, but security for sure, a VPN infrastructure, et cetera. So, can you talk a little bit about how clients have responded, how you've helped them respond to that shif? How has the threat matrix changed? >> Well, so in terms of the attack surface, you mentioned there's a lot more people working from home, right? So, what we've got is over 220 million people in the United States, over one billion people in India alone, that are now working from home. So as you can imagine, that attack surface has really increased from an attacker perspective, right? And coupled with that, is that since March 1st, we've already seen a 6000% increase in coronavirus related spam. So, you've now got this larger attack surface that organizations need to protect against, and you've got an increase in threats and threat activity that is attacking them. So, from that perspective, pretty difficult for CIOs who are used to defending an environment that may be more on-site, and now have this really wide range of attack surface certainly more difficult for them to respond to. The other thing that we've seen, so one of the things that's super critical in these types of situations is to have an incident response plan, and to make sure that you're testing it. So, in our work that we've done both with our incident response teams, as well as with the teams that train clients in how to respond to breaches more effectively, we've seen that 76% of organizations don't actually have a consistently tested or applied incident response plan, and one in four have no plan at all. So, I will say that in terms of how we're working with clients, the first thing that any organization can do right now, is actually, have a plan and test it. So, if you're starting from scratch, it's really as simple as putting words on paper, understanding how you're going to get a hold of your critical team members, having a backup plan in place for communication strategies if your primary infrastructure goes offline. So making sure you know how to get a hold of your personnel. If you're more mature, then what we're really encouraging our clients to do is have a variety of scenarios that they're testing against, and make sure that they're running through those. So, a great one to practice right now, would be a ransomware attack. In particular, how does your organization respond effectively to it? What do you do when you get the initial notification? Do you have critical and sensitive data that's backed up offline, and not always connected to the network? If so, you're going to be in a much better spot to effectively defend against those attacks and limit any of the negative impact to them. >> So, a couple things I want to sort of follow up in. So, what I heard was you've got more fragile work-from-home infrastructure, and you've got somewhat, well, significantly more vulnerable users. I've often said, bad user behavior is going to trump good security infrastructure every time. So, you've got many more opportunities for the bad guys to get in. And so, I'm hearing that threat response is now more critical than ever. It's always been critical. The communication to the board has been hey, chances are we're going to get infiltrated. We got to find it fast, and it's really about response, incident response. We can build modes, we can build layers, but we have to put a plan for that response. And so, it sounds like that's something that maybe is heightened as a result of this COVID-19 crisis. >> Wendi: Oh, it absolutely is. I think it's now more critical than ever. I think there's two approaches, right? So, one of them would be improvising through chaos, which we don't necessarily encourage, right? There's a difference between that and really managing through disruption, and that's what we're encouraging our clients to do, is look at how we can create sustainable processes and procedures. You may have a very well-established team that does response, but perhaps they haven't worked remotely before. So, that means testing those procedures, now taking them to a scenario where everyone is remote. What does that mean? It may mean that you need to capture less data over the network, because perhaps you just don't have the bandwidth or the capacity to do it. We've certainly looked at how we do that. How do we answer questions that are critically needed from an investigative perspective, for example, but without maybe all the resources that we would prefer to have. So, what we're really looking at, is kind of shifting in the way that we manage through these. And then, you mentioned that users who maybe sometimes make bad decisions, right? We're all guilty of that, because especially with that increase in spam, there's also been an increase in Nation-State actors who are now sending out new lures and new attempts to get access to environments that are related to coronavirus. So, we've got cyber criminals, Nation-State actors, everyone, and we're now at home looking to effectively defend. So, some things that organizations can do with that, would be insuring that they have multi-factor authentication on all remotely accessible systems. So, devices, applications, anything that can be accessed remotely should have multi-factor authentication. That will help limit some of the impact. As it relates to spam, organizations should really be making sure they've got good email spam-filtering systems in place, and if they have the capability to send out some test emails to their employees, they should do that, right? We are getting numb. I will say, our CIO and their office does it at least once a week where I know I'm getting a very well-crafted email, and I have to really think twice, and it's really made me think differently about opening my email, and making sure that I'm doing some due diligence, to make sure I know where the email's coming from. One of the things we do, is also any external email is labeled external, so that way if it's a lure that appears to be, it's coming from another employee, but it's actually coming from an external email address, that's another way to help users make some good decisions, and really limit your attack surface, and reduce the threat. >> I think the points you're making here are very important, because if you think about the work-from-home cadence, it's a lot different. You're not nine to five. I mean, who works nine to five anyway, but your hours are different. Oftentimes, you got children to hone. You got dogs barking, kids are crawling all over us on the video. And so, oftentimes, of course we're frenzied at work, but there's a different kind of frenzy, so you might not be as in tune. So, you're basically saying, exercise that a little bit to get people, like a fire drill, to really get them tuned to being sensitized to such phishing attack. >> Right, well if you think about this from the viewpoint of an attacker, all of those scenarios that you mentioned, where you have a global pandemic. So, we're not just talking about a regional threat, like a hurricane or a tornado. In a case of a pandemic, or any of these type of situations, people are more likely to be reading the news, be probably checking social media more often, so that they can get an understanding of the latest news and information that may impact them. If you're an attacker, you've got now this kind of environment of global chaos that's been created, and you can use it to your advantage, because the reality is, as long as there's money to be made, attackers are going to want to take advantage of that scenario. So, what we're really talking about is, as you're reading your work email, as you're checking your personal email, taking a step back, slowing things down amidst all the distractions, barking dogs and co-workers now that may be at your house, also known as children, right? So, we need to really take a step back, and make sure that we are slowing things down, reading and doing due diligence in opening emails that will help all of the CIO and CISO type organizations more effectively to protect their organizations and their clients as well. >> When you talked about ransomware earlier, and I inferred from your comments that best practice, create an air gap, but I'm wondering also, can analytics play a role there, just in terms of identifying anomalous behavior? What else can I do to protect myself from ransomware? >> Great question. So, on the visibility side, which I think is what you're talking about, right? How do we detect these types of attacks? There's lots of great software out there. Typically, what we would want our visibility at the endpoints. So, usually some sort of EDR tool, which is an endpoint detection and response tool. That's going to allow us to capture things. In the old days, we would talk about antivirus software, and now you really have kind of next generation of antivirus software, which also gives you behavioral analytics and actions on the keyboard. We want to be able to detect that in any size environment. So, the more visibility we have into that, the better, but aside from just adopting new technology, potentially, there are best practices steps that we can take, and I mentioned earlier about making sure that you understand what is your most critical and sensitive data, and that you've got it backed up, and a lot of times we go into environments, and they say, "Well yeah, we have backups." This is great, but what they're not realizing, is that oftentimes those backups are connected to the network at all times, and in the case of a ransomware breach, you typically then will see those backups corrupted as well, and organizations will find themselves in a position where they say, "Well, we don't have any valid backups now "that we can restore from, in order to make sure "that we have a safe environment." And so, it's important that organizations understand and do a survey of what is their most critical and sensitive data, and then make sure that's backed up offline, and I say that, because it's not usually viable for organizations to have all of their data backed up offline. That costs a lot of money. That requires a lot of storage, but to look at really prioritizing their environment, their data within it, and making sure that they can have access to that which is needed, and then ultimately that's going to prevent you even needing to have the conversation about ransomware, because you still have access to that data. >> Yeah Wendi, I think you're making some really important points there. The tech obviously, is critical. People shifting to SD-WAN, securing endpoints, securing gateways, but really the processes are very very important, and I'll just throw out an example. If I'm making a snapshot of the Cloud, I'm not backed up. You better make sure that you understand how to recover from that backup, because just that copy is not a backup. You need the proper type of recovery software. You need to test that. Your thoughts on that. >> Yeah, that's absolutely true. So, what we want to make sure is that during the course of a potential ransomware attack, that the email's critical sensitive data is available offline. So, I mentioned earlier that testing is one of the best things that we're recommending. One of the most effective preparations is having an incident response plan, testing it for particular scenarios, and so in this case, one of the other things that we talk about a lot is limiting the impact of a breach. Every organization is going to get attacked, especially in today's day and age where you've got a larger attack surface. The win is really limiting the impact of that attack, and limiting the cost, and having an incident response plan, and having a team of people, whether they're internal or external that are responsible for responding to attacks, is the number one cost management. The number one decrease in cost is having access to that team. Typically, it will save an organization over a million dollars when the average cost of a data breach is about $4 million. So, that's pretty significant, and ultimately, if we can test, as you mentioned, those backups, that they are available in an offline scenario. In the course of one of those IR program plans or tests, that's great. It's a win for the organization. They can ensure that that data is going to be available, and it really helps them exercise that muscle memory in advance of an actual attack. >> Yeah, so the backup corp is actually becomes a really even more important component now. This has been great information. Where can people go specifically as it relates to COVID-19? I want to go look up a checklist to make sure. I've been scrambling to get my homeworkers up and running, get them productive, but boy, I really want to focus now on the things that I should be doing to button up my organization. Where can I go to learn more about this? >> Yeah, so there's so much great information out there, from everyone in the industry, but IBM is clearly no different. So, what we've done is action repurpose at IBM.com homepage where we've got a tremendous amount of information on COVID-19, and then IBM Security.com as well. Our team that focuses on breach response, has in particular, a site called X-Force Exchange, where we're sharing indicators, and we have a particular component that's related to COVID-19 specifically, and then lastly, we've got a free service, which is a threat intelligence enclave that we are hosting with our partner TruSTAR, that is specific to COVID-19 where industry organizations can sign up and then share in real time, threat indicators related to this, and have really that intelligence that's been also qualified by their peers, and many large organizations are using that to defend their environments. So, a lot of great resources out there. >> Wendy, you're an amazing source of knowledge. Thanks so much for coming on the theCUBE, and thanks to the X-Force team, doing some travel when necessary, and helping people really get a handle on this in this crazy crisis time. So, thank you very much. I really appreciate it. >> You're welcome, and certainly stay safe, and thanks for having me on. >> Back at you. All right, and thank you everybody. This is Dave Vellante for theCUBE. You're watching our continuous coverage of IBM Think 2020 Digital Think. Be right back right after this short break. (uplifting music)

Published Date : May 5 2020

SUMMARY :

brought to you by IBM. of IBM Think 2020, the I'm excited to be here. How are you protecting us? and respond to threats on-site. So, X-Force is traveling right now and in those cases we have that you talked about. that simulates kind of the heart pounding but something that you guys that happens to your organization, organizations that we surveyed, and to make sure that you're testing it. for the bad guys to get in. that are related to coronavirus. to get people, like a fire drill, and make sure that we and in the case of a ransomware breach, You better make sure that you understand is that during the course as it relates to COVID-19? that is specific to COVID-19 and thanks to the X-Force and thanks for having me on. All right, and thank you everybody.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave VellantePERSON

0.99+

EuropeLOCATION

0.99+

Wendi WhitmorePERSON

0.99+

IBMORGANIZATION

0.99+

6000%QUANTITY

0.99+

WendiPERSON

0.99+

IndiaLOCATION

0.99+

Palo AltoLOCATION

0.99+

March 1stDATE

0.99+

BostonLOCATION

0.99+

United StatesLOCATION

0.99+

76%QUANTITY

0.99+

WendyPERSON

0.99+

oneQUANTITY

0.99+

COVID-19OTHER

0.99+

about $4 millionQUANTITY

0.99+

twiceQUANTITY

0.99+

two approachesQUANTITY

0.99+

over 220 million peopleQUANTITY

0.99+

bothQUANTITY

0.99+

over one billion peopleQUANTITY

0.99+

OneQUANTITY

0.98+

nineQUANTITY

0.98+

theCUBEORGANIZATION

0.98+

COVID-19TITLE

0.98+

X-ForceTITLE

0.98+

X-Force ExchangeTITLE

0.98+

todayDATE

0.98+

over a million dollarsQUANTITY

0.96+

About 20%QUANTITY

0.96+

fiveQUANTITY

0.95+

X-ForceORGANIZATION

0.95+

theCUBE StudiosORGANIZATION

0.94+

Think 2020COMMERCIAL_ITEM

0.94+

TruSTARORGANIZATION

0.92+

first thingQUANTITY

0.92+

Think 2020 Digital ThinkCOMMERCIAL_ITEM

0.9+

fourQUANTITY

0.84+

Threat IntelligenceORGANIZATION

0.83+

pandemicEVENT

0.82+

once a weekQUANTITY

0.79+

at leastQUANTITY

0.74+

coronavirusOTHER

0.73+

Cambridge MassachusettsLOCATION

0.73+

IBM Security.comORGANIZATION

0.7+

IBM Think 2020COMMERCIAL_ITEM

0.68+

themQUANTITY

0.63+

lot more peopleQUANTITY

0.63+

coupleQUANTITY

0.61+

globalEVENT

0.58+

ThinkCOMMERCIAL_ITEM

0.53+

IBM.comOTHER

0.39+

Darren Murph, GitLab | GitLab Commit 2020


 

>>from San Francisco. It's the Cube covering. Get lab commit 2020 Brought to you by get lab. >>I am stupid, man, and this is the Cubes coverage of Get Lab. Commit 2020 here in San Francisco. Still hard saying 2020 and not thinking we're in the future. Joining us first time on the program. Darren Murph, who's the head of remote at get lab and a little birdie, told me that he also has a Guinness Book of World record. So, >>Darren, before we get >>into your day job, bring us back, is it? It's, uh, it's an interesting thing. >>Yeah, it's true. Surreal to have their records. So I'm the world's most prolific professional blogger, which is kind of crazy of Britain. About 10 million words or so When the record was given, it was about 17,000 articles, which was an article published every two hours, 24 7 for four straight years, which actually makes my chest hurt just a bit saying that out loud. >>But Darrin, I'm going thio going to do the Turing test right now >>because I'm not sure you're human being. I have been a blogger. When I had my regular cadence of 2 to 3 articles a week and knew the you know, 10 p.m. When the kids were in bed when I would do that stuff. The amount of words you're saying, um, you know that does not compute with me, but amazing stuff. Congratulations on. And you still you don't keep up that pace >>anymore. Try actually, part of >>my job here, get Lab is to make the all remote section of our handbook a lot bigger and better than it. ISS. So I'm still cranking away very different capacity than covering consumer electronics, but still cranking. Yeah, all right. >>Remote? Yes. Really interesting topic. When we talk about the future of work, you talk about the gig economy. There's all these ways that, you know, how do we leverage and enable global and changing workforce? And it's really fascinating. Get lab over 1100 employees and completely remote keynote this morning talked about, you know, the woman in New Zealand that's completely cut off with everything except for the Internet. She does her own power and everything like that, but she could just be part of the team and you don't even know. So tell us a little bit. What does that mean? Head of remote for your >>old. So as we've grown, it's It's interesting. There's an intersection of hiring and recruiting, talent branding, but also process. So we have a lot of people joining the company that come from co located spaces. And there's a certain acclamation period to getting used to remote and doing remote. Well, you think about people that have joined a co located company. They walk into an office where they've had a professional design, their office space. So they have ergonomic chairs, ergonomic monitors. Everything is set up for them. But if you're working from home or working for some wires from somewhere outside of an office, now that's on. You did. How do you do meetings? Well, how do you do? A synchronous Well, so part of my role is to work through those processes to make sure everyone, when they're on board and get lab, is giving the best possible experience. If they're coming into a remote role for the first time, >>you do you have any bias towards it makes sense to just build it in my home. Is it good to go to Oh, You know, I live in Cambridge, Massachusetts, and there's this cool place where I can go where they have good coffee and the people I can hang out with everything in between. What are some of the best >>practices there? The beauty of all remote is we don't say you have to work from home. You can work from anywhere. So wherever you're most comfortable, maybe that's a co working space. Maybe that's at a friend's house. Maybe that's a different place. Every month we have people that travel all around the world and every month or in a different time zone. That's the beauty of it. So we have over 1100 employees, but none of them operate their days quite the same as anyone else, and that that's the beauty of a super diverse and inclusive team. And, well, actually reimburse co working space expensing expenses. If you just feel more comfortable working in a group or you need to leave your home for any reason. Okay, >>um, we talked to sit a little bit about, you know, does this remote work for every type of job? His feedback from the light Combinator people were like, If you're in finance, It might not be the best fit on dhe one. Understand? How does the software help does it if if I'm someone that's doing development, you know, doesn't poke me every once in a while and say, Hey, you know, maybe you should eat and sleep every once in a while And you know, you've been going at this a little long time, you know? How does both the kind of the kind of HR and the technology piece fit together? >>Well, we hire people that are managers of one, so having a high degree of autonomy is really important. So you need to have a lot of self awareness in managing your day, and that includes taking breaks. And so we encourage people to take vacation, take breaks whenever they need it, and again everyone is different. So we enable people to take that as they need that. But no doubt when you're hiring, you want to look for things like that. It helps to have some experience in the working world, definitely with interns in junior level level level people, you need to check in with them more often because managing their own time and themselves when you're not in an office setting can feel a bit far, but actually get lab. The product is tailor made for remote teams because it's built by remote and even on the marketing side of things where no code really is involved. We use it to manage entire projects and entire events. And the beauty of that is it hones in on documentation, which is essential to do remote. Well, so we say any part of a project that you're moving forward try to move that forward and add context in a way that someone else who may be asleep right now when they wake up and read your stuff along the way they have context of what you did and can pick it up from there and move it on to the next step so that that helps us work really well, remotely. But honestly, that is probably useful for co located teams as well. And so a lot of people look at us as all you know, you have this all remote team. Things must be drastically different, but the truth is, all remote forces you to do things that you should be doing anyway. Transparency, documentation iteration. We just have to do them much more quickly and much more intentionally. >>Yeah, when everybody gets together event like this, Are they okay being on the same room, or do they want to go documents and things and hand things off? >>The funny thing about that is people will often say I don't know if I could I could work well in a remote environment because I really love the energy of being with other people. And the truth is in person, interactions are vital to a remote company. We have to be really intentional about that as well. So we get a CZ many people, as we can together for things like get lab, commit and get lab, contribute where we invite the entire company. But the subtle differences when you're in a co located space and you see people on a daily basis, sometimes you can take for granted in person interactions you have because you think I'll just see them again and again and again, but never in a remote setting. When you have to be intentional about when you'll see a person, it's it's there's a certain level of energy and, um, you proactively look forward to moments like that because you don't get them is often. So we build a lot of great bonds and relationships around those key in person moments. >>Is there anything along the communication technologies that you recommend you use video conference thing or, you know, phone calls? Or you know what some of the recommended How do I make sure you get, you know, some high bandwith interacting? >>There's a few tools that we use that didn't exist not too long ago, but because they exist now they've made all remote as a concept far more approachable and feasible. Google Docks is a big one. We cover agendas and things like that and something that could be edited by multiple people at once. Zuma's another one. Zoom is really amazing for video communication because many, many dozens of people hundreds of people, could be on the same call. And with very little technical difficulty, everyone can communicate well, which has been amazing for us being able to see each other on connect on a meaningful in a meaningful way, and the last one is actually get lab the product. So we build our handbooks. We have over 3000 pages of publicly accessible Get Live Handbook, How We Do Everything that is All publicly available on the Web and built and edited by Get Lab the product. So as we use get lab product to edit and iterated on their handbook, we as a get lab team see things that could be done better, more efficiently, and that gives us a flywheel of making the product better and then making the handbook better. So, >>Darren, I'm just curious where there any kind of interesting findings that you've had, uh, going to a company this size now with everything remote that you know, surprises >>the team. Well, I've worked remotely my entire career in different stages of remote. So some of the companies have been about 50 50 and some have been most of the people in the office. And then I've been one of maybe 10% of the company that works. Outside of it, you see all different facets of how people and companies communicate when you're in a hybrid remote setting. But the beautiful thing about all remote is it truly makes everyone a first class citizen. So a lot of people will say in a hybrid setting. If I don't go to the office frequently enough, miss out on some things. Or maybe I miss out on praise or promotion opportunities. Things like that. You feel like a second class citizen. So in, in in, in an environment like that, you have to take certain approaches to include people. You have to think about it intentionally to include those remote individuals, whereas if it's all remote, you're all on a level playing field. I think the other interesting thing is we have an amazingly diverse team over 65 countries because we hire the world's best talent from wherever they are. And so you'll be talking with someone on a call and you'll just see what's in their background. You think that looks completely foreign to where I am, and it's an amazing way to engage with someone and learn about them, learn about a new culture and truly keep a more global perspective. And lastly, all remote enables a workforce that may have been rejected at other stages of the workforce. So things like caregivers or working parents or military spouses where their spouse has to move at each new deployment. Ah, lot of these people might say, You know, it's too complicated to continue to reinvent my career with every move along the way in an all remote setting. Your job goes with you as those changes in life happened. And I just think that's going to become more than Norm, where the notion of moving for a career will seem silly, like the career should just follow you, no matter where life takes you. >>Yeah, I guess that lets follow upon that is we've reached a point where people expect, you know, immediate response. It's a text. It's something like that. When I'm dealing with dispersed and remote, how do you make How do you is that something you deal with? Is it something that is a team monitoring and handling that? But how do I make sure that I'm a little? I would I think, that it has to be a little bit more forgiving of not being an instant response. >>I tell you, all remote is actually much better for your mental health insanity than other settings, and it's because it forces us to work a synchronously. There's no other way to do it. We have people spread or call 65 countries, so almost every time zone is covered. But that also means there's almost a guarantee that someone on your team isn't a vastly different times, so they may be asleep the entire time. Europe working. We also allow people to just structure their day day today differently, depending on what they have going on appointments, things they need to attend to with their Children, things like that. So within a single as mindset, it enables all of us to take a step back and just assume that whatever we're doing is done with no one else online, so it can removes the burden of this nonstop string of slag messages where you have to respond to things immediately within a given time frame. We don't operate in that construct. I'll tell you, just from a mental health standpoint, when you have an entire company that embraces that were all given a little more breathing room to do really good. Deep work requires long periods of uninterrupted time, and we've seen massive improvements on the product and just team morale. When we embrace that and I feel like as a whole as a society, we're getting close to a tipping point where people are just to their limit on how many more slack messages or e mails or pings or urgent, urgent, urgent things they could do while also doing their job well. So we may be a little bit ahead of the curve on that. But my hope is that the industry at large embraces that allows the people more time to actually do the work they were hired. >>Darren Murph love the idea. Hope it definitely spreads beyond. Everybody absolutely can use the breathing room and being able to focus because we we know that multitasking really is a myth when it comes down to it. So great to be able to chat with you in person and thank you for all the >>work you're doing. A remote. Thanks for having me here. I appreciate it. All right, check out the cube dot net for all of our coverage, whether you were at an event or watching remote or after it, >>we've got all the content for you to meet him in. Thank you for watching the Cube

Published Date : Jan 15 2020

SUMMARY :

Get lab commit 2020 Brought to you by get lab. I am stupid, man, and this is the Cubes coverage of Get Lab. into your day job, bring us back, is it? So I'm the world's most prolific professional blogger, And you still you don't keep up that pace my job here, get Lab is to make the all remote section of our handbook a lot bigger and better than it. She does her own power and everything like that, but she could just be part of the team and you don't even So we have a lot of people joining the company you do you have any bias towards it makes sense to just build it in my home. The beauty of all remote is we don't say you have to work from home. um, we talked to sit a little bit about, you know, does this remote work for every So you need to have a lot of self awareness in managing your day, and that includes taking breaks. and, um, you proactively look forward to moments like that because you don't get them is a flywheel of making the product better and then making the handbook better. So some of the companies When I'm dealing with dispersed and remote, how do you make How do you is so it can removes the burden of this nonstop string of slag messages where you have to respond to things So great to be able to chat with you in person and all of our coverage, whether you were at an event or watching remote or after it, we've got all the content for you to meet him in.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Darren MurphPERSON

0.99+

New ZealandLOCATION

0.99+

San FranciscoLOCATION

0.99+

2QUANTITY

0.99+

San FranciscoLOCATION

0.99+

65 countriesQUANTITY

0.99+

24QUANTITY

0.99+

10%QUANTITY

0.99+

DarrinPERSON

0.99+

first timeQUANTITY

0.99+

2020DATE

0.99+

GitLabORGANIZATION

0.99+

four straight yearsQUANTITY

0.99+

10 p.m.DATE

0.99+

about 17,000 articlesQUANTITY

0.99+

BritainLOCATION

0.98+

oneQUANTITY

0.98+

hundreds of peopleQUANTITY

0.98+

over 1100 employeesQUANTITY

0.98+

over 3000 pagesQUANTITY

0.98+

bothQUANTITY

0.98+

DarrenPERSON

0.97+

over 65 countriesQUANTITY

0.97+

Cambridge, MassachusettsLOCATION

0.97+

About 10 million wordsQUANTITY

0.97+

EuropeLOCATION

0.96+

todayDATE

0.96+

ZumaPERSON

0.96+

every two hoursQUANTITY

0.94+

about 50 50QUANTITY

0.93+

Guinness Book of WorldTITLE

0.92+

Get Live HandbookTITLE

0.88+

singleQUANTITY

0.88+

first classQUANTITY

0.88+

this morningDATE

0.87+

3 articles a weekQUANTITY

0.87+

GoogleORGANIZATION

0.86+

each new deploymentQUANTITY

0.83+

How We DoTITLE

0.82+

dozens of peopleQUANTITY

0.75+

7QUANTITY

0.75+

second classQUANTITY

0.62+

CubesORGANIZATION

0.58+

onceQUANTITY

0.58+

CubeORGANIZATION

0.57+

labORGANIZATION

0.56+

NormPERSON

0.55+

DocksCOMMERCIAL_ITEM

0.54+

Get LabORGANIZATION

0.53+

cubeORGANIZATION

0.49+

LabORGANIZATION

0.44+

netORGANIZATION

0.41+

CubePERSON

0.29+

Chris Lynch, AtScale | MIT CDOIQ 2019


 

>> From Cambridge, Massachusetts it's theCUBE, covering MIT Chief Data Officer and Information Quality Symposium 2019. Brought to you by, SiliconANGLE Media. >> Welcome back to Cambridge, Massachusetts, everybody. You're watching theCUBE, the leader in live tech coverage. I'm Dave Vellante with my co-host, Paul Gillan. Chris Lynch, good friend is here CEO, newly minted CEO and AtScale and legend. Good to see you. >> In my own mind. >> In mine too. >> It's great to be here. >> It's awesome, thank you for taking time. I know how busy you are, you're running around like crazy your next big thing. I was excited to hear that you got back into it. I predicted it a while ago you were a very successful venture capitalists but at heart, you're startup guy, aren't ya? >> Yeah 100%, 100%. I couldn't be more thrilled, I feel invigorated. I think I've told you many times, when you've interviewed me and asked me about the transition from being an entrepreneur to being a VC and since it's a PG show, I've got a different analog than the one I usually give you. I used to be a movie star and now I'm an executive producer of movies. Now am back to being a movie star, hopefully. >> yeah well, so you told me when you first became a VC you said, I look for startups that have a 10X impact either 10X value, 10X cost reduction. What was it that attracted you to AtScale? What's the 10X? >> AtScale, addresses $150 billion market problem which is basically bringing traditional BI to the cloud. >> That's the other thing you told me, big markets. >> Yeah, so that's the first thing massive market opportunity. The second is, the innovation component and where the 10X comes we're uniquely qualified to virtualize data into the pipeline and out. So I like to say that we're the bridge between BI and AI and back. We make every BI user, a citizen data scientist and that's a game changer. And that's sort of the new futuristic component of what we do. So one part is steeped in, that $150 billion BI marketplace in a traditional analytics platforms and then the second piece is into you delivering the data, into these BI excuse me, these AI machine learning platforms. >> Do you see that ultimately getting integrated into some kind of larger, data pipeline framework. I mean, maybe it lives in the cloud or maybe on prem, how do you see that evolving over time? >> So I believe that, with AtScale as one single pane of glass, we basically are providing an API, to the data and to the user, one single API. The reason that today we haven't seen the delivery of the promise of big data is because we don't have big data. Fortunate 2000 companies don't have big data. They have lots of data but to me big data means you can have one logical view of that data and get the best data pumped into these models in these tools, and today that's not the case. They're constricted by location they're constricted by vendor they're constricted by whether it's in the cloud or on prem. We eliminate those restrictions. >> The single API, I think is important actually. Because when you look at some of these guys what they're doing with their data pipeline they might have 10 or 15 unique API's that they're trying to manage. So there's a simplification aspect to, I suppose. >> One of the knocks on traditional BI has always been the need for extract databases and all the ETL that goes that's involved in that. Do you guys avoid that stage? You go to the production data directly or what's the-- >> It's a great question. The way I put it is, we bring Moses to the mountain the mountain being the data, Moses being the user. Traditionally, what people have been trying to do is bring the mountain to Moses, doesn't scale. At AtScale, we provide an abstraction a logical distraction between the data and the BI user. >> You don't touch, you don't move the data. >> We don't move the data. Which is what's unique and that's what's delivering I think, way more than a 10X delivery in value. >> Because you leave the data in place you bring that value to wherever the data is. Which is the original concept of Hadoop, by the way. That was what was profound about Hadoop everybody craps on it now, but that was the game changer and if you could take advantage of that that's how you tap your 10X. >> To the difference is, we're not, to your point we're not moving the data. Hadoop, in my humble opinion why it plateaued is because to get the value, you had to ask the user to bring and put data in yet another platform. And the reason that we're not delivering on big data as an industry, I believe is because we've too many data sources, too many platforms too many consumers of data and too many producers. As we build all these islands of data, with no connectivity. The idea is, we'll create this big data lake and we're going to physically put everything in there. Guess what? Someday turned out to be never. Because people aren't going to deal with the business disruption. We move thousands of users from a platform like Teradata to a platform like Snowflake or Google BigQuery, we don't care. We're a multi-cloud and we're a hybrid cloud. But we do it without any disruption. You're using Excel, you just continue and use it. You just see the results are faster. You use Tableau, same difference. >> So we had all the vertical rock stars in here. So we had Colin in yesterday, we had Stonebraker around earlier. Andy Palmer just came on and Chris here with the CEO who ultimately sold the company to HP. That really didn't do anything with it and then spun it off and now it's back. Aaron was, he had a spring in his step yesterday. So when you think about, Vertica. The technology behind Vertica go back 10 years and where we come now give us a little journey of, your data journey. >> So I think it plays into the, the original assertion is that, vertical is a best-in-class platform for analytics but it was yet another platform. The analog I give now, is now we have Snowflake and six months, 12 months from now we're going to have another one. And that creates a set of problems if you have to live in the physical world. Because you've all these islands of data and I believe, it's about the data not about the models, it's about the data. You can't get optimal results if you don't have an optimal access to the pertinent data. I believe that having that Universal API is going to make the next platform that more valuable. You're not going to be making the trade-off is, okay we have this platform that has some neat capability but the trade-off is from an enterprise architecture perspective we're never going to be able to connect all this stuff. That's how all of these things proliferated. My view is, in a world where you have that single pane of glass, that abstraction layer between the user and the data. Then innovation can be spawned quicker and you can use these tools effectively 'cause you're not compromising being able to get a logical view of the data and get access to it as a user. >> What's your issue with Snowflake you mentioned them, Mugli's company-- >> No issue, they're a great partner of ours. We eliminate the friction between the user going from an on-prem solution to the cloud. >> Slootman just took over there. So you know where that's going. >> Yep (laughing) >> Frank's got the magic touch. Okay good, you say they're a partner yours how are you guys partnering? >> They refer us into customers that, if you want to buy Snowflake now the next issue is, how do i migrate? You don't. You put our virtualization layer in and then we allow you access to Snowflake in a non-disruptive way, versus having to move data into their system or into a particular cloud which creates sales friction. >> Moving data is just, you want to avoid it at all cost. >> I do want to ask you because I met with your predecessors, Dave Mariani last year and I know he was kind of a reluctant CEO he didn't really want to be CEO but wanted to be CTO, which is what he is now. How did that come about, that they found you that you connected with them and decided this was the right opportunity. >> That's a great question. I actually looked at the company at the seed stage when I was in venture, but I had this thing as you know that, I wanted to move companies to Boston and they're about my vintage age-wise and he's married with four kids so that wasn't in the cards. I said look, it doesn't make sense for me to seed this company 'cause I can't give you the time you're out in California everything I'm instrumenting is around Boston. We parted friends. And I was skeptical whether he could build this 'cause people have been talking about building a heterogeneous universal semantic layer, for years and it's never come to fruition. And then he read in Fortune or Forbes that I was leaving Accomplice and that I was looking for one more company to operate. He reached out and he told me what they were doing that hey, we really built it but we need help and I don't want to run this. It's not right for the company and the opportunity So he said, "I'll come and I'll consult to you." I put together a plan and I had my Vertica and data robot. NekTony guys do the technical diligence to make sure that the architecture wasn't wedded to the dupe, like all the other ones were and when I saw it wasn't then I knew the market opportunity was to take that, rifle and point it at that legacy $150 billion BI market not at the billion dollar market of Hadoop. And when we did that, we've been growing at 162% quarter-over-quarter. We've built development centers in Bulgaria. We've moved all operations, non-technical to Boston here down in our South Station. We've been on fire and we are the partner of choice of every cloud manner, because we eliminate the sales friction, for customers being able to take advantage of movement to the cloud and we're able through our intelligent pipeline and capability. We're able to reduce the cost significantly of queries because we understand and we were able to intelligently cash those queries. >> Sales ops is here, all-- >> Sales marketing, customer support, customer success and we're building a machine learning team here at Dev team here. >> Where are you in that sort of Boston build-out? >> We have an office on 711 Atlantic that we opened in the fall. We're actually moving from 4,000 square feet to 10,000 this month. In less than six months and we'll house by the first year, 100 employees in Boston 100 in Bulgaria and about that same hundred in San Mateo. >> Are you going after net new business mainly? Or there's a lot of legacy BI out there are you more displacing those products? >> A couple of things. What we find is that, customers want to evolve into the cloud, they don't want a revolution they want a evolution. So we allow them, because we support hybrid cloud to keep some data behind the firewall and then experiment with moving other data to the cloud platform of choice but we're still providing that one logical view. I would say most of our customers are looking to reap platform, off of Teradata or something onto a, another platform like Snowflake. And then we have a set of customers that see that as part of the solution but not the whole solution. They're more true hybrids but I would say that 80% of our customers are traditional BI customers that are trying to contemporize their environments and be able to take advantage of tabular support and multidimensional, the things that we do in addition to the cube world. >> They can keep whatever they're using. >> Correct, that's the key. >> Did you do the series D, you did, right? >> Yes, Morgan Stanely led. >> So you're not actively but you're good for now, It was like $50 million >> Yeah we raised $50 million. >> You're good for a bit. Who's in the Chris Lynch target? (laughs) Who's the enemy? Vertica, I could say it was the traditional database guys. Who's the? >> We're in a unique position, we're almost Switzerland so we could be friend to foe, of anybody in that ecosystem because we can, non-disruptively re-platform customers between legacy platforms or from legacy platforms to the cloud. We're an interesting position. >> So similar to the file sharing. File virtualization company >> The Copier. >> Copier yeah. >> It puts us in an interesting position. They need to be friends with us and at the same time I'm sure that they're concerned about the capabilities we have but we have a number of retail customers for instance that have asked us to move down from Amazon to Google BigQuery, which we accommodate and because we can do that non-disruptively. The cost and the ability to move is eliminated. It gives customers true freedom of choice. >> How worried are you, that AWS tries to replicate what you guys do. You're in their sights. >> I think there are technical, legal and structural barriers to them doing that. The technical is, this team has been at it for six and a half years. So to do what we do, they'll have to do what we've done. Structurally from a business perspective if they could, I'm not sure they want to. The way to think about Amazon is, they're no different than Teradata, except for they want the same vendor lock-in except they want it to be the Amazon Cloud when Teradata wanted it to be, their data warehouse. >> They don't promote multi-cloud versus-- >> Yeah, they don't want multi-cloud they don't want >> On Prem >> Customers to have a freedom of choice. Would they really enable a heterogeneous abstraction layer, I don't think they would nor do I think any of the big guys would. They all claim to have this capability for their system. It's like the old IBM adage I'm in prison but the food's going to get three squares a day, I get cable TV but I'm in prison. (laughing) >> Awesome, all right, parting thoughts. >> Parting thoughts, oh geez you got to give me a question I'm not that creative. >> What's next, for you guys? What should we be paying attention to? >> I think you're going to see some significant announcements in September regarding the company and relationships that I think will validate the impact we're having in the market. >> Give you some leverage >> Yeah, will give us, better channel leverage. We have a major technical announcement that I think will be significant to the marketplace and what will be highly disruptive to some of the people you just mentioned. In terms of really raising the bar for customers to be able to have the freedom of choice without any sort of vendor lock-in. And I think that that will create some counter strike which we'll be ready for. (laughing) >> If you've never heard of AtScale before trust me you're going to in the next 18 months. Chris Lynch, thanks so much for coming on theCUBE. >> It's my pleasure. >> Great to see you. All right, keep it right there everybody we're back with our next guest, right after this short break you're watching theCUBE from MIT, right back. (upbeat music)

Published Date : Aug 2 2019

SUMMARY :

Brought to you by, SiliconANGLE Media. Good to see you. that you got back into it. and asked me about the transition What was it that attracted you to AtScale? traditional BI to the cloud. That's the other thing and then the second piece is into you I mean, maybe it lives in the cloud and get the best data Because when you look and all the ETL that goes is bring the mountain don't move the data. We don't move the data. and if you could take advantage of that is because to get the value, So when you think about, Vertica. and I believe, it's about the data We eliminate the friction between the user So you know where that's going. Frank's got the magic touch. and then we allow you access to Snowflake you want to avoid it that they found you and it's never come to fruition. and we're building a by the first year, 100 employees in Boston the things that we do Who's in the Chris Lynch target? to the cloud. So similar to the file sharing. about the capabilities we have tries to replicate what you guys do. So to do what we do, they'll I'm in prison but the food's you got to give me a question in September regarding the to some of the people you just mentioned. in the next 18 months. Great to see you.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Paul GillanPERSON

0.99+

Dave VellantePERSON

0.99+

Chris LynchPERSON

0.99+

AmazonORGANIZATION

0.99+

BulgariaLOCATION

0.99+

SeptemberDATE

0.99+

ChrisPERSON

0.99+

AWSORGANIZATION

0.99+

10QUANTITY

0.99+

HPORGANIZATION

0.99+

Andy PalmerPERSON

0.99+

Dave MarianiPERSON

0.99+

CaliforniaLOCATION

0.99+

AaronPERSON

0.99+

BostonLOCATION

0.99+

San MateoLOCATION

0.99+

$150 billionQUANTITY

0.99+

$50 millionQUANTITY

0.99+

$150 billionQUANTITY

0.99+

MosesPERSON

0.99+

80%QUANTITY

0.99+

4,000 square feetQUANTITY

0.99+

last yearDATE

0.99+

second pieceQUANTITY

0.99+

162%QUANTITY

0.99+

South StationLOCATION

0.99+

AtScaleORGANIZATION

0.99+

Morgan StanelyPERSON

0.99+

100%QUANTITY

0.99+

four kidsQUANTITY

0.99+

ExcelTITLE

0.99+

six and a half yearsQUANTITY

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

Cambridge, MassachusettsLOCATION

0.99+

IBMORGANIZATION

0.99+

TableauTITLE

0.99+

yesterdayDATE

0.99+

firstQUANTITY

0.99+

secondQUANTITY

0.99+

TeradataORGANIZATION

0.99+

Cambridge, MassachusettsLOCATION

0.99+

less than six monthsQUANTITY

0.99+

SnowflakeORGANIZATION

0.99+

FrankPERSON

0.99+

todayDATE

0.98+

this monthDATE

0.98+

SwitzerlandLOCATION

0.98+

HadoopTITLE

0.98+

10XQUANTITY

0.98+

100 employeesQUANTITY

0.98+

one partQUANTITY

0.98+

SlootmanPERSON

0.98+

10,000QUANTITY

0.97+

VerticaORGANIZATION

0.97+

MugliORGANIZATION

0.97+

GoogleORGANIZATION

0.97+

15 unique APIQUANTITY

0.96+

hundredQUANTITY

0.96+

six monthsQUANTITY

0.96+

three squares a dayQUANTITY

0.96+

thousands of usersQUANTITY

0.96+

NekTonyORGANIZATION

0.96+

FortuneTITLE

0.96+

12 monthsQUANTITY

0.95+

single APIQUANTITY

0.95+

711 AtlanticLOCATION

0.95+

2000 companiesQUANTITY

0.94+

OneQUANTITY

0.94+

next 18 monthsDATE

0.94+

ColinPERSON

0.93+

one more companyQUANTITY

0.92+

one single APIQUANTITY

0.92+

single paneQUANTITY

0.91+

Susan Wilson, Informatica & Blake Andrews, New York Life | MIT CDOIQ 2019


 

(techno music) >> From Cambridge, Massachusetts, it's theCUBE. Covering MIT Chief Data Officer and Information Quality Symposium 2019. Brought to you by SiliconANGLE Media. >> Welcome back to Cambridge, Massachusetts everybody, we're here with theCUBE at the MIT Chief Data Officer Information Quality Conference. I'm Dave Vellante with my co-host Paul Gillin. Susan Wilson is here, she's the vice president of data governance and she's the leader at Informatica. Blake Anders is the corporate vice president of data governance at New York Life. Folks, welcome to theCUBE, thanks for coming on. >> Thank you. >> Thank you. >> So, Susan, interesting title; VP, data governance leader, Informatica. So, what are you leading at Informatica? >> We're helping our customers realize their business outcomes and objectives. Prior to joining Informatica about 7 years ago, I was actually a customer myself, and so often times I'm working with our customers to understand where they are, where they going, and how to best help them; because we recognize data governance is more than just a tool, it's a capability that represents people, the processes, the culture, as well as the technology. >> Yeah so you've walked the walk, and you can empathize with what your customers are going through. And Blake, your role, as the corporate VP, but more specifically the data governance lead. >> Right, so I lead the data governance capabilities and execution group at New York Life. We're focused on providing skills and tools that enable government's activities across the enterprise at the company. >> How long has that function been in place? >> We've been in place for about two and half years now. >> So, I don't know if you guys heard Mark Ramsey this morning, the key-note, but basically he said, okay, we started with enterprise data warehouse, we went to master data management, then we kind of did this top-down enterprise data model; that all failed. So we said, all right, let's pump the governance. Here you go guys, you fix our corporate data problem. Now, right tool for the right job but, and so, we were sort of joking, did data governance fail? No, you always have to have data governance. It's like brushing your teeth. But so, like I said, I don't know if you heard that, but what are your thoughts on that sort of evolution that he described? As sort of, failures of things like EDW to live up to expectations and then, okay guys over to you. Is that a common theme? >> It is a common theme, and what we're finding with many of our customers is that they had tried many of the, if you will, the methodologies around data governance, right? Around policies and structures. And we describe this as the Data 1.0 journey, which was more application-centric reporting to Data 2.0 to data warehousing. And a lot of the failed attempts, if you will, at centralizing, if you will, all of your data, to now Data 3.0, where we look at the explosion of data, the volumes of data, the number of data consumers, the expectations of the chief data officer to solve business outcomes; crushing under the scale of, I can't fit all of this into a centralized data at repository, I need something that will help me scale and to become more agile. And so, that message does resonate with us, but we're not saying data warehouses don't exist. They absolutely do for trusted data sources, but the ability to be agile and to address many of your organizations needs and to be able to service multiple consumers is top-of-mind for many of our customers. >> And the mind set from 1.0 to 2.0 to 3.0 has changed. From, you know, data as a liability, to now data as this massive asset. It's sort of-- >> Value, yeah. >> Yeah, and the pendulum is swung. It's almost like a see-saw. Where, and I'm not sure it's ever going to flip back, but it is to a certain extent; people are starting to realize, wow, we have to be careful about what we do with our data. But still, it's go, go, go. But, what's the experience at New York Life? I mean, you know. A company that's been around for a long time, conservative, wants to make sure risk averse, obviously. >> Right. >> But at the same time, you want to keep moving as the market moves. >> Right, and we look at data governance as really an enabler and a value-add activity. We're not a governance practice for the sake of governance. We're not there to create a lot of policies and restrictions. We're there to add value and to enable innovation in our business and really drive that execution, that efficiency. >> So how do you do that? Square that circle for me, because a lot of people think, when people think security and governance and compliance they think, oh, that stifles innovation. How do you make governance an engine of innovation? >> You provide transparency around your data. So, it's transparency around, what does the data mean? What data assets do we have? Where can I find that? Where are my most trusted sources of data? What does the quality of that data look like? So all those things together really enable your data consumers to take that information and create new value for the company. So it's really about enabling your value creators throughout the organization. >> So data is an ingredient. I can tell you where it is, I can give you some kind of rating as to the quality of that data and it's usefulness. And then you can take it and do what you need to do with it in your specific line of business. >> That's right. >> Now you said you've been at this two and half years, so what stages have you gone through since you first began the data governance initiative. >> Sure, so our first year, year and half was really focused on building the foundations, establishing the playbook for data governance and building our processes and understanding how data governance needed to be implemented to fit New York Life in the culture of the company. The last twelve months or so has really been focused on operationalizing governance. So we've got the foundations in place, now it's about implementing tools to further augment those capabilities and help assist our data stewards and give them a better skill set and a better tool set to do their jobs. >> Are you, sort of, crowdsourcing the process? I mean, you have a defined set of people who are responsible for governance, or is everyone taking a role? >> So, it is a two-pronged approach, we do have dedicated data stewards. There's approximately 15 across various lines of business throughout the company. But, we are building towards a data democratization aspect. So, we want people to be self-sufficient in finding the data that they need and understanding the data. And then, when they have questions, relying on our stewards as a network of subject matter experts who also have some authorizations to make changes and adapt the data as needed. >> Susan, one of the challenges that we see is that the chief data officers often times are not involved in some of these skunkworks AI projects. They're sort of either hidden, maybe not even hidden, but they're in the line of business, they're moving. You know, there's a mentality of move fast and break things. The challenge with AI is, if you start operationalizing AI and you're breaking things without data quality, without data governance, you can really affect lives. We've seen it. In one of these unintended consequences. I mean, Facebook is the obvious example and there are many, many others. But, are you seeing that? How are you seeing organizations dealing with that problem? >> As Blake was mentioning often times what it is about, you've got to start with transparency, and you got to start with collaborating across your lines of businesses, including the data scientists, and including in terms of what they are doing. And actually provide that level of transparency, provide a level of collaboration. And a lot of that is through the use of our technology enablers to basically go out and find where the data is and what people are using and to be able to provide a mechanism for them to collaborate in terms of, hey, how do I get access to that? I didn't realize you were the SME for that particular component. And then also, did you realize that there is a policy associated to the data that you're managing and it can't be shared externally or with certain consumer data sets. So, the objective really is around how to create a platform to ensure that any one in your organization, whether I'm in the line of business, that I don't have a technical background, or someone who does have a technical background, they can come and access and understand that information and connect with their peers. >> So you're helping them to discover the data. What do you do at that stage? >> What we do at that stage is, creating insights for anyone in the organization to understand it from an impact analysis perspective. So, for example, if I'm going to make changes, to as well as discovery. Where exactly is my information? And so we have-- >> Right. How do you help your customers discover that data? >> Through machine learning and artificial intelligence capabilities of our, specifically, our data catalog, that allows us to do that. So we use such things like similarity based matching which help us to identify. It doesn't have to be named, in miscellaneous text one, it could be named in that particular column name. But, in our ability to scan and discover we can identify in that column what is potentially social security number. It might have resided over years of having this data, but you may not realize that it's still stored there. Our ability to identify that and report that out to the data stewards as well as the data analysts, as well as to the privacy individuals is critical. So, with that being said, then they can actually identify the appropriate policies that need to be adhered to, alongside with it in terms of quality, in terms of, is there something that we need to archive. So that's where we're helping our customers in that aspect. >> So you can infer from the data, the meta data, and then, with a fair degree of accuracy, categorize it and automate that. >> Exactly. We've got a customer that actually ran this and they said that, you know, we took three people, three months to actually physically tag where all this information existed across something like 7,000 critical data elements. And, basically, after the set up and the scanning procedures, within seconds we were able to get within 90% precision. Because, again, we've dealt a lot with meta data. It's core to our artificial intelligence and machine learning. And it's core to how we built out our platforms to share that meta data, to do something with that meta data. It's not just about sharing the glossary and the definition information. We also want to automate and reduce the manual burden. Because we recognize with that scale, manual documentation, manual cataloging and tagging just, >> It doesn't work. >> It doesn't work. It doesn't scale. >> Humans are bad at it. >> They're horrible at it. >> So I presume you have a chief data officer at New York Life, is that correct? >> We have a chief data and analytics officer, yes. >> Okay, and you work within that group? >> Yes, that is correct. >> Do you report it to that? >> Yes, so-- >> And that individual, yeah, describe the organization. >> So that sits in our lines of business. Originally, our data governance office sat in technology. And then, our early 2018 we actually re-orged into the business under the chief data and analytics officer when that role was formed. So we sit under that group along with a data solutions and governance team that includes several of our data stewards and also some others, some data engineer-type roles. And then, our center for data science and analytics as well that contains a lot of our data science teams in that type of work. >> So in thinking about some of these, I was describing to Susan, as these skunkworks projects, is the data team, the chief data officer's team involved in those projects or is it sort of a, go run water through the pipes, get an MVP and then you guys come in. How does that all work? >> We're working to try to centralize that function as much as we can, because we do believe there's value in the left hand knowing what the right hand is doing in those types of things. So we're trying to build those communications channels and build that network of data consumers across the organization. >> It's hard right? >> It is. >> Because the line of business wants to move fast, and you're saying, hey, we can help. And they think you're going to slow them down, but in fact, you got to make the case and show the success because you're actually not going to slow them down to terms of the ultimate outcome. I think that's the case that you're trying to make, right? >> And that's one of the things that we try to really focus on and I think that's one of the advantages to us being embedded in the business under the CDAO role, is that we can then say our objectives are your objectives. We are here to add value and to align with what you're working on. We're not trying to slow you down or hinder you, we're really trying to bring more to the table and augment what you're already trying to achieve. >> Sometimes getting that organization right means everything, as we've seen. >> Absolutely. >> That's right. >> How are you applying governance discipline to unstructured data? >> That's actually something that's a little bit further down our road map, but one of the things that we have started doing is looking at our taxonomy's for structured data and aligning those with the taxonomy's that we're using to classify unstructured data. So, that's something we're in the early stages with, so that when we get to that process of looking at more of our unstructured content, we can, we already have a good feel for there's alignment between the way that we think about and organize those concepts. >> Have you identified automation tools that can help to bring structure to that unstructured data? >> Yes, we have. And there are several tools out there that we're continuing to investigate and look at. But, that's one of the key things that we're trying to achieve through this process is bringing structure to unstructured content. >> So, the conference. First year at the conference. >> Yes. >> Kind of key take aways, things that interesting to you, learnings? >> Oh, yes, well the number of CDO's that are here and what's top of mind for them. I mean, it ranges from, how do I stand up my operating model? We just had a session just about 30 minutes ago. A lot of questions around, how do I set up my organization structure? How do I stand up my operating model so that I could be flexible? To, right, the data scientists, to the folks that are more traditional in structured and trusted data. So, still these things are top-of-mind and because they're recognizing the market is also changing too. And the growing amount of expectations, not only solving business outcomes, but also regulatory compliance, privacy is also top-of-mind for a lot of customers. In terms of, how would I get started? And what's the appropriate structure and mechanism for doing so? So we're getting a lot of those types of questions as well. So, the good thing is many of us have had years of experience in this phase and the convergence of us being able to support our customers, not only in our principles around how we implement the framework, but also the technology is really coming together very nicely. >> Anything you'd add, Blake? >> I think it's really impressive to see the level of engagement with thought leaders and decision makers in the data space. You know, as Susan mentioned, we just got out of our session and really, by the end of it, it turned into more of an open discussion. There was just this kind of back and forth between the participants. And so it's really engaging to see that level of passion from such a distinguished group of individuals who are all kind of here to share thoughts and ideas. >> Well anytime you come to a conference, it's sort of any open forum like this, you learn a lot. When you're at MIT, it's like super-charged. With the big brains. >> Exactly, you feel it when you come on the campus. >> You feel smarter when you walk out of here. >> Exactly, I know. >> Well, guys, thanks so much for coming to theCUBE. It was great to have you. >> Thank you for having us. We appreciate it, thank you. >> You're welcome. All right, keep it right there everybody. Paul and I will be back with our next guest. You're watching theCUBE from MIT in Cambridge. We'll be right back. (techno music)

Published Date : Aug 2 2019

SUMMARY :

Brought to you by SiliconANGLE Media. Susan Wilson is here, she's the vice president So, what are you leading at Informatica? and how to best help them; but more specifically the data governance lead. Right, so I lead the data governance capabilities and then, okay guys over to you. And a lot of the failed attempts, if you will, And the mind set from 1.0 to 2.0 to 3.0 has changed. Where, and I'm not sure it's ever going to flip back, But at the same time, Right, and we look at data governance So how do you do that? What does the quality of that data look like? and do what you need to do with it so what stages have you gone through in the culture of the company. in finding the data that they need is that the chief data officers often times and to be able to provide a mechanism What do you do at that stage? So, for example, if I'm going to make changes, How do you help your customers discover that data? and report that out to the data stewards and then, with a fair degree of accuracy, categorize it And it's core to how we built out our platforms It doesn't work. And that individual, And then, our early 2018 we actually re-orged is the data team, the chief data officer's team and build that network of data consumers but in fact, you got to make the case and show the success and to align with what you're working on. Sometimes getting that organization right but one of the things that we have started doing is bringing structure to unstructured content. So, the conference. And the growing amount of expectations, and decision makers in the data space. it's sort of any open forum like this, you learn a lot. when you come on the campus. Well, guys, thanks so much for coming to theCUBE. Thank you for having us. Paul and I will be back with our next guest.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Paul GillinPERSON

0.99+

SusanPERSON

0.99+

Dave VellantePERSON

0.99+

PaulPERSON

0.99+

Susan WilsonPERSON

0.99+

BlakePERSON

0.99+

InformaticaORGANIZATION

0.99+

CambridgeLOCATION

0.99+

Mark RamseyPERSON

0.99+

Blake AndersPERSON

0.99+

three monthsQUANTITY

0.99+

three peopleQUANTITY

0.99+

FacebookORGANIZATION

0.99+

New York LifeORGANIZATION

0.99+

early 2018DATE

0.99+

Cambridge, MassachusettsLOCATION

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

First yearQUANTITY

0.99+

oneQUANTITY

0.99+

90%QUANTITY

0.99+

two and half yearsQUANTITY

0.98+

firstQUANTITY

0.98+

approximately 15QUANTITY

0.98+

7,000 critical data elementsQUANTITY

0.97+

about two and half yearsQUANTITY

0.97+

first yearQUANTITY

0.96+

twoQUANTITY

0.96+

about 30 minutes agoDATE

0.96+

theCUBEORGANIZATION

0.95+

Blake AndrewsPERSON

0.95+

MIT Chief Data Officer andEVENT

0.93+

MIT Chief Data Officer Information Quality ConferenceEVENT

0.91+

EDWORGANIZATION

0.86+

last twelve monthsDATE

0.86+

skunkworksORGANIZATION

0.85+

CDAOORGANIZATION

0.85+

this morningDATE

0.83+

MITORGANIZATION

0.83+

7 years agoDATE

0.78+

yearQUANTITY

0.75+

Information Quality Symposium 2019EVENT

0.74+

3.0OTHER

0.66+

York LifeORGANIZATION

0.66+

2.0OTHER

0.59+

MIT CDOIQ 2019EVENT

0.58+

halfQUANTITY

0.52+

Data 2.0OTHER

0.52+

Data 3.0TITLE

0.45+

1.0OTHER

0.43+

DataOTHER

0.21+

Show Wrap | MIT CDOIQ 2019


 

>> from Cambridge, Massachusetts. It's three Cube covering M I T. Chief data officer and information quality Symposium 2019. Brought to you by Silicon Angle Media. >> Welcome back. We're here to wrap up the M I T. Chief data officer officer, information quality. It's hashtag m i t CDO conference. You're watching the Cube. I'm David Dante, and Paul Gill is my co host. This is two days of coverage. We're wrapping up eyes. Our analysis of what's going on here, Paul, Let me let me kick it off. When we first started here, we talked about that are open. It was way saw the chief data officer role emerged from the back office, the information quality role. When in 2013 the CEO's that we talked to when we asked them what was their scope. We heard things like, Oh, it's very wide. Involves analytics, data science. Some CEOs even said Oh, yes, security is actually part of our purview because all the cyber data so very, very wide scope. Even in some cases, some of the digital initiatives were sort of being claimed. The studios were staking their claim. The reality was the CDO also emerged out of highly regulated industries financialservices healthcare government. And it really was this kind of wonky back office role. And so that's what my compliance, that's what it's become again. We're seeing that CEOs largely you're not involved in a lot of the emerging. Aye, aye initiatives. That's what we heard, sort of anecdotally talking to various folks At the same time. I feel as though the CDO role has been more fossilized than it was before. We used to ask, Is this role going to be around anymore? We had C I. Ose tell us that the CEO Rose was going to disappear, so you had both ends of the spectrum. But I feel as though that whatever it's called CDO Data's our chief analytics off officer, head of data, you know, analytics and governance. That role is here to stay, at least for for a fair amount of time and increasingly, issues of privacy and governance. And at least the periphery of security are gonna be supported by that CD a role. So that's kind of takeaway Number one. Let me get your thoughts. >> I think there's a maturity process going on here. What we saw really in 2016 through 2018 was, ah, sort of a celebration of the arrival of the CDO. And we're here, you know, we've got we've got power now we've got an agenda. And that was I mean, that was a natural outcome of all this growth and 90% of organizations putting sea Dios in place. I think what you're seeing now is a realization that Oh, my God, this is a mess. You know what I heard? This year was a lot less of this sort of crowing about the ascendance of sea Dios and Maura about We've got a big integration problem of big data cleansing problem, and we've got to get our hands down to the nitty gritty. And when you talk about, as you said, we had in here so much this year about strategic initiatives, about about artificial intelligence, about getting involved in digital business or customer experience transformation. What we heard this year was about cleaning up data, finding the data that you've got organizing it, applying meditator, too. It is getting in shape to do something with it. There's nothing wrong with that. I just think it's part of the natural maturation process. Organizations now have to go through Tiu to the dirty process of cleaning up this data before they can get to the next stage, which was a couple of three years out for most of >> the second. Big theme, of course. We heard this from the former head of analytics. That G s K on the opening keynote is the traditional methods have failed the the Enterprise Data Warehouse, and we've actually studied this a lot. You know, my analogy is often you snake swallowing a basketball, having to build cubes. E D W practitioners would always used to call it chasing the chips until we come up with a new chip. Oh, we need that because we gotta run faster because it's taking us hours and hours, weeks days to run these analytics. So that really was not an agile. It was a rear view mirror looking thing. And Sarbanes Oxley saved the E. D. W. Business because reporting became part of compliance thing perspective. The master data management piece we've heard. Do you consistently? We heard Mike Stone Breaker, who's obviously a technology visionary, was right on. It doesn't scale through this notion of duping. Everything just doesn't work and manually creating rules. It's just it's just not the right approach. This we also heard the top down data data enterprise data model doesn't works too complicated, can operationalize it. So what they do, they kick the can to governance. The Duke was kind of a sidecar, their big data that failed to live up to its promises. And so it's It's a big question as to whether or not a I will bring that level of automation we heard from KPMG. Certainly, Mike Stone breaker again said way heard this, uh, a cz well, from Andy Palmer. They're using technology toe automate and scale that big number one data science problem, which is? They spend all their time wrangling data. We'll see if that if that actually lives up >> to his probable is something we did here today from several of our guests. Was about the promise of machine learning to automate this day to clean up process and as ah Mark Ramsay kick off the conference saying that all of these efforts to standardize data have failed in the past. This does look, He then showed how how G s K had used some of the tools that were represented here using machine learning to actually clean up the data at G S. K. So there is. And I heard today a lot of optimism from the people we talked to about the capability of Chris, for example, talking about the capability of machine learning to bring some order to solve this scale scale problem Because really organizing data creating enterprise data models is a scale problem, and the only way you can solve that it's with with automation, Mike Stone breaker is right on top of that. So there was optimism at this event. There was kind of an ooh, kind of, ah, a dismay at seeing all the data problems they have to clean up, but also promised that tools are on the way that could do that. >> Yeah, The reason I'm an optimist about this role is because data such a hard problem. And while there is a feeling of wow, this is really a challenge. There's a lot of smart people here who are up for the challenge and have the d n a for it. So the role, that whole 360 thing. We talked about the traditional methods, you know, kind of failing, and in the third piece that touched on, which is really bringing machine intelligence to the table. We haven't heard that as much at this event. It's now front and center. It's just another example of a I injecting itself into virtually every aspect every corner of the industry. And again, I often jokes. Same wine, new bottle. Our industry has a habit of doing that, but it's cyclical, but it is. But we seem to be making consistent progress. >> And the machine learning, I thought was interesting. Several very guest spoke to machine learning being applied to the plumbing projects right now to cleaning up data. Those are really self contained projects. You can manage those you can. You can determine out test outcomes. You can vet the quality of the of the algorithms. It's not like you're putting machine learning out there in front of the customer where it could potentially do some real damage. There. They're vetting their burning in machine, learning in a environment that they control. >> Right, So So, Amy, Two solid days here. I think that this this conference has really grown when we first started here is about 130 people, I think. And now it was 500 registrants. This'd year. I think 600 is the sort of the goal for next year. Moving venues. The Cube has been covering this all but one year since 2013. Hope to continue to do that. Paul was great working with you. Um, always great work. I hope we can, uh we could do more together. We heard the verdict is bringing back its conference. You put that together. So we had column. Mahoney, um, had the vertical rock stars on which was fun. Com Mahoney, Mike Stone breaker uh, Andy Palmer and Chris Lynch all kind of weighed in, which was great to get their perspectives kind of the days of MPP and how that's evolved improving on traditional relational database. And and now you're Stone breaker. Applying all these m i. Same thing with that scale with Chris Lynch. So it's fun to tow. Watch those guys all Boston based East Coast folks some news. We just saw the news hit President Trump holding up jet icon contractors is we've talked about. We've been following that story very closely and I've got some concerns over that. It's I think it's largely because he doesn't like Bezos in The Washington Post Post. Exactly. You know, here's this you know, America first. The Pentagon says they need this to be competitive with China >> and a I. >> There's maybe some you know, where there's smoke. There's fire there, so >> it's more important to stick in >> the eye. That's what it seems like. So we're watching that story very closely. I think it's I think it's a bad move for the executive branch to be involved in those type of decisions. But you know what I know? Well, anyway, Paul awesome working with you guys. Thanks. And to appreciate you flying out, Sal. Good job, Alex Mike. Great. Already wrapping up. So thank you for watching. Go to silicon angle dot com for all the news. Youtube dot com slash silicon angles where we house our playlist. But the cube dot net is the main site where we have all the events. It will show you what's coming up next. We've got a bunch of stuff going on straight through the summer. And then, of course, VM World is the big kickoff for the fall season. Goto wicked bond dot com for all the research. We're out. Thanks for watching Dave. A lot day for Paul Gillon will see you next time.

Published Date : Aug 1 2019

SUMMARY :

Brought to you by in 2013 the CEO's that we talked to when we asked them what was their scope. And that was I mean, And Sarbanes Oxley saved the E. data models is a scale problem, and the only way you can solve that it's with with automation, We talked about the traditional methods, you know, kind of failing, and in the third piece that touched on, And the machine learning, I thought was interesting. We just saw the news hit President Trump holding up jet icon contractors There's maybe some you know, where there's smoke. And to appreciate you flying out, Sal.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Andy PalmerPERSON

0.99+

David DantePERSON

0.99+

Chris LynchPERSON

0.99+

ChrisPERSON

0.99+

2013DATE

0.99+

PaulPERSON

0.99+

Paul GillPERSON

0.99+

Mike StonePERSON

0.99+

2016DATE

0.99+

Paul GillonPERSON

0.99+

Mike Stone BreakerPERSON

0.99+

Silicon Angle MediaORGANIZATION

0.99+

2018DATE

0.99+

RosePERSON

0.99+

Alex MikePERSON

0.99+

BezosPERSON

0.99+

G s KORGANIZATION

0.99+

MahoneyPERSON

0.99+

BostonLOCATION

0.99+

KPMGORGANIZATION

0.99+

90%QUANTITY

0.99+

SalPERSON

0.99+

third pieceQUANTITY

0.99+

DavePERSON

0.99+

500 registrantsQUANTITY

0.99+

two daysQUANTITY

0.99+

Cambridge, MassachusettsLOCATION

0.99+

todayDATE

0.99+

next yearDATE

0.99+

Mark RamsayPERSON

0.99+

360QUANTITY

0.99+

this yearDATE

0.99+

MauraPERSON

0.99+

G S. K.ORGANIZATION

0.98+

YoutubeORGANIZATION

0.98+

AmyPERSON

0.98+

PentagonORGANIZATION

0.98+

C I. OsePERSON

0.98+

Sarbanes OxleyPERSON

0.97+

firstQUANTITY

0.97+

This yearDATE

0.96+

one yearQUANTITY

0.96+

Mike Stone breakerPERSON

0.95+

Enterprise Data WarehouseORGANIZATION

0.95+

DiosPERSON

0.94+

Two solid daysQUANTITY

0.94+

secondQUANTITY

0.94+

three yearsQUANTITY

0.92+

about 130 peopleQUANTITY

0.91+

600QUANTITY

0.9+

DukeORGANIZATION

0.89+

VM WorldEVENT

0.88+

dot comORGANIZATION

0.85+

ChinaORGANIZATION

0.84+

E. D. W.ORGANIZATION

0.83+

CubeORGANIZATION

0.8+

MITORGANIZATION

0.77+

East CoastLOCATION

0.75+

M I T.PERSON

0.75+

2019DATE

0.74+

President TrumpPERSON

0.71+

both endsQUANTITY

0.71+

threeQUANTITY

0.68+

M I T.EVENT

0.64+

cube dot netORGANIZATION

0.59+

ChiefPERSON

0.58+

The Washington Post PostTITLE

0.57+

AmericaORGANIZATION

0.56+

Goto wickedORGANIZATION

0.54+

CEOPERSON

0.54+

coupleQUANTITY

0.54+

CDOORGANIZATION

0.45+

StonePERSON

0.43+

CDOIQTITLE

0.24+

Bob Parr & Sreekar Krishna, KPMG US | MIT CDOIQ 2019


 

>> from Cambridge, Massachusetts. It's the Cube covering M I T. Chief data officer and information quality Symposium 2019. Brought to you by Silicon Angle Media. >> Welcome back to Cambridge, Massachusetts. Everybody watching the Cuban leader live tech coverage. We here covering the M I t CDO conference M I t CEO Day to wrapping up. Bob Parr is here. He's a partner in principle at KPMG, and he's joined by Streetcar Krishna, who is the managing director of data science. Aye, aye. And innovation at KPMG. Gents, welcome to the Cube. Thank >> thank you. Let's start with your >> roles. So, Bob, where do you focus >> my focus? Ah, within KPMG, we've got three main business lines audit tax, an advisory. And so I'm the advisory chief date officer. So I'm more focused on how we use data competitively in the market. More the offense side of our focus. So, you know, how do we make sure that our teams have the data they need to deliver value? Uh, much as possible working concert with the enterprise? CDO uh, who's more focused on our infrastructure, Our standards, security, privacy and those >> you've focused on making KPMG better A >> supposed exactly clients. OK, >> I also have a second hat, and I also serve financial service is si Dios as well. So Okay, so >> get her out of a dual role. I got sales guys in >> streetcar. What was your role? >> Yeah, You know, I focus a lot on data science, artificial intelligence and overall innovation s o my reaction. I actually represent a centre of >> excellence within KPMG that focuses on the I machine learning natural language processing. And I work with Bob's Division to actually advance the data site off the store because all the eye needs data. And without data, there's no algorithms, So we're focusing a lot on How do we use a I to make data Better think about their equality. Think about data lineage. Think about all of the problems that data has. How can we make it better using algorithms? And I focused a lot on that working with Bob, But no, it's it's customers and internal. I mean, you know, I were a horizontal within the form, So we help customers. We help internal, we focus a lot on the market. >> So, Bob, you mentioned used data offensively. So 10 12 years ago, it was data was a liability. You had to get rid of it. Keep it no longer than you had to, because you're gonna get soon. So email archives came in and obviously thinks flipped after the big data. But so what do you What are you seeing in terms of that shift from From the defense data to the offensive? >> Yeah, and it's it's really you know, when you think about it and let me define sort of offense versus defense. Who on the defense side, historically, that's where most of CEOs have played. That's risk regulatory reporting, privacy, um, even litigation support those types of activities today. Uh, and really, until about a year and 1/2 ago, we really saw most CEOs still really anchored in that I run a forum with a number of studios and financial service is, and every year we get them together and asked him the same set of questions. This was the first year where they said that you know what my primary focus now is. Growth. It's bringing efficiency is trying to generate value on the offensive side. It's not like the regulatory work's going away, certainly in the face of some of the pending privacy regulation. But you know, it's It's a sign that the volume of use cases as the investments in their digital transformations are starting to kick out, as well as the volumes of data that are available. The raw material that's available to them in terms of third party data in terms of the the just the general volumes that that exist that are streaming into the organization and the overall literacy in the business units are creating this, this massive demand. And so they're having to >> respond because of getting a handle on the data they're actually finding. Word is, they're categorizing it there, there, >> yeah, organizing that. That is still still a challenge. Um, I think it's better with when you have a very narrow scope of critical data elements going back to the structure data that we're talking it with the regulatory reporting when you start to get into the three offense, the generating value, getting the customer experience, you know, really exploring. You know that side of it. There's there's a ton of new muscle that has to be built new muscle in terms of data quality, new muscle in terms of um, really more scalable operating model. I think that's a big issue right now with Si Dios is, you know, we've got ah, we're used to that limited swath of CDs and they've got Stewardship Network. That's very labor intensive. A lot of manual processes still, um, and and they have some good basic technology, but it's a lot of its rules based. And when you do you think about those how that constraints going to scale when you have all of this demand. You know, when you look at the customer experience analytics that they want to do when you look at, you know, just a I applied to things like operations. The demand on the focus there is is is gonna start to create a fundamental shift >> this week are one of things that I >> have scene, and maybe it's just my small observation space. But I wonder, if you could comment Is that seems like many CBO's air not directly involved in the aye aye initiatives. Clearly, the chief digital officer is involved, but the CDO zehr kind of, you know, in the background still, you see that? >> That's a fantastic question, and I think this is where we're seeing some off the cutting it change that is happening in the industry. And when Barbara presenter idea that we can often civilly look at data, this is what it is that studios for a long time have become more reactive in their roles. And that is that is starting to come forefront now. So a lot of institutions were working with are asking What's the next generation Roll off a CDO and why are they in the background and why are they not in the foreground? And this is when you become more often they were proactive with data and the digital officers are obviously focused on, you know, the transformation that has to happen. But the studios are their backbone in order to make the transformation. Really. And if the CDO started, think about their data as an asset did as a product did us a service. The judicial officers are right there because those are the real, you know, like the data data they're living so CDO can really become from my back office to really become a business line. We've >> seen taking the reins in machine learning in machine learning projects and cos you work with. Who >> was driving that? Yeah. Great question. So we are seeing, like, you know, different. I would put them in buckets, right? There is no one mortal fits all. We're seeing different generations within the company's. Some off. The ones were just testing out the market. There's two keeping it in their technology space in their back office. Take idea and, you know, in in forward I d let me call them where they are starting to experiment with this. But you see, the mature organizations on the other end of the spectrum, they are integrating action, learning and a I right into the business line because they want to see ex souls having the technology right by their side so they can lead leverage. Aye, aye. And machine learning spot right for the business right there. And that is where we're seeing know some of the new models. Come on. >> I think the big shift from a CDO perspective is using a i to prep data for a That's that's fundamentally where you know, where the data science was distributed. Some of that data science has to come back and free the integration for equality for data prepping because you've got all this data third party and other from customer streaming into the organization. And you know, the work that you're doing around, um, anomaly detection is it transcends developing the rules, doing the profiling, doing the rules. You know, the very manual, the very labor intensive process you've got to get away from that >> is used in order for this to be scale goes and a I to figure out which out goes to apply t >> clean to prepare the data toe, see what algorithms we can use. So it's basically what we're calling a eye for data rather than just data leading into a I. So it's I mean, you know, you developed a technology for one off our clients and pretty large financial service. They were getting closer, like 1,000,000,000 data points every day. And there was no way manually, you could go through the same quality controls and all of those processes. So we automated it through algorithms, and these algorithms are learning the behavior of data as they flow into the organization, and they're able to proactively tell their problems are starting very much. And this is the new face that we see in in the industry, you cannot scale the traditional data governance using manual processes, we have to go to the next generation where a i natural language processing and think about on structure data, right? I mean, that is, like 90% off. The organization is unstructured data, and we have not talked about data quality. We have not talked about data governance. For a lot of these sources of information, now is the time. Hey, I can do it. >> And I think that raised a great question. If you look at unstructured and a lot of the data sources, as you start to take more of an offensive stance will be unstructured. And the data quality, what it means to apply data quality isn't the the profiling and the rules generation the way you would with standard data. So the teams, the skills that CEOs have in their organizations, have to change. You have to start to, and, you know, it's a great example where, you know, you guys were ingesting documents and there was handwriting all over the documents, you know, and >> yeah, you know, you're a great example, Bob. Like you no way would ask the client, like, you know, is this document gonna scanned into the system so my algorithm can run and they're like, Yeah, everything is good. I mean, the deal is there, but when you then start scanning it, you realize there's handwriting and the information is in the handwriting. So all the algorithms breakdown now >> tribal knowledge striving Exactly. >> Exactly. So that's what we're seeing. You know, if I if we talk about the digital transformation in data in the city organization, it is this idea dart. Nothing is left unseen. Some algorithm or some technology, has seen everything that is coming into. The organization has has has a para 500. So you can tell you where the problems are. And this is what algorithms do. This scale beautifully. >> So the data quality approaches are evolving, sort of changing. So rather than heavy, heavy emphasis on masking or duplication and things like that, you would traditionally think of participating the difficult not that that goes away. But it's got to evolve to use machine >> intelligence. Exactly what kind of >> skill sets people need thio achieve that Is it Is it the same people or do we need to retrain them or bring in new skills. >> Yeah, great question. And I can talk from the inspector off. Where is disrupting every industry now that we know, right? But we knew when you look at what skills are >> required, all of the eye, including natural language processing, machine learning, still require human in the loop. And >> that is the training that goes in there. And who do you who are the >> people who have that knowledge? It is the business analyst. It's the data analyst who are the knowledge betters the C suite and the studios. They are able to make decisions. But the day today is still with the data analyst. >> Those s Emmys. Those sm >> means So we have to obscure them to really start >> interacting with these new technologies where they are the leaders, rather than just waiting for answers to come through. And >> when that happens now being as a data scientist, my job is easy because they're Siamese, are there? I deploy the technology. They're semi's trained algorithms on a regular basis. Then it is a fully fungible model which is evolving with the business. And no longer am I spending time re architect ing my rules. And like my, you know, what are the masking capabilities I need to have? It is evolving us. >> Does that change the >> number one problem that you hear from data scientists, which is the 80% of the time >> spent on wrangling cleaning data 10 15 20% run into sm. He's being concerned that they're gonna be replaced by the machine. Their training. >> I actually see them being really enabled now where they're spending 80% of the time doing boring job off, looking at data. Now they're spending 90% of their time looking at the elements future creative in which requires human intelligence to say, Hey, this is different because off X, >> y and Z so let's let's go out. It sounds like a lot of what machine learning is being used for now in your domain is clean things up its plumbing. It's basic foundation work. So go out. Three years after all that work has been done and the data is clean. Where are your clients talking about going next with machine learning? Bob, did you want? >> I mean, it's a whole. It varies by by industry, obviously, but, um but it covers the gamut from, you know, and it's generally tied to what's driving their strategies. So if you look at a financial service is organization as an example today, you're gonna have, you know, really a I driving a lot of the behind the scenes on the customer experience. It's, you know, today with your credit card company. It's behind the scenes doing fraud detection. You know, that's that's going to continue. So it's take the critical functions that were more data. It makes better models that, you know, that that's just going to explode. And I think they're really you can look across all the functions, from finance to to marketing to operations. I mean, it's it's gonna be pervasive across, you know all of that. >> So if I may, I don't top award. While Bob was saying, I think what's gonna what What our clients are asking is, how can I exhilarate the decision making? Because at the end of the day on Lee, all our leaders are focused on making decisions, and all of this data science is leading up to their decision, and today you see like you know what you brought up, like 80% of the time is wasted in cleaning the data. So only 20% time was spent in riel experimentation and analytics. So your decision making time was reduced to 20% off the effort that I put in the pipeline. What if now I can make it 80% of the time? They're I put in the pipeline, better decisions are gonna come on the train. So when I go into a meeting and I'm saying like, Hey, can you show me what happened in this particular region or in this particular part of the country? Previously, it would have been like, Oh, can you come back in two weeks? I will have the data ready, and I will tell you the answer. But in two weeks, the business has ran away and the CDO know or the C Street doesn't require the same answer. But where we're headed as as the data quality improves, you can get to really time questions and decisions. >> So decision, sport, business, intelligence. Well, we're getting better. Isn't interesting to me. Six months to build a cube, we'd still still not good enough. Moving too fast. As the saying goes, data is plentiful. Insights aren't Yes, you know, in your view, well, machine intelligence. Finally, close that gap. Get us closer to real time decision >> making. It will eventually. But there's there's so much that we need to. Our industry needs to understand first, and it really ingrained. And, you know, today there is still a fundamental trust issues with a I you know, it's we've done a lot of work >> watch Black box or a part of >> it. Part of it. I think you know, the research we've done. And some of this is nine countries, 2400 senior executives. And we asked some, ah, a lot of questions around their data and trusted analytics, and 92% of them came back with. They have some fundamental trust issues with their data and their analytics and and they feel like there's reputational risk material reputational risk. This isn't getting one little number wrong on one of the >> reports about some more of an >> issue, you know, we also do a CEO study, and we've done this many years in a row going back to 2017. We started asked them okay, making a lot of companies their data driven right. When it comes to >> what they say they're doing well, They say they're day driven. That's the >> point. At the end of the day, they making strategic decisions where you have an insight that's not intuitive. Do you trust your gut? Go with the analytics back then. You know, 67% said they go with their gut, So okay, this is 2017. This industry's moving quickly. There's tons and tons of investment. Look at it. 2018 go down. No, went up 78%. So it's not aware this issue there is something We're fundamentally wrong and you hit it on. It's a part of its black box, and part of it's the date equality and part of its bias. And there's there's all of these things flowing around it. And so when we dug into that, we said, Well, okay, if that exists, how are we going to help organizations get their arms around this issue and start digging into that that trust issue and really it's the front part is, is exactly what we're talking about in terms of data quality, both structured more traditional approaches and unstructured, using the handwriting example in those types of techniques. But then you get into the models themselves, and it's, you know, the critical thing she had to worry about is, you know, lineage. So from an integrity perspective, where's the data coming from? Whether the sources for the change controls on some of that, they need to look at explain ability, gain at the black box part where you can you tell me the inferences decisions are those documented. And this is important for this me, the human in the loop to get confidence in the algorithm as well as you know, that executive group. So they understand there's a structure set of processes around >> Moneyball. Problem is actually pretty confined. It's pretty straightforward. Dono 32 teams are throwing minor leagues, but the data models pretty consistent through the problem with organizations is I didn't know data model is consistent with the organization you mentioned, Risk Bob. The >> other problem is organizational inertia. If they don't trust it, what is it? What is a P and l manage to do when he or she wants to preserve? Yeah, you know, their exit position. They attacked the data. You know, I don't believe that well, which which is >> a fundamental point, which is culture. Yes. I mean, you can you can have all the data, science and all the governance that you want. But if you don't work culture in parallel with all this, it's it's not gonna stick. And and that's, I think the lot of the leading organisations, they're starting to really dig into this. We hear a lot of it literacy. We hear a lot about, you know, top down support. What does that really mean? It means, you know, senior executives are placing bats around and linking demonstrably linking the data and the role of data days an asset into their strategies and then messaging it out and being specific around the types of investments that are going to reinforce that business strategy. So that's absolutely critical. And then literacy absolutely fundamental is well, because it's not just the executives and the data scientists that have to get this. It's the guy in ops that you're trying to get you. They need to understand, you know, not only tools, but it's less about the tools. But it's the techniques, so it's not. The approach is being used, are more transparent and and that you know they're starting to also understand, you know, the issues of privacy and data usage rights. That's that's also something that we can't leave it the curb. With all this >> innovation, it's also believing that there's an imperative. I mean, there's a lot of for all the talk about digital transformation hear it everywhere. Everybody's trying to get digital, right? But there's still a lot of complacency in the organization in the lines of business in operation to save. We're actually doing really well. You know, we're in financial service is health care really hasn't been disrupted. This is Oh, it's coming, it's coming. But there's still a lot of I'll be retired by then or hanging. Actually, it's >> also it's also the fact that, you know, like in the previous generation, like, you know, if I had to go to a shopping, I would go into a shop and if I wanted by an insurance product, I would call my insurance agent. But today the New world, it's just a top off my screen. I have to go from Amazon, so some other some other app, and this is really this is what is happening to all of our kind. Previously that they start their customers, pocketed them in different experience. Buckets. It's not anymore that's real in front of them. So if you don't get into their digital transformation, a customer is not going to discount you by saying, Oh, you're not Amazon. So I'm not going to expect that you're still on my phone and you're only two types of here, so you have to become really digital >> little surprises that you said you see the next. The next stage is being decision support rather than customer experience, because we hear that for CEOs, customer experience is top of mind right now. >> No natural profile. There are two differences, right? One is external facing is absolutely the customer internal facing. It's absolutely the decision making, because that's how they're separating. The internal were, says the external, and you know most of the meetings that we goto Customer insight is the first place where analytics is starting where data is being cleaned up. Their questions are being asked about. Can I master my customer records? Can I do a good master off my vendor list? That is where they start. But all of that leads to good decision making to support the customers. So it's like that external towards internal view well, back >> to the offense versus defense and the shift. I mean, it absolutely is on the offense side. So it is with the customer, and that's a more directly to the business strategy. So it's get That's the area that's getting the money, the support and people feel like it's they're making an impact with it there. When it's it's down here in some admin area, it's below the water line, and, you know, even though it's important and it flows up here, it doesn't get the VIN visibility. So >> that's great conversation. You coming on? You got to leave it there. Thank you for watching right back with our next guest, Dave Lot. Paul Gillen from M I t CDO I Q Right back. You're watching the Cube

Published Date : Aug 1 2019

SUMMARY :

Brought to you by We here covering the M I t CDO conference M I t CEO Day to wrapping Let's start with your So, Bob, where do you focus And so I'm the advisory chief date officer. I also have a second hat, and I also serve financial service is si Dios as well. I got sales guys in What was your role? Yeah, You know, I focus a lot on data science, artificial intelligence and I mean, you know, I were a horizontal within the form, So we help customers. seeing in terms of that shift from From the defense data to the offensive? Yeah, and it's it's really you know, when you think about it and let me define sort of offense versus respond because of getting a handle on the data they're actually finding. getting the customer experience, you know, really exploring. if you could comment Is that seems like many CBO's air not directly involved in And this is when you become more often they were proactive with data and the digital officers seen taking the reins in machine learning in machine learning projects and cos you work with. So we are seeing, like, you know, different. And you know, the work that you're doing around, um, anomaly detection is So it's I mean, you know, you developed a technology for one off our clients and pretty and the rules generation the way you would with standard data. I mean, the deal is there, but when you then start scanning it, So you can tell you where the problems are. So the data quality approaches are evolving, Exactly what kind of do we need to retrain them or bring in new skills. And I can talk from the inspector off. machine learning, still require human in the loop. And who do you who are the But the day today is still with the data Those s Emmys. And And like my, you know, what are the masking capabilities I need to have? He's being concerned that they're gonna be replaced by the machine. 80% of the time doing boring job off, looking at data. the data is clean. And I think they're really you and all of this data science is leading up to their decision, and today you see like you know what you brought Insights aren't Yes, you know, fundamental trust issues with a I you know, it's we've done a lot of work I think you know, the research we've done. issue, you know, we also do a CEO study, and we've done this many years That's the in the algorithm as well as you know, that executive group. is I didn't know data model is consistent with the organization you mentioned, Yeah, you know, science and all the governance that you want. the organization in the lines of business in operation to save. also it's also the fact that, you know, like in the previous generation, little surprises that you said you see the next. The internal were, says the external, and you know most of the meetings it's below the water line, and, you know, even though it's important and it flows up here, Thank you for

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
BarbaraPERSON

0.99+

KPMGORGANIZATION

0.99+

BobPERSON

0.99+

20%QUANTITY

0.99+

AmazonORGANIZATION

0.99+

90%QUANTITY

0.99+

80%QUANTITY

0.99+

Bob ParrPERSON

0.99+

2017DATE

0.99+

Silicon Angle MediaORGANIZATION

0.99+

Dave LotPERSON

0.99+

2018DATE

0.99+

67%QUANTITY

0.99+

nine countriesQUANTITY

0.99+

92%QUANTITY

0.99+

Cambridge, MassachusettsLOCATION

0.99+

2400 senior executivesQUANTITY

0.99+

Six monthsQUANTITY

0.99+

three offenseQUANTITY

0.99+

firstQUANTITY

0.99+

Paul GillenPERSON

0.99+

LeePERSON

0.99+

todayDATE

0.99+

78%QUANTITY

0.99+

Sreekar KrishnaPERSON

0.99+

two typesQUANTITY

0.99+

OneQUANTITY

0.98+

32 teamsQUANTITY

0.98+

second hatQUANTITY

0.98+

Three yearsQUANTITY

0.98+

two differencesQUANTITY

0.98+

10DATE

0.98+

bothQUANTITY

0.97+

twoQUANTITY

0.97+

two weeksQUANTITY

0.97+

this weekDATE

0.96+

oneQUANTITY

0.95+

M I t CDOEVENT

0.95+

C StreetORGANIZATION

0.93+

M I t CEO DayEVENT

0.93+

Streetcar KrishnaPERSON

0.92+

about a year andDATE

0.91+

2019DATE

0.9+

CubanOTHER

0.9+

CBOORGANIZATION

0.88+

first yearQUANTITY

0.88+

Si DiosORGANIZATION

0.87+

12 years agoDATE

0.86+

10QUANTITY

0.84+

RiskPERSON

0.81+

1,000,000,000 data pointsQUANTITY

0.8+

CDOTITLE

0.8+

ParrPERSON

0.79+

CubeORGANIZATION

0.79+

1/2 agoDATE

0.78+

CDOORGANIZATION

0.78+

tons andQUANTITY

0.76+

dualQUANTITY

0.72+

15QUANTITY

0.71+

DonoORGANIZATION

0.7+

one little numberQUANTITY

0.69+

MITORGANIZATION

0.67+

threeQUANTITY

0.64+

500OTHER

0.63+

boxTITLE

0.61+

M I T.EVENT

0.6+

Cube BobORGANIZATION

0.59+

Robert Abate, Global IDS | MIT CDOIQ 2019


 

>> From Cambridge, Massachusetts, it's theCUBE. Covering MIT Chief Data Officer and Information Quality Symposium 2019. Brought to you by SiliconANGLE Media. (futuristic music) >> Welcome back to Cambridge, Massachusetts everybody. You're watching theCUBE, the leader in live tech coverage. We go out to the events and we extract the signal from the noise. This is day two, we're sort of wrapping up the Chief Data Officer event. It's MIT CDOIQ, it started as an information quality event and with the ascendancy of big data the CDO emerged and really took center stage here. And it's interesting to know that it's kind of come full circle back to information quality. People are realizing all this data we have, you know the old saying, garbage in, garbage out. So the information quality worlds and this chief data officer world have really come colliding together. Robert Abate is here, he's the Vice President and CDO of Global IDS and also the co-chair of next year's, the 14th annual MIT CDOIQ. Robert, thanks for coming on. >> Oh, well thank you. >> Now you're a CDO by background, give us a little history of your career. >> Sure, sure. Well I started out with an Electrical Engineering degree and went into applications development. By 2000, I was leading the Ralph Lauren's IT, and I realized when Ralph Lauren hired me, he was getting ready to go public. And his problem was he had hired eight different accounting firms to do eight different divisions. And each of those eight divisions were reporting a number, but the big number didn't add up, so he couldn't go public. So he searched the industry to find somebody who could figure out the problem. Now I was, at the time, working in applications and had built this system called Service Oriented Architectures, a way of integrating applications. And I said, "Well I don't know if I could solve the problem, "but I'll give it a shot." And what I did was, just by taking each silo as it's own problem, which was what EID Accounting Firm had done, I was able to figure out that one of Ralph Lauren's policies was if you buy a garment, you can return it anytime, anywhere, forever, however long you own it. And he didn't think about that, but what that meant is somebody could go to a Bloomingdale's, buy a garment and then go to his outlet store and return it. Well, the cross channels were different systems. So the outlet stores were his own business, retail was a different business, there was a completely different, each one had their own AS/400, their own data. So what I quickly learned was, the problem wasn't the systems, the problem was the data. And it took me about two months to figure it out and he offered me a job, he said well, I was a consultant at the time, he says, "I'm offering you a job, you're going to run my IT." >> Great user experience but hard to count. >> (laughs) Hard to count. So that's when I, probably 1999 was when that happened. I went into data and started researching-- >> Sorry, so how long did it take you to figure that out? You said a couple of months? >> A couple of months, I think it was about two months. >> 'Cause jeez, it took Oracle what, 10 years to build Fusion with SOA? That's pretty good. (laughs) >> This was a little bit of luck. When we started integrating the applications we learned that the messages that we were sending back and forth didn't match, and we said, "Well that's impossible, it can't not match." But what didn't match was it was coming from one channel and being returned in another channel, and the returns showed here didn't balance with the returns on this side. So it was a data problem. >> So a forensics showdown. So what did you do after? >> After that I went into ICICI Bank which was a large bank in India who was trying to integrate their systems, and again, this was a data problem. But they heard me giving a talk at a conference on how SOA had solved the data challenge, and they said, "We're a bank with a wholesale, a retail, "and other divisions, "and we can't integrate the systems, can you?" I said, "Well yeah, I'd build a website "and make them web services and now what'll happen is "each of those'll kind of communicate." And I was at ICICI Bank for about six months in Mumbai, and finished that which was a success, came back and started consulting because now a lot of companies were really interested in this concept of Service Oriented Architectures. Back then when we first published on it, myself, Peter Aiken, and a gentleman named Joseph Burke published on it in 1996. The publisher didn't accept the book, it was a really interesting thing. We wrote the book called, "Services Based Architectures: A Way to Integrate Systems." And the way Wiley & Sons, or most publishers work is, they'll have three industry experts read your book and if they don't think what you're saying has any value, they, forget about it. So one guy said this is brilliant, one guy says, "These guys don't know what they're talking about," and the third guy says, "I don't even think what they're talking about is feasible." So they decided not to publish. Four years later it came back and said, "We want to publish the book," and Peter said, "You know what, they lost their chance." We were ahead of them by four years, they didn't understand the technology. So that was kind of cool. So from there I went into consulting, eventually took a position as the Head of Enterprise and Director of Enterprise Information Architecture with Walmart. And Walmart, as you know, is a huge entity, almost the size of the federal government. So to build an architecture that integrates Walmart would've been a challenge, a behemoth challenge, and I took it on with a phenomenal team. >> And when was this, like what timeframe? >> This was 2010, and by the end of 2010 we had presented an architecture to the CIO and the rest of the organization, and they came back to me about a week later and said, "Look, everybody agrees what you did was brilliant, "but nobody knows how to implement it. "So we're taking you away, "you're no longer Director of Information Architecture, "you're now Director of Enterprise Information Management. "Build it. "Prove that what you say you could do, you could do." So we built something called the Data CAFE, and CAFE was an acronym, it stood for: Collaborative Analytics Facility for the Enterprise. What we did was we took data from one of the divisions, because you didn't want to take on the whole beast, boil the ocean. We picked Sam's Club and we worked with their CFO, and because we had information about customers we were able to build a room with seven 80 inch monitors that surrounded anyone in the room. And in the center was the Cisco telecommunications so you could be a part of a meeting. >> The TelePresence. >> TelePresence. And we built one room in one facility, and one room in another facility, and we labeled the monitors, one red, one blue, one green, and we said, "There's got to be a way where we can build "data science so it's interactive, so somebody, "an executive could walk into the room, "touch the screen, and drill into features. "And in another room "the features would be changing simultaneously." And that's what we built. The room was brought up on Black Friday of 2013, and we were able to see the trends of sales on the East Coast that we quickly, the executives in the room, and these are the CEO of Walmart and the heads of Sam's Club and the like, they were able to change the distribution in the Mountain Time Zone and west time zones because of the sales on the East Coast gave them the idea, well these things are going to sell, and these things aren't. And they saw a tremendous increase in productivity. We received the 2014, my team received the 2014 Walmart Innovation Project of the Year. >> And that's no slouch. Walmart has always been heavily data-oriented. I don't know if it's urban legend or not, but the famous story in the '80s of the beer and the diapers, right? Walmart would position beer next to diapers, why would they do that? Well the father goes in to buy the diapers for the baby, picks up a six pack while he's on the way, so they just move those proximate to each other. (laughs) >> In terms of data, Walmart really learned that there's an advantage to understanding how to place items in places that, a path that you might take in a store, and knowing that path, they actually have a term for it, I believe it's called, I'm sorry, I forgot the name but it's-- >> Selling more stuff. (laughs) >> Yeah, it's selling more stuff. It's the way you position items on a shelf. And Walmart had the brilliance, or at least I thought it was brilliant, that they would make their vendors the data champion. So the vendor, let's say Procter & Gamble's a vendor, and they sell this one product the most. They would then be the champion for that aisle. Oh, it's called planogramming. So the planogramming, the way the shelves were organized, would be set up by Procter & Gamble for that entire area, working with all their other vendors. And so Walmart would give the data to them and say, "You do it." And what I was purporting was, well, we shouldn't just be giving the data away, we should be using that data. And that was the advent of that. From there I moved to Kimberly-Clark, I became Global Director of Enterprise Data Management and Analytics. Their challenge was they had different teams, there were four different instances of SAP around the globe. One for Latin America, one for North America called the Enterprise Edition, one for EMEA, Europe, Middle East, and Africa, and one for Asia-Pacific. Well when you have four different instances of SAP, that means your master data doesn't exist because the same thing that happens in this facility is different here. And every company faces this challenge. If they implement more than one of a system the specialty fields get used by different companies in different ways. >> The gold standard, the gold version. >> The golden version. So I built a team by bringing together all the different international teams, and created one team that was able to integrate best practices and standards around data governance, data quality. Built BI teams for each of the regions, and then a data science and advanced analytics team. >> Wow, so okay, so that makes you uniquely qualified to coach here at the conference. >> Oh, I don't know about that. (laughs) There are some real, there are some geniuses here. >> No but, I say that because these are your peeps. >> Yes, they are, they are. >> And so, you're a practitioner, this conference is all about practitioners talking to practitioners, it's content-heavy, There's not a lot of fluff. Lunches aren't sponsored, there's no lanyard sponsor and it's not like, you know, there's very subtle sponsor desks, you have to have sponsors 'cause otherwise the conference's not enabled, and you've got costs associated with it. But it's a very intimate event and I think you guys want to keep it that way. >> And I really believe you're dead-on. When you go to most industry conferences, the industry conferences, the sponsors, you know, change the format or are heavily into the format. Here you have industry thought leaders from all over the globe. CDOs of major Fortune 500 companies who are working with their peers and exchanging ideas. I've had conversations with a number of CDOs and the thought leadership at this conference, I've never seen this type of thought leadership in any conference. >> Yeah, I mean the percentage of presentations by practitioners, even when there's a vendor name, they have a practitioner, you know, internal practitioner presenting so it's 99.9% which is why people attend. We're moving venues next year, I understand. Just did a little tour of the new venue, so, going to be able to accommodate more attendees, so that's great. >> Yeah it is. >> So what are your objectives in thinking ahead a year from now? >> Well, you know, I'm taking over from my current peer, Dr. Arka Mukherjee, who just did a phenomenal job of finding speakers. People who are in the industry, who are presenting challenges, and allowing others to interact. So I hope could do a similar thing which is, find with my peers people who have real world challenges, bring them to the forum so they can be debated. On top of that, there are some amazing, you know, technology change is just so fast. One of the areas like big data I remember only five years ago the chart of big data vendors maybe had 50 people on it, now you would need the table to put all the vendors. >> Who's not a data vendor, you know? >> Who's not a data vendor? (laughs) So I would think the best thing we could do is, is find, just get all the CDOs and CDO-types into a room, and let us debate and talk about these points and issues. I've seen just some tremendous interactions, great questions, people giving advice to others. I've learned a lot here. >> And how about long term, where do you see this going? How many CDOs are there in the world, do you know? Is that a number that's known? >> That's a really interesting point because, you know, only five years ago there weren't that many CDOs to be called. And then Gartner four years ago or so put out an article saying, "Every company really should have a CDO." Not just for the purpose of advancing your data, and to Doug Laney's point that data is being monetized, there's a need to have someone responsible for information 'cause we're in the Information Age. And a CIO really is focused on infrastructure, making sure I've got my PCs, making sure I've got a LAN, I've got websites. The focus on data has really, because of the Information Age, has turned data into an asset. So organizations realize, if you utilize that asset, let me reverse this, if you don't use data as an asset, you will be out of business. I heard a quote, I don't know if it's true, "Only 10 years ago, 250 of the Fortune 10 no longer exists." >> Yeah, something like that, the turnover's amazing. >> Many of those companies were companies that decided not to make the change to be data-enabled, to make data decision processing. Companies still use data warehouses, they're always going to use them, and a warehouse is a rear-view mirror, it tells you what happened last week, last month, last year. But today's businesses work forward-looking. And just like driving a car, it'd be really hard to drive your car through a rear-view mirror. So what companies are doing today are saying, "Okay, let's start looking at this as forward-looking, "a prescriptive and predictive analytics, "rather than just what happened in the past." I'll give you an example. In a major company that is a supplier of consumer products, they were leading in the industry and their sales started to drop, and they didn't know why. Well, with a data science team, we were able to determine by pulling in data from the CDC, now these are sources that only 20 years ago nobody ever used to bring in data in the enterprise, now 60% of your data is external. So we brought in data from the CDC, we brought in data on maternal births from the national government, we brought in data from the Census Bureau, we brought in data from sources of advertising and targeted marketing towards mothers. Pulled all that data together and said, "Why are diaper sales down?" Well they were targeting the large regions of the country and putting ads in TV stations in New York and California, big population centers. Birth rates in population centers have declined. Birth rates in certain other regions, like the south, and the Bible Belt, if I can call it that, have increased. So by changing the marketing, their product sales went up. >> Advertising to Texas. >> Well, you know, and that brings to one of the points, I heard a lecture today about ethics. We made it a point at Walmart that if you ran a query that reduced a result to less than five people, we wouldn't allow you to see the result. Because, think about it, I could say, "What is my neighbor buying? "What are you buying?" So there's an ethical component to this as well. But that, you know, data is not political. Data is not chauvinistic. It doesn't discriminate, it just gives you facts. It's the interpretation of that that is hard CDOs, because we have to say to someone, "Look, this is the fact, and your 25 years "of experience in the business, "granted, is tremendous and it's needed, "but the facts are saying this, "and that would mean that the business "would have to change its direction." And it's hard for people to do, so it requires that. >> So whether it's called the chief data officer, whatever the data czar rubric is, the head of analytics, there's obviously the data quality component there whatever that is, this is the conference for, as I called them, your peeps, for that role in the organization. People often ask, "Will that role be around?" I think it's clear, it's solidifying. Yes, you see the chief digital officer emerging and there's a lot of tailwinds there, but the information quality component, the data architecture component, it's here to stay. And this is the premiere conference, the premiere event, that I know of anyway. There are a couple of others, perhaps, but it's great to see all the success. When I first came here in 2013 there were probably about 130 folks here. Today, I think there were 500 people registered almost. Next year, I think 600 is kind of the target, and I think it's very reasonable with the new space. So congratulations on all the success, and thank you for stepping up to the co-chair role, I really appreciate it. >> Well, let me tell you I thank you guys. You provide a voice at these IT conferences that we really need, and that is the ability to get the message out. That people do think and care, the industry is not thoughtless and heartless. With all the data breaches and everything going on there's a lot of fear, fear, loathing, and anticipation. But having your voice, kind of like ESPN and a sports show, gives the technology community, which is getting larger and larger by the day, a voice and we need that so, thank you. >> Well thank you, Robert. We appreciate that, it was great to have you on. Appreciate the time. >> Great to be here, thank you. >> All right, and thank you for watching. We'll be right back with out next guest as we wrap up day two of MIT CDOIQ. You're watching theCUBE. (futuristic music)

Published Date : Aug 1 2019

SUMMARY :

Brought to you by SiliconANGLE Media. and also the co-chair of next year's, give us a little history of your career. So he searched the industry to find somebody (laughs) Hard to count. 10 years to build Fusion with SOA? and the returns showed here So what did you do after? and the third guy says, And in the center was the Cisco telecommunications and the heads of Sam's Club and the like, Well the father goes in to buy the diapers for the baby, (laughs) So the planogramming, the way the shelves were organized, and created one team that was able to integrate so that makes you uniquely qualified to coach here There are some real, there are some geniuses here. and it's not like, you know, the industry conferences, the sponsors, you know, Yeah, I mean the percentage of presentations by One of the areas like big data I remember just get all the CDOs and CDO-types into a room, because of the Information Age, and the Bible Belt, if I can call it that, have increased. It's the interpretation of that that is hard CDOs, the data architecture component, it's here to stay. and that is the ability to get the message out. We appreciate that, it was great to have you on. All right, and thank you for watching.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
PeterPERSON

0.99+

WalmartORGANIZATION

0.99+

Peter AikenPERSON

0.99+

Robert AbatePERSON

0.99+

RobertPERSON

0.99+

Procter & GambleORGANIZATION

0.99+

CiscoORGANIZATION

0.99+

IndiaLOCATION

0.99+

MumbaiLOCATION

0.99+

Census BureauORGANIZATION

0.99+

2010DATE

0.99+

1996DATE

0.99+

New YorkLOCATION

0.99+

last weekDATE

0.99+

last yearDATE

0.99+

last monthDATE

0.99+

60%QUANTITY

0.99+

BloomingdaleORGANIZATION

0.99+

Next yearDATE

0.99+

1999DATE

0.99+

TexasLOCATION

0.99+

25 yearsQUANTITY

0.99+

10 yearsQUANTITY

0.99+

one roomQUANTITY

0.99+

2014DATE

0.99+

2013DATE

0.99+

Doug LaneyPERSON

0.99+

Sam's ClubORGANIZATION

0.99+

ICICI BankORGANIZATION

0.99+

99.9%QUANTITY

0.99+

Wiley & SonsORGANIZATION

0.99+

50 peopleQUANTITY

0.99+

Arka MukherjeePERSON

0.99+

next yearDATE

0.99+

JosPERSON

0.99+

TodayDATE

0.99+

third guyQUANTITY

0.99+

2000DATE

0.99+

todayDATE

0.99+

oneQUANTITY

0.99+

500 peopleQUANTITY

0.99+

Cambridge, MassachusettsLOCATION

0.99+

one channelQUANTITY

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

OracleORGANIZATION

0.99+

eachQUANTITY

0.99+

OneQUANTITY

0.99+

CDCORGANIZATION

0.99+

less than five peopleQUANTITY

0.99+

Ralph LaurenORGANIZATION

0.99+

one guyQUANTITY

0.99+

six packQUANTITY

0.99+

ESPNORGANIZATION

0.99+

four years agoDATE

0.98+

AfricaLOCATION

0.98+

SOATITLE

0.98+

five years agoDATE

0.98+

CaliforniaLOCATION

0.98+

GartnerORGANIZATION

0.98+

three industry expertsQUANTITY

0.98+

Global IDSORGANIZATION

0.98+

Four years laterDATE

0.98+

600QUANTITY

0.98+

20 years agoDATE

0.98+

East CoastLOCATION

0.98+

250QUANTITY

0.98+

Middle EastLOCATION

0.98+

four yearsQUANTITY

0.98+

one teamQUANTITY

0.97+

monthsQUANTITY

0.97+

firstQUANTITY

0.97+

about two monthsQUANTITY

0.97+

Latin AmericaLOCATION

0.97+

Andy Palmer, TAMR | MIT CDOIQ 2019


 

>> from Cambridge, Massachusetts. It's the Cube covering M. I. T. Chief Data officer and Information Quality Symposium 2019 Brought to you by Silicon Angle Media >> Welcome back to M I. T. Everybody watching the Cube. The leader in live tech coverage we hear a Day two of the M I t chief data officer information Quality Conference Day Volonte with Paul Dillon. Andy Palmer's here. He's the co founder and CEO of Tamer. Good to see again. It's great to see it actually coming out. So I didn't ask this to Mike. I could kind of infirm from someone's dances. But why did you guys start >> Tamer? >> Well, it really started with an academic project that Mike was doing over at M. I. T. And I was over in of artists at the time. Is the chief get officer over there? And what we really found was that there were a lot of companies really suffering from data mastering as the primary bottleneck in their company did used great new tech like the vertical system that we've built and, you know, automated a lot of their warehousing and such. But the real bottleneck was getting lots of data integrated and mastered really, really >> quickly. Yeah, He took us through the sort of problems with obviously the d. W. In terms of scaling master data management and the scanning problems was Was that really the problem that you were trying to solve? >> Yeah, it really was. And when we started, I mean, it was like, seven years ago, eight years ago, now that we started the company and maybe almost 10 when we started working on the academic project, and at that time, people weren't really thinking are worried about that. They were still kind of digesting big data. A zit was called, but I think what Mike and I kind of felt was going on was that people were gonna get over the big data, Um, and the volume of data. And we're going to start worrying about the variety of the data and how to make the data cleaner and more organized. And, uh, I think I think way called that one pretty much right. Maybe >> we're a little >> bit early, but but I think now variety is the big problem >> with the other thing about your big day. Big data's oftentimes associated with Duke, which was a batch and then you sort of saw the shifter real time and spark was gonna fix all that. And so what are you seeing in terms of the trends in terms of how data is being used to drive almost near real time business decisions. >> You know, Mike and I came out really specifically back in 2007 and declared that we thought, uh, Hadoop and H D f s was going to be far less impactful than other people. >> 07 >> Yeah, Yeah. And Mike Mike actually was really aggressive and saying it was gonna be a disaster. And I think we've finally seen that actually play out of it now that the bloom is off the rose, so to speak. And so they're They're these fundamental things that big companies struggle with in terms of their data and, you know, cleaning it up and organizing it and making it, Iike want. Anybody that's worked at one of these big companies can tell you that the data that they get from most of their internal system sucks plain and simple, and so cleaning up that data, turning it into something it's an asset rather than liability is really what what tamers all about? And it's kind of our mission. We're out there to do this and it sort of pails and compare. Do you think about the amount of money that some of these companies have spent on systems like ASAP on you're like, Yeah, but all the data inside of the systems so bad and so, uh, ugly and unuseful like we're gonna fix that problem. >> So you're you're you're special sauce and machine learning. Where are you applying machine learning most most effectively when >> we apply machine learning to probably the least sexy problem on the planet. There are a lot of companies out there that use machine learning and a I t o do predictive algorithms and all kinds of cool stuff. All we do with machine learning is actually use it to clean up data and organize data. Get it ready for people to use a I I I started in the eye industry back in the late 19 eighties on, you know, really, I learned from the sky. Marvin Minsky and Mark Marvin taught me two things. First was garbage in garbage out. There's no algorithm that's worth anything unless you've got great data, and the 2nd 1 is it's always about the human in the machine working together. And I've really been working on those two same principles most of my career, and Tamer really brings both of those together. Our goal is to prepare data so that it can be used analytically inside of these companies, that it's actually high quality and useful. And the way we do that involves bringing together the machine, mostly these advanced machine learning algorithms with humans, subject matter experts inside of these companies that actually know all the ins and outs and all the intricacies of the data inside of their company. >> So say garbage in garbage out. If you don't have good training data course you're not going good ML model. How much how much upfront work is required. G. I know it was one of your customers and how much time is required to put together on ML model that can deal with 20,000,000 records like that? >> Well, you know, the amazing thing that this happened for us in the last five years, especially is that now we've got we've built enough models from scratch inside of these large global 2000 companies that very rarely do we go into a place where there we don't already have a model that's pre built. That they can use is a starting point. And I think that's the same thing that's happening in modeling in general. If you look a great companies like data robot Andi and even in in the Python community ml live that the accessibility of these modeling tools and the models themselves are actually so they're commoditized. And so most of our models and most of the projects we work on, we've already got a model. That's a starting point. We don't really have to start from scratch. >> You mentioned gonna ta I in the eighties Is that is the notion of a I Is it same as it was in the eighties and now we've just got the tooling, the horsepower, the data to take advantage of it is the concept changed? The >> math is all the same, like, you know, absolutely full stop, like there's really no new math. The two things I think that have changed our first. There's a lot more data that's available now, and, you know, uh, neural nets are a great example, right? in Marvin's things that, you know when you look at Google translate and how aggressively they used neural nets, it was the quantity of data that was available that actually made neural nets work. The second thing that that's that's changed is the cheap availability of Compute that Now the largest supercomputer in the world is available to rent by the minute. And so we've got all this data. You've got all this really cheap compute. And then third thing is what you alluded to earlier. The accessibility of all the math that now it's becoming so simple and easy to apply these math techniques, and they're becoming you know, it's It's almost to the point where the average data scientists not the advance With the average data, scientists can do a practice. Aye, aye. Techniques that 20 years ago required five PhDs. >> It's not surprising that Google, with its new neural net technology, all the search data that it has has been so successful. It's a surprise you that that Amazon with Alexa was able to compete so effectively. >> Oh, I think that I would never underestimate Amazon and their ability to, you know, build great tact. They've done some amazing work. One of my favorite Mike and I actually, one of our favorite examples in the last, uh, three years, they took their red shift system, you know, that competed with with Veronica and they they re implemented it and, you know, as a compiled system and it really runs incredibly fast. I mean, that that feat of engineering, what was truly exceptional >> to hear you say that Because it wasn't Red Shift originally Park. So yeah, that's right, Larry Ellison craps all over Red Shift because it's just open source offer that they just took and repackage. But you're saying they did some major engineering to Oh >> my gosh, yeah, It's like Mike and I both way Never. You know, we always compared par, excelled over tika, and, you know, we always knew we were better in a whole bunch of ways. But this this latest rewrite that they've done this compiled version like it's really good. >> So as a guy has been doing a eye for 30 years now, and it's really seeing it come into its own, a lot of a I project seems right now are sort of low hanging fruit is it's small scale stuff where you see a I in five years what kind of projects are going our bar company's gonna be undertaking and what kind of new applications are gonna come out of this? But >> I think we're at the very beginning of this cycle, and actually there's a lot more potential than has been realized. So I think we are in the pick the low hanging fruit kind of a thing. But some of the potential applications of A I are so much more impactful, especially as we modernize core infrastructure in the enterprise. So the enterprise is sort of living with this huge legacy burden. And we always air encouraging a tamer our customers to think of all their existing legacy systems is just dated generating machines and the faster they can get that data into a state where they can start doing state of the art A. I work on top of it, the better. And so really, you know, you gotta put the legacy burden aside and kind of draw this line in the sand so that as you really get, build their muscles on the A. I side that you can take advantage of that with all the data that they're generating every single day. >> Everything about these data repose. He's Enterprise Data Warehouse. You guys built better with MPP technology. Better data warehouses, the master data management stuff, the top down, you know, Enterprise data models, Dupin in big data, none of them really lived up to their promise, you know? Yeah, it's kind of somewhat unfair toe toe like the MPP guys because you said, Hey, we're just gonna run faster. And you did. But you didn't say you're gonna change the world and all that stuff, right? Where's e d? W? Did Do you feel like this next wave is actually gonna live up to the promise? >> I think the next phase is it's very logical. Like, you know, I know you're talking to Chris Lynch here in a minute, and you know what? They're doing it at scale and at scale and tamer. These companies are all in the same general area. That's kind of related to how do you take all this data and actually prepare it and turn it into something that's consumable really quickly and easily for all of these new data consumers in the enterprise and like so that that's the next logical phase in this process. Now, will this phase be the one that finally sort of meets the high expectations that were set 2030 years ago with enterprise data warehousing? I don't know, but we're certainly getting closer >> to I kind of hoped knockers, and we'll have less to do any other cool stuff that you see out there. That was a technology just >> I'm huge. I'm fanatical right now about health care. I think that the opportunity for health care to be transformed with technology is, you know, almost makes everything else look like chump change. What aspect of health care? Well, I think that the most obvious thing is that now, with the consumer sort of in the driver seat in healthcare, that technology companies that come in and provide consumer driven solutions that meet the needs of patients, regardless of how dysfunctional the health care system is, that's killer stuff. We had a great company here in Boston called Pill Pack was a great example of that where they just build something better for consumers, and it was so popular and so, you know, broadly adopted again again. Eventually, Amazon bought it for $1,000,000,000. But those kinds of things and health care Pill pack is just the beginning. There's lots and lots of those kinds of opportunities. >> Well, it's right. Healthcare's ripe for disruption on, and it hasn't been hit with the digital destruction. And neither is financialservices. Really? Certainly, defenses has not yet another. They're high risk industry, so Absolutely takes longer. Well, Andy, thanks so much for making the time. You know, You gotta run. Yeah. Yeah. Thank you. All right, keep it right. Everybody move back with our next guest right after this short break. You're watching the Cube from M I T c B O Q. Right back.

Published Date : Aug 1 2019

SUMMARY :

you by Silicon Angle Media But why did you guys start like the vertical system that we've built and, you know, the problem that you were trying to solve? now that we started the company and maybe almost 10 when we started working on the academic And so what are you seeing in terms of the trends in terms of how data that we thought, uh, Hadoop and H D f s was going to be far big companies struggle with in terms of their data and, you know, cleaning it up and organizing Where are you applying machine the eye industry back in the late 19 eighties on, you know, If you don't have good training data course And so most of our models and most of the projects we work on, we've already got a model. math is all the same, like, you know, absolutely full stop, like there's really no new math. It's a surprise you that that Amazon implemented it and, you know, as a compiled system and to hear you say that Because it wasn't Red Shift originally Park. we always compared par, excelled over tika, and, you know, we always knew we were better in a whole bunch of ways. And so really, you know, you gotta put the legacy of them really lived up to their promise, you know? That's kind of related to how do you take all this data and actually to I kind of hoped knockers, and we'll have less to do any other cool stuff that you see out health care to be transformed with technology is, you know, Well, Andy, thanks so much for making the time.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
MikePERSON

0.99+

AndyPERSON

0.99+

Andy PalmerPERSON

0.99+

Mark MarvinPERSON

0.99+

2007DATE

0.99+

AmazonORGANIZATION

0.99+

Paul DillonPERSON

0.99+

BostonLOCATION

0.99+

$1,000,000,000QUANTITY

0.99+

Chris LynchPERSON

0.99+

Marvin MinskyPERSON

0.99+

Larry EllisonPERSON

0.99+

FirstQUANTITY

0.99+

bothQUANTITY

0.99+

30 yearsQUANTITY

0.99+

GoogleORGANIZATION

0.99+

Cambridge, MassachusettsLOCATION

0.99+

Silicon Angle MediaORGANIZATION

0.99+

second thingQUANTITY

0.99+

third thingQUANTITY

0.99+

20,000,000 recordsQUANTITY

0.99+

two same principlesQUANTITY

0.99+

seven years agoDATE

0.99+

eight years agoDATE

0.99+

Mike MikePERSON

0.98+

three yearsQUANTITY

0.98+

late 19 eightiesDATE

0.98+

firstQUANTITY

0.98+

five yearsQUANTITY

0.98+

2030 years agoDATE

0.98+

2nd 1QUANTITY

0.98+

oneQUANTITY

0.98+

OneQUANTITY

0.98+

two thingsQUANTITY

0.97+

five PhDsQUANTITY

0.97+

Day twoQUANTITY

0.97+

VeronicaPERSON

0.97+

M I. T.PERSON

0.96+

MarvinPERSON

0.96+

20 years agoDATE

0.96+

PythonTITLE

0.96+

eightiesDATE

0.94+

2019DATE

0.94+

2000 companiesQUANTITY

0.94+

Red ShiftTITLE

0.94+

DukeORGANIZATION

0.93+

AlexaTITLE

0.91+

last five yearsDATE

0.9+

M I tEVENT

0.88+

almost 10QUANTITY

0.87+

TAMRPERSON

0.86+

AndiPERSON

0.8+

M. I. T.ORGANIZATION

0.79+

TamerORGANIZATION

0.78+

Information Quality SymposiumEVENT

0.78+

Quality Conference Day VolonteEVENT

0.77+

TamerPERSON

0.77+

Google translateTITLE

0.75+

single dayQUANTITY

0.71+

HPERSON

0.71+

ChiefPERSON

0.66+

HadoopPERSON

0.64+

MITORGANIZATION

0.63+

CubeORGANIZATION

0.61+

moreQUANTITY

0.6+

M. I. T.PERSON

0.57+

Pill packCOMMERCIAL_ITEM

0.56+

Pill PackORGANIZATION

0.53+

D f sORGANIZATION

0.48+

ParkTITLE

0.44+

CDOIQEVENT

0.32+

CubePERSON

0.27+

Michael Stonebraker, TAMR | MIT CDOIQ 2019


 

>> from Cambridge, Massachusetts. It's the Cube covering M I T. Chief data officer and information quality Symposium 2019. Brought to you by Silicon Angle Media. >> Welcome back to Cambridge, Massachusetts. Everybody, You're watching the Cube, the leader in live tech coverage, and we're covering the M I t CDO conference M I t. CDO. My name is David Monty in here with my co host, Paul Galen. Mike Stone breakers here. The legend is founder CTO of Of Tamer, as well as many other companies. Inventor Michael. Thanks for coming back in the Cube. Good to see again. Nice to be here. So this is kind of ah, repeat pattern for all of us. We kind of gather here in August that the CDO conference You're always the highlight of the show. You gave a talk this week on the top 10. Big data mistakes. You and I are one of the few. You were the few people who still use the term big data. I happen to like it. Sad that it's out of vogue already, but people associated with the doo doop it's kind of waning, but regardless, so welcome. How'd the talk go? What were you talking about. >> So I talked to a lot of people who were doing analytics. We're doing operation Offer operational day of data at scale, and they always make most of them make a collection of bad mistakes. And so the talk waas a litany of the blunders that I've seen people make, and so the audience could relate to the blunders about most. Most of the enterprise is represented. Make a bunch of the blunders. So I think no. One blunder is not planning on moving most everything to the cloud. >> So that's interesting, because a lot of people would would would love to debate that, but and I would imagine you probably could have done this 10 years ago in a lot of the blunders would be the same, but that's one that wouldn't have been there. But so I tend to agree. I was one of the two hands that went up this morning, and vocalist talk when he asked, Is the cloud cheaper for us? It is anyway. But so what? Why should everybody move everything? The cloud aren't there laws of physics, laws of economics, laws of the land that suggest maybe you >> shouldn't? Well, I guess 22 things and then a comment. First thing is James Hamilton, who's no techies. Techie works for Amazon. We know James. So he claims that he could stand up a server for 25% of your cost. I have no reason to disbelieve him. That number has been pretty constant for a few years, so his cost is 1/4 of your cost. Sooner or later, prices are gonna reflect costs as there's a race to the bottom of cloud servers. So >> So can I just stop you there for a second? Because you're some other date on that. All you have to do is look at a W S is operating margin and you'll see how profitable they are. They have software like economics. Now we're deploying servers. So sorry to interrupt, but so carry. So >> anyway, sooner or later, they're gonna have their gonna be wildly cheaper than you are. The second, then yet is from Dave DeWitt, whose database wizard. And here's the current technology that that Microsoft Azure is using. As of 18 months ago, it's shipping containers and parking lots, chilled water in power in Internet, Ian otherwise sealed roof and walls optional. So if you're doing raised flooring in Cambridge versus I'm doing shipping containers in the Columbia River Valley, who's gonna be a lot cheaper? And so you know the economies of scale? I mean, that, uh, big, big cloud guys are building data centers as fast as they can, using the cheapest technology around. You put up the data center every 10 years on dhe. You do it on raised flooring in Cambridge. So sooner or later, the cloud guys are gonna be a lot cheaper. And the only thing that isn't gonna the only thing that will change that equation is For example, my lab is up the street with Frank Gehry building, and we have we have an I t i t department who runs servers in Cambridge. Uh, and they claim they're cheaper than the cloud. And they don't pay rent for square footage and they don't pay for electricity. So yeah, if if think externalities, If there are no externalities, the cloud is assuredly going to be cheaper. And then the other thing is that most everybody tonight that I talk thio including me, has very skewed resource demands. So in the cloud finding three servers, except for the last day of the month on the last day of the month. I need 20 servers. I just do it. If I'm doing on Prem, I've got a provision for peak load. And so again, I'm just way more expensive. So I think sooner or later these combinations of effects was going to send everybody to the cloud for most everything, >> and my point about the operating margins is difference in price and cost. I think James Hamilton's right on it. If he If you look at the actual cost of deploying, it's even lower than the price with the market allows them to their growing at 40 plus percent a year and a 35 $40,000,000,000 run rate company sooner, Sooner or >> later, it's gonna be a race to the lot of you >> and the only guys are gonna win. You have guys have the best cost structure. A >> couple other highlights from your talk. >> Sure, I think 2nd 2nd thing like Thio Thio, no stress is that machine learning is going to be a game is going to be a game changer for essentially everybody. And not only is it going to be autonomous vehicles. It's gonna be automatic. Check out. It's going to be drone delivery of most everything. Uh, and so you can, either. And it's gonna affect essentially everybody gonna concert of, say, categorically. Any job that is easy to understand is going to get automated. And I think that's it's gonna be majorly impactful to most everybody. So if you're in Enterprise, you have two choices. You can be a disrupt or or you could be a disruptive. And so you can either be a taxi company or you can be you over, and it's gonna be a I machine learning that's going going to be determined which side of that equation you're on. So I was a big blunder that I see people not taking ml incredibly seriously. >> Do you see that? In fact, everyone I talked who seems to be bought in that this is we've got to get on the bandwagon. Yeah, >> I'm just pointing out the obvious. Yeah, yeah, I think, But one that's not quite so obvious you're is a lot of a lot of people I talked to say, uh, I'm on top of data science. I've hired a group of of 10 data scientists, and they're doing great. And when I talked, one vignette that's kind of fun is I talked to a data scientist from iRobot, which is the guys that have the vacuum cleaner that runs around your living room. So, uh, she said, I spend 90% of my time locating the data. I want to analyze getting my hands on it and cleaning it, leaving the 10% to do data science job for which I was hired. Of the 10% I spend 90% fixing the data cleaning errors in my data so that my models work. So she spends 99% of her time on what you call data preparation 1% of her time doing the job for which he was hired. So data science is not about data science. It's about data integration, data cleaning, data, discovery. >> But your new latest venture, >> so tamer does that sort of stuff. And so that's But that's the rial data science problem. And a lot of people don't realize that yet, And, uh, you know they will. I >> want to ask you because you've been involved in this by my count and starting up at least a dozen companies. Um, 99 Okay, It's a lot. >> It's not overstated. You estimated high fall. How do you How >> do you >> decide what challenge to move on? Because they're really not. You're not solving the same problems. You're You're moving on to new problems. How do you decide? What's the next thing that interests you? Enough to actually start a company. Okay, >> that's really easy. You know, I'm on the faculty of M i t. My job is to think of news new ship and investigate it, and I come up. No, I'm paid to come up with new ideas, some of which have commercial value, some of which don't and the ones that have commercial value, like, commercialized on. So it's whatever I'm doing at the time on. And that's why all the things I've commercialized, you're different >> s so going back to tamer data integration platform is a lot of companies out there claim to do it day to get integration right now. What did you see? What? That was the deficit in the market that you could address. >> Okay, great question. So there's the traditional data. Integration is extract transforming load systems and so called Master Data management systems brought to you by IBM in from Attica. Talent that class of folks. So a dirty little secret is that that technology does not scale Okay, in the following sense that it's all well, e t l doesn't scale for a different reason with an m d l e t l doesn't scale because e t. L is based on the premise that somebody really smart comes up with a global data model For all the data sources you want put together. You then send a human out to interview each business unit to figure out exactly what data they've got and then how to transform it into the global data model. How to load it into your data warehouse. That's very human intensive. And it doesn't scale because it's so human intensive. So I've never talked to a data warehouse operator who who says I integrate the average I talk to says they they integrate less than 10 data sources. Some people 20. If you twist my arm hard, I'll give you 50. So a Here. Here's a real world problem, which is Toyota Motor Europe. I want you right now. They have a distributor in Spain, another distributor in France. They have a country by country distributor, sometimes canton by Canton. Distribute distribution. So if you buy a Toyota and Spain and move to France, Toyota develops amnesia. The French French guys know nothing about you. So they've got 250 separate customer databases with 40,000,000 total records in 50 languages. And they're in the process of integrating that. It was single customer database so that they can Duke custom. They could do the customer service we expect when you cross cross and you boundary. I've never seen an e t l system capable of dealing with that kind of scale. E t l dozen scale to this level of problem. >> So how do you solve that problem? >> I'll tell you that they're a tamer customer. I'll tell you all about it. Let me first tell you why MGM doesn't scare. >> Okay. Great. >> So e t l says I now have all your data in one place in the same format, but now you've got following problems. You've got a d duplicated because if if I if I bought it, I bought a Toyota in Spain, I bought another Toyota in France. I'm both databases. So if you want to avoid double counting customers, you got a dupe. Uh, you know, got Duke 30,000,000 records. And so MGM says Okay, you write some rules. It's a rule based technology. So you write a rule. That's so, for example, my favorite example of a rule. I don't know if you guys like to downhill downhill skiing, All right? I love downhill skiing. So ski areas, Aaron, all kinds of public databases assemble those all together. Now you gotta figure out which ones are the same the same ski area, and they're called different names in different addresses and so forth. However, a vertical drop from bottom to the top is the same. Chances are they're the same ski area. So that's a rule that says how to how to put how to put data together in clusters. And so I now have a cluster for mount sanity, and I have a problem which is, uh, one address says something rather another address as something else. Which one is right or both? Right, so now you want. Now you have a gold. Let's call the golden Record problem to basically decide which, which, which data elements among a variety that maybe all associated with the same entity are in fact correct. So again, MDM, that's a rule's a rule based system. So it's a rule based technology and rule systems don't scale the best example I can give you for why Rules systems don't scale. His tamer has another customer. General Electric probably heard of them, and G wanted to do spend analytics, and so they had 20,000,000 spend transactions. Frank the year before last and spend transaction is I paid $12 to take a cab from here here to the airport, and I charged it to cost center X Y Z 20,000,000 of those so G has a pre built classification system for spend, so they have parts and underneath parts or computers underneath computers and memory and so forth. So pre existing preexisting class classifications for spend they want to simply classified 20,000,000 spent transactions into this pre existing hierarchy. So the traditional technology is, well, let's write some rules. So G wrote 500 rules, which is about the most any single human I can get there, their arms around so that classified 2,000,000 of the 20,000,000 transactions. You've now got 18 to go and another 500 rules is not going to give you 2,000,000 more. It's gonna give you love diminishing returns, right? So you have to write a huge number of rules and no one can possibly understand. So the technology simply doesn't scale, right? So in the case of G, uh, they had tamer health. Um, solve this. Solved this classification problem. Tamer used their 2,000,000 rule based, uh, tag records as training data. They used an ML model, then work off the training data classifies remaining 18,000,000. So the answer is machine learning. If you don't use machine learning, you're absolutely toast. So the answer to MDM the answer to MGM doesn't scale. You've got to use them. L The answer to each yell doesn't scale. You gotta You're putting together disparate records can. The answer is ml So you've got to replace humans by machine learning. And so that's that seems, at least in this conference, that seems to be resonating, which is people are understanding that at scale tradition, traditional data integration, technology's just don't work >> well and you got you got a great shot out on yesterday from the former G S K Mark Grams, a leader Mark Ramsay. Exactly. Guys. And how they solve their problem. He basically laid it out. BTW didn't work and GM didn't work, All right. I mean, kick it, kick the can top down data modelling, didn't work, kicked the candid governance That's not going to solve the problem. And But Tamer did, along with some other tooling. Obviously, of course, >> the Well, the other thing is No. One technology. There's no silver bullet here. It's going to be a bunch of technologies working together, right? Mark Ramsay is a great example. He used his stream sets and a bunch of other a bunch of other startup technology operating together and that traditional guys >> Okay, we're good >> question. I want to show we have time. >> So with traditional vendors by and large or 10 years behind the times, And if you want cutting edge stuff, you've got to go to start ups. >> I want to jump. It's a different topic, but I know that you in the past were critic of know of the no sequel movement, and no sequel isn't going away. It seems to be a uh uh, it seems to be actually gaining steam right now. What what are the flaws in no sequel? It has your opinion changed >> all? No. So so no sequel originally meant no sequel. Don't use it then. Then the marketing message changed to not only sequel, So sequel is fine, but no sequel does others. >> Now it's all sequel, right? >> And my point of view is now. No sequel means not yet sequel because high level language, high level data languages, air good. Mongo is inventing one Cassandra's inventing one. Those unless you squint, look like sequel. And so I think the answer is no sequel. Guys are drifting towards sequel. Meanwhile, Jason is That's a great idea. If you've got your regular data sequel, guys were saying, Sure, let's have Jason is the data type, and I think the only place where this a fair amount of argument is schema later versus schema first, and I pretty much think schema later is a bad idea because schema later really means you're creating a data swamp exactly on. So if you >> have to fix it and then you get a feel of >> salary, so you're storing employees and salaries. So, Paul salaries recorded as dollars per month. Uh, Dave, salary is in euros per week with a lunch allowance minds. So if you if you don't, If you don't deal with irregularities up front on data that you care about, you're gonna create a mess. >> No scheme on right. Was convenient of larger store, a lot of data cheaply. But then what? Hard to get value out of it created. >> So So I think the I'm not opposed to scheme later. As long as you realize that you were kicking the can down the road and you're just you're just going to give your successor a big mess. >> Yeah, right. Michael, we gotta jump. But thank you so much. Sure appreciate it. All right. Keep it right there, everybody. We'll be back with our next guest right into the short break. You watching the cue from M i t cdo Ike, you right back

Published Date : Aug 1 2019

SUMMARY :

Brought to you by We kind of gather here in August that the CDO conference You're always the highlight of the so the audience could relate to the blunders about most. physics, laws of economics, laws of the land that suggest maybe you So he claims that So can I just stop you there for a second? And so you know the and my point about the operating margins is difference in price and cost. You have guys have the best cost structure. And so you can either be a taxi company got to get on the bandwagon. leaving the 10% to do data science job for which I was hired. But that's the rial data science problem. want to ask you because you've been involved in this by my count and starting up at least a dozen companies. How do you How You're You're moving on to new problems. No, I'm paid to come up with new ideas, s so going back to tamer data integration platform is a lot of companies out there claim to do and so called Master Data management systems brought to you by IBM I'll tell you that they're a tamer customer. So the answer to MDM the I mean, kick it, kick the can top down data modelling, It's going to be a bunch of technologies working together, I want to show we have time. and large or 10 years behind the times, And if you want cutting edge It's a different topic, but I know that you in the past were critic of know of the no sequel movement, No. So so no sequel originally meant no So if you So if you if Hard to get value out of it created. So So I think the I'm not opposed to scheme later. But thank you so much.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
MichaelPERSON

0.99+

JamesPERSON

0.99+

Mark RamsayPERSON

0.99+

James HamiltonPERSON

0.99+

Paul GalenPERSON

0.99+

Dave DeWittPERSON

0.99+

ToyotaORGANIZATION

0.99+

David MontyPERSON

0.99+

General ElectricORGANIZATION

0.99+

2,000,000QUANTITY

0.99+

FranceLOCATION

0.99+

AmazonORGANIZATION

0.99+

20,000,000QUANTITY

0.99+

10%QUANTITY

0.99+

Michael StonebrakerPERSON

0.99+

CambridgeLOCATION

0.99+

IBMORGANIZATION

0.99+

50QUANTITY

0.99+

$12QUANTITY

0.99+

SpainLOCATION

0.99+

18,000,000QUANTITY

0.99+

25%QUANTITY

0.99+

20 serversQUANTITY

0.99+

90%QUANTITY

0.99+

Columbia River ValleyLOCATION

0.99+

99%QUANTITY

0.99+

18QUANTITY

0.99+

AaronPERSON

0.99+

DavePERSON

0.99+

AugustDATE

0.99+

Silicon Angle MediaORGANIZATION

0.99+

three serversQUANTITY

0.99+

35 $40,000,000,000QUANTITY

0.99+

50 languagesQUANTITY

0.99+

500 rulesQUANTITY

0.99+

22 thingsQUANTITY

0.99+

10 data scientistsQUANTITY

0.99+

Mike StonePERSON

0.99+

Cambridge, MassachusettsLOCATION

0.99+

MGMORGANIZATION

0.99+

less than 10 data sourcesQUANTITY

0.99+

IanPERSON

0.99+

PaulPERSON

0.99+

1%QUANTITY

0.99+

bothQUANTITY

0.99+

Toyota Motor EuropeORGANIZATION

0.99+

Of TamerORGANIZATION

0.99+

MicrosoftORGANIZATION

0.99+

oneQUANTITY

0.99+

singleQUANTITY

0.99+

AtticaORGANIZATION

0.99+

10 years agoDATE

0.99+

yesterdayDATE

0.99+

iRobotORGANIZATION

0.99+

Mark GramsPERSON

0.99+

TAMRPERSON

0.99+

10 yearsQUANTITY

0.99+

20QUANTITY

0.98+

1/4QUANTITY

0.98+

250 separate customer databasesQUANTITY

0.98+

CassandraPERSON

0.98+

First thingQUANTITY

0.98+

30,000,000 recordsQUANTITY

0.98+

both databasesQUANTITY

0.98+

18 months agoDATE

0.98+

firstQUANTITY

0.98+

M I t CDOEVENT

0.98+

One blunderQUANTITY

0.98+

TamerPERSON

0.98+

one placeQUANTITY

0.98+

secondQUANTITY

0.97+

two choicesQUANTITY

0.97+

tonightDATE

0.97+

each business unitQUANTITY

0.97+

Thio ThioPERSON

0.97+

two handsQUANTITY

0.96+

this weekDATE

0.96+

FrankPERSON

0.95+

DukeORGANIZATION

0.95+

Aaron Kalb, Alation | MIT CDOIQ 2019


 

>> From Cambridge, Massachusetts, it's theCUBE covering MIT Chief Data Officer and Information Quality Symposium 2019, brought to you by SiliconANGLE Media. (dramatic music) >> Welcome back to Cambridge, Massachusetts, everybody. This is theCUBE, the leader in live tech coverage. We go out to the events, and we extract the signal from then noise. And, we're here at the MIT CDOIQ, the Chief Data Officer conference. I'm Dave Vellante with my cohost Paul Gillin. Day two of our wall to wall coverage. Aaron Kalb is here. He's the cofounder and chief data officer of Alation. Aaron, thanks for making the time to come on. >> Thanks so much Dave and Paul for having me. >> You're welcome. So, words matter, you know, and we've been talking about data, and big data, and the three Vs, and data is the new oil, and all this stuff. You gave a talk this week about, you know, "We're maybe not talking the right language "when it comes to data." What did you mean by all that? >> Absolutely, so I get a little bit frustrated by some of these cliques we hear at conference after conference, and the one I, sort of, took aim at in this talk is, data is the new oil. I think what people want to invoke with that is to say, in the same way that oil powered the industrial age, data's powering the information age. Just saying, data's really cool and trendy and important. That's true, but there are a lot of other associations and contexts that people have with data, and some of them don't really apply as, I'm sorry, with oil. And, some of them apply, as well, to data. >> So, is data more valuable than oil? >> Well, I think they're each valuable in different ways, but I think there's a couple issues with the metaphor. One is that data is scarce and dwindling, and part of value comes from the fact that it's so rare. Whereas, the experience with data is that it's so plentiful and abundant, we're almost drowning in it. And so, what I contend is, instead of talking about data as compared to oil, we should talk about data compared to water. And, the idea is, you know, water is very plentiful on the planet, but sometimes, you know, if you have saltwater or contaminated water, you can't drink it. Water is good for different purposes, depending on its form, and so it's all about getting the right data for the right purpose, like water. >> Well, we've certainly, at least in my opinion, fought wars, Paul, over oil. >> And, over water. >> And, certainly, conflicts over water. Do you think we'll be fighting wars over data? Or, are we already? >> No, we might be. One of my favorite talks from the sessions here was a keynote by the CDO for the Department of Defense, who was talking about, you know, the civic duty about transparency but was observing that, actually, more IP addresses from China and Russia are looking at our public datasets than from within the country. So, you know, it's definitely a resource that can be very powerful. >> So, what was the reaction to your premise from the audience. What kind of questions did you get? >> You know, people actually responded very favorably, including some folks from the oil and gas industry, which I was pleased to find. We have a lot of customers in energy, so that was cool. But, what it was nice being here at MIT and just really geeking out about language and linguistics and data with a bunch of CDOs and other people who are, kind of, data intellectuals. >> Right, so if data is not the new oil. >> And, water isn't really a good analogy either, because the supply of water is finite. >> That's true. >> So, what is data? >> Yeah. >> Space? >> Yeah, it's a good point. >> Matter? >> Maybe it is like the universe in that it's always expanding, right, somehow. Right, because any thing, any physic which is on the planet probably won't be growing at that exponential speed. >> So, give us the punchline. >> Well, so I contend that water, while imperfect, is, actually, a really good metaphor that helps for a lot of things. It has properties like the fact that if it's a data quality issue, it flows downstream like pollution in a river. It's the fact that it can come in different forms, useful for different purposes. You might have gray water, right, which is good enough for, you know, irrigation or industrial purposes, but not safe to drink. And so, you rely on metadata to get the data that's in the right form. And, you know, the talk is more fun because you've a lot of visual examples that make this clear. >> Yeah, of course, yeah. >> I actually had one person in the audience say that he used a similar analogy in his own company, so it's fun to trade notes. >> So, chief data officer is a relatively new title for you, is it not? In terms of your role at Alation. >> Yeah, that's right, and the most fun thing about my job is being able to interact with all of the other CDOs and CDAOs at a conference like this. And, it was cool to see. I believe this conference doubled since the last year. Is that right? >> No. >> No, it's up about a hundred, though. >> Right. >> Well. >> And, it's about double from three years ago. >> And, when we first started, in 2013, yeah. >> 130 people, yeah. >> Yeah, it was a very small and intimate event. >> Yeah, here we're outgrowing this building, it seems. >> Yeah, they're kicking us out. >> I think what's interesting is, you know, if we do a little bit of analysis, this is a small data, within our own company, you know, our biggest and most visionary customers typically bought Alation. The buyer champion either was a CDO or they weren't a CDO when they bought the software and have since been promoted to be a CDO. And so, seeing this trend of more and more CDOs cropping up is really exciting for us. And also, just hearing all of the people at the conference saying, two trends we're hearing. A move from, sort of, infrastructure and technology to driving business value, and a move from defense and governance to, sort of, playing offense and doing revenue generation with data. Both of those trends are really exciting for us. >> So, don't hate me for asking this question, because what a lot of companies will do is, they'll give somebody a CDO title, and it's, kind of, a little bit of gimmick, right, to go to market. And, they'll drag you into sales, because I'm sure they do, as a cofounder. But, as well, I know CDOs at tech companies that are actually trying to apply new techniques, figure out how data contributes to their business, how they can cut costs, raise revenue. Do you have an internal role, as well? >> Absolutely, yeah. >> Explain that. >> So, Alation, you know, we're about 250 people, so we're not at the same scale as many of the attendees here. But, we want to learn, you know, from the best, and always apply everything that we learn internally as well. So, obviously, analytics, data science is a huge role in our internal operations. >> And so, what kinds of initiatives are you driving internally? Is it, sort of, cost initiatives, efficiency, innovation? >> Yeah, I think it's all of the above, right. Every single division and both in the, sort of, operational efficiency and cost cutting side as well as figuring out the next big bet to make, can be informed by data. And, our goal was to empower a curious and rational world, and our every decision be based not on the highest paid person's opinion, but on the best evidence possible. And so, you know, the goal of my function is largely to enable that both centrally and within each business unit. >> I want to talk to you about data catalogs a bit because it's a topic close to my heart. I've talked to a lot of data catalog companies over the last couple years, and it seems like, for one thing, the market's very crowded right now. It seems to me. Would you agree there are a lot of options out there? >> Yeah, you know, it's been interesting because when we started it, we were basically the first company to make this technology and to, kind of, use this term, data catalog, in this way. And, it's been validating to see, you know, a lot of big players and other startups even, kind of, coming to that terminology. But, yeah, it has gotten more crowded, and I think our customers who, or our prospects, used to ask us, you know, "What is it that you do? "Explain this catalog metaphor to me," are now saying, "Yeah, catalogs, heard about that." >> It doesn't need to be defined anymore. >> "Which one should I pick? "Why you?" Yeah. >> What distinguished one product from another, you know? What are the major differentiation points? >> Yeah, I think one thing that's interesting is, you know, my talk was about how the metaphors we use shape the way we think. And, I think there's a sense in which, kind of, the history of each company shapes their philosophy and their approach, so we've always been a data catalog company. That's our one product. Some of the other catalog vendors come from ETL background, so they're a lot more focused on technical metadata and infrastructure. Some of the catalog products grew out of governance, and so it's, sort of, governance first, no sorry, defense first and then offense secondary. So, I think that's one of the things, I think, we encourage our prospects to look at, is, kind of, the soul of the company and how that affects their decisions. The other thing is, of course, technology. And, what we at Alation are really excited about, and it's been validating to hear Gartner and others and a lot of the people here, like the GSK keynote speaker yesterday, talking about the importance of comprehensiveness and on taking a behavioral approach, right. We have our Behavioral IO technology that really says, "Let's not look at all the bits and the bytes, "but how are people using the data to drive results?" As our core differentiator. >> Do your customers generally standardize on one data catalog, or might they have multiple catalogs for multiple purposes? >> Yeah, you know, we heard a term more last season, of catalog of catalogs, you know. And, people here can get arbitrarily, you know, meta, meta, meta data, where we like to go there. I think the customers we see most successful tend to have one catalog that serves this function of the single source of reference. Many of our customers will say, you know, that their catalog serves as, sort of, their internal Google for data. Or, the one stop shop where you could find everything. Even though they may have many different sources, Typically you don't want to have siloed catalogs. It makes it harder to find what you're looking for. >> Let's play a little word association with some metaphors. Data lake. (laughter) >> Data lake's another one that I sort of hate. If you think about it, people had data warehouses and didn't love them, but at least, when you put something into a warehouse, you can get it out, right. If you throw something into a lake, you know, there's really no hope you're ever going to find it. It's probably not going to be in great shape, and we're not surprised to find that many folks who invested heavily in data lakes are now having to invest in a layer over it, to make it comprehensible and searchable. >> So, yeah, the lake is where we hide the stolen cars. Data swamp. >> Yeah, I mean, I think if your point is it's worse than lake, it works. But, I think we can do better a lake, right. >> How about data ocean? (laughter) >> You know, out of respect for John Furrier, I'll say it's fantastic. But, to us we think, you know, it isn't really about the size. The more data you have, people think the more data the better. It's actually the more data the worse unless you have a mechanism for finding the little bit of data that is relevant and useful for your task and put it to use. >> And to, want to set up, enter the catalog. So, technically, how does the catalog solve that problem? >> Totally, so if we think about, maybe let's go to the warehouse, for example. But, it works just as well on a data lake in practice. >> Yeah, cool. >> Through the catalog is. It starts with the inventory, you know, what's on every single shelf. But, if you think about what Amazon has done, they have the inventory warehouse in the back, but what you see as a consumer is a simple search interface, where you type in the word of the product you're looking for. And then, you see ranked suggestions for different items, you know, toasters, lamps, whatever, books I want to buy. Same thing for data. I can type in, you know, if I'm at the DOD, you know, information about aircraft, or information about, you know, drug discovery if I'm at GSK. And, I should be able to therefore see all of the different data sets that I have. And, that's true in almost any catalog, that you can do some search over the curated data sets there. With Alation in particular, what I can see is, who's using it, how are they using it, what are they joining it with, what results do they find in that process. And, that can really accelerate the pace of discovery. >> Go ahead. >> I'm sorry, Dave. To what degree can you automate some of that detail, like who's using it and what it's being used for. I mean, doesn't that rely on people curating the catalog? Or, to what degree can you automate that? >> Yeah, so it's a great question. I think, sometimes, there's a sense with AI or ML that it's like the computer is making the decisions or making things up. Which is, obviously, very scary. Usually, the training data comes from humans. So, our goal is to learn from humans in two ways. There's learning from humans where humans explicitly teach you. Somebody goes and says, "This is goal standard data versus this is, "you know, low quality data." And, they do that manually. But, there's also learning implicitly from people. So, in the same way on amazon.com, if I buy one item and then buy another, I'm doing that for my own purposes, but Amazon can do collaborative filtering over all of these trends and say, "You might want to buy this item." We can do a similar thing where we parse the query logs, parse the usage logs and be eye tools, and can basically watch what people are doing for their own purposes. Not to, you know, extra work on top of their job to help us. We can learn from that and make everybody more effective. >> Aaron, is data classification a part of all this? Again, when we started in the industry, data classification was a manual exercise. It's always been a challenge. Certainly, people have applied math to it. You've seen support vector machines and probabilistic latent cement tech indexing being used to classify data. Have we solved that problem, as an industry? Can you automate the classification of data on creation or use at this point in time? >> Well, one thing that came up in a few talks about AI and ML here is, regardless of the algorithm you're using, whether it's, you know, IFH or SVM, or something really modern and exciting that keeps learning. >> Stuff that's been around forever or, it's like you say, some new stuff, right. >> Yeah, you know, actually, I think it was said best by Michael Collins at the DOD, that data is more important than the algorithm because even the best algorithm is useless without really good training data. Plus, the algorithm's, kind of, everyone's got them. So, really often, training data is the limiting reactant in getting really good classification. One thing we try to do at Alation is create an upward spiral where maybe some data is curated manually, and then we can use that as a seed to make some suggestions about how to label other data. And then, it's easier to just do a confirm or deny of a guess than to actually manually label everything. So, then you get more training, get it faster, and it kind of accelerates that way instead of being a big burden. >> So, that's really the advancement in the last five to what, five, six years. Where you're able to use machine intelligence to, sort of, solve that problem as opposed to brute forcing it with some algorithm. Is that fair? >> Yeah, I think that's right, and I think what gets me very excited is when you can have these interactive loops where the human helps the computer, which helps the human. You get, again, this upward spiral. Instead of saying, "We have to have all of this, "you know, manual step done "before we even do the first step," or trying to have an algorithm brute force it without any human intervention. >> It's kind of like notes key mode on write, except it actually works. I'm just kidding to all my ADP friends. All right, Aaron, hey. Thanks very much for coming on theCUBE, but give your last word on the event. I think, is this your first one or no? >> This is our first time here. >> Yeah, okay. So, what are your thoughts? >> I think we'll be back. It's just so exciting to get people who are thinking really big about data but are also practitioners who are solving real business problems. And, just the exchange of ideas and best practices has been really inspiring for me. >> Yeah, that's great. >> Yeah. >> Well, thank you for the support of the event, and thanks for coming on theCUBE. It was great to see you again. >> Thanks Dave, thanks Paul. >> All right, you're welcome. >> Thank you, sir. >> All right, keep it right there, everybody. We'll be back with our next guest right after this short break. You're watching theCUBE from MIT CDOIQ. Be right back. (upbeat music)

Published Date : Aug 1 2019

SUMMARY :

brought to you by SiliconANGLE Media. Aaron, thanks for making the time to come on. and data is the new oil, and all this stuff. in the same way that oil powered the industrial age, And, the idea is, you know, water is very plentiful Well, we've certainly, at least in my opinion, Do you think we'll be fighting wars over data? So, you know, it's definitely a resource What kind of questions did you get? We have a lot of customers in energy, so that was cool. because the supply of water is finite. Maybe it is like the universe And, you know, the talk is more fun because you've a lot I actually had one person in the audience say So, chief data officer is a relatively Yeah, that's right, and the most fun thing I think what's interesting is, you know, And, they'll drag you into sales, But, we want to learn, you know, from the best, And so, you know, the goal of my function I want to talk to you about data catalogs a bit And, it's been validating to see, you know, "Which one should I pick? Yeah, I think one thing that's interesting is, you know, Or, the one stop shop where you could find everything. Data lake. when you put something into a warehouse, So, yeah, the lake is where we hide the stolen cars. But, I think we can do better a lake, right. But, to us we think, you know, So, technically, how does the catalog solve that problem? maybe let's go to the warehouse, for example. I can type in, you know, if I'm at the DOD, you know, Or, to what degree can you automate that? Not to, you know, extra work on top of their job to help us. Can you automate the classification of data whether it's, you know, IFH or SVM, or something it's like you say, some new stuff, right. Yeah, you know, actually, I think it was said best in the last five to what, five, six years. when you can have these interactive loops I'm just kidding to all my ADP friends. So, what are your thoughts? And, just the exchange of ideas It was great to see you again. We'll be back with our next guest

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Michael CollinsPERSON

0.99+

Paul GillinPERSON

0.99+

PaulPERSON

0.99+

AmazonORGANIZATION

0.99+

DavePERSON

0.99+

2013DATE

0.99+

Aaron KalbPERSON

0.99+

Dave VellantePERSON

0.99+

AaronPERSON

0.99+

fiveQUANTITY

0.99+

Department of DefenseORGANIZATION

0.99+

six yearsQUANTITY

0.99+

John FurrierPERSON

0.99+

amazon.comORGANIZATION

0.99+

yesterdayDATE

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

AlationPERSON

0.99+

AlationORGANIZATION

0.99+

GartnerORGANIZATION

0.99+

one itemQUANTITY

0.99+

Cambridge, MassachusettsLOCATION

0.99+

first stepQUANTITY

0.99+

last yearDATE

0.99+

GSKORGANIZATION

0.99+

bothQUANTITY

0.99+

DODORGANIZATION

0.99+

one personQUANTITY

0.99+

GoogleORGANIZATION

0.99+

130 peopleQUANTITY

0.98+

OneQUANTITY

0.98+

first timeQUANTITY

0.98+

MITORGANIZATION

0.98+

one productQUANTITY

0.97+

three years agoDATE

0.97+

this weekDATE

0.97+

twoQUANTITY

0.97+

MIT CDOIQORGANIZATION

0.96+

MIT Chief Data Officer andEVENT

0.96+

one data catalogQUANTITY

0.96+

eachQUANTITY

0.96+

each companyQUANTITY

0.95+

BothQUANTITY

0.95+

one thingQUANTITY

0.95+

first oneQUANTITY

0.94+

one catalogQUANTITY

0.93+

two trendsQUANTITY

0.93+

theCUBEORGANIZATION

0.93+

firstQUANTITY

0.92+

first companyQUANTITY

0.92+

last couple yearsDATE

0.92+

CDOORGANIZATION

0.91+

about a hundredQUANTITY

0.91+

single shelfQUANTITY

0.88+

about 250 peopleQUANTITY

0.88+

single sourceQUANTITY

0.87+

ChinaLOCATION

0.87+

2019DATE

0.86+

Day twoQUANTITY

0.86+

oneQUANTITY

0.85+

each business unitQUANTITY

0.82+

MIT CDOIQEVENT

0.79+

ADPORGANIZATION

0.79+

couple issuesQUANTITY

0.76+

Information Quality Symposium 2019EVENT

0.76+

One thingQUANTITY

0.7+

single divisionQUANTITY

0.69+

one stopQUANTITY

0.68+

RussiaLOCATION

0.64+

threeQUANTITY

0.61+

doubleQUANTITY

0.59+

favoriteQUANTITY

0.5+

CDOIQEVENT

0.46+

ChiefPERSON

0.42+

Lars Toomre, Brass Rat Capital | MIT CDOIQ 2019


 

>> from Cambridge, Massachusetts. It's the Cube covering M I T. Chief data officer and information quality Symposium 2019. Brought to you by Silicon Angle Media. >> Welcome back to M I. T. Everybody. This is the Cube. The leader in live coverage. My name is David wanted. I'm here with my co host, Paul Gill, in this day to coverage of the M I t cdo I Q conference. A lot of acronym stands for M I. T. Of course, the great institution. But Chief Data officer information quality event is his 13th annual event. Lars to Maria's here is the managing partner of Brass Rat Capital. Cool name Lars. Welcome to the Cube. Great. Very much. Glad I start with a name brass around Capitol was That's >> rat is reference to the M I t school. Okay, Beaver? Well, he is, but the students call it a brass rat, and I'm third generation M i t. So it's just seen absolutely appropriate. That is a brass rods and capital is not a reference to money, but is actually referenced to the intellectual capital. They if you have five or six brass rats in the same company, you know, we Sometimes engineers arrive and they could do some things. >> And it Boy, if you put in some data data capital in there, you really explosions. We cause a few problems. So we're gonna talk about some new regulations that are coming down. New legislation that's coming down that you exposed me to yesterday, which is gonna have downstream implications. You get ahead of this stuff and understand it. You can really first of all, prepare, make sure you're in compliance, but then potentially take advantage for your business. So explain to us this notion of open government act. >> Um, in the last five years, six years or so, there's been an effort going on to increase the transparency across all levels of government. Okay, State, local and federal government. The first of federal government laws was called the the Open Data Act of 2014 and that was an act. They was acted unanimously by Congress and signed by Obama. They was taking the departments of the various agencies of the United States government and trying to roll up all the expenses into one kind of expense. This is where we spent our money and who got the money and doing that. That's what they were trying to do. >> Big picture type of thing. >> Yeah, big picture type thing. But unfortunately, it didn't work, okay? Because they forgot to include this odd word called mentalities. So the same departments meant the same thing. Data problem. They have a really big data problem. They still have it. So they're to G et o reports out criticizing how was done, and the government's gonna try and correct it. Then in earlier this year, there was another open government date act which said in it was signed by Trump. Now, this time you had, like, maybe 25 negative votes, but essentially otherwise passed Congress completely. I was called the Open as all capital O >> P E >> n Government Data act. Okay, and that's not been implemented yet. But there's live talking around this conference today in various Chief date officers are talking about this requirement that every single non intelligence defense, you know, vital protection of the people type stuff all the like, um, interior, treasury, transportation, those type of systems. If you produce a report these days, which is machine, I mean human readable. You must now in two years or three years. I forget the exact invitation date. Have it also be machine readable. Now, some people think machine riddle mil means like pdf formats, but no, >> In fact, what the government did is it >> said it must be machine readable. So you must be able to get into the reports, and you have to be able to extract out the information and attach it to the tree of knowledge. Okay, so we're all of sudden having context like they're currently machine readable, Quote unquote, easy reports. But you can get into those SEC reports. You pull out the net net income information and says its net income, but you don't know what it attaches to on the tree of knowledge. So, um, we are helping the government in some sense able, machine readable type reporting that weaken, do machine to machine without people being involved. >> Would you say the tree of knowledge You're talking about the constant >> man tick semantic tree of knowledge so that, you know, we all come from one concept like the human is example of a living thing living beast, a living Beeston example Living thing. So it also goes back, and they're serving as you get farther and farther out the tree, there's more distance or semantic distance, but you can attach it back to concept so you can attach context to the various data. Is this essentially metadata? That's what people call it. But if I would go over see sale here at M I t, they would turn around. They call it the Tree of Knowledge or semantic data. Okay, it's referred to his semantic dated, So you are passing not only the data itself, but the context that >> goes along with the data. Okay, how does this relate to the financial transparency? >> Well, Financial Transparency Act was introduced by representative Issa, who's a Republican out of California. He's run the government Affairs Committee in the House. He retired from Congress this past November, but in 2017 he introduced what's got referred to his H R 15 30 Um, and the 15 30 is going to dramatically change the way, um, financial regulators work in the United States. Um, it is about it was about to be introduced two weeks ago when the labor of digital currency stuff came up. So it's been delayed a little bit because they're trying to add some of the digital currency legislation to that law. >> A front run that Well, >> I don't know exactly what the remember soul coming out of Maxine Waters Committee. So the staff is working on a bunch of different things at once. But, um, we own g was asked to consult with them on looking at the 15 30 act and saying, How would we improve quote unquote, given our technical, you know, not doing policy. We just don't have the technical aspects of the act. How would we want to see it improved? So one of the things we have advised is that for the first time in the United States codes history, they're gonna include interesting term called ontology. You know what intelligence? Well, everyone gets scared by the word. And when I read run into people, they say, Are you a doctor? I said, no, no, no. I'm just a date. A guy. Um, but an intolerant tea is like a taxonomy, but it had order has important, and an ontology allows you to do it is ah, kinda, you know, giving some context of linking something to something else. And so you're able Thio give Maur information with an intolerant that you're able to you with a tax on it. >> Okay, so it's a taxonomy on steroids? >> Yes, exactly what? More flexible, >> Yes, but it's critically important for artificial intelligence machine warning because if I can give them until ology of sort of how it goes up and down the semantics, I can turn around, do a I and machine learning problems on the >> order of 100 >> 1000 even 10,000 times faster. And it has context. It has contacts in just having a little bit of context speeds up these problems so dramatically so and it is that what enables the machine to machine? New notion? No, the machine to machine is coming in with son called SP R M just standard business report model. It's a OMG sophistication of way of allowing the computers or machines, as we call them these days to get into a standard business report. Okay, so let's say you're ah drug company. You have thio certify you >> drugged you manufactured in India, get United States safely. Okay, you have various >> reporting requirements on the way. You've got to give extra easy the FDA et cetera that will always be a standard format. The SEC has a different format. FERC has a different format. Okay, so what s p r m does it allows it to describe in an intolerant he what's in the report? And then it also allows one to attach an ontology to the cells in the report. So if you like at a sec 10 Q 10 k report, you can attach a US gap taxonomy or ontology to it and say, OK, net income annual. That's part of the income statement. You should never see that in a balance sheet type item. You know his example? Okay. Or you can for the first time by having that context you can say are solid problem, which suggested that you can file these machine readable reports that air wrong. So they believe or not, There were about 50 cases in the last 10 years where SEC reports have been filed where the assets don't equal total liabilities, plus cheryl equity, you know, just they didn't add >> up. So this to, >> you know, to entry accounting doesn't work. >> Okay, so so you could have the machines go and check scale. Hey, we got a problem We've >> got a problem here, and you don't have to get humans evolved. So we're gonna, um uh, Holland in Australia or two leaders ahead of the United States. In this area, they seem dramatic pickups. I mean, Holland's reporting something on the order of 90%. Pick up Australia's reporting 60% pickup. >> We say pick up. You're talking about pickup of errors. No efficiency, productivity, productivity. Okay, >> you're taking people out of the whole cycle. It's dramatic. >> Okay, now what's the OMG is rolling on the hoof. Explain the OMG >> Object Management Group. I'm not speaking on behalf of them. It's a membership run organization. You remember? I am a >> member of cold. >> I'm a khalid of it. But I don't represent omg. It's the membership has to collectively vote that this is what we think. Okay, so I can't speak on them, right? I have a pretty significant role with them. I run on behalf of OMG something called the Federated Enterprise Risk Management Group. That's the group which is focusing on risk management for large entities like the federal government's Veterans Affairs or Department offense upstairs. I think talking right now is the Chief date Officer for transportation. OK, that's a large organization, which they, they're instructed by own be at the, um, chief financial officer level. The one number one thing to do for the government is to get an effective enterprise worst management model going in the government agencies. And so they come to own G let just like NIST or just like DARPA does from the defense or intelligence side, saying we need to have standards in this area. So not only can we talk thio you effectively, but we can talk with our industry partners effectively on space. Programs are on retail, on medical programs, on finance programs, and so they're at OMG. There are two significant financial programs, or Sanders, that exist once called figgy financial instrument global identifier, which is a way of identifying a swap. Its way of identifying a security does not have to be used for a que ce it, but a worldwide. You can identify that you know, IBM stock did trade in Tokyo, so it's a different identifier has different, you know, the liberals against the one trading New York. Okay, so those air called figgy identifiers them. There are attributes associated with that security or that beast the being identified, which is generally comes out of 50 which is the financial industry business ontology. So you know, it says for a corporate bond, it has coupon maturity, semi annual payment, bullets. You know, it is an example. So that gives you all the information that you would need to go through to the calculation, assuming you could have a calculation routine to do it, then you need thio. Then turn around and set up your well. Call your environment. You know where Ford Yield Curves are with mortgage backed securities or any portable call. Will bond sort of probabilistic lee run their numbers many times and come up with effective duration? Um, And then you do your Vader's analytics. No aggregating the portfolio and looking at Shortfalls versus your funding. Or however you're doing risk management and then finally do reporting, which is where the standardized business reporting model comes in. So that kind of the five parts of doing a full enterprise risk model and Alex So what >> does >> this mean for first? Well, who does his impact on? What does it mean for organizations? >> Well, it's gonna change the world for basically everyone because it's like doing a clue ends of a software upgrade. Conversion one's version two point. Oh, and you know how software upgrades Everyone hates and it hurts because everyone's gonna have to now start using the same standard ontology. And, of course, that Sarah Ontology No one completely agrees with the regulators have agreed to it. The and the ultimate controlling authority in this thing is going to be F sock, which is the Dodd frank mandated response to not ever having another chart. So the secretary of Treasury heads it. It's Ah, I forget it's the, uh, federal systemic oversight committee or something like that. All eight regulators report into it. And, oh, if our stands is being the adviser Teff sock for all the analytics, what these laws were doing, you're getting over farm or more power to turn around and look at how we're going to find data across the three so we can come up consistent analytics and we can therefore hopefully take one day. Like Goldman, Sachs is pre payment model on mortgages. Apply it to Citibank Portfolio so we can look at consistency of analytics as well. It is only apply to regulated businesses. It's gonna apply to regulated financial businesses. Okay, so it's gonna capture all your mutual funds, is gonna capture all your investment adviser is gonna catch her. Most of your insurance companies through the medical air side, it's gonna capture all your commercial banks is gonna capture most of you community banks. Okay, Not all of them, because some of they're so small, they're not regularly on a federal basis. The one regulator which is being skipped at this point, is the National Association Insurance Commissioners. But they're apparently coming along as well. Independent federal legislation. Remember, they're regulated on the state level, not regularly on the federal level. But they've kind of realized where the ball's going and, >> well, let's make life better or simply more complex. >> It's going to make life horrible at first, but we're gonna take out incredible efficiency gains, probably after the first time you get it done. Okay, is gonna be the problem of getting it done to everyone agreeing. We use the same definitions >> of the same data. Who gets the efficiency gains? The regulators, The companies are both >> all everyone. Can you imagine that? You know Ah, Goldman Sachs earnings report comes out. You're an analyst. Looking at How do I know what Goldman? Good or bad? You have your own equity model. You just give the model to the semantic worksheet and all turn around. Say, Oh, those numbers are all good. This is what expected. Did it? Did it? Didn't you? Haven't. You could do that. There are examples of companies here in the United States where they used to have, um, competitive analysis. Okay. They would be taking somewhere on the order of 600 to 7. How 100 man hours to do the competitive analysis by having an available electronically, they cut those 600 hours down to five to do a competitive analysis. Okay, that's an example of the type of productivity you're gonna see both on the investment side when you're doing analysis, but also on the regulatory site. Can you now imagine you get a regulatory reports say, Oh, there's they're out of their way out of whack. I can tell you this fraud going on here because their numbers are too much in X y z. You know, you had to fudge numbers today, >> and so the securities analyst can spend Mme. Or his or her time looking forward, doing forecasts exactly analysis than having a look back and reconcile all this >> right? And you know, you hear it through this conference, for instance, something like 80 to 85% of the time of analysts to spend getting the data ready. >> You hear the same thing with data scientists, >> right? And so it's extent that we can helped define the data. We're going thio speed things up dramatically. But then what's really instinct to me, being an M I t engineer is that we have great possibilities. An A I I mean, really great possibilities. Right now, most of the A miles or pattern matching like you know, this idea using face shield technology that's just really doing patterns. You can do wonderful predictive analytics of a I and but we just need to give ah lot of the a m a. I am a I models the contact so they can run more quickly. OK, so we're going to see a world which is gonna found funny, But we're going to see a world. We talk about semantic analytics. Okay. Semantic analytics means I'm getting all the inputs for the analysis with context to each one of the variables. And when I and what comes out of it will be a variable results. But you also have semantics with it. So one in the future not too distant future. Where are we? We're in some of the national labs. Where are you doing it? You're doing pipelines of one model goes to next model goes the next mile. On it goes Next model. So you're gonna software pipelines, Believe or not, you get them running out of an Excel spreadsheet. You know, our modern Enhanced Excel spreadsheet, and that's where the future is gonna be. So you really? If you're gonna be really good in this business, you're gonna have to be able to use your brain. You have to understand what data means You're going to figure out what your modeling really means. What happens if we were, You know, normally for a lot of the stuff we do bell curves. Okay, well, that doesn't have to be the only distribution you could do fat tail. So if you did fat tail descriptions that a bell curve gets you much different results. Now, which one's better? I don't know, but, you know, and just using example >> to another cut in the data. So our view now talk about more about the tech behind this. He's mentioned a I What about math? Machine learning? Deep learning. Yeah, that's a color to that. >> Well, the tech behind it is, believe or not, some relatively old tech. There is a technology called rd F, which is kind of turned around for a long time. It's a science kind of, ah, machine learning, not machine wearing. I'm sorry. Machine code type. Fairly simplistic definitions. Lots of angle brackets and all this stuff there is a higher level. That was your distracted, I think put into standard in, like, 2000 for 2005. Called out. Well, two point. Oh, and it does a lot at a higher level. The same stuff that already f does. Okay, you could also create, um, believer, not your own special ways of a communicating and ontology just using XML. Okay, So, uh, x b r l is an enhanced version of XML, okay? And so some of these older technologies, quote unquote old 20 years old, are essentially gonna be driving a lot of this stuff. So you know you know Corbett, right? Corba? Is that what a maid omg you know, on the communication and press thing, do you realize that basically every single device in the world has a corpus standard at okay? Yeah, omg Standard isn't all your smartphones and all your computers. And and that's how they communicate. It turns out that a lot of this old stuff quote unquote, is so rigidly well defined. Well done that you can build modern stuff that takes us to the Mars based on these old standards. >> All right, we got to go. But I gotta give you the award for the most acronyms >> HR 15 30 fi G o m g s b r >> m fsoc tarp. Oh, fr already halfway. We knew that Owl XML ex brl corba, Which of course >> I do. But that's well done. Like thanks so much for coming. Everyone tried to have you. All right, keep it right there, everybody, We'll be back with our next guest from M i t cdo I Q right after this short, brief short message. Thank you

Published Date : Aug 1 2019

SUMMARY :

Brought to you by A lot of acronym stands for M I. T. Of course, the great institution. in the same company, you know, we Sometimes engineers arrive and they could do some things. And it Boy, if you put in some data data capital in there, you really explosions. of the United States government and trying to roll up all the expenses into one kind So they're to G et o reports out criticizing how was done, and the government's I forget the exact invitation You pull out the net net income information and says its net income, but you don't know what it attaches So it also goes back, and they're serving as you get farther and farther out the tree, Okay, how does this relate to the financial and the 15 30 is going to dramatically change the way, So one of the things we have advised is that No, the machine to machine is coming in with son Okay, you have various So if you like at a sec Okay, so so you could have the machines go and check scale. I mean, Holland's reporting something on the order of 90%. We say pick up. you're taking people out of the whole cycle. Explain the OMG You remember? go through to the calculation, assuming you could have a calculation routine to of you community banks. gains, probably after the first time you get it done. of the same data. You just give the model to the semantic worksheet and all turn around. and so the securities analyst can spend Mme. And you know, you hear it through this conference, for instance, something like 80 to 85% of the time You have to understand what data means You're going to figure out what your modeling really means. to another cut in the data. on the communication and press thing, do you realize that basically every single device But I gotta give you the award for the most acronyms We knew that Owl Thank you

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Paul GillPERSON

0.99+

ObamaPERSON

0.99+

TrumpPERSON

0.99+

IBMORGANIZATION

0.99+

LarsPERSON

0.99+

IndiaLOCATION

0.99+

2017DATE

0.99+

DavidPERSON

0.99+

fiveQUANTITY

0.99+

GoldmanORGANIZATION

0.99+

IssaPERSON

0.99+

Federated Enterprise Risk Management GroupORGANIZATION

0.99+

80QUANTITY

0.99+

600 hoursQUANTITY

0.99+

Financial Transparency ActTITLE

0.99+

CongressORGANIZATION

0.99+

60%QUANTITY

0.99+

Maxine Waters CommitteeORGANIZATION

0.99+

Silicon Angle MediaORGANIZATION

0.99+

TokyoLOCATION

0.99+

90%QUANTITY

0.99+

20 yearsQUANTITY

0.99+

United StatesLOCATION

0.99+

MariaPERSON

0.99+

600QUANTITY

0.99+

National Association Insurance CommissionersORGANIZATION

0.99+

Brass Rat CapitalORGANIZATION

0.99+

CaliforniaLOCATION

0.99+

CitibankORGANIZATION

0.99+

Goldman SachsORGANIZATION

0.99+

ExcelTITLE

0.99+

FERCORGANIZATION

0.99+

Lars ToomrePERSON

0.99+

15 30TITLE

0.99+

2005DATE

0.99+

two leadersQUANTITY

0.99+

Cambridge, MassachusettsLOCATION

0.99+

SECORGANIZATION

0.99+

AustraliaLOCATION

0.99+

three yearsQUANTITY

0.99+

threeQUANTITY

0.99+

7QUANTITY

0.99+

NISTORGANIZATION

0.99+

Open Data Act of 2014TITLE

0.99+

25 negative votesQUANTITY

0.99+

85%QUANTITY

0.99+

todayDATE

0.99+

50QUANTITY

0.99+

two yearsQUANTITY

0.99+

SarahPERSON

0.99+

yesterdayDATE

0.99+

Veterans AffairsORGANIZATION

0.99+

five partsQUANTITY

0.99+

bothQUANTITY

0.98+

first timeQUANTITY

0.98+

RepublicanORGANIZATION

0.98+

oneQUANTITY

0.98+

two weeks agoDATE

0.98+

one conceptQUANTITY

0.98+

DARPAORGANIZATION

0.98+

10,000 timesQUANTITY

0.98+

firstQUANTITY

0.98+

New YorkLOCATION

0.98+

AlexPERSON

0.98+

United States governmentORGANIZATION

0.98+

VaderPERSON

0.98+

one dayQUANTITY

0.98+

about 50 casesQUANTITY

0.98+

TreasuryORGANIZATION

0.97+

government Affairs CommitteeORGANIZATION

0.97+

MarsLOCATION

0.97+

Object Management GroupORGANIZATION

0.97+

Government Data actTITLE

0.96+

earlier this yearDATE

0.96+

OMGORGANIZATION

0.96+

TeffPERSON

0.96+

100QUANTITY

0.96+

six yearsQUANTITY

0.96+

BeaverPERSON

0.95+

two significant financial programsQUANTITY

0.94+

two pointQUANTITY

0.94+

third generationQUANTITY

0.94+

Jeanne Ross, MIT CISR | MIT CDOIQ 2019


 

(techno music) >> From Cambridge, Massachusetts, it's theCUBE. Covering MIT Chief Data Officer and Information Quality Symposium 2019, brought to you by SiliconANGLE Media. >> Welcome back to MIT CDOIQ. The CDO Information Quality Conference. You're watching theCUBE, the leader in live tech coverage. My name is Dave Vellante. I'm here with my co-host, Paul Gillin. This is our day two of our two day coverage. Jean Ross is here. She's the principle research scientist at MIT CISR, Jean good to see you again. >> Nice to be here! >> Welcome back. Okay, what do all these acronyms stand for, I forget. MIT CISR. >> CISR which we pronounce scissor, is the Center for Information Systems Research. It's a research center that's been at MIT since 1974, studying how big companies use technology effectively. >> So and, what's your role as a research scientist? >> As a research scientist, I work with both researchers and with company leaders to understand what's going on out there, and try to present some simple succinct ideas about how companies can generate greater value from information technology. >> Well, I guess not much has changed in information technology since 1974. (laughing) So let's fast forward to the big, hot trend, digital transformation, digital business. What's the difference between a business and a digital business? >> Right now, you're hoping there's no difference for you and your business. >> (chuckling) Yeah, for sure. >> The main thing about a digital business is it's being inspired by technology. So in the past, we would establish a strategy, and then we would check out technology and say, okay, how can technology make us more effective with that strategy? Today, and this has been driven a lot by start-ups, we have to stop and say, well wait a minute, what is technology making possible? Because if we're not thinking about it, there sure are a lot of students at MIT who are, and we're going to miss the boat. We're going to get Ubered if you will, somebody's going to think of a value proposition that we should be offering and aren't, and we'll be left in the dust. So, our digital businesses are those that are recognizing the opportunities that digital technologies make possible. >> Now, and what about data? In terms of the role of digital business, it seems like that's an underpinning of a digital business. Is it not? >> Yeah, the single biggest capability that digital technologies provide, is ubiquitous data that's readily accessible anytime. So when we think about being inspired by technology, we could reframe that as inspired by the availability of ubiquitous data that's readily accessible. >> Your premise about the difference between digitization and digital business is interesting. It's more than just a sematic debate. Do companies now, when companies talk about digital transformation these days, in fact, are most of them of thinking of digitization rather than really transformative business change? >> Yeah, this is so interesting to me. In 2006, we wrote a book that said, you need to become more agile, and you need to rely on information technology to get you there. And these are basic things like SAP and salesforce.com and things like that. Just making sure that your core processes are disciplined and reliable and predictable. We said this in 2006. What we didn't know is that we were explaining digitization, which is very effective use of technology in your underlying process. Today, when somebody says to me, we're going digital, I'm thinking about the new value propositions, the implications of the data, right? And they're often actually saying they're finally doing what we thought they should do in 2006. The problem is, in 2006, we said get going on this, it's a long journey. This could take you six, 10 years to accomplish. And then we gave examples of companies that took six to 10 years. LEGO, and USAA and really great companies. And now, companies are going, "Ah, you know, we really ought to do that". They don't have six to 10 years. They get this done now, or they're in trouble, and it's still a really big deal. >> So how realistic is it? I mean, you've got big established companies that have got all these information silos, as we've been hearing for the last two days, just pulling their information together, knowing what they've got is a huge challenge for them. Meanwhile, you're competing with born on the web, digitally native start-ups that don't have any of that legacy, is it really feasible for these companies to reinvent themselves in the way you're talking about? Or should they just be buying the companies that have already done it? >> Well good luck with buying, because what happens is that when a company starts up, they can do anything, but they can't do it to scale. So most of these start-ups are going to have to sell themselves because they don't know anything about scale. And the problem is, the companies that want to buy them up know about the scale of big global companies but they don't know how to do this seamlessly because they didn't do the basic digitization. They relied on basically, a lot of heroes in their company to pull of the scale. So now they have to rely more on technology than they did in the past, but they still have a leg up if you will, on the start-up that doesn't want to worry about the discipline of scaling up a good idea. They'd rather just go off and have another good idea, right? They're perpetual entrepreneurs if you will. So if we look at the start-ups, they're not really your concern. Your concern is the very well run company, that's been around, knows how to be inspired by technology and now says, "Oh I see what you're capable of doing, "or should be capable of doing. "I think I'll move into your space". So this, the Amazon's, and the USAA's and the LEGO's who say "We're good at what we do, "and we could be doing more". We're watching Schneider Electric, Phillips's, Ferovial. These are big ole companies who get digital, and they are going to start moving into a lot of people's territory. >> So let's take the example of those incumbents that you've used as examples of companies that are leaning into digital, and presumably doing a good job of it, they've got a lot of legacy debt, as you know people call it technical debt. The question I have is how they're using machine intelligence. So if you think about Facebook, Amazon, Microsoft, Google, they own horizontal technologies around machine intelligence. The incumbents that you mentioned, do not. Now do they close the gap? They're not going to build their own A.I. They're going to buy it, and then apply it. It's how they apply it that's going to be the difference. So do you agree with that premise, and where are they getting it, do they have the skill sets to do it, how are they closing that gap? >> They're definitely partnering. When you say they're not going to build any of it, that's actually not quite true. They're going to build a lot around the edges. They'll rely on partners like Microsoft and Google to provide some of the core, >> Yes, right. >> But they are bringing in their own experts to take it to the, basically to the customer level. How do I take, let me just take Schneider Electric for an example. They have gone from being an electrical equipment manufacturer, to a purveyor of energy management solutions. It's quite a different value proposition. To do that, they need a lot of intelligence. Some of it is data analytics of old, and some of it is just better representation on dashboards and things like that. But there is a layer of intelligence that is new, and it is absolutely essential to them by relying on partners and their own expertise in what they do for customers, and then co-creating a fair amount with customers, they can do things that other companies cannot. >> And they're developing a software presumably, a SAS revenue stream as part of that, right? >> Yeah, absolutely. >> How about the innovators dilemma though, the problem that these companies often have grown up, they're very big, they're very profitable, they see disruption coming, but they are unable to make the change, their shareholders won't let them make the change, they know what they have to do, but they're simply not able to do it, and then they become paralyzed. Is there a -- I mean, looking at some of the companies you just mentioned, how did they get over that mindset? >> This is real leadership from CEO's, who basically explain to their boards and to their investors, this is our future, we are... we're either going this direction or we're going down. And they sell it. It's brilliant salesmanship, and it's why when we go out to study great companies, we don't have that many to choose from. I mean, they are hard to find, right? So you are at such a competitive advantage right now. If you understand, if your own internal processes are cleaned up and you know how to rely on the E.R.P's and the C.R.M's, to get that done, and on the other hand, you're using the intelligence to provide value propositions, that new technologies and data make possible, that is an incredibly powerful combination, but you have to invest. You have to convince your boards and your investors that it's a good idea, you have to change your talent internally, and the biggest surprise is, you have to convince your customers that they want something from you that they never wanted before. So you got a lot of work to do to pull this off. >> Right now, in today's economy, the economy is sort of lifting all boats. But as we saw when the .com implosion happened in 2001, often these breakdown gives birth to great, new companies. Do you see that the next recession, which is inevitably coming, will be sort of the turning point for some of these companies that can't change? >> It's a really good question. I do expect that there are going to be companies that don't make it. And I think that they will fail at different rates based on their, not just the economy, but their industry, and what competitors do, and things like that. But I do think we're going to see some companies fail. We're going to see many other companies understand that they are too complex. They are simply too complex. They cannot do things end to end and seamlessly and present a great customer experience, because they're doing everything. So we're going to see some pretty dramatic changes, we're going to see failure, it's a fair assumption that when we see the economy crash, it's also going to contribute, but that's, it's not the whole story. >> But when the .com blew up, you had the internet guys that actually had a business model to make money, and the guys that didn't, the guys that didn't went away, and then you also had the incumbents that embrace the internet, so when we came out of that .com downturn, you had the survivors, who was Google and eBay, and obviously Amazon, and then you had incumbent companies who had online retailing, and e-tailing and e-commerce etc, who thrived. I would suspect you're going to see something similar, but I wonder what you guys think. The street today is rewarding growth. And we got another near record high today after the rate cut yesterday. And so, but companies that aren't making money are getting rewarded, 'cause they're growing. Well when the recession comes, those guys are going to get crushed. >> Right. >> Yeah. >> And you're going to have these other companies emerge, and you'll see the winners, are going to be those ones who have truly digitized, not just talking the talk, or transformed really, to use your definition. That's what I would expect. I don't know, what do you think about that? >> I totally agree. And, I mean, we look at industries like retail, and they have been fundamentally transformed. There's still lots of opportunities for innovation, and we're going to see some winners that have kind of struggled early but not given up, and they're kind of finding their footing. But we're losing some. We're losing a lot, right? I think the surprise is that we thought digital was going to replace what we did. We'd stop going to stores, we'd stop reading books, we wouldn't have newspapers anymore. And it hasn't done that. Its only added, it hasn't taken anything away. >> It could-- >> I don't think the newspaper industry has been unscathed by digital. >> No, nor has retail. >> Nor has retail, right. >> No, no no, not unscathed, but here's the big challenge. Is if I could substitute, If I could move from newspaper to online, I'm fine. You don't get to do that. You add online to what you've got, right? And I think this right now is the big challenge. Is that nothing's gone away, at least yet. So we have to sustain the business we are, so that it can feed the business we want to be. And we have to make that transition into new capabilities. I would argue that established companies need to become very binary, that there are people that do nothing but sustain and make better and better and better, who they are. While others, are creating the new reality. You see this in auto companies by the way. They're creating not just the autonomous automobiles, but the mobility services, the whole new value propositions, that will become a bigger and bigger part of their revenue stream, but right now are tiny. >> So, here's the scary thing to me. And again, I'd love to hear your thoughts on this. And I've been an outspoken critic of Liz Warren's attack on big tech. >> Absolutely. >> I just think if they're breaking the law, and they're really acting like monopolies, the D.O.J and F.T.C should do something, but to me, you don't just break up big tech because they're good capitalists. Having said that, one of the things that scares me is, when you see Apple getting into payment systems, Amazon getting into grocery and logistics. Digital allows you to do something that's never happened before which is, you can traverse industries. >> Yep. >> Yeah, absolutely >> You used to have this stack of industries, and if you were in that industry, you're stuck in healthcare, you're stuck in financial services or whatever it was. And today, digital allows you to traverse those. >> It absolutely does. And so in theory, Amazon and Apple and Facebook and Google, they can attack virtually any industry and they kind of are. >> Yeah they kind are. I would certainly not break up anything. I would really look hard though at acquisitions, because I think that's where some of this is coming from. They can stop the overwhelming growth, but I do think you're right. That you get these opportunities from digital that are just so much easier because they're basically sharing information and technology, not building buildings and equipment and all that kind of thing. But I think there all limits to all this. I do not fear these companies. I think there, we need some law, we need some regulations, they're fine. They are adding a lot of value and the great companies, I mean, you look at the Schneider's and the Phillips, yeah they fear what some of them can do, but they're looking forward to what they provide underneath. >> Doesn't Cloud change the equation here? I mean, when you think of something like Amazon getting into the payments business, or Google in the payments business, you know it used to be that the creating of global payments processing network, just going global was a huge barrier to entry. Now, you don't have nearly that same level of impediment right? I mean the cloud eliminates much of the traditional barrier. >> Yeah, but I'll tell you what limits it, is complexity. Every company we've studied gets a little over anxious and becomes too complex, and they cannot run themselves effectively anymore. It happens to everyone. I mean, remember when we were terrified about what Microsoft was going to become? But then it got competition because it's trying to do so many things, and somebody else is offering, Sales Force and others, something simpler. And this will happen to every company that gets overly ambitious. Something simpler will come along, and everybody will go "Oh thank goodness". Something simpler. >> Well with Microsoft, I would argue two things. One is the D.O.J put some handcuffs on them , and two, with Steve Ballmer, I wouldn't get his nose out of Windows, and then finally stuck on a (mumbles) (laughter) >> Well it's they had a platform shift. >> Well this is exactly it. They will make those kind of calls . >> Sure, and I think that talks to their legacy, that they won't end up like Digital Equipment Corp or Wang and D.G, who just ignored the future and held onto the past. But I think, a colleague of ours, David Moschella wrote a book, it's called "Seeing Digital". And his premise was we're moving from a world of remote cloud services, to one where you have to, to use your word, ubiquitous digital services that you can access upon which you can build your business and new business models. I mean, the simplest example is Waves, you mentioned Uber. They're using Cloud, they're using OAuth.in with Google, Facebook or LinkedIn and they've got a security layer, there's an A.I layer, there's all your BlockChain, mobile, cognitive, it's all these sets of services that are now ubiquitous on which you're building, so you're leveraging, he calls it the matrix, to the extent that these companies that you're studying, these incumbents can leverage that matrix, they should be fine. >> Yes. >> The part of the problem is, they say "No, we're going to invent everything ourselves, we're going to build it all ourselves". To use Andy Jassy's term, it's non-differentiated heavy lifting, slows them down, but there's no reason why they can't tap that matrix, >> Absolutely >> And take advantage of it. Where I do get scared is, the Facebooks, Apples, Googles, Amazons, they're matrix companies, their data is at their core, and they get this. It's not like they're putting data around the core, data is the core. So your thoughts on that? I mean, it looks like your slide about disruption, it's coming. >> Yeah, yeah, yeah, yeah. >> No industry is safe. >> Yeah, well I'll go back to the complexity argument. We studied complexity at length, and complexity is a killer. And as we get too ambitious, and we're constantly looking for growth, we start doing things that create more and more tensions in our various lines of business, causes to create silos, that then we have to coordinate. I just think every single company that, no cloud is going to save us from this. It, complexity will kill us. And we have to keep reminding ourselves to limit that complexity, and we've just not seen the example of the company that got that right. Sooner or later, they just kind of chop them, you know, create problems for themselves. >> Well isn't that inherent though in growth? >> Absolutely! >> It's just like, big companies slow down. >> That's right. >> They can't make decisions as quickly. >> That's right. >> I haven't seen a big company yet that moves nimbly. >> Exactly, and that's the complexity thing-- >> Well wait a minute, what about AWS? They're a 40 billion dollar company. >> Oh yeah, yeah, yeah >> They're like the agile gorilla. >> Yeah, yeah, yeah. >> I mean, I think they're breaking the rule, and my argument would be, because they have data at their core, and they've got that, its a bromide, but that common data model, that they can apply now to virtually any business. You know, we're been expecting, a lot of people have been expecting that growth to attenuate. I mean it hasn't yet, we'll see. But they're like a 40 billion dollar firm-- >> No that's a good example yeah. >> So we'll see. And Microsoft, is the other one. Microsoft is demonstrating double digit growth. For such a large company, it's astounding. I wonder, if the law of large numbers is being challenged, so. >> Yeah, well it's interesting. I do think that what now constitutes "so big" that you're really going to struggle with the complexity. I think that has definitely been elevated a lot. But I still think there will be a point at which human beings can't handle-- >> They're getting away. >> Whatever level of complexity we reach, yeah. >> Well sure, right because even though this great new, it's your point. Cloud technology, you know, there's going to be something better that comes along. Even, I think Jassy might have said, If we had to do it all over again, we would have built the whole thing on lambda functions >> Yeah. >> Oh, yeah. >> Not on, you know so there you go. >> So maybe someone else does that-- >> Yeah, there you go. >> So now they've got their hybrid. >> Yeah, yeah. >> Yeah, absolutely. >> You know maybe it'll take another ten years, but well Jean, thanks so much for coming to theCUBE, >> it was great to have you. >> My pleasure! >> Appreciate you coming back. >> Really fun to talk. >> All right, keep right there everybody, Paul Gillin and Dave Villante, we'll be right back from MIT CDOIQ, you're watching theCUBE. (chuckles) (techno music)

Published Date : Aug 1 2019

SUMMARY :

brought to you by SiliconANGLE Media. Jean good to see you again. Okay, what do all these acronyms stand for, I forget. is the Center for Information Systems Research. to understand what's going on out there, So let's fast forward to the big, hot trend, for you and your business. We're going to get Ubered if you will, Now, and what about data? Yeah, the single biggest capability and digital business is interesting. information technology to get you there. to reinvent themselves in the way you're talking about? and they are going to start moving into It's how they apply it that's going to be the difference. They're going to build a lot around the edges. and it is absolutely essential to them I mean, looking at some of the companies you just mentioned, and the biggest surprise is, you have to convince often these breakdown gives birth to great, new companies. I do expect that there are going to be companies and then you also had the incumbents I don't know, what do you think about that? and they have been fundamentally transformed. I don't think the newspaper industry so that it can feed the business we want to be. So, here's the scary thing to me. but to me, you don't just break up big tech and if you were in that industry, they can attack virtually any industry and they kind of are. But I think there all limits to all this. I mean, when you think of something like and they cannot run themselves effectively anymore. One is the D.O.J put some handcuffs on them , Well this is exactly it. Sure, and I think that talks to their legacy, The part of the problem is, they say data is the core. that then we have to coordinate. Well wait a minute, what about AWS? that growth to attenuate. And Microsoft, is the other one. I do think that what now constitutes "so big" that you're there's going to be something better that comes along. Paul Gillin and Dave Villante,

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave VellantePERSON

0.99+

Paul GillinPERSON

0.99+

AmazonORGANIZATION

0.99+

David MoschellaPERSON

0.99+

FacebookORGANIZATION

0.99+

MicrosoftORGANIZATION

0.99+

GoogleORGANIZATION

0.99+

Jean RossPERSON

0.99+

2006DATE

0.99+

sixQUANTITY

0.99+

Steve BallmerPERSON

0.99+

Jeanne RossPERSON

0.99+

Liz WarrenPERSON

0.99+

LEGOORGANIZATION

0.99+

AppleORGANIZATION

0.99+

Schneider ElectricORGANIZATION

0.99+

Dave VillantePERSON

0.99+

AmazonsORGANIZATION

0.99+

GooglesORGANIZATION

0.99+

JeanPERSON

0.99+

FacebooksORGANIZATION

0.99+

PhillipsORGANIZATION

0.99+

USAAORGANIZATION

0.99+

Center for Information Systems ResearchORGANIZATION

0.99+

ApplesORGANIZATION

0.99+

Andy JassyPERSON

0.99+

AWSORGANIZATION

0.99+

FerovialORGANIZATION

0.99+

Digital Equipment CorpORGANIZATION

0.99+

2001DATE

0.99+

1974DATE

0.99+

two dayQUANTITY

0.99+

twoQUANTITY

0.99+

UberORGANIZATION

0.99+

D.O.JORGANIZATION

0.99+

yesterdayDATE

0.99+

eBayORGANIZATION

0.99+

40 billion dollarQUANTITY

0.99+

MITORGANIZATION

0.99+

JassyPERSON

0.99+

Cambridge, MassachusettsLOCATION

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

todayDATE

0.99+

10 yearsQUANTITY

0.99+

ten yearsQUANTITY

0.99+

TodayDATE

0.99+

OneQUANTITY

0.99+

CISRORGANIZATION

0.98+

MIT CISRORGANIZATION

0.98+

Seeing DigitalTITLE

0.98+

two thingsQUANTITY

0.98+

singleQUANTITY

0.97+

UberedORGANIZATION

0.97+

LinkedInORGANIZATION

0.97+

WindowsTITLE

0.96+

OAuth.inTITLE

0.96+

oneQUANTITY

0.94+

Wang and D.GORGANIZATION

0.94+

CDO Information Quality ConferenceEVENT

0.94+

D.O.JPERSON

0.87+

Gokula Mishra | MIT CDOIQ 2019


 

>> From Cambridge, Massachusetts, it's theCUBE covering MIT Chief Data Officer and Information Quality Symposium 2019 brought to you by SiliconANGLE Media. (upbeat techno music) >> Hi everybody, welcome back to Cambridge, Massachusetts. You're watching theCUBE, the leader in tech coverage. We go out to the events. We extract the signal from the noise, and we're here at the MIT CDOIQ Conference, Chief Data Officer Information Quality Conference. It is the 13th year here at the Tang building. We've outgrown this building and have to move next year. It's fire marshal full. Gokula Mishra is here. He is the Senior Director of Global Data and Analytics and Supply Chain-- >> Formerly. Former, former Senior Director. >> Former! I'm sorry. It's former Senior Director of Global Data Analytics and Supply Chain at McDonald's. Oh, I didn't know that. I apologize my friend. Well, welcome back to theCUBE. We met when you were at Oracle doing data. So you've left that, you're on to your next big thing. >> Yes, thinking through it. >> Fantastic, now let's start with your career. You've had, so you just recently left McDonald's. I met you when you were at Oracle, so you cut over to the dark side for a while, and then before that, I mean, you've been a practitioner all your life, so take us through sort of your background. >> Yeah, I mean my beginning was really with a company called Tata Burroughs. Those days we did not have a lot of work getting done in India. We used to send people to U.S. so I was one of the pioneers of the whole industry, coming here and working on very interesting projects. But I was lucky to be working on mostly data analytics related work, joined a great company called CS Associates. I did my Master's at Northwestern. In fact, my thesis was intelligent databases. So, building AI into the databases and from there on I have been with Booz Allen, Oracle, HP, TransUnion, I also run my own company, and Sierra Atlantic, which is part of Hitachi, and McDonald's. >> Awesome, so let's talk about use of data. It's evolved dramatically as we know. One of the themes in this conference over the years has been sort of, I said yesterday, the Chief Data Officer role emerged from the ashes of sort of governance, kind of back office information quality compliance, and then ascended with the tailwind of the Big Data meme, and it's kind of come full circle. People are realizing actually to get value out of data, you have to have information quality. So those two worlds have collided together, and you've also seen the ascendancy of the Chief Digital Officer who has really taken a front and center role in some of the more strategic and revenue generating initiatives, and in some ways the Chief Data Officer has been a supporting role to that, providing the quality, providing the compliance, the governance, and the data modeling and analytics, and a component of it. First of all, is that a fair assessment? How do you see the way in which the use of data has evolved over the last 10 years? >> So to me, primarily, the use of data was, in my mind, mostly around financial reporting. So, anything that companies needed to run their company, any metrics they needed, any data they needed. So, if you look at all the reporting that used to happen it's primarily around metrics that are financials, whether it's around finances around operations, finances around marketing effort, finances around reporting if it's a public company reporting to the market. That's where the focus was, and so therefore a lot of the data that was not needed for financial reporting was what we call nowadays dark data. This is data we collect but don't do anything with it. Then, as the capability of the computing, and the storage, and new technologies, and new techniques evolve, and are able to handle more variety and more volume of data, then people quickly realize how much potential they have in the other data outside of the financial reporting data that they can utilize too. So, some of the pioneers leverage that and actually improved a lot in their efficiency of operations, came out with innovation. You know, GE comes to mind as one of the companies that actually leverage data early on, and number of other companies. Obviously, you look at today data has been, it's defining some of the multi-billion dollar company and all they have is data. >> Well, Facebook, Google, Amazon, Microsoft. >> Exactly. >> Apple, I mean Apple obviously makes stuff, but those other companies, they're data companies. I mean largely, and those five companies have the highest market value on the U.S. stock exchange. They've surpassed all the other big leaders, even Berkshire Hathaway. >> So now, what is happening is because the market changes, the forces that are changing the behavior of our consumers and customers, which I talked about which is everyone now is digitally engaging with each other. What that does is all the experiences now are being captured digitally, all the services are being captured digitally, all the products are creating a lot of digital exhaust of data and so now companies have to pay attention to engage with their customers and partners digitally. Therefore, they have to make sure that they're leveraging data and analytics in doing so. The other thing that has changed is the time to decision to the time to act on the data inside that you get is shrinking, and shrinking, and shrinking, so a lot more decision-making is now going real time. Therefore, you have a situation now, you have the capability, you have the technology, you have the data now, you have to make sure that you convert that in what I call programmatic kind of data decision-making. Obviously, there are people involved in more strategic decision-making. So, that's more manual, but at the operational level, it's going more programmatic decision-making. >> Okay, I want to talk, By the way, I've seen a stat, I don't know if you can confirm this, that 80% of the data that's out there today is dark data or it's data that's behind a firewall or not searchable, not open to Google's crawlers. So, there's a lot of value there-- >> So, I would say that percent is declining over time as companies have realized the value of data. So, more and more companies are removing the silos, bringing those dark data out. I think the key to that is companies being able to value their data, and as soon as they are able to value their data, they are able to leverage a lot of the data. I still believe there's a large percent still not used or accessed in companies. >> Well, and of course you talked a lot about data monetization. Doug Laney, who's an expert in that topic, we had Doug on a couple years ago when he, just after, he wrote Infonomics. He was on yesterday. He's got a very detailed prescription as to, he makes strong cases as to why data should be valued like an asset. I don't think anybody really disagrees with that, but then he gave kind of a how-to-do-it, which will, somewhat, make your eyes bleed, but it was really well thought out, as you know. But you talked a lot about data monetization, you talked about a number of ways in which data can contribute to monetization. Revenue, cost reduction, efficiency, risk, and innovation. Revenue and cost is obvious. I mean, that's where the starting point is. Efficiency is interesting. I look at efficiency as kind of a doing more with less but it's sort of a cost reduction, but explain why it's not in the cost bucket, it's different. >> So, it is first starts with doing what we do today cheaper, better, faster, and doing more comes after that because if you don't understand, and data is the way to understand how your current processes work, you will not take the first step. So, to take the first step is to understand how can I do this process faster, and then you focus on cheaper, and then you focus on better. Of course, faster is because of some of the market forces and customer behavior that's driving you to do that process faster. >> Okay, and then the other one was risk reduction. I think that makes a lot of sense here. Actually, let me go back. So, one of the key pieces of it, of efficiency is time to value. So, if you can compress the time, or accelerate the time and you get the value that means more cash in house faster, whether it's cost reduction or-- >> And the other aspect you look at is, can you automate more of the processes, and in that way it can be faster. >> And that hits the income statement as well because you're reducing headcount cost of your, maybe not reducing headcount cost, but you're getting more out of different, out ahead you're reallocating them to more strategic initiatives. Everybody says that but the reality is you hire less people because you just automated. And then, risk reduction, so the degree to which you can lower your expected loss. That's just instead thinking in insurance terms, that's tangible value so certainly to large corporations, but even midsize and small corporations. Innovation, I thought was a good one, but maybe you could use an example of, give us an example of how in your career you've seen data contribute to innovation. >> So, I'll give an example of oil and gas industry. If you look at speed of innovation in the oil and gas industry, they were all paper-based. I don't know how much you know about drilling. A lot of the assets that goes into figuring out where to drill, how to drill, and actually drilling and then taking the oil or gas out, and of course selling it to make money. All of those processes were paper based. So, if you can imagine trying to optimize a paper-based innovation, it's very hard. Not only that, it's very, very by itself because it's on paper, it's in someone's drawer or file. So, it's siloed by design and so one thing that the industry has gone through, they recognize that they have to optimize the processes to be better, to innovate, to find, for example, shale gas was a result output of digitizing the processes because otherwise you can't drill faster, cheaper, better to leverage the shale gas drilling that they did. So, the industry went through actually digitizing a lot of the paper assets. So, they went from not having data to knowingly creating the data that they can use to optimize the process and then in the process they're innovating new ways to drill the oil well cheaper, better, faster. >> In the early days of oil exploration in the U.S. go back to the Osage Indian tribe in northern Oklahoma, and they brilliantly, when they got shuttled around, they pushed him out of Kansas and they negotiated with the U.S. government that they maintain the mineral rights and so they became very, very wealthy. In fact, at one point they were the wealthiest per capita individuals in the entire world, and they used to hold auctions for various drilling rights. So, it was all gut feel, all the oil barons would train in, and they would have an auction, and it was, again, it was gut feel as to which areas were the best, and then of course they evolved, you remember it used to be you drill a little hole, no oil, drill a hole, no oil, drill a hole. >> You know how much that cost? >> Yeah, the expense is enormous right? >> It can vary from 10 to 20 million dollars. >> Just a giant expense. So, now today fast-forward to this century, and you're seeing much more sophisticated-- >> Yeah, I can give you another example in pharmaceutical. They develop new drugs, it's a long process. So, one of the initial process is to figure out what molecules this would be exploring in the next step, and you could have thousand different combination of molecules that could treat a particular condition, and now they with digitization and data analytics, they're able to do this in a virtual world, kind of creating a virtual lab where they can test out thousands of molecules. And then, once they can bring it down to a fewer, then the physical aspect of that starts. Think about innovation really shrinking their processes. >> All right, well I want to say this about clouds. You made the statement in your keynote that how many people out there think cloud is cheaper, or maybe you even said cheap, but cheaper I inferred cheaper than an on-prem, and so it was a loaded question so nobody put their hand up they're afraid, but I put my hand up because we don't have any IT. We used to have IT. It was a nightmare. So, for us it's better but in your experience, I think I'm inferring correctly that you had meant cheaper than on-prem, and certainly we talked to many practitioners who have large systems that when they lift and shift to the cloud, they don't change their operating model, they don't really change anything, they get a bill at the end of the month, and they go "What did this really do for us?" And I think that's what you mean-- >> So what I mean, let me make it clear, is that there are certain use cases that cloud is and, as you saw, that people did raise their hand saying "Yeah, I have use cases where cloud is cheaper." I think you need to look at the whole thing. Cost is one aspect. The flexibility and agility of being able to do things is another aspect. For example, if you have a situation where your stakeholder want to do something for three weeks, and they need five times the computing power, and the data that they are buying from outside to do that experiment. Now, imagine doing that in a physical war. It's going to take a long time just to procure and get the physical boxes, and then you'll be able to do it. In cloud, you can enable that, you can get GPUs depending on what problem we are trying to solve. That's another benefit. You can get the fit for purpose computing environment to that and so there are a lot of flexibility, agility all of that. It's a new way of managing it so people need to pay attention to the cost because it will add to the cost. The other thing I will point out is that if you go to the public cloud, because they make it cheaper, because they have hundreds and thousands of this canned CPU. This much computing power, this much memory, this much disk, this much connectivity, and they build thousands of them, and that's why it's cheaper. Well, if your need is something that's very unique and they don't have it, that's when it becomes a problem. Either you need more of those and the cost will be higher. So, now we are getting to the IOT war. The volume of data is growing so much, and the type of processing that you need to do is becoming more real-time, and you can't just move all this bulk of data, and then bring it back, and move the data back and forth. You need a special type of computing, which is at the, what Amazon calls it, adds computing. And the industry is kind of trying to design it. So, that is an example of hybrid computing evolving out of a cloud or out of the necessity that you need special purpose computing environment to deal with new situations, and all of it can't be in the cloud. >> I mean, I would argue, well I guess Microsoft with Azure Stack was kind of the first, although not really. Now, they're there but I would say Oracle, your former company, was the first one to say "Okay, we're going to put the exact same infrastructure on prem as we have in the public cloud." Oracle, I would say, was the first to truly do that-- >> They were doing hybrid computing. >> You now see Amazon with outposts has done the same, Google kind of has similar approach as Azure, and so it's clear that hybrid is here to stay, at least for some period of time. I think the cloud guys probably believe that ultimately it's all going to go to the cloud. We'll see it's going to be a long, long time before that happens. Okay! I'll give you last thoughts on this conference. You've been here before? Or is this your first one? >> This is my first one. >> Okay, so your takeaways, your thoughts, things you might-- >> I am very impressed. I'm a practitioner and finding so many practitioners coming from so many different backgrounds and industries. It's very, very enlightening to listen to their journey, their story, their learnings in terms of what works and what doesn't work. It is really invaluable. >> Yeah, I tell you this, it's always a highlight of our season and Gokula, thank you very much for coming on theCUBE. It was great to see you. >> Thank you. >> You're welcome. All right, keep it right there everybody. We'll be back with our next guest, Dave Vellante. Paul Gillin is in the house. You're watching theCUBE from MIT. Be right back! (upbeat techno music)

Published Date : Aug 1 2019

SUMMARY :

brought to you by SiliconANGLE Media. He is the Senior Director of Global Data and Analytics Former, former Senior Director. We met when you were at Oracle doing data. I met you when you were at Oracle, of the pioneers of the whole industry, and the data modeling and analytics, So, if you look at all the reporting that used to happen the highest market value on the U.S. stock exchange. So, that's more manual, but at the operational level, that 80% of the data that's out there today and as soon as they are able to value their data, Well, and of course you talked a lot and data is the way to understand or accelerate the time and you get the value And the other aspect you look at is, Everybody says that but the reality is you hire and of course selling it to make money. the mineral rights and so they became very, very wealthy. and you're seeing much more sophisticated-- So, one of the initial process is to figure out And I think that's what you mean-- and the type of processing that you need to do I mean, I would argue, and so it's clear that hybrid is here to stay, and what doesn't work. Yeah, I tell you this, Paul Gillin is in the house.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave VellantePERSON

0.99+

AmazonORGANIZATION

0.99+

HitachiORGANIZATION

0.99+

AppleORGANIZATION

0.99+

FacebookORGANIZATION

0.99+

MicrosoftORGANIZATION

0.99+

GoogleORGANIZATION

0.99+

Doug LaneyPERSON

0.99+

five timesQUANTITY

0.99+

OracleORGANIZATION

0.99+

KansasLOCATION

0.99+

TransUnionORGANIZATION

0.99+

Paul GillinPERSON

0.99+

HPORGANIZATION

0.99+

three weeksQUANTITY

0.99+

IndiaLOCATION

0.99+

10QUANTITY

0.99+

Sierra AtlanticORGANIZATION

0.99+

Gokula MishraPERSON

0.99+

DougPERSON

0.99+

hundredsQUANTITY

0.99+

Berkshire HathawayORGANIZATION

0.99+

five companiesQUANTITY

0.99+

80%QUANTITY

0.99+

U.S.LOCATION

0.99+

Booz AllenORGANIZATION

0.99+

Tata BurroughsORGANIZATION

0.99+

first stepQUANTITY

0.99+

GokulaPERSON

0.99+

next yearDATE

0.99+

thousandsQUANTITY

0.99+

McDonald'sORGANIZATION

0.99+

one aspectQUANTITY

0.99+

Cambridge, MassachusettsLOCATION

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

firstQUANTITY

0.99+

yesterdayDATE

0.99+

thousands of moleculesQUANTITY

0.99+

first oneQUANTITY

0.99+

OneQUANTITY

0.98+

GEORGANIZATION

0.98+

northern OklahomaLOCATION

0.98+

todayDATE

0.97+

CS AssociatesORGANIZATION

0.97+

20 million dollarsQUANTITY

0.97+

oneQUANTITY

0.96+

FirstQUANTITY

0.96+

Global Data and Analytics and Supply ChainORGANIZATION

0.95+

MIT CDOIQ ConferenceEVENT

0.95+

13th yearQUANTITY

0.94+

U.S. governmentORGANIZATION

0.93+

two worldsQUANTITY

0.92+

Azure StackTITLE

0.91+

one thingQUANTITY

0.9+

one pointQUANTITY

0.9+

NorthwesternORGANIZATION

0.9+

couple years agoDATE

0.89+

MIT Chief Data Officer and Information Quality Symposium 2019EVENT

0.87+

this centuryDATE

0.85+

Tang buildingLOCATION

0.85+

Global Data Analytics andORGANIZATION

0.83+

Chief Data Officer Information Quality ConferenceEVENT

0.81+

MITORGANIZATION

0.78+

theCUBEORGANIZATION

0.77+

thousand different combination of moleculesQUANTITY

0.74+

lastDATE

0.67+

yearsDATE

0.66+

U.S.ORGANIZATION

0.66+

billion dollarQUANTITY

0.65+

themesQUANTITY

0.65+

Osage IndianOTHER

0.64+

Matt Kobe, Chicago Bulls | MIT CDOIQ 2019


 

>> from Cambridge, Massachusetts. It's the Cube covering M. I. T. Chief Data officer and Information Quality Symposium 2019. Brought to you by Silicon Angle Media. >> Welcome back to M. I. T. In Cambridge, Massachusetts. Everybody You're watching The Cube, the Leader and Live Tech coverage. My name is Dave Volante, and it's my pleasure to introduce Matt Kobe, who's the vice president of business strategy Analytics of Chicago Bulls. We love talking sports. We love talking data. Matt. Thanks for coming on. >> No problem getting a date. So talk about >> your role. Is the head of analytics for the Bulls? >> Sure. So I work exclusively on the business side of the operation. So we have a separate team that those the basketball side, which is kind of your players stuff. But on the business side, um, what we're focused on is really two things. One is being essentially internal consultants for the rest of the customer facing functions. So we work a lot with ticketing, allow its sponsorship, um, marketing digital, all of those folks that engage with our customer base and then on the backside back end of it, we're building out the technical infrastructure for the organization right. So everything from data warehouse to C. R M to email marketing All of that sits with my team. And so we were a lot of hats, which is exciting. But at the end of the day, we're trying to use data to enhance the customer and fan experience. Um and that's our aim. And that's what we're driving towards >> success in sports. In a larger respect. It's come down to don't be offended by this. Who's got the best geeks? So now your side of the house is not about like you say, player performance about the business performances. But that's it. That's a big part of getting the best players. I mean, if it's successful and all the nuances of the N B, A salary cap and everything else, but I think there is one, and so that makes it even more important. But you're helping fund. You know that in various ways, but so are the other two teams that completely separate. Is there a Chinese wall between them? Are you part of the sort of same group? >> Um, we're pretty separate. So the basketball folks do their thing. The business folks do their thing from an analytic standpoint. We meet and we collaborate on tools and other methods of actually doing the analysis. But in terms of, um, the analysis itself, there is a little bit of separation there, and mainly that is from priority standpoint. Obviously, the basketball stuff is the most important stuff. And so if we're working on both sides that we'd always be doing the basketball stuff and the business stuff needs to get done, >> drag you into exactly okay. But which came first? The chicken or the egg was It was the sort of post Moneyball activity applied to the N B. A. And I want to ask you a question about that. And then somebody said, Hey, we should do this for the business side. Or was the business side of sort of always there? >> I think I think, the business side and probably the last 5 to 7 years you've really seen it grown. So if you look at the N. B. A. I've been with the Bulls for five years. If you look at the N. B. A. 78 years ago, there was a handful of Business analytics teams and those those teams had one or two people at him. Now every single team in the NBA has some sort of business analytics team, and the average staff is seven. So my staff is six full time folks pushed myself, so we'll write it right at the average. And I think what you've seen is everything has become more complex in sports. Right? If you look at ticketing, you've got all the secondary markets. You have all this data flowing in, and they need someone to make sense of all that data. If you look at sponsorship sponsorship, his transition from selling a sign that sits on the side of the court for these truly integrated partnerships, where our partners are coming to us and saying, What do we get out of? This was our return. And so you're seeing a lot more part lot more collaboration between analytics and sponsorship to go back to those partners and say, Hey, here's what we delivered And so I think you it started on the basketball side, certainly because that's that's where the, you know that is the most important piece. But it quickly followed on the business side because they saw the value that that type of thinking can bring in the business. >> So I know this is not, you know, your swim lane, but But, you know, the lore of Billy Beane and Moneyball and all that, a sort of the starting point for sports analytics. Is that Is that Is that a fair characterization? Yeah. I mean, was that Was that really the main spring? >> I think it It probably started even before that. I think if you have got to see Billy being at the M I t Sports Analytics conference and him thought he always references kind of Bill James is first, and so I think it started. Baseball was I wouldn't say the easiest place to start, But it was. It's a one versus one, right? It's pitcher versus batter. In a lot of cases, basketball is a little bit more fluid. It's a team. Sport is a little harder, but I think as technology has advanced, there's been more and more opportunities to do the analytics on the basketball side and on the business side. I think what you're seeing is this huge. What we've heard the first day and 1/2 here, this huge influx of data, not nearly to the levels of the MasterCard's and others of the world. But as more and more things moved to the mobile phone, I think you're going to see this huge influx of data on the business side, and you're going to need the same systems in the same sort of approach to tackle it. >> S O. Bill James is the ultimate sports geek, and he's responsible for all these stats that, no, none of us understand. He's why we don't pay attention to batting average anymore. Of course, I still do. So let's talk about the business side of things. If you think about the business of baseball, you know it's all about maximizing the gate. Yeah, there's there's some revenue, a lot of revenue course from TV. But it's not like football, which is dominated by the by the TV. Basketball, I think, is probably a mix right. You got 80 whatever 82 game season, so filling up the stadium is important. Obviously, N v A has done a great job of of really getting it right. Free agency is like, fascinating. Now >> it's 12 months a year >> scored way. Talk about the NBA all the time and of course, you know, people like celebrities like LeBron have certainly helped, and now a whole batch of others. But what's the money side of the n ba look like? Where's the money coming from? >> Yeah, I mean, I think you certainly have broadcast right, but in many ways, like national broadcast sort of takes care of it itself. In some ways, from the standpoint of my team, doesn't have a lot of control over national broadcast money. That's a league level thing. And so the things that we have control over the two big buckets are ticketing and sponsorship. Those those are the two big buckets of revenue that my team spends a lot of time on. Ticketing is, is one that is important from the standpoint, as you say, which is like, How do we fill the building right? We've got 41 home game, supposed three preseason games. We got 44 events a year. Our goal is to fill the building for all 44 of those events. We do a pretty good job of doing it, but that has cascading effects into other revenue streams. Right, As you think about concessions and merchandise and sponsorship, it's a lot easier to spell spot cell of sponsorship when you're building is full, then if you're building isn't full. And so our focus is on. How do we? How do we fill the building in the most efficient way possible? And as you have things like the secondary market and people have access to tickets in different ways than they did 10 to 15 years ago, I think that becomes increasingly complex. Um, but that's the fun area that's like, That's where we spend a lot of time. There's the pricing, There's inventory management. It's a lot of, you know, is you look a traditional cpg. There's there's some of those same principles being applied, which is how do you are you looking airline right there? They're selling a plane. It's an asset you have to fill. We have ah, building. That's an asset we have to fill, and how do we fill it in the most optimal way? >> So the idea of surge pricing demand supply, But so several years ago, the Red Sox went to a tiered pricing. You guys do the same If the Sox are playing Kansas City Royals tickets way cheaper than if they're playing the Yankees. You guys do a similar. So >> we do it for single game tickets. So far are season ticket holders. It's the same price for every game, but on the price for primary tickets for single games, right? So if we're playing, you know this year will be the Clippers and the Lakers. That price is going to be much more expensive, so we dynamically price on a game to game basis. But our season ticket holders pay this. >> Why don't you do it for the season ticket holders? Um, just haven't gone there yet. >> Yeah, I mean, there's some teams have, right, so there's a few different approaches you convey. Lovely price. Those tickets, I think, for for us, the there's in years past. In the last few years, in particular, there's been a couple of flagship games, and then every other game feels similar. I think this will be the first year where you have 8 to 10 teams that really have a shot at winning the title, and so I think you'll see a more balanced schedule. Um, and so we've We've talked about it a lot. We just haven't gone to that made that move yet? >> Well, a season ticket holder that shares his tickets with seven other guys with red sauce. You could buy a BMW. You share the tickets, so but But I would love it if they didn't do the tiered. Pricing is a season ticket holder, so hope you hold off a while, but I don't know. It could maximize revenues if the Red Sox that was probably not a stupid thing is they're smart people. What about the sponsorships? Is fascinating about the partners looking for our ally. How are you measuring that? You're building your forging a tighter relationship, obviously, with the sponsors in these partners. Yeah, what's that are? Why look like it's >> measured? A variety of relies, largely based on the assets that they deliver. But I think every single partner we talk to these days, I also leave the sponsorship team. So I oversee. It's It's rare in sports, but I stayed over business strategy and Alex and sponsorship team. Um, it's not my title, but in practice, that's what I do. And I think everyone we talked to wants digital right? They want we've got over 25,000,000 social media followers with the Bulls, right? We've got 19,000,000 on Facebook alone. And so sponsors see those numbers and they know that we can deliver impression. They know we can deliver engagement and they want access to those channels. And so, from a return on, I always call a return on objectives, right? Return on investment is a little bit tricky, but return on objectives is if we're trying to reel brand awareness, we're gonna go back to them and say, Here's how many people came to our arena and saw your logo and saw the feature that you had on the scoreboard. If you're on our social media channels or a website, here's the number of impressions you got. Here is the number of engagements you got. I think where we're at now is Maura's Bad Morris. Still better, right? Everyone wants the big numbers. I think where you're starting to see it move, though, is that more isn't always better. We want the right folks engaging with our brands, and that's really what we're starting to think about is if you get 10,000,000 impressions, but they're 10,000,000 impressions to the wrong group of potential customers, that's not terribly helpful. for a brand. We're trying to work with our brands to reach the right demographics that they want to reach in order to actually build that brand awareness they want to build. >> What, What? Your primary social channels. Twitter, Obviously. >> So every platform has a different purpose way. Have Facebook, Twitter, instagram, Snapchat. We're in a week. We bow in in China and you know, every platform has a different function. Twitter's obviously more real time news. Um, you know the timeline stuff, it falls off really quick. Instagram is really the artistic piece of it on, and then Facebook is a blend of both, and so that's kind of how we deploy our channels. We have a whole social team that generates content and pushes that content out. But those are the channels we use and those air incredibly valuable. Now what you're starting to see is those channels are changing very rapidly, based on their own set of algorithms, of how they deliver content of fans. And so we're having to continue to adapt to those changing environments in those social >> show impressions. In the term, impressions varies by various platforms. So so I know. I know I'm more familiar with Twitter impressions. They have the definition. It's not just somebody who might have seen it. It's somebody that they believe actually spent a few seconds looking at. They have some algorithm to figure that out. Yeah. Is that a metric that you finding your brands are are buying into, for example? >> Yeah. I mean, I think certainly there they view it's kind of the old, you know, when you bought TV ads, it's how many households. So my commercial right, it's It's a similar type of metric of how many eyeballs saw a piece of content that we put out. I think we're the metrics. More people are starting to care about his engagements, which is how many of you actually engaged with that piece of content, whether it's a like a common a share, because then that's actual. Yeah, you might have seen it for three seconds, but we know how things work. You're scrolling pretty fast, But if you actually stopped to engage it with something, that's where I think brands are starting to see value. And as we think about our content, we have ah framework that our digital team uses. But one of the pillars of that is thumb stopping. We want to create content that is some stopping that people actually engage with. And that's been a big focus of ours. Last couple years, >> I presume. Using video, huge >> video We've got a whole graphics team that does custom graphics for whether it's stats or for history, historical anniversaries. We have a hole in house production team that does higher end, and then our digital team does more kind of straight from the phone raw footage. So we're using a variety of different mediums toe reach our fans >> that What's your background? How'd you get into all of this? >> I spent seven years in consulting, so I worked for Deloitte on their strategy group out of Chicago, And I worked for CPG companies like at the intersection of Retailer and CPG. So a lot of in store promotional work helping brands think through just General Revenue management, pricing strategy, promotional strategy and, um stumbled upon greatness with the Bulls job. A friend gave me the heads up that they were looking to fill this type of role and I was able to get my resume in the mix and I was lucky enough to get get the job, and it's been when I started. We're single, single, single, so it's a team of one. Five years later, we're a team of six, and we'll probably keep growing. So it's been an exciting ride and >> your background is >> maths. That's eyes business. Undergrad. And then I got a went Indian undergrad business and then went to Kellogg. Northwestern got an MBA on strategy, so that's my background. But it's, you know, I've dabbled in sports. I worked for the Chicago 2016 Olympic bid back in the day when I was at Deloitte. Um, and so it's been It's always been a dream of mine. I just never knew how I get there like I was wanted to work in sports. They just don't know the path. And I'm lucky enough to find the path a lot earlier than I thought. >> How about this conference? I know you have been the other M I T. Event. How about this one? How we found some of the key takeaways. Think you >> think it's been great because a lot of the conferences we go to our really sports focus? So you've got the M. I T Sports Analytics conference. You have seat. You have n b a type, um, programming that they put on. But it's nice to get out of sports and sort of see how other bigger industries are thinking about some of the problems specifically around data management and the influx of data and how they're thinking about it. It's always nice to kind of elevated. Just have some room to breathe and think and meet people that are not in sports and start to build those, you know, relationships and with thought leaders and things like that. So it's been great. It's my first time here. What are probably back >> good that Well, hopefully get to see a game, even though that stocks are playing that well. Thanks so much for coming in Cuba. No problems here on your own. You have me. It was great to have you. All right. Keep right, everybody. I'll be back with our next guest with Paul Gill on day Volante here in the house. You're watching the cue from M I T CEO. I cube. Right back

Published Date : Aug 1 2019

SUMMARY :

Brought to you by Silicon Angle Media. Welcome back to M. I. T. In Cambridge, Massachusetts. So talk about Is the head of analytics for the Bulls? But on the business side, um, what we're focused on is really two things. the house is not about like you say, player performance about the business performances. always be doing the basketball stuff and the business stuff needs to get done, A. And I want to ask you a question about that. it started on the basketball side, certainly because that's that's where the, you know that is the most important So I know this is not, you know, your swim lane, but But, you know, the lore of Billy Beane I think if you have got to see Billy being at the M So let's talk about the business side of things. Talk about the NBA all the time and of course, you know, And so the things that we have control over the two big buckets are So the idea of surge pricing demand supply, But so several years ago, It's the same price for every game, Why don't you do it for the season ticket holders? I think this will be the first year where you have 8 to 10 teams that really have a shot at winning so hope you hold off a while, but I don't know. Here is the number of engagements you got. Twitter, Obviously. Um, you know the timeline stuff, it falls off really quick. Is that a metric that you finding your brands are are More people are starting to care about his engagements, which is how many of you actually engaged with that piece of content, I presume. We have a hole in house production team A friend gave me the heads up that they were looking to fill this type of role and I was able to get my resume in the But it's, you know, I've dabbled I know you have been the other M I T. Event. you know, relationships and with thought leaders and things like that. good that Well, hopefully get to see a game, even though that stocks are playing that well.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Dave VolantePERSON

0.99+

Matt KobePERSON

0.99+

19,000,000QUANTITY

0.99+

CubaLOCATION

0.99+

8QUANTITY

0.99+

DeloitteORGANIZATION

0.99+

Red SoxORGANIZATION

0.99+

ClippersORGANIZATION

0.99+

ChinaLOCATION

0.99+

BillyPERSON

0.99+

five yearsQUANTITY

0.99+

Bill JamesPERSON

0.99+

sevenQUANTITY

0.99+

ChicagoLOCATION

0.99+

MattPERSON

0.99+

YankeesORGANIZATION

0.99+

Paul GillPERSON

0.99+

LakersORGANIZATION

0.99+

seven yearsQUANTITY

0.99+

BMWORGANIZATION

0.99+

three secondsQUANTITY

0.99+

oneQUANTITY

0.99+

Chicago BullsORGANIZATION

0.99+

80QUANTITY

0.99+

Silicon Angle MediaORGANIZATION

0.99+

Cambridge, MassachusettsLOCATION

0.99+

singleQUANTITY

0.99+

MasterCardORGANIZATION

0.99+

two teamsQUANTITY

0.99+

two big bucketsQUANTITY

0.99+

82 gameQUANTITY

0.99+

SoxORGANIZATION

0.99+

seven other guysQUANTITY

0.99+

M. I T Sports AnalyticsEVENT

0.99+

10,000,000 impressionsQUANTITY

0.99+

BullsORGANIZATION

0.99+

three preseason gamesQUANTITY

0.99+

M I t Sports AnalyticsEVENT

0.99+

two thingsQUANTITY

0.99+

two peopleQUANTITY

0.99+

firstQUANTITY

0.99+

single gamesQUANTITY

0.99+

Five years laterDATE

0.98+

TwitterORGANIZATION

0.98+

several years agoDATE

0.98+

10 teamsQUANTITY

0.98+

41 home gameQUANTITY

0.98+

NorthwesternORGANIZATION

0.98+

both sidesQUANTITY

0.98+

first timeQUANTITY

0.98+

FacebookORGANIZATION

0.98+

LeBronPERSON

0.98+

bothQUANTITY

0.98+

10DATE

0.98+

AlexPERSON

0.98+

this yearDATE

0.97+

Kansas City RoyalsORGANIZATION

0.97+

OneQUANTITY

0.97+

12 months a yearQUANTITY

0.97+

first yearQUANTITY

0.97+

78 years agoDATE

0.95+

single game ticketsQUANTITY

0.95+

M I T. EventEVENT

0.94+

1/2QUANTITY

0.94+

IndianOTHER

0.94+

InstagramORGANIZATION

0.94+

instagramORGANIZATION

0.93+

7 yearsQUANTITY

0.92+

first dayQUANTITY

0.92+

15 years agoDATE

0.92+

44 of those eventsQUANTITY

0.91+

six fullQUANTITY

0.91+

Maura's Bad MorrisORGANIZATION

0.9+

a weekQUANTITY

0.9+

SnapchatORGANIZATION

0.9+

M. I. T.PERSON

0.9+

over 25,000,000 social media followersQUANTITY

0.88+

secondsQUANTITY

0.88+

Last couple yearsDATE

0.88+

N. B.LOCATION

0.87+

Julie Johnson, Armored Things | MIT CDOIQ 2019


 

>> From Cambridge Massachusetts, it's The Cube covering MIT Chief Data Officer, and Information Quality Symposium 2019. Brought to you by SiliconANGLE Media. (electronic music) >> Welcome back to MIT in Cambridge, Massachusets everybody. You're watching The Cube, the leader in live tech coverage. My name is Dave Vellante I'm here with Paul Gillin. Day two of the of the MIT Chief Data Officer Information Quality Conference. One of the things we like to do, at these shows, we love to profile Boston area start-ups that are focused on data, and in particular we love to focus on start-ups that are founded by women. Julie Johnson is here, She's the Co-founder and CEO of Armored Things. Julie, great to see you again. Thanks for coming on. >> Great to see you. >> So why did you start Armored Things? >> You know, Armored Things was created around a mission to keep people safe. Early in the time where were looking at starting this company, incidents like Las Vegas happened, Parkland happened, and we realized that the world of security and operations was really stuck in the past right? It's a manual solutions generally driven by a human instinct, anecdotal evidence, and tools like Walkie-Talkies and video cameras. We knew there had to be a better way right? In the world of Data that we live in today, I would ask if either of you got in your car this morning without turning on Google Maps to see where you were going, and the best route with traffic. We want to help universities, ball parks, corporate campuses do that for people. How do we keep our people safe? By understanding how they live. >> Yeah, and stay away from Lambert Street in Cambridge by the way. >> (laughing) >> Okay so, you know in people, when they think about security they think about cyber, they think about virtual security, et cetera et cetera, but there's also the physical security aspect. Can you talk about the balance of those two? >> Yeah, and I think both are very important. We actually tend to mimic some of the revolutions that have happened on the cyber security side over the last 10 years with what we're trying to do in the world of physical security. So, folks watching this who are familiar with cyber security might understand concepts like anomaly detection, SIEM and SOAR for orchestrated response. We very much believe that similar concepts can be applied to the physical world, but the unique thing about the physical world, is that it has defined boundaries, right? People behave in accordance with their environment. So, how do we take the lessons learned in cyber security over 10 to 15 years, and apply them to that physical world? I also believe that physical and cyber security are converging. So, are there things that we know in the physical world because of how we approach the problem? That can be a leading indicator of a threat in either the physical world or the digital world. What many people don't understand is that for some of these cyber security hacks, the first weak link is physical access to your network, to your data, to your systems. How do we actually help you get an eye on that, so you already have some context when you notice it in the digital realm. >> So, go back to the two examples you sited earlier, the two shooting examples. Could those have been prevented or mitigated in some way using the type of technology you're building? >> Yeah, I hate to say that you could ever prevent an incident like that. Everyone wants us to do better. Our goal is to get a better sense predicatively of the leading indicators that tell you you have a problem. So, because we're fundamentally looking at patterns of people and flow, I want to know when a normal random environment starts to disperse in a certain way, or if I have a bottle neck in my environment. Because if then I have that type of incident occur, I already know where my hotspots are, where my pockets of risk are. So, I can address it that much more efficiently from a response perspective. >> So if people are moving quickly away from a venue, it might be and indication that there's something wrong- >> It could be, Yeah. That demands attention. >> Yeah, when you go to a baseball game, or when you go to work I would imagine that you generally have a certain pattern of behavior. People know conceptually what those patterns are. But, we're the first effort to bring them data to prove what those patterns are so that they can actually use that data to consistently re-examine their operations, re-examine their security from a staffing perspective, from a management perspective, to make sure that they're using all the data that's at their disposal. >> Seems like there would be many other applications beyond security of this type of analysis. Are you committed to the security space, or do you have broader ambitions? >> Are we committed to the security space is a hundred percent. I would say the number one reason why people join our team, and the number one reason why people call us to be customers is for security. There's a better way to do things. We fundamentally believe that every ball park, every university, every corporate campus, needs a better way. I think what we've seen though is exactly what you're saying. As we built our software, for security in these venues, and started with an understanding of people and flow, there's a lot that falls out of that right? How do I open gates that are more effective based on patterns of entry and exit. How do I make sure that my staffing's appropriate for the number of people I have in my environment. There's lots of other contextual information that can ultimately drive a bottom line or top line revenue. So, you take a pro sports venue for example. If we know that on a 10 degree colder day people tend to eagres more early in the game, how do we adjust our food and beverage strategy to save money on hourly workers, so that we're not over staffing in a period of time that doesn't need those resources. >> She's talking about the physical and the logical security worlds coming together, and security of course has always been about data, but 10 years ago it was staring at logs increasing the machines are helping us do that, and software is helping us do that. So can you add some color to at least the trends in the market generally, and then maybe specifically what you're doing bringing machine intelligence to the data to make us more secure. >> Sure, and I hate to break it to you, but logs are still a pretty big part of what people are watching on a daily basis, as are video cameras. We've seen a lot of great technology evolve in the video management system realm. Very advanced technology great at object recognition and detecting certain behaviors with a video only solution, right? How do we help pinpoint certain behaviors on a specific frame or specific camera. The only problem with that is, if you have people watching those cameras, you're still relying on humans in the loop to catch a malicious behavior, to respond in the event that they're notified about something unusual. That still becomes a manual process. What we do, is we use data to watch not only cameras, but we are watching your cameras, your Wi-Fi, access control. Contextual data from public transit, or weather. How do we get this greater understanding of your environment that helps us watch everything so that we can surface the things that you want the humans in the loop to pay attention to, right? So, we're not trying to remove the human, we're trying to help them focus their time and make decisions that are backed by data in the most efficient way possible. >> How about the concerns about The Surveillance Society? In some countries, it's just taken for granted now that you're on camera all the time. In the US that's a little bit more controversial. Is what your doing, do you have to be sensitive to that in designing the tools you're building? >> Yeah, and I think to Dave's question, there are solutions like facial recognition which are very much working on identifying the individual. We have a philosophy as a company, that security doesn't necessarily start with the individual, it starts with the aggregate. How do we understand at an aggregate macro level, the patterns in an environment. Which means I don't have to identify Paul, or I don't have to identify Dave. I want to look for what's usual and unusual, and use that as the basis of my response. There's certain instances where you want to know who people are. Do I want to know who my security personnel are so I can dispatch them more efficiently? Absolutely. Let's opt those people in and allow them to share the information they need to share to be better resources for our environment. But, that's the exception not the norm. If we make the norm privacy first, I think we'll be really successful in this emerging GDPR data centric world. >> But I could see somebody down the road saying hey can you help us find this bad guy? And my kids at camp this week, This is his 7th year of camp, and this year was the first year my wife, she was able to sign up for a facial recognition thing. So, we used to have to scroll through hundreds and hundreds of pictures to see oh, there he is! And so Deb signs up for this thing, and then it pings you when your son has a picture taken. >> Yeah. And I was like, That's awesome. Oh. (laughing) >> That's great until you think about it. >> But there aren't really any clear privacy laws today. And so you guys are saying, look it, we're looking at the big picture. >> That's right. >> But that day is coming isn't it? >> There's certain environments that care more than others. If you think about universities, which is where we first started building our technology, they cared greatly about the privacy of their students. Health care is a great example. We want to make sure that we're protecting peoples personal data at a different level. Not only because that's the right thing to do, but also from a regulatory perspective. So, how do we give them the same security without compromising the privacy. >> Talk about Bottom line. You mentioned to us earlier that you just signed a contract with a sports franchise, you're actually going to help them, help save them money by deploying their resources more efficiently. How does your technology help the bottom line? >> Sure, you're average sporting venue, is getting great information at the point a ticket is scanned or a ticket is purchased, they have very little visibility beyond that into the customer journey during an event at their venue. So, if you think about again, patterns of people and flow from a security perspective, at our core we're helping them staff the right gates, or figure out where people need to be based on hot spots in their environment. But, what that also results in is an ability to drive other operational benefits. Do we have a zone that's very low utilization that we could use as maybe even a benefit to our avid fans. Send them to that area, get traffic in that area, and now give them a better concession experience because of it, right? Where they're going to end up spending more money because they're not waiting in line in the different zone. So, how do we give them a dashboard in real time, but also alerts or reports that they can use on an ongoing basis to change their decision making going forward. >> So, give us the company overview. Where are you guys at with funding, head count, all that good stuff. >> So, we raised a seed round with some great Boston and Silicon Valley investors a year ago. So, that was Glasswing is a Boston AI focused fund, has been a great partner for us, and Inovia which is Canada's largest VC fund recently opened a Silicon Valley office. We just started raising a series A about a week ago. I'm excited to say those conversation have been going really well so far. We have some potential strategic partners who we're excited about who know data better then anyone else that we think would help us accelerate our business. We also have a few folks who are very familiar with the large venue space. You know, the distributed campuses, the sporting and entertainment venues. So, we're out looking for the right partner to lead our series A round, and take our business to the next level, but where we are today with five really great branded customers, I think we'll have 20 by the end of next year, and we won't stop fighting 'till we're at every ball park, every football stadium, every convention center, school. >> The big question, at some point will you be able to eliminate security lines? (laughing) >> I don't think that's my core mission. (laughing) But, optimistically I'd love to help you. Right, I think there's some very talented people working on that challenge, so I'll defer that one to them. >> And rough head count today? >> We have 23 people. >> You're 23 people so- >> Yeah, I headquartered in Boston Post Office Square. >> Awesome, great location. So, and you say you've got five customers, so you're generating revenue? >> Yes >> Okay, good. Well, thank you for coming in The Cube >> Yeah, thank you. >> And best of luck with the series A- >> I appreciate it and going forward >> Yeah, great. >> All right, and thank you for watching. Paul Gillin and I will be back right after this short break. This is The Cube from MIT Chief Data Officer Information Quality Conference in Cambridge. We'll be right back. (electronic music)

Published Date : Aug 1 2019

SUMMARY :

Brought to you by SiliconANGLE Media. Julie, great to see you again. to see where you were going, in Cambridge by the way. Okay so, you know in people, How do we actually help you get an eye on that, So, go back to the two examples you sited earlier, Yeah, I hate to say that you could ever prevent That demands attention. data to prove what those patterns are or do you have broader ambitions? and the number one reason why people bringing machine intelligence to the data Sure, and I hate to break it to you, sensitive to that in designing the tools you're building? Yeah, and I think to Dave's question, and then it pings you when your son And I was like, That's awesome. And so you guys are saying, Not only because that's the right thing to do, You mentioned to us earlier that you So, if you think about again, Where are you guys at with funding, head count, and take our business to the next level, so I'll defer that one to them. So, and you say you've got five customers, Well, thank you for coming in The Cube All right, and thank you for watching.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
DavePERSON

0.99+

Paul GillinPERSON

0.99+

Dave VellantePERSON

0.99+

Julie JohnsonPERSON

0.99+

JuliePERSON

0.99+

CambridgeLOCATION

0.99+

7th yearQUANTITY

0.99+

InoviaORGANIZATION

0.99+

PaulPERSON

0.99+

Lambert StreetLOCATION

0.99+

BostonLOCATION

0.99+

fiveQUANTITY

0.99+

two examplesQUANTITY

0.99+

10 degreeQUANTITY

0.99+

USLOCATION

0.99+

five customersQUANTITY

0.99+

23 peopleQUANTITY

0.99+

twoQUANTITY

0.99+

todayDATE

0.99+

DebPERSON

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

Armored ThingsORGANIZATION

0.99+

bothQUANTITY

0.99+

Google MapsTITLE

0.99+

Silicon ValleyLOCATION

0.99+

GlasswingORGANIZATION

0.99+

OneQUANTITY

0.99+

this weekDATE

0.98+

Cambridge MassachusettsLOCATION

0.98+

a year agoDATE

0.98+

first yearQUANTITY

0.98+

series AOTHER

0.98+

hundred percentQUANTITY

0.98+

20QUANTITY

0.98+

Day twoQUANTITY

0.97+

The CubeTITLE

0.97+

Las VegasLOCATION

0.97+

firstQUANTITY

0.97+

CanadaLOCATION

0.96+

GDPRTITLE

0.96+

Chief Data OfficerEVENT

0.95+

over 10QUANTITY

0.94+

10 years agoDATE

0.94+

this yearDATE

0.94+

Surveillance SocietyORGANIZATION

0.93+

Boston Post Office SquareLOCATION

0.92+

15 yearsQUANTITY

0.91+

first effortQUANTITY

0.91+

end of next yearDATE

0.89+

MITORGANIZATION

0.89+

this morningDATE

0.88+

two shooting examplesQUANTITY

0.85+

about a week agoDATE

0.83+

ThiPERSON

0.83+

ArmoredORGANIZATION

0.83+

football stadiumQUANTITY

0.82+

oneQUANTITY

0.82+

2019DATE

0.81+

Information Quality SymposiumEVENT

0.8+

hundreds of picturesQUANTITY

0.79+

great branded customersQUANTITY

0.77+

last 10 yearsDATE

0.73+

hundreds andQUANTITY

0.73+

MIT Chief Data Officer Information Quality ConferenceEVENT

0.72+

MassachusetsLOCATION

0.7+

ParklandORGANIZATION

0.7+

every ball parkQUANTITY

0.7+

one reasonQUANTITY

0.69+

Walkie-ORGANIZATION

0.66+

first weak linkQUANTITY

0.66+

convention centerQUANTITY

0.65+

The CubeORGANIZATION

0.64+

corporate campusQUANTITY

0.64+

ball parkQUANTITY

0.61+

MIT ChiefORGANIZATION

0.59+

TalkiesTITLE

0.57+

universityQUANTITY

0.57+

Data Officer Information Quality ConferenceEVENT

0.54+

Stewart Bond, IDC | MIT CDOIQ 2019


 

>> from Cambridge, Massachusetts. It's three Cube covering M. I T. Chief data officer and information quality Symposium 2019. Brought to you by Silicon Angle Media. >> Welcome back to M I. T. CDO I Q everybody, you're watching the cube we got. We go out to the events we extract the signal from the noise is day one of this conference. Chief Data Officer event. I'm Dave, along with my co host, Paul Gillen. Stuart Bond is here is a research director of International Data Corporation I DC Stewart. Welcome to the Cube. Thanks for coming on. Thank you for having me. You're very welcome. So your space data intelligence tell us about your swim lane? Sure. >> So my role it I. D. C is a ZAY. Follow the data integration and data intelligence software market. So I follow all the different vendors in the market. I look at what kinds of solutions they're bringing to market, what kinds of problems. They're solving both business and technical for their clients. And so I can then report on the trends and market sizes, forecasts and such, And within that part of what I what I cover is everything from data integration which is more than traditionally E T l change data capture data movements, data, virtualization types of technologies as well as what we call date integrity of one. And I'm calling data intelligence, which is all of the Tell the metadata about the data. It's the data catalogs meditating management's data lineage. It's the data quality data profiling, master data intelligence. It's all of the data about the data and understanding really answering what I call a entering the five W's and h of data. It's the who, what, where, when, why and how. Data. So that's the market that I'm covering and following, and that's why I'm >> here. Were you here this morning for Mark Ramsey's Yes, I talk. So he kind of went to you. Heard it started with the D W kind of through E T L under the bus. Well, MGM, then the Enterprise data model said all that failed. But that stuff's not going away, and I'm sure they're black. So still using, you know, all those all that tooling today. So what was your reaction to that you were not in your head and yeah, it's true or saying, Well, maybe there's a little we'll have what we've been saying. The mainframe is gonna go away for years and >> still around, so I think they're obviously there's still those technologies out there and they're still being used. You can look at any of the major dtl vendors and there's new ones coming to the market, so that's still alive and well. There's no doubt that it's out there and its biggest segment of the market that I followed. So there's no source tooling, right? Yes, >> there's no doubt that it's still >> there. But Mark's vision of where things are going, where things are heading with, with data intelligence really being at the Cory talk about those spiders talked about that central depository of information about knowledge of the data. That's where things are heading to, whether you call it a data hub, whether you call it a date, a platform, not really a one big, huge data pop for one big, huge data depository, but one a place where you can go to get the information but natives you can find out where the data is. You could find out what it means, both the business context as well as the technical information you find out who's using that data. You can find out when it's being used, Why it's being used in. Why do we even have it and how it should >> be used? So it's being used >> appropriately. So you would say that his vision, actually what he implemented was visionary skating. They skated to the puck, so to speak, and that's we're going >> to see more of that. Where are seeing more of that? That's why we've seen such a jump in the number of vendors that air providing data catalogue solutions. I did, Uh, I d. C has this work product calling market glance. I did that >> beginning of 2018. >> I just did it again. In the middle of this year, the number of vendors that offer data catalogue solutions has significantly interest 240% increase in the number of vendors that offer that now itself of a small base. These air, not exhaustive studies. It may be that I didn't know about all those data catalogue vendors a year and 1/2 ago, but may also be that people are now saying that we've got a data catalogue, >> but you've really got a >> peel back the layers a little bit. Understand what these different data catalysts are and what they're doing because not all of them are crediting. >> We'll hear Radar. You don't know about it. 99% of the world mark talked this morning about some interesting new technologies. They were using Spider Ring to find the data bots to classify the data tools wrangle the data. I mean, there's a lot of new technology being applied to this area. What? Which of those technologies do you think has the greatest promise right now? And how? How how automated can this process become? >> It's the spider ring, and it's the cataloging of the data. It's understanding what you've got out there that is growing crazy. Just started to track that it's growing a lot that has the most promised. And as I said, I think that's going to be the data platform in the future. Is the intelligence knowing about where your data is? You men go on, get it. You know it's not a matter of all. The data is one place anymore. Data's everywhere Date is in hybrid cloud. It's in on premise. It's in private. Cloud isn't hosted. It's everywhere. I just did a survey. I got the results back in June 2019 just a month ago, and the data is all over the place. So really having that knowledge having that intelligence about where your data is, that has the most promise. As faras, the automation is concerned. Next step there. It's not just about collecting the information about where your data is, but it's actually applying the analytics, the machine learning and the artificial intelligence to that metadata collection that you've got so that you can then start to create those bots to create those pipelines to start to automate those tasks. We're starting to see some vendors move in that area, moving that direction. There's a lot of promise there >> you guys, at least when I remember. You see, the software is pretty robust taxonomy. I'm sure it's evolved over the years. So how do you sort of define your space? I'm interested in How big is that space, you know, in terms of market size and is a growing and where do you see it going? >> Right. So my my coverage of data integration and data intelligence is fairly small. It's a small, little marketed. I D. C. I'm part of a larger team that looks a data management, the analytics and information management. So we've got people on our team like a damn vessel. Who covers the analytics? Advanced Analytics show Nautical Palo Carlson. He's been on the cable covers, innovative technologies, those I apologize. I don't have that number off the top. >> Okay, No, But your space, my space is it. That's that Software market is so fragmented. And what I d. C has always done well, as you put people on those fragments and you know, deep in there. So So how you've been ableto not make your eyes bleed when you do that, challenging so the data and put it all together. >> It's important. Integration markets about 66 and 1/2 1,000,000,000 >> dollars. Substantial size. Yeah, but again, a lot of vendors Growing number of events in the markets growing, >> the market continues to grow as the data is becoming more distributed, more dispersed. There's no need to continue to integrate that data. There's also that need that growing >> need for that date intelligence. It's not >> just, you know, we've had a lot of enquiries lately about data being fed into machine learning artificial intelligence and people realizing our data isn't clean. We have to clean up our data because we're garbage in garbage. Out is probably more important now than ever before because you don't have someone saying, I don't think that day is right. You've got machines were looking at data instead. The technology that's out there and the problem with data quality. It's on a new problem. It's the same problem we've had for years. All of the technology is there to clean that data up, and that's a part of what I saw. I look at the data quality vendors experience here, sink sort in all of the other data quality capabilities that you get from in from Attica, from Tahoe or from a click podium. Metal is there, and so that part is growing. And there's a lot of more interest in that data quality and that data intelligence side again so the right data can be used. Good data can be used to trust in that data. Can the increase we used for the right reasons as well That's adding that context. Understand that Samantha having all that metadata that goes around that data so that could be used. Most of >> it is one of those markets that you may be relatively small. It's not 100,000,000,000 but it it enables a lot of larger markets. So okay, so it's 66 and 1/2 1,000,000,000 it's growing. It is a growing single digits, double digits. It's growing. It's hovering around the double dip double. It is okay, it's 10%. And then and then who were the, You know, big players who was driving the shares there? Is there a dominant player there? Bunch of >> so infirm. Atticus Number one in the market. Okay, followed by IBM. And I say peas right up there. Sass is there. Tell End is making a good Uh, okay, they're making a nice with Yeah, but there there's a number of different players. There's There's a lot of different players in that market. >> And in the leading market share player has what, 10%? 15%? 50%? Is it like a dominant divine spot? That's tough to say. You got a big It's over 1,000,000,000,000,000,000 right? So they've got maybe 1/6 of the market. Okay, so but it's not like Cisco as 2/3 of the networking market or anything like that. And what about the cloud guys? A participating in this guy's deal with >> the cloud guys? Yeah, the ClA got so there are some pure cloud solutions. There's a relative, for example. Pure cloud MBM mastered a management there. There's I'd say there's less pure cloud than there used to be. But, you know, but someone like an infra matic is really pushing that clouds presence in that cloud >> running these tools, this tooling in in the cloud But the cloud guys directly or not competing at this >> point. So Amazon Google? Yes, Those cloud guys. Yes. Okay, there, there. Google announced data flow back in our data. Sorry. Data fusion back. Google. >> Yeah, that's right. >> And so there they've got an e t l two on the cloud now. Ah, Amazon has blue yet which is both a catalog and an e t l tool. Microsoft course has data factory in azure. >> So those guys are coming on. I'm guessing if you talk to in dramatic and they said, Well, they're not as robust as we are. And we got a big install base and we go multi cloud is that kind of posturing of the incumbents or yeah, that's posturing. And maybe that's I don't mean it is a pejorative. If I were, those guys would be doing the same thing. You know, we were talking earlier about how the cloud guys essentially killed the Duke. All right, do you Do you see the same thing happening here, or is it well, the will the tool vendors be able to stay ahead in your view, >> depends on how they execute. If they're there and they're available in the cloud along with along with those clapper viers, they're able to provide solutions in the same same way the same elasticity, the same type of consumption based pricing models that pod vendors air offering. They can compete with that. They still have a better solution. Easton What >> in multi cloud in hybrid is a big part of their value problems that the cloud guys aren't really going hard after. I mean, this sort of dangling your toe in the water, some of them some of the >> cloud guys they have. They have the hybrid capabilities because they've got some of what they're what they built comes from on premises, worlds as well. So they've got that ability. Microsoft in particular >> on Google, >> Google that the data fusion came out of >> You're saying, But it's part of the Antos initiative. Er, >> um, I apologize. Folks are watching, >> but soup of acronyms notices We're starting a little bit. What tools have you seen or technology? Have you seen making governance of unstructured data? That looks promising? Uh, so I don't really cover >> the instructor data space that much. What I can say is Justus in the structure data world. It's about the metadata. It's about having the proper tags about that unstructured data. It's about getting the information of that unstructured data so that it can then be governed appropriately, making structure out of that, that is, I can't really say, because I don't cover that market explicitly. But I think again it comes back to the same type of data intelligence having that intelligence about that data by understanding what's in there. >> What advice are you giving to, you know, the buyers in your community and the sellers in your community, >> So the buyer's within the market. I talk a lot about that. The need for that data intelligence, so data governance to me is not a technology you can't go by data governance data governance is an organizational disappoint. Technology is a part of that. To me, the data intelligence technology is a part of that. So, really, organizations, if they really want a good handle, get a good handle on what data they have, how to use that, how to be enabled by that data. They need to have that date intelligence into go look for solutions that can help him pull that data intelligence out. But the other part of that is measurement. It's critical to measure because you can't improve what you're not measuring. So you know that type of approach to it is critical Eve, and you've got to be able to have people in the organization. You've got to be able to have cooperation collaboration across the business. I t. The the gifted office chief Officer office. You've gotta have that collaboration. You've gotta have accountability and for in order for that, to really be successful. For the vendors in the space hybrid is the new reality. In my survey data, it shows clearly that hybrid is where things are. It's not just cloud, it's not just on promise Tiebreak. That's where the future is. They've got to be able to have solutions that work in that environment. Working that hybrid cloud ability has got to be able to have solutions that can be purchased and used again in the same sort of elastic type of method that they're able to get consumers able to get. Service is from other vendors in that same >> height, so we gotta run. Thank you so much for sharing your insights and your data. And I know we were fired. I was firing a lot of questions. Did pretty well, not having the report in front of me. I know what that's like. So thank you for sharing and good luck with your challenges in the future. You got You got a lot of a lot of data to collect and a lot of fast moving markets. So come back any time. Share with you right now, Okay? And thank you for watching Paul and I will be back with our next guest right after this short break from M I t cdo. Right back

Published Date : Aug 1 2019

SUMMARY :

Brought to you by Silicon Angle Media. We go out to the events we extract the signal from the noise is day one of this conference. It's all of the So what was your reaction to that you were You can look at any of the major dtl vendors and there's new ones coming to the market, the information but natives you can find out where the data is. So you would say that his vision, actually what he implemented in the number of vendors that air providing data catalogue solutions. significantly interest 240% increase in the number of vendors that offer that now peel back the layers a little bit. 99% of the world mark It's not just about collecting the information about where your data is, but it's actually applying the I'm sure it's evolved over the years. I don't have that number off the top. that, challenging so the data and put it all together. It's important. number of events in the markets growing, the market continues to grow as the data is becoming more distributed, need for that date intelligence. All of the technology is there to clean that data up, and that's a part of what I saw. It's hovering around the double dip double. There's There's a lot of different players in that market. And in the leading market share player has what, 10%? Yeah, the ClA got so there are some pure cloud solutions. Google announced data flow back in our And so there they've got an e t l two on the cloud now. of the incumbents or yeah, that's posturing. They can compete with that. I mean, this sort of dangling your toe in the water, some of them some of the They have the hybrid capabilities because they've got some You're saying, But it's part of the Antos initiative. Folks are watching, What tools have you seen or technology? It's about getting the information of that So the buyer's within the market. not having the report in front of me.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Paul GillenPERSON

0.99+

June 2019DATE

0.99+

AmazonORGANIZATION

0.99+

50%QUANTITY

0.99+

CiscoORGANIZATION

0.99+

10%QUANTITY

0.99+

15%QUANTITY

0.99+

MicrosoftORGANIZATION

0.99+

GoogleORGANIZATION

0.99+

MarkPERSON

0.99+

Mark RamseyPERSON

0.99+

SamanthaPERSON

0.99+

IBMORGANIZATION

0.99+

Stuart BondPERSON

0.99+

AtticaORGANIZATION

0.99+

66QUANTITY

0.99+

Silicon Angle MediaORGANIZATION

0.99+

International Data CorporationORGANIZATION

0.99+

240%QUANTITY

0.99+

DavePERSON

0.99+

Cambridge, MassachusettsLOCATION

0.99+

99%QUANTITY

0.99+

a month agoDATE

0.99+

1/6QUANTITY

0.99+

bothQUANTITY

0.99+

100,000,000,000QUANTITY

0.98+

MGMORGANIZATION

0.98+

oneQUANTITY

0.98+

DukeORGANIZATION

0.98+

a yearDATE

0.97+

PaulPERSON

0.95+

todayDATE

0.94+

EvePERSON

0.94+

M I. T. CDOPERSON

0.94+

Nautical Palo CarlsonORGANIZATION

0.93+

this morningDATE

0.92+

ClAORGANIZATION

0.92+

EastonPERSON

0.92+

1/2 agoDATE

0.9+

about 66QUANTITY

0.89+

2/3QUANTITY

0.88+

1/2 1,000,000,000QUANTITY

0.88+

CoryORGANIZATION

0.86+

one placeQUANTITY

0.86+

singleQUANTITY

0.85+

Stewart BondPERSON

0.84+

day oneQUANTITY

0.83+

I DCORGANIZATION

0.83+

I. D. CPERSON

0.82+

this yearDATE

0.81+

over 1,000,000,000,000,000,000QUANTITY

0.8+

yearsQUANTITY

0.79+

StewartPERSON

0.78+

middleDATE

0.75+

Spider RingCOMMERCIAL_ITEM

0.74+

beginningDATE

0.72+

2018DATE

0.72+

RadarORGANIZATION

0.72+

doubleQUANTITY

0.68+

2019DATE

0.68+

MITORGANIZATION

0.68+

TiebreakORGANIZATION

0.64+

threeQUANTITY

0.64+

TahoeLOCATION

0.63+

M. I T.PERSON

0.59+

IDCORGANIZATION

0.54+

CubeCOMMERCIAL_ITEM

0.54+

thoseQUANTITY

0.51+

JustusPERSON

0.5+

AntosORGANIZATION

0.48+

CDOIQEVENT

0.34+

Joe Caserta & Doug Laney, Caserta | MIT CDOIQ 2019


 

>> from Cambridge, Massachusetts. It's three Cube covering M I T. Chief data officer and information quality Symposium 2019. Brought to you by Silicon Angle Media. >> Hi already. We're back in Cambridge, Massachusetts at the M I t. Chief data officer Information quality event. Hashtag m i t cdo i Q. And I'm David Dante. He's Paul Gillen. Day one of our two day coverage of this event. This is the Cube, the leader in live tech coverage. Joe Caserta is here is the president of Caserta and Doug Laney, who is principal data strategist at Caserta, both Cube alarm guys. Great to see you again, Joe. What? Did you pick up this guy? How did that all came on here a couple of years ago? We had a great conversation. I read the book, Loved it. So congratulations. A nice pickup. >> We're very fortunate to have. >> Thanks. So I'm fortunate to be here, >> so Okay, well, what attracted you to Cassard? Oh, >> it's Joe's got a tremendous reputation. His his team of consultants has a great reputation. We both felt there was an opportunity to build some data strategy competency on top of that and leverage some of those in Phanom. Its ideas that I've been working on over the years. >> Great. Well, congratulations. And so, Joe, you and I have talked many times. And the reason I like talking because you know what's going on in the market place? You could you could siphon. What's riel? What's hype? So what do you see? It is the big trends in this data space, and then we'll get into it. Yeah, sure. Um, trends >> are chief data officer has been evolving over the last couple of years. You know, when we started doing this several years ago, there was just a handful of people, maybe 30 40 people. Now, there's 450 people here today, and it's been evolving. People are still trying to find their feet. Exactly what the chief date officers should be doing where they are in the hierarchy. Should they report to the c e o the C I O u the other CDO, which is a digital officer. So I think you know, hierarchically. That's still figuring it out politically. They're figuring it out, but technically also, they're still trying to figure it out. You know what's been happening over the past three years is the evolution of data going from traditional data warehousing and business intelligence. To get inside out of data just isn't working anymore. Eso evolving that moving it forward to more modern data engineering we've been doing for the past couple of years with quote unquote big data on That's not working anymore either, right? Because it's been evolving so fast. So now we're on, like, maybe Data three dato. And now we're talking about just pure automate everything. We have to automate everything. And we have to change your mindset from from having output of a data solution to an outcome to date a solution. And that's why I hired Doug, because way have to figure out not only had to get this data and look at it and analyze really had to monetize it, right? It's becoming a revenue stream for your business if you're doing it right and Doug is the leader in the industry, how to figure that >> you keep keep premise of your book was you gotta start valuing data and its fundamental you put forth a number of approaches and techniques and examples of companies doing that. Since you've published in phenomena Microsoft Apple, Amazon, Google and Facebook. Of the top five market value cos they've surpassed all the financial service is guys all ExxonMobil's and any manufacturer? Automobile makers? And what of a data companies, right? Absolutely. But intrinsically we know there's value their way any closer to the prescription that you put forth. >> Yeah, it's really no surprise and extra. We found that data companies have, ah, market to book value. That's nearly 33 times the market average, so Apple and others are much higher than that. But on average, if you look at the data product companies, they're valued much higher than other companies, probably because data can be reused in multiple ways. That's one of the core tenets of intra nomics is that Data's is non depleted ble regenerative, reusable asset and that companies that get that an architect of businesses based on those economics of information, um, can really perform well and not just data companies, but >> any company. That was a key takeaway of the book. The data doesn't conform to the laws of scarcity. Every says data is the new oil. It's like, No, it's not more valuable. So what are some examples in writing your book and customers that you work with. Where do you see Cos outside of these big data driven firms, breaking new ground and uses of data? I >> think the biggest opportunity is really not with the big giant Cos it's really with. Most of our most valuable clients are small companies with large volumes of data. You know if and the reason why they can remain small companies with large volumes of data is the thing that holds back the big giant enterprises is they have so much technical. Dad, it's very hard. They're like trying to, you know, raise the Titanic, right? You can't really. It's not agile enough. You need something that small and agile in order to pivot because it is changing so fast every time there's a solution created, it's obsolete. We have to greet the new solution on dhe when you have a big old processes. Big old technologies, big old mind sets on big old cultures. It's very hard to be agile. >> So is there no hope? I mean, the reason I ask the question was, What hope can you give some of these smokestack companies that they can become data centric? Yeah, What you >> see is that there was a There was a move to build big, monolithic data warehouses years ago and even Data Lakes. And what we find is that through the wealth of examples of companies that have benefited in significant ways from data and analytics, most of those solutions are very vocational. They're very functionally specific. They're not enterprise class, yada, yada, kind of kind of projects. They're focused on a particular business problem or monetizing or leveraging data in a very specific way, and they're generating millions of dollars of value. But again they tend to be very, very functionally specific. >> The other trend that we're seeing is also that the technology and the and the end result of what you're doing with your data is one thing. But really, in order to make that shift, if your big enterprises culture to really change all of the people within the organization to migrate from being a conventional wisdom run company to be a data really analytics driven company, and that takes a lot of change management, a lot of what we call data therapy way actually launched a new practice within the organization that Doug is actually and I are collaborating on to really mature because that is the next wave is really we figured out the data part. We figured out the technology part, but now it's the people part people. Part is really why we're not way ahead of where we even though we're way ahead of where we were a couple of years ago, we should be even further. Culturally, it's very, very challenging, and we need to address that head on. >> And that zeta skills issue that they're sort of locked into their existing skill sets and processes. Or is it? It's fear of the unknown what we're doing, you know? What about foam? Oh, yeah, Well, I mean, there are people >> jumping into bed to do this, right? So there is that part in an exciting part of it. But there's also just fear, you know, and fear of the unknown and, you know, part of what we're trying to do. And why were you trying Thio push Doug's book not for sales, but really just to share the knowledge and remove the mystery and let people see what they can actually do with this data? >> Yeah, it's more >> than just date illiteracy. So there's a lot of talk of the industry about data literacy programs and educating business people on the data and educating data people on the business. And that's obviously important. But what Joe is talking about is something bigger than that. It's really cultural, and it's something that is changed to the company's DNA. >> So where do you attack that problem? It doesn't have to go from the top down. You go into the middle. It has to >> be from the top down. It has to be. It has to be because my boss said to do it all right. >> Well, otherwise they well, they might do it. But the organization's because if you do, it >> is a grassroots movement on Lee. The folks who are excited, right? The foam of people, right? They're the ones who are gonna be excited. But they're going to evolve in adopt anyway, right? But it's the rest of the organization, and that needs to be a top down, Um, approach. >> It was interesting hearing this morning keynote speakers. You scored a throw on top down under the bus, but I had the same reaction is you can't do it without that executive buying. And of course, we defined, I guess in the session what that was. Amazon has an interesting concept for for any initiative, like every initiative that's funded has to have what they call a threaded leader. Another was some kind of And if they don't, if they don't have a threat of leader, there's like an incentive system tau dime on initiative. Kill it. It kind of forces top down. Yeah, you know, So >> when we interview our clients, we have a litmus test and the limits. It's kind of a ready in this test. Do you have the executive leadership to actually make this project successful? And in a lot of cases, they don't And you know, we'll have to say will call us when you're ready, you know, or because one of the challenges another part of the litmus test is this IittIe driven. If it's I t driven is gonna be very tough to get embraced by the rest of the business. So way need to really be able to have that executive leadership from the business to say this is something that we need >> to do to survive. Yeah, and, you know, with without the top down support. You could play small ball. But if you're playing the Yankees, you're gonna win one >> of the reasons why when it's I t driven, it's very challenging is because the people part right is a different budget from the i T budget. And when we start talking about data therapy, right and human resource is and training and education of just culture and data literacy, which is not necessary technical, that that becomes a challenge internally figuring out, like how to pay for Andi how to get it done with a corporate politics. >> So So the CDO crowd definitely parts of your book that they should be adopting because to me, there their main job is okay. How does data support the monetization of my organization? Raising revenue, cutting costs, improving productivity, saving lives. You call it value. And so that seems to be the starting point. At the same time. In this conference, you grew out of the ashes of back room information quality of the big data height, but exploded and have kind of gone full circle. So But I wonder, I mean, is the CDO crowd still focused on that monetization? Certainly I think we all agree they should be, but they're getting sucked back into a governance role. Can they do both, I guess, is >> my question. Well, governance has been, has been a big issue the past few years with all of the new compliance regulation and focus on on on ensuring compliance with them. But there's often a just a pendulum swing back, and I think there's a swing back to adding business value. And so we're seeing a lot of opportunities to help companies monetize their data broadly in a variety of ways. A CZ you mentioned not just in one way and, um, again those you need to be driven from the top. We have a process that we go through to generate ideas, and that's wonderful. Generating ideas. No is fairly straightforward enough. But then running them through kind of a feasibility government, starting with you have the executive support for that is a technology technologically feasible, managerially feasible, ethically feasible and so forth. So we kind of run them through that gauntlet next. >> One of my concerns is that chief data officer, the level of involvement that year he has in these digital initiatives again is digital initiative of Field of Dreams. Maybe it is. But everywhere you go the CEO is trying to get digital right, and it seems like the chief data officer is not necessarily front and center in those. Certainly a I projects, which are skunk works. But it's the chief digital officer that's driving it. So how how do you see in those roles playoff >> In the less panel that I've just spoken, very similar question was asked. And again, we're trying to figure out the hierarchy of where the CDO should live in an organization. Um, I find that the biggest place it fails typically is if it rolls up to a C I. O. Right. If you think the data is a technical issue, you're wrong, Right? Data is a business issue, Andi. I also think for any company to survive today, they have to have a digital presence. And so digital presence is so tightly coupled to data that I find the best success is when the chief date officer reports directly to the chief digital officer. Chief Digital officer has a vision for the user experience for the customer customers Ella to figure out. How do we get that customer engaged and that directly is dependent on insight. Right on analytics. You know, if the four of us were to open up, any application on our phone, even for the same product, would have four different experiences based on who we are, who are peers are what we bought in the past, that's all based on analytics. So the business application of the digital presence is tightly couple tow Analytics, which is driven by the chief state officer. >> That's the first time I've heard that. I think that's the right organizational structure. Did see did. JJ is going to be sort of the driver, right? The strategy. That's where the budget's gonna go and the chief date office is gonna have that supporting role that's vital. The enabler. Yeah, I think the chief data officer is a long term play. Well, we have a lot of cheap date officers. Still, 10 years from now, I think that >> data is not a fad. I think Data's just become more and more important. And will they ultimately leapfrog the chief digital officer and report to the CEO? Maybe someday, but for now, I think that's where they belong. >> You know what's company started managing their labor and workforce is as an actual asset, even though it's not a balance sheet. Asked for obvious reasons in the 19 sixties that gave rise to the chief human resource officer, which we still see today and his company start to recognize information as an asset, you need an executive leader to oversee and be responsible for that asset. >> Conceptually, it's always been data is an asset and a liability. And, you know, we've always thought about balancing terms. Your book sort of put forth a formula for actually formalizing. That's right. Do you think it's gonna happen our lifetime? What exactly clear on it, what you put forth in your book in terms of organizations actually valuing data specifically on the balance sheet. So that's >> an accounting question and one that you know that you leave to the accounting professionals. But there have been discussion papers published by the accounting standards bodies to discuss that issue. We're probably at least 10 years away, but I think respective weather data is that about what she'd asked or not. It's an imperative organizations to behave as if it is one >> that was your point it's probably not gonna happen, but you got a finger in terms that you can understand the value because it comes >> back to you can't manage what you don't measure and measuring the value of potential value or quality of your information. Or what day do you have your in a poor position to manage it like one. And if you're not manage like an asset, then you're really not probably able to leverage it like one. >> Give us a little commercial for I do want to say that I do >> think in our lifetime we will see it become an asset. There are lots of intangible assets that are on the books, intellectual property contracts. I think data that supports both of those things are equally is important. And they will they will see the light. >> Why are those five companies huge market cap winners, where they've surpassed all the evaluation >> of a business that the data that they have is considered right? So it should be part of >> the assets in the books. All right, we gotta wraps, But give us Give us the The Caserta Commercial. Well, concert is >> a consultancy that does essentially three things. We do data advisory work, which, which Doug is heading up. We do data architecture and strategy, and we also do just implementation of solutions. Everything from data engineering gate architecture and data science. >> Well, you made a good bet on data. Thanks for coming on, you guys. Great to see you again. Thank you. That's a wrap on day one, Paul. And I'll be back tomorrow for day two with the M I t cdo m I t cdo like you. Thanks for watching. We'll see them all.

Published Date : Jul 31 2019

SUMMARY :

Brought to you by Great to see you again, Joe. Its ideas that I've been working on over the years. And the reason I like talking because you know what's going on in the market place? So I think you that you put forth. We found that data companies have, ah, market to book value. The data doesn't conform to the laws of scarcity. We have to greet the new solution on dhe when you have a big old processes. But again they tend to be very, very functionally specific. But really, in order to make that shift, if your big enterprises It's fear of the unknown what we're But there's also just fear, you know, and fear of the unknown and, people on the data and educating data people on the business. It doesn't have to go from the top down. It has to be because my boss said to do it all But the organization's because if you do, But it's the rest of the organization, and that needs to be a top down, And of course, we defined, I guess in the session what that was. And in a lot of cases, they don't And you know, we'll have to say will call us when you're ready, Yeah, and, you know, with without the top down support. of the reasons why when it's I t driven, it's very challenging is because the people part And so that seems to be the starting point. Well, governance has been, has been a big issue the past few years with all of the new compliance regulation One of my concerns is that chief data officer, the level of involvement experience for the customer customers Ella to figure out. JJ is going to be sort of the driver, right? data is not a fad. to the chief human resource officer, which we still see today and his company start to recognize information What exactly clear on it, what you put forth in your book in terms of an accounting question and one that you know that you leave to the accounting professionals. back to you can't manage what you don't measure and measuring the value of potential value or quality of your information. assets that are on the books, intellectual property contracts. the assets in the books. a consultancy that does essentially three things. Great to see you again.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
JoePERSON

0.99+

Paul GillenPERSON

0.99+

AmazonORGANIZATION

0.99+

David DantePERSON

0.99+

MicrosoftORGANIZATION

0.99+

AppleORGANIZATION

0.99+

ExxonMobilORGANIZATION

0.99+

FacebookORGANIZATION

0.99+

GoogleORGANIZATION

0.99+

Joe CasertaPERSON

0.99+

PaulPERSON

0.99+

Silicon Angle MediaORGANIZATION

0.99+

five companiesQUANTITY

0.99+

DougPERSON

0.99+

450 peopleQUANTITY

0.99+

Cambridge, MassachusettsLOCATION

0.99+

fourQUANTITY

0.99+

YankeesORGANIZATION

0.99+

JJPERSON

0.99+

tomorrowDATE

0.99+

bothQUANTITY

0.99+

two dayQUANTITY

0.99+

LeePERSON

0.99+

Doug LaneyPERSON

0.99+

todayDATE

0.98+

OneQUANTITY

0.98+

CassardPERSON

0.98+

AndiPERSON

0.97+

CubeORGANIZATION

0.97+

The Caserta CommercialORGANIZATION

0.97+

oneQUANTITY

0.97+

day oneQUANTITY

0.97+

first timeQUANTITY

0.97+

day twoQUANTITY

0.96+

several years agoDATE

0.96+

one thingQUANTITY

0.93+

Day oneQUANTITY

0.93+

three thingsQUANTITY

0.92+

PhanomLOCATION

0.92+

CasertaORGANIZATION

0.91+

this morningDATE

0.91+

nearly 33 timesQUANTITY

0.9+

couple of years agoDATE

0.9+

millions of dollarsQUANTITY

0.9+

last couple of yearsDATE

0.9+

Doug Laney,PERSON

0.9+

waveEVENT

0.89+

19 sixtiesDATE

0.87+

2019DATE

0.86+

Thio pushPERSON

0.85+

past couple of yearsDATE

0.84+

years agoDATE

0.84+

Data three datoORGANIZATION

0.84+

one wayQUANTITY

0.84+

nextEVENT

0.83+

past three yearsDATE

0.81+

TitanicCOMMERCIAL_ITEM

0.8+

30 40 peopleQUANTITY

0.8+

least 10 yearsQUANTITY

0.75+

topQUANTITY

0.75+

M I T.EVENT

0.75+

MIT CDOIQEVENT

0.7+

Field of DreamsORGANIZATION

0.7+

past few yearsDATE

0.7+

threeQUANTITY

0.7+

five marketQUANTITY

0.69+

CDOORGANIZATION

0.68+

of peopleQUANTITY

0.66+

M I t.EVENT

0.65+

yearsQUANTITY

0.64+

CasertaPERSON

0.63+

CosORGANIZATION

0.56+

EllaPERSON

0.56+

kORGANIZATION

0.53+

Colin Mahony, Vertica | MIT CDOIQ 2019


 

>> From Cambridge, Massachusetts, it's theCUBE, covering MIT Chief Data Officer and Information Quality Symposium 2019, brought to you by SiliconANGLE Media. >> Welcome back to Cambridge, Massachusetts everybody, you're watching The Cube, the leader in tech coverage. My name is Dave Vellante here with my cohost Paul Gillin. This is day one of our two day coverage of the MIT CDOIQ conferences. CDO, Chief Data Officer, IQ, information quality. Colin Mahoney is here, he's a good friend and long time CUBE alum. I haven't seen you in awhile, >> I know >> But thank you so much for taking some time, you're like a special guest here >> Thank you, yeah it's great to be here, thank you. >> Yeah, so, this is not, you know, something that you would normally attend. I caught up with you, invited you in. This conference has started as, like back office governance, information quality, kind of wonky stuff, hidden. And then when the big data meme took off, kind of around the time we met. The Chief Data Officer role emerged, the whole Hadoop thing exploded, and then this conference kind of got bigger and bigger and bigger. Still intimate, but very high level, very senior. It's kind of come full circle as we've been saying, you know, information quality still matters. You have been in this data business forever, so I wanted to invite you in just to get your perspectives, we'll talk about what's new with what's going on in your company, but let's go back a little bit. When we first met and even before, you saw it coming, you kind of invested your whole career into data. So, take us back 10 years, I mean it was so different, remember it was Batch, it was Hadoop, but it was cool. There was a lot of cool >> It's still cool. (laughs) projects going on, and it's still cool. But, take a look back. >> Yeah, so it's changed a lot, look, I got into it a while ago, I've always loved data, I had no idea, the explosion and the three V's of data that we've seen over the last decade. But, data's really important, and it's just going to get more and more important. But as I look back I think what's really changed, and even if you just go back a decade I mean, there's an insatiable appetite for data. And that is not slowing down, it hasn't slowed down at all, and I think everybody wants that perfect solution that they can ask any question and get an immediate answers to. We went through the Hadoop boom, I'd argue that we're going through the Hadoop bust, but what people actually want is still the same. You know, they want real answers, accurate answers, they want them quickly, and they want it against all their information and all their data. And I think that Hadoop evolved a lot as well, you know, it started as one thing 10 years ago, with MapReduce and I think in the end what it's really been about is disrupting the storage market. But if you really look at what's disrupting storage right now, public clouds, S3, right? That's the new data league. So there's always a lot of hype cycles, everybody talks about you know, now it's Cloud, everything, for maybe the last 10 years it was a lot of Hadoop, but at the end of the day I think what people want to do with data is still very much the same. And a lot of companies are still struggling with it, hence the role for Chief Data Officers to really figure out how do I monetize data on the one hand and how to I protect that asset on the other hand. >> Well so, and the cool this is, so this conference is not a tech conference, really. And we love tech, we love talking about this, this is why I love having you on. We kind of have a little Vertica thread that I've created here, so Colin essentially, is the current CEO of Vertica, I know that's not your title, you're GM and Senior Vice President, but you're running Vertica. So, Michael Stonebreaker's coming on tomorrow, >> Yeah, excellent. >> Chris Lynch is coming on tomorrow, >> Oh, great, yeah. >> we've got Andy Palmer >> Awesome, yeah. >> coming up as well. >> Pretty cool. (laughs) >> So we have this connection, why is that important? It's because, you know, Vertica is a very cool company and is all about data, and it was all about disrupting, sort of the traditional relational database. It's kind of doing more with data, and if you go back to the roots of Vertica, it was like how do you do things faster? How do you really take advantage of data to really drive new business? And that's kind of what it's all about. And the tech behind it is really cool, we did your conference for many, many years. >> It's coming back by the way. >> Is it? >> Yeah, this March, so March 30th. >> Oh, wow, mark that down. >> At Boston, at the new Encore Hotel. >> Well we better have theCUBE there, bro. (laughs) >> Yeah, that's great. And yeah, you've done that conference >> Yep. >> haven't you before? So very cool customers, kind of leading edge, so I want to get to some of that, but let's talk the disruption for a minute. So you guys started with the whole architecture, MPP and so forth. And you talked about Cloud, Cloud really disrupted Hadoop. What are some of the other technology disruptions that you're seeing in the market space? >> I think, I mean, you know, it's hard not to talk about AI machine learning, and what one means versus the other, who knows right? But I think one thing that is definitely happening is people are leveraging the volumes of data and they're trying to use all the processing power and storage power that we have to do things that humans either are too expensive to do or simply can't do at the same speed and scale. And so, I think we're going through a renaissance where a lot more is being automated, certainly on the Vertica roadmap, and our path has always been initially to get the data in and then we want the platform to do a lot more for our customers, lots more analytics, lots more machine-learning in the platform. So that's definitely been a lot of the buzz around, but what's really funny is when you talk to a lot of customers they're still struggling with just some basic stuff. Forget about the predictive thing, first you've got to get to what happened in the past. Let's give accurate reporting on what's actually happening. The other big thing I think as a disruption is, I think IOT, for all the hype that it's getting it's very real. And every device is kicking off lots of information, the feedback loop of AB testing or quality testing for predictive maintenance, it's happening almost instantly. And so you're getting massive amounts of new data coming in, it's all this machine sensor type data, you got to figure out what it means really quick, and then you actually have to do something and act on it within seconds. And that's a whole new area for so many people. It's not their traditional enterprise data network warehouse and you know, back to you comment on Stonebreaker, he got a lot of this right from the beginning, you know, and I think he looked at the architectures, he took a lot of the best in class designs, we didn't necessarily invent everything, but we put a lot of that together. And then I think the other you've got to do is constantly re-invent your platform. We came out with our Eon Mode to run cloud native, we just got rated the best cloud data warehouse from a net promoter score rating perspective, so, but we got to keep going you know, we got to keep re-inventing ourselves, but leverage everything that we've done in the past as well. >> So one of the things that you said, which is kind of relevant for here, Paul, is you're still seeing a real data quality issue that customers are wrestling with, and that's a big theme here, isn't it? >> Absolutely, and the, what goes around comes around, as Dave said earlier, we're still talking about information quality 13 years after this conference began. Have the tools to improve quality improved all that much? >> I think the tools have improved, I think that's another area where machine learning, if you look at Tamr, and I know you're going to have Andy here tomorrow, they're leveraging a lot of the augmented things you can do with the processing to make it better. But I think one thing that makes the problem worse now, is it's gotten really easy to pour data in. It's gotten really easy to store data without having to have the right structure, the right quality, you know, 10 years ago, 20 years ago, everything was perfect before it got into the platform. Right, everything was, there was quality, everything was there. What's been happening over the last decade is you're pumping data into these systems, nobody knows if it's redundant data, nobody knows if the quality's any good, and the amount of data is massive. >> And it's cheap to store >> Very cheap to store. >> So people keep pumping it in. >> But I think that creates a lot of issues when it comes to data quality. So, I do think the technology's gotten better, I think there's a lot of companies that are doing a great job with it, but I think the challenge has definitely upped. >> So, go ahead. >> I'm sorry. You mentioned earlier that we're seeing the death of Hadoop, but I'd like you to elaborate on that becuase (Dave laughs) Hadoop actually came up this morning in the keynote, it's part of what GlaxoSmithKline did. Came up in a conversation I had with the CEO of Experian last week, I mean, it's still out there, why do you think it's in decline? >> I think, I mean first of all if you look at the Hadoop vendors that are out there, they've all been struggling. I mean some of them are shutting down, two of them have merged and they've got killed lately. I think there are some very successful implementations of Hadoop. I think Hadoop as a storage environment is wonderful, I think you can process a lot of data on Hadoop, but the problem with Hadoop is it became the panacea that was going to solve all things data. It was going to be the database, it was going to be the data warehouse, it was going to do everything. >> That's usually the kiss of death, isn't it? >> It's the kiss of death. And it, you know, the killer app on Hadoop, ironically, became SQL. I mean, SQL's the killer app on Hadoop. If you want to SQL engine, you don't need Hadoop. But what we did was, in the beginning Mike sort of made fun of it, Stonebreaker, and joked a lot about he's heard of MapReduce, it's called Group By, (Dave laughs) and that created a lot of tension between the early Vertica and Hadoop. I think, in the end, we embraced it. We sit next to Hadoop, we sit on top of Hadoop, we sit behind it, we sit in front of it, it's there. But I think what the reality check of the industry has been, certainly by the business folks in these companies is it has not fulfilled all the promises, it has not fulfilled a fraction on the promises that they bet on, and so they need to figure those things out. So I don't think it's going to go away completely, but I think its best success has been disrupting the storage market, and I think there's some much larger disruptions of technologies that frankly are better than HTFS to do that. >> And the Cloud was a gamechanger >> And a lot of them are in the cloud. >> Which is ironic, 'cause you know, cloud era, (Colin laughs) they didn't really have a cloud strategy, neither did Hortonworks, neither did MapR and, it just so happened Amazon had one, Google had one, and Microsoft has one, so, it's just convenient to-- >> Well, how is that affecting your business? We've seen this massive migration to the cloud (mumbles) >> It's actually been great for us, so one of the things about Vertica is we run everywhere, and we made a decision a while ago, we had our own data warehouse as a service offering. It might have been ahead of its time, never really took off, what we did instead is we pivoted and we say "you know what? "We're going to invest in that experience "so it's a SaaS-like experience, "but we're going to let our customers "have full control over the cloud. "And if they want to go to Amazon they can, "if they want to go to Google they can, "if they want to go to Azure they can." And we really invested in that and that experience. We're up on the Amazon marketplace, we have lots of customers running up on Amazon Cloud as well as Google and Azure now, and then about two years ago we went down and did this endeavor to completely re-architect our product so that we could separate compute and storage so that our customers could actually take advantage of the cloud economics as well. That's been huge for us, >> So you scale independent-- >> Scale independently, cloud native, add compute, take away compute, and for our existing customers, they're loving the hybrid aspect, they love that they can still run on Premise, they love that they can run up on a public cloud, they love that they can run in both places. So we will continue to invest a lot in that. And it is really, really important, and frankly, I think cloud has helped Vertica a lot, because being able to provision hardware quickly, being able to tie in to these public clouds, into our customers' accounts, give them control, has been great and we're going to continue on that path. >> Because Vertica's an ISV, I mean you're a software company. >> We're a software company. >> I know you were a part of HP for a while, and HP wanted to mash that in and run it on it's hardware, but software runs great in the cloud. And then to you it's another hardware platform. >> It's another hardware platform, exactly. >> So give us the update on Micro Focus, Micro Focus acquired Vertica as part of the HPE software business, how many years ago now? Two years ago? >> Less than two years ago. >> Okay, so how's that going, >> It's going great. >> Give us the update there. >> Yeah, so first of all it is great, HPE and HP were wonderful to Vertica, but it's great being part of a software company. Micro Focus is a software company. And more than just a software company it's a company that has a lot of experience bridging the old and the new. Leveraging all of the investments that you've made but also thinking about cloud and all these other things that are coming down the pike. I think for Vertica it's been really great because, as you've seen Vertica has gotten its identity back again. And that's something that Micro Focus is very good at. You can look at what Micro Focus did with SUSE, the Linux company, which actually you know, now just recently spun out of Micro Focus but, letting organizations like Vertica that have this culture, have this product, have this passion, really focus on our market and our customers and doing the right thing by them has been just really great for us and operating as a software company. The other nice thing is that we do integrate with a lot of other products, some of which came from the HPE side, some of which came from Micro Focus, security products is an example. The other really nice thing is we've been doing this insource thing at Micro Focus where we open up our source code to some of the other teams in Micro Focus and they've been contributing now in amazing ways to the product. In ways that we would just never be able to scale, but with 4,000 engineers strong in Micro Focus, we've got a much larger development organization that can actually contribute to the things that Vertica needs to do. And as we go into the cloud and as we do a lot more operational aspects, the experience that these teams have has been incredible, and security's another great example there. So overall it's been great, we've had four different owners of Vertica, our job is to continue what we do on the innovation side in the culture, but so far Micro Focus has been terrific. >> Well, I'd like to say, you're kind of getting that mojo back, because you guys as an independent company were doing your own thing, and then you did for a while inside of HP, >> We did. >> And that obviously changed, 'cause they wanted more integration, but, and Micro Focus, they know what they're doing, they know how to do acquisitions, they've been very successful. >> It's a very well run company, operationally. >> The SUSE piece was really interesting, spinning that out, because now RHEL is part of IBM, so now you've got SUSE as the lone independent. >> Yeah. >> Yeah. >> But I want to ask you, go back to a technology question, is NoSQL the next Hadoop? Are these databases, it seems to be that the hot fad now is NoSQL, it can do anything. Is the promise overblown? >> I think, I mean NoSQL has been out almost as long as Hadoop, and I, we always say not only SQL, right? Mike's said this from day one, best tool for the job. Nothing is going to do every job well, so I think that there are, whether it's key value stores or other types of NoSQL engines, document DB's, now you have some of these DB's that are running on different chips, >> Graph, yeah. >> there's always, yeah, graph DBs, there's always going to be specialty things. I think one of the things about our analytic platform is we can do, time series is a great example. Vertica's a great time series database. We can compete with specialized time series databases. But we also offer a lot of, the other things that you can do with Vertica that you wouldn't be able to do on a database like that. So, I always think there's going to be specialty products, I also think some of these can do a lot more workloads than you might think, but I don't see as much around the NoSQL movement as say I did a few years ago. >> But so, and you mentioned the cloud before as kind of, your position on it I think is a tailwind, not to put words in your mouth, >> Yeah, yeah, it's a great tailwind. >> You're in the Amazon marketplace, I mean they have products that are competitive, right? >> They do, they do. >> But, so how are you differentiating there? >> I think the way we differentiate, whether it's Redshift from Amazon, or BigQuery from Google, or even what Azure DB does is, first of all, Vertica, I think from, feature functionality and performance standpoint is ahead. Number one, I think the second thing, and we hear this from a lot of customers, especially at the C-level is they don't want to be locked into these full stacks of the clouds. Having the ability to take a product and run it across multiple clouds is a big thing, because the stack lock-in now, the full stack lock-in of these clouds is scary. It's really easy to develop in their ecosystems but you get very locked into them, and I think a lot of people are concerned about that. So that works really well for Vertica, but I think at the end of the day it's just, it's the robustness of the product, we continue to innovate, when you look at separating compute and storage, believe it or not, a lot of these cloud-native databases don't do that. And so we can actually leverage a lot of the cloud hardware better than the native cloud databases do themselves. So, like I said, we have to keep going, those guys aren't going to stop, and we actually have great relationships with those companies, we work really well with the clouds, they seem to care just as much about their cloud ecosystem as their own database products, and so I think that's going to continue as well. >> Well, Colin, congratulations on all the success >> Yeah, thank you, yeah. >> It's awesome to see you again and really appreciate you coming to >> Oh thank you, it's great, I appreciate the invite, >> MIT. >> it's great to be here. >> All right, keep it right there everybody, Paul and I will be back with our next guest from MIT, you're watching theCUBE. (electronic jingle)

Published Date : Jul 31 2019

SUMMARY :

brought to you by SiliconANGLE Media. I haven't seen you in awhile, kind of around the time we met. It's still cool. but at the end of the day I think is the current CEO of Vertica, (laughs) and if you go back to the roots of Vertica, at the new Encore Hotel. Well we better have theCUBE there, bro. And yeah, you've done that conference but let's talk the disruption for a minute. but we got to keep going you know, Have the tools to improve quality the right quality, you know, But I think that creates a lot of issues but I'd like you to elaborate on that becuase I think you can process a lot of data on Hadoop, and so they need to figure those things out. so one of the things about Vertica is we run everywhere, and frankly, I think cloud has helped Vertica a lot, I mean you're a software company. And then to you it's another hardware platform. the Linux company, which actually you know, and Micro Focus, they know what they're doing, so now you've got SUSE as the lone independent. is NoSQL the next Hadoop? Nothing is going to do every job well, the other things that you can do with Vertica and so I think that's going to continue as well. Paul and I will be back with our next guest from MIT,

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
DavePERSON

0.99+

Andy PalmerPERSON

0.99+

Paul GillinPERSON

0.99+

Dave VellantePERSON

0.99+

MicrosoftORGANIZATION

0.99+

GoogleORGANIZATION

0.99+

AmazonORGANIZATION

0.99+

Colin MahoneyPERSON

0.99+

PaulPERSON

0.99+

ColinPERSON

0.99+

IBMORGANIZATION

0.99+

VerticaORGANIZATION

0.99+

Chris LynchPERSON

0.99+

HPEORGANIZATION

0.99+

Michael StonebreakerPERSON

0.99+

HPORGANIZATION

0.99+

Micro FocusORGANIZATION

0.99+

HadoopTITLE

0.99+

Colin MahonyPERSON

0.99+

last weekDATE

0.99+

AndyPERSON

0.99+

March 30thDATE

0.99+

NoSQLTITLE

0.99+

MikePERSON

0.99+

ExperianORGANIZATION

0.99+

tomorrowDATE

0.99+

SQLTITLE

0.99+

two dayQUANTITY

0.99+

SiliconANGLE MediaORGANIZATION

0.99+

BostonLOCATION

0.99+

Cambridge, MassachusettsLOCATION

0.99+

4,000 engineersQUANTITY

0.99+

Two years agoDATE

0.99+

SUSETITLE

0.99+

Azure DBTITLE

0.98+

second thingQUANTITY

0.98+

20 years agoDATE

0.98+

10 years agoDATE

0.98+

oneQUANTITY

0.98+

VerticaTITLE

0.98+

HortonworksORGANIZATION

0.97+

MapReduceORGANIZATION

0.97+

one thingQUANTITY

0.97+

Mark Krzysko, US Department of Defense | MIT CDOIQ 2019


 

>> From Cambridge, Massachusetts, it's The Cube, covering MIT Chief data Officer and Information Quality Symposium 2019. Brought to you by SiliconANGLE Media. >> Welcome back to Cambridge, everybody. We're here at Tang building at MIT for the MIT CDOIQ Conference. This is the 13th annual MIT CDOIQ. It started as a information quality conference and grew through the big data era, the Chief Data Officer emerged and now it's sort of a combination of those roles. That governance role, the Chief Data Officer role. Critical for organizations for quality and data initiatives, leading digital transformations ans the like. I'm Dave Vallante with my cohost Paul Gillin, you're watching The Cube, the leader in tech coverage. Mark Chrisco is here, the deputy, sorry, Principle Deputy Director for Enterprise Information at the Department of Defense. Good to see you again, thanks for coming on. >> Oh, thank you for having me. >> So, Principle Deputy Director Enterprise Information, what do you do? >> I do data. I do acquisition data. I'm the person in charge of lining the acquisition data for the programs for the Under Secretary and the components so a strong partnership with the army, navy, and air force to enable the department and the services to execute their programs better, more efficiently, and be efficient in the data management. >> What is acquisition data? >> So acquisition data generally can be considered best in the shorthand of cost schedule performance data. When a program is born, you have to manage, you have to be sure it's resourced, you're reporting up to congress, you need to be sure you have insight into the programs. And finally, sometimes you have to make decisions on those programs. So, cost schedule performance is a good shorthand for it. >> So kind of the key metrics and performance metrics around those initiatives. And how much of that is how you present that data? The visualization of it. Is that part of your role or is that, sort of, another part of the organization you partner with, or? >> Well, if you think about it, the visualization can take many forms beyond that. So a good part of the role is finding the authoritative trusted source of that data, making sure it's accurate so we don't spend time disagreeing on different data sets on cost schedule performance. The major programs are tremendously complex and large and involve and awful lot of data in the a buildup to a point where you can look at that. It's just not about visualizing, it's about having governed authoritative data that is, frankly, trustworthy that you can can go operate in. >> What are some of the challenges of getting good quality data? >> Well, I think part of the challenge was having a common lexicon across the department and the services. And as I said, the partnership with the services had been key in helping define and creating a semantic data model for the department that we can use. So we can have agreement on what it would mean when we were using it and collecting it. The services have thrown all in and, in their perspective, have extended that data model down through their components to their programs so they can better manage the programs because the programs are executed at a service level, not at an OSD level. >> Can you make that real? I mean, is there an example you can give us of what you mean by a common semantic model? >> So for cost schedule, let's take a very simple one, program identification. Having a key number for that, having a long name, a short name, and having just the general description of that, were in various states amongst the systems. We've had decades where, however the system was configured, configured it the way they wanted to. It was largely not governed and then trying to bring those data sets together were just impossible to do. So even with just program identification. Since the majority of the programs and numbers are executed at a service level, we worked really hard to get the common words and meanings across all the programs. >> So it's a governance exercise the? >> Yeah. It is certainly a governance exercise. I think about it as not so much as, in the IT world or the data world will call it governance, it's leadership. Let's settle on some common semantics here that we can all live with and go forward and do that. Because clearly there's needs for other pieces of data that we may or may not have but establishing a core set of common meanings across the department has proven very valuable. >> What are some of the key data challenges that the DOD faces? And how is your role helping address them? >> Well in our case, and I'm certain there's a myriad of data choices across the department. In our place it was clarity in and the governance of this. Many of the pieces of data were required by statute, law, police, or regulation. We came out of eras where data was the piece of a report and not really considered data. And we had to lead our ways to beyond the report to saying, "No, we're really "talking about key data management." So we've been at this for a few years and working with the services, that has been a challenge. I think we're at the part where we've established the common semantics for the department to go forward with that. And one of the challenges that I think is the access and dissemination of knowing what you can share and when you can share it. Because Michael Candolim said earlier that the data in mosaic, sometimes you really need to worry about it from our perspective. Is too much publicly available or should we protect on behalf of the government? >> That's a challenge. Is the are challenge in terms of, I'm sure there is but I wonder if you can describe it or maybe talk about how you might have solved it, maybe it's not a big deal, but you got to serve the mission of the organization. >> Absolutely. >> That's, like, number one. But at the same time, you've got stakeholders and they're powerful politicians and they have needs and there's transparency requirements, there are laws. They're not always aligned, those two directives, are they? >> No, thank goodness I don't have to deal with misalignments of those. We try to speak in the truth of here's the data and the decisions across the organization of our reports still go to congress, they go to congress on an annual basis through the selected acquisition report. And, you know, we are better understanding what we need to protect and how to advice congress on what should be protected and why. I would not say that's an easy proposition. The demands for those data come from the GAO, come from congress, come from the Inspector General and having to navigate that requires good access and dissemination controls and knowing why. We've sponsored some research though the RAND organization to help us look and understand why you have got to protect it and what policies, rules, and regulations are. And all those reports have been public so we could be sure that people would understand what it is. We're coming out of an era where data was not considered as it is today where reports were easily stamped with a little rubber stamp but data now moves at the velocities of milliseconds not as the velocity of reports. So we really took a comprehensive look at that. How do you manage data in a world where it is data and it is on infrastructures like data models. >> So, the future of war. Everybody talks about cyber as the future of war. There's a lot of data associated with that. How does that change what you guys do? Or does it? >> Well, I think from an acquisition perspective, you would think, you know. In that discussion that you just presented us, we're micro in that. We're equipping and acquiring through acquisitions. What we've done is we make sure that our data is shareable, you know? Open I, API structures. Having our data models. Letting the war fighters have our data so they could better understand where information is here. Letting other communities to better help that. By us doing our jobs where we sit, we can contribute to their missions and we've aways been every sharing in that. >> Is technology evolving to the point where, let's assume you could dial back 10 or 15 years and you had the nirvana of data quality. We know how fast technology is changing but is it changing as an enabler to really leverage that quality of data in ways that you might not have even envision 10 or 15 years ago? >> I think technology is. I think a lot of this is not in tools, it's now in technique and management practices. I think many of us find ourselves rethinking of how to do this now that you have data, now that you have tools that you can get them. How can you adopt better and faster? That requires a cultural change to organization. In some cases it requires more advanced skills, in other cases it requires you to think differently about the problems. I always like to consider that we, at some point, thought about it as a process-driven organization. Step one to step two to step three. Now process is ubiquitous because data becomes ubiquitous and you could refactor your processes and decisions much more efficiently and effectively. >> What are some of the information quality problems you have to wrestle with? >> Well, in our case, by setting a definite semantic meaning, we kicked the quality problems to those who provide the authoritative data. And if they had a quality problem, we said, "Here's your data. "We're going to now use it." So it spurs, it changes the model of them ensuring the quality of those who own the data. And by working with the services, they've worked down through their data issues and have used us a bit as the foil for cleaning up their data errors that they have from different inputs. And I like to think about it as flipping the model of saying, "It's not my job to drive quality, "it's my job to drive clarity, "it's their job to drive the quality into the system." >> Let's talk about this event. So, you guys are long-time contributors to the event. Mark, have you been here since the beginning? Or close to it? >> Um... About halfway through I think. >> When the focus was primarily on information quality? >> Yes. >> Was it CDOIQ at the time or was it IQ? >> It was the very beginnings of CDOIQ. It was right before it became CDOIQ. >> Early part of this decade? >> Yes. >> Okay. >> It was Information Quality Symposium originally, is that was attracted you to it? >> Well, yes, I was interested in it because I think there were two things that drew my interest. One, a colleague had told me about it and we were just starting the data journey at that point. And it was talking about information quality and it was out of a business school in the MIT slenton side of the house. And coming from a business perspective, it was not just the providence of IT, I wanted to learn form others because I sit on the business side of the equation. Not a pure IT-ist or technology. And I came here to learn. I've never stopped learning through my entire journey here. >> What have you learned this week? >> Well, there's an awful lot I learned. I think it's been... This space is evolving so rapidly with the law, policy, and regulation. Establishing the CDOs, establishing the roles, getting hear from the CDOs, getting to hear from visions, hear from Michael Conlan and hear from others in the federal agencies. Having them up here and being able to collaborate and talk to them. Also hearing from the technology people, the people that're bringing solutions to the table. And then, I always say this is a bit like group therapy here because many of us have similar problems, we have different start and end points and learning from each other has proven to be very valuable. From the hallway conversations to hearing somebody and seeing how they thought about the products, seeing how commercial industry has implemented data management. And you have a lot of similarity of focus of people dealing with trying to bring data to bring value to the organizations and understanding their transformations, it's proven invaluable. >> Well, what did the appointment of the DOD's first CDO last year, what statement did that make to the organization? >> That data's important. Data are important. And having a CDO in that and, when Micheal came on board, we shared some lessons learned and we were thinking about how to do that, you know? As I said, I function in a, arguably a silo of the institution is the acquisition data. But we were copying CDO homework so it helped in my mind that we can go across to somebody else that would understand and could understand what we're trying to do and help us. And I think it becomes, the CDO community has always been very sharing and collaborative and I hold that true with Micheal today. >> It's kind of the ethos of this event. I mean, obviously you guys have been heavily involved. We've always been thrilled to cover this. I think we started in 2013 and we've seen it grow, it's kind of fire marshal full now. We got to get to a new facility, I understand. >> Fire marshal full. >> Next year. So that's congratulations to all the success. >> Yeah, I think it's important and we've now seen, you know, you hear it, you can read it in every newspaper, every channel out there, that data are important. And what's more important than the factor of governance and the factor of bringing safety and security to the nation? >> I do feel like a lot in, certainly in commercial world, I don't know if it applies in the government, but a lot of these AI projects are moving really fast. Especially in Silicon Valley, there's this move fast and break things mentality. And I think that's part of why you're seeing some of these big tech companies struggle right now because they're moving fast and they're breaking things without the governance injected and many CDOs are not heavily involved in some of these skunk works projects and it's almost like they're bolting on governance which has never been a great formula for success in areas like governance and compliance and security. You know, the philosophy of designing it in has tangible benefits. I wonder if you could comment on that? >> Yeah, I can talk about it as we think about it in our space and it may be limited. AI is a bit high on the hype curve as you might imagine right now, and the question would be is can it solve a problem that you have? Well, you just can't buy a piece of software or a methodology and have it solve a problem if you don't know what problem you're trying to solve and you wouldn't understand the answer when it gave it to you. And I think we have to raise our data intellectualism across the organization to better work with these products because they certainly represent utility but it's not like you give it with no fences on either side or you open up your aperture to find basic solution on this. How you move forward with it is your workforce has got to be in tune with that, you have to understand some of the data, at least the basics, and particularly with products when you get the machine learning AI deep learning, the models are going to be moving so fast that you have to intellectually understand them because you'll never be able to go all the way back and stubby pencil back to an answer. And if you don't have the skills and the math and the understanding of how these things are put together, it may not bring the value that they can bring to us. >> Mark, thanks very much for coming on The Cube. >> Thank you very much. >> Great to see you again and appreciate all the work you guys both do for the community. All right. And thank you for watching. We'll be right back with our next guest right after this short break. You're watching The Cube from MIT CDOIQ.

Published Date : Jul 31 2019

SUMMARY :

Brought to you by SiliconANGLE Media. Good to see you again, thanks for coming on. and be efficient in the data management. And finally, sometimes you have to make another part of the organization you partner with, or? and involve and awful lot of data in the a buildup And as I said, the partnership with the services and having just the general description of that, in the IT world or the data world And one of the challenges that I think but you got to serve the mission of the organization. But at the same time, you've got stakeholders and the decisions across the organization How does that change what you guys do? In that discussion that you just presented us, and you had the nirvana of data quality. rethinking of how to do this now that you have data, So it spurs, it changes the model of them So, you guys are long-time contributors to the event. About halfway through I think. It was the very beginnings of CDOIQ. in the MIT slenton side of the house. getting hear from the CDOs, getting to hear from visions, and we were thinking about how to do that, you know? It's kind of the ethos of this event. So that's congratulations to all the success. and the factor of bringing safety I don't know if it applies in the government, across the organization to better work with these products all the work you guys both do for the community.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Peter BurrisPERSON

0.99+

Dave VellantePERSON

0.99+

Michael DellPERSON

0.99+

Rebecca KnightPERSON

0.99+

MichaelPERSON

0.99+

ComcastORGANIZATION

0.99+

ElizabethPERSON

0.99+

Paul GillanPERSON

0.99+

Jeff ClarkPERSON

0.99+

Paul GillinPERSON

0.99+

NokiaORGANIZATION

0.99+

SavannahPERSON

0.99+

DavePERSON

0.99+

RichardPERSON

0.99+

MichealPERSON

0.99+

Carolyn RodzPERSON

0.99+

Dave VallantePERSON

0.99+

VerizonORGANIZATION

0.99+

AmazonORGANIZATION

0.99+

Eric SeidmanPERSON

0.99+

PaulPERSON

0.99+

Lisa MartinPERSON

0.99+

GoogleORGANIZATION

0.99+

KeithPERSON

0.99+

Chris McNabbPERSON

0.99+

JoePERSON

0.99+

CarolynPERSON

0.99+

QualcommORGANIZATION

0.99+

AlicePERSON

0.99+

2006DATE

0.99+

JohnPERSON

0.99+

NetflixORGANIZATION

0.99+

AWSORGANIZATION

0.99+

congressORGANIZATION

0.99+

EricssonORGANIZATION

0.99+

AT&TORGANIZATION

0.99+

Elizabeth GorePERSON

0.99+

Paul GillenPERSON

0.99+

Madhu KuttyPERSON

0.99+

1999DATE

0.99+

Michael ConlanPERSON

0.99+

2013DATE

0.99+

Michael CandolimPERSON

0.99+

PatPERSON

0.99+

Yvonne WassenaarPERSON

0.99+

Mark KrzyskoPERSON

0.99+

BostonLOCATION

0.99+

Pat GelsingerPERSON

0.99+

DellORGANIZATION

0.99+

Willie LuPERSON

0.99+

IBMORGANIZATION

0.99+

YvonnePERSON

0.99+

HertzORGANIZATION

0.99+

AndyPERSON

0.99+

2012DATE

0.99+

MicrosoftORGANIZATION

0.99+

Tom Davenport, Babson College | MIT CDOIQ 2019


 

>> from Cambridge, Massachusetts. It's the Cube covering M I T. Chief data officer and information quality Symposium 2019. Brought to you by Silicon Angle Media. >> Welcome back >> to M I. T. Everybody watching the Cube, The leader in live tech coverage. My name is Dave Volonte here with Paul Guillen. My co host, Tom Davenport, is here is the president's distinguished professor at Babson College. Huebel? Um, good to see again, Tom. Thanks for coming on. Glad to be here. So, yeah, this is, uh let's see. The 13th annual M I t. Cdo lucky. >> Yeah, sure. As this year. Our seventh. I >> think so. Really? Maybe we'll offset. So you gave a talk earlier? She would be afraid of the machines, Or should we embrace them? I think we should embrace them, because so far, they are not capable of replacing us. I mean, you know, when we hit the singularity, which I'm not sure we'll ever happen, But it's certainly not going happen anytime soon. We'll have a different answer. But now good at small, narrow task. Not so good at doing a lot of the things that we do. So I think we're fine. Although as I said in my talk, I have some survey data suggesting that large U. S. Corporations, their senior executives, a substantial number of them more than half would liketo automate as many jobs as possible. They say. So that's a little scary. But unfortunately for us human something, it's gonna be >> a while before they succeed. Way had a case last year where McDonald's employees were agitating for increasing the minimum wage and tThe e management used the threat of wrote of robotics sizing, hamburger making process, which can be done right to thio. Get them to back down. Are you think we're going to Seymour of four that were maybe a eyes used as a threat? >> Well, I haven't heard too many other examples. I think for those highly structured, relatively low level task, it's quite possible, particularly if if we do end up raising the minimum wage beyond a point where it's economical, pay humans to do the work. Um, but I would like to think that, you know, if we gave humans the opportunity, they could do Maur than they're doing now in many cases, and one of the things I was saying is that I think companies are. Generally, there's some exceptions, but most companies they're not starting to retrain their workers. Amazon recently announced they're going to spend 700,000,000 to retrain their workers to do things that a I and robots can't. But that's pretty rare. Certainly that level of commitment is very rare. So I think it's time for the companies to start stepping up and saying, How can we develop a better combination of humans and machines? >> The work by, you know, brain Nelson and McAfee, which is a little dated now. But it definitely suggests that there's some things to be concerned about. Of course, ultimately there prescription was one of an optimist and education, and yeah, on and so forth. But you know, the key point there is the machines have always replace humans, but now, in terms of cognitive functions, but you see it everywhere you drive to the airport. Now it's Elektronik billboards. It's not some person putting up the kiosks, etcetera, but you know, is you know, you've you've used >> the term, you know, paid the cow path. We don't want to protect the past from the future. All right, so, to >> your point, retraining education I mean, that's the opportunity here, isn't it? And the potential is enormous. Well, and, you know, let's face it, we haven't had much in the way of productivity improvements in the U. S. Or any other advanced economy lately. So we need some guests, you know, replacement of humans by machines. But my argument has always been You can handle innovation better. You can avoid sort of race to the bottom at automation sometimes leads to, if you think creatively about humans and machines working as colleagues. In many cases, you remember in the PC boom, I forget it with a Fed chairman was it might have been, Greenspan said, You can see progress everywhere except in the product. That was an M. I. T. Professor Robert Solow. >> OK, right, and then >> won the Nobel Prize. But then, shortly thereafter, there was a huge productivity boom. So I mean is there may be a pent up Well, God knows. I mean, um, everybody's wondering. We've been spending literally trillions on I t. And you would think that it would have led toe productivity, But you know, certain things like social media, I think reduced productivity in the workplace and you know, we're all chatting and talking and slacking and sewing all over the place. Maybe that's is not conducive to getting work done. It depends what you >> do with that social media here in our business. It's actually it's phenomenal to see political coverage these days, which is almost entirely consist of reprinting politicians. Tweets >> Exactly. I guess it's made life easier for for them all people reporters sitting in the White House waiting for a press conference. They're not >> doing well. There are many reporters left. Where do you see in your consulting work your academic work? Where do you see a I being used most effectively in organizations right now? And where do you think that's gonna be three years from now? >> Well, I mean, the general category of activity of use case is the sort of someone's calling boring I. It's data integration. One thing that's being discussed a lot of this conference, it's connecting your invoices to your contracts to see Did we actually get the stuff that we contracted for its ah, doing a little bit better job of identifying fraud and doing it faster so all of those things are quite feasible. They're just not that exciting. What we're not seeing are curing cancer, creating fully autonomous vehicles. You know, the really aggressive moonshots that we've been trying for a while just haven't succeeded at what if we kind of expand a I is gonna The rumor, trawlers. New cool stuff that's coming out. So considering all these new checks with detective Aye, aye, Blockchain new security approaches. When do you think that machines will be able to make better diagnoses than doctors? Well, I think you know, in a very narrow sense in some cases, that could do it now. But the thing is, first of all, take a radiologist, which is one of the doctors I think most at risk from this because they don't typically meet with patients and they spend a lot of time looking at images. It turns out that the lab experiments that say you know, these air better than human radiologist say I tend to be very narrow, and what one lab does is different from another lab. So it's just it's gonna take a very long time to make it into, you know, production deployment in the physician's office. We'll probably have to have some regulatory approval of it. You know, the lab research is great. It's just getting it into day to day. Reality is the problem. Okay, So staying in this context of digital a sort of umbrella topic, do you think large retail stores roll largely disappeared? >> Uh, >> some sectors more than others for things that you don't need toe, touch and feel, And soon before you're to them. Certainly even that obviously, it's happening more and more on commerce. What people are saying will disappear. Next is the human at the point of sale. And we've been talking about that for a while. In In grocery, Not so not achieve so much yet in the U. S. Amazon Go is a really interesting experiment where every time I go in there, I tried to shoplift. I took a while, and now they have 12 stores. It's not huge yet, but I think if you're in one of those jobs that a substantial chunk of it is automata ble, then you really want to start looking around thinking, What else can I do to add value to these machines? Do you think traditional banks will lose control of the payment system? Uh, No, I don't because the Finn techs that you see thus far keep getting bought by traditional bank. So my guess is that people will want that certainty. And you know, the funny thing about Blockchain way say in principle it's more secure because it's spread across a lot of different ledgers. But people keep hacking into Bitcoin, so it makes you wonder. I think Blockchain is gonna take longer than way thought as well. So, you know, in my latest book, which is called the Aye Aye Advantage, I start out talking by about Tamara's Law, This guy Roy Amara, who was a futurist, not nearly as well known as Moore's Law. But it said, You know, for every new technology, we tend to overestimate its impact in the short run and underestimated Long, long Ryan. And so I think a I will end up doing great things. We may have sort of tuned it out of the time. It actually happens way finally have autonomous vehicles. We've been talking about it for 50 years. Last one. So one of the Democratic candidates of the 75 Democratic ended last night mentioned the chief manufacturing officer Well, do you see that automation will actually swing the pendulum and bring back manufacturing to the U. S. I think it could if we were really aggressive about using digital technologies in manufacturing, doing three D manufacturing doing, um, digital twins of every device and so on. But we are not being as aggressive as we ought to be. And manufacturing companies have been kind of slow. And, um, I think somewhat delinquent and embracing these things. So they're gonna think, lose the ability to compete. We have to really go at it in a big way to >> bring it. Bring it all back. Just we've got an election coming up. There are a lot of concern following the last election about the potential of a I chatbots Twitter chat bots, deep fakes, technologies that obscure or alter reality. Are you worried about what's coming in the next year? And that that >> could never happen? Paul. We could never see anything deep fakes I'm quite worried about. We don't seem. I know there's some organizations working on how we would certify, you know, an image as being really But we're not there yet. My guess is, certainly by the time the election happens, we're going to have all sorts of political candidates saying things that they never really said through deep fakes and image manipulation. Scary? What do you think about the call to break up? Big check. What's your position on that? I think that sell a self inflicted wound. You know, we just saw, for example, that the automobile manufacturers decided to get together. Even though the federal government isn't asking for better mileage, they said, We'll do it. We'll work with you in union of states that are more advanced. If Big Tak had said, we're gonna work together to develop standards of ethical behavior and privacy and data and so on, they could've prevented some of this unless they change their attitude really quickly. I've seen some of it sales force. People are talking about the need for data standard data protection standards, I must say, change quickly. I think they're going to get legislation imposed and maybe get broken up. It's gonna take awhile. Depends on the next administration, but they're not being smart >> about it. You look it. I'm sure you see a lot of demos of advanced A I type technology over the last year, what is really impressed you. >> You know, I think the biggest advances have clearly been in image recognition looking the other day. It's a big problem with that is you need a lot of label data. It's one of the reasons why Google was able to identify cat photos on the Internet is we had a lot of labeled cat images and the Image net open source database. But the ability to start generating images to do synthetic label data, I think, could really make a big difference in how rapidly image recognition works. >> What even synthetic? I'm sorry >> where we would actually create. We wouldn't have to have somebody go around taking pictures of cats. We create a bunch of different cat photos, label them as cat photos have variations in them, you know, unless we have a lot of variation and images. That's one of the reasons why we can't use autonomous vehicles yet because images differ in the rain and the snow. And so we're gonna have to have synthetic snow synthetic rain to identify those images. So, you know, the GPU chip still realizes that's a pedestrian walking across there, even though it's kind of buzzed up right now. Just a little bit of various ation. The image can throw off the recognition altogether. Tom. Hey, thanks so much for coming in. The Cube is great to see you. We gotta go play Catch. You're welcome. Keep right. Everybody will be back from M I t CDO I Q In Cambridge, Massachusetts. Stable, aren't they? Paul Gillis, You're watching the Cube?

Published Date : Jul 31 2019

SUMMARY :

Brought to you by My co host, Tom Davenport, is here is the president's distinguished professor at Babson College. I I mean, you know, when we hit the singularity, Are you think we're going to Seymour of four that were maybe a eyes used as you know, if we gave humans the opportunity, they could do Maur than they're doing now But you know, the key point there is the machines the term, you know, paid the cow path. Well, and, you know, in the workplace and you know, we're all chatting and talking It's actually it's phenomenal to see reporters sitting in the White House waiting for a press conference. And where do you think that's gonna be three years from now? I think you know, in a very narrow sense in some cases, No, I don't because the Finn techs that you see thus far keep There are a lot of concern following the last election about the potential of a I chatbots you know, an image as being really But we're not there yet. I'm sure you see a lot of demos of advanced A But the ability to start generating images to do synthetic as cat photos have variations in them, you know, unless we have

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
McDonaldORGANIZATION

0.99+

Dave VolontePERSON

0.99+

Paul GillisPERSON

0.99+

Roy AmaraPERSON

0.99+

Paul GuillenPERSON

0.99+

Tom DavenportPERSON

0.99+

AmazonORGANIZATION

0.99+

TomPERSON

0.99+

SeymourPERSON

0.99+

700,000,000QUANTITY

0.99+

12 storesQUANTITY

0.99+

GoogleORGANIZATION

0.99+

Robert SolowPERSON

0.99+

PaulPERSON

0.99+

last yearDATE

0.99+

Silicon Angle MediaORGANIZATION

0.99+

Cambridge, MassachusettsLOCATION

0.99+

oneQUANTITY

0.99+

50 yearsQUANTITY

0.99+

U. S.LOCATION

0.99+

Babson CollegeORGANIZATION

0.99+

HuebelPERSON

0.99+

next yearDATE

0.99+

FedORGANIZATION

0.98+

fourQUANTITY

0.98+

DemocraticORGANIZATION

0.98+

more than halfQUANTITY

0.98+

M I. T.PERSON

0.98+

seventhQUANTITY

0.98+

2019DATE

0.98+

Nobel PrizeTITLE

0.97+

McAfeeORGANIZATION

0.97+

GreenspanPERSON

0.97+

TwitterORGANIZATION

0.96+

OneQUANTITY

0.96+

U. S.LOCATION

0.96+

one labQUANTITY

0.96+

RyanPERSON

0.95+

CatchTITLE

0.95+

this yearDATE

0.95+

last nightDATE

0.94+

Big TakORGANIZATION

0.87+

ProfessorPERSON

0.84+

Aye Aye AdvantageTITLE

0.84+

75QUANTITY

0.84+

Amazon GoORGANIZATION

0.81+

U.ORGANIZATION

0.78+

MaurPERSON

0.77+

trillionsQUANTITY

0.76+

NelsonORGANIZATION

0.73+

TamaraPERSON

0.71+

one of the reasonsQUANTITY

0.71+

White HouseORGANIZATION

0.69+

Big checkORGANIZATION

0.69+

LawTITLE

0.67+

three yearsQUANTITY

0.66+

M I t. CdoEVENT

0.66+

MPERSON

0.65+

MoorePERSON

0.59+

13th annualQUANTITY

0.58+

firstQUANTITY

0.57+

LastQUANTITY

0.54+

AyePERSON

0.52+

MIT CDOIQORGANIZATION

0.51+

M.PERSON

0.48+

FinnORGANIZATION

0.45+

CubeTITLE

0.41+

Lisa Ehrlinger, Johannes Kepler University | MIT CDOIQ 2019


 

>> From Cambridge, Massachusetts, it's theCUBE, covering MIT Chief Data Officer and Information Quality Symposium 2019. Brought to you by SiliconANGLE Media. >> Hi, everybody, welcome back to Cambridge, Massachusetts. This is theCUBE, the leader in tech coverage. I'm Dave Vellante with my cohost, Paul Gillin, and we're here covering the MIT Chief Data Officer Information Quality Conference, #MITCDOIQ. Lisa Ehrlinger is here, she's the Senior Researcher at the Johannes Kepler University in Linz, Austria, and the Software Competence Center in Hagenberg. Lisa, thanks for coming in theCUBE, great to see you. >> Thanks for having me, it's great to be here. >> You're welcome. So Friday you're going to lay out the results of the study, and it's a study of Data Quality Tools. Kind of the long tail of tools, some of those ones that may not have made the Gartner Magic Quadrant and maybe other studies, but talk about the study and why it was initiated. >> Okay, so the main motivation for this study was actually a very practical one, because we have many company projects with companies from different domains, like steel industry, financial sector, and also focus on automotive industry at our department at Johannes Kepler University in Linz. We have experience with these companies for more than 20 years, actually, in this department, and what reoccurred was the fact that we spent the majority of time in such big data projects on data quality measurement and improvement tasks. So at some point we thought, okay, what possibilities are there to automate these tasks and what tools are out there on the market to automate these data quality tasks. So this was actually the motivation why we thought, okay, we'll look at those tools. Also, companies ask us, "Do you have any suggestions? "Which tool performs best in this-and-this domain?" And I think this study answers some questions that have not been answered so far in this particular detail, in these details. For example, Gartner Magic Quadrant of Data Quality Tools, it's pretty interesting but it's very high-level and focusing on some global windows, but it does not look on the specific measurement functionalities. >> Yeah, you have to have some certain number of whatever, customers or revenue to get into the Magic Quadrant. So there's a long tail that they don't cover. But talk a little bit more about the methodology, was it sort of you got hands-on or was it more just kind of investigating what the capabilities of the tools were, talking to customers? How did you come to the conclusions? >> We actually approached this from a very scientific side. We conducted a systematic search, which tools are out there on the market, not only industrial tools, but also open-sourced tools were included. And I think this gives a really nice digest of the market from different perspectives, because we also include some tools that have not been investigated by Gartner, for example, like more BTQ, Data Quality, or Apache Griffin, which has really nice monitoring capabilities, but lacks some other features from these comprehensive tools, of course. >> So was the goal of the methodology largely to capture a feature function analysis of being able to compare that in terms of binary, did it have it or not, how robust is it? And try to develop a common taxonomy across all these tools, is that what you did? >> So we came up with a very detailed requirements catalog, which is divided into three fields, like the focuses on data profiling to get a first insight into data quality. The second is data quality management in terms of dimensions, metrics, and rules. And the third part is dedicated to data quality monitoring over time, and for all those three categories, we came up with different case studies on a database, on a test database. And so we conducted, we looked, okay, does this tool, yes, support this feature, no, or partially? And when partially, to which extent? So I think, especially on the partial assessment, we got a lot into detail in our survey, which is available on Archive online already. So the preliminary results are already online. >> How do you find it? Where is it available? >> On Archive. >> Archive? >> Yes. >> What's the URL, sorry. Archive.com, or .org, or-- >> Archive.org, yeah. >> Archive.org. >> But actually there is a ID I have not with me currently, but I can send you afterwards, yeah. >> Yeah, maybe you can post that with the show notes. >> We can post it afterwards. >> I was amazed, you tested 667 tools. Now, I would've expected that there would be 30 or 40. Where are all of these, what do all of these long tail tools do? Are they specialized by industry or by function? >> Oh, sorry, I think we got some confusion here, because we identified 667 tools out there on the market, but we narrowed this down. Because, as you said, it's quite impossible to observe all those tools. >> But the question still stands, what is the difference, what are these very small, niche tools? What do they do? >> So most of them are domain-specific, and I think this really highlights also these very basic early definition about data quality, of like data qualities defined as fitness for use, and we can pretty much see it here that we excluded the majority of these tools just because they assess some specific kind of data, and we just really wanted to find tools that are generally applicable for different kinds of data, for structured data, unstructured data, and so on. And most of these tools, okay, someone came up with, we want to assess the quality of our, I don't know, like geological data or something like that, yeah. >> To what extent did you consider other sort of non-technical factors? Did you do that at all? I mean, was there pricing or complexity of downloading or, you know, is there a free version available? Did you ignore those and just focus on the feature function, or did those play a role? >> So basically the focus was on the feature function, but of course we had to contact the customer support. Especially with the commercial tools, we had to ask them to provide us with some trial licenses, and there we perceived different feedback from those companies, and I think the best comprehensive study here is definitely Gartner Magic Quadrant for Data Quality Tools, because they give a broad assessment here, but what we also highlight in our study are companies that have a very open support and they are very willing to support you. For example, Informatica Data Quality, we perceived a really close interaction with them in terms of support, trial licenses, and also like specific functionality. Also Experian, our contact from Experian from France was really helpful here. And other companies, like IBM, they focus on big vendors, and here, it was not able to assess these tools, for example, yeah. >> Okay, but the other differences of the Magic Quadrant is you guys actually used the tools, played with them, experienced firsthand the customer experience. >> Exactly, yeah. >> Did you talk to customers as well, or, because you were the customer, you had that experience. >> Yes, I were the customer, but I was also happy to attend some data quality event in Vienna, and there I met some other customers who had experience with single tools. Not of course this wide range we observed, but it was interesting to get feedback on single tools and verify our results, and it matched pretty good. >> How large was the team that ran the study? >> Five people. >> Five people, and how long did it take you from start to finish? >> Actually, we performed it for one year, roughly. The assessment. And I think it's a pretty long time, especially when you see how quick the market responds, especially in the open source field. But nevertheless, you need to make some cut, and I think it's a very recent study now, and there is also the idea to publish it now, the preliminary results, and we are happy with that. >> Were there any surprises in the results? >> I think the main results, or one of the surprises was that we think that there is definitely more potential for automation, but not only for automation. I really enjoyed the keynote this morning that we need more automation, but at the same time, we think that there is also the demand for more declaration. We observed some tools that say, yeah, we apply machine learning, and then you look into their documentation and find no information, which algorithm, which parameters, which thresholds. So I think this is definitely, especially if you want to assess the data quality, you really need to know what algorithm and how it's attuned and give the user, which in most case will be a technical person with technical background, like some chief data officer. And he or she really needs to have the possibility to tune these algorithms to get reliable results and to know what's going on and why, which records are selected, for example. >> So now what? You're presenting the results, right? You're obviously here at this conference and other conferences, and so it's been what, a year, right? >> Yes. >> And so what's the next wave? What's next for you? >> The next wave, we're currently working on a project which is called some Knowledge Graph for Data Quality Assessment, which should tackle two problems in ones. The first is to come up with a semantic representation of your data landscape in your company, but not only the data landscape itself in terms of gathering meta data, but also to automatically improve or annotate this data schema with data profiles. And I think what we've seen in the tools, we have a lot of capabilities for data profiling, but this is usually left to the user ad hoc, and here, we store it centrally and allow the user to continuously verify newly incoming data if this adheres to this standard data profile. And I think this is definitely one step into the way into more automation, and also I think it's the most... The best thing here with this approach would be to overcome this very arduous way of coming up with all the single rules within a team, but present the data profile to a group of data, within your data quality project to those peoples involved in the projects, and then they can verify the project and only update it and refine it, but they have some automated basis that is presented to them. >> Oh, great, same team or new team? >> Same team, yeah. >> Oh, great. >> We're continuing with it. >> Well, Lisa, thanks so much for coming to theCUBE and sharing the results of your study. Good luck with your talk on Friday. >> Thank you very much, thank you. >> All right, and thank you for watching. Keep it right there, everybody. We'll be back with our next guest right after this short break. From MIT CDOIQ, you're watching theCUBE. (upbeat music)

Published Date : Jul 31 2019

SUMMARY :

Brought to you by SiliconANGLE Media. and the Software Competence Center in Hagenberg. it's great to be here. Kind of the long tail of tools, Okay, so the main motivation for this study of the tools were, talking to customers? And I think this gives a really nice digest of the market And the third part is dedicated to data quality monitoring What's the URL, sorry. but I can send you afterwards, yeah. Yeah, maybe you can post that I was amazed, you tested 667 tools. Oh, sorry, I think we got some confusion here, and I think this really highlights also these very basic So basically the focus was on the feature function, Okay, but the other differences of the Magic Quadrant Did you talk to customers as well, or, and there I met some other customers and we are happy with that. or one of the surprises was that we think but present the data profile to a group of data, and sharing the results of your study. All right, and thank you for watching.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Lisa EhrlingerPERSON

0.99+

Paul GillinPERSON

0.99+

IBMORGANIZATION

0.99+

Dave VellantePERSON

0.99+

HagenbergLOCATION

0.99+

LisaPERSON

0.99+

ViennaLOCATION

0.99+

LinzLOCATION

0.99+

Five peopleQUANTITY

0.99+

30QUANTITY

0.99+

Johannes Kepler UniversityORGANIZATION

0.99+

40QUANTITY

0.99+

FridayDATE

0.99+

one yearQUANTITY

0.99+

667 toolsQUANTITY

0.99+

FranceLOCATION

0.99+

three categoriesQUANTITY

0.99+

third partQUANTITY

0.99+

Cambridge, MassachusettsLOCATION

0.99+

ExperianORGANIZATION

0.99+

secondQUANTITY

0.99+

two problemsQUANTITY

0.99+

more than 20 yearsQUANTITY

0.99+

GartnerORGANIZATION

0.99+

single toolsQUANTITY

0.99+

SiliconANGLE MediaORGANIZATION

0.98+

firstQUANTITY

0.98+

MIT CDOIQORGANIZATION

0.98+

a yearQUANTITY

0.97+

three fieldsQUANTITY

0.97+

Apache GriffinORGANIZATION

0.97+

Archive.orgOTHER

0.96+

.orgOTHER

0.96+

one stepQUANTITY

0.96+

Linz, AustriaLOCATION

0.95+

oneQUANTITY

0.94+

singleQUANTITY

0.94+

first insightQUANTITY

0.93+

theCUBEORGANIZATION

0.92+

2019DATE

0.92+

this morningDATE

0.91+

BTQORGANIZATION

0.91+

MIT Chief Data Officer andEVENT

0.9+

Archive.comOTHER

0.88+

InformaticaORGANIZATION

0.85+

Software Competence CenterORGANIZATION

0.84+

Information Quality Symposium 2019EVENT

0.81+

MIT Chief Data Officer Information Quality ConferenceEVENT

0.72+

Data QualityORGANIZATION

0.67+

#MITCDOIQEVENT

0.65+

Magic QuadrantCOMMERCIAL_ITEM

0.63+

MagicCOMMERCIAL_ITEM

0.45+

nextEVENT

0.44+

waveEVENT

0.43+

Magic QuadrantORGANIZATION

0.43+

waveDATE

0.41+

MagicTITLE

0.39+

Veda Bawo, Raymond James & Althea Davis, ING Bank | MIT CDOIQ 2019


 

>> From Cambridge Massachusetts, it's the CUBE, covering MIT Chief Data Officer and Information Quality Symposium 2019. Brought to you by silicon angle media. >> Welcome back to Cambridge Massachusetts everybody you're watching the cube. The leader in live tech coverage. The cubes two day coverage of MIT's CDOIQ. The chief data officer information quality event. Thirteenth year we started here in 2013. I'm Dave Vallante with my co-host Paul Gillin. Veda Bawo. Bowo. Bawo. Sorry Veda Bawo is here. Did I get that right? >> That's close enough. >> The director of data governance at Raymond James and Althea Davis the former chief data officer of ING bank challengers and growth markets. Ladies welcome to the cube thanks so much for coming on. >> Thank you. >> Thank you. >> Hi Vita, talk about your role at Raymond James. Relatively new role for you? >> It is a relatively new role. So I recently left fifth third bank as their managing director of data governance and I've moved on to Raymond James in sunny Florida. And I am now the director of data governance for Raymond James. So it's a global financial services company they do asset wealth management, investment banking, retail banking. So I'm excited, I'm very excited about it. >> So we've been talking all day and actually several years about how the chief data officer role kind of emerged from the back office of the data governance. >> Mmm >> And the information quality and now its come you know front and center. And actually we've seen a full circle because now it's all about data quality again. So Althea as the former CDO right is that a fair assessment that it sort of came out of the ashes of the back room. >> Yeah, I mean its definitely a fair assessment. That's where we got started. That's how we got our budgets that's how we got our teams. However, now we have to serve many masters. We have to deal with all of the privacy, we have to deal with the multiple compliancies. We have to deal with the data operations and we have to deal with all of the new, sexy emerging technologies. So to do AI and data science you need a lot of data. You need data rich. You need it to be knowledge management, you need it to be information management. And it needs to be intelligent. So we need to actually raise the bar on what we do and at the same time get the credibility from our sea sweet peers. >> Well I think we no longer have the. We don't have the luxury of being just a cost center anymore . >> No. >> Right, we have to generate revenue. So it's about data monetization. It's about partnering with our businesses to make sure that we're helping to drive strategy and deliver results for the broader organization. >> So you got to hit the bottom line. >> Yeah. >> Either raise revenue or cut costs >> Yeah absolutely >> You know directly that can be tangibly monetized. >> Exactly keep them out of jail. Right. Save money >> That too. >> Save money, make money. (inaudible laughter) keep them out of jail. >> Like both CDO's you do not study for this career path because it didn't exist a few years ago. So talk about your backgrounds and how you came to come into this role Veda. >> Yeah absolutely so you know you talked about you know data kind of starting in the bowels of the back office. So I am that person right. So I am an accountant by training. So I am the person who is non legally entity controllership by book journal entries I've closed the books. I've done regulatory reporting so I know what it feels like to have to deal with dirty data every single month end, every single quarter end right. And I know the pain of having to cleanse it and having to deal with our business partners and having experienced that gave me the passion to want to do better. Right so I want to influence my partners upstream to do better as well as to take away some of the pain points that my teams experiencing over and over again it really was groundhog day. So that really made me feel passionate about going into the data discipline. Right and so you know the benefit is great it's not an easy journey but yeah out of accounting finance and that kind of back office operational support was boring right. A data evangelist and some passionate were about it. >> Which made sense because you have to have quality. >> Absolutely. >> Consistency. You have to have so called single version of the truth. >> Absolutely because you look regularly there's light for the financial reports to be accurate. All the time. (laughter) >> Exactly >> How about you? >> I came at it from a totally different angle. I was a marketeer so I was a business manager, a marketeer I was working with the big retail brands you know the Nikes and the Levi's strauss's of the world. So I came to it from a value chain perspective from marketing you know from rolling out retail chains across Europe. And I went from there as a line management position and all the pains of the different types of data we needed and then did quite a bit of consulting with some of the big consultancies accenture. And then rolled more into the data migration so dealing with those huge change projects and having teams from all of the world. And knowing the pains what all of the guys didn't want to work on. I got it all on my plate. But it put me in position to be a really solid chief data officer. >> Somebody it was called like data chicks or something like that (laughter) and I snuck in I was like the lone >> Data chicks >> I was like the lone data dude >> You can be a data chick. It's okay no judgement here. >> And so one of the things that one of the CDO's said there. She was a woman obviously. And she said you know I think that and the stat was there was a higher proportion of women as CDO's than there were across tech which is like I don't know fifty seventeen percent. And she's positive that the reason was because it's like a thankless job that nobody wants and so I just wonder as woman CDO your thoughts on that is that true. >> Well first of all we're the newest to the table right so you're the new kid on the block it doesn't matter if you're man or woman you're the new kid on the block so you know the CFO's got the four thousand year history behind him or her. The CIO or CTO they've got the fifty, sixty year up on us. So we're new. So you have to calve out your space and I do think that a lot of women by nature like to take on things big. To do things that other people don't want to do. So I can see how women kind of fell into that. But, at the same time you know data it's an asset and it is the newest asset. And it's definitely misunderstood. So I do think that you know women you know we kind of fell into it but it was actually something that happened good for women because there's a big future in data. >> Well let's just be realistic right. Woman have unique skillset. I may be a little bias but we have a unique skillset. We're able to solve problems creatively. Right there's no one size fits all solution for data. There's no accounting pronouncement that tells me how to handle and manage my data. Right I have to kind of figure it out as I go along and pivot when something doesn't work. I think that's something that is very natural to women. >> Yeah. >> I think that contributes to us kind of taking on these roles. >> Can I just do a little survey here (laughter) We hear that the chief data officer of function is defined differently at different organizations. Now you both are in financial services. You both have a chief data function. Are you doing the same thing? (laughter) >> Absolutely not! (laughter) >> You know this is data by design. I mean I'm getting lucky I've had teams that go the whole gammon right so. From the compliancy side through to the data operations through to all of the like I said the exotics, sexy you know emerging technologies stuff with the data scientists. So I've had the whole thing. I've also had my last position at ING bank I had to you know lead a team of chief data officers across three different continents Australia, Asia and also Eastern and Western Europe. So it's totally different than you know maybe another company that they've only got to chief data officer working on data quality and data governance. >> So again another challenge of being the new kid on the block right. Defining roles and responsibilities. There's no one globally, universally accepted definition of what a chief data officer should do. >> Right >> Right is data science in or out are analytics in or out. Right. >> Security sometimes. >> Security right sometimes privacy is it or out. Do you have operational responsibilities or are you truly just a second line governance function right? There's a mixed bag out there in the industry. I don't know that we have one answer that we know for sure is true. But I do know for sure is that data is not an IT function. >> Well okay. That's really important. >> It's not an IT asset. >> Yeah. >> I want to say that it's not an IT asset. It is an information asset or a data asset which is a different asset than an IT asset or a financial asset or a human asset. >> But and that's the other big change is that fifteen. Ten to fifteen years ago data was assumed to be a liability right. >> Mmm. >> Federal rules set up a civil procedure we got to get rid of the data or you know we're going to get sued. Number one and number two is that data because it's digital you know people say data is the new oil. I always say it's not. It's more important than oil. >> It's like blood. >> Oil you can only use in one use case. Data you can reuse over and over again. >> Reuse, reuse perpetual. It goes on and on and on. And every time you reuse it the value increases. So I would agree with you it is not the new oil. It is much bigger than that and it needs to I mean I know from some of my colleagues in the profession. We talk about borrowing from other more mature disciplines to make data management, information management and knowledge management much more robust and be much more professional. We also need to be more professional about it as the data leaders. >> So when you're a little panel today. One of the things that you guys addressed is what keeps the CDO up at night. >> Yes >> I presume it's data. (laughter) >> No, no, no. >> It's our payers that don't get it. (laughter) >> That's what keeps us up at night. >> Its the sponsors that keep us up at night. (laughter) So what was that discussion like? >> So yeah I mean it was a lively discussion. Um, great attendance at the panel so we appreciate everyone who came out and supported. >> Full house. >> Definitely a full house. Great reviews so far. >> Yep. >> Okay, so the thing that definitely keeps folks up at night and I'm going to start with my standard one which is quality. Right you can have all of the fancy tools, right you can have a million data scientists but if the quality is not good or sufficient. Then you're no where. So quality is fundamentally the thing that the CDO has to always pay attention to. And there's no magic you know pill or magic right potion that's going to make the quality right. It's something that the entire organization has a rally around. And it's not a one thing done right it has to be a sustainable approach to making sure the quality is good enough so that you can actually reap the benefits or derive the value right from your data. >> Absolutely and I would say you know following on from the quality and I consider that trustworthiness of the data. I would say as a chief data officer you're coming to the table. You're coming to the executive table you need to bring it all so you need to be impactful. You need to be absolutely relevant to your peers. You also need to be able to make their teams in a position to act. So it needs to be actionable. And if you don't have all of that combination with the trustworthiness you're dead in the water. So it is a hard act and that's why there is a high attrition for chief data officers. You know it's a hard job. But I think it's very much worthwhile because this particular asset this new asset we haven't been able to even scratch the surface of what it could mean for us a society and for commercial organizations or government organizations. >> To your point it's not a technology problem when Mark Ramsay who was surveying the audience this morning. He said you know why have we had so many failures and the first hand that went up said. It's because of relations with the database. >> And I wanted to say it's not a technology problem. >> It's a hearts, minds and haves >> Absolutely. Absolutely. You couldn't make an impact to your data landscape without changing your technology. >> You said at the outset how important it is for you to show a bottom line impact. >> Right >> What's one project you've worked on or that you've led in your tenure that did that. >> If we're talking about for example I can't say specifics but if we're looking at one of institutions I worked at in an insurance firm and we looked at the customer journey. So we worked with some of the different departments that traditionally did not get access to data for them to be able to be effective at their jobs. But they wanted to do in marketing was create actually new products to make you know increase the wallet from the existing customers other things they wanted to do was for example, when there were problems with the customers instead of customer you know leaving you know the journey they were able to bring them back in by getting access to the data. So we either gave them insight like you know looking back to make sure that things didn't happen wrong the next time or we helped them giving them information so they could develop new products so this is all about going to market. So that's absolutely bottom line. It's not just all cost efficiency and products to begin . >> Yeah pipeline. (laughter) >> And that's really valid but you know. >> Absolutely so I'll give you one example where the data organization partnered with our data scientists. To try to figure out the best location for various branches. For that particular institution. And it was taking right trillions of data points right about current footprint as well as other information about geographic information that was out there publicly available. Taking that and using the analytics to figure out okay where should we have our branches, our ATM's etc... and then conslidating the footprint or expanding where appropriate. So that is bottom line impact for sure. >> I remember in the early part of the two thousands I remember reading a Harvard business review article about gut feel trumps data every time. But that's an example where no way. >> Nope. >> You could never do better with the gut than that example that you just gave. >> Absolutely. >> Veda. I want to ask you a question. I don't know if you've heard Mark Ramsays talk this morning but he sort of. He sort of declared that data governance was over. >> Mmm. >> And as the director of data governance >> Never! >> I wondered if you would disagree with that. >> Never! >> Look. >> Were you surprised? >> It's just like saying that I should stop brushing my teeth. Right I always will have to maintain a certain level of data hygiene. And I don't think that employees and executives and organizations have reached a level of maturity where I can trust them to maintain that level of hygiene independently. And therefore I need a governance function. I need to check to make sure you brush your teeth in the morning and in the evening. Right and I need you to go for your annual exam to make sure you don't have any cavities that weren't detected. Right so I think that there's still a role for governance to play. It will evolve over time for sure. Right as you know the landscape changes but I think there's still a role right for like governance. >> And that wasn't my takeaway part. I think he said that basically enterprise data warehouse fail massive data management fail. The single data model failed so we punted to governance and that's not going to solve the enterprise data problem. >> I think it's a one leg in the stool. It's one leg in the stool. ` >> Yeah I think I would really sum it up as a monolithic data storage approach failed. Like that. And then our attention went to data governance but that's not going to solve it either. Look, data management is about twelve different data capabilties it's a discipline so we give the title data governance but it means multiple things. And I think that if we're more educated and we have more confidence on what we're doing on those different areas. Plus information and knowledge management then we're way ahead of the game. I mean knowledge graphs and semantics. That puts companies you know at the top of that you know corporate inequality gap that we're looking at right now. Where you know companies are you know five and thousand times more valuable then their competition and the gap is just going to get bigger considering if some of those companies at the bottom of the gap are you know just keep on doing the same thing. >> I agree I was just trying to get you worked up. (laughter) >> Well you did. >> It's going to be a different kind of show. >> But that point you're making. Microsoft, Apple, Amazon and Google, Facebook. Top five companies in terms of market cap. And they're all data companies. They surpass all the financial services, all the energy companies, all the manufacturers. >> And Alibaba same thing. >> Oh yeah. >> They're doing the same thing. >> They're coming right up there. With four or five hundred billion. >> They're all doing the knowledge approach. They're doing all of this stuff and that's a much more comprehensive approach to looking at it as a full spectrum and if we keep on in the financial industry or any industry keep on just kind of looking at little bits and pieces. It's not going to work. It's a lot of talk but there's no action. >> We are losing right. I know that Fintechs are right fringing upon are territory. Right if Amazon can provide a credit card or lend you money or extend you credit. They're now functioning as a traditional bank would. If we're not paying attention to them as real competitors. We've lost the battle. >> That's a really important point you're making because it's all digital now. >> Absolutely. >> You used to be you'd never see companies traverse industries and now you see it Apple pay and Amazon and healthcare. >> Yeah. >> And government organizations teaming up with corporations and individuals. Everything is free flowing so that means the knowledge and the data and the information also needs to flow freely but it needs to be managed. >> Now you're into a whole realm of privacy and security. >> And regulations right. Regulations for the non right traditional banks. So we're doing banking transactions. >> Do you think traditional banks will lose control over the payment systems? >> If they don't move with the time they will. If they don't. I mean it's not something that's going to happen tomorrow but you know there is a category of bank called Challenger banks so there's a reason. You know even within their own niche there's a group of banks. >> I mean not even just payments right. Think about cash transactions like if I do money transfer am I going to my traditional bank to do it or am I going to cashapp. >> I think it's interesting particularly in the retail banking business where you know one banking app looks pretty much like other and people don't go to branches anymore and so that brand affinity that used to exist is harder and harder to maintain and I wonder what role does data play in reestablishing that connection. >> Well for me right I get really excited and sometimes annoyed when I can open up my app for my bank and I can see the pie chart of my spending. They're using my data to inform me about my behaviors sometimes a good story, sometimes a bad story. But they're using it to inform me. That's making me more loyal to that particular institution right so I can also link all of my financial accounts in that one institutions app and I can see a full list of all of my credit cards, all of my loans, all of my investments in one stop shopping. That's making me go to their app more often versus the other options that are out there. So I think we can use the data in order to endear the customer source but we have to be smart about it. >> That's the accountant in you. I just refuse to not look. (laughter) >> You can afford to not look. I can't. >> Thank you. >> Thanks for riling us up. >> Alright thank you for watching everybody we'll be right back with our next guest right after this short break. You're watching the cube from MIT in Boston, Cambridge. Right back. (atmospheric music)

Published Date : Jul 31 2019

SUMMARY :

Brought to you by silicon angle media. Did I get that right? and Althea Davis the former chief data officer Hi Vita, talk about your role at Raymond James. And I am now the director of data of the data governance. So Althea as the former CDO right is that So to do AI and data science you need a lot of data. We don't have the luxury of being and deliver results for the broader organization. Right. keep them out of jail. you came to come into this role Veda. And I know the pain of having to cleanse it You have to have so called single version of the truth. light for the financial reports to be accurate. So I came to it from a value chain perspective You can be a data chick. And she's positive that the reason was because But, at the same time you know data it's an asset Right I have to kind of figure it out as I go along I think that contributes to us kind of We hear that the chief data officer of function I had to you know lead a team of chief data officers the new kid on the block right. Right is data science in or out are I don't know that we have one answer that we know That's really important. I want to say that it's not an IT asset. But and that's the other big change is that fifteen. we got to get rid of the data or you know Data you can reuse over and over again. So I would agree with you it is not the new oil. One of the things that you guys addressed I presume it's data. It's our payers that don't get it. Its the sponsors that keep us up at night. Um, great attendance at the panel so we appreciate Great reviews so far. the thing that the CDO has to always pay attention to. So it needs to be actionable. and the first hand that went up said. You couldn't make an impact to your data it is for you to show a bottom line impact. or that you've led in your tenure that did that. actually new products to make you know increase (laughter) Absolutely so I'll give you one example I remember in the early part of the two thousands than that example that you just gave. He sort of declared that data governance was over. I need to check to make sure you brush your and that's not going to solve the enterprise data problem. It's one leg in the stool. and the gap is just going to get bigger considering I agree I was just trying to get you worked up. all the energy companies, all the manufacturers. They're coming right up there. It's not going to work. I know that Fintechs are right fringing upon are territory. That's a really important point you're industries and now you see it and the data and the information also needs to Regulations for the non right traditional banks. I mean it's not something that's going to happen tomorrow am I going to my traditional bank to do it banking business where you know one banking app looks and I can see the pie chart of my spending. I just refuse to not look. You can afford to not look. Alright thank you for watching everybody we'll

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Mark RamsayPERSON

0.99+

Paul GillinPERSON

0.99+

MicrosoftORGANIZATION

0.99+

AmazonORGANIZATION

0.99+

FacebookORGANIZATION

0.99+

Veda BawoPERSON

0.99+

AppleORGANIZATION

0.99+

Dave VallantePERSON

0.99+

GoogleORGANIZATION

0.99+

AlibabaORGANIZATION

0.99+

2013DATE

0.99+

EuropeLOCATION

0.99+

ING BankORGANIZATION

0.99+

VitaPERSON

0.99+

fiveQUANTITY

0.99+

fiftyQUANTITY

0.99+

NikesORGANIZATION

0.99+

fourQUANTITY

0.99+

INGORGANIZATION

0.99+

MITORGANIZATION

0.99+

Raymond JamesORGANIZATION

0.99+

Althea DavisPERSON

0.99+

fifty seventeen percentQUANTITY

0.99+

bothQUANTITY

0.99+

two dayQUANTITY

0.99+

Levi'sORGANIZATION

0.99+

one legQUANTITY

0.99+

Mark RamsaysPERSON

0.99+

tomorrowDATE

0.99+

four thousand yearQUANTITY

0.99+

AustraliaLOCATION

0.99+

two thousandsQUANTITY

0.99+

five hundred billionQUANTITY

0.98+

Cambridge MassachusettsLOCATION

0.98+

AsiaLOCATION

0.98+

BawoPERSON

0.98+

todayDATE

0.98+

VedaPERSON

0.98+

singleQUANTITY

0.97+

Western EuropeLOCATION

0.97+

one exampleQUANTITY

0.97+

OneQUANTITY

0.97+

EasternLOCATION

0.97+

oneQUANTITY

0.97+

TenDATE

0.97+

Boston, CambridgeLOCATION

0.97+

Thirteenth yearQUANTITY

0.97+

fifteenDATE

0.96+

second lineQUANTITY

0.96+

a million data scientistsQUANTITY

0.95+

Raymond JamesPERSON

0.95+

five companiesQUANTITY

0.95+

fifteen years agoDATE

0.94+

fifth third bankQUANTITY

0.94+

BowoPERSON

0.94+

sixty yearQUANTITY

0.93+

trillions of data pointsQUANTITY

0.92+

ING bankORGANIZATION

0.92+

one use caseQUANTITY

0.91+

one projectQUANTITY

0.91+

this morningDATE

0.91+

firstQUANTITY

0.9+

thousand timesQUANTITY

0.9+

2019DATE

0.89+

one answerQUANTITY

0.87+

AltheaPERSON

0.86+

ChallengerORGANIZATION

0.85+

one banking appQUANTITY

0.84+

MIT Chief Data Officer andEVENT

0.83+

three different continentsQUANTITY

0.82+

few years agoDATE

0.81+

this morningDATE

0.8+

single versionQUANTITY

0.78+

number twoQUANTITY

0.76+

Information Quality Symposium 2019EVENT

0.75+

HarvardORGANIZATION

0.73+

payTITLE

0.72+

sunny FloridaLOCATION

0.7+

Dr. Stuart Madnick, MIT | MIT CDOIQ 2019


 

>> from Cambridge, Massachusetts. It's the Cube covering M I T. Chief data officer and information quality Symposium 2019. Brought to you by Silicon Angle Media. >> Welcome back to M I. T. In Cambridge, Massachusetts. Everybody. You're watching the cube. The leader in live tech coverage. This is M I t CDO I Q the chief data officer and information quality conference. Someday Volonte with my co host, Paul Galen. Professor Dr Stewart, Mad Nick is here. Longtime Cube alum. Ah, long time professor at M i. T soon to be retired, but we're really grateful that you're taking your time toe. Come on. The Cube is great to see you again. >> It's great to see you again. It's been a long time. She worked together and I really appreciate the opportunity to share our spirits. Hear our mighty with your audience. Well, it's really been fun >> to watch this conference evolved were full and it's really amazing. We have to move to a new venue >> next year. I >> understand. And data we talk about the date explosion all the time, But one of the areas that you're focused on and you're gonna talk about today is his ethics and privacy and data causes so many concerns in those two areas. But so give us the highlight of what you're gonna discuss with the audience today. We'll get into >> one of things that makes it so challenging. It is. Data has so many implications. Tow it. And that's why the issue of ethics is so hard to get people to reach agreement on it. We're talking people regarding medicine and the idea big data and a I so know, to be able to really identify causes you need mass amounts of data. That means more data has to be made available as long as it's Elsa data, not mine. Well, not my backyard. If he really So you have this issue where on the one hand, people are concerned about sharing the data. On the other hand, there's so many valuable things would gain by sharing data and getting people to reach agreement is a challenge. Well, one of things >> I wanted to explore with you is how things have changed you back in the day very familiar with Paul you as well with Microsoft, Department of Justice, justice, FTC issues regarding Microsoft. And it wasn't so much around data was really around browsers and bundling things today. But today you see Facebook and Google Amazon coming under fire, and it's largely data related. Listen, Liz Warren, last night again break up big tech your thoughts on similarities and differences between sort of the monopolies of yesterday and the data monopolies of today Should they be broken up? What do you thought? So >> let me broaden the issue a little bit more from Maryland, and I don't know how the demographics of the audience. But I often refer to the characteristics that millennials the millennials in general. I ask my students this question here. Now, how many of you have a Facebook account in almost every class? Facebook. You realize you've given away a lot of nation about yourself. It it doesn't really occurred to them. That may be an issue. I was told by someone that in some countries, Facebook is very popular. That's how they cordoned the kidnappings of teenagers from rich families. They track them. They know they're going to go to this basketball game of the soccer match. You know exactly what I'm going after it. That's the perfect spot to kidnap them, so I don't know whether students think about the fact that when they're putting things on Facebook than making so much of their life at risk. On the other hand, it makes their life richer, more enjoyable. And so that's why these things are so challenging now, getting back to the issue of the break up of the big tech companies. One of the big challenges there is that in order to do the great things that big data has been doing and the things that a I promises do you need lots of data. Having organizations that can gather it all together in a relatively systematic and consistent manner is so valuable breaking up the tech companies. And there's some reasons why people want to do that, but also interferes with that benefit. And that's why I think it's gonna be looked at real Kim, please, to see not only what game maybe maybe breaking up also what losses of disadvantages we're creating >> for ourselves so example might be, perhaps it makes United States less competitive. Visa VI China, in the area of machine intelligence, is one example. The flip side of that is, you know Facebook has every incentive to appropriate our data to sell ads. So it's not an easy, you know, equation. >> Well, even ads are a funny situation for some people having a product called to your attention that something actually really want. But you never knew it before could be viewed as a feature, right? So, you know, in some case of the ads, could be viewed as a feature by some people. And, of course, a bit of intrusion by other people. Well, sometimes we use the search. Google, right? Looking >> for the ad on the side. No longer. It's all ads. You know >> it. I wonder if you see public public sentiment changing in this respect. There's a lot of concerns, certainly at the legislative level now about misuse of data. But Facebook user ship is not going down. Instagram membership is not going down. Uh, indication is that that ordinary citizens don't really care. >> I know that. That's been my I don't have all the data. Maybe you may have seen, but just anecdotally and talking to people in the work we're doing, I agree with you. I think most people maybe a bit dramatic, but at a conference once and someone made a comment that there has not been the digital Pearl Harbor yet. No, there's not been some event that was just so onerous. Is so all by the people. Remember the day it happened kind of thing. And so these things happen and maybe a little bit of press coverage and you're back on your Facebook. How their instagram account the next day. Nothing is really dramatic. Individuals may change now and then, but I don't see massive changes. But >> you had the Equifax hack two years ago. 145,000,000 records. Capital one. Just this week. 100,000,000 records. I mean, that seems pretty Pearl Harbor ish to me. >> Well, it's funny way we're talking about that earlier today regarding different parts of the world. I think in Europe, the general, they really seem to care about privacy. United States that kind of care about privacy in China. They know they have no privacy. But even in us where they care about privacy, exactly how much they care about it is really an issue. And in general it's not enough to move the needle. If it does, it moves it a little bit about the time when they show that smart TVs could be broken into smart. See, TV sales did not Dutch an inch. Not much help people even remember that big scandal a year ago. >> Well, now, to your point about expects, I mean, just this week, I think Equifax came out with a website. Well, you could check whether or not your credentials were. >> It's a new product. We're where we're compromised. And enough in what has been >> as head mind, I said, My wife says it's too. So you had a choice, you know, free monitoring or $125. So that way went okay. Now what? You know, life goes >> on. It doesn't seem like anything really changes. And we were talking earlier about your 1972 book about cyber security, that many of the principles and you outlined in that book are still valid today. Why are we not making more progress against cybercriminals? >> Well, two things. One thing is you gotta realize, as I said before, the Cave man had no privacy problems and no break in problems. But I'm not sure any of us want to go back to caveman era because you've got to realize that for all these bad things. There's so many good things that are happening, things you could now do, which a smartphone you couldn't even visualize doing a decade or two ago. So there's so much excitement, so much for momentum, autonomous cars and so on and so on that these minor bumps in the road are easy to ignore in the enthusiasm and excitement. >> Well and now, as we head into 2020 affection it was. It was fake news in 2016. Now we've got deep fakes. Get the ability to really use video in new ways. Do you see a way out of that problem? A lot of people looking a Blockchain You wrote an article recently, and Blockchain you think it's on hackable? Well, think again. >> What are you seeing? I think one of things we always talk about when we talk about improving privacy and security and organizations, the first thing is awareness. Most people are really small moment of time, aware that there's an issue and it quickly pass in the mind. The analogy I use regarding industrial safety. You go into almost any factory. You'll see a sign over the door every day that says 520 days, his last industrial accident and then a sub line. Please do not be the one to reset it this year. And I often say, When's the last time you went to a data center? And so assign is at 50 milliseconds his last cyber data breach. And so it needs to be something that is really front, the mind and people. And we talk about how to make awareness activities over companies and host household. And that's one of our major movements here is trying to be more aware because we're not aware that you're putting things at risk. You're not gonna do anything about it. >> Last year we contacted Silicon Angle, 22 leading security experts best in one simple question. Are we winning or losing the war against cybercriminals? Unanimously, they said, we're losing. What is your opinion of that question? >> I have a great quote I like to use. The good news is the good guys are getting better than a firewall of cryptographic codes. But the bad guys are getting batter faster, and there's a lot of reasons for that well on all of them. But we came out with a nautical talking about the docking Web, and the reason why it's fascinating is if you go to most companies if they've suffered a data breach or a cyber attack, they'll be very reluctant to say much about unless they really compelled to do so on the dock, where they love to Brent and reputation. I'm the one who broke in the Capital One. And so there's much more information sharing that much more organized, a much more disciplined. I mean, the criminal ecosystem is so much more superior than the chaotic mess we have here on the good guys side of the table. >> Do you see any hope for that? There are service's. IBM has one, and there are others in a sort of anonymous eyes. Security data enable organizations to share sensitive information without risk to their company. You see any hope on the collaboration, Front >> said before the good guys are getting better. The trouble is, at first I thought there was an issue that was enough sharing going on. It turns out we identified over 120 sharing organizations. That's the good news. And the bad news is 120. So IBM is one and another 119 more to go. So it's not a very well coordinated sharing. It's going just one example. The challenges Do I see any hope in the future? Well, in the more distant future, because the challenge we have is that there'll be a cyber attack next week of some form or shape that we've never seen before and therefore what? Probably not well prepared for it. At some point, I'll no longer be able to say that, but I think the cyber attackers and creatures and so on are so creative. They've got another decade of more to go before they run out of >> Steve. We've got from hacktivists to organized crime now nation states, and you start thinking about the future of war. I was talking to Robert Gates, aboutthe former defense secretary, and my question was, Why don't we have the best cyber? Can't we go in the oven? It goes, Yeah, but we also have the most to lose our critical infrastructure, and the value of that to our society is much greater than some of our adversaries. So we have to be very careful. It's kind of mind boggling to think autonomous vehicles is another one. I know that you have some visibility on that. And you were saying that technical challenges of actually achieving quality autonomous vehicles are so daunting that security is getting pushed to the back burner. >> And if the irony is, I had a conversation. I was a visiting professor, sir, at the University of Niece about a 12 14 years ago. And that's before time of vehicles are not what they were doing. Big automotive tele metrics. And I realized at that time that security wasn't really our top priority. I happen to visit organization, doing really Thomas vehicles now, 14 years later, and this conversation is almost identical now. The problems we're trying to solve. A hider problem that 40 years ago, much more challenging problems. And as a result, those problems dominate their mindset and security issues kind of, you know, we'll get around him if we can't get the cot a ride correctly. Why worry about security? >> Well, what about the ethics of autonomous vehicles? Way talking about your programming? You know, if you're gonna hit a baby or a woman or kill your passengers and yourself, what do you tell the machine to Dio, that is, it seems like an unsolvable problem. >> Well, I'm an engineer by training, and possibly many people in the audience are, too. I'm the kind of person likes nice, clear, clean answers. Two plus two is four, not 3.94 point one. That's the school up the street. They deal with that. The trouble with ethic issues is they don't tend to have a nice, clean answer. Almost every study we've done that has these kind of issues on it. And we have people vote almost always have spread across the board because you know any one of these is a bad decision. So which the bad decision is least bad. Like, what's an example that you used the example I use in my class, and we've been using that for well over a year now in class, I teach on ethics. Is you out of the design of an autonomous vehicle, so you must program it to do everything and particular case you have is your in the vehicle. It's driving around the mountain and Swiss Alps. You go around a corner and the vehicle, using all of senses, realize that straight ahead on the right? Ian Lane is a woman in a baby carriage pushing on to this onto the left, just entering the garage way a three gentlemen, both sides a road have concrete barriers so you can stay on your path. Hit the woman the baby carriage via to the left. Hit the three men. Take a shop, right or shot left. Hit the concrete wall and kill yourself. And trouble is, every one of those is unappealing. Imagine the headline kills woman and baby. That's not a very good thing. There actually is a theory of ethics called utility theory that says, better to say three people than to one. So definitely doing on Kim on a kill three men, that's the worst. And then the idea of hitting the concrete wall may feel magnanimous. I'm just killing myself. But as a design of the car, shouldn't your number one duty be to protect the owner of the car? And so people basically do. They close their eyes and flip a coin because they don't want anyone. Those hands, >> not an algorithmic >> response, doesn't leave. >> I want to come back for weeks before we close here to the subject of this conference. Exactly. You've been involved with this conference since the very beginning. How have you seen the conversation changed since that time? >> I think I think it's changing to Wei first. As you know, this record breaking a group of people are expecting here. Close to 500 I think have registered s o much Clea grown kind of over the years, but also the extent to which, whether it was called big data or call a I now whatever is something that was kind of not quite on the radar when we started, I think it's all 15 years ago. He first started the conference series so clearly has become something that is not just something We talk about it in the academic world but is becoming main stay business for corporations Maur and Maur. And I think it's just gonna keep increasing. I think so much of our society so much of business is so dependent on the data in any way, shape or form that we use it and have >> it well, it's come full circle. It's policy and I were talking at are open. This conference kind of emerged from the ashes of the back office information quality and you say the big date and now a I guess what? It's all coming back to information. >> Lots of data. That's no good. Or that you don't understand what they do with this. Not very healthy. >> Well, doctor Magic. Thank you so much. It's a >> relief for all these years. Really Wanna thank you. Thank you, guys, for joining us and helping to spread the word. Thank you. Pleasure. All right, keep it right, everybody. Paul and >> I will be back at M I t cdo right after this short break. You're watching the cue.

Published Date : Jul 31 2019

SUMMARY :

Brought to you by The Cube is great to see you again. It's great to see you again. We have to move to a new venue I But one of the areas that you're focused on and you're gonna talk about today is his ethics and privacy to be able to really identify causes you need mass amounts of data. I wanted to explore with you is how things have changed you back in the One of the big challenges there is that in order to do the great things that big data has been doing The flip side of that is, you know Facebook has every incentive to appropriate our data to sell ads. But you never knew it before could be viewed as a feature, for the ad on the side. There's a lot of concerns, certainly at the legislative level now about misuse of data. Is so all by the people. I mean, that seems pretty Pearl Harbor ish to me. And in general it's not enough to move the needle. Well, now, to your point about expects, I mean, just this week, And enough in what has been So you had a choice, you know, book about cyber security, that many of the principles and you outlined in that book are still valid today. in the road are easy to ignore in the enthusiasm and excitement. Get the ability to really use video in new ways. And I often say, When's the last time you went to a data center? What is your opinion of that question? Web, and the reason why it's fascinating is if you go to most companies if they've suffered You see any hope on the collaboration, in the more distant future, because the challenge we have is that there'll be a cyber attack I know that you have some visibility on that. And if the irony is, I had a conversation. that is, it seems like an unsolvable problem. But as a design of the car, shouldn't your number one How have you seen the conversation so much of business is so dependent on the data in any way, shape or form that we use it and from the ashes of the back office information quality and you say the big date and now a I Or that you don't understand what they do with this. Thank you so much. to spread the word. I will be back at M I t cdo right after this short break.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Ian LanePERSON

0.99+

Stuart MadnickPERSON

0.99+

Liz WarrenPERSON

0.99+

Paul GalenPERSON

0.99+

IBMORGANIZATION

0.99+

MicrosoftORGANIZATION

0.99+

EuropeLOCATION

0.99+

ChinaLOCATION

0.99+

$125QUANTITY

0.99+

PaulPERSON

0.99+

EquifaxORGANIZATION

0.99+

2016DATE

0.99+

StevePERSON

0.99+

FacebookORGANIZATION

0.99+

Robert GatesPERSON

0.99+

GoogleORGANIZATION

0.99+

Silicon AngleORGANIZATION

0.99+

Silicon Angle MediaORGANIZATION

0.99+

ElsaPERSON

0.99+

fourQUANTITY

0.99+

520 daysQUANTITY

0.99+

StewartPERSON

0.99+

Last yearDATE

0.99+

next yearDATE

0.99+

Cambridge, MassachusettsLOCATION

0.99+

TwoQUANTITY

0.99+

KimPERSON

0.99+

2020DATE

0.99+

50 millisecondsQUANTITY

0.99+

Swiss AlpsLOCATION

0.99+

this weekDATE

0.99+

yesterdayDATE

0.99+

three menQUANTITY

0.99+

14 years laterDATE

0.99+

two years agoDATE

0.99+

a year agoDATE

0.99+

three peopleQUANTITY

0.99+

todayDATE

0.99+

AmazonORGANIZATION

0.99+

twoQUANTITY

0.99+

two thingsQUANTITY

0.99+

one simple questionQUANTITY

0.99+

last nightDATE

0.99+

one exampleQUANTITY

0.99+

InstagramORGANIZATION

0.99+

two areasQUANTITY

0.98+

DioPERSON

0.98+

United StatesLOCATION

0.98+

120QUANTITY

0.98+

next weekDATE

0.98+

firstQUANTITY

0.98+

this yearDATE

0.98+

22 leading security expertsQUANTITY

0.98+

three gentlemenQUANTITY

0.98+

OneQUANTITY

0.98+

1972DATE

0.98+

instagramORGANIZATION

0.98+

FTCORGANIZATION

0.98+

oneQUANTITY

0.97+

100,000,000 recordsQUANTITY

0.97+

MagicPERSON

0.97+

145,000,000 recordsQUANTITY

0.97+

Pearl HarborEVENT

0.97+

40 years agoDATE

0.97+

MarylandLOCATION

0.97+

University of NieceORGANIZATION

0.97+

Department of JusticeORGANIZATION

0.96+

One thingQUANTITY

0.95+

over 120 sharing organizationsQUANTITY

0.95+

next dayDATE

0.95+

12 14 years agoDATE

0.94+

15 years agoDATE

0.93+

an inchQUANTITY

0.93+

first thingQUANTITY

0.93+

one exampleQUANTITY

0.92+

Michael Conlin, US Department of Defense | MIT CDOIQ 2019


 

(upbeat music) >> From Cambridge, Massachusetts, it's the CUBE. Covering MIT Chief Data Officer and Information Quality Symposium 2019. Brought to you by SiliconANGLE Media. (upbeat music) >> Welcome back to MIT in Cambridge Massachusetts everybody you're watching the CUBE the leader in live tech coverage. We go out to the events and extract the signal from the noise we hear at the MIT CDOIQ. It's the MIT Chief Data Officer event the 13th annual event. The CUBE started covering this show in 2013. I'm Dave Vellante with Paul Gillin, my co-host, and Michael Conlin is here as the chief data officer of the Department of Defense, Michael welcome, thank you for coming on. >> Thank you, it's a pleasure to be here. >> So the DoD is, I think it's the largest organization in the world, what does the chief data officer of the DoD do on a day to day basis? >> A range of things because we have a range of challenges at the Department of Defense. We are the single largest organization on the planet. We have the greatest scope and scale and complexity. We have the most dangerous competitors of anybody on the planet, it's not a trivial issue for us. So, I've a range of challenges. Challenges around, how do I lift the overall performance of the department using data effectively? How do I help executives make better decisions faster, using more recent, more common data? More common enterprise data is the expression we use. How do I help them become more sophisticated consumers of data and especially data analytics? And, how do we get to the point where, I can compare performance over here with performance over there, on a common basis? And compared to commercial benchmark? Which is now an expectation for us, and ask are we doing this as well as we should, right across the patch? Knowing, that all that data comes from multiple different places to start with. So we have to overcome all those differences and provide that department wide view. That's the essence of the role. And now with the recent passage of the Foundations for Evidenced-Based Policymaking Act, there are a number of additional expectations that go on top of that, but this is ultimately about improving affordability and performance of the department. >> So overall performance of the organization... >> Overall performance. >> ...as well, and maybe that comes from supporting various initiatives, and making sure you're driving performance on that basis as well. >> It does, but our litmus test is are we enabling the National Defense Strategy to succeed? Only reason to touch data is to enable the National Defense Strategy to be more successful than without it. And so we're always measuring ourselves against that. But it is, can we objectively say we're performing better? Can we objectively say that we are more affordable? In terms of the way we support the National Defense Strategy. >> I'm curious about your motivations for taking on this assignment because your background, as I see, is primarily in the private sector. A year ago you joined the US Department of Defense. A huge set of issues that you're tackling now, why'd you do it? >> So I am a capitalist, like most Americans, and I'm a serial entrepreneur. This was my first opportunity to serve government. And when I looked at it, knowing that I could directly support national defense, knowing that I could make a direct meaningful contribution, let me exercise that spirit of patriotism that many of us have, but we just not found ourselves an opportunity. When this opportunity came along I just couldn't say no to it. There's so much to be done and so much appetite for improvement that I just couldn't walk away for this. Now I've to tell you, when you start you take an oath of office to protect and defend the constitution. I don't know, it's maybe a paragraph or maybe it's two paragraphs. It felt like it took an hour to choke it out, because I was suddenly struck with all of this emotion. >> The gravity of what you were doing. >> Yeah, the gravity of what I'm doing. And that was just a reinforcement of the choice I'd already made, obviously right. But the chance to be the first chief data officer of the entire Department of Defense, just an enormous privilege. The chance to bring commercial sector best practices in and really lift the game of the department, again enormous privilege. There's so many people who could do this, probably better than me. The fact that I got the opportunity I just couldn't say no. Just too important, to many places I could see that we could make things better. I think anybody with a patriotic bone in their body would of jumped at the opportunity. >> That's awesome, I love that congratulations on getting that role and seemingly thrive in it. A big part of preserving that capitalist belief, defending the constitution and the American way, it sounds corny, but... >> It's real. >> I'm a patriot as well, is security. And security and data are intertwined. And just the whole future of warfare is dramatically changing. Can you talk about in a format like this, security, you're thinking on that, the department's thinking on that from a CDO's perspective? >> So as you know we have a number of swimlanes within the department and security is very clear swimlane, it's aligned under our chief information officer, but security is everybody's responsibility, of course. Now the longstanding criticism of security people is that they think they best way to secure anything is to permit nobody to touch it. The clear expectation for me as chief data officer is to make sure that information is shared to the right people as rapidly as possible. And, that's a different philosophy. Now I'm really lucky. Lieutenant General Denis Crall our principal cyber advisor, Dana Deasy our CIO, these people understand how important it is to get information in the right place at the right time, make it rapidly available and secure it every step along the way. We embrace the zero trust mantra. And because we embrace the zero trust mantra we're directly concerned with defending the data itself. And as long as we defend the data and the same mechanisms are the mechanisms we use to let people share it, suddenly the tension goes away. Suddenly we all have the same goal. Because the goal is not to prevent use of data, it's to enable use of data in a secure way. So the traditional tension that might be in that place doesn't exist in the department. Very productive, very professional level of collaboration with those folks in this space. Very sophisticated people. >> When we were talking before we went live you mentioned that the DoD has 10,000 plus operational systems... >> That's correct. >> A portfolio of that magnitude just overwhelming, I mean how did you know what to do first when you moved into this job, or did you have a clear mandate when you were hired? >> So I did have a clear mandate when I was hired and luckily that was spelled out. We knew what to do first because we sat down with actual leaders of the department and asked them what their goals were for improving the performance of the department. And everything starts from that conversation. You find those executives that what to improve performance, you understand what those goals are, and what data they need to manage that improvement. And you capture all the critical business questions they need answers to. From that point on they're bought in to everything that happens, right. Because they want those answers to those critical business questions. They have performance targets of their own, this is now aligned with. And so you have the support you need to go down the rest of the path of finding the data, standardizing it, et cetera. In order to deliver the answers to those questions. But it all starts which either the business mission leaders or the warfighting mission leaders who define the steps they're taking to implement the National Defense Strategy. Everything gets lined up against that, you get instant support and you know you're going after the right thing. This is not, an if you build it they will come. This is not, a driftnet the organization try to gather up all the data. This is spear fishing for specific answers to materially important questions, and everything we do is done on that basis. >> We hear Mark Ramsey this morning talk about the... He showed a picture of stove pipes and then he complicated that picture by showing multiple copies within each of those stove pipes, and says this is organizations that we've all lived in. >> That's my organization too. >> So talk about some of those data challenges at the DoD and how you're addressing those, specifically how you're enabling soldiers in the field to get the right data to the field when they need it. >> So what we'll be delicate when we talk about what we do for soldiers in the field. >> Understood, yeah. >> That tends to be sensitive. >> Understand why, sure. >> But all of those dynamics that Mark described in that presentation are present in every large cooperation I've ever served. And that includes the Department of Defense. That heterogeneity and sprawl of IT that what I would refer to, he showed us a hair ball of IT. Every large organization has a hair ball of IT. And data scattered all over the place. We took many of the same steps that he described in terms of organizing and presenting meaningful answers to questions, in almost exactly the same sequence. The challenge as you heard me use the statistics that our CIO's published digital monetization strategies, which calls out that we have roughly 10,000 operational systems. Well, every one of them is different. Every one's put in place by a different group of people at a different time, with a different set of requirements, and a different budget, and a different focus. You know organizational scope. We're just like he showed. We're trying to blend all that in to a common view. So we have to find what's the real authoritative piece of data, cause it's not all of those systems. It's only a subset of those systems. And you have to do all of the mapping and translations, to make the result add up. Otherwise you double count or you miss something. This is work in progress. This will always be a work in progress to any large organization. So I don't want to give you impression it's all sorted. Definitely not all sorted. But, the reality is we're trying to get to the point where people can see the data that's available and that's a requirement by the way under the Foundations Act that we have a data catalog, an authoritative data catalog so people can see it and they have the ability to then request access to that through automation. This is what's critical, you need to be able to request access and have it arbitraged on the basis of whether you should directly have access based on your role, your workflow, et cetera, but it should happen in real time. You don't want to wait weeks, or months, or however long for some paperwork to move around. So this all has to become highly automated. So, what's the data, who can access it under what policy, for what purpose? Our roles and responsibilities? Identity management? All this is a combined set of solutions that we have to put in place. I'm mostly worried about a subset of that. My colleagues in these other swimlanes are working to do the rest. Most people in the department have access to data they need in their space. That hasn't been a problem. The problem is you go from space to space, you have to learn a new set of systems and a new set of techniques for a new set of data formats which means you have to be retrained. That really limits our freedom of maneuver of human beings. In the ideal world you'd be able to move from any job in any part of the department to the same job in another part of the department with no retraining whatsoever. You'd be instantly able to make a contribution. That's what we're trying to get to. So that's a different kind of a challenge, right. How do we get that level of consistency in the user experience, a modern user experience. So that if I'm a real estate manager, or I'm a medical business manager, or I'm a clinical professional, or I'm whatever, I can go from this location in this part of the department to that location in that part and my experience is the same. It's completely modern, and it's completely consistent. No retraining. >> How much of that challenge pie is people, process and technology? How would you split that opportunity? >> Well everything starts for a process perspective. Because if you automate a bad process, you just make more mistakes in less time at greater costs. Obviously that's not the ideal. But the biggest single challenge is people. It's talent, it's culture. Both on the demand side and on the supply side. If fact a lot of what I talked about in my remarks, was the additional changes we need to put in place to bring people into a more modern approach to data, more modern consumption. And look, we have pockets of excellence. And they can hold their own against any team, any place on the planet. But they are pockets of excellence. And what we're trying to do is raise the entire organization's performance. So it's people, people, and people and then the other stuff. But the products, don't care about (laughs). >> We often here about... >> They're going to change in 12 to 18 months. I'm a technologist, I'm hands on. The products are going to change rapidly, I make no emotional commitment to products. But the people that's a different story. >> Well we know that in the commercial world we often hear that cultural resistance is what sabotages modernization efforts. The DoD is sort of the ultimate top-down organization. It is any easier to get buy-in because the culture is sort of command and control oriented? >> It's hard in the DoD, it's not easier in the DoD. Ultimately people respond to their performance incentives. That's the dirty secrets performance incentives, they work every time. So unless you restructure performance measures and incentives for people their behavior's never going to change. They need to see their personal future in the future you're prescribing. And if they don't see it, you're going to get resistance every time. They're going to do what they believe they're incented to do. Making those changes, cascading those performance measures down, has been difficult because much of the decision-making processes in the department have been based on slow-moving systems and slow-moving data. I mean think about it, our budget planning process was created by Robert McNamara, as the Secretary of Defense. It requires you to plan everything for five years. And it takes more than a year to plan a single year's worth of activities, it's slow-moving. And we have regulation, we have legislation, we're a law-abiding organization, we do what we have to do. All of those things slow things down. And there's a culture of expecting macro-level consensus building. Which means everybody feels they can say no. If everybody can say no, then change becomes peanut butter spread across an organization. When you peanut butter spread across something our size and scale, the layer's pretty thin. So we have the same problem that other organizations have. There is clearly a perception of top-down change and if the Secretary or the Deputy Secretary issue an instruction people will obey it. It just takes some time to work it's way down into all the detailed combinations and permutations. Cause you have to make sophisticated decisions now. How am I going to change for my performance measures for that group to that group? And that takes time and energy and thought. There's a natural sort of pipeline effect in this. So there's real tension I think in between this perception of top-down and people will obey the orders their given. But when you're trying to integrate those changes into a board set of policy and process and people, that takes time and energy. >> And as a result the leaders have to be circumspect about the orders they give because they want to see success. They want to make sure that what they say is actually implemented or it reflects poorly on the organization. >> I think that out leaders are absolutely concerned about accomplishing the outcomes that they set out. And I think that they are rightfully determined to get the change as rapidly as possible. I would not expect them to be circumspect. I would anticipate that they would be firm and clear in the direction that they set and they would set aggressive targets because you need aggressive targets to get aggressively changed outcomes. Now. >> But they would have to choose wisely, they can't just fire off orders and expect everything to be done. I would think that they got to really think about what they want to get done, and put all the wood behind the arrow as you... >> I think that they constantly balance all those considerations. I must say, I did not appreciate before I joined the department the extraordinary caliber of leadership we enjoy. We have people with real insight and experience, and high intellectual horsepower making the decisions in the department. We've been blessed with the continuing stream of them at all of the senior ranks. These people could go anywhere, or do anything that they wanted in the economy and they've chosen to be in the department. And they bring enormous intellectual firepower to bear on challenges. >> Well you mentioned the motivation at the top of the segment, that's largely pretty powerful. >> Yeah, oh absolutely. >> I want to ask you, we have to break, but the organizational structure, you talked about the CIO, actually the responsibility for security within the CIO. >> Sure. >> To whom do you report. What's the organization look like? >> So I report to the Chief Management Officer of the Department of Defense. So if you think about the order of precedents, there's the Secretary of Defense, the Deputy Secretary of Defense and third in order is the Chief Management Officer. I report to the Chief Management Officer. >> As does the CIO, is that right? >> As does the CIO, as does the CIO. And actually this is quite typical in large organizations, that you don't have the CDO and the CIO in the same space because the concerns are very different. They have to collaborate but very different concerns. We used to see CDOs reporting to CIOs that's fallen dramatically in terms of the frequency you see that. Cause we now recognize that's just a failure mode. So you don't want to go down that path. The number one most common reporting relationship is actually to a CEO, the chief executive officer, of an organization. It's all about, what executive is driving performance for the organization? That's the person the CDO should report to. And I'm blessed in that I do find myself reporting to the executive driving organizational improvement. For me, that's a critical thing. That would make the difference between whether I could succeed or whether I'm doomed to fail. >> COO would be common too in a commercial organization. >> Yeah, in certain commercial organizations, it's a COO. It just depends on the nature of the business and their maturity with data. But if you're in the... If data's the business, CDO will report to the CEO. There are other organizations where it'll be the COO or CFO, it just depends on the nature of that business. And in our case I'm quite fortunate. >> Well Michael, thank you for, not only the coming to the CUBE but the service you're providing to the country, we really appreciate your insights and... >> It's a pleasure meeting you. >> It's a pleasure meeting you. All right, keep it right there everybody we'll be right back with our next guest. You're watching the CUBE live from MIT CDOIQ, be right back. (upbeat music)

Published Date : Jul 31 2019

SUMMARY :

Brought to you by SiliconANGLE Media. and Michael Conlin is here as the chief data officer More common enterprise data is the expression we use. and maybe that comes from supporting various initiatives, In terms of the way we support as I see, is primarily in the private sector. I just couldn't say no to it. But the chance to be the first chief data officer defending the constitution and the American way, And just the whole future of warfare Because the goal is not to prevent use of data, you mentioned that the DoD has 10,000 plus This is not, a driftnet the organization and says this is organizations that we've all lived in. enabling soldiers in the field to get the right data for soldiers in the field. in any part of the department to the same job Both on the demand side and on the supply side. But the people that's a different story. The DoD is sort of the ultimate top-down organization. and if the Secretary or the Deputy Secretary And as a result the leaders have to be circumspect about in the direction that they set and they would set behind the arrow as you... the extraordinary caliber of leadership we enjoy. of the segment, that's largely pretty powerful. but the organizational structure, you talked about the CIO, What's the organization look like? of the Department of Defense. dramatically in terms of the frequency you see that. It just depends on the nature of the business to the CUBE but the service you're providing to the country, It's a pleasure meeting you.

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
JimPERSON

0.99+

DavePERSON

0.99+

JohnPERSON

0.99+

JeffPERSON

0.99+

Paul GillinPERSON

0.99+

MicrosoftORGANIZATION

0.99+

DavidPERSON

0.99+

Lisa MartinPERSON

0.99+

PCCWORGANIZATION

0.99+

Dave VolantePERSON

0.99+

AmazonORGANIZATION

0.99+

Michelle DennedyPERSON

0.99+

Matthew RoszakPERSON

0.99+

Jeff FrickPERSON

0.99+

Rebecca KnightPERSON

0.99+

Mark RamseyPERSON

0.99+

GeorgePERSON

0.99+

Jeff SwainPERSON

0.99+

Andy KesslerPERSON

0.99+

EuropeLOCATION

0.99+

Matt RoszakPERSON

0.99+

Frank SlootmanPERSON

0.99+

John DonahoePERSON

0.99+

Dave VellantePERSON

0.99+

Dan CohenPERSON

0.99+

Michael BiltzPERSON

0.99+

Dave NicholsonPERSON

0.99+

Michael ConlinPERSON

0.99+

IBMORGANIZATION

0.99+

MeloPERSON

0.99+

John FurrierPERSON

0.99+

NVIDIAORGANIZATION

0.99+

Joe BrockmeierPERSON

0.99+

SamPERSON

0.99+

MattPERSON

0.99+

Jeff GarzikPERSON

0.99+

CiscoORGANIZATION

0.99+

Dave VellantePERSON

0.99+

JoePERSON

0.99+

George CanuckPERSON

0.99+

AWSORGANIZATION

0.99+

AppleORGANIZATION

0.99+

Rebecca NightPERSON

0.99+

BrianPERSON

0.99+

Dave ValantePERSON

0.99+

NUTANIXORGANIZATION

0.99+

NeilPERSON

0.99+

MichaelPERSON

0.99+

Mike NickersonPERSON

0.99+

Jeremy BurtonPERSON

0.99+

FredPERSON

0.99+

Robert McNamaraPERSON

0.99+

Doug BalogPERSON

0.99+

2013DATE

0.99+

Alistair WildmanPERSON

0.99+

KimberlyPERSON

0.99+

CaliforniaLOCATION

0.99+

Sam GroccotPERSON

0.99+

AlibabaORGANIZATION

0.99+

RebeccaPERSON

0.99+

twoQUANTITY

0.99+