Stijn Paul Fireside Chat Accessible Data | Data Citizens'21
>>Really excited about this year's data, citizens with so many of you together. Uh, I'm going to talk today about accessible data, because what good is the data. If you can get it into your hands and shop for it, but you can't understand it. Uh, and I'm here today with, uh, bald, really thrilled to be here with Paul. Paul is an award-winning author on all topics data. I think 20 books with 21st on the way over 300 articles, he's been a frequent speaker. He's an expert in future trends. Uh, he's a VP at cognitive systems, uh, over at IBM teachers' data also, um, at the business school and as a champion of diversity initiatives. Paul, thank you for being here, really the conformance, uh, to the session with you. >>Oh, thanks for having me. It's a privilege. >>So let's get started with, uh, our origins and data poll. Um, and I'll start with a little story of my own. So, uh, I trained as an engineer way back when, uh, and, um, in one of the courses we got as an engineer, it was about databases. So we got the stick thick book of CQL and me being in it for the programming. I was like, well, who needs this stuff? And, uh, I wanted to do my part in terms of making data accessible. So essentially I, I was the only book that I sold on. Uh, obviously I learned some hard lessons, uh, later on, as I did a master's in AI after that, and then joined the database research lab at the university that Libra spun off from. Uh, but Hey, we all learned along the way. And, uh, Paula, I'm really curious. Um, when did you awaken first to data? If you will? >>You know, it's really interesting Stan, because I come from the opposite side, an undergrad in economics, uh, with some, uh, information systems research at the higher level. And so I think I was always attuned to what data could do, but I didn't understand how to get at it and the kinds of nuances around it. So then I started this job, a database company, like 27 years ago, and it started there, but I would say the awakening has never stopped because the data game is always changing. Like I look at these epochs that I've been through data. I was a real relational databases thinking third normal form, and then no SQL databases. And then I watch no SQL be about no don't use SQL, then wait a minute. Not only sequel. And today it's really for the data citizens about wait, no, I need SQL. So, um, I think I'm always waking up in data, so I'll call it a continuum if you will. But that was it. It was trying to figure out the technology behind driving analytics in which I took in school. >>Excellent. And I fully agree with you there. Uh, every couple of years they seem to reinvent new stuff and they want to be able to know SQL models. Let me see. I saw those come and go. Uh, obviously, and I think that's, that's a challenge for most people because in a way, data is a very abstract concepts, um, until you get down in the weeds and then it starts to become really, really messy, uh, until you, you know, from that end button extract a certain insights. Um, and as the next thing I want to talk about with you is that challenging organizations, we're hearing a lot about data, being valuable data, being the new oil data, being the new soil, the new gold, uh, data as an asset is being used as a slogan all over. Uh, people are investing a lot in data over multiple decades. Now there's a lot of new data technologies, always, but still, it seems that organizations fundamentally struggle with getting people access to data. What do you think are some of the key challenges that are underlying the struggles that mud, that organizations seem to face when it comes to data? >>Yeah. Listen, Stan, I'll tell you a lot of people I think are stuck on what I call their data, acumen curves, and you know, data is like a gym membership. If you don't use it, you're not going to get any value on it. And that's what I mean by accurate. And so I like to think that you use the analogy of some mud. There's like three layers that are holding a lot of organizations back at first is just the amount of data. Now, I'm not going to give you some stat about how many times I can go to the moon and back with the data regenerate, but I will give you one. I found interesting stat. The average human being in their lifetime will generate a petabyte of data. How much data is that? If that was my apple music playlist, it would be about 2000 years of nonstop music. >>So that's some kind of playlist. And I think what's happening for the first layer of mud is when I first started writing about data warehousing and analytics, I would be like, go find a needle in the haystack. But now it's really finding a needle in a stack of needles. So much data. So little time that's level one of mine. I think the second thing is people are looking for some kind of magic solution, like Cinderella's glass slipper, and you put it on her. She turns into a princess that's for Disney movies, right? And there's nothing magical about it. It is about skill and acumen and up-skilling. And I think if you're familiar with the duper, you recall the Hadoop craze, that's exactly what happened, right? Like people brought all their data together and everyone was going to be able to access it and give insights. >>And it teams said it was pretty successful, but every line of business I ever talked to said it was a complete failure. And the third layer is governance. That's actually where you're going to find some magic. And the problem in governance is every client I talked to is all about least effort to comply. They don't want to violate GDPR or California consumer protection act or whatever governance overlooks, where they do business and governance. When you don't lead me separate to comply and try not to get fine, but as an accelerant to your analytics, and that gets you out of that third layer of mud. So you start to invoke what I call the wisdom of the crowd. Now imagine taking all these different people with intelligence about the business and giving them access and acumen to hypothesize on thousands of ideas that turn into hundreds, we test and maybe dozens that go to production. So those are three layers that I think every organization is facing. >>Well. Um, I definitely follow on all the days, especially the one where people see governance as a, oh, I have to comply to this, which always hurts me a little bit, honestly, because all good governance is about making things easier while also making sure that they're less riskier. Um, but I do want to touch on that Hadoop thing a little bit, uh, because for me in my a decade or more over at Libra, we saw it come as well as go, let's say around 2015 to 2020 issue. So, and it's still around. Obviously once you put your data in something, it's very hard to make it go away, but I've always felt that had do, you know, it seemed like, oh, now we have a bunch of clusters and a bunch of network engineers. So what, >>Yeah. You know, Stan, I fell for, I wrote the book to do for dummies and it had such great promise. I think the problem is there wasn't enough education on how to extract value out of it. And that's why I say it thinks it's great. They liked clusters and engineers that you just said, but it didn't drive lineup >>Business. Got it. So do you think that the whole paradigm with the clouds that we're now on is going to fundamentally change that or is just an architectural change? >>Yeah. You know, it's, it's a great comment. What you're seeing today now is the movement for the data lake. Maybe a way from repositories, like Hadoop into cloud object stores, right? And then you look at CQL or other interfaces over that not allows me to really scale compute and storage separately, but that's all the technical stuff at the end of the day, whether you're on premise hybrid cloud, into cloud software, as a service, if you don't have the acumen for your entire organization to know how to work with data, get value from data, this whole data citizen thing. Um, you're not going to get the kind of value that goes into your investment, right? And I think that's the key thing that business leaders need to understand is it's not about analytics for kind of science project sakes. It's about analytics to drive. >>Absolutely. We fully agree with that. And I want to touch on that point. You mentioned about the wisdom of the crowds, the concept that I love about, right, and your organization is a big grout full of what we call data citizens. Now, if I remember correctly from the book of the wisdom of the crowds, there's, there's two points that really, you have to take Canada. What is, uh, for the wisdom of the grounds to work, you have to have all the individuals enabled, uh, for them to have access to the right information and to be able to share that information safely kept from the bias from others. Otherwise you're just biasing the outcome. And second, you need to be able to somehow aggregate that wisdom up to a certain decision. Uh, so as Felix mentioned earlier, we all are United by data and it's a data citizen topic. >>I want to touch on with you a little bit, because at Collibra we look at it as anyone who uses data to do their job, right. And 2020 has sort of accelerated digitization. Uh, but apart from that, I've always believed that, uh, you don't have to have data in your title, like a data analyst or a data scientist to be a data citizen. If I take a look at the example inside of Libra, we have product managers and they're trying to figure out which features are most important and how are they used and what patterns of behavior is there. You have a gal managers, and they're always trying to know the most they can about their specific accounts, uh, to be able to serve as them best. So for me, the data citizen is really in its broadest sense. Uh, anyone who uses data to do their job, does that, does that resonate with you? >>Yeah, absolutely. It reminds me of myself. And to be honest in my eyes where I got started from, and I agree, you don't need the word data in your title. What you need to have is curiosity, and that is in your culture and in your being. And, and I think as we look at organizations to transform and take full advantage of their, their data investments, they're going to need great governance. I guarantee you that, but then you're going to have to invest in this data citizen concept. And the first thing I'll tell you is, you know, that kind of acumen, if you will, as a team sport, it's not a departmental sport. So you need to think about what are the upskilling programs of where we can reach across to the technical and the non-technical, you know, lots and lots of businesses rely on Microsoft Excel. >>You have data citizens right there, but then there's other folks who are just flat out curious about stuff. And so now you have to open this up and invest in those people. Like, why are you paying people to think about your business without giving the data? It would be like hiring Tom Brady as a quarterback and telling him not to throw a pass. Right. And I see it all the time. So we kind of limit what we define as data citizen. And that's why I love what you said. You don't need the word data in your title and more so if you don't build the acumen, you don't know how to bring the data together, maybe how to wrangle it, but where did it come from? And where can you fixings? One company I worked with had 17 definitions for a sales individual, 17 definitions, and the talent team and HR couldn't drive to a single definition because they didn't have the data accurate. So when you start thinking of the data citizen, concept it about enabling everybody to shop for data much. Like I would look for a USB cable on Amazon, but also to attach to a business glossary for definition. So we have a common version of what a word means, the lineage of the data who owns it, who did it come from? What did it do? So bring that all together. And, uh, I will tell you companies that invest in the data, citizen concept, outperform companies that don't >>For all of that, I definitely fully agree that there's enough research out there that shows that the ones who are data-driven are capturing the most markets, but also capturing the most growth. So they're capturing the market even faster. And I love what you said, Paul, about, um, uh, the brains, right? You've already paid for the brains you've already invested in. So you may as well leverage them. Um, you may as well recognize and, and enable the data citizens, uh, to get access to the assets that they need to really do their job properly. That's what I want to touch on just a little bit, if, if you're capable, because for me, okay. Getting access to data is one thing, right? And I think you already touched on a few items there, but I'm shopping for data. Now I have it. I have a cul results set in my hands. Let's say, but I'm unable to read and write data. Right? I don't know how to analyze it. I don't know maybe about bias. Uh, maybe I, I, I don't know how to best visualize it. And maybe if I do, maybe I don't know how to craft a compelling persuasion narrative around it to change my bosses decisions. So from your viewpoint, do you think that it's wise for companies to continuously invest in data literacy to continuously upgrade that data citizens? If you will. >>Yeah, absolutely. Forest. I'm going to tell you right now, data literacy years are like dog years stage. So fast, new data types, new sources of data, new ways to get data like API APIs and microservices. But let me take it away from the technical concept for a bit. I want to talk to you about the movie. A star is born. I'm sure most of you have seen it or heard it Bradley Cooper, lady Gaga. So everyone knows the movie. What most people probably don't know is when lady Gaga teamed up with Bradley Cooper to do this movie, she demanded that he sing everything like nothing could be auto-tuned everything line. This is one of the leading actors of Hollywood. They filmed this remake in 42 days and Bradley Cooper spent 18 months on singing lessons. 18 months on a guitar lessons had a voice coach and it's so much and so forth. >>And so I think here's the point. If one of the best actors in the world has to invest three and a half years for 42 days to hit a movie out of the park. Why do we think we don't need a continuous investment in data literacy? Even once you've done your initial training, if you will, over the data, citizen, things are going to change. I don't, you don't. If I, you Stan, if you go to the gym and workout every day for three months, you'll never have to work out for the rest of your life. You would tell me I was ridiculous. So your data literacy is no different. And I will tell you, I have managed thousands of individuals, some of the most technical people around distinguished engineers, fellows, and data literacy comes from curiosity and a culture of never ending learning. That is the number one thing to success. >>And that curiosity, I hire people who are curious, I'll give you one more story. It's about Mozart. And this 21 year old comes to Mozart and he says, Mozart, can you teach me how to compose a symphony? And Mozart looks at this person that says, no, no, you're too young, too young. You compose your fourth symphony when you were 12 and Mozart looks at him and says, yeah, but I didn't go around asking people how to compose a symphony. Right? And so the notion of that story is curiosity. And those people who show up in always want to learn, they're your home run individuals. And they will bring data literacy across the organization. >>I love it. And I'm not going to try and be Mozart, but you know, three and a half years, I think you said two times, 18 months, uh, maybe there's hope for me yet in a singing, you'll be a good singer. Um, Duchy on the, on the, some of the sports references you've made, uh, Paul McGuire, we first connected, uh, I'm not gonna like disclose where you're from, but, uh, I saw he did come up and I know it all sorts of sports that drive to measure everything they can right on the field of the field. So let's imagine that you've done the best analysis, right? You're the most advanced data scientists schooled in the classics, as well as the modernist methods, the best tools you've made a beautiful analysis, beautiful dashboards. And now your coach just wants to put their favorite player on the game, despite what you're building to them. How do you deal with that kind of coaches? >>Yeah. Listen, this is a great question. I think for your data analytics strategy, but also for anyone listening and watching, who wants to just figure out how to drive a career forward? I would give the same advice. So the story you're talking about, indeed hockey, you can figure out where I'm from, but it's around the Ottawa senators, general manager. And he made a quote in an interview and he said, sometimes I want to punch my analytics, people in the head. Now I'm going to tell you, that's not a good culture for analytics. And he goes on to say, they tell me not to play this one player. This one player is very tough. You know, throws four or five hits a game. And he goes, I'd love my analytics people to get hit by bore a wacky and tell me how it feels. That's the player. >>Sure. I'm sure he hits hard, but here's the deal. When he's on the ice, the opposing team gets more shots on goal than the senators do on the opposing team. They score more goals, they lose. And so I think whenever you're trying to convince a movement forward, be it management, be it a project you're trying to fund. I always try to teach something that someone didn't previously know before and make them think, well, I never thought of it that way before. And I think the great opportunity right now, if you're trying to get moving in a data analytics strategy is around this post COVID era. You know, we've seen post COVID now really accelerate, or at least post COVID in certain parts of the world, but accelerate the appetite for digital transformation by about half a decade. Okay. And getting the data within your systems, as you digitize will give you all kinds of types of projects to make people think differently than the way they thought before. >>About data. I call this data exhaust. I'll give you a great example, Uber. I think we're all familiar with Uber. If we all remember back in the days when Uber would offer you search pricing. Okay? So basically you put Uber on your phone, they know everything about you, right? Who are your friends, where you going, uh, even how much batteries on your phone? Well, in a data science paper, I read a long time ago. They recognize that there was a 70% chance that you would accept a surge price. If you had less than 10% of your battery. So 10% of battery on your phone is an example of data exhaust all the lawns that you generate on your digital front end properties. Those are logs. You can take those together and maybe show executive management with data. We can understand why people abandoned their cart at the shipping phase, or what is the amount of shipping, which they abandoned it. When is the signal when our systems are about to go to go down. So, uh, I think that's a tremendous way. And if you look back to the sports, I mean the Atlanta Falcons NFL team, and they monitor their athletes, sleep performance, the Toronto Raptors basketball, they're running AI analytics on people's personalities and everything they tweet and every interview to see if the personality fits. So in sports, I think athletes are the most important commodity, if you will, or asset a yet all these teams are investing in analytics. So I think that's pretty telling, >>Okay, Paul, it looks like we're almost out of time. So in 30 seconds or less, what would you recommend to the data citizens out there? >>Okay. I'm going to give you a four tips in 30 seconds. Number one, remember learning never ends be curious forever. You'll drive your career. Number two, remember companies that invest in analytics and data, citizens outperform those that don't McKinsey says it's about 1.4 times across many KPIs. Number three, stop just collecting the dots and start connecting them with that. You need a strong governance strategy and that's going to help you for the future because the biggest thing in the future is not going to be about analytics, accuracy. It's going to be about analytics, explainability. So accuracy is no longer going to be enough. You're going to have to explain your decisions and finally stay positive and forever test negative. >>Love it. Thank you very much fall. Um, and for all the data seasons is out there. Um, when it comes down to access to data, it's more than just getting your hands on the data. It's also knowing what you can do with it, how you can do that and what you definitely shouldn't be doing with it. Uh, thank you everyone out there and enjoy your learning and interaction with the community. Stay healthy. Bye-bye.
SUMMARY :
If you can get it into your hands and shop for it, but you can't understand it. It's a privilege. Um, when did you awaken first to data? And so I think I was always attuned to what data could do, but I didn't understand how to get Um, and as the next thing I want to talk about with you is And so I like to think that you use And I think if you're familiar with the duper, you recall the Hadoop craze, And the problem in governance is every client I talked to is Obviously once you put your They liked clusters and engineers that you just said, So do you think that the whole paradigm with the clouds that And then you look at CQL or other interfaces over that not allows me to really scale you have to have all the individuals enabled, uh, uh, you don't have to have data in your title, like a data analyst or a data scientist to be a data citizen. and I agree, you don't need the word data in your title. And so now you have to open this up and invest in those people. And I think you already touched on a few items there, but I'm shopping for data. I'm going to tell you right now, data literacy years are like dog years I don't, you don't. And that curiosity, I hire people who are curious, I'll give you one more story. And I'm not going to try and be Mozart, but you know, And he goes on to say, they tell me not to play this one player. And I think the great opportunity And if you look back to the sports, what would you recommend to the data citizens out there? You need a strong governance strategy and that's going to help you for the future thank you everyone out there and enjoy your learning and interaction with the community.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Paul | PERSON | 0.99+ |
Toronto Raptors | ORGANIZATION | 0.99+ |
Paula | PERSON | 0.99+ |
Paul McGuire | PERSON | 0.99+ |
Uber | ORGANIZATION | 0.99+ |
17 definitions | QUANTITY | 0.99+ |
Tom Brady | PERSON | 0.99+ |
Mozart | PERSON | 0.99+ |
IBM | ORGANIZATION | 0.99+ |
Bradley Cooper | PERSON | 0.99+ |
70% | QUANTITY | 0.99+ |
Amazon | ORGANIZATION | 0.99+ |
18 months | QUANTITY | 0.99+ |
30 seconds | QUANTITY | 0.99+ |
20 books | QUANTITY | 0.99+ |
12 | QUANTITY | 0.99+ |
hundreds | QUANTITY | 0.99+ |
42 days | QUANTITY | 0.99+ |
fourth symphony | QUANTITY | 0.99+ |
two times | QUANTITY | 0.99+ |
three months | QUANTITY | 0.99+ |
Atlanta Falcons | ORGANIZATION | 0.99+ |
lady Gaga | PERSON | 0.99+ |
Bradley Cooper | PERSON | 0.99+ |
Stan | PERSON | 0.99+ |
2020 | DATE | 0.99+ |
10% | QUANTITY | 0.99+ |
21st | QUANTITY | 0.99+ |
today | DATE | 0.99+ |
one player | QUANTITY | 0.99+ |
CQL | TITLE | 0.99+ |
Cinderella | PERSON | 0.99+ |
second thing | QUANTITY | 0.99+ |
GDPR | TITLE | 0.99+ |
two points | QUANTITY | 0.99+ |
Felix | PERSON | 0.99+ |
dozens | QUANTITY | 0.99+ |
three and a half years | QUANTITY | 0.99+ |
single definition | QUANTITY | 0.99+ |
thousands | QUANTITY | 0.99+ |
second | QUANTITY | 0.99+ |
four | QUANTITY | 0.99+ |
less than 10% | QUANTITY | 0.98+ |
Collibra | ORGANIZATION | 0.98+ |
first | QUANTITY | 0.98+ |
third layer | QUANTITY | 0.98+ |
three layers | QUANTITY | 0.98+ |
one | QUANTITY | 0.98+ |
2015 | DATE | 0.98+ |
about 2000 years | QUANTITY | 0.98+ |
Canada | LOCATION | 0.98+ |
California consumer protection act | TITLE | 0.98+ |
four tips | QUANTITY | 0.97+ |
Disney | ORGANIZATION | 0.97+ |
third | QUANTITY | 0.97+ |
SQL | TITLE | 0.97+ |
Microsoft | ORGANIZATION | 0.97+ |
this year | DATE | 0.96+ |
Hollywood | ORGANIZATION | 0.96+ |
one more story | QUANTITY | 0.96+ |
over 300 articles | QUANTITY | 0.94+ |
27 years ago | DATE | 0.94+ |
one thing | QUANTITY | 0.94+ |
a decade | QUANTITY | 0.94+ |
Duchy | PERSON | 0.93+ |
level one | QUANTITY | 0.92+ |