Ferd Scheepers, ING Group | IBM Think 2018

>> Narrator: Live from Las Vegas, It's the CUBE covering IBM Think 2018, brought to you by IBM. >> Welcome back to the CUBE. We are live at the inaugural IBM Think 2018 event. I'm Lisa Martin, with my co-host Dave Vellante, and we are excited to be joined by one of the keynotes at this inaugural event, Ferd Scheepers, the Chief Information Architect from ING Group. Welcome to the CUBE. >> Thank you very much. Pleasure to be here. >> So, you already mentioned you are doing you said six sessions. I know at least one of them is a keynote. >> Correct. >> So, you've been to IBM events before. You're going to be talking in the cloud and data campus as they call it. Tell us, though, about what you have been doing as really one of the leaders for the last five years of ING becoming a data-driven company. And, also tell us what does data-driven mean to ING? >> Sure, so let's start with the latter. What does data-driven mean for ING? There may be different opinions within ING even, but for me, it's very much, we use data, and make it accessible for everybody in the company to help them drive their decision-making, and at the same time, we use that same data also to help our customers get more understanding of what they actually do with ING, and maybe even outside of ING, and use that data to help them get better services from ING at the right point in time with the right quality they can expect to really elevate our service level to our customers, but also drive decision-making internally. So, how do we do that? Well, very much by driving a data architecture, an information architecture, that started about six years ago when we work together with IBM to create something we now call the ING data lake architecture, which was very much about making it possible for us to bring all those data sources that we have in the company together, qualify them with business terms so that people could actually understand what they were, making sure that we came up with a common language across the bank, so that across all those different lines of business, all those countries, we actually had a common understanding of what we meant with, say, customer. I mean, that sounds very natural for a bank to understand what a customer is, but you might have very different definitions, based on where you come from, and which country. >> Okay, so I have to ask you about some of that data mall and that data journey, because the financial services business, it's always been a data business, but a lot of years ago, maybe even still today, many organizations data exists in silos. So, you talked about making data and data sources accessible to everybody in the company, so they could utilize it, but I'm very curious as to how you went about, basically, busting down the silos of data. What did you have to go through to do that, and do you feel like your employees and your customers actually now do have access to that data as you envisioned? >> I would say we're not there yet, we're on a journey, and that journey has been ongoing for about five years. But, the journey very much started by actually creating the architecture, which was the easy part, but then selling the architecture. And, selling the architecture actually means that you have to go to a different stakeholders with very different stories. So, what's in it for them? What's in it for your CIO's? Well, an easier landscape, a lot of automation, where in the past they had to do manual things, being in control, meaning all the risk items to go down. What's in it for the business sides? Well, that well-articulated business meeting around data, that empowerment of actually the business sides to own the data, to be able to say who has access to it, and what they can do with it. So, it was really about selling this architecture with many different presentations, with many different stakeholders, and then actually building this. The most important thing that I've always said to anybody who asked me, "Why is this successful at ING?" We planned something six years ago, and we've been driving this journey continuously for the last six years in that same direction, and that is really the key to it. If you believe you can do this journey, and have a value after a year, and then you're done, it doesn't work that way. It's a long journey. It takes a lot of investments, and it pays off after you've done that investment after many years. >> So, the joke is, of course, that we all hear, that the data lake turns into a data swamp. So, you went into this, thinking about getting value, obviously, out of the data. How did you make the data not stagnate? What kind of challenges do you have in that regard? >> I think that one of the main things that we did when we came up with this whole architecture is to say from day one, it is a data lake that is governance. Even though we didn't use the word that much, because a few years ago, governance may not have been the most popular term to use. But, in essence, it's what we did. Everything that we have in our data lake is identified. It is governed with different levels of governance. When you talk about customer data, you want to know all the different details about: What is a salary? Does an account includes the accrued interest? All these kinds of things. When you start talking about, maybe, log data, it's a lesser level of governance, but for every asset we have in our data lake, we know what it is, who owns it, more or less, high level, what it means, and in a lot of assets the more key assets of the bank, we know in all the details what's there, and that actually makes sure you don't get into a data swamp, 'cause data swamp pretty much is what a lot of companies, that one that said: Data lake equals a loop, equals put in bits and bytes, and then later you can't find it anymore. >> And, those data sets are categorized? >> They are. >> You've auto-categorized them at the point of creation, or use. Is that right, that's automated? >> We have still a lot of manual activities, but we actually more and more trying to automate this, so taking a lot of data discovery tools, where we look at the data the moment we ingest it into the data lake, we try to auto-classify what it means, and actually even tie it into business terms that we've defined. But, it's still partly also a manual thing, because as a bank, you probably have thousands of things that you could describe on a business term level, and we're still growing through that process of actually classifying everything. >> What about the policies associated with that? That, presumably, is automated for retention, or deletion, or movement, or archiving, or? >> Absolutely, yes. >> That's automated, right? >> Absolutely, yes. So, that ties into the business terms, so we do everything on business term level. So the moment we talk about customer, we have a policy that is on customer, or customer name, or whatever. No matter where that physical asset is, and even which kind of technology it is, it is driven all from that policy on the business term level. >> You have published quite a bit with IBM on data lakes. I mentioned that you are speaking at this event. What are some of the key learnings, as you are now in fifth or sixth year of this journey to Dave's question earlier that you can share about how to not turn a data lake into a data swamp, with maintaining a quality in meeting those internal stakeholder needs and expectations? >> One of the complexes that you see in all major organizations, is that we have, like any other technology out there. I mean, even though we're a good friend of IBM, we don't only have IBM technology. One of the challenges that you have is, the moment you go into the different organizations, or units within your company, they all use different technologies and nobody wants to give it up. But, you don't have a choice, because at this moment, and, you know, that might change over the next few years. The only way to be in control of your entire data landscape is to limit yourself in the technologies that you use, and actually to make sure that you drive the governance from a central perspective and use the technology stack, framework, whatever you want to call it that actually ties governance directly into the technology, into that way that you handle the data. If you think you can do that with every technology out there, and it magically all works together, or you want to do the integration, I would advise against that. I think it's way too much of a challenge, and one of the things I'll actually be presenting upon here at this conference is about open meta data. So a way for us to actually start opening this up and bringing meta data, which in essence means governance, to a more heterogeneous landscape, which is one of the major drivers why we're investing in this ourselves. Even though we like the IBM technology, we still, now and then, want to play with tools from other vendors, or maybe with open source technologies, and it needs to add up. It needs to be governed as well. So, this is a major investment for us, and I think this is something that everybody should have a look upon. >> I want to ask you about innovation and governance, 'cause they're kind of counterpoised, in a lot of people minds, but you were hinting earlier that it used to be a bad word, but maybe we could start getting value out of our governance framework. We got a great studio audience, I'm going to be like a broken record to these guys. I've been saying all morning that innovation is going to come from data. You've got a data lake: Machine intelligence or artificial intelligence, and cloud, at scale, whether it's private or public cloud. So, first question is: Governance and innovation. Are they at odds? And, how do you address that? >> So, I would say they're not at odds, but I do think that the moment you start looking at innovation, you need to take governance as something that is always top of mind. Actually, I think that what we've done so far, by investing heavily into a governed data lake, has helped us with being innovative, because the data foundation is there. The moment you want to look at data that you have within the company, if it's well qualified, if it's known, you know the quality of it, you know where it is, it actually makes it way easier to use enough of this technology to work with the data, because you don't have that problem of trying to find where everything is. I think that's been one of the biggest problems with all the innovation projects that I've seen. You start with this great idea, then you bring it into a company, then everybody says, "Ho, ho, ho, ho. "Not with my data." We have all the data together. We know where it is. We know what to use it for, and, we can actually say that the moment people start playing with their data, within a very well-defined set of rules, that's great. The moment we start bringing that innovation to production, we go to the steps to see whether that actually makes sense, whether we want to change the technology, or whether we need to bring a next level of governance in there. But, because we have everything under control, we can way easily actually play with innovation. >> So, governance brings data quality, data quality brings conviction of your decision-making. Okay, I get that. What about the cloud piece? We talked off-camera. Public cloud, not so much. How do you get scale economies, network effects, etcetera? >> One of the challenges that we've been facing is that the moment to start bringing a lot of technology in your own company, and you have to deploy all of that, it's the issue of bringing all that life cycle management into your organization. It's just a challenge. We've got literally I don't know how many teams, we'll say five, six, seven teams that do nothing else but bring life cycle managements, related updates towards our data lake. I love the cloud's idea, that actually all that stuff is taken away by somebody else. They do they updates, they do the life cycle management. I have a clear separation of my compute versus my storage. That's all the good stuff that cloud brings to me. The scalability, the elasticity, all of that stuff. I can't do all that in public cloud. I mean, we have a lot of customer data, we are very, very sensitive, you know, being from Europe, and especially being in the Netherlands. Now, all the privacy of our customers, so we don't want to bring everything to public clouds, but private cloud as it is today, especially with things like what's now being announced as the IBM Cloud Private for data, bringing a lot of those containerized ways of delivering new technologies into our organization. We did a POC with that, from three months to a few hours. That's the kind of stuff I'm looking for, and now also the metering comes in there, and we can start paying for it in a different way, not by just having a license for a product, with a number, of course, but but actually have that dynamic scaling, even in what we pay for. That has really enabled us to do a lot of new things, and that brings a lot of value. >> Can you touch on that business alcem that you just mentioned a minute ago from three months to three hours? Give a little more context there, that was with IBM Cloud Private for data. >> So, what we did, actually, we did the proof of concept together with IBM, where we looked at a product, in this case, just to try it out, which was Data Sage. In the past, when we have a new version of Data Sage, it will take us, literally, months to get that new version in production, even if it's a small fix, because, in all honesty, the way that the different fixes depend upon each other, the complexity of playing through that, it just takes forever, and it never goes right from day one. What we did is we brought the Data Sage containers into our own private cloud, which happens to be called IPC instead of ICP, which led to a lot of confusion during the whole POC, and we managed to show that we could actually bring the containers from IBM into our old cloud environments, and, literally, we could show that we could do an update in hours. That same update, going through normal process of installing it, doing all the different patches there are for each other, with some of them conflicting, testing it, making sure it all works literally, months. It's a huge success for us. >> So, thinking about the the data journey that you went on, if you had a Mulligan? Does Mulligan translate into your native tongue? Do-over, Mulligan, golf term, right? Pot-shot, take another one. If you had a do-over, what would you do differently? What kind of advice would you give to your colleagues? >> I think I wouldn't change a fundamental step in what we did. I think what we did, the journey was okay. What I probably would have done different is actually, two things. One is, there was and still has quite some focus on creating this ING language, which we call the ING Esperanto, which is one thing we need. We need to have a definition that is cross-country, cross-lines of business, and just like a common understanding. But, it has also translated quite a bit into becoming like an attempt to economical data model. I think we should have shied away a little bit from that and kept it at a definition level a little bit more. The second thing that I probably would have done different, is that instead of trying to do a lot of work together only with IBM, I would have probably invited a second partner from day one, just to make sure this is even more of an industry standard thing. We tried to publish together. We've done a lot of work together, but actually I think that everything we've built shouldn't be an ING proprietary thing. It should be something that's open-source, and we're actually doing that now, more and more. A lot of the stuff we've built, we're pushing to open source, which I think is the right way, because at the end of the day, what we've built is plumbing, and a banker's not in the business of plumbing. We're in the business of helping our customers to achieve great things, and all the stuff behind the scenes, all the plumbing, is something I'd rather buy, and get off the shelf, than I build it myself. >> So, last question, I've heard a number of things about what ING has achieved in terms of a lot of operational efficiencies. You mentioned that this is a journey, and that's probably also another key piece for people who want to learn from you that this is something that is going to take time. Last question, though. You mentioned the word control earlier, and how you had to get buy-in from a lot of stakeholders who probably felt very tied lines divisive to their data. Recommendations and advice for truly building a data-driven culture of a company that's several decades old. >> I would say, go to the highest level in your company and make sure your CEO puts this on the messages to the outside world. I think that one of the biggest achievements we had at some point in time is that our CEO, Ralph Hamers, he talks to the world and he says, "We love analytics. "We want to be a technology company, and we think "analytics is one of the most important things we do, "because it's the best way for us to actually "help our customers to be "a step ahead in life from business." The moment you have that message and you explain it that way to the world, nobody within your company will actually say, "This is a bad idea," 'cause if the boss says so, even within a Dutch organization, everybody buys into it. So, I think just go to your the highest manager in your company, get them on board, get them to speak on it publicly, and you're set. >> Well, Ferd, thanks so much for sharing what you have achieved so far at ING in your current role, and for also sharing your recommendations and advice, lessons learned. We appreciate your time. >> Thank you very much. >> And, good luck on your keynote, and all of your other speaking sessions this week. >> Ferd: Thank you, yeah. And, for Dave Vellante, I'm Lisa Martin. You're watching the CUBE, live on day one of the inaugural IBM Think 2018. Stick around. Dave and I will be right back after a short break.

Published Date : Mar 19 2018

SUMMARY :

brought to you by IBM. We are live at the inaugural IBM Think 2018 event. Thank you very much. So, you already mentioned you are doing Tell us, though, about what you have been doing and at the same time, we use that same data also and do you feel like your employees and your customers and that is really the key to it. So, the joke is, of course, that we all hear, the most popular term to use. at the point of creation, or use. the moment we ingest it into the data lake, So the moment we talk about customer, we have a policy What are some of the key learnings, as you are now One of the challenges that you have is, in a lot of people minds, but you were hinting earlier that the moment people start playing with their data, What about the cloud piece? That's all the good stuff that cloud brings to me. that you just mentioned a minute ago of installing it, doing all the different patches the data journey that you went on, We're in the business of helping our customers to and how you had to get buy-in from a lot of stakeholders "analytics is one of the most important things we do, what you have achieved so far and all of your other speaking sessions this week. live on day one of the inaugural IBM Think 2018.

ENTITIES

Entity	Category	Confidence
Lisa Martin	PERSON	0.99+
Dave Vellante	PERSON	0.99+
Ferd Scheepers	PERSON	0.99+
Dave	PERSON	0.99+
Europe	LOCATION	0.99+
IBM	ORGANIZATION	0.99+
ING	ORGANIZATION	0.99+
Ralph Hamers	PERSON	0.99+
Ferd	PERSON	0.99+
five	QUANTITY	0.99+
Netherlands	LOCATION	0.99+
ING Group	ORGANIZATION	0.99+
six	QUANTITY	0.99+
fifth	QUANTITY	0.99+
Las Vegas	LOCATION	0.99+
six sessions	QUANTITY	0.99+
One	QUANTITY	0.99+
two things	QUANTITY	0.99+
three months	QUANTITY	0.99+
one	QUANTITY	0.99+
second partner	QUANTITY	0.99+
three hours	QUANTITY	0.99+
sixth year	QUANTITY	0.99+
today	DATE	0.98+
seven teams	QUANTITY	0.98+
second thing	QUANTITY	0.98+
IBM Think 2018	EVENT	0.98+
six years ago	DATE	0.98+
this week	DATE	0.98+
first question	QUANTITY	0.98+
about five years	QUANTITY	0.97+
a year	QUANTITY	0.95+
day one	QUANTITY	0.95+
thousands of things	QUANTITY	0.94+
a minute ago	DATE	0.94+
about six years ago	DATE	0.91+
one of the keynotes	QUANTITY	0.87+
few years ago	DATE	0.84+
data lake	ORGANIZATION	0.82+
Data lake	ORGANIZATION	0.82+
CUBE	ORGANIZATION	0.81+
Data Sage	TITLE	0.79+
last six years	DATE	0.78+
next few years	DATE	0.78+
lot of years ago	DATE	0.76+
Dutch	OTHER	0.74+
several decades	QUANTITY	0.71+
one of them	QUANTITY	0.7+
last five	DATE	0.61+
hours	QUANTITY	0.6+
years	QUANTITY	0.55+

Recommend Videos

Sentiment Analysis

AWS Comprehend

Search Results for Data Sage: