Jitesh Ghai, Informatica | Informatica World 2019
>> Live from Las Vegas, it's theCUBE. Covering Informatica World 2019, brought to you by Informatica. >> Welcome back everyone to theCUBE's live coverage of Informatica World here in Las Vegas. I'm your host, Rebecca Knight along with my co-host John Furrier, we are joined by Jitesh Ghai, he is the Senior Vice President and General Manager Data Quality, Security and Governance at Informatica. Thank you so much for coming or returning to the show Jitesh. >> My pleasure, happy to be here. >> So, this is a real moment for data governance, we have the anniversary of GDPR and the California Privacy Act it's a topic at Dabos, there is growing concern among the public and lawmakers over security and privacy, give us the lay of the land from your perspective. >> Right, you know it is a moment for data governance, what's exciting in the space is governance was born out of risk and compliance and managing for risk and compliance, but really what it was mandating was healthy data management practices, how do we give the regulators comfort that our data is of high quality, that we know the lineage of where data is coming from that we know how the business relies on the data what is critical data? And while it was born to give the regulators comfort, what organizations very quickly realized is well when you democratize data, you need to give everybody that comfort, you need to give your data scientists, your data analysts, that same level of contextual understanding of their data right, where did it come from? What's the quality of it? How does the business use it, rely on it? And so that has been a tremendous opportunity for us, we've supported organizations, financial services from a BCBS 239 CCAR, counterparty credit risk, but what's happened is from a data democratization, data scale perspective, self-service analytics perspective, is what moved from terabytes to petabytes. We've moved from data warehouses, to data lakes and you can't democratize data unless there's a governed framework. I don't know, it sounds kind of like wait, democratizing data is supposed to be free data everywhere, but without some governed framework, it's a bit of a mess, and so what we're enabling organizations is the effective consumption and understanding of where their data is, discovering it, so that the right people can consume the data that they care about, the right data scientists can build the right models, the right analysts can build the right reports and the executives get the right confidence on what reports they're getting, what KPI's they're getting. >> One of the things that we talked last year, you had a couple customers on, you had told a great story, you guys had had the benefit as a long-standing company, 25 years in the private for large-customer base, but the markets changed, you mentioned governance I mean we're in the one year-anniversary of GDPR. >> Right. >> And I think everyone's kind of like OK what happened last year? More privacy laws are coming and one of the themes this year is clarity with data, but also in the industry you know access to data, making data addressable, because AI needs data sets, cloud has proven that, SAS business models, using data winning formula, that's clear if you're born in the cloud. Enterprises now want that same kind of SAS-like execution on the applications side, whether it's SAS or using AI for instance, >> Right. >> So when you have more regulation, inherent nature is to oh like more complexity, how are customers dealing with the complexity of this, because they want to free it up, but at the same time they want to make sure that they can respect the laws for individuals, but also governments aren't that smart either so you know, the balance there, what's the strategy? >> And therein lies the challenges with privacy specifically, it's not just about quality counterparty credit risk in like five or seven systems in a data warehouse, it's all the data in your enterprise, it's the data in production, there's the data in your DevOps environment, it's all your data literally, structured all the way to unstructured data like Word, PDFs, Powerpoints. And you need a governing framework around it, you need to enable organizations to be able to discover where is there sensitive information, how is there sensitive information proliferating through the organization? Is it protected? Is it not protected? And what's particularly, you know, we're all consumers, I'm pretty confident some or all of our data has been breached at some point, enabling organizations, what these privacy regulations are doing is they are giving us, as individuals, rights to go to the organizations we transact with and ask them, what are you doing with out data? Forget my data or at least tell me how you're processing it and get my consent for the data. >> Yeah, I mean policy and business models are certainly driving that and with regulation, I see that, but the question is that when you move the impact to the enterprise, you got storage drives. You store it on drives as a storage administrator you've got software abstractions with data, like you guys do. So, it's complicated, so the question is, for you, is what are customers doing now? What's the answer to all this? >> The answer really comes down to you need to scale to the scope of the problem, it's a thousand x-increase, you're going from terabytes to petabytes right? And so, you need an AI, an ML, an intelligent solution that can discover all of this information, but it can map it to John Furrier, this is where John Furrier's information is, it's in the human capital management system, the CRM system, organizations know, may start knowing whether sensitive data is, but hey don't know who it belongs to, so when you go to invoke your right to be forgotten or portability, today, what we're enabling organizations with is hey, we'll help you discover the sensitive information, but we'll also tell you who it belongs to, so that when John shows up or Rebecca, you show up, you just have to punch in their name and we'll tell you all the systems, that it's in. That is something that requires teams of database administrators, lawyers, system administrators that needs to be automated, to truly realize the potential of these privacy regulations, while enabling organizations to continue to innovate and disrupt with data. >> What's your take on whether or not consumers truly understand the scope of these privacy regulations, I mean talking about GDPR and you get the pop-ups that say do you consent and you just say yes, I just need to get to this site and so you blithely, just press yes, yes, yes so you are technically giving your consent, but do you, I mean what's your take, do consumers truly understand what they're doing here? >> You know, I think historically, we've all said yes, yes, yes, over the last, I would say two years with growing regulations and significant breaches, there is a change in customer expectations, you know, there's a stat out there in the event of a data breach, two-thirds of consumers of a particular organization blame the organization for the breach, not the hackers, right, so it's a mindshift in all of us, where you're the custodian of my data I'm counting on you, whatever organization I'm transacting with ,to ensure and preserve my privacy, ensure my data's protected. So, that's a big shift that's happened, so whether you're doing it for regulatory reasons, CCPA North America, there's several other state-wide regulations coming out or GDPR, the consumer expectation, forget regulations, it's brand preservation, it's customer trust, it's customer experience, that organizations are really having to solve for from a privacy standpoint. >> Tell what the news around yesterday around the shift of the trust pieces, because that's a huge deal. Because trust is shifting, expectations are shifting, so when you have shifting expectations, with users and buyers, customers, the experience has to shift. So, take us through what's the new things? >> Well, the new things are, you know, you look at we're enabling organizations to be data-driven, we're enabling organizations to transform, build new products, new services, be more efficient and for that, you need to enable them to get access to data. The counter, the tension on the other end is how do we get them broad-based access while ensuring privacy, right, and that's the balance. How do we enable them to be customer-centric and optimal in engaging with their customers while preserving the privacy of their customers and that really comes down to having a detailed understanding of what your critical data is, where is it in the organization and how an organization is using that data. Enabling an organization to know that they're processing data with the appropriate consent. >> What's interesting to me, when I was with press yesterday, is also the addition of how the cloud players are coming onboard, because you know, one constituent that's not mentioned in that statement is that you guys are kind of keeping an eye on, that are impacted by this, is developers, because you know developers like infrastructures coded with DevOps. Don't want to be provisioning networks and storage, they just write to the API's. Data is kind of going through that similar experience where, if I'm a developer doing an IOT app, I'm just going to use the cloud. I put the data there, I don't need to have a mismatch of mechanisms to deal with some governance compliance rules. >> Correct and that's why it needs to be built-in by design. And you know there's this connotation that- >> Explain that, what does built-in by design mean. >> Well you need to have privacy built-into how you as a business operate, how you as a DevOps team or development team, build products, if that's built-in to how you operate, you enable the innovation without falling into the pitfalls of oh you know what we broke some privacy regulations there we breached our customers trust there, we used data or engaged with them in manner that they weren't comfortable with. >> So, don't retro-fit after the fact? Think holistically on the front-end of the transformation in architecture. >> It's an enabler, in that if you do it right to begin with, you can continue to innovate and engage effectively, versus bolting it on as an afterthought and retro-fitting. >> It really seems like it is this evolution in thinking from this risk and compliance, overdoing this to check all the boxes, versus here are our constraints, but our constraints are actually liberating, is what you're saying. >> Right, but you can't democratize data, without giving the consumers of that data an understanding of the quality of that data, the trustworthiness of that data, the relevance of the data to the business, you give them that and now you're enabling your analytics, your data scientists, your analytics organizations to innovate with that data with confidence and if you do it within a framework of privacy, you're ensuring that you're preserving customer trust while you're automating and building intelligent and engaging customer experiences. >> What I love about the data business right now, is it's exciting because it's real specific examples of impact, security, you know, national security, to hackers, to just general security, privacy of the laws, But, I've seen the development angles interesting too, so when you got these two things moving, customers can ignore this, it's not like back-up and recovery where same kind of ethos is there, you don't want to think about it after the fact, you want to build it in, you know, there's certainly reasons why you do that, in case there's a disaster, but data is highly impactful all the time. This is a challenge, you guys can pull this off. >> Well you know, it's a, with privacy, it's no longer about a few systems, it's all your data and so the scope is the challenge and the scale applies for privacy, the scale applies for making data available enterprise wide and that's where you need and you know we spoke about AI needs data, well data also needs AI. And that's where we're leveraging AI and ML. Building out intelligence, to help organizations solve that problem and not do it manually. >> You know, I've said it on theCUBE, you've probably heard it many times, I say it all the time, scale is the new competitive advantage. Value is the new lock-in. No proprietary software anymore, but technology is needed. I want to ask you, you've been talking about this with some of your customers last year around data is that you need more scale, because AI needs more access to data, because the more visibility into data, the smarter, machine learning and AI applications can become. So Scale is real. What is the, what are you, you guys have some scalosity in your customers, you got the end-to-end, got the catalog and everything is kind of looking good, but you have competition How would you compare to the competition, when people say hey Jitesh, a start-up just popped out or XYZ company's got the solution, why should I go with them or you? What's the difference, what's the competitive angle? >> You know, the way we're thinking the problem is founded on governance is an enabler it's not about locking things down for risk and compliance, because you know, the regulators want to know that this particular warehouse is highly tightly controlled, it's about getting the data out there, it's about enabling end-users to have a contextual understanding when you're doing that for all of your data, within around, that's a thousand X-increase in the data, it's a thousand X-increase in your constituents, you're not supporting, the risk and compliance portions of the organization, you're supporting marketing, you're supporting sales, you're supporting business operations, supply chain, customer-onboarding and so with the problem of scale, practices of the past, which were typically manual laborious, but hey at the risk of non-compliance, we just had to deal with them, don't practically in any way scale, to the requirements of the future which is a thousand X-increase in consumers and that's where intelligence and AI and ML come in. >> The question I have for you is, where should customers store their data? Is there an answer to that on premises or in the cloud? What are they doing? >> The answer is yes, (Knight laughs) the customer should store their data, what we see, the world is going to be hybrid, mainframes are still here, on-premise will still be here many years from now. >> So you're taking the middle of the road here, so >> There's Switzerland. >> You're saying whatever they want on-premise or cloud, is there a preference you see with customers? >> Well, you know it depends on the applications , depends on regulations, historically regulations especially in financial services, have mandated a more on-premise stance, but those regulations, are also evolving and so we see, the global investment banks all of a sudden, we're having all sorts of conversations about enabling them to move select portions of their data estate to the cloud, enabling them to be more agile, so the answer is yes and it will be for a very long time to come. >> Final question, one of the most pressing problems in the technology industry is the skills gap. I want to hear your thoughts on it, how as a Senior Executive at Informatica, how worried are you about finding qualified candidates for your open-roles? >> You know, it is a challenge, good news is, we're a global organization, my teams are globally-distributed. I have teams in Europe, North America and Asia and the good part about that is if you can't find it in the valley, you can certainly find the talent elsewhere, and so while, it is a challenge, we're able to find talented engineers, software developers, data scientists, to help us innovate and build the intelligence capabilities to solve the productivity challenges, the scale challenges of data consumption. >> Jitesh, talk about the skills required for people coming out of school, take your Informatica hat off, put your expertise hat on, data guru hat, knowing that data is going to continue to grow, continue to have more impact across the board, from coding to society affix, whatever, what are some of the key skills in training, classes or courses or areas of expertise that people an dial-up or dig into that might be beneficial to them that may or may not be on the radar curriculum or, say is, part of school curriculum, >> you know we engage with universities in North America, in Europe, in Asia, we have a large development center in India and we're constantly, engaging with them. We're on various boards at various universities, advisory standpoint, big data standpoint and what we're seeing is as we engage with these organizations, we're able to feed back on where the market is going, what the requirements are, the nature of data science, the enabling technologies such as platforms like Spark, languages like Python and so we're working with these schools to share our perspectives, they in turn, are incorporating this into their curriculums and how they train future data scientists. >> When you see a young gun out there that's kicking butt and taking names and data, what are some of the backgrounds? Is it math, is it philosophy, is there a certain kind of pattern that you've seen as the makeup of just the killer data person? >> You know, it's interesting, you mention philosophy, I'm a big, I've hired many philosophy majors that have been some of the best architects, having said that, from a data science perspective, it's all about stats, it's all about math and while that's an important skillset to have, we're also focused on making their lives easier, they're spending 70% of their time, doing data engineering versus data science and so while they are being educated from a stats, from a data science foundation, when they come into the industry, they end up spend 70% of their time doing data engineering, that's where we're helping them as well. >> So study your Socrates and study your stats. >> I like that. (Knight and Furrier laugh) >> Jitesh, thank you so much for coming on theCUBE. >> My pleasure, happy to be here, thank you. >> I'm Rebecca Knight for John Furrier, you are watching theCUBE.
SUMMARY :
brought to you by Informatica. are joined by Jitesh Ghai, he is the the lay of the land from your perspective. so that the right people can consume the data but the markets changed, you mentioned governance one of the themes this year is it's all the data in your enterprise, but the question is that when you move the impact The answer really comes down to you need in customer expectations, you know, there's customers, the experience has to shift. Well, the new things are, you know, is also the addition of how the cloud players And you know into the pitfalls of oh you know what of the transformation in architecture. right to begin with, you can continue to innovate this to check all the boxes, versus here the relevance of the data to the business, about it after the fact, you want to and you know we spoke about AI needs data, is that you need more scale, because AI needs and compliance, because you know, the the customer should store their data, so the answer is yes and it will the most pressing problems in the and the good part about that is if you can't data science, the enabling technologies such as some of the best architects, having said that, (Knight and Furrier laugh) John Furrier, you are watching theCUBE.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Rebecca Knight | PERSON | 0.99+ |
Europe | LOCATION | 0.99+ |
Jitesh | PERSON | 0.99+ |
John Furrier | PERSON | 0.99+ |
70% | QUANTITY | 0.99+ |
five | QUANTITY | 0.99+ |
India | LOCATION | 0.99+ |
last year | DATE | 0.99+ |
Jitesh Ghai | PERSON | 0.99+ |
Informatica | ORGANIZATION | 0.99+ |
John | PERSON | 0.99+ |
North America | LOCATION | 0.99+ |
Rebecca | PERSON | 0.99+ |
25 years | QUANTITY | 0.99+ |
seven systems | QUANTITY | 0.99+ |
Asia | LOCATION | 0.99+ |
Python | TITLE | 0.99+ |
two years | QUANTITY | 0.99+ |
yesterday | DATE | 0.99+ |
GDPR | TITLE | 0.99+ |
Word | TITLE | 0.99+ |
California Privacy Act | TITLE | 0.99+ |
Las Vegas | LOCATION | 0.99+ |
two-thirds | QUANTITY | 0.99+ |
XYZ | ORGANIZATION | 0.99+ |
Switzerland | LOCATION | 0.99+ |
Spark | TITLE | 0.99+ |
this year | DATE | 0.98+ |
today | DATE | 0.97+ |
2019 | DATE | 0.97+ |
One | QUANTITY | 0.97+ |
CCPA North America | ORGANIZATION | 0.97+ |
one | QUANTITY | 0.96+ |
DevOps | TITLE | 0.96+ |
two things | QUANTITY | 0.96+ |
Powerpoints | TITLE | 0.95+ |
Furrier | PERSON | 0.95+ |
Knight | PERSON | 0.94+ |
couple customers | QUANTITY | 0.87+ |
one year-anniversary | QUANTITY | 0.86+ |
Informatica World 2019 | EVENT | 0.84+ |
Informatica World | ORGANIZATION | 0.82+ |
theCUBE | ORGANIZATION | 0.79+ |
a thousand X | QUANTITY | 0.78+ |
one constituent | QUANTITY | 0.77+ |
terabytes | QUANTITY | 0.74+ |
a thousand X | QUANTITY | 0.74+ |
Socrates | PERSON | 0.73+ |
petabytes | QUANTITY | 0.73+ |
Informatica World | EVENT | 0.7+ |
thousand | QUANTITY | 0.67+ |
BCBS 239 | ORGANIZATION | 0.65+ |
SAS | ORGANIZATION | 0.62+ |
PDFs | TITLE | 0.58+ |
SAS | TITLE | 0.53+ |
CCAR | ORGANIZATION | 0.5+ |
Dabos | LOCATION | 0.48+ |
Jitesh Ghai | Informatica World 2017
>> Announcer: Live from San Francisco, it's The Cube covering Informatica World 2017. Brought to you by Informatica. >> Okay, welcome back everyone. We are here live in San Francisco for The Cube's exclusive coverage of Informatica World 2017. I'm John Furrier, this is Siliconangle's flagship program, we go out to the events and (he mumbles). My next guest is Jitesh Ghai who's the Vice President General Manager of data quality and governance for Informatica. Welcome to The Cube, thanks for joining us today. >> Happy to be here, John. Pleasure. >> So, two things right out of the gate. One, data quality and governance, two of the hottest topics in the industry, never mind within Informatica. You guys are announcing a lot of stuff, customers are pretty happy, you got a solid customer base. >> That's right. >> Product's been blooming, you got a big brand behind you now. This is important. There's laws now in place coming online in 2018, I think it's the GDPR. >> That's right. >> And there's a variety of other things, but more importantly customers got to get hold of their data. >> That's right. >> What's your take and what are you announcing here at the show? >> Well, you know, from a data governance and compliance and overall quality standpoint, data governance started off as a stick, a threat of regulatory pressure, but really the heart of what it is is effective access to and consumption of data, trusted data. And through that exercise of the threat of a stick, healthy practices have been implemented and that's resulted in an appreciation for data governance as a carrot, as an opportunity to innovate, innovate with your data to develop new business models. The challenge is as this maturation in the practice of data governance has happened there's been a realization that there's a lot of manual work, there's a lot of collaboration that's required across a cross-functional matrixed organization of stakeholders. And there's the concept of ... >> There's some dogma too, let's just face it, within organizations. I got all this data, I did it this way before. >> Right. >> And now, whoa, the pressure's on to make data work, right, I mean that's the big thing. >> That's exactly right. So, you collaborate, you align, and you agree on what data matters and how you govern it. But then you ultimately have to stop documenting your policies but actually make it real, implement it, and that's where the underlying data management stack comes into place. That could be making it real for regulatory, financial regulations, like BCBS 239 and CCAR, where data quality is essential. It could be making it real for security related regulations where protection is essential, like GDPR, the data protection regulation in the EU. And that's where, Informatica is launching a holistic enterprise data governance offering that enables you to not just document it, or as one CDO said to me, "You know, at some point you've got to stop talking about it, "you actually have to do it." To connecting the conceptual, the policies, with the underlying physical systems, which is where intelligent automation with the underlying data management portfolio, the industry-leading data management portfolio that we have, really delivers significant productivity benefits, it's really redefining the practice of data governance. >> Yeah, most people think of data as being one of those things, it's been kind of like, whether it's healthcare, HIPAA old models, it's always been an excuse to say no. "Whoa, we don't do it that way." Or, "Hey." It's kind of become a no-op kind of thing where, "No, we don't want to do any more than data." But you guys introduced CLAIR which is the acronym for the clairvoyant or AI, it's kind of a clever way to brand. >> That's right. >> That's going to bring in machine learning augmented intelligence and cool things. That only, to me, feels like you're speeding things up. >> That's exactly right. >> When in reality governance is more of a slowdown, so how do you blend the innovation strategy of making data freely available ... >> Right. >> ..and yet managing the control layer of governance, because governance wants to go slow, CLAIR wants to go fast, you know. Help me explain that. >> Well, in short, sometimes you have to go slow to go fast. And that's the heart of what our automated intelligence that CLAIR provides in the practice of data governance, is to ensure that people are getting access to, efficient access to trusted data and consuming it in the right context. And that's where you can set, you can define a set of policies, but ultimately you need those policies to connect to the right data assets within the enterprise. And to do that you need to be able to scan an entire enterprise's data sets to understand where all the data is and understand what that data is. >> Talk about the silver bullet that everyone just wants to buy, the answer to the test, which is ungettable, by the way, I believe, we just had Allegis on, one of your customers, and their differentiation to their competition is that they're using data as an asset but they're not going all algorithmic. There's the human data relationship. >> Absolutely. >> So there's really no silver bullet in data. You could use algorithms like machine learning to speed things up and work on things that are repeatal tasks. >> Right. >> Talk about that dynamic because governance can be accelerated with machine learning, I would imagine, right? >> Absolutely, absolutely. Governance is a practice of ensuring an understanding across people, processes and systems. And to do that you need to collaborate and define who are the people, what are your processes, and what are the systems that are most critical to you. Once you've defined that it's, well, how do we connect that to the underlying data assets that matter, and that's where machine learning really helps. Machine learning tells you that if you define customer id as a critical data element, through machine learning, through CLAIR, we are able to surface up everywhere in your organization where customer id resides. It could be cmd id, it could be customer_id, could be customer space id, cust id. Those are all the inferences we can make, the relationships we can make, and surface all of that up so that people have a clear understanding of where all these data assets reside. >> Jitesh, let's take a step back. I want to get your thoughts on this, I really want you to take a minute to explain something for the folks watching. So, there's a couple of different use cases, at least I've observed in a row and the wikibon team has certainly observed. Some people have an older definition of governance. >> Right. >> What's the current definition from your standpoint? What should people know about governance today that's different than just last year or even a few years ago, what's the new picture, what's the new narrative for governance and the impact to business? >> You know, it's a great question. I held a CDO summit in February, we had about 20 Chief Data Officers in New York and I just held an informal survey. "Who implements data governance programs "for regulatory reasons?" Everybody put their hand up. >> Yeah. >> And then I followed that up with, "Who implements data governance programs "to positively affect the top line?" and everybody put their hand up. That's the big transition that's happened in the industry is a realization that data governance is not just about compliance, it's also about effective policies to better understand your data, work with your data, and innovate with your data. Develop new business models, support your business in developing those new business models so that you can positively affect the top line. >> Another question we get up on The Cube all the time, and we also observe, and we've heard this here from other folks at Informatica and your customers have said, getting to know what you actually have is the first step. >> Right. >> Which sounds counter-intuitive but the reality is that a lot of folks realize there's an asset opportunity, they raise their, hey, top line revenue. I mean, who's not going to raise their hand on that one, right, you get fired. I mean, the reality is this train's coming down the tracks pretty fast, data as an input into value creation. >> That's exactly right. >> So now the first step is oh boy, just signed up for that, raise my hand, now what the hell do I have? >> Right. >> How do you react to that? What's your perspective on that? >> That's where you need to be able to, google indexed the internet to make it more consumable. Actually, a few search engines indexed the internet. Google came up with sophistication through its page-ranking algorithm. Similarly, we are cataloging the enterprise and through CLAIR we're making it so that the right relevant information is surfaced to the right practitioner. >> And that's the key. >> That is the key. >> Accelerating the access method, so increase the surface area of data, have the control catalog for the enterprise. >> That's right. >> Which is like your google search analogy. A little harder than searching the internet, but even google's not doing a great job these days, in my opinion, I should say that. But there's so many new data points coming in. >> That's right. >> So now the followup question is, okay, it's really hard when you start having IOT come in. >> That's right. >> Or gesture data or any kind of data coming in. How do you guys deal with that? How does that rock your world, as they say? >> And that's where effective consumption of data permeates across big data, cloud, as well as streaming data. We have implemented, in service to governance, we've implemented in-stream data quality rules to filter out the noise from the signal in sensor data coming in from aircraft subsystems, as an example. That's a means of, well, first you need to understand what are the events that matter, and that's a policy definition exercise which is a governance exercise. And then there's the implementation of filtering events in realtime so that you're only getting the signal and avoiding the noise, that's another IOT example. >> What's your big, take your Informatica hat off, put your kind of industry citizen hat on. >> Mm-hm. >> What's your view of the marketplace right now? What's the big wave that people are riding? Obviously, data, you could say data, don't say data 'cause we know that already. >> Sure. >> What should people, what do you observe out there in the marketplace that's different, that's changing very rapidly? Obviously we see Amazon stock going up like a hockey stick, obviously cloud is there. What are you getting excited about these days? >> You know, what I'm excited about is bringing broad-based access of data to the right users in the right context, and why that's exciting is because there's an appreciation that it's not the analytics that are important, it's the data that fuels those analytics that's important. 'Cause if you're not delivering trusted, accurate data it's effectively a garbage in, garbage out analytics problem. >> Hence the argument, data or algorithms, which one's more important? >> Right. >> I mean data is more important than algorithms 'cause algorithms need data. >> That's exactly right and that's even more true when you get into non-deterministic algorithms and when you get into machine learning. Your machine learning algorithm is only as good as the data you train it with. >> I mean look, machine learning is not a new thing. Unsupervised machine learning's getting better. >> Right. >> But that's really where the compute comes in, and the more data you have the more modeling you can do. These are new areas that are kind of coming online, so the question is, to you, what new exciting areas are energizing some of these old paradigms? We hear neural nets, I mean, google's just announced neural nets that teach neural nets to make machine learning easier for humans. >> Right. >> Okay. I mean, it has a little bit of computer science baseball but you're seeing machine learning now hitting mainstream. >> Right. >> What's the driver for all this? >> The driver for all this comes down to productivity and automation. It's productivity and automation in autonomous vehicles, it's productivity and automation that's now coming into smart homes, it's productivity and automation that is being introduced through data-driven transformation in the enterprise as well, right, that's the driver. >> It's so funny, one of my undergraduate computer science degrees was databases. And in the '80s it wasn't like you went out to the tub, "Hey, I'm a databaser." (He mimics uncertain mumbling) And now it's like the hottest thing, being a data guy. >> Right. >> And what's also interesting is a lot of the computer science programs have been energized by this whole software defined with cloud data because now they have unlimited, potentially, compute power. >> Right. >> What's your view on the young generation coming in as you look to hire and you look to interview people? What are some of the disciplines that are coming out of the universities and the masters programs that are different than it was even five years ago? What are some trends you're seeing in the young kids coming in, what are they gravitating towards? >> Well, you know, there's always an appreciation of, a greater appreciation for, you know, the phrase I love is, "In god we trust, all others must have data." There's an increasing growing culture around being data-driven. But from a background of young people, it's from a variety of backgrounds, of course computer science but philosophy majors, arts majors in general, all in service to the larger cause of making information more accessible, democratizing data, making it more consumable. >> I think AI, I agree, by the way, I would just add, I think AI, although it is hyped and I don't really want to burst that bubble because it's really promoting software. >> Right. >> I mean, AI's giving people a mental model of, "Oh my god, some pretty amazing things are happening." >> Sure. >> I mean, autonomous vehicles is what most people point to and say, "Hey, wow, that's pretty cool." A Tesla's much different than a classic car. I mean, you test-drive a TESLA you go, "Why am I buying BMW, Audi, Mercedes?" >> Right, exactly. >> It's a no brainer. >> Right. >> Except it's like (he mumbles), you got to get it installed. But, again, that's going to change pretty quickly. >> At this point it's becoming a table sticks exercise. If you're not innovating, if you're not applying intelligence and AI, you're not doing it right. >> Right, final question. What's your advice to your customers who are in the trenches, they raise their hand, they're committed to the mandate, they're going down the digital business transformation route, they recognize that data's the center of the value proposition, and they have to rethink and reimagine their businesses. >> Right. >> What advice do you give them in respect to how to think architecturally about data? >> Well, you know, it all starts with your data-driven transformations are only as good as the data that you're driving your transformations with. So, ensure that that's trusted data. Ensure that that's data you agree as an organization upon, not as a functional group, right. The definition of a customer in support is different from the definition of a customer in sales versus marketing. It's incredibly important to have a shared understanding, an alignment on what you are defining and what you're reporting against, because that's how you're running your business. >> So, the old schema concept, the old database world, know your types. >> Right. >> But then you got the unstructured data coming in as well, that's a tsunami IOT coming in. >> Sure, sure. >> That's going to be undefined, right? >> And the goal and the power of AI is to infer and extract metadata and meaning from this whole landscape of semi-structured and unstructured data. >> So you're of the opinion, I'm sure you're biased with being Informatica, but I'm just saying, I'm sure you're in favor of collect everything and connect the dots as you see fit. >> Well ... >> Or is that ...? >> It's a nuance, you can't collect everything but you can collect the metadata of everything. >> Metadata's important. >> Data that describes the data is what makes this achievable and doable, practically implementable. >> Jitesh Ghai here sharing the metadata, we're getting all the metadata from the industry, sharing it with you here on The Cube. I'm John Furrier here live at Informatica World 2017, exclusive Cube coverage, this is our third year. Go to siliconangle.com, check us out there, and also wikibon.com for our great research. Youtube.com/siliconangle for all the videos. More live coverage here at Informatica World in San Francisco after this short break, stay with us.
SUMMARY :
Brought to you by Informatica. Welcome to The Cube, thanks for joining us today. customers are pretty happy, you got a solid customer base. you got a big brand behind you now. but more importantly customers got to get hold of their data. but really the heart of what it is I did it this way before. right, I mean that's the big thing. and you agree on what data matters and how you govern it. But you guys introduced CLAIR That's going to bring in machine learning so how do you blend the innovation strategy CLAIR wants to go fast, you know. And to do that you need to be able to and their differentiation to their competition to speed things up and work on things And to do that you need to collaborate and the wikibon team has certainly observed. and I just held an informal survey. so that you can positively affect the top line. getting to know what you actually have is the first step. I mean, the reality is this train's coming down the tracks google indexed the internet to make it more consumable. have the control catalog for the enterprise. A little harder than searching the internet, So now the followup question is, okay, How do you guys deal with that? and avoiding the noise, that's another IOT example. What's your big, take your Informatica hat off, What's the big wave that people are riding? in the marketplace that's different, that it's not the analytics that are important, I mean data is more important than algorithms as the data you train it with. I mean look, machine learning is not a new thing. and the more data you have the more modeling you can do. I mean, it has a little bit of computer science baseball in the enterprise as well, right, that's the driver. And in the '80s it wasn't like you went out to the tub, is a lot of the computer science programs a greater appreciation for, you know, the phrase I love is, and I don't really want to burst that bubble I mean, AI's giving people a mental model of, I mean, you test-drive a TESLA you go, you got to get it installed. if you're not applying intelligence and AI, of the value proposition, and they have to rethink are only as good as the data that you're the old database world, know your types. But then you got the unstructured data coming in And the goal and the power of AI collect everything and connect the dots as you see fit. but you can collect the metadata of everything. Data that describes the data Youtube.com/siliconangle for all the videos.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Jitesh Ghai | PERSON | 0.99+ |
BMW | ORGANIZATION | 0.99+ |
Mercedes | ORGANIZATION | 0.99+ |
Audi | ORGANIZATION | 0.99+ |
Amazon | ORGANIZATION | 0.99+ |
New York | LOCATION | 0.99+ |
John | PERSON | 0.99+ |
February | DATE | 0.99+ |
Informatica | ORGANIZATION | 0.99+ |
John Furrier | PERSON | 0.99+ |
2018 | DATE | 0.99+ |
first step | QUANTITY | 0.99+ |
last year | DATE | 0.99+ |
San Francisco | LOCATION | 0.99+ |
ORGANIZATION | 0.99+ | |
third year | QUANTITY | 0.99+ |
Tesla | ORGANIZATION | 0.99+ |
today | DATE | 0.99+ |
two | QUANTITY | 0.99+ |
ORGANIZATION | 0.99+ | |
siliconangle.com | OTHER | 0.98+ |
GDPR | TITLE | 0.98+ |
The Cube | ORGANIZATION | 0.98+ |
One | QUANTITY | 0.98+ |
Siliconangle | ORGANIZATION | 0.97+ |
wikibon | ORGANIZATION | 0.97+ |
five years ago | DATE | 0.97+ |
one | QUANTITY | 0.96+ |
Informatica World 2017 | EVENT | 0.96+ |
Jitesh | PERSON | 0.96+ |
first | QUANTITY | 0.95+ |
few years ago | DATE | 0.94+ |
two things | QUANTITY | 0.93+ |
Allegis | ORGANIZATION | 0.93+ |
Youtube.com/siliconangle | OTHER | 0.89+ |
BCBS 239 | TITLE | 0.87+ |
CLAIR | ORGANIZATION | 0.84+ |
CLAIR | PERSON | 0.82+ |
about 20 Chief Data Officers | QUANTITY | 0.82+ |
EU | ORGANIZATION | 0.79+ |
big wave | EVENT | 0.79+ |
HIPAA | TITLE | 0.77+ |
wikibon.com | OTHER | 0.76+ |
CCAR | TITLE | 0.71+ |
TESLA | ORGANIZATION | 0.71+ |
Vice President General Manager | PERSON | 0.7+ |
Informatica World | ORGANIZATION | 0.7+ |
'80s | DATE | 0.68+ |
Cube | COMMERCIAL_ITEM | 0.65+ |
minute | QUANTITY | 0.54+ |
couple | QUANTITY | 0.51+ |
CDO | EVENT | 0.41+ |
CDO | TITLE | 0.41+ |