Jitesh Ghai, Informatica | CUBE Conversation, July 2020

(ambient music) >> Narrator: From the cube studios in Palo Alto in Boston, connecting with thought leaders all around the world, this is a CUBE conversation. >> Hello welcome to this cube conversation. I'm John Furrier, host of theCUBE here in our Palo Alto studios. During this quarantine, crew doing all the interviews, getting all the top story especially during this COVID pandemic. Great conversation here Jitesh Ghai, Senior Vice President and General Manager of Data Management with Informatica, CUBE alumni multi time. We can't be in person this year, because of the pandemic but a lot of great content. We've been doing a lot of interviews with you guys. Jitesh great to see you. Thanks for coming on. >> Hey, great to see you again. We weren't able to make it happen in person this year, >> but if not in person, >> virtually will have to work. >>In our past conversations on theCUBE and through all the Informatica employees it's always been kind of an inside baseball, kind of inside the ropes conversation in the industry >> about data. >> Now more than ever, with the pandemic, you starting to see people seeing it. Oh, I get it now. I get why data is important. I can see why Cloud First, Mobile First, Data First strategies and now Virtual First, is now this transformational scene. Everyone's feeling it, you can't help not ignore it. It's happening. It's also highlighting what's working, what's not. I have to ask you in the current environment Jitesh what are you seeing as some of those opportunities that your customers are dealing with approach to data? 'Cause clearly, you're working with that data layer, there's a lot of innovation opportunities, you've got CLAIRE on the AI side, all great. But now with the pandemic, it's really forcing that conversation. I got to rethink about what's going to happen after and have a really good strategy. >> Yeah, you're exactly right. There's a broad based realization that, I'll take a step back. First, we all know that as global 2000 organizations or in general, we all need to be data driven, we need to make fact based decisions. And there is a lot of that good work that's happened over the last few years as organizations have realized just how important data is to innovate and to deliver new products and services, new business models. What's really happened is that, during this COVID pandemic, there is a greater appreciation for trust in data. Historically, organizations became data driven, we're on the journey of being increasingly data driven. However, there was some element of Oh, gut or experience and that combined with data will get us to the outcomes we're looking for, will enable us to make the decisions. In this pandemic world of great uncertainty, supply chains falling apart on occasion, groceries not getting delivered on time et cetra, et cetra. The appreciation and critical importance on the quality on the trust of data is greater than ever to drive the insights for organizations. Leaders are less hesitant or sorry, leaders are more hesitant to just go with your gut type of approaches. There is a tremendous reliance on data. And we're seeing it in particular, more than ever, as you can imagine in the healthcare provider sector, in the public sector with federal state and local, as all of these organizations are having to make very difficult decisions, and are increasingly relying on high quality, trustworthy governed data to help them make what can be life or death decision. So a big shift and appreciation for the importance and trustworthiness in their data, their data state and their insights. >> So as the GM of data management and Senior Vice President at Informatica, you get a good view of things. I got to ask you love this data 4.0 concept. Talk about what that means to you because you got customers have been doing data management with you guys for a while, but now it's data 4.0 that has a feeling of agility to it. It's got kind of a DevOps vibe. It feels like a lot of automation being discussed and you mentioned trust. What is data 4.0 mean? >> So data 4.0 for us is where AI and ML is powering data management. And so what do I mean by that? There is a greater insight and appreciation for high quality trustworthy data to enable organizations to make fact based decisions to be more data driven. But how do you do that when data is exponentially growing in volume, where data types are increasing, where data is moving increasingly between Clouds, between On-premises and Clouds between various ecosystems, new data sources are emerging, the internet of things is yet another exploding source of data. This is a lot of different types of data, a lot of volume of data, a lot of different locations, and gravity of data where data resides. So the question becomes how do you practically manage this data without intelligence and automation. And that's what the era of data 4.0 is. Where AI and ML is powering data management, making it more intelligent, automating more and more of what was historically manual to enable organizations to scale, to enable them to scale to the breadth of data that they need to get a greater understanding of their data landscape within the enterprise, to get a greater understanding of the quality of the data within their landscape, how it's moving, and the associated privacy implications of how that data is being used, how effectively it's protected, so on and so forth. All underpinned by our CLAIRE engine, which is AI and ML applied to metadata, to deliver the intelligence and enable the automation of the data management operations. >> Awesome. Thanks for taking the time to define that, love that. The question I want to ask you, I'll put you on the spot here because I think this is an important conversation we've been having and also writing a lot about it on siliconangle.com and that is customers say to us, "Hey, John, I'm investing in Cloud Native technologies, using Cloud data warehouse as a data lakes. I need to make this work because this is a scale opportunity. I need to come out of this pandemic with really agile, scalable solutions that I can move fast on my applications." How do you comment on that? What's your thoughts on this because, you guys are in the middle of all this with the data management. >> I couldn't agree more. Increasingly, data workloads are moving to the Cloud. It's projected that by 2022, 75% of all databases will be in the Cloud, and COVID-19 is really accelerating it. It's opening the eyes of leadership of decision makers to be truly Cloud First and Cloud Native, now more than ever. And so organizations, traditional banking organizations, highly regulated industries that have been hesitant to move to the cloud, are now aggressively embarking on that journey. And industries that were early adopters of the Cloud are now accelerating that journey. I mentioned earlier that, we had a very seamless transition as we moved to a work from home environment, and that's because our IT is Cloud First Cloud Native. And why is that? It's because it's through being Cloud First and Cloud Native that you get the resiliency, the agility, the flexibility benefits in these uncertain times. And we're seeing that with the data and analytics stack as well. Customers are accelerating the move to Cloud data warehouses to Cloud data lakes, and become Cloud Native for their data management stack in addition to the data analytics platforms. >> Great stuff which I agree with hundred percent. Cloud Native is where it goes but you aren't they're (laughs) yet. Still on Hybrid and Multi-cloud is a big discussion. I want to get your thoughts >> Completely. >> On how that's going to play up because if you put Hybrid cloud and Multi-cloud I see Public cloud it's amazing, we know that. But Hybrid and Multi-cloud as the next generation of kind of interoperability framework of Cloud services, you're going to have to overlay and manage data governance and privacy. It's going to get more complicated, right? So how are you seeing your customers approach that piece, on the Public side, and then with Hybrid, because that's become a big discussion point. >> So Hybrid is an absolutely critical enabling capability as organizations modernize their on premise estate into the Cloud. You need to be able to move and connect to your On-premise applications, databases, and migrate the data that's important into the Cloud. So Hybrid is an essential capability. When I say Informatica is Cloud First Cloud Native, being Cloud First Cloud Native as a data management as a service provider if you will, requires essentially capabilities of being able to connect to On-premise data sources and therefore, be Hybrid. So Hybrid architecture is an essential part of that. Equally, it's important to enable organizations to understand what needs to go to the Cloud. As you're modernizing your infrastructure, your applications, your data and analytics stack. You don't need to bring everything to the Cloud with you. So there's an opportunity for organizations to introduce efficiencies. And that's done by enabling organizations to really scan the data landscape On-premise, scan the data that already exists in the various Public clouds that they partner with, and understand what's important, what's not, what can be decommissioned and left behind to realize savings and what is important for the business and needs to be moved into a Cloud Native analytic stack. And that's really where our CLAIRE metadata intelligence capabilities come to bear. And that's really what serves as the foundation of data governance, data cataloging and data privacy, to enable organizations to get the right data into the Cloud. To do so, while ensuring privacy. And to ensure that they govern that data in their new now Cloud Native analytics stack, whether it's AWS, Azure, GCP, snowflake data, bricks, all partners, all deep partnerships that we have. >> Jitesh, I want to get your thoughts on something. I was having a Zoom call a couple weeks ago, with a bunch of CXO friends, people, practitioners, probably some of them are probably your customers. It was kind of a social get together. But we were talking about, how the world we're living in pandemic, from COVID data, fake news, and one of the comments was, finally the whole world now realized what my life like. And in referring to how we're seeing fake news and misinformation kind of screw up an election and you got COVID's got 10 zillion different data points and people are making it to tell stories. And what does it really mean? There's a lot of trust involved. People are confused, and all that's going on. Again, in that backdrop, he said that that's my world. >> Right. This is back down to some of the things you're talking about, trust. We've talked about metadata services in the past. This authenticity, the duck democratization has been around for a while in the enterprise, so that dealing with bad data or fake data or too much data, you can make data (laughs) into whatever you want. You got to make sense of it. What's your thoughts on the reaction to his comment? I mean, what does it make you feel? >> Completely agree, completely agree. And that goes back to the earlier comment I made about making fact based decisions that you can have confidence in because the insight is based on trusted data. And so you mentioned data democratization. Our point of view is to democratize data, you have to do it on a foundational governance, right? There's a reason why traffic lights exist, it's to facilitate or at least attempt to facilitate the optimal free flow of traffic without getting into accidents, without causing congestion, so on and so forth. Equally, you need to have a foundation of governance. And I realized that there's an optical tension of democratized data, which is, free data for everybody consume it whenever and however you want, and then governance, which seems to imply, locking things down controlling them. And really, when I say you need a foundation of data governance, you need to enable for organizations to implement guardrails so that data can be effectively democratized. So that data consumers can easily find data. They can understand how trustworthy it is, what the quality of it is, and they can access it in easy way and consume it, while adhering to the appropriate privacy policies that are fit for the use of that particular set of data that a data and data consumer wants to access. And so, how do you practically do that? That's where data 4.0 AI power data management comes into play. In that, you need to build a foundation of what we call intelligent data governance. A foundation of scanning metadata, combining it with business metadata, linking it into an enterprise knowledge graph that gives you an understanding of an organization and enterprises data language. It auto tags auto curates, it gives you insight into the quality of the data, and now enables organizations to publish these curated data sets into a capability, what we call a data marketplace, so that much like Amazon.com, you can shop for the data, you can browse home and garden, electronics various categories. You can identify the data sets that are interesting to you, when you select them, you can look at the quality dimensions that have already been analyzed and associated with the data set. And you can also review the privacy policies that govern the use of that data set. And if you're interested in it, find the data sets, add them to your shopping cart, like you would do with Amazon.com, and check out. And when you do that triggers off an approval workflow to enable organizations to that last mile of governing access. And once approved, we can automatically provision the datasets to wherever you want to analyze them, whether it's in Tableau Power BI, an S3 market, what have you. And that is what I mean by a foundation of intelligent data governance. That is enabling data democratization. >> A common metadata layer gives you capabilities to use AI, I get that, There's a concept that you guys are talking a lot about, this augmentation to the data. This augmented data management activities that go on. What does that mean? Can you describe and explain that further and unpack that? This augmented data management activity? >> Yeah, and what do we mean by augmented data management, it's a really a first step into full blown automation of data management. In the old world, a developer would connect to a source, parse the source schema, connect to another source, parse its source schema, connect to the target, understand the target schema, and then pick the appropriate fields from the various sources, structure it through a mapping and then run a job that transforms the data and delivers it to a target database, in its structure, in its schema, in its format. Now that we have enterprise scale metadata intelligence, we know what source of data looks like, we know what targets exist as you simply pick sources and targets, we're able to automatically generate the mappings and automate this development part of the process so that organizations can more rapidly build out data pipelines to support their AI to operationalize AIML, to enable data science, and to enable analytics. >> Jitesh great insight. I really appreciate you explaining all this concept and unpacking that with me. Final point, I'd love you to have you just take a minute to put the plug in there for Informatica, what you're working on? What are your customers doing? What are some of the best practices coming out of the current situation? Take a minute to talk about that. >> Yeah, thank you, I'm happy to. It really comes down to focusing on enabling organizations to have a complete understanding of their data landscape. And that is, where we're enabling organizations to build an enterprise knowledge graph of technical metadata, business metadata, operational usage metadata, social metadata to understand and link and develop the necessary context to understand what data exists, where how it's used, what its purpose is and whether or not you should be using. And that's where we're building the Google for the enterprise to help organizations develop that. Equally, leveraging that insight, we're building out the necessary that insight and intelligence through CLAIRE, we're building out the automation in the data quality capabilities, in the data integration capabilities, in the metadata management capabilities, in the master data management capabilities, as well as the data privacy capability. So things that our tooling historically used to do manually, we're just automating it so that organizations can more productively access data, understand it and scale their understanding and insight and analytics initiatives with greater trust greater insight. It's all built on a foundation of our intelligent data platform. >> Love it, scaling data. It's that's really the future fast, available, highly available, integrated to the applications for AI. That's the future. >> Exactly right. Data 4.0, (laughs) AI power data management. >> I love talking about data in the future, because I think that's really valuable. And I think developers, and I've always been saying for over a decade now data is a critical piece for the applications, and AI really unlocks that of having it available, and surface is critical. You guys doing a great job. Thanks for the insight, appreciate you Jitesh. Thank you for coming on. >> Thanks for having me. Pleasure to be here. >> You couldn't do it in person with Informatica world but we're getting the conversations here on the remote CUBE, CUBE virtual. I'm John Furrier, you're watching CUBE conversation with Jitesh Ghai Senior Vice President General Manager, Data Manager at Informatica. Thanks for watching. (upbeat music)

Published Date : Jul 13 2020

SUMMARY :

leaders all around the world, because of the pandemic Hey, great to see you again. I have to ask you in the and that combined with data I got to ask you love that they need to get and that is customers say to us, in addition to the data but you aren't they're (laughs) yet. On how that's going to play up and connect to your On-premise and people are making it to tell stories. This is back down to some of the things And that goes back to the There's a concept that you and to enable analytics. of the current situation? and whether or not you should be using. integrated to the applications for AI. AI power data management. data in the future, Pleasure to be here. on the remote CUBE, CUBE virtual.

ENTITIES

Entity	Category	Confidence
Jitesh Ghai	PERSON	0.99+
John Furrier	PERSON	0.99+
Jitesh	PERSON	0.99+
Palo Alto	LOCATION	0.99+
John	PERSON	0.99+
July 2020	DATE	0.99+
Informatica	ORGANIZATION	0.99+
Amazon.com	ORGANIZATION	0.99+
First	QUANTITY	0.99+
AWS	ORGANIZATION	0.99+
Google	ORGANIZATION	0.99+
2022	DATE	0.99+
Boston	LOCATION	0.99+
CUBE	ORGANIZATION	0.98+
this year	DATE	0.98+
siliconangle.com	OTHER	0.98+
pandemic	EVENT	0.97+
COVID pandemic	EVENT	0.96+
Native	TITLE	0.96+
hundred percent	QUANTITY	0.96+
Cloud Native	TITLE	0.96+
Cloud First Cloud Native	TITLE	0.95+
first step	QUANTITY	0.94+
Cloud Native	TITLE	0.93+
Data First	ORGANIZATION	0.93+
Mobile First	ORGANIZATION	0.93+
Cloud	TITLE	0.92+
Hybrid	TITLE	0.92+
CXO	ORGANIZATION	0.91+
couple weeks ago	DATE	0.9+
10 zillion different data points	QUANTITY	0.9+
over a decade	QUANTITY	0.89+
Cloud First Cloud Native	TITLE	0.89+
Virtual First	ORGANIZATION	0.88+
Cloud First	COMMERCIAL_ITEM	0.86+
Azure	ORGANIZATION	0.85+
COVID-19	OTHER	0.84+
Cloud	COMMERCIAL_ITEM	0.83+
GCP	ORGANIZATION	0.81+
2000 organizations	QUANTITY	0.81+
75%	QUANTITY	0.75+
Tableau Power BI	TITLE	0.75+
one of the comments	QUANTITY	0.75+
Cloud Native	COMMERCIAL_ITEM	0.73+
Cloud First	ORGANIZATION	0.73+
last	DATE	0.72+
Senior	PERSON	0.71+
CLAIRE	PERSON	0.68+
COVID	OTHER	0.67+
years	DATE	0.64+
CUBE	TITLE	0.59+
theCUBE	ORGANIZATION	0.58+
Management	ORGANIZATION	0.52+
President	PERSON	0.5+
First	TITLE	0.48+

Jitesh Ghai, Informatica | CUBE Conversation, July 2020

(ambient music) >> Narrator: From the cube studios in Palo Alto in Boston, connecting with thought leaders all around the world, this is a CUBE conversation. >> Hello welcome to this cube conversation. I'm John Furrier, host of theCUBE here in our Palo Alto studios. During this quarantine, crew doing all the interviews, getting all the top story especially during this COVID pandemic. Great conversation here Jitesh Ghai, Senior Vice President and General Manager of Data Management with Informatica, CUBE alumni multi time. We can't be in person this year, because of the pandemic but a lot of great content. We've been doing a lot of interviews with you guys. Jitesh great to see you. Thanks for coming on. >> Hey, great to see you again. We weren't able to make it happen in person this year, but if not in person, virtually will have to work. >> One of the things, I'm a half glass half full kind of guy but you can't look at this without saying man, it's bad. But it really highlights how things are going on. So first, how are you doing? How's everyone Informatica doing over there? You guys are doing okay? >> We are well, we are well, families well, the Informatica family is well. So overall, can't complain can't complain, I think it was remarkable how quickly we were able to transition to a work from home environment for our global 5000 plus organization. And really, the fact that we're Cloud First Cloud Native, both in our product offerings, as well as an IT organization really helped make that transition seamless. >> In our past conversations on theCUBE and through all the Informatica employees it's always been kind of an inside baseball, kind of inside the ropes conversation in the industry about data. Now more than ever, with the pandemic, you starting to see people seeing it. Oh, I get it now. I get why data is important. I can see why Cloud First, Mobile First, Data First strategies and now Virtual First, is now this transformational scene. Everyone's feeling it, you can't help not ignore it. It's happening. It's also highlighting what's working, what's not. I have to ask you in the current environment Jitesh what are you seeing as some of those opportunities that your customers are dealing with approach to data? 'Cause clearly, you're working with that data layer, there's a lot of innovation opportunities, you've got CLAIRE on the AI side, all great. But now with the pandemic, it's really forcing that conversation. I got to rethink about what's going to happen after and have a really good strategy. >> Yeah, you're exactly right. There's a broad based realization that, I'll take a step back. First, we all know that as global 2000 organizations or in general, we all need to be data driven, we need to make fact based decisions. And there is a lot of that good work that's happened over the last few years as organizations have realized just how important data is to innovate and to deliver new products and services, new business models. What's really happened is that, during this COVID pandemic, there is a greater appreciation for trust in data. Historically, organizations became data driven, we're on the journey of being increasingly data driven. However, there was some element of Oh, gut or experience and that combined with data will get us to the outcomes we're looking for, will enable us to make the decisions. In this pandemic world of great uncertainty, supply chains falling apart on occasion, groceries not getting delivered on time et cetra, et cetra. The appreciation and critical importance on the quality on the trust of data is greater than ever to drive the insights for organizations. Leaders are less hesitant or sorry, leaders are more hesitant to just go with your gut type of approaches. There is a tremendous reliance on data. And we're seeing it in particular, more than ever, as you can imagine in the healthcare provider sector, in the public sector with federal state and local, as all of these organizations are having to make very difficult decisions, and are increasingly relying on high quality, trustworthy governed data to help them make what can be life or death decision. So a big shift and appreciation for the importance and trustworthiness in their data, their data state and their insights. >> So as the GM of data management and Senior Vice President at Informatica, you get a good view of things. I got to ask you love this data 4.0 concept. Talk about what that means to you because you got customers have been doing data management with you guys for a while, but now it's data 4.0 that has a feeling of agility to it. It's got kind of a DevOps vibe. It feels like a lot of automation being discussed and you mentioned trust. What is data 4.0 mean? >> So data 4.0 for us is where AI and ML is powering data management. And so what do I mean by that? There is a greater insight and appreciation for high quality trustworthy data to enable organizations to make fact based decisions to be more data driven. But how do you do that when data is exponentially growing in volume, where data types are increasing, where data is moving increasingly between Clouds, between On-premises and Clouds between various ecosystems, new data sources are emerging, the internet of things is yet another exploding source of data. This is a lot of different types of data, a lot of volume of data, a lot of different locations, and gravity of data where data resides. So the question becomes how do you practically manage this data without intelligence and automation. And that's what the era of data 4.0 is. Where AI and ML is powering data management, making it more intelligent, automating more and more of what was historically manual to enable organizations to scale, to enable them to scale to the breadth of data that they need to get a greater understanding of their data landscape within the enterprise, to get a greater understanding of the quality of the data within their landscape, how it's moving, and the associated privacy implications of how that data is being used, how effectively it's protected, so on and so forth. All underpinned by our CLAIRE engine, which is AI and ML applied to metadata, to deliver the intelligence and enable the automation of the data management operations. >> Awesome. Thanks for taking the time to define that, love that. The question I want to ask you, I'll put you on the spot here because I think this is an important conversation we've been having and also writing a lot about it on siliconangle.com and that is customers say to us, "Hey, John, I'm investing in Cloud Native technologies, using Cloud data warehouse as a data lakes. I need to make this work because this is a scale opportunity. I need to come out of this pandemic with really agile, scalable solutions that I can move fast on my applications." How do you comment on that? What's your thoughts on this because, you guys are in the middle of all this with the data management. >> I couldn't agree more. Increasingly, data workloads are moving to the Cloud. It's projected that by 2022, 75% of all databases will be in the Cloud, and COVID-19 is really accelerating it. It's opening the eyes of leadership of decision makers to be truly Cloud First and Cloud Native, now more than ever. And so organizations, traditional banking organizations, highly regulated industries that have been hesitant to move to the cloud, are now aggressively embarking on that journey. And industries that were early adopters of the Cloud are now accelerating that journey. I mentioned earlier that, we had a very seamless transition as we moved to a work from home environment, and that's because our IT is Cloud First Cloud Native. And why is that? It's because it's through being Cloud First and Cloud Native that you get the resiliency, the agility, the flexibility benefits in these uncertain times. And we're seeing that with the data and analytics stack as well. Customers are accelerating the move to Cloud data warehouses to Cloud data lakes, and become Cloud Native for their data management stack in addition to the data analytics platforms. >> Great stuff which I agree with hundred percent. Cloud Native is where it goes but you aren't they're (laughs) yet. Still on Hybrid and Multi-cloud is a big discussion. I want to get your thoughts >> Completely. >> On how that's going to play up because if you put Hybrid cloud and Multi-cloud I see Public cloud it's amazing, we know that. But Hybrid and Multi-cloud as the next generation of kind of interoperability framework of Cloud services, you're going to have to overlay and manage data governance and privacy. It's going to get more complicated, right? So how are you seeing your customers approach that piece, on the Public side, and then with Hybrid, because that's become a big discussion point. >> So Hybrid is an absolutely critical enabling capability as organizations modernize their on premise estate into the Cloud. You need to be able to move and connect to your On-premise applications, databases, and migrate the data that's important into the Cloud. So Hybrid is an essential capability. When I say Informatica is Cloud First Cloud Native, being Cloud First Cloud Native as a data management as a service provider if you will, requires essentially capabilities of being able to connect to On-premise data sources and therefore, be Hybrid. So Hybrid architecture is an essential part of that. Equally, it's important to enable organizations to understand what needs to go to the Cloud. As you're modernizing your infrastructure, your applications, your data and analytics stack. You don't need to bring everything to the Cloud with you. So there's an opportunity for organizations to introduce efficiencies. And that's done by enabling organizations to really scan the data landscape On-premise, scan the data that already exists in the various Public clouds that they partner with, and understand what's important, what's not, what can be decommissioned and left behind to realize savings and what is important for the business and needs to be moved into a Cloud Native analytic stack. And that's really where our CLAIRE metadata intelligence capabilities come to bear. And that's really what serves as the foundation of data governance, data cataloging and data privacy, to enable organizations to get the right data into the Cloud. To do so, while ensuring privacy. And to ensure that they govern that data in their new now Cloud Native analytics stack, whether it's AWS, Azure, GCP, snowflake data, bricks, all partners, all deep partnerships that we have. >> Jitesh, I want to get your thoughts on something. I was having a Zoom call a couple weeks ago, with a bunch of CXO friends, people, practitioners, probably some of them are probably your customers. It was kind of a social get together. But we were talking about, how the world we're living in pandemic, from COVID data, fake news, and one of the comments was, finally the whole world now realized what my life like. And in referring to how we're seeing fake news and misinformation kind of screw up an election and you got COVID's got 10 zillion different data points and people are making it to tell stories. And what does it really mean? There's a lot of trust involved. People are confused, and all that's going on. Again, in that backdrop, he said that that's my world. >> Right. This is back down to some of the things you're talking about, trust. We've talked about metadata services in the past. This authenticity, the duck democratization has been around for a while in the enterprise, so that dealing with bad data or fake data or too much data, you can make data (laughs) into whatever you want. You got to make sense of it. What's your thoughts on the reaction to his comment? I mean, what does it make you feel? >> Completely agree, completely agree. And that goes back to the earlier comment I made about making fact based decisions that you can have confidence in because the insight is based on trusted data. And so you mentioned data democratization. Our point of view is to democratize data, you have to do it on a foundational governance, right? There's a reason why traffic lights exist, it's to facilitate or at least attempt to facilitate the optimal free flow of traffic without getting into accidents, without causing congestion, so on and so forth. Equally, you need to have a foundation of governance. And I realized that there's an optical tension of democratized data, which is, free data for everybody consume it whenever and however you want, and then governance, which seems to imply, locking things down controlling them. And really, when I say you need a foundation of data governance, you need to enable for organizations to implement guardrails so that data can be effectively democratized. So that data consumers can easily find data. They can understand how trustworthy it is, what the quality of it is, and they can access it in easy way and consume it, while adhering to the appropriate privacy policies that are fit for the use of that particular set of data that a data and data consumer wants to access. And so, how do you practically do that? That's where data 4.0 AI power data management comes into play. In that, you need to build a foundation of what we call intelligent data governance. A foundation of scanning metadata, combining it with business metadata, linking it into an enterprise knowledge graph that gives you an understanding of an organization and enterprises data language. It auto tags auto curates, it gives you insight into the quality of the data, and now enables organizations to publish these curated data sets into a capability, what we call a data marketplace, so that much like Amazon.com, you can shop for the data, you can browse home and garden, electronics various categories. You can identify the data sets that are interesting to you, when you select them, you can look at the quality dimensions that have already been analyzed and associated with the data set. And you can also review the privacy policies that govern the use of that data set. And if you're interested in it, find the data sets, add them to your shopping cart, like you would do with Amazon.com, and check out. And when you do that triggers off an approval workflow to enable organizations to that last mile of governing access. And once approved, we can automatically provision the datasets to wherever you want to analyze them, whether it's in Tableau Power BI, an S3 market, what have you. And that is what I mean by a foundation of intelligent data governance. That is enabling data democratization. >> A common metadata layer gives you capabilities to use AI, I get that, There's a concept that you guys are talking a lot about, this augmentation to the data. This augmented data management activities that go on. What does that mean? Can you describe and explain that further and unpack that? This augmented data management activity? >> Yeah, and what do we mean by augmented data management, it's a really a first step into full blown automation of data management. In the old world, a developer would connect to a source, parse the source schema, connect to another source, parse its source schema, connect to the target, understand the target schema, and then pick the appropriate fields from the various sources, structure it through a mapping and then run a job that transforms the data and delivers it to a target database, in its structure, in its schema, in its format. Now that we have enterprise scale metadata intelligence, we know what source of data looks like, we know what targets exist as you simply pick sources and targets, we're able to automatically generate the mappings and automate this development part of the process so that organizations can more rapidly build out data pipelines to support their AI to operationalize AIML, to enable data science, and to enable analytics. >> Jitesh great insight. I really appreciate you explaining all this concept and unpacking that with me. Final point, I'd love you to have you just take a minute to put the plug in there for Informatica, what you're working on? What are your customers doing? What are some of the best practices coming out of the current situation? Take a minute to talk about that. >> Yeah, thank you, I'm happy to. It really comes down to focusing on enabling organizations to have a complete understanding of their data landscape. And that is, where we're enabling organizations to build an enterprise knowledge graph of technical metadata, business metadata, operational usage metadata, social metadata to understand and link and develop the necessary context to understand what data exists, where how it's used, what its purpose is and whether or not you should be using. And that's where we're building the Google for the enterprise to help organizations develop that. Equally, leveraging that insight, we're building out the necessary that insight and intelligence through CLAIRE, we're building out the automation in the data quality capabilities, in the data integration capabilities, in the metadata management capabilities, in the master data management capabilities, as well as the data privacy capability. So things that our tooling historically used to do manually, we're just automating it so that organizations can more productively access data, understand it and scale their understanding and insight and analytics initiatives with greater trust greater insight. It's all built on a foundation of our intelligent data platform. >> Love it, scaling data. It's that's really the future fast, available, highly available, integrated to the applications for AI. That's the future. >> Exactly right. Data 4.0, (laughs) AI power data management. >> I love talking about data in the future, because I think that's really valuable. And I think developers, and I've always been saying for over a decade now data is a critical piece for the applications, and AI really unlocks that of having it available, and surface is critical. You guys doing a great job. Thanks for the insight, appreciate you Jitesh. Thank you for coming on. >> Thanks for having me. Pleasure to be here. >> You couldn't do it in person with Informatica world but we're getting the conversations here on the remote CUBE, CUBE virtual. I'm John Furrier, you're watching CUBE conversation with Jitesh Ghai Senior Vice President General Manager, Data Manager at Informatica. Thanks for watching. (upbeat music)

Published Date : Jul 9 2020

SUMMARY :

leaders all around the world, because of the pandemic Hey, great to see you again. One of the things, I'm a And really, the fact that I have to ask you in the and that combined with data I got to ask you love that they need to get and that is customers say to us, early adopters of the Cloud but you aren't they're (laughs) yet. On how that's going to play up and connect to your On-premise and people are making it to tell stories. This is back down to some of the things And that goes back to the There's a concept that you and delivers it to a target database, of the current situation? and whether or not you should be using. It's that's really the future fast, AI power data management. data in the future, Pleasure to be here. on the remote CUBE, CUBE virtual.

ENTITIES

Entity	Category	Confidence
Jitesh Ghai	PERSON	0.99+
John Furrier	PERSON	0.99+
Jitesh	PERSON	0.99+
John	PERSON	0.99+
Palo Alto	LOCATION	0.99+
July 2020	DATE	0.99+
Informatica	ORGANIZATION	0.99+
AWS	ORGANIZATION	0.99+
2022	DATE	0.99+
Google	ORGANIZATION	0.99+
First	QUANTITY	0.99+
75%	QUANTITY	0.99+
Amazon.com	ORGANIZATION	0.99+
Boston	LOCATION	0.99+
first	QUANTITY	0.99+
one	QUANTITY	0.98+
pandemic	EVENT	0.98+
this year	DATE	0.98+
siliconangle.com	OTHER	0.98+
hundred percent	QUANTITY	0.98+
CUBE	ORGANIZATION	0.98+
first step	QUANTITY	0.98+
both	QUANTITY	0.97+
Cloud Native	TITLE	0.97+
COVID pandemic	EVENT	0.96+
One	QUANTITY	0.95+
10 zillion different data points	QUANTITY	0.94+
GCP	ORGANIZATION	0.94+
COVID-19	OTHER	0.92+
5000 plus	QUANTITY	0.92+
Cloud	TITLE	0.91+
Cloud Native	TITLE	0.9+
COVID	OTHER	0.87+
couple weeks ago	DATE	0.86+
Azure	ORGANIZATION	0.82+
CLAIRE	PERSON	0.82+
over a decade	QUANTITY	0.81+
2000 organizations	QUANTITY	0.81+
Mobile First	ORGANIZATION	0.8+
CXO	ORGANIZATION	0.79+
Data First	ORGANIZATION	0.78+
Cloud First	COMMERCIAL_ITEM	0.78+
Cloud	COMMERCIAL_ITEM	0.77+
Virtual First	ORGANIZATION	0.77+
Cloud Native	COMMERCIAL_ITEM	0.74+
Senior	PERSON	0.74+
half	QUANTITY	0.73+
Data Management	ORGANIZATION	0.72+
Hybrid	TITLE	0.7+
First	TITLE	0.66+
theCUBE	ORGANIZATION	0.66+
last	DATE	0.66+
Cloud First Cloud Native	TITLE	0.66+
Cloud First Cloud Native	TITLE	0.65+
Tableau Power BI	TITLE	0.64+
years	DATE	0.63+
Native	TITLE	0.62+
First	ORGANIZATION	0.55+
Vice President	PERSON	0.51+

Recommend Videos

Sentiment Analysis

AWS Comprehend

Search Results for Tableau Power BI: