Bruno Aziza, Google | CUBEconversation

(gentle music) >> Welcome to the new abnormal. Yes, you know, the pandemic, it did accelerate the shift to digital, but it's also created disorder in our world. I mean, every day it seems that companies are resetting their office reopening playbooks. They're rethinking policies on large gatherings and vaccination mandates. There's an acute labor shortage in many industries, and we're seeing an inventory glutton in certain goods, like bleach and hand sanitizer. Airline schedules and pricing algorithms, they're all unsettled. Is inflation transitory? Is that a real threat to the economy? GDP forecasts are seesawing. In short, the world is out of whack and the need for fast access to quality, trusted and governed data has never been greater. Can coherent data strategies help solve these problems, or will we have to wait for the world to reach some type of natural equilibrium? And how are companies, like Google, helping customers solve these problems in critical industries, like financial services, retail, manufacturing, and other sectors? And with me to share his perspectives on data is a long-time CUBE alum, Bruno Aziza. He's the head of data analytics at Google. Bruno, my friend, great to see you again, welcome. >> Great to see you, thanks for having me, Dave. >> So you heard my little narrative upfront, how do you see this crazy world of data today? >> I think you're right. I think there's a lot going on in the world of data analytics today. I mean, certainly over the last 30 years, we've all tried to just make the life of people better and give them access more readily to the information that they need. But certainly over the last year and half, two years, we've seen an amazing acceleration in digital transformation. And what I think we're seeing is that even after three decades of investment in the data analytics world, you know, the opportunity is still really out wide and is still available for organizations to get value out of their data. I was looking at some of the latest research in the market, and, you know, only 32% of companies are actually able to say that they get tangible, valuable insights out of their data. So after all these years, we still have a lot of opportunity ahead of us, of course, with the democratization of access to data, but also the advent in machine learning and AI, so that people can make better decisions faster than their competitors. >> So do you think that the pandemic has heightened that sort of awareness as they were sort of forced to pivot to digital, that they're maybe not getting enough out of their data strategies? That maybe their whatever, their organization, their technology, their way they were thinking about data was not adequate and didn't allow them to be agile enough? Why do you think that only 32% are getting that type of value? >> I think it's true. I think, one, digital transformation has been accelerated over the last two years. I think, you know, if you look at research the last two years, I've seen almost a decade of digital acceleration, you know, happening. But I also think that we're hitting a particular time where employees are expecting more from their employers in terms of the type of insights that can get. Consumers are now evolving, right? So they want more information. And I think now technology has evolved to a point where it's a lot easier to provision a data cloud environment so you can get more data out to your constituents. So I think the connection of these three things, expectation of employees, expectation of customers to better customer experiences, and, of course, the global environment, has accelerated quite a bit, you know, where the space can go. And for people like me, you know, 20 years ago, nobody really cared about databases and so forth. And now I feel like, you know, everybody's, you know, understands the value that we can get out of it. And we're kind of getting, you know, in the sexy territory, finally, data now is sexy for everyone and there's a lot of interest in the space. >> You and I met, of course, in the early days of Hadoop. And there were many things about Hadoop that were profound and, of course, many things that, you know, just were overly complex, et cetera. And one of the things we saw was this sort of decentralization. We thought that Hadoop was going to send five megabytes of code to petabytes of data. And what happened is everything, you know, came into this centralized repository and that centralized thinking, the data pipeline organization was very centralized. Are you seeing companies rethink that? I mean, has the cloud changed their thinking? You know, especially as the cloud expands to the edge, on-prem, everywhere. How are you seeing organizations rethink their regimes for data? >> Yeah, I think, you know, we've seen over the last three decades kind of the pendulum, right, from really centralizing everything and making the IT organization kind of the center of excellence for data analytics, all the way to now, you know, providing data as a self-service, you know, application for end-users. And I think what we're seeing now is there's a few forces happening. The first one is, of course, multicloud, right? So the world today is clearly multicloud and it's going to be multicloud for many, many years. So I think not only are now people considering their on-prem information, but they're also looking at data across multiple clouds. And so I think that is a huge force for chief data officers to consider is that, you know, you're not going to have data centralized in one place, nicely organized, because sometimes it's going to be a factor of where you want to be as an organization. Maybe you're going to be partnering with other organizations that have data in other clouds. And so you want to have an architecture that is modern and that accommodates this idea of an open cloud. The second problem that we see is this idea around data governance, intelligent data governance, right? So the world of managing data is becoming more complex because, of course, you're now dealing with many different speeds, you're dealing with many different types of data. And so you want to be able to empower people to get access to the information, without necessarily having to move this data, so they can make quick decisions on the data. So this idea of a data fabric is becoming really important. And then the third trend that we see, of course, is this idea around data sharing, right? People are now looking to use their own data to create a data economy around their business. And so the ability to augment their existing data with external data and create data products around it is becoming more and more important to the chief data officers. So it's really interesting we're seeing a switch from, you know, this chief data officer really only worried about governance, to this we're now worried about innovation, while making sure that security and governance is taken care of. You know, we call this freedom within the framework, which is a great challenge, but a great opportunity for many of these data leaders. >> You mentioned several things there. Self-service, multicloud, the governance key, especially if we can federate that governance in a decentralized world. Data fabric is interesting. I was talking to Zhamak Dehghani this weekend on email. She coined the term data mesh. And there seems to be some confusion, data mesh, data fabric. I think Gartner's using the term fabric. I know like NetApp, I think coined that term, which to me is like an infrastructure layer, you know. But what do you mean by data fabric? >> Well, the first thing that I would say is that it's not up to the vendors to define what it is. It really is up to the customer. The problem that we're seeing these customers trying to fix is you have a diversity of data, right? So you have data stored in the data mart, in a data lake, in a data warehouse, and they all have their specific, you know, reasons for being there. And so this idea of a data fabric is that without moving the data, can you, one, govern it intelligently? And, two, can you provide landing zones for people to actually do their work without having to go through the pain of setting up new infrastructure, or moving information left and right, and creating new applications? So it's this idea of basically taking advantage of your existing environment, but also governing it centrally, and also now providing self-service capabilities so people can do their job easily. So, you know, you might call it a data mesh, you might call it a data fabric. You know, the terminology to me, you know, doesn't seem to be the barrier. The issue today is how do we enable, you know, this freedom for customers? Because, you know, I think what we've seen with vendors out there is they're trying to just take the customer down to their paradigms. So if they believe in all the answers need to be in a data warehouse, they're going to guide the customer there. If they believe that, you know, everything needs to be in a data lake, they're going to guide the customer there. What we believe in is this idea of choice. You should be able to do every single use case. And we should be able to enable you to manage it intelligently, both from an access standpoint, as well as a governance standpoint. >> So when you think about those different, and I like that, you're making it somewhat technology agnostic, so whether it's a data warehouse, or a data lake, or a data hub, a data mart, those are nodes within the mesh or the fabric, right? That are discoverable, accessible, I guess, governed. I think that there's got to be some kind of centralized governance edict, but in a federated governance model so you don't have to move the data around. Is that how you're thinking about it? >> Absolutely, you know, in our recent event, in the Data Cloud Summit, we had Equifax. So the gentleman there was the VP of data governance and data fabric. So you can start seeing now these roles, you know, created around this problem. And really when you listen to what they're trying to do, they're trying to provide as much value as they can without changing the habits of their users. I think that's what's key here, is that the minute you start changing habits, force people into paradigms that maybe, you know, are useful for you as a vendor, but not so useful to the customer, you get into the danger zone. So the idea here is how can you provide a broad enough platform, a platform that is deep enough, so the data can be intelligently managed and also distributed and activated at the point of interaction for the end-user, so they can do their job a lot easier? And that's really what we're about, is how do you make data simpler? How do you make, you know, the process of getting to insight a lot more fluid without changing habits necessarily, both on the IT side and the business side? >> I want to get to specifics on what Google is doing, but the last sort of uber-trends I want to ask you about 'cause, again, we've known each other for a long time. We've seen this data world grow up. And you're right, 20, 30 years ago, nobody cared about database. Well, maybe 30 years ago. But 20 years ago, it was a boring market, right now it's like the hottest thing going. But we saw, you know, bromide like data is the new oil. Well, we found out, well, actually data is more valuable than oil 'cause you can use, you know, data in a lot of different places, oil you can use once. And then the term like data as an asset, and you said data sharing. And it brings up the notion that, you know, you don't want to share your assets, but you do want to share your data as long as it can be governed. So we're starting to change the language that we use to describe data and our thinking is changing. And so it says to me that the next 10 years, aren't going to be like the last 10 years. What are your thoughts on that? >> I think you're absolutely right. I think if you look at how companies are maturing their use of data, obviously the first barrier is, "How do I, as a company, make sure that I take advantage of my data as an asset? How do I turn, you know, all this information into a sustainable, competitive advantage, really top of mind for organizations?" The second piece around it is, "How do I create now this innovation flywheel so that I can create value for my customers, and my employees, and my partners?" And then, finally, "How do I use data as the center of a product that I can then further monetize and create further value into my ecosystem?" I think the piece that's been happening that people have not talked a lot about I think, with the cloud, what's come is it's given us the opportunity to think about data as an ecosystem. Now you and I are partnering on insights. You and I are creating assets that might be the combination of your data and my data. Maybe it's an intelligent application on top of that data that now has become an intelligent, rich experience, if you will, that we can either both monetize or that we can drive value from. And so I think, you know, it's just scratching the surface on that. But I think that's where the next 10 years, to your point, are going to be, is that the companies that win with data are going to create products, intelligent products, out of that data. And they're just going to take us to places that, you know, we are not even thinking about right now. >> Yeah, and I think you're right on. That is going to be one of the big differences in the coming years is data as product. And that brings up sort of the line of business, right? I mean the lines of business heads historically have been kind of removed from the data group, that's why I was asking you about the organization before. But let's get into Google. How do you describe Google's strategy, its approach, and why it's unique? >> You know, I think one of the reasons, so I just, you know, started about a year ago, and one of the reasons for why I found, you know, the Google mission interesting, is that it's really rooted at who we are and what we do. If you think about it, we make data simple. That's really what we're about. And we live that value. If you go to google.com today, what's happening? Right, as an end-user, you don't need any training. You're going to type in whatever it is that you're looking for, and then we're going to return to you highly personalized, highly actionable insights to you as a consumer of insights, if you will. And I think that's where the market is going to. Now, you know, making data simple doesn't mean that you have to have simple infrastructure. In fact, you need to be able to handle sophistication at scale. And so simply our differentiation here is how do we go from highly sophisticated world of the internet, disconnected data, changing all the time, vast volume, and a lot of different types of data, to a simple answer that's actionable to the end-user? It's intelligence. And so our differentiation is around that. Our mission is to make data simple and we use intelligence to take the sophistication and provide to you an answer that's highly actionable, highly relevant, highly personalized for you, so you can go on and do your job, 'cause ultimately the majority of people are not in the data business. And so they need to get the information just like you said, as a business user, that's relevant, actionable, timely, so they can go off and, you know, create value for their organization. >> So I don't think anybody would argue that Google, obviously, are data experts, arguably the best in the world. But it's interesting, some of the uniqueness here that I'm hearing in your language. You used the word multicloud, Amazon doesn't, you know, use that term. So that's a differentiation. And you sell a cloud, right? You sell cloud services, but you're talking about multicloud. You sell databases, but, of course, you host other databases, like Snowflake. So where do you fit in all this? Do you see your role, as the head of data analytics, is to sort of be the chef that helps combine all these different capabilities? Or are you sort of trying to help people adopt Google products and services? How should we think about that? >> Yeah, the best way to think about, you know, I spend 60 to 70% of my time with customers. And the best way I can think about our role is to be your innovation partner as an organization. And, you know, whichever is the scenario that you're going to be using, I think you talked about open cloud, I think another uniqueness of Google is that we have a very partner friendly, you know, approach to the business. Because we realized that when you walk into an enterprise or a digital native, and so forth, they already have a lot of assets that they have accumulated over the years. And it might be technology assets, but also might be knowledge, and know-how, right? So we want to be able to be the innovation vendor that enables you to take these assets, put them together, and create simplicity towards the data. You know, ultimately, you can have all types of complexity in the backend. But what we can do the best for you is make that really simple, really integrated, really unified, so you, as a business user, you don't have to worry about, "Where is my data? Do I need to think about moving data from here to there? Are there things that I can do only if the data is formatted that way and this way?" We want to remove all that complexity, just like we do it on google.com, so you can do your job. And so that's our job, and that's the reason for why people come to us, is because they see that we can be their best innovation partner, regardless where the data is and regardless, you know, what part of the stack they're using. >> Well, I want to take an example, because my example, I mean, I don't know Google's portfolio like you do, obviously, but one of the things I hear from customers is, "We're trying to inject as much machine intelligence into our data as possible. We see opportunities to automate." So I look at something like BigQuery, which has a strong affinity in embedded machine learning and machine intelligence, as an example, maybe of that simplification. But maybe you could pick up on that and give us some other concrete examples. >> Yeah, specifically on products, I mean, there are a lot products we can talk about, and certainly BigQuery has tremendous market momentum. You know, and it's really anchored on this idea that, you know, the idea behind BigQuery is that just add data and we'll do the rest, right? So that's kind of the idea where you can start small and you can scale at incredible, you know, volumes without really having to think about tuning it, about creating indexes, and so forth. Also, we think about BigQuery as the place that people start in order to build their ecosystem. That's why we've invested a lot in machine learning. Just a few years ago, we introduced this functionality called BigQuery Machine Learning, or BigQuery ML, if you're familiar with it. And you notice out of the top 100 customers we have, 80% of these customers are using machine learning right out of, you know, BigQuery. So now why is that? Why is it that it's so easy to use machine learning using BigQuery is because it's built in. It was built from the ground up. Instead of thinking about machine learning as an afterthought, or maybe something that only data scientists have access to that you're going to license just for narrow scenarios, we think about you have your data in a warehouse that can scale, that is equally awesome at small volume as very large volume, and we build on top of that. You know, similarly, we just announced our analytics exchange, which is basically the place where you can now build these data analytics assets that we discussed, so you can now build an ecosystem that creates value for end-users. And so BigQuery is really at the center of a lot of that strategy, but it's not unlike any of the other products that we have. We want to make it simple for people to onboard, simple to scale, to really accomplish, you know, whatever success is ahead of them. >> Well, I think ecosystems is another one of those big differences in the coming decade, because you're able to build ecosystems around data, especially if you can share that data, you know, and do so in a governed and secure way. But it leads to my question on industries, and I'm wondering if you see any patterns emerging in industries? And each industry seems to have its own unique disruption scenario. You know, retail obviously has been, you know, disrupted with online commerce. And healthcare with, of course, the pandemic. Financial services, you wonder, "Okay, are traditional banks going to lose control of payment systems?" Manufacturing you see our reliance on China's supply chain in, of course, North America. Are you seeing any patterns in industry as it pertains to data? And what can you share with us in terms of insights there? >> Yeah, we are. And, I mean, you know, there's obviously the industries that are, you know, very data savvy or data hungry. You think about, you know, the telecommunication industry, you think about manufacturing, you think about financial services and retail. I mean, financial services and retailers are particularly interesting, because they're kind of both in the retail business and having to deal with this level of complexity of they have physical locations and they also have a relationship with people online, so they really want to be able to bring these two worlds together. You know, I think, you know, about those scenarios of Carrefour, for instance. It's a large retailer in Europe that has been able to not only to, you know, onboard on our platform and they're using, you know, everything from BigQuery, all the way to Looker, but also now create the data assets that enable them to differentiate within their own industry. And so we see a lot of that happening across pretty much all industries. It's difficult to think about an industry that is not really taking a hard look at their data strategy recently, especially over the last two years, and really thought about how they're creating innovation. We have actually created what we call design patterns, which are basically blueprints for organization to take on. It's free, it's free guidance, it's free datasets and code that can accelerate their building of these innovative solutions. So think about the, you know, ability to determine propensity to purchase. Or build, you know, a big trend is recommendation systems. Another one is anomaly detection, and this was great because anomaly detection is a scenario that works in telco, but also in financial services. So we certainly are seeing now companies moving up in their level of maturity, because we're making it easier and simpler for them to assemble these technologies and create, you know, what we call data-rich experiences. >> The last question is how you see the emerging edge, IoT, analytics in that space? You know, a lot of the machine learning or AI today is modeling in the cloud, as you well know. But when you think about a lot of the consumer applications, whether it's voice recognition or, you know, or fingerprinting, et cetera, you're seeing some really interesting use cases that could bleed into the enterprise. And we think about AI inferencing at the edge as really driving a lot of value. How do you see that playing out and what's Google's role there? >> So there's a lot going on in that space. I'll give you just a simple example. Maybe something that's easy for the community to understand is there's still ways that we define certain metrics that are not taking into account what actually is happening in reality. I was just talking to a company whose job is to deliver meals to people. And what they have realized is that in order for them to predict exactly the time it's going to take them from the kitchen to your desk, they have to take into account the fact that distance sometimes it's not just horizontal, it's also vertical. So if you're distributing and you're delivering meals, you know, in Singapore, for instance, high density, you have to understand maybe the data coming from the elevators. So you can determine, "Oh, if you're on the 20th floor, now my distance to you, and my ability to forecast exactly when you're going to get that meal, is going to be different than if you are on the fifth floor. And, particularly, if you're ordering at 11:32, versus if you're ordering at 11:58." And so what's happening here is that as people are developing these intelligent systems, they're now starting to input a lot of information that historically we might not have thought about, but that actually is very relevant to the end-user. And so, you know, how do you do that? Again, and you have to have a platform that enables you to have a large diversity of use cases, and that thinks ahead, if you will, of the problems you might run into. Lots and lots of innovation in this space. I mean, we work with, you know, companies like Ford to, you know, reinvent the connected, you know, cars. We work with companies like Vodafone, 700 use cases, to think about how they're going to deal with what they call their data ocean. You know, I thought you would like this term, because we've gone from data lakes to data oceans. And so there is certainly a ton of innovation and certainly, you know, the chief data officers that I have the opportunity to work with are really not short of ideas. I think what's been happening up until now, they haven't had this kind of single, unified, simple experience that they can use in order to onboard quickly and then enable their people to build great, rich-data applications. >> Yeah, we certainly had fun with that over the years, data lake or data ocean. And thank you for remembering that, Bruno. Always a pleasure seeing you. Thanks so much for your time and sharing your perspectives, and informing us about what Google's up to. Can't wait to have you back. >> Thanks for having me, Dave. >> All right, and thank you for watching, everybody. This is Dave Vellante. Appreciate you watching this CUBE Conversation, and we'll see you next time. (gentle music)

Published Date : Aug 9 2021

SUMMARY :

to see you again, welcome. Great to see you, you know, the opportunity And for people like me, you know, you know, came into this all the way to now, you know, But what do you mean by data fabric? You know, the terminology to me, you know, so you don't have to move the data around. is that the minute you But we saw, you know, bromide And so I think, you know, that's why I was asking you and provide to you an answer Amazon doesn't, you know, use that term. and regardless, you know, But maybe you could pick up on that we think about you have your data has been, you know, So think about the, you know, recognition or, you know, of the problems you might run into. And thank you for remembering that, Bruno. and we'll see you next time.

ENTITIES

Entity	Category	Confidence
Ford	ORGANIZATION	0.99+
Carrefour	ORGANIZATION	0.99+
Vodafone	ORGANIZATION	0.99+
Bruno	PERSON	0.99+
Europe	LOCATION	0.99+
Dave Vellante	PERSON	0.99+
Singapore	LOCATION	0.99+
Amazon	ORGANIZATION	0.99+
Google	ORGANIZATION	0.99+
Dave	PERSON	0.99+
Zhamak Dehghani	PERSON	0.99+
Bruno Aziza	PERSON	0.99+
fifth floor	QUANTITY	0.99+
80%	QUANTITY	0.99+
60	QUANTITY	0.99+
Gartner	ORGANIZATION	0.99+
BigQuery	TITLE	0.99+
20th floor	QUANTITY	0.99+
11:58	DATE	0.99+
second piece	QUANTITY	0.99+
11:32	DATE	0.99+
both	QUANTITY	0.99+
Equifax	ORGANIZATION	0.99+
today	DATE	0.99+
pandemic	EVENT	0.99+
North America	LOCATION	0.99+
one	QUANTITY	0.98+
second problem	QUANTITY	0.98+
Hadoop	TITLE	0.98+
30 years ago	DATE	0.98+
first barrier	QUANTITY	0.98+
Looker	ORGANIZATION	0.98+
Data Cloud Summit	EVENT	0.98+
20 years ago	DATE	0.98+
70%	QUANTITY	0.98+
first one	QUANTITY	0.98+
third trend	QUANTITY	0.97+
32%	QUANTITY	0.96+
first thing	QUANTITY	0.96+
two	QUANTITY	0.96+
two years	QUANTITY	0.96+
each industry	QUANTITY	0.96+
telco	ORGANIZATION	0.95+
two worlds	QUANTITY	0.95+
three decades	QUANTITY	0.95+
Hadoop	PERSON	0.95+
three things	QUANTITY	0.94+
20	DATE	0.94+
BigQuery ML	TITLE	0.93+
few years ago	DATE	0.93+
100 customers	QUANTITY	0.91+
700 use cases	QUANTITY	0.91+
last three decades	DATE	0.89+
single	QUANTITY	0.89+
google.com	ORGANIZATION	0.89+
a year ago	DATE	0.88+
next 10 years	DATE	0.88+
last two years	DATE	0.88+
single use case	QUANTITY	0.85+
China	LOCATION	0.84+
one place	QUANTITY	0.82+

Sudhir Hasbe, Google Cloud | Informatica World 2019

>> Live from Las Vegas, it's theCUBE. Covering Informatica World 2019. Brought to you by Informatica. >> Welcome back, everyone to theCUBE's live coverage of Informatica World 2019 I'm your host, Rebecca Knight, along with my cohost, John Furrier. We are joined by Sudhir Hasbe. He is the director of product management at Google Cloud. Thank you so much for coming on theCUBE. >> Thank you for inviting me. (laughing) >> So, this morning we saw Thomas Kurian up on the main stage to announce the expanded partnership. Big story in Wall Street Journal. Google Cloud and Informatica Team Up to Tame Data. Tell us more about this partnership. >> So if you take a look at the whole journey of data within organizations, lot of data is still siloed in different systems within different environments. Could be a hybrid on-prem. It could be multi-cloud and all. And customers need this whole end-to-end experience where you can go ahead and take that data, move it to Cloud, do data cleansing on it, do data preparation. You want to be able to go ahead and govern the data, know what data you have, like a catalog. Informatica provides all of those capabilities. And if you look at Google Cloud, we have some highly differentiated services like Google BigQuery, which customers love across the globe, to go ahead and use for analytics. We can do large scale analytics. We have customers from few terabytes to 100-plus petabytes, and storing that amount of data in BigQuery, analyzing, getting value out of it. And from there, all the A.I. capabilities that we have built on top of it. This whole journey of taking data from wherever it is, moving it, cleansing it, and then actually getting value out of it with Big Query, as with our A.I. capabilities. That whole end-to-end experience is what customers need. And with this partnership, I think we are bringing all the key components our customers need together for a perfect fit. >> Sadhir, first of all, great to see you. Since Google Next, we just had a great event by the way this year, congratulations. >> Thanks. >> A lot of great momentum in the enterprise. Explain for a minute. What is the relationship, what is the partnership? Just take a quick minute to describe what it is with Informatica that you're doing. >> Yeah, that's great. I think if you take a look at it, you can bring two key areas together in this partnership. There's data management. How do you get data into Cloud, how do you govern it, manage it, understand it. And then there is analyze the data and AI. So the main thing that we're bring together is these two capabilities. What do I mean by that? The two key components that will be available for our customers is the Intelligent Cloud services from Informatica, which will be available on GCP, will run on GCP. This will basically make sure that the whole end-to-end capability for that platform, like data pipelines and data cleansing and preparation, everything is now available natively on GCP. That's one thing. What that will also do is, Informatica team has actually optimized the execution as part of this migration. What that means is, now you'll be able to use products like Data Cloud, Dataproc. You'll be able to use some of the AI capabilities in BigQuery to actually go do the data cleansing and preparation and process-- >> So when you say "execute", you mean "running." >> Yeah, just running software. >> Not executing, go to market, but executing software. >> Executing software. If you have a data pipeline, you can literally layer this Dataproc underneath to go ahead and run some of the key processes. >> And so the value to the customer is seamless-- >> Seamless integration. >> Okay, so as you guys get more enterprise savvy, and it's clear you guys are doing good work, and obviously Thomas has got the chops there. We've covered that on theCUBE many times. As you go forward, this Cloud formula seems to be taking shape. Amazon, Azure, Google, coming in, providing onboarding to Cloud and vice-versa, so those relationships. The customers are scratching their heads, going, "Okay, where do I fit in that?" So, when you talk to customers, how do you explain that? Because, unlike the old days in computer science and the computer industry, there was known practices. You built a data center, you provisioned some servers, you did some things. It was the general-purpose formula. But every company is different. Their journey's different. Their software legacy make-up's different. Could be born in the cloud with on-prem compliance needs. So, how do customers figure this out? What's the playbook? >> I think the big thing is this: There's a trend in the industry, across the board, to go ahead and be more data-driven, build a culture that is data-driven culture. And as customers are looking at it, what they are seeing is, "Hey, traditionally I was doing a lot of stuff. "Managing infrastructure. Let me go build a data center. "Let me buy machines." That is not adding that much value. It is because. "I need to go do that." That's why they did that. But the real value is, if I can get the data, I can go analyze it, I can get better decisions from it. If I can use machine learning to differentiate my services, that's where the value is. So, most customers are looking at it and saying, "Hey, I know what I need to do in the industry now, "is basically go ahead and focus more on insights "and less on infrastructure." But as doing this, the most important thing is, data is still, as you mentioned, siloed. It's different applications, different data centers, still sitting in different places. So, I think what is happening with what we announced today is making it easy to get that data into Google Cloud and then leveraging that to go ahead and get insights. That's where the focus is for us. And as you get more of these capabilities in the cloud as native services, from Infomatica and Google, customers can now focus more on how to derive value from the data. Putting the data into Cloud, cleansing it, and data preparation, and all of that, that becomes easier. >> Okay, so that brings the solution question to the table. With the solutions that you see with Infomatica, because again, they have a broad space, a horizontal, on-prem and cloud, and they have a huge customer base with enterprise, 25 years, and big data is their thing. What us case is their low-hanging fruit right now? Where are people putting their toe in the water? Where are they jumping full in? Where do you see that spectrum of solutions? >> Great question. There are two or three key scenarios that I see across the board with talking to a lot of customers. Even today, I spoke to a lot of customers at this show. And the first main thing I hear is this whole thing, modedernization of the data warehousing and analytics infrastructure. Lot of data is still siloed and stuck into these different data systems that are there within organizations. And, if you want to go ahead and leverage that data to build on top of the data, democratize it with everybody within the organization, or to leverage AI and machine learning on top of it, you need to unwind what you've done and just take that data and put into Cloud and all. I think modernization of data warehouses and analytics infrastructure is one key play across the IT systems and IT operations. >> Before you go on to the next one, I just want to drill down on that. Because one of the things we're hearing, obviously here and all of the places, is that if you constrain the data, machine learning and AI application ultimately fails. >> Yes. >> So, legacy silos. You mentioned that. But also regulatory things. I got to have privacy now, forget my customer, GDPR first-year anniversary, new regulatory things around, all kinds of data, nevermind outside the United States. But the cloud is appealing, of just throwing it in there as one thing. It's an agility lag issue. Because lagging is not good for AI. You want real-time data. You need to have it fast. How does a customer do that? Is it best to store it in the cloud first, on-premise, with mechanisms? What's your take on this? >> I think it's different in different scenarios. I talk a lot of customers on this. Not all data is restricted from going anywhere. I think there are some data sets you want to have good governance in place. For example, if you have PII data, if you have important customer information, you want to make sure that you take the right steps to govern it. You want to anonymize it. You want to make sure that the right amount of data, per the policies within the organization, only gets into the right systems. And I think this is where, also, the partnership is helpful, because with Infomatica, the tooling that they're provided, or as you mentioned over 25 years, allows customers to understand what these data sets are, what value they're providing. And so, you can do anonymization of data before it lands into Cloud and all of that. So I think one thing is the tooling around that, which is critical. And the second thing is, if you can identify data sets that are real-time, and they don't have business-critical or PII-critical data, that you're fine as a business process to be there, then you can derive a lot of data in real time from all the data sets. >> Tell me about Google's big capabilities, because you guys have a lot of internal power platform features. BigQuery is one of them. Is BigQuery the secret weapon? Is that the big power source for managing the data? >> I would just say: Our customers love BigQuery, primarily because of the capability it provides. There are different capabilities. Let me just list a few. One is: We can do analytics at scale. So as organizations grow, even if data sets are small within organization, what I have seen is, over a period of time, when you derive a lot of value from data, you will start collecting more data within organization. And so, you have to think about scale, whether you are starting with one terabyte or one petabyte or 100 petabytes, it doesn't matter. Analyzing data at scale is what we're really good at, at different types of scale. Second is: democratizing data. We have done a good job of making data available through different tooling, existing tooling that customers have invested in and our tooling, to make it available to everybody. AirAsia is a good example. They have been able to go ahead and give right insights to everybody within the organization, which has helped them go save 5 to 10% in their operational costs. So that's one great example of democratizing access to insights. The third big thing is machine learning and AI. We all know there are just lack of resources to do, at once, analytics with AI and machine learning in the industry. So our goal has been democratize it. Make it easy within an organization. So investments that we have done with BigQuery ML, where you can do machine learning with just simple SQL statements or AutoML tables, which basically allows you to just, within the UI, map and say, "That's table in BigQuery, here's a column that I want to predict, and just automatically figure out what model you want to create, and then we can use neural networks to go do that. I think that kind of investments is what customers love about it from the platform side. >> What about the partnership from a particular functional part of the company, marketing? There's the old adage: 50% of my marketing budget is wasted. I just don't know which one. This one could really change that. >> Exactly right. >> So talk a little bit about the impact of it on marketing. >> I think the main thing is, if you think about the biggest challenge that CMOs have within organizations is how do you better marketing analytics and optimize the spend? So, one of the thing that we're doing with the partnership is not just breaking the silos, getting the data in BigQuery, all of that side and data governance. But another thing is with master data management capability that Infomatica brings to table. Now you can have all of your data in BigQuery. You leverage the Customer 360 that MDM provides and now CMOs can actually say, "Hey, I have a complete view of my customer. "I can do better segmentation. I can do better targeting. "I can give them better service." So that is actually going to derive lot of value with our customers. >> I want to just touch on that once, see if I can get this right. What you just said, I think might be the question I was just about to ask, which is: What is unique about Google's analytical portfolio with Infomatica specifically? Because there's other cloud deals they have. They have Azure and AWS. What's unique about you guys and Infomatica? Was it that piece? >> Yeah, I think there are a few things. One is the whole end-to-end experience of basically getting the data, breaking the silos, doing data governance, this tight integration between our product portfolio, where now you can get a great experience within the native GCP environment. That's one. And then on the other side, Cloud for Marketing is a big, big initiative for us. We work with hundreds of thousand of customers across the globe on their marketing spend and optimizing their marketing. And this is one of the areas where we can work together to go ahead and help those CMOs to get more value from their marketing investments. >> One of the conversations we're having here on theCUBE, and really that we're having in the technology industry, is about the skills gap. I want to hear what you're doing at Google to tackle this problem. >> I think one of the big things that we're doing is just trying to-- I have this team internally. In planning, I use "radical simplicity." And radical simplicity is: How do we take things that we are doing today and make it extremely simple for the next generation of innovation that we're doing? All the investments and BigQuery ML, you SQL for mostly everything. One of the other things that we announced at Next was SQL for data flow, SQL pipelines. What that means is, instead of writing Beam or Java code to build data flow pipelines, now you can write SQL commands to go ahead and create a whole pipeline. Similarly, machine learning with SQL. This whole aspect of simplifying capabilities so that you can use SQL and then AutoML, that's one part of it. And the second, of course, we are working with different partners to go ahead and have a lot of training that is available online, where customers don't have to go take classes, like traditional classes, but just go online. All the assets are available, examples are available. One of the big things in BigQuery we have is we have 70-plus public data sets, where you can go, with BigQuery sandbox, without credit card, you can start using it. You can start trying it out. You can use 70-plus data sets that already available and start learning the product. So I think that should help drive more-- >> Google's a real cultural tech company, so you guys obviously based that from Stanford. Very academic field, so you do hire a lot of smart people. But there's a lot of people graduating middle school, high school, college. Berkeley just graduated their first, inaugural class in data science and analytics. What's the skills, specifically, that young kids or people who are either retraining should either reboot, hone, or dial up? Is there any things that you see from people that are successful inside Google? I mean, sometimes you don't have to have that traditional math background or computer science, although math does help; it's key. But what is your observation? What's your personal view on this? >> I think the biggest thing I've noticed is the passion for data. I fundamentally believe that, in the next three to five years, most organizations will be driven with data and insights. Machine learning and AI is going to become more and more important. So this understanding and having the passion for understanding data, answering questions based on data is the first thing that you need to have. And then you can learn the technologies and everything else. They will become simpler and easier to use. But the key thing is this passion for data and having this data-driven decision-making is the biggest thing, so my recommendation to everybody who is going to college today and learning is: Go learn more about how to make better decisions with data. Learn more about tooling around data. Focus on data, and then-- >> It's like an athlete. If you're not at the gym shooting hoops, if you don't love it, if you're not living it, you're probably not going to be any-- (laughing) It's kind of like that. >> Sudhir, thank you so much for coming on theCUBE. It's a pleasure talking to you. >> Thank you. Thanks a lot for having me. >> I'm Rebecca Knight for John Furrier. You are watching theCUBE. (techno music)

Published Date : May 22 2019

SUMMARY :

Brought to you by Informatica. He is the director of product management at Google Cloud. Thank you for inviting me. Google Cloud and Informatica Team Up to Tame Data. at the whole journey of data within organizations, by the way this year, congratulations. What is the relationship, what is the partnership? the AI capabilities in BigQuery to actually go do If you have a data pipeline, you can literally layer and the computer industry, there was known practices. data is still, as you mentioned, siloed. Okay, so that brings the solution question to the table. And the first main thing I hear is obviously here and all of the places, is that all kinds of data, nevermind outside the United States. And the second thing is, if you can identify Is that the big power source for managing the data? And so, you have to think about scale, What about the partnership from a particular So, one of the thing that we're doing with the partnership the question I was just about to ask, which is: One is the whole end-to-end experience One of the conversations we're having here on theCUBE, One of the big things in BigQuery we have I mean, sometimes you don't have to have is the first thing that you need to have. if you don't love it, Sudhir, thank you so much for coming on theCUBE. Thanks a lot for having me. You are watching theCUBE.

ENTITIES

Entity	Category	Confidence
Rebecca Knight	PERSON	0.99+
Amazon	ORGANIZATION	0.99+
two	QUANTITY	0.99+
John Furrier	PERSON	0.99+
Google	ORGANIZATION	0.99+
Sadhir	PERSON	0.99+
Sudhir	PERSON	0.99+
50%	QUANTITY	0.99+
Sudhir Hasbe	PERSON	0.99+
Informatica	ORGANIZATION	0.99+
Thomas Kurian	PERSON	0.99+
one petabyte	QUANTITY	0.99+
first	QUANTITY	0.99+
5	QUANTITY	0.99+
100 petabytes	QUANTITY	0.99+
one terabyte	QUANTITY	0.99+
SQL	TITLE	0.99+
25 years	QUANTITY	0.99+
AWS	ORGANIZATION	0.99+
United States	LOCATION	0.99+
third	QUANTITY	0.99+
Las Vegas	LOCATION	0.99+
Thomas	PERSON	0.99+
Java	TITLE	0.99+
over 25 years	QUANTITY	0.99+
One	QUANTITY	0.99+
Second	QUANTITY	0.99+
BigQuery	TITLE	0.99+
today	DATE	0.99+
Infomatica	ORGANIZATION	0.99+
one	QUANTITY	0.99+
second	QUANTITY	0.98+
one part	QUANTITY	0.98+
Beam	TITLE	0.98+
one thing	QUANTITY	0.98+
Azure	ORGANIZATION	0.98+
70-plus	QUANTITY	0.97+
10%	QUANTITY	0.97+
theCUBE	ORGANIZATION	0.97+
three key scenarios	QUANTITY	0.97+
second thing	QUANTITY	0.97+
AirAsia	ORGANIZATION	0.96+
Dataproc	ORGANIZATION	0.96+
two key components	QUANTITY	0.96+
five years	QUANTITY	0.95+
BigQuery ML	TITLE	0.95+
this year	DATE	0.95+
first-year	QUANTITY	0.94+
Cloud	TITLE	0.94+
two capabilities	QUANTITY	0.94+
first thing	QUANTITY	0.94+
Informatica World 2019	EVENT	0.93+
hundreds of thousand of customers	QUANTITY	0.93+
two key areas	QUANTITY	0.91+
Google Cloud	ORGANIZATION	0.91+
GCP	TITLE	0.91+

Carol Carpenter, Google Cloud & Ayin Vala, Precision Medicine | Google Cloud Next 2018

>> Live from San Francisco, it's the Cube, covering Google Cloud Next 2018. Brought to you by Google Cloud and its ecosystem partners. >> Hello and welcome back to The Cube coverage here live in San Francisco for Google Cloud's conference Next 2018, #GoogleNext18. I'm John Furrier with Jeff Frick, my cohost all week. Third day of three days of wall to wall live coverage. Our next guest, Carol Carpenter, Vice President of Product Marketing for Google Cloud. And Ayin Vala, Chief Data Science Foundation for Precision Medicine. Welcome to The Cube, thanks for joining us. >> Thank you for having us. >> So congratulations, VP of Product Marketing. Great job getting all these announcements out, all these different products. Open source, big query machine learning, Istio, One dot, I mean, all this, tons of products, congratulations. >> Thank you, thank you. It was a tremendous amount of work. Great team. >> So you guys are starting to show real progress in customer traction, customer scale. Google's always had great technology. Consumption side of it, you guys have made progress. Diane Green mentioned on stage, on day one, she mentioned health care. She mentioned how you guys are organizing around these verticals. Health care is one of the big areas. Precision Medicine, AI usage, tell us about your story. >> Yes, so we are a very small non-profit. And we are at the intersection of data science and medical science and we work on projects that have non-profits impact and social impact. And we work on driving and developing projects that have social impact and in personalized medicine. >> So I think it's amazing. I always think with medicine, right, you look back five years wherever you are and you look back five years and think, oh my god, that was completely barbaric, right. They used to bleed people out and here, today, we still help cancer patients by basically poisoning them until they almost die and hopefully it kills the cancer first. You guys are looking at medicine in a very different way and the future medicine is so different than what it is today. And talk about, what is Presicion Medicine? Just the descriptor, it's a very different approach to kind of some of the treatments that we still use today in 2018. It's crazy. >> Yes, so Presicion Medicine has the meaning of personalized medicine. Meaning that we hone it into smaller population of people to trying to see what is the driving factors, individually customized to those populations and find out the different variables that are important for that population of people for detection of the disease, you know, cancer, Alzheimer's, those things. >> Okay, talk about the news. Okay, go ahead. >> Oh, oh, I was just going to say. And to be able to do what he's doing requires a lot of computational power to be able to actually get that precise. >> Right. Talk about the relationship and the news you guys have here. Some interesting stuff. Non-profits, they need compute power, they need, just like an eneterprise. You guys are bringing some change. What's the relationship between you guys? How are you working together? >> So one of our key messages here at this event is really around making computing available for everyone. Making data and analytics and machine learning available for everyone. This whole idea of human-centered AI. And what we've realized is, you know, data is the new natural resource. >> Yeah. >> In the world these days. And companies that know how to take advantage and actually mine insights from the data to solve problems like what they're solving at Precision Medicine. That is really where the new breakthroughs are going to come. So we announced a program here at the event, It's called Data Solutions for Change. It's from Google Cloud and it's a program in addition to our other non-profit programs. So we actually have other programs like Google Earth for non-profits. G Suite for non-profits. This one is very much focused on harnessing and helping non-profits extract insights from data. >> And is it a funding program, is it technology transfer Can you talk about, just a little detail on how it actually works. >> It's actually a combination of three things. One is funding, it's credits for up to $5,000 a month for up to six months. As well as customer support. One thing we've all talked about is the technology is amazing. You often also need to be able to apply some business logic around it and data scientists are somewhat of a challenge to hire these days. >> Yeah. >> So we're also proving free customer support, as well as online learning. >> Talk about an impact of the Cloud technology for the non-proit because6 I, you know, I'm seeing so much activity, certainly in Washington D.C. and around the world, where, you know, since the Jobs Act, fundings have changed. You got great things happening. You can have funding on mission-based funding. And also, the legacy of brand's are changing and open source changes So faster time to value. (laughs) >> Right. >> And without all the, you know, expertise it's an issue. How is Cloud helping you be better at what you do? Can you give some examples? >> Yes, so we had two different problems early on, as a small non-profit. First of all, we needed to scale up computationally. We had in-house servers. We needed a HIPAA complaint way to put our data up. So that's one of the reasons we were able to even use Google Cloud in the beginning. And now, we are able to run our models or entire data sets. Before that, we were only using a small population. And in Presicion Medicine, that's very important 'cause you want to get% entire population. That makes your models much more accurate. The second things was, we wanted to collaborate with people with clinical research backgrounds. And we need to provide a platform for them to be able to use, have the data on there, visualize, do computations, anything they want to do. And being on a Cloud really helped us to collaborate much more smoothly and you know, we only need their Gmail access, you know to Gmail to give them access and things. >> Yeah. >> And we could do it very, very quickly. Whereas before, it would take us months to transfer data. >> Yeah, it's a huge savings. Talk about the machine learning, AutoML's hot at the show, obviously, hot trend. You start to see AI ops coming in and disrupt more of the enterprise side but as data scientists, as you look at some of these machine learnings, I mean, you must get pretty excited. What are you thinking? What's your vision and how you going to use, like BigQuery's got ML built in now. This is like not new, it's Google's been using it for awhile. Are you tapping some of that? And what's your team doing with ML? >> Absolutely. We use BigQuery ML. We were able to use a few months in advance. It's great 'cause our data scientists like to work in BigQuery. They used to see, you know, you query the data right there. You can actually do the machine learning on there too. And you don't have to send it to different part of the platform for that. And it gives you sort of a proof of concept right away. For doing deep learning and those things, we use Cloud ML still, but for early on, you want to see if there is potential in a data. And you're able to do that very quickly with BigQuery ML right there. We also use AutoML Vision. We had access to about a thousand patients for MRI images and we wanted to see if we can detect Alzheimer's based on those. And we used AutoML for that. Actually works well. >> Some of the relationships with doctors, they're not always seen as the most tech savvy. So now they are getting more. As you do all this high-end, geeky stuff, you got to push it out to an interface. Google's really user-centric philosophy with user interfaces has always been kind of known for. Is that in Sheets, is that G Suite? How will you extend out the analysis and the interactions. How do you integrate into the edge work flow? You know? (laughs) >> So one thing I really appreciated for Google Cloud was that it was, seems to me it's built from the ground up for everyone to use. And it was the ease of access was very, was very important to us, like I said. We have data scientisits and statisticians and computer scientists onboard. But we needed a method and a platform that everybody can use. And through this program, they actually.. You guys provide what's called Qwiklab, which is, you know, screenshot of how to spin up a virtual machine and things like that. That, you know, a couple of years ago you have to run, you know, few command lines, too many command lines, to get that. Now it's just a push of a button. So that's just... Makes it much easier to work with people with background and domain knowledge and take away that 80% of the work, that's just a data engineering work that they don't want to do. >> That's awesome stuff. Well congratulations. Carol, a question to you is How does someone get involved in the Data Solutions for Change? An application? Online? Referral? I mean, how do these work? >> All of the above. (John laughs) We do have an online application and we welcome all non-profits to apply if they have a clear objective data problem that they want to solve. We would love to be able to help them. >> Does scope matter, big size, is it more mission? What's the mission criteria? Is there a certain bar to reach, so to speak, or-- >> Yeah, I mean we're most focused on... there really is not size, in terms of size of the non-profit or the breadth. It's much more around, do you have a problem that data and analytics can actually address. >> Yeah. >> So really working on problems that matter. And in addition, we actually announced this week that we are partnering with United Nations on a contest. It's called Sustainable.. It's for Visualize 2030 >> Yeah. >> So there are 17 sustainable development goals. >> Right, righr. >> And so, that's aimed at college students and storytelling to actually address one of these 17 areas. >> We'd love to follow up after the show, talk about some of the projects. since you have a lot of things going on. >> Yeah. >> Use of technology for good really is important right now, that people see that. People want to work for mission-driven organizations. >> Absolutely >> This becomes a clear citeria. Thanks for coming on. Appreciate it. Thanks for coming on today. Acute coverage here at Google Could Next 18 I'm John Furrier with Jeff Fricks. Stay with us. More coverage after this short break. (upbeat music)

Published Date : Jul 26 2018

SUMMARY :

Brought to you by Google Cloud Welcome to The Cube, thanks for joining us. So congratulations, VP of Product Marketing. It was a tremendous amount of work. So you guys are starting to show real progress And we work on driving and developing and you look back five years for that population of people for detection of the disease, Okay, talk about the news. And to be able to do what he's doing and the news you guys have here. And what we've realized is, you know, And companies that know how to take advantage Can you talk about, just a little detail You often also need to be able to apply So we're also proving free customer support, And also, the legacy of brand's are changing And without all the, you know, expertise So that's one of the reasons we And we could do it very, very quickly. and disrupt more of the enterprise side And you don't have to send it to different Some of the relationships with doctors, and take away that 80% of the work, Carol, a question to you is All of the above. It's much more around, do you have a problem And in addition, we actually announced this week and storytelling to actually address one of these 17 areas. since you have a lot of things going on. Use of technology for good really is important right now, Thanks for coming on today.

ENTITIES

Entity	Category	Confidence
Jeff Frick	PERSON	0.99+
Carol Carpenter	PERSON	0.99+
Diane Green	PERSON	0.99+
80%	QUANTITY	0.99+
Ayin Vala	PERSON	0.99+
United Nations	ORGANIZATION	0.99+
Carol	PERSON	0.99+
Google	ORGANIZATION	0.99+
San Francisco	LOCATION	0.99+
Washington D.C.	LOCATION	0.99+
Jeff Fricks	PERSON	0.99+
Precision Medicine	ORGANIZATION	0.99+
John	PERSON	0.99+
five years	QUANTITY	0.99+
John Furrier	PERSON	0.99+
One	QUANTITY	0.99+
three days	QUANTITY	0.99+
Jobs Act	TITLE	0.99+
BigQuery	TITLE	0.99+
G Suite	TITLE	0.99+
2018	DATE	0.99+
17 areas	QUANTITY	0.98+
one	QUANTITY	0.98+
today	DATE	0.98+
Third day	QUANTITY	0.98+
this week	DATE	0.98+
AutoML	TITLE	0.98+
Cloud ML	TITLE	0.98+
up to six months	QUANTITY	0.98+
First	QUANTITY	0.97+
Gmail	TITLE	0.97+
BigQuery ML	TITLE	0.97+
second things	QUANTITY	0.97+
17 sustainable development goals	QUANTITY	0.96+
about a thousand patients	QUANTITY	0.95+
three things	QUANTITY	0.95+
Google Cloud	ORGANIZATION	0.94+
two different problems	QUANTITY	0.94+
Google Earth	TITLE	0.93+
AutoML Vision	TITLE	0.93+
The Cube	ORGANIZATION	0.93+
ML	TITLE	0.93+
Alzheimer	OTHER	0.91+
up to $5,000 a month	QUANTITY	0.91+
day one	QUANTITY	0.87+
couple of years ago	DATE	0.87+
Istio	PERSON	0.87+
first	QUANTITY	0.85+
Vice President	PERSON	0.85+
Google Cloud	TITLE	0.85+
BigQuery ML.	TITLE	0.85+
Next 2018	DATE	0.84+
one thing	QUANTITY	0.83+
Qwiklab	TITLE	0.79+
2030	TITLE	0.78+
Cloud	TITLE	0.76+
#GoogleNext18	EVENT	0.73+
HIPAA	TITLE	0.72+
Data Science Foundation	ORGANIZATION	0.72+
Next 18	TITLE	0.7+
Cube	ORGANIZATION	0.67+
I'm John	TITLE	0.64+
tons	QUANTITY	0.64+
Next	DATE	0.63+
Furrier	PERSON	0.59+
messages	QUANTITY	0.58+

Sudhir Hasbe, Google Cloud | Google Cloud Next 2018

>> Live from San Francisco, it's theCUBE covering Google Cloud Next 2018, brought to you by Google Cloud and its ecosystem partners. (techy music) >> Hey, welcome back, everyone, this is theCUBE Live in San Francisco coverage of Google Cloud Next '18, I'm John Furrier with Jeff Frick. Day three of three days of coverage, kind of getting day three going here. Our next guest, Sudhir, as the director of product management, Google Cloud, has the luxury and great job of managing BigTable, BigQuery, I'm sorry, BigQuery, I guess BigTable, BigQuery. (laughs) Welcome back to the table, good to see you. >> Thank you. >> So, you guys had a great demo yesterday, I want to get your thoughts on that, I want to explore some of the machine learning things that you guys announced, but first I want to get perspective of the show for you guys. What's going on with you guys at the show here, what are some of the big announcements, what's happening? >> A lot of different announcements across the board, so I'm responsible for data analytics on the Google Cloud. One of our key products is Google BigQuery. Large scale, cloud scale data warehouse, a lot of customers using it for bringing all their enterprise data into the data warehouse, analyzing it at scale, you can do petabyte scale queries in seconds, so that's the kind of scale we provide. So, a lot of momentum on that, we announced a lot of things, a lot of enhancements within that. For example, one of the things we announced was we have a new experience, new UI of BigQuery, now you can literally do the query, as I was saying, of petabyte scale or something, any queries that you want, and with one click you can go into Data Studio, which is our DI tool that's available, or you can go in Sheets and then from there quickly go ahead and fire up a connector, connect to BigQuery, get the data in Sheets and do analysis. >> So, ease of use is a focus. >> Ease of use is a major focus for us. As we are growing we want to make sure everybody in the organization can get access to their data, analyze it. That was one, one of the things, which is pretty unique to BigQuery, which is there is a real time collection of information, so you can... There are customers that are actually collecting real time data from click-stream, for example, on their websites or other places, and moving it directly into BigQuery and analyzing it. Example, in-game analytics, if in-game you're actually playing games and you're going to collect those events and do real time analysis, you're going to literally put it into BigQuery at scale and do that. So, a lot of customers using BigQuery at different levels. We also announced Clustering that allows you to reduce the cost, improve efficiency, and make queries almost two X faster for us. So, a lot of announcements other than the machine learning. >> Well, the one thing I saw in the demo I thought was, I mean, it was machine learning, so that's hot topic here, obviously. >> Yes. >> Is you don't have to move the data, and this is something that we've been covering, go back to the Hadoop, back when we first started doing theCUBE, you know, data pipeline, all the complexities involved in moving the data, and at the scale and size of the data all this wrangling was going on just to get some machine learning in. >> Yep. >> So, talk about that new feature where you guys are doing it inside BigQuery. I think that's important, take a minute to explain that. >> Yeah, so when we were talking to our customers one of the biggest challenges they were facing with machine learning in general, or a couple of them were, one, every time you want to do machine learning you are to take data from your core data warehouse, like in BigQuery you have petabytes of scaled data sets, terabytes of data sets. Now, if you want to do machine learning on any portion of it you take it out of BigQuery, move it into some machine learning engine, ML engine, auto-ML, anything, then you realize, "Oh, I missed some of the data that I needed." I go back then again take the data, move it, and you have to go back and forth too much time. There are analysis I think that different organizations have done. 80% of the time the data scientists say they're spending on the moving of data-- >> Right. >> Wrangling data and all of that, so that is one big problem. The second big challenge we were hearing was skillset gap, there are just not that many PhD data scientists in the industry, how do we solve that problem? So, what we said is first problem, how do we solve it, why do people have to move data to the machine learning engines? Why can't I take the machine learning capability, move it inside where the data is, so bring the machine learning closer to data rather than data closer to machine learning. So, that's what BigQuery ML is, it's an ability to run regression-like models inside the data warehouse itself in BigQuery so that you can do that. The second we said the interface can't be complex. Our audiences already know SQL, they're already analyzing data, these folks, business analysts that are using BigQuery are the experts on the data. So, what we said is use your standard SQL, write two lines of code, create model, type of the model you want to run, give us the data, we will just run the machine learning model on the backend and you can do predictions pretty easily. So, that's what we are doing with that. >> That's awesome. >> So, Sudhir, I love to hear that you were driven by that, by your customers, because one of the things we talk about all the time is democratization. >> Yeah. >> If you want innovation you've got to democratize access to the data, and then you got to democratize access to the tools to actually do stuff with the data-- >> Yes. >> That goes way beyond just the hardcore data scientist in the organization-- >> Yeah, exactly. >> And that's really what you're trying to enable the customers to be able to do. >> Absolutely, if you look at it, if you just go on LinkedIn and search for data analyst versus data scientist there is 100 X more analysts in the industry, and our thing was how do we empower these analysts that understand the data, that are familiar with SQL, to go ahead and do data science. Now, we realize they're not going to be expert machine learning folks who understand all the intricacies of how the gradient descent works, all that, that's not their skillset, so our thing was reduce the complexity, make it very simple for them to use. The framework, like just use SQL and we take care of the internal hyper-tuning, the complexity of it, model selection. We try to do that internally within the technology, and they just get a simple interface for that. So, it's really empowering the SQL analyst with an organization to do machine learning with very little to no knowledge of machine learning. >> Right. >> Talk about the history of BigQuery, where did it come from? I mean, Google has this DNA of they do it internally for themselves-- >> Yes. >> Which is a tough customer-- >> Yes. >> In Cloud Spatter we had the product manager on for Cloud Spatter. Dip Dee, she was, like amazing, like okay, baked internally, did that have the same-- >> Yes. >> BigQuery, take a minute to talk about that, because you're now making it consumable for enterprise customers. >> Yeah. >> It's not a just, "Here's BigQuery." >> No. >> Talk about the origination, how it started, why, and how you guys use it internally. >> So, BigQuery internally is called Dremel. There's a paper on Dremel available. I think in 2012 or something we published it. Dremel has been used internally for analytics across Google. So, if you think about Spanner being used for transaction management in the company across all areas, BigQuery, or Dremel internally, is what we use for all large scale data analytics within Google. So, the whole company runs on, analyzes data with it, so our things was how do we take this capability that we are driving, and imagine like, when you have seven products that are more than a billion active users, the amount of data that gets generated, the insights we are giving in Maps and all the different places, a lot of those things are first analyzed in Dremel internally and we're making it available. So, our thing was how do we take that capability that's there internally and make it available to all enterprises. >> Right. >> As Sundhir was saying yesterday, our goal is empower all our customers to go ahead and do more. >> Right. >> And so, this is a way of taking the piece of technology that's powered Google for a while and also make it available to enterprises. >> It's tested, hardened and tested. >> Yeah, absolutely. >> It's not like it's vaporware. >> Yeah, it's not. (laughs) >> No, I mean, this is what I think is important about the show this year. If you look at it, you guys have done a really good job of taking the big guns of Google, the big stuff, and not try to just say, "We're Google and you can be like Google." You've taken it and you've kind of made it consumable. >> Yes. >> This has been a big focus, explain the mindset behind the product management. >> Absolutely, there is actually one of the key things Google is good at doing is taking what's there internally used, but also the research part of it. Actually, Corinna Cortes, who is head of our AI side who does a lot of research in SQL-based machine learning, so again, the-- >> Yeah. >> BigQuery ML is nothing new, like we internally have a research team that has been developing it for a few years. We have been using it internally for running all these models and all, and so what we were able to do it bring product management from our side, like hey, this is really a problem we are facing, moving data, skillset gap, and then we were like, research team was already enabling it and then we had an engineering team which is pretty strong. We were like, okay, let's bring all three triads together and go ahead and make sure we provide a real value to our customers with all of that we're doing, so that's how it came to light. >> So, I just want to get your take, early days like when there was the early Google search appliance, I'll just pick that up, and that was ancient, ancient ago, but one of the digs was, right, it didn't work as well in the enterprise, per se, because you just didn't have the same amount of data when you applied that type of technique to a Google flow of data and a Google flow of queries. So, how's that evolved over time, because you guys, like you said, seven applications with a billion-- >> Yep. >> Users, most enterprises don't have that, so how do they get the same type of performance if they don't have the same kind of throughput to build the models and to get that data, how's that kind of evolved? >> So, this is why I think thinking about, when we think about scale we think about scaling up and scaling down, right? We have customers who are using BigQuery with a few terabytes of data. Not every customer has petabytes scale, but what we're also noticing is these same customers, when they see value in data they collect more. I will give you a real example, Zulily, one of our customers, I used to be there before, so when they started doing real time data collection for doing real time analytics they were collecting like 50 million events a day. Within 18 months they started collecting five billion a day, 100 x improvement, and the reason is they started seeing value. They could take this real time data, analyze it, make some real time experiences possible on their website and all, with all of that they were able to go out and get real valuer for their customers, drive growth, so when customers see that kind of value they collect more data. So, what I would say is yes, a lot of customers start small, but they all have an aspiration to have lots of data, leverage that to create operational efficiency as well as growth, and so as they start doing that I think they will need infrastructure that can scale down and up all the way, and I think that's what we're focusing on, providing that. >> You guys look at the possibility, and I've seen some examples where customers are just, like, they're shell-shocked, and you're almost too good, right? I mean, it's like, "We've been doing "Dremel on a large scale, I bought this "data warehouse like 10 years ago," like what are you talking about? (laughs) I mean, there's a reality of we've been buying IT, enterprises have been buying IT and in comes Google, the gunslinger saying, "Hey, man, you can do all this stuff." There's a little bit of shell-shock factor for some IT people. Some engineering organizations get it right away. How are you guys dealing with this as you make it consumable? >> Yeah. >> There's probably a lot of education. As a product manager do you see, is that something that you think about, is that something you guys talk about? >> Yes, we do, so I think I actually see a difference in how customers, what customers need, enterprise customers versus cloud native companies. As you said, cloud native companies starting new, starting fresh, so it's a very different set of requirement. Enterprise customers, thinking about scale, thinking about security and how do you do that. So, BigQuery is a highly secure data warehouse. The other thing BigQuery has is it's a completely serverless platform, so we take care of the security. We encrypt all the data at rest and when it's moving. The key thing is when we share what is possible and how easy it is to manage and how fast people can start analyzing, you can bring the data. Like you can actually get started with BigQuery in minutes, like you just bring your data in and start analyzing it. You don't have to worry about how many machines do I need, how do I provision it, how many servers do I need. >> Yeah. >> So, enterprises, when they look at-- >> Cloud native ready. >> Yeah. >> All right, so take a minute to explain BigTable versus, I mean, BigTable versus BigQuery. >> Yes. >> What's the difference between the two, one's a data warehouse and the other one is a system for managing data? What's the difference between Big-- >> So, it's a no-SQL system, so I will... The simple example, I will give you a real example how customers use it, right. BigQuery is great for large scale analytics, people who want to take, like, petabyte scale data or terabyte scale data and analyze historical patterns, all of that, and do complex analysis. You want to do machine learning model creation, you can do that. What BigTable is great at is once you have pre-aggregated data you want to go ahead and really fast serving. If you have a website, I don't expect you to run a website and back it with BigQuery, it's not built for that. Whereas BigTable is exactly for that scenario, so for example, you have millions of people coming on the website, they want to see some key metrics that have been pre-created ready to go, you go to BigTable and that can actually do high performance, high throughput. Last statement on that, like almost 10,000-- >> Yeah. >> Requests per second per node and you can just create as many as you want, so you can really create high scale-- >> Auto-scaling, all kinds of stuff there. >> Exactly. >> And that's good for unstructured data as well-- >> Exactly. >> And managing it. >> Absolutely. >> Okay, so structured data, SQL, basically large scale-- >> Yes. >> BigTable for real time-- >> Yes. >> New kinds of datas, different data types. >> Absolutely, yes. >> What else do you have in the bag of goodies in there that you're working on? >> The one big thing that we also announced with this week was a GIS capability within BigQuery. GIS is geographical information, like everything today is location-based, latitude, longitude. Our customers were telling us really difficult to analyze it, right, like I want to know... Example would be we are here, I want to know how many food restaurants are in a two-mile radius of here, which ones are those, how many, should we create the next one here or not. Those kind of analyses are really difficult, so we partnered with Earth Engine, Earth Engine team within Google with Maps, and then what we're launching is ability to do geospatial analysis within BigQuery. Additionally along with that we also have a visualization tool that we launched this week, so folks who haven't seen that should go check that out. One great example I will give you is Geotab, their CEO is here, Neil. He was showing a demo in one of the sessions and he was talking about how he was able to transform his business. I'll give you an example, Geotab is basically into vehicle tracking, so they have these sensors that track different things with vehicles, and then with, and they store everything in BigQuery, collect all of that and all, and his thing was with BigQuery ML and a GIS capability, what he's now able to do is create models that can predict what intersections in a city when it's snowing are going to be dangerous, and for smart cities he can now recommend to cities where and how to invest in these kind of scenarios. Completely transforming his business because his business is not smart cities, his business was vehicle tracking and all, he's like, but with these capabilities they're transforming what they were doing and solving-- >> New discoveries. >> New discoveries, solving new problems, it's amazing. I wonder if you could just dig at a little bit to, you know, the fact that you've got this, these seven billion activities or apps that you can leverage, you know, specific functionality or goals or objectives or priorities in those groups, and now apply those, pull that data, pull that knowledge, pull those use cases into a completely different application on the enterprise. I mean, is that an active process-- >> I don't think that's how people. >> Do people query? >> No, no. >> But how does that happen? >> No, we don't-- >> As a customer. >> As a customer completely different, right? Our focus in Google Cloud is primarily enabling enterprises to collect their data, process their data, innovate on their data. We don't bring in, like, the Google side of it at all, like that's their completely different area that way, so we basically, enterprises, all their data stays within their environment. They basically, we don't touch it, we don't get to access it at all, and they can know it. >> Yeah, yeah, no, I didn't mean that, I meant, you know, like say Maps for instance, it's interesting to see how Maps has evolved over all these years. Every time you open it, oh, and it's directions-- >> Yep. >> Oh, now it's better directions, oh, now it's got gas stations, oh, now it's where the... And it triggered because you said the restaurants that are close by, so it's kind of adding value to the core app on that side, and as you just said, now geolocation can be used on the enterprise side-- >> Yeah, yes. >> And lots of different things, so that-- >> Exactly. >> That's where I meant that kind of connection-- >> Exactly right, so-- >> In terms of the value of what can I do with geolocation. >> Absolutely, exactly, so like, that's exactly what we did. With Earth Engine we had a lot of learnings on geospatial analysis and our thing was how do you make it easy for our enterprise customers to do that. We've partnered with them closely and we said, "Okay, here are the core pieces of things "we can add in BigQuery that will allow you "to do better geospatial analysis, visualize it." One of the big challenges is lat longs, I don't think they're that friendly with analysts, like oh, numbers and all that. So, we actually will turn a UI visualization tool that allows you to just fire a query and see visually on a map where things are, all the points look like and all. >> Awesome. >> So, just simplifying what analysts can do with all these. >> Sudhir, thanks for coming on, really appreciate it and congratulations on your success. Got a lot of great, big products there, hardened internally, now-- >> Yes. >> Making consumable, it's clear here at Google Cloud you guys are recognized that making it consumable-- >> Yep. >> Pre-existing, proven technologies, so I want to give you guys props for that, congratulations. >> Thank you, thanks a lot. >> Thanks for coming on the show. >> Thanks for coming on. >> Thank you. >> It's theCUBE coverage here, Google Cloud coverage, Google Next 2018. I'm John Furrier with Jeff Frick, stay with us, we've got all day with more coverage for day three. Stay with us after this short break. (techy music)

Published Date : Jul 26 2018

SUMMARY :

brought to you by Google Cloud and its ecosystem partners. has the luxury and great job of managing BigTable, What's going on with you guys at the show here, in seconds, so that's the kind of scale we provide. So, a lot of announcements other than the machine learning. Well, the one thing I saw in the demo I thought was, and at the scale and size of the data all this wrangling you guys are doing it inside BigQuery. of them were, one, every time you want to on the backend and you can do predictions pretty easily. So, Sudhir, I love to hear that you were driven by that, enable the customers to be able to do. Absolutely, if you look at it, if you just baked internally, did that have the same-- BigQuery, take a minute to talk about why, and how you guys use it internally. that gets generated, the insights we are giving all our customers to go ahead and do more. and also make it available to enterprises. Yeah, it's not. "We're Google and you can be like Google." the mindset behind the product management. SQL-based machine learning, so again, the-- like hey, this is really a problem we are facing, So, how's that evolved over time, because you guys, I will give you a real example, Zulily, like what are you talking about? As a product manager do you see, is that something that can start analyzing, you can bring the data. All right, so take a minute to explain BigTable so for example, you have millions of people One great example I will give you that you can leverage, you know, specific functionality We don't bring in, like, the Google side of it at all, Every time you open it, oh, and it's directions-- to the core app on that side, and as you just said, on geospatial analysis and our thing was how do you Got a lot of great, big products there, give you guys props for that, congratulations. I'm John Furrier with Jeff Frick, stay with us,

ENTITIES

Entity	Category	Confidence
Paul	PERSON	0.99+
John	PERSON	0.99+
Yusef	PERSON	0.99+
Vodafone	ORGANIZATION	0.99+
Neil	PERSON	0.99+
Verizon	ORGANIZATION	0.99+
Dave	PERSON	0.99+
AWS	ORGANIZATION	0.99+
Webster Bank	ORGANIZATION	0.99+
Amazon	ORGANIZATION	0.99+
Deutsche	ORGANIZATION	0.99+
Earth Engine	ORGANIZATION	0.99+
Sudhir	PERSON	0.99+
Europe	LOCATION	0.99+
Jeff Frick	PERSON	0.99+
Microsoft	ORGANIZATION	0.99+
Adolfo Hernandez	PERSON	0.99+
Telco	ORGANIZATION	0.99+
2012	DATE	0.99+
Google	ORGANIZATION	0.99+
Andy Jassy	PERSON	0.99+
Facebook	ORGANIZATION	0.99+
Corinna Cortes	PERSON	0.99+
Dave Brown	PERSON	0.99+
telco	ORGANIZATION	0.99+
24 weeks	QUANTITY	0.99+
John Furrier	PERSON	0.99+
Amazon Web Services	ORGANIZATION	0.99+
100s	QUANTITY	0.99+
Adolfo	PERSON	0.99+
KDDI	ORGANIZATION	0.99+
thousands	QUANTITY	0.99+
London	LOCATION	0.99+
15	QUANTITY	0.99+
Io-Tahoe	ORGANIZATION	0.99+
Yusef Khan	PERSON	0.99+
80%	QUANTITY	0.99+
90%	QUANTITY	0.99+
Sudhir Hasbe	PERSON	0.99+
two	QUANTITY	0.99+
SK Telecom	ORGANIZATION	0.99+
two lines	QUANTITY	0.99+
hundreds	QUANTITY	0.99+
BigQuery	TITLE	0.99+
IBM	ORGANIZATION	0.99+
four weeks	QUANTITY	0.99+
10s	QUANTITY	0.99+
Brazil	LOCATION	0.99+
three	QUANTITY	0.99+
SQL	TITLE	0.99+
San Francisco	LOCATION	0.99+
LinkedIn	ORGANIZATION	0.99+
Global Telco Business Unit	ORGANIZATION	0.99+

Recommend Videos

Sentiment Analysis

AWS Comprehend

Search Results for BigQuery ML: