Phil Bullinger, Western Digital | CUBE Conversation, August 2020

>> Announcer: From theCUBE Studios in Palo Alto and Boston, connecting with thought leaders all around the world, this is a Cube conversation. >> Hey welcome back everybody, Jeff Frick here with theCUBE. We are in our Palo Alto studios, COVID is still going on, so all of the interviews continue to be remote, but we're excited to have a Cube alumni, he hasn't been on for a long time, and this guy has been in the weeds of the storage industry for a very very long time and we're happy to have him on and get an update because there continues to be a lot of exciting developments. He's Phil Bullinger, he is the SVP and general manager, data center business unit from Western Digital joining us, I think for Colorado, so Phil, great to see you, how's the weather in Colorado today? >> Hi Jeff, it's great to be here. Well, it's a hot, dry summer here, I'm sure like a lot of places. But yeah, enjoying the summer through these unusual times. >> It is unusual times, but fortunately there's great things like the internet and heavy duty compute and store out there so we can get together this way. So let's jump into it. You've been in the he business a long time, you've been at Western Digital, you were at EMC, you worked on Isilon, and you were at storage companies before that. And you've seen kind of this never-ending up and to the right slope that we see kind of ad nauseum in terms of the amount of storage demands. It's not going anywhere but up, and please increase complexity in terms of unstructure data, sources of data, speed of data, you know the kind of classic big V's of big data. So I wonder, before we jump into specifics, if you can kind of share your perspective 'cause you've been kind of sitting in the Catford seat, and Western Digital's a really unique company; you not only have solutions, but you also have media that feeds other people solutions. So you guys are really seeing and ultimately all this compute's got to put this data somewhere, and a whole lot of it's sitting on Western Digital. >> Yeah, it's a great intro there. Yeah, it's been interesting, through my career, I've seen a lot of advances in storage technology. Speeds and feeds like we often say, but the advancement through mechanical innovation, electrical innovation, chemistry, physics, just the relentless growth of data has been driven in many ways by the relentless acceleration and innovation of our ability to store that data, and that's been a very virtuous cycle through what, for me, has been 30 years in enterprise storage. There are some really interesting changes going on though I think. If you think about it, in a relatively short amount of time, data has gone from this artifact of our digital lives to the very engine that's driving the global economy. Our jobs, our relationships, our health, our security, they all kind of depend on data now, and for most companies, kind of irrespective of size, how you use data, how you store it, how you monetize it, how you use it to make better decisions to improve products and services, it becomes not just a matter of whether your company's going to thrive or not, but in many industries, it's almost an existential question; is your company going to be around in the future, and it depends on how well you're using data. So this drive to capitalize on the value of data is pretty significant. >> It's a really interesting topic, we've had a number of conversations around trying to get a book value of data, if you will, and I think there's a lot of conversations, whether it's accounting kind of way, or finance, or kind of good will of how do you value this data? But I think we see it intrinsically in a lot of the big companies that are really data based, like the Facebooks and the Amazons and the Netflixes and the Googles, and those types of companies where it's really easy to see, and if you see the valuation that they have, compared to their book value of assets, it's really baked into there. So it's fundamental to going forward, and then we have this thing called COVID hit, which I'm sure you've seen all the memes on social media. What drove your digital transformation, the CEO, the CMO, the board, or COVID-19? And it became this light switch moment where your opportunities to think about it are no more; you've got to jump in with both feet, and it's really interesting to your point that it's the ability to store this and think about it now differently as an asset driving business value versus a cost that IT has to accommodate to put this stuff somewhere, so it's a really different kind of a mind shift and really changes the investment equation for companies like Western Digital about how people should invest in higher performance and higher capacity and more unified and kind of democratizing the accessibility that data, to a much greater set of people with tools that can now start making much more business line and in-line decisions than just the data scientist kind of on Mahogany Row. >> Yeah, as you mentioned, Jeff, here at Western Digital, we have such a unique kind of perch in the industry to see all the dynamics in the OEM space and the hyperscale space and the channel, really across all the global economies about this growth of data. I have worked at several companies and have been familiar with what I would have called big data projects and fleets in the past. But at Western Digital, you have to move the decimal point quite a few digits to the right to get the perspective that we have on just the volume of data that the world has just relentless insatiably consuming. Just a couple examples, for our drive projects we're working on now, our capacity enterprise drive projects, you know, we used to do business case analysis and look at their lifecycle capacities and we measured them in exabytes, and not anymore, now we're talking about zettabyte, we're actually measuring capacity enterprise drive families in terms of how many zettabyte they're going to ship in their lifecycle. If we look at just the consumption of this data, the last 12 months of industry TAM for capacity enterprise compared to the 12 months prior to that, that annual growth rate was north of 60%. And so it's rare to see industries that are growing at that pace. And so the world is just consuming immense amounts of data, and as you mentioned, the COVID dynamics have been both an accelerant in some areas, as well as headwinds in others, but it's certainly accelerated digital transformation. I think a lot of companies we're talking about, digital transformation and hybrid models and COVID has really accelerated that, and it's certainly driving, continues to drive just this relentless need to store and access and take advantage of data. >> Yeah, well Phil, in advance of this interview, I pulled up the old chart with all the different bytes, kilobytes, megabytes, gigabytes, terabytes, petabytes, exabytes, and zettabytes, and just per the Wikipedia page, what is a zettabyte? It's as much information as there are grains of sand in all the world's beaches. For one zettabyte. You're talking about thinking in terms of those units, I mean, that is just mind boggling to think that that is the scale in which we're operating. >> It's really hard to get your head wrapped around a zettabyte of storage, and I think a lot of the industry thinks when we say zettabyte scale era, that it's just a buzz word, but I'm here to say it's a real thing. We're measuring projects in terms of zettabytes now. >> That's amazing. Well, let's jump into some of the technology. So I've been fortunate enough here at theCUBE to be there at a couple of major announcements along the way. We talked before we turned the cameras on, the helium announcement and having the hard drive sit in the fish bowl to get all types of interesting benefits from this less dense air that is helium versus oxygen. I was down at the Mammer and Hammer announcement, which was pretty interesting; big heavy technology moves there, to again, increase the capacity of the hard drive's base systems. You guys are doing a lot of stuff on RISC-V I know is an Open source project, so you guys have a lot of things happening, but now there's this new thing, this new thing called zonedd storage. So first off, before we get into it, why do we need zoned storage, and really what does it now bring to the table in terms of a capability? >> Yeah, great question, Jeff. So why now, right? Because I mentioned storage, I've been in storage for quite some time. In the last, let's just say in the last decade, we've seen the advent of the hyperscale model and certainly a whole nother explosion level of data and just the veracity with which they hyperscalers can create and consume and process and monetize data. And of course with that, has also come a lot of innovation, frankly, in the compute space around how to process that data and moving from what was just a general purpose CPU model to GPU's and DPU's and so we've seen a lot of innovation on that side, but frankly, in the storage side, we haven't seen much change at all in terms of how operating systems, applications, file systems, how they actually use the storage or communicate with the storage. And sure, we've seen advances in storage capacities; hard drives have gone from two to four, to eight, to 10 to 14, 16, and now our leading 18 and 20 terabyte hard drives. And similarly, on the SSD side, now we're dealing with the capacities of seven, and 15, and 30 terabytes. So things have gotten larger, as you expect. And some interfaces have improved, I think NVME, which we'll talk about, has been a nice advance in the industry; it's really now brought a very modern scalable low latency multi-threaded interface to a NAM flash to take advantage of the inherent performance of transistor based persistent storage. But really when you think about it, it hasn't changed a lot. But what has changed is workloads. One thing that definitely has evolved in the space of the last decade or so is this, the thing that's driving a lot of this explosion of data in the industry is around workloads that I would characterize as sequential in nature, they're serial, you can capture it in written. They also have a very consistent life cycle, so you would write them in a big chunk, you would read them maybe in smaller pieces, but the lifecycle of that data, we can treat more as a chunk of data, but the problem is applications, operating systems, vial systems continue to interface with storage using paradigms that are many decades old. The old 512 byte or even Forte, Sector size constructs were developed in the hard drive industry just as convenient paradigms to structure what is an unstructured sea of magnetic grains into something structured that can be used to store and access data. But the reality is when we talk about SSDs, structure really matters, and so what has changed in the industry is the workloads are driving very very fresh looks at how more intelligence can be applied to that application OS storage device interface to drive much greater efficiency. >> Right, so there's two things going on here that I want to drill down on. On one hand, you talked about kind of the introduction of NAND and flash, and treating it like you did, generically you did a regular hard drive. But you could get away and you could do some things because the interface wasn't taking full advantage of the speed that was capable in the NAND. But NVME has changed that, and now forced kind of getting rid of some of those inefficient processes that you could live with, so it's just kind of classic next level step up and capabilities. One is you get the better media, you just kind of plug it into the old way. Now actually you're starting to put in processes that take full advantage of the speed that that flash has. And I think obviously prices have come down dramatically since the first introduction, where before it was always kind of a clustered off or super high end, super low latency, super high value apps, it just continues to spread and proliferate throughout the data center. So what did NVME force you to think about in terms of maximizing the return on the NAND and flash? >> Yeah, NVME, which we've been involved in the standardization, I think it's been a very successful effort, but we have to remember NVME is about a decade old, or even more when the original work started around defining this interface, but it's been very successful. The NVME standard's body is very productive cross company effort, it's really driven a significant change, and what we see now is the rapid adoption of NVME in all of data center architectures, whether it's very large hyperscale to classic on prem enterprise to even smaller applications, it's just a very efficient interface mechanism for connecting SSDs into a server. So we continue to see evolution at NVME, which is great, and we'll talk about ZNS today as one of those evolutions. We're also very keenly interested in NVME protocol over fabrics, and so one of the things that Western Digital has been talking about a lot lately is incorporating NVME over fabrics as a mechanism for now connecting shared storage into multiple post architectures. We think this is a very attractive way to build shared storage architectures of the future that are scalable, that are composable, that really have a lot more agility with respect to rack level infrastructure and applying that infrastructure to applications. >> Right, now one thing that might strike some people as kind of counterintuitive is within the zoned storage in zoning off parts of the media, to think of the data also kind of in these big chunks, is it feels contrary to kind of atomization that we're seeing in the rest of the data center, right? So smaller units of compute, smaller units of store, so that you can assemble and disassemble them in different quantities as needed. So what was the special attribute that you had to think about and actually come back and provide a benefit in actually kind of re-chunking, if you will, in these zones versus trying to get as atomic as possible? >> Yeah, it's a great question, Jeff, and I think it's maybe not intuitive in terms of why zoned storage actually creates a more efficient storage paradigm when you're storing stuff essentially in larger blocks of data, but this is really where the intersection of structure and workload and sort of the nature of the data all come together. If you turn back the clock maybe four or five years when SMR hard drives host managers SMR hard drives first emerged on the scene. This was really taking advantage of the fact that the right head on a hard disk drive is larger than the read head, or the read head can be much smaller, and so the notion of overlapping or shingling the data on the drive, giving the read head a smaller target to read, but the writer a larger write pad to write the data could actually, what we found was it increases aerial density significantly. And so that was really the emergence of this notion of sequentially written larger blocks of data being actually much more efficiently stored when you think about physically how it's being stored. What's very new now and really gaining a lot of traction is the SSD corollary to SMR on the hard drive, on the SSD side, we had the ZNS specification, which is, very similarly where you'd divide up the name space of an SSD into fixed size zones, and those zones are written sequentially, but now those zones are intimately tied to the underlying physical architecture of the NAND itself; the dyes, the planes, the read pages, the erase pages. So that, in treating data as a block, you're actually eliminating a lot of the complexity and the work that an SSD has to do to emulate a legacy hard drive, and in doing so, you're increasing performance and endurance and the predictable performance of the device. >> I just love the way that you kind of twist the lens on the problem, and on one hand, by rule, just looking at my notes here, the zoned storage device is the ZSD's introduce a number of restrictions and limitations and rules that are outside the full capabilities of what you might do. But in doing so, an aggregate, the efficiency, and the performance of the system in the whole is much much better, even though when you first look at it, you think it's more of a limiter, but it's actually opens up. I wonder if there's any kind of performance stats you can share or any kind of empirical data just to give people kind of a feel for what that comes out as. >> So if you think about the potential of zoned storage in general and again, when I talk about zoned storage, there's two components; there's an HDD component of zoned storage that we refer to as SMR, and there's an SSD version of that that we call ZNS. So we think about SMR, the value proposition there is additional capacity. So effectively in the same drive architecture, with roughly the same bill of material used to build the drive, we can overlap or shingle the data on the drive. And generally for the customer, additional capacity. Today with our 18, 20 terabyte offerings that's on the order of just over 10%, but that delta is going to increase significantly going forward to 20% or more. And when you think about a hyperscale customer that has not hundreds or thousands of racks, but tens of thousands of racks. A 10 or 20% improvement in effective capacity is a tremendous TCO benefit, and the reason we do that is obvious. I mean, the economic paradigm that drives large at-scale data centers is total custom ownership, both acquisition costs and operating costs. And if you can put more storage in a square tile of data center space, you're going to generally use less power, you're going to run it more efficiently, you're actually, from an acquisition cost, you're getting a more efficient purchase of that capacity. And in doing that, our innovation, we benefit from it and our customers benefit from it. So the value proposition for zoned storage in capacity enterprise HDV is very clear, it's additional capacity. The exciting thing is, in the SSD side of things, or ZNS, it actually opens up even more value proposition for the customer. Because SSDs have had to emulate hard drives, there's been a lot of inefficiency and complexity inside an enterprise SSD dealing with things like garbage collection and right amplification reducing the endurance of the device. You have to over-provision, you have to insert as much as 20, 25, even 28% additional man bits inside the device just to allow for that extra space, that working space to deal with delete of data that are smaller than the block erase that the device supports. So you have to do a lot of reading and writing of data and cleaning up. It creates for a very complex environment. ZNS by mapping the zoned size with the physical structure of the SSD essentially eliminates garbage collection, it reduces over-provisioning by as much as 10x. And so if you were over provisioning by 20 or 25% on an enterprise SSD, and a ZNS SSD, that can be one or two percent. The other thing I have to keep in mind is enterprise SSD is typically incorporate D RAM and that D RAM is used to help manage all those dynamics that I just mentioned, but with a much simpler structure where the pointers to the data can be managed without all the D RAM. We can actually reduce the amount of D RAM in an enterprise SSD by as much as eight X. And if you think about the MILA material of an enterprise SSD, D RAM is number two on the list in terms of the most expensive bomb components. So ZNS and SSDs actually have a significant customer total cost of ownership impact. It's an exciting standard, and now that we have the standard ratified through the NVME working group, it can really accelerate the development of the software ecosystem around. >> Right, so let's shift gears and talk a little bit about less about the tech and more about the customers and the implementation of this. So you talked kind of generally, but are there certain types of workloads that you're seeing in the marketplace where this is a better fit or is it just really the big heavy lifts where they just need more and this is better? And then secondly, within these hyperscale companies, as well as just regular enterprises that are also seeing their data demands grow dramatically, are you seeing that this is a solution that they want to bring in for kind of the marginal kind of next data center, extension of their data center, or their next cloud region? Or are they doing lift and shift and ripping stuff out? Or do they enough data growth organically that there's plenty of new stuff that they can put in these new systems? >> Yeah, I love that. The large customers don't rip and shift; they ride their assets for a long lifecycle, 'cause with the relentless growth of data, you're primarily investing to handle what's coming in over the transom. But we're seeing solid adoption. And in SMRS you know we've been working on that for a number of years. We've got significant interest and investment, co-investment, our engineering, and our customer's engineering adapting the application environment's to take advantage of SMR. The great thing is now that we've got the NVME, the ZNS standard gratified now in the NVME working group, we've got a very similar, and all approved now, situation where we've got SMR standards that have been approved for some time, and the SATA and SCSI standards. Now we've got the same thing in the NVME standard, and the great thing is once a company goes through the lift, so to speak, to adapt an application, file system, operating system, ecosystem, to zoned storage, it pretty much works seamlessly between HDD and SSD, and so it's not an incremental investment when you're switching technologies. Obviously the early adopters of these technologies are going to be the large companies who design their own infrastructure, who have mega fleets of racks of infrastructure where these efficiencies really really make a difference in terms of how they can monetize that data, how they compete against the landscape of competitors they have. For companies that are totally reliant on kind of off the shelf standard applications, that adoption curve is going to be longer, of course, because there are some software changes that you need to adapt to enable zoned storage. One of the things Western Digital has done and taken the lead on is creating a landing page for the industry with zoned storage.io. It's a webpage that's actually an area where many companies can contribute Open source tools, code, validation environments, technical documentation. It's not a marketeering website, it's really a website built to land actual Open source content that companies can use and leverage and contribute to to accelerate the engineering work to adapt software stacks to zoned storage devices, and to share those things. >> Let me just follow up on that 'cause, again, you've been around for a while, and get your perspective on the power of Open source. And it used to be the best secrets, the best IP were closely guarded and held inside, and now really we're in an age where it's not necessarily. And the brilliant minds and use cases and people out there, just by definition, it's more groups of engineers, more engineers outside your building than inside your building, and how that's really changed kind of a strategy in terms of development when you can leverage Open source. >> Yeah, Open source clearly has accelerated innovation across the industry in so many ways, and it's the paradigm around which companies have built business models and innovated on top of it, I think it's always important as a company to understand what value ad you're bringing, and what value ad the customers want to pay for. What unmet needs in your customers are you trying to solve for, and what's the best mechanism to do that? And do you want to spend your RND recreating things, or leveraging what's available and innovating on top of it? It's all about ecosystem. I mean, the days where a single company could vertically integrate top to bottom a complete end solution, you know, those are fewer and far between. I think it's about collaboration and building ecosystems and operating within those. >> Yeah, it's such an interesting change, and one more thing, again, to get your perspective, you run the data center group, but there's this little thing happening out there that we see growing, IOT, in the industrial internet of things, and edge computing as we try to move more compute and store and power kind of outside the pristine world of the data center and out towards where this data is being collected and processed when you've got latency issues and all kinds of reasons to start to shift the balance of where the compute is and where the store and relies on the network. So when you look back from the storage perspective in your history in this industry and you start to see basically everything is now going to be connected, generating data, and a lot of it is even Opensource. I talked to somebody the other day doing kind of Opensource computer vision on surveillance video. So the amount of stuff coming off of these machines is growing in crazy ways. At the same time, it can't all be processed at the data center, it can't all be kind of shipped back and then have a decision and then ship that information back out to. So when you sit back and look at Edge from your kind of historical perspective, what goes through your mind, what gets you excited, what are some opportunities that you see that maybe the laymen is not paying close enough attention to? >> Yeah, it's really an exciting time in storage. I get asked that question from time to time, having been in storage for more than 30 years, you know, what was the most interesting time? And there's been a lot of them, but I wouldn't trade today's environment for any other in terms of just the velocity with which data is evolving and how it's being used and where it's being used. A TCO equation may describe what a data center looks like, but data locality will determine where it's located, and we're excited about the Edge opportunity. We see that as a pretty significant, meaningful part of the TAM as we look three to five years. Certainly 5G is driving much of that, I think just any time you speed up the speed of the connected fabric, you're going to increase storage and increase the processing the data. So the Edge opportunity is very interesting to us. We think a lot of it is driven by low latency work loads, so the concept of NVME is very appropriate for that, we think, in general SSDs deployed and Edge data centers defined as anywhere from a meter to a few kilometers from the source of the data. We think that's going to be a very strong paradigm. The workloads you mentioned, especially IOT, just machine-generated data in general, now I believe, has eclipsed human generated data, in terms of just the amount of data stored, and so we think that curve is just going to keep going in terms of machine generated data. Much of that data is so well suited for zoned storage because it's sequential, it's sequentially written, it's captured, and it has a very consistent and homogenous lifecycle associated with it. So we think what's going on with zoned storage in general and ZNS and SMR specifically are well suited for where a lot of the data growth is happening. And certainly we're going to see a lot of that at the Edge. >> Well, Phil, it's always great to talk to somebody who's been in the same industry for 30 years and is excited about today and the future. And as excited as they have been throughout their whole careers. So that really bodes well for you, bodes well for Western Digital, and we'll just keep hoping the smart people that you guys have over there, keep working on the software and the physics, and the mechanical engineering and keep moving this stuff along. It's really just amazing and just relentless. >> Yeah, it is relentless. What's exciting to me in particular, Jeff, is we've driven storage advancements largely through, as I said, a number of engineering disciplines, and those are still going to be important going forward, the chemistry, the physics, the electrical, the hardware capabilities. But I think as widely recognized in the industry, it's a diminishing curve. I mean, the amount of energy, the amount of engineering effort, investment, that cost and complexity of these products to get to that next capacity step is getting more difficult, not less. And so things like zoned storage, where we now bring intelligent data placement to this paradigm, is what I think makes this current juncture that we're at very exciting. >> Right, right, well, it's applied AI, right? Ultimately you're going to have more and more compute power driving the storage process and how that stuff is managed. As more cycles become available and they're cheaper, and ultimately compute gets cheaper and cheaper, as you said, you guys just keep finding new ways to move the curve in. And we didn't even get into the totally new material science, which is also coming down the pike at some point in time. >> Yeah, very exciting times. >> It's been great to catch up with you, I really enjoy the Western Digital story; I've been fortunate to sit in on a couple chapters, so again, congrats to you and we'll continue to watch and look forward to our next update. Hopefully it won't be another four years. >> Okay, thanks Jeff, I really appreciate the time. >> All right, thanks a lot. All right, he's Phil, I'm Jeff, you're watching theCUBE. Thanks for watching, we'll see you next time.

Published Date : Aug 25 2020

SUMMARY :

all around the world, this so all of the interviews Hi Jeff, it's great to be here. in terms of the amount of storage demands. be around in the future, that it's the ability to store this and the channel, really across and just per the Wikipedia and I think a lot of the and having the hard drive of data and just the veracity with which kind of the introduction and so one of the things of the data center, right? and so the notion of I just love the way that you kind of and the reason we do that is obvious. and the implementation of this. and the great thing is And the brilliant minds and use cases and it's the paradigm around which and all kinds of reasons to start to shift and increase the processing the data. and the mechanical engineering I mean, the amount of energy, driving the storage process I really enjoy the Western Digital story; really appreciate the time. we'll see you next time.

ENTITIES

Entity	Category	Confidence
Jeff	PERSON	0.99+
Jeff Frick	PERSON	0.99+
Western Digital	ORGANIZATION	0.99+
Phil Bullinger	PERSON	0.99+
Western Digital	ORGANIZATION	0.99+
Colorado	LOCATION	0.99+
one	QUANTITY	0.99+
20	QUANTITY	0.99+
Phil	PERSON	0.99+
August 2020	DATE	0.99+
Amazons	ORGANIZATION	0.99+
30 years	QUANTITY	0.99+
both feet	QUANTITY	0.99+
Netflixes	ORGANIZATION	0.99+
18	QUANTITY	0.99+
three	QUANTITY	0.99+
two percent	QUANTITY	0.99+
Boston	LOCATION	0.99+
two	QUANTITY	0.99+
Palo Alto	LOCATION	0.99+
Facebooks	ORGANIZATION	0.99+
15	QUANTITY	0.99+
25%	QUANTITY	0.99+
20%	QUANTITY	0.99+
four	QUANTITY	0.99+
10	QUANTITY	0.99+
Googles	ORGANIZATION	0.99+
hundreds	QUANTITY	0.99+
28%	QUANTITY	0.99+
Today	DATE	0.99+
eight	QUANTITY	0.99+
theCUBE	ORGANIZATION	0.99+
14	QUANTITY	0.99+
five years	QUANTITY	0.99+
30 terabytes	QUANTITY	0.99+
EMC	ORGANIZATION	0.99+
COVID-19	OTHER	0.99+
10x	QUANTITY	0.99+
more than 30 years	QUANTITY	0.99+
seven	QUANTITY	0.99+
Cube	ORGANIZATION	0.99+
two components	QUANTITY	0.99+
Opensource	ORGANIZATION	0.99+
two things	QUANTITY	0.99+
four years	QUANTITY	0.99+
Isilon	ORGANIZATION	0.99+
first	QUANTITY	0.98+
25	QUANTITY	0.98+
20 terabyte	QUANTITY	0.98+
both	QUANTITY	0.98+
18, 20 terabyte	QUANTITY	0.98+
16	QUANTITY	0.98+
over 10%	QUANTITY	0.98+
COVID	OTHER	0.97+
tens of thousands of racks	QUANTITY	0.97+
first introduction	QUANTITY	0.97+
today	DATE	0.96+
TAM	ORGANIZATION	0.96+
theCUBE Studios	ORGANIZATION	0.96+
NVME	ORGANIZATION	0.95+
last decade	DATE	0.94+
Edge	ORGANIZATION	0.94+
Mammer and Hammer	ORGANIZATION	0.93+
One	QUANTITY	0.92+
COVID	ORGANIZATION	0.92+

Janet George, Western Digital | WiDS 2019

>> Live from Stanford University. It's the Cube covering global Women in Data Science conference brought to you by Silicon Angle media. >> Welcome back to the key. We air live at Stanford University for the fourth annual Women in Data Science Conference. The Cube has had the pleasure of being here all four years on I'm welcoming Back to the Cube, one of our distinguished alumni Janet George, the fellow chief data officer, scientists, big data and cognitive computing at Western Digital. Janet, it's great to see you. Thank you. Thank you so much. So I mentioned yes. Fourth, Annie will women in data science. And it's been, I think I met you here a couple of years ago, and we look at the impact. It had a chance to speak with Margo Garrett's in a about an hour ago, one of the co founders of Woods saying, We're expecting twenty thousand people to be engaging today with the Livestream. There are wigs events in one hundred and fifty locations this year, fifty plus countries expecting about one hundred thousand people to engage the attention. The focus that they have on data science and the opportunities that it has is really palpable. Tell us a little bit about Western Digital's continued sponsorship and what makes this important to you? >> So Western distal has recently transformed itself as a company, and we are a data driven company, so we are very much data infrastructure company, and I think that this momentum off A is phenomenal. It's just it's a foundational shift in the way we do business, and this foundational shift is just gaining tremendous momentum. Businesses are realizing that they're going to be in two categories the have and have not. And in order to be in the half category, you have started to embrace a You've got to start to embrace data. You've got to start to embrace scale and you've got to be in the transformation process. You have to transform yourself to put yourself in a competitive position. And that's why Vest Initial is here, where the leaders in storage worldwide and we'd like to be at the heart of their data is. >> So how has Western Digital transform? Because if we look at the evolution of a I and I know you're give you're on a panel tan, you're also giving a breakout on deep learning. But some of the importance it's not just the technical expertise. There's other really important skills. Communication, collaboration, empathy. How has Western digital transformed to really, I guess, maybe transform the human capital to be able to really become broad enough to be ableto tow harness. Aye, aye, for good. >> So we're not just a company that focuses on business for a We're doing a number of initiatives One of the initiatives were doing is a I for good, and we're doing data for good. This is related to working with the U. N. We've been focusing on trying to figure out how climate change the data that impacts climate change, collecting data and providing infrastructure to store massive amounts of species data in the environment that we've never actually collected before. So climate change is a huge area for us. Education is a huge area for us. Diversity is a huge area for us. We're using all of these areas as launching pad for data for good and trying to use data to better mankind and use a eye to better mankind. >> One of the things that is going on at this year's with second annual data fun. And when you talk about data for good, I think this year's Predictive Analytics Challenge was to look at satellite imagery to train the model to evaluate which images air likely tohave oil palm plantations. And we know that there's a tremendous social impact that palm oil and oil palm plantations in that can can impact, such as I think in Borneo and eighty percent reduction in the Oregon ten population. So it's interesting that they're also taking this opportunity to look at data for good. And how can they look at predictive Analytics to understand how to reduce deforestation like you talked about climate and the impact in the potential that a I and data for good have is astronomical? >> That's right. We could not build predictive models. We didn't have the data to put predictive boats predictive models. Now we have the data to put put out massively predictive models that can help us understand what change would look like twenty five years from now and then take corrective action. So we know carbon emissions are causing very significant damage to our environment. And there's something we can do about it. Data is helping us do that. We have the infrastructure, economies of scale. We can build massive platforms that can store this data, and then we can. Alan, it's the state at scale. We have enough technology now to adapt to our ecosystem, to look at disappearing grillers, you know, to look at disappearing insects, to look at just equal system that be living, how, how the ecosystem is going to survive and be better in the next ten years. There's a >> tremendous amount of power that data for good has, when often times whether the Cube is that technology conferences or events like this. The word trust issues yes, a lot in some pretty significant ways. And we often hear that data is not just the life blood of an organization, whether it's in just industry or academia. To have that trust is essential without it. That's right. No, go. >> That's right. So the data we have to be able to be discriminated. That's where the trust comes into factor, right? Because you can create a very good eh? I'm odder, or you can create a bad air more so a lot depends on who is creating the modern. The authorship of the model the creator of the modern is pretty significant to what the model actually does. Now we're getting a lot of this new area ofthe eyes coming in, which is the adversarial neural networks. And these areas are really just springing up because it can be creators to stop and block bad that's being done in the world next. So, for example, if you have malicious attacks on your website or hear militias, data collection on that data is being used against you. These adversarial networks and had built the trust in the data and in the so that is a whole new effort that has started in the latest world, which is >> critical because you mentioned everybody. I think, regardless of what generation you're in that's on. The planet today is aware of cybersecurity issues, whether it's H vac systems with DDOS attacks or it's ah baby boomer, who was part of the fifty million Facebook users whose data was used without their knowledge. It's becoming, I won't say accepted, but very much commonplace, Yes, so training the A I to be used for good is one thing. But I'm curious in terms of the potential that individuals have. What are your thoughts on some of these practices or concepts that we're hearing about data scientists taking something like a Hippocratic oath to start owning accountability for the data that they're working with. I'm just curious. What's >> more, I have a strong opinion on this because I think that data scientists are hugely responsible for what they are creating. We need a diversity of data scientists to have multiple models that are completely divorce, and we have to be very responsible when we start to create. Creators are by default, have to be responsible for their creation. Now where we get into tricky areas off, then you are the human auto or the creator ofthe Anay I model. And now the marshal has self created because it a self learned who owns the patent, who owns the copyright to those when I becomes the creator and whether it's malicious or non malicious right. And that's also ownership for the data scientist. So the group of people that are responsible for creating the environment, creating the morals the question comes into how do we protect the authors, the uses, the producers and the new creators off the original piece of art? Because at the end of the day, when you think about algorithms and I, it's just art its creation and you can use the creation for good or bad. And as the creation recreates itself like a learning on its own with massive amounts of data after an original data scientist has created the model well, how we how to be a confident. So that's a very interesting area that we haven't even touched upon because now the laws have to change. Policies have to change, but we can't stop innovation. Innovation has to go, and at the same time we have to be responsible about what we innovate >> and where do you think we are? Is a society in terms of catching As you mentioned, we can't. We have to continue innovation. Where are we A society and society and starting to understand the different principles of practices that have to be implemented in order for proper management of data, too. Enable innovation to continue at the pace that it needs. >> June. I would say that UK and other countries that kind of better than us, US is still catching up. But we're having great conversations. This is very important, right? We're debating the issues. We're coming together as a community. We're having so many discussions with experts. I'm sitting in so many panels contributing as an Aye aye expert in what we're creating. What? We see its scale when we deploy an aye aye, modern in production. What have we seen as the longevity of that? A marker in a business setting in a non business setting. How does the I perform and were now able to see sustained performance of the model? So let's say you deploy and am are in production. You're able inform yourself watching the sustained performance of that a model and how it is behaving, how it is learning how it's growing, what is its track record. And this knowledge is to come back and be part of discussions and part of being informed so we can change the regulations and be prepared for where this is going. Otherwise will be surprised. And I think that we have started a lot of discussions. The community's air coming together. The experts are coming together. So this is very good news. >> Theologian is's there? The moment of Edward is building. These conversations are happening. >> Yes, and policy makers are actively participating. This is very good for us because we don't want innovators to innovate without the participation of policymakers. We want the policymakers hand in hand with the innovators to lead the charter. So we have the checks and balances in place, and we feel safe because safety is so important. We need psychological safety for anything we do even to have a conversation. We need psychological safety. So imagine having a >> I >> systems run our lives without having that psychological safety. That's bad news for all of us, right? And so we really need to focus on the trust. And we need to focus on our ability to trust the data or a right to help us trust the data or surface the issues that are causing the trust. >> Janet, what a pleasure to have you back on the Cube. I wish we had more time to keep talking, but it's I can't wait till we talk to you next year because what you guys are doing and also your pact, true passion for data science for trust and a I for good is palpable. So thank you so much for carving out some time to stop by the program. Thank you. It's my pleasure. We want to thank you for watching the Cuba and Lisa Martin live at Stanford for the fourth annual Women in Data Science conference. We back after a short break.

Published Date : Mar 4 2019

SUMMARY :

global Women in Data Science conference brought to you by Silicon Angle media. We air live at Stanford University for the fourth annual Women And in order to be in the half category, you have started to embrace a You've got to start Because if we look at the evolution of a initiatives One of the initiatives were doing is a I for good, and we're doing data for good. So it's interesting that they're also taking this opportunity to We didn't have the data to put predictive And we often hear that data is not just the life blood of an organization, So the data we have to be able to be discriminated. But I'm curious in terms of the creating the morals the question comes into how do we protect the We have to continue innovation. And this knowledge is to come back and be part of discussions and part of being informed so we The moment of Edward is building. We need psychological safety for anything we do even to have a conversation. And so we really need to focus on the trust. I can't wait till we talk to you next year because what you guys are doing and also your pact,

ENTITIES

Entity	Category	Confidence
Janet George	PERSON	0.99+
Janet	PERSON	0.99+
Alan	PERSON	0.99+
Borneo	LOCATION	0.99+
next year	DATE	0.99+
fifty million	QUANTITY	0.99+
Western Digital	ORGANIZATION	0.99+
Lisa Martin	PERSON	0.99+
Oregon	LOCATION	0.99+
twenty thousand people	QUANTITY	0.99+
June	DATE	0.99+
Silicon Angle	ORGANIZATION	0.99+
eighty percent	QUANTITY	0.99+
two categories	QUANTITY	0.99+
Annie	PERSON	0.99+
Stanford University	ORGANIZATION	0.99+
Western distal	ORGANIZATION	0.99+
fifty plus countries	QUANTITY	0.98+
Vest Initial	ORGANIZATION	0.98+
one	QUANTITY	0.98+
this year	DATE	0.98+
One	QUANTITY	0.97+
Women in Data Science	EVENT	0.97+
second annual	QUANTITY	0.96+
Facebook	ORGANIZATION	0.96+
today	DATE	0.96+
Cube	ORGANIZATION	0.95+
Stanford	LOCATION	0.95+
Western digital	ORGANIZATION	0.94+
Women in Data Science Conference	EVENT	0.93+
about one hundred thousand people	QUANTITY	0.92+
one hundred and fifty locations	QUANTITY	0.92+
Fourth	QUANTITY	0.91+
Edward	PERSON	0.9+
US	ORGANIZATION	0.89+
Women in Data Science conference	EVENT	0.88+
ten population	QUANTITY	0.88+
couple of years ago	DATE	0.85+
WiDS 2019	EVENT	0.85+
one thing	QUANTITY	0.85+
Cuba	LOCATION	0.85+
Margo Garrett	PERSON	0.84+
about an hour ago	DATE	0.82+
U. N.	LOCATION	0.82+
twenty five years	QUANTITY	0.81+
Livestream	ORGANIZATION	0.77+
next ten years	DATE	0.73+
fourth annual	EVENT	0.69+
annual	QUANTITY	0.65+
half	QUANTITY	0.62+
fourth	EVENT	0.6+
Woods	ORGANIZATION	0.59+
four	QUANTITY	0.58+
UK	LOCATION	0.58+
wigs	QUANTITY	0.56+
Cube	COMMERCIAL_ITEM	0.52+
Anay	PERSON	0.31+

Christopher Bergey, Western Digital | Autotech Council 2018

>> Announcer: From Milpitas, California at the edge of Silicon Valley, it's The CUBE. Covering autonomous vehicles. Brought to you by Western Digital. >> Hey, welcome back everybody. Jeff Frick here with The Cube. We are at the Autotech Council Autonomous Vehicle event at Western Digital. Part of our Data Makes Possible Program with Western Digital where we're looking at all these cool applications and a lot of cutting edge technology that at the end of the day, it's data dependent and data's got to sit somewhere. But really what's interesting here is that the data, and more and more of the data is moving out to the edge and edge computing and nowhere is that more apparent than in autonomous vesicles so we're really excited to have maybe the best title at Western Digital, I don't know. Chris Bergey, VP of Product Marketing. That's not so special, but all the areas that he's involved with: mobile, compute, automotive, connected homes, smart cities, and if that wasn't enough, industrial IOT. Chris, you must be a busy guy. >> Hey, we're having a lot of fun here. This data world is an exciting place to be right now. >> so we're her at the Autonomous Vehicle event. We could talk about smart cities, which is pretty interesting, actually ties to it and internet of things and industrial internets, but what are some of the really unique challenges in autonomous vehicles that most people probably aren't thinking of? >> Well, I think that we all understand that really, autonomous vehicles are being made possible by just the immense amount of sensors that are being put into the car. Not much different than as our smartphones or our phones evolved from really not having a lot of sensors to today's smartphones have many, many sensors. Whether it's sensing your face, gyroscopes, GPS, all these kind of things. The car is having the exact thing happen but many, many more sensors. And, of course, those sensors just drive a tremendous amount of data and then it's really about trying to pull the intelligence out of that data and that's really what the whole artificial intelligence or autonomous is really trying to do is, okay, we've got all this data, how do I understand what's happening in the autonomous vehicle in a very short period of time? >> Right, and there's two really big factors that you've talked about and some of the other things that you've done. I did some homework and one of them is the metadata around the data, so there's the raw data itself that's coming off those sensors, but the metadata is a whole nother level, and a big level, and even more importantly is the context. What is the context of that data and without context, it's just data. It's not really intelligence or smarts or things you can do anything about so that baseline sensor data gets amplified significantly in terms of actually doing anything with that information. >> That's correct. I think one of the examples I give that's easier for people to understand is surveillance, right? We're very familiar with walking into a retail store where there's surveillance cameras and they're recording in the case that maybe there's a theft or something goes wrong, but there's so much data there that's not acutely being processed, right? How may people walked into the store? What was the average time a person came to the store? How many men? How many women? That's the context of the data and that's what's really would be very valuable if you were, say, an owner of the store or a regional manager. So that's really pulling the context out of the raw data. And in the car example, autonomous vehicles, hey, there's going to be something, my sensors are seeing something, and then, of course, you'd use multiple sensors. That's the sensor fusion between them of, "Hey, that's a person, that's a deer, oh, don't worry, "that's a car moving alongside of us and he's "staying in his lane." Those are the types of decisions we're making with this data and that's the context. >> Right, and even they had in the earlier presentation today the reflection of the car off the side of a bus, I mean, these are the nuance things that aren't necessarily obvious when you first start exploring. >> And we're dealing with human life, I mean, so obviously it needs to be right 99.999 plus percent. So that's the challenge, right? It's the corner cases and I think that's what we see with autonomous vehicles. It's really exciting to see the developments going on and, of course, there's been a couple challenges, but we just have so much learning to do to really get to that fifth nine or whatever it is from a probability point of view. And that's where we'll continue to work on those corner cases, but the technology is coming along so fast, it's just mind-boggling how quickly we are starting to attack these more difficult challenges. And we'll get there but it's going to take time like anything. >> The other really important thing, especially now where we're in the rise of Cloud, if you will. Amazon is going bananas. Google Cloud Platform, Microsoft Azure, so we're seeing this huge move of Cloud and enterprise IT. But in a car, right, there's this little thing called latency and this other thing called physics where you've got a real issue when you have to make a quick decision based on data and those sensors when something jumps out in front of the car. So really, the rise of edge computing and moving so much of that stored compute and intelligence into the vehicle and then deciding what goes back to the car to retrain the algorithm. So it's really a shift to back out to the edge, if you will, dependent because of this latency issue. >> Yeah, I mean, they're very complimentary, right? But there's a lot of decisions you can make locally and, obviously, there's a lot of advantages in doing that. Latency being one of them, but just cost of communications and again, what people don't necessarily understand is how big this data is. You see statistics thrown out there, one gigabit per second, two gigabits per second. I mean, that is just massive data. At the end of the day, actually, in some of the development, it's pretty interesting that we have the car developers actually FedExing the terabyte drives that they've captured data because it's the easiest way for them to actually transfer the data. I mean, people think, "Oh, internet connectivity, no problem." You try to ship 80 terabytes in a cost effective manner, FedEx ends up being the best shot right now. So it's pretty interesting. >> The old sneaker, that is pretty funny. But the quantities of this data are so big. I was teasing you on Twitter earlier today. I think we took it up to an xobyte, a zedobyte, a yodabyte, and then the crowd responded. No, it's a brontosaurousbyte is even bigger than a yodabyte. We were at Flink Forward earlier this week and really this whole idea of stream processing, it's really taking new approaches to data processing. You'll be able to take all that stuff in in real time, which probably state of the market now is financial trading and advertising markets. But to do that now in a car where if you make a mistake, there's really significant consequences. It's a really different challenge. >> It is and again, that's really this advent of the sensor data, right? The sensor data is going to swamp probably every other data set that's in the world, but a lot of it's not interesting because you don't know when that interesting event is going to happen. So what you actually find is that you try to put it's intelligence as close as you can to the data, end storage, and again, storage may be 30 seconds to if you had an accident, you want to be able to go back 30 seconds. It may be lifetimes. So just thinking about these data flows and what's the half life of the data relative to the value? But what we're actually finding with many of the machine learning is that data we thought was not valuable, data we thought, "Oh, we have the right amount of granularity," now with machine learning we're going back and saying, "Oh, why didn't we record at an even higher granularity?" We could have pulled out more of these trends or more of these corner cases. So I think that's one of the challenges enterprise are going through right now is that everyone's so scared of getting rid of any data, yet there's just tremendous data growth. And we're sitting right here in the middle of it at Western Digital. >> Well, thankfully for you guys, you're going to store all that data and it is really important, though, because it used to be, it's funny to me. It used to be a sample of things that happened in the past is how you would make your decisions. Now it's not a sample, it's all of what's happening now and hopefully you can make a decision while you still have time to have an impact. So it's a very different world but sampling is going away when, in theory, you don't know what you're going to need that data for and you have the ability to store it. >> Making real-time decisions but then also learning how to use that decision to make better decisions in the future. That's really where Silicon Valley's focused right now. >> All right, Chris, well you're a busy guy so we're going to let you get back to it because you also have to do IOT and industrial internet and mobile an compute. So thanks for taking ... >> And I try to eat in between there too. >> And you try to eat and hopefully see your kids Friday night, so hopefully you'll take >> Absolutely. your wife out to a movie tonight. >> All right, Chris, great to see you. Thanks for taking a few minutes. >> Chris: Thank you very much. >> All right, I'm Jeff Frick. You're watching The CUBE from Autotech Council Autonomous Vehicle event. Thanks for watching.

Published Date : Apr 14 2018

SUMMARY :

Brought to you by Western Digital. and more and more of the data is moving out to the edge Hey, we're having a lot of fun here. and internet of things and industrial internets, that are being put into the car. and a big level, and even more importantly is the context. So that's really pulling the context out of the raw data. necessarily obvious when you first start exploring. I mean, so obviously it needs to be right So it's really a shift to back out to the edge, captured data because it's the easiest way for them But to do that now in a car where if you make a mistake, of the sensor data, right? and hopefully you can make a decision while you still Making real-time decisions but then also learning how to so we're going to let you get back to it And I try to eat your wife out to a movie tonight. All right, Chris, great to see you. All right, I'm Jeff Frick.

ENTITIES

Entity	Category	Confidence
Jeff Frick	PERSON	0.99+
Chris	PERSON	0.99+
Chris Bergey	PERSON	0.99+
Western Digital	ORGANIZATION	0.99+
Amazon	ORGANIZATION	0.99+
30 seconds	QUANTITY	0.99+
Christopher Bergey	PERSON	0.99+
Silicon Valley	LOCATION	0.99+
one	QUANTITY	0.99+
Friday night	DATE	0.99+
FedEx	ORGANIZATION	0.99+
80 terabytes	QUANTITY	0.99+
Flink Forward	ORGANIZATION	0.99+
today	DATE	0.99+
tonight	DATE	0.98+
Microsoft	ORGANIZATION	0.98+
Milpitas, California	LOCATION	0.97+
two gigabits per second	QUANTITY	0.95+
FedExing	ORGANIZATION	0.95+
Autotech Council Autonomous Vehicle	EVENT	0.95+
fifth nine	QUANTITY	0.93+
one gigabit per second	QUANTITY	0.93+
Twitter	ORGANIZATION	0.93+
first	QUANTITY	0.92+
earlier this week	DATE	0.92+
earlier today	DATE	0.91+
99.999 plus percent	QUANTITY	0.91+
two really big factors	QUANTITY	0.9+
Autotech Council	EVENT	0.84+
Cloud Platform	TITLE	0.82+
Autonomous Vehicle	EVENT	0.79+
The Cube	ORGANIZATION	0.76+
zedobyte	COMMERCIAL_ITEM	0.69+
Google	ORGANIZATION	0.67+
couple	QUANTITY	0.63+
Autotech Council	ORGANIZATION	0.57+
dec	DATE	0.48+
2018	DATE	0.47+
yodabyte	COMMERCIAL_ITEM	0.46+
Azure	TITLE	0.46+
CUBE	TITLE	0.43+
yodabyte	OTHER	0.3+

Oded Sagee, Western Digital | Autotech Council 2018

>> Announcer: From Milpitas, California at the edge of Silicon Valley, it's theCUBE, covering autonomous vehicles. Brought to you by Western Digital. >> Hey, welcome back everybody. Jeff Frick, here with theCUBE. We're in Milpitas, California, at Western Digital, at the Autotech Council Autonomous Vehicle Event. About 300 people, really deep into this space. It's a developing ecosystem. You know, we think about Tesla, that's kind of got a complete, closed system. But there's a whole ecosystem of other companies getting into the autonomous vehicle space, and as was mentioned in the keynote, there are, literally thousands of problems. A great opportunity for startups. So we're excited to have Oded Sagee, he's a senior director of product marketing from Western Digital. Oded, great to see you. >> Thank you very much, Jeff. >> So you were just on the panel and, really that was a big topic, is there are thousands of problems to solve and this ecosystem's trying to come together, but it's complicated, right? It's not just the big car manufacturers anymore, and the tier one providers, but there's this whole ecosystem that's now growing up to try to solve these problems. So what are you seeing from your point of view? >> Yes, correct. So, definitely in the past automotive was a tough market to play in, but it was simple from the amount of players and people you needed to talk to to design your product inside. With the disruption of connectivity, smart vehicles, even before autonomous, there are so many new systems in the car now that generate data or consume data. And so, for us, to kind of figure out what's the use case, right? How is this going to look in the future? Who's going to define it? Who's going to buy it? Who's going to pay for it? It has become more and more complex. Happily, storage is in the center of all this. >> Jeff: Right. >> So we get a seat at the table and everyone wants to talk to us, but yes, it's a very big ecosystem now. And trying to resolve that problem, it's going to take some time. >> So what are some of the unique characteristics, from a storage point of view, that you have to worry about? Obviously environmental jumps out. We had the guy on before talking about bumpy roads, you know, the huge impacts on vibration. And now you spent a lot of money for a Toughbook back in the day to put a laptop in a cop car, this is a whole other level of expense, investment, and data flow. >> Right. So, for us, I think with all this disruption happening of full autonomous, people are, very much focused on making that autonomous work, right? So, for them it's all about connectivity, it's all about the sensor, whether it's Lidar, or, you know, cameras. Just making that work, right? All the algorithms and the software. And so, for them storage, currently is an afterthought, right? They were saying, once we meet mass production we'll just go and buy some storage and everything's going to be fine. So while they're prototyping, right? They can use any storage that they want. But, if you think about a full autonomous vehicle out there driving, not two hours a day like we are driving today, right? 20 hours a day, from cold to hot, going through areas without connectivity. Suddenly, the storage requirements are very, very different. And this is what we're trying to drive and explain that, if we don't design the future storage solutions today, What's going to end up, is that people are going to pay much more for storage just to make a basic use case work. >> Right. >> But if we start working now, and I'm talking about five, seven years out, we can have affordable solutions to make those business models work. >> And is that resonating in the industry, or are they just too focused on, you know, better cameras? >> It definitely does, but as companies change, right? So let's just take the car makers for a second. They didn't necessarily have a CTO in place, right? To drive engineering and semi-conductor. So you got to find those figures, and you got to start working and educating them. It definitely resonates if you have the right person. Once you find him, yes, it's on the list of priority. So we need to push. But it is happening. Yes, it is resonating. >> And it's so different because you do have this edge case. You have so much data being collected out in the field, if you will, within that vehicle. Some, to go back to the cloud, but you've got latency is always an issue, right? For safety. So, a little different storage challenge. So are there significant design thoughts that you guys are bringing into play on why this is so different and what is it going to take to really have kind of an optimal solution for autonomous vehicles? >> Yes, definitely there are a couple of vectors I would say, or knobs we need to work on. One of them is temperature. So, again vehicles do tend to go between hot and cold. Unlike many other components that just need to make sure that they operate between hot and cold, we actually have a big challenge on keeping data being accurate between hot and cold. So if you program cold and read hot and vice versa, data gets corrupted. >> Oh, even within the structures within the media? >> Yes. >> Okay. >> And people don't know that. So, for us to figure out, what's the temperature range that the car, through its lifetime, is going to go through. And make sure that we meet the use case, that's a big one. What we call the endurance on the cycling of the storage, again, if you cannot rely on connectivity, cannot rely on cloud because of latency, you need to record a lot of data in the car. So, again, a car drives for seven years, 15 years, and you want to record constantly, how much do you need to record? We don't necessarily have the technology today to meet that use case and we need to work with the ecosystem, in figuring it out. So these are just two examples. >> And I would imagine clean power, as you're saying these things, but they can need others. You're not in daddy's data center anymore. This is a pretty harsh environment, I would imagine. >> Very harsh. >> Ugly power, inconsistent power, turning off the car before everything is spun down. There's all kinds of little, kind of environmental impacts in that whole realm that you would never think of in, kind of a typical data center, for instance. >> Correct. And even, you touched power, that's very interesting because even some people think, oh, there's not power limitation in a car. You can just enjoy how much power you want. Actually, it's very, very sensitive. The battery, if you think about an EV car now has so many components to run and so even the power consumption, right? Just the energy that you need to consume is becoming critical for each, and every component >> in the vehicle. >> Right. And it's everybody's AI comparison, right? Is if Kasparov had to fight the computer with the same amount of power, it wouldn't have been much of a match. So the power to run all this AI stuff is not insignificant, so it is going to be a huge drain on these electric vehicles. Pretty exciting times. So when you get up in the morning, what's the biggest thing, when you talk to people about autonomous vehicles, that they just don't get? That people should really be thinking about. >> Yeah, so it goes back to some of the things we've discussed. Definitely, again, we're seeing the use cases change. We are working again with the broad ecosystem to explain the fundamental challenges that we have, right? What is our design cycle? What are the challenges that we have? So we start with educating the ecosystem, so they know what we have. And from that we trigger a discussion because they realize, oh, okay, because I do have a use case that, probably, you don't have a solution for, how do we go together? And we're doing it across the board. It's not only happening in automotive. It's happening in surveillance. It's happening in the home space. A lot of people don't know, but the home space, if you think about it, again, set-top boxes used to be huge, sat outside in the room. People are moving to these sticks, right? And they're behind the TV and they have no ventilation and they're small and they record all the time. And they get to temperatures that we've never seen in the past. So we even need to educate the telcos of the world, the set-top box makers. Everything is changing. Automotive is definitely ahead in a lot of innovation and disruption, but everything is changing for us. >> Right, a lot of those are fond of just the bright shiny object that everybody can see, right? We can't necessarily see a lot of IOT that GE's putting in to connect their factories. Alright, Oded, well thanks for taking a few minutes out of your busy day and I really appreciate the insight. >> Thank you very much. >> All right, he's Oded, I'm Jeff, You're watching theCUBE from Western Digital at The Autonomous Vehicle Event for the Autotech Council. Thanks for watching. Catch you next time. (electronic music)

Published Date : Apr 14 2018

SUMMARY :

Brought to you by Western Digital. at the Autotech Council Autonomous Vehicle Event. So what are you seeing from your point of view? and people you needed to talk to So we get a seat at the table that you have to worry about? is that people are going to pay much more for storage just to make those business models work. So you got to find those figures, And it's so different because you do have this edge case. So if you program cold and read hot and vice versa, And make sure that we meet the use case, And I would imagine clean power, that you would never think of in, Just the energy that you need to consume So the power to run all this AI stuff but the home space, if you think about it, again, and I really appreciate the insight. at The Autonomous Vehicle Event for the Autotech Council.

ENTITIES

Entity	Category	Confidence
Jeff	PERSON	0.99+
Jeff Frick	PERSON	0.99+
Oded	PERSON	0.99+
Western Digital	ORGANIZATION	0.99+
Pat Gelsinger	PERSON	0.99+
seven years	QUANTITY	0.99+
15 years	QUANTITY	0.99+
Oded Sagee	PERSON	0.99+
Kasparov	PERSON	0.99+
Silicon Valley	LOCATION	0.99+
Tesla	ORGANIZATION	0.99+
Dell	ORGANIZATION	0.99+
two examples	QUANTITY	0.99+
GE	ORGANIZATION	0.99+
First	QUANTITY	0.99+
Milpitas, California	LOCATION	0.98+
Dell Technologies	ORGANIZATION	0.98+
two hours a day	QUANTITY	0.98+
20 hours a day	QUANTITY	0.98+
Three	QUANTITY	0.97+
About 300 people	QUANTITY	0.97+
theCUBE	ORGANIZATION	0.96+
VMware	ORGANIZATION	0.96+
today	DATE	0.96+
five	QUANTITY	0.93+
SKO V2	COMMERCIAL_ITEM	0.9+
Two	QUANTITY	0.9+
Autotech Council	ORGANIZATION	0.89+
each	QUANTITY	0.88+
2018	DATE	0.84+
Autonomous Vehicle Event	EVENT	0.83+
The Cube	TITLE	0.83+
One of	QUANTITY	0.82+
Autotech Council Autonomous Vehicle Event	EVENT	0.82+
thousands of problems	QUANTITY	0.77+
Toughbook	ORGANIZATION	0.75+
this decade	DATE	0.69+
vectors	QUANTITY	0.62+
couple	QUANTITY	0.59+
a second	QUANTITY	0.56+
money	QUANTITY	0.56+
lot	QUANTITY	0.53+
Autotech Council	EVENT	0.52+
EMC	ORGANIZATION	0.5+
one	QUANTITY	0.37+

Dave Tang, Western Digital & Martin Fink, Western Digital l | CUBEConversation Feb 2018

(inspirational music) >> Hey, welcome back everybody. Jeff Frick here with theCUBE. We are in our Palo Alto studio. The conference season hasn't really kicked off yet into full swing so we can do a lot more kind of intimate stuff here in the studio, for a CUBE Conversation. And we're really excited to have a many time CUBE alum on, and a new guest, both from Western Digital. So Dave Tang, Senior Vice President at Western Digital. Great to see you again, Dave. >> Great to be here, Jeff. >> Absolutely and Martin Fink, he is the Chief Technology Officer at Western Digital, a longtime HP alum. I'm sure people recognized you from that and our great machine keynotes we were talking about it. So great to finally meet you, Martin. >> Thank you, nice to be here. >> Absolutely, so you guys are here talking about and we've got an ongoing program actually with Western Digital about Data Makes Possible, right. With all the things that are going on in tech at the end of the day, right, there's data, it's got to be stored somewhere and then of course there's processes and things going on. We've been exploring media and entertainment, sports, healthcare, autonomous vehicles, you know. All the places that this continues to reach out and it's such a fun project because you guys are a rising tide, lifts all boats, kind of company and really enjoy watching this whole ecosystem grow. So I really want to thank you for that. But now there's some new things that we want to talk about that you guys are doing to continue really in that same theme, and that's the support of this RISC-V. So first off, for people who have no idea, what is RISC-V? Let's jump into that and then kind of what is the announcement and why it's important. >> Sure, so RISC-V is an, you know, the tagline is, it's an open source instruction set architecture. So what does that mean, just so people can kind of understand. So today the world is dominated by two instruction set architectures. For the most part the, we'll call the desktop enterprise world is dominated by the Intel instruction set architecture and that's what's in most PCs, what people talk about as x86. And then the embedded and mobile space tends to be dominated by Arm, or by Arm Holdings. And so both of those are great architectures but they're also proprietary, they're owned by their respective companies. So RISC-V is essentially a third entrant, we'll say, into this world, but the distinction is that it's completely open source. So everything about the instruction set is available to all and anybody can implement it. We can all share the implementations. We can share the code that makes up that instruction set architecture, and very importantly for us and part of our motivation is the freedom to innovate. So we now have the ability to modify the instruction set or change the implementation of the instruction set, to optimize it for our devices and our storage and our drives, etc. >> So is this the first kind of open source play in microprocessor architecture? >> No, there's been other attempts at this. OpenSpark kind of comes to mind, and things like that, but the ability to get a community of individuals to kind of rally around this in a meaningful way has really been a challenge. And so I'd say that right now, RISC-V presents probably the best sort of clean slate, let's take some thing new to the market out there. >> So open source, obviously we've seen you know, take over the software world, first in the operating system which everybody is familiar with Linux but then we see it time and time again in different applications, Hadoop. I mean, there's just a proliferation of open source projects. The benefits are tremendous. Pretty easy to ascertain at a typical software case, how is that going to be applied do you think within the microprocessor world? >> So it's a little bit different. When we're talking about open source hardware or open source chips and microprocessors, you're dealing with a physical device. So even though you can open source all of the designs and the code associated with that device, you still have to fabricate it. You still have to create a physical design and you still have to call up a fab and say, will you make this for me at these particular volumes? And so that's the difference. So there are some differences between open source software where it's, you know, you create the bits and then you distribute those bits through the Internet and all is good. Whereas here, you still have a physical need to fabricate something. >> Now, how much more flexibility can you do then for the output when you can actually impact the architecture as opposed to just creating a custom chip design, on top of somebody else's architecture? >> Well, let me give you probably a really simple, concrete example that kind of people can internalize of some of our motivation behind this, because that might sort of help get people through this. If you think of a very typical surveillance application, you have a camera pointed into a room or a hallway. The reality is we're basically grabbing a ton of video frames but very few of them change, right? So the typical surveillance application is it never changes and you really want, only know when stuff changes. Well, today, in very simple terms, all of those frames get routed up to some big server somewhere and that server spends a lot of time trying to figure out, okay have I got a frame that changed? Have I got a frame that changed, and so on. And then eventually it'll find maybe two or three or five frames that have got something interesting. So in the world what we're trying to do is to say, okay well why don't we take that, find no changes, and push that right down to the device? So we basically store all those frames, why don't we go figure out all the frames that mean nothing, and only ship up to that big bad server the frames that have something interesting and something you want to go analyze and do some work on? So that's a very typical application that's quite meaningful because we can do all of that work at the device. We can eliminate shipping a whole bunch of data to where it's just going to get discarded anyways, and we can allow the end customer to really focus on the data that matters, and get some intelligence. >> And that's critical as we get more and more immersed in a data-centric world, where we have realtime applications like Martin described as well as large data-centric applications like of course, big data analytics, but also training for AI systems or machine learning. These workloads are going to become more and more diverse and they're going to need more specialized architectures and more specialized processing. So big data is getting bigger and faster and these realtime fast data applications are getting faster and bigger. So we need ways to contend with that, that really go beyond what's available with general purpose architectures. >> So that's a great point because if we take this example of video frames, now if I can build a processor that is customized to only do that, that's the only thing it does. It can be very low power, very efficient, and do that one thing very very well, and the cost adder, if you want to call it that, to the device where we put it, is a tiny fraction, but the cost savings of the overall solution is significant. So this ability to customize the instruction set to only do what you need it to do for that very special purpose, that's gold. >> So I just wanted to, Dave, we've talked about a lot of interesting innovations that you guys have come up with over the years, with the helium launch. Which I don't know, a couple, two, three years ago, you were just at the MAMR event, really energy assisted recording. So this is really kind of foundational within the storage and the media itself and how you guys do better and take advantage of evolving land space. This is a kind of a different play for Western Digital, this isn't a direct kind of improvement in the way that storage media and architecture works but this is really more of, I'm going to ask you. What is the Western Digital play here? Why is this an important space for you guys in your core storage business? >> Well we're really broadening our focus to really develop and innovate around technologies that really help the world extract more value from data as a whole, right. So it's way beyond storage these days, right. We're looking for better ways to capture, preserve, access, and transform the data. And unless you transform it, you can't really extract the value out of it so as we see all these new applications for data and the vast possibilities for data, we really want to pave the path and help the industry innovate to bring all those applications to reality. >> It's interesting too because one of the great topics always in computing is you know, you got compute and store, which has to go to which, right. And nobody wants to move a lot of data, that's hard and may or may not be easy to get compute. Especially these IoT applications, remote devices, tough conditions and power, which we mentioned a little bit before we went on air. So the the landscape for the for the need for compute and store in networking is radically changing than either the desktop or what we're seeing a consolidation in clouds. So what's interesting here, where does the scale come, right? At the end of the day, scale always wins. And that's where we've seen historically where the general-purpose microprocessor architectures is dominated but used to be a slew of specialty purpose architectures but now there's an opportunity to bring scale to this. So how does that scale game continue to evolve? >> So it's a great point that scale does matter and we've seen that repeatedly and so it's a significant part of the reason why we decided to go early with a significant commitment was to tell the world that we were bringing scale to the equation. And so what we communicated to the marketplace is we ship on the order of a billion processor cores a year, most people don't realize that all of our devices from USB sticks to hard drives, all have processors on them. And so we said, hey we're going to basically go all-in and go big and that translates into a billion cores that we ship every year and we're going to go on a program to essentially migrate all of those cores to RISC-V. It'll take a few years to get there but we'll migrate all of those cores and so we basically were signaling to the market, hey scale is now here. Scale is here, you can make the investments, you can go forward, you can make that commitment to RISC-V because essentially we've got your back. >> So just to make sure we get that clear. So you guys have announced that you're going to slowly migrate over time your micro processors that power your devices to the tune of approximately a billion with a B, cores per year to this new architecture. >> That is correct. >> And has that started? >> So the design has started. So we have started to design and develop our first two cores but the actual manifestation into devices probably in the early stage of 2020. >> Okay, okay. But that's a pretty significant commitment and again, the ideas you explicitly said it's a signal to the ecosystem, this is worth your investment because there is some scale here. >> Martin: That's right. >> Yeah, pretty exciting. And how do you think it's going to open up the ability for you to do new things with your devices that you before either couldn't do or we're too expensive with dollars or power. >> Martin: So we're going to step and iterate through this and one key point here is a lot of people tend to want to start in this processor world at the very high end, right. I'm going to go take on a Xeon processor or something like that. It's not what we're doing. We're basically saying, we're going to go at the small end, the tiny end where power matters. Power matters a lot in our devices and where can we achieve the optimum combination of power and performance. So even in our small devices like a USB stick or a client SSD or something like that, if we can reduce power consumption and even just maintain performance that's a huge win for our customers, you know. If you think about your laptop and if I reduce the power consumption of that SSD in there so that you have longer battery life and you can get you know through the day better, that's a huge win, right. And I don't impact performance in the process, that's a huge win. So what we do, what we're doing right now is we're developing the cores based on the RISC-V architecture and then what we're going to do is once we've got that sort of design, sort of complete is we want to take all of the typical client workloads and profile them on that. Then we want to find out, okay where are the hot spots? What are the two or three things that are really consuming all the power and how do we go optimize, by either creating two or three instructions or by optimizing the micro architecture for an existing instruction. And then iterate through that a few times so that we really get a big win, even at the very low end of the spectrum and then we just iterate through that with time. >> We're in a unique position I think in that the technologies that we develop span everything from the actual media where the bits are stored, whether it's solid-state flash or rotating magnetic disk and the recording heads. We take those technologies and build them all the way up into devices and platforms and full-fledged data center systems. And if we can optimize and tune all the way from that core media level all the way up through into the system level, we can deliver significantly higher value, we believe, to the marketplace. So this is the start of that, that enables us to customize command sets and optimize the flow of data so that we can we can allow users to access it when and where they need it. >> So I think there's another actually really cool point, which goes back to the open source nature of this and we try to be very clear about this. We're not going to develop our cores for all applications. We want the world to develop all sorts of different cores. And so for many applications somebody else might come in and say, hey we've got a really cool core. So one of the companies we've partnered with and invested in for example, is Esperanto. They've actually decided to go at the high end and do a machine learning accelerator. Hey, maybe we'll use that for some machine learning applications in our system level performance. So we don't have to do it all but we've got a common architecture across the portfolio and that speaks to that sort of open source nature of the RISC-V architecture is we want the world to get going. We want our competitors to get on board, we want partners, we want software providers, we want everybody on board. >> It's such a different ecosystem with open-source and the way the contributions are made and the way contributions are valued and the way that people can go find niches that are underserved. It's this really interesting kind of bifurcation of the market really, you don't really want to be in the big general-purpose middle anymore. That's not a great place to be, there's all kinds of specialty places where you can build the competence and with software and you know with, thank goodness for Moore's law decreasing prices of the power of the compute and now the cloud, which is basically always available. Really a exciting time to develop a myriad of different applications. >> Right and you talked before about scale in terms of points of implementation that will drive adoption and drive this to critical mass but there's another aspect of scale relative to the architecture within a single system that's also important that I think RISC-V helps to break down some barriers. Because with general purpose computer architectures, they assume a certain ratio of memory and storage and processing and bandwidth for interconnect and if you exceed those ratios, you have to add a whole new processor. Even though you don't need to need the processing capability, you need it for scale. So that's another great benefit of these new architectures is that the diversity of data needs where some are going to be large data sets, some are going to be small data sets that need need high bandwidth. You can customize and blend that recipe as you need to, you're not at the mercy of these fixed ratios. >> Yeah and I think you know it's so much of kind of what is cloud computing. And the atomic nature of it, that you can apply the ratios, the amount that you need as you need, you can change it on the fly, you can tone it up, tone it down. And I think the other interesting thing that you touched on is some of these new, which are now relatively special-purpose but are going to be general-purpose very soon in terms of machine learning and AI and applying those to different places and applying them closer to the problem. It's a very very interesting evolution of the landscape but what I want to do is kind of close on you Martin, especially because again kind of back to the machine. Not the machine specifically but you have been in the business of looking way down the road for a long time. So you came out, I'd looked at your LinkedIn, you retired for three months, congratulations. (laughs) Hope you got some my golf in but you came back to Western Digital so why did you come back? And as you look down the road a ways, what do you see that it excites you, that got you off that three-month little tour around the golf course and I'm sorry I had to tease about that. But what do you see? What are you excited about that you came back and got involved in an open source microprocessor project? >> So the the short answer was that, I saw the opportunity at Western Digital to be where data lives. So I had spent my entire career, will call it at the compute or the server side of things and the interesting thing is I had a very close relationship with SanDisk, which was acquired by Western Digital. And so I had, we'll call it an insider view, of what was possible there and so what triggered was essentially what we're talking about here was given that about half the world's data lands on Western Digital devices, taking that from a real position of strength in the marketplace and say, what could we go do to make data more intelligent and rather than start kind of at that server end and so that I saw that potential there and it was just incredible, so that's that's what made me want to join. >> Exciting times. Dave good get. (laughs) >> We're delighted to have Martin with us. >> All right, well we look forward to watch it evolve. We've got another another whole set of events we're going to do again together with Western Digital that we're excited about. Again, covering Data Makes Possible but you know kind of uplifting into the application space as a lot of the cool things that people are doing in innovation. So Martin, great to finally meet you and thanks for stopping by. >> Thanks for the time. >> David as always and I think we'll see in a month or so. >> Right, always a pleasure Jeff, thanks. >> All right Martin Fink, Dave Tang. I'm Jeff Frick, you're watching theCUBE. Thanks for watching, we'll catch you next time. (inspirational music)

Published Date : Feb 1 2018

SUMMARY :

Great to see you again, Dave. So great to finally meet you, Martin. and that's the support of this RISC-V. So everything about the instruction set is available to all but the ability to get a community of individuals how is that going to be applied do you think and the code associated with that device, and something you want to go analyze and do some work on? and they're going to need more specialized architectures and the cost adder, if you want to call it that, and how you guys do better and the vast possibilities for data, So how does that scale game continue to evolve? and so it's a significant part of the reason why So just to make sure we get that clear. So the design has started. and again, the ideas you explicitly said that you before either couldn't do so that you have longer battery life and and optimize the flow of data and that speaks to that sort of open source nature and with software and you know with, is that the diversity of data needs where the amount that you need as you need, and the interesting thing is I had (laughs) So Martin, great to finally meet you David as always and I think Thanks for watching, we'll catch you next time.

ENTITIES

Entity	Category	Confidence
Nutanix	ORGANIZATION	0.99+
Western Digital	ORGANIZATION	0.99+
John	PERSON	0.99+
David	PERSON	0.99+
Krista	PERSON	0.99+
Bernie Hannon	PERSON	0.99+
Jeff Frick	PERSON	0.99+
Bernie	PERSON	0.99+
H3C	ORGANIZATION	0.99+
Citrix	ORGANIZATION	0.99+
September of 2015	DATE	0.99+
Dave Tang	PERSON	0.99+
Krista Satterthwaite	PERSON	0.99+
SanDisk	ORGANIZATION	0.99+
Martin	PERSON	0.99+
James White	PERSON	0.99+
Sue	PERSON	0.99+
Cisco	ORGANIZATION	0.99+
Carol Dweck	PERSON	0.99+
Martin Fink	PERSON	0.99+
Jeff	PERSON	0.99+
HPE	ORGANIZATION	0.99+
two	QUANTITY	0.99+
Stu Miniman	PERSON	0.99+
Dave	PERSON	0.99+
Dave allante	PERSON	0.99+
John Furrier	PERSON	0.99+
Raghu	PERSON	0.99+
Raghu Nandan	PERSON	0.99+
Palo Alto	LOCATION	0.99+
three	QUANTITY	0.99+
Lee Caswell	PERSON	0.99+
HP	ORGANIZATION	0.99+
Antonio Neri	PERSON	0.99+
five years	QUANTITY	0.99+
three-month	QUANTITY	0.99+
four-year	QUANTITY	0.99+
one minute	QUANTITY	0.99+
Gary	PERSON	0.99+
Antonio	PERSON	0.99+
Feb 2018	DATE	0.99+
2023	DATE	0.99+
seven dollars	QUANTITY	0.99+
three months	QUANTITY	0.99+
Arm Holdings	ORGANIZATION	0.99+
first	QUANTITY	0.99+

John Rydning, IDC | Western Digital the Next Decade of Big Data 2017

>> Announcer: Live from San Jose, California, it's theCUBE covering innovating to fuel the next decade of big data. Brought to you by Western Digital. >> Hey, welcome back everybody. Jeff Frick, here with theCUBE. We are at the Western Digital Headquarters in San Jose, California. It's the Al-Mady Campus. A historic campus. It's had a lot of great innovation, especially in hard drives for years and years and years. This event's called Innovating to Fuel the Next Data Big Data. And we're excited to have a big brain on. We like to get smart people who's been watching this story for a while and will give us a little bit of historical perspective. It's John Rydning. He is the Research Vice President for Hard Drives for IEC. John, Welcome. >> Thank you, Jeff. >> Absolutely. So, what is your take on today's announcement? >> I think it's our very meaningful announcement, especially when you consider that the previous BIGIT Technology announcement for the industry was Helium, about four or five years ago. But, really, the last big technology announcement prior to that was back in 2005, 2006, when the industry announced making this transition to what they called at that time, "Perpendicular Magnetic Recording." And when that was announced it was kind of a similar problem at that time in the industry that we have today, where the industry was just having a difficult time putting more data on each disc inside that drive. And, so they kind of hit this technology wall. And they announced Perpendicular Magnetic Recording and it really put them on a new S curve in terms of their ability to pack more data on each disc and just kind of put it in some perspective. So, after they announce Perpendicular Magnetic Recording, the capacity per disc increased about 30% a year for about five years. And then over, really, a ten year period, increased about an average of about 20% a year. And, so today's announcement is I see a lot of parallels to that. You know, back when Perpendicular Magnetic Recording was announced, really they build. They increased the capacity per platter was growing very slowly. That's where we are today. And with this announcement of MAMR Technology the direction that Western Digital's choosing really could put the industry on a new S curve and putting in terms of putting more capacity, storage capacity on each one of those discs. >> It's interesting. Always reminds me kind of back to the OS in Microsoft in Intel battles. Right? Intel would come out with a new chip and then Microsoft would make a bigger OS and they go back and back and forth and back and forth. >> John: Yeah, that's very >> And we're seeing that here, right? Cuz the demands for the data are growing exponentially. I think one of the numbers that was thrown out earlier today that the data thrown off by people and the data thrown off by machines is so exponentially larger than the data thrown off by business, which has been kind of the big driver of IT spin. And it's really changing. >> It's a huge fundamental shift. It really is >> They had to do something. Right? >> Yeah, the demand for a storage capacity by these large data centers is just phenomenal and yet at the same time, they don't want to just keep building new data center buildings. And putting more and more racks. They want to put more storage density in that footprint inside that building. So, that's what's really pushing the demand for these higher capacity storage devices. They want to really increase the storage capacity per cubic meter. >> Right, right. >> Inside these data centers. >> It's also just fascinating that our expectation is that they're going to somehow pull it off, right? Our expectation that Moore's laws continue, things are going to get better, faster, cheaper, and bigger. But, back in the back room, somebody's actually got to figure out how to do it. And as you said, we hit these kind of seminal moments where >> Yeah, that's right. >> You do get on a new S curve, and without that it does flatten out over time. >> You know, what's interesting though, Jeff, is really about the time that Perpendicular Magnetic Recording was announced way back in 2005, 2006, the industry was really, already at that time, talking about these thermal assist technologies like MAMR that Western Digital announced today. And it's always been a little bit of a question for those folks that are either in the industry or watching the industry, like IDC. And maybe even even more importantly for some of the HDD industry customers. They're kind of wondering, so what's really going to be the next technology race horse that takes us to that next capacity point? And it's always been a bit of a horse race between HAMR and MAMR. And there's been this lack of clarity or kind of a huge question mark hanging over the industry about which one is it going to be. And Western Digital certainly put a stake in the ground today that they see MAMR as that next technology for the future. >> (mumbles words) Just read a quote today (rushes through name) key alumni just took a new job. And he's got a pin tweet at the top of his thing. And he says, "The smart man looks for ways "To solve the problem. "Or looks at new solutions. "The wise man really spends his time studying the problem." >> I like that. >> And it's really interesting here cuz it seems kind of obvious there. Heat's never necessarily a good thing with electronics and data centers as you mentioned trying to get efficiency up. There's pressure as these things have become huge, energy consumption machines. That said, they're relatively efficient, based on other means that we've been doing they compute and the demand for this compute continues to increase, increase, increase, increase. >> Absolutely >> So, as you kind of look forward, is there anything kind of? Any gems in the numbers that maybe those of us at a layman level are kind of a first read are missing that we should really be paying attention that give us a little bit of a clue of what the feature looks like? >> Well, there's a couple of major trends going on. One is that, at least for the hard drive industry, if you kind of look back the last ten years or so, a pretty significant percentage of the revenue that they've generated a pretty good percentage of the petabytes that they ship have really gone into the PC market. And that's fundamentally shifting. And, so now it's really the data centers, so that by the time you get to 2020, 2021, about 60 plus percent of the petabytes that the industry's shipping is going into data centers, where if you look back a few years ago, 60% was going into PCs. That's a big, big change for the industry. And it's really that kind of change that's pushing the need for these higher capacity hard drives. >> Jeff: Right. >> So, that's, I think, one of the biggest shifts has taking place. >> Well, the other thing that's interesting in that comment because we know scale drives innovation better than anything and clearly Intel microprocessors rode the PC boom to get out scale to drive the innovation. And, so if you're saying, now, that the biggest scale is happening in the data center Then, that's a tremendous force for innovation in there versus Flash, which is really piggy-backing on the growth of these jobs, because that's where it's getting it's scale. So, when you look at kind of the Flash hard drive comparison, right? Obviously, Flash is the shiny new toy getting a lot of buzz over the last couple years. Western Digital has a play across the portfolio, but the announcement earlier today said, you're still going to have like this TenX cost differentiation. >> Yeah, that's right. >> Even through, I think it was 20, 25. I don't want to say what the numbers were. Over a long period of time. You see that kind of continuing DC&E kind of conflict between those two? Or is there a pretty clear stratification between what's going to go into Flash systems, or what's going to hard drives? >> That's a great question, now. So, even in the very large HyperScale data centers and we definitely see where Flash and hard disk drives are very complimentary. They're really addressing different challenges, different problems, and so I think one of the charts that we saw today at the briefing really is something that we agree with strongly at IDC. Today, maybe, about 7% or 8% of all of the combined HDD SSD petabyte shipped for enterprise are SSD petabytes. And then, that grows to maybe ten. >> What was it? Like 7% you said? >> 6% to 7%. >> 6% to 7% okay. Yeah, so we still have 92, 93%, 94% of all petabytes that again are HDD SSD petabytes for enterprise. Those are still HDD petabytes. And even when you get out to 2020, 2021, again, still bought 90%. We agree with what Western Digital talked about today. About 90% of the combined HDD SSD petabytes that are shipping for enterprise continue to be HDD. So, we do see the two technologies very complementary. Talked about SSD is kind of getting their scale on PCs and that's true. They really are going to quickly continue to become a bigger slice of the storage devices attached to new PCs. But, in the data center you really need that bulk storage capacity, low cost capacity. And that's where we see that the two SSDs and HDDs are going to live together for a long time. >> Yeah, and as we said the conflict barrier, complimentary nature of the two different applications are very different. You need the big data to build the models, to run the algorithms, to do stuff. But, at the same time, you need the fast data that's coming in. You need the real time analytics to make modifications to the algorithms and learn from the algorithms >> That's right, yeah. It's the two of those things together that are one plus one makes three type of solution. Exactly, and especially to address latency. Everybody wants their data fast. When you type something into Google, you want your response right away. And that's where SSDs really come into play, but when you do deep searches, you're looking through a lot of data that has been collected over years and a lot of that's probably sitting on hard disc drives. >> Yeah. The last piece of the puzzle, I just want to you to address before we sign off, That was an interesting point is that not just necessarily the technology story, but the ecosystem story. And I thought that was really kind of, I thought, the most interesting part of the MAMR announcement was that it fits in the same form factor, there's no change to OS, there's no kind of change in the ecosystem components in which you plug this in. >> Yeah, that's right. It's just you take out the smaller drive, the 10, or the 12, or whatever, or 14 I guess is coming up. And plug in. They showed a picture of a 40 terabyte drive. >> Right. >> You know, that's the other part of the story that maybe doesn't get as much play as it should. You're playing in an ecosystem. You can't just come up with this completely, kind of independent, radical, new thing, unless it'S so radical that people are willing to swap out their existing infrastructure. >> I completely agree. It's can be very difficult for the customer to figure out how to adopt some of these new technologies and actually, the hard disk drive industry has thrown a couple of technologies at their customers over the past five, six years, that have been a little challenging for them to adopt. So, one was when the industry went from a native 512 by sectors to 4K sectors. Seems like a pretty small change that you're making inside the drive, but it actually presented some big challenges for some of the enterprise customers. And even the single magnetic recording technologies. So, it has a way to get more data on the disc, and Western Digital certainly talked about that today. But, for the customer trying to plug and play that into a system and SMR technology actually created some real challenges for them to figure out how to adopt that. So, I agree that what was shown today about the MAMR technology is definitely a plug and play. >> Alright, we'll give you the last word as people are driving away today from the headquarters. They got a bumper sticker as to why this is so important. What's it say on the bumper sticker about MAMR? It says that we continue to get more capacity at a lower cost. >> (chuckles) Isn't that just always the goal? >> I agree. >> (chuckles) Alright, well thank you for stopping by and sharing your insight. Really appreciate it. >> Thanks, Jeff. >> Alright. Jeff Frick here at Western Digital. You're watching theCUBE! Thanks for watching. (futuristic beat)

Published Date : Oct 12 2017

SUMMARY :

Brought to you by Western Digital. He is the Research Vice President So, what is your take on today's announcement? for the industry was Helium, about four or five years ago. Always reminds me kind of back to the OS that the data thrown off by people It's a huge fundamental shift. They had to do something. Yeah, the demand for a storage capacity But, back in the back room, and without that it does flatten out over time. as that next technology for the future. "To solve the problem. and the demand for this compute continues And it's really that kind of change that's pushing the need one of the biggest shifts has taking place. and clearly Intel microprocessors rode the PC boom You see that kind of continuing DC&E kind of conflict So, even in the very large HyperScale data centers of the storage devices attached to new PCs. You need the big data to build the models, It's the two of those things together is that not just necessarily the technology story, the 10, or the 12, or whatever, or 14 I guess is coming up. that's the other part of the story that maybe doesn't get And even the single magnetic recording technologies. What's it say on the bumper sticker about MAMR? and sharing your insight. Thanks for watching.

ENTITIES

Entity	Category	Confidence
Jeff Frick	PERSON	0.99+
John Rydning	PERSON	0.99+
Jeff	PERSON	0.99+
Western Digital	ORGANIZATION	0.99+
John	PERSON	0.99+
2005	DATE	0.99+
two	QUANTITY	0.99+
7%	QUANTITY	0.99+
90%	QUANTITY	0.99+
2006	DATE	0.99+
60%	QUANTITY	0.99+
2020	DATE	0.99+
two technologies	QUANTITY	0.99+
94%	QUANTITY	0.99+
92, 93%	QUANTITY	0.99+
each disc	QUANTITY	0.99+
Microsoft	ORGANIZATION	0.99+
2021	DATE	0.99+
ten year	QUANTITY	0.99+
Today	DATE	0.99+
San Jose, California	LOCATION	0.99+
20	QUANTITY	0.99+
today	DATE	0.99+
8%	QUANTITY	0.99+
one	QUANTITY	0.99+
BIGIT	ORGANIZATION	0.99+
two different applications	QUANTITY	0.99+
40 terabyte	QUANTITY	0.99+
25	QUANTITY	0.99+
MAMR	ORGANIZATION	0.98+
ten	QUANTITY	0.98+
IDC	ORGANIZATION	0.98+
12	QUANTITY	0.98+
first read	QUANTITY	0.98+
MAMR Technology	ORGANIZATION	0.98+
Intel	ORGANIZATION	0.97+
theCUBE	ORGANIZATION	0.97+
about 20% a year	QUANTITY	0.97+
about 30% a year	QUANTITY	0.97+
about five years	QUANTITY	0.97+
One	QUANTITY	0.96+
Google	ORGANIZATION	0.96+
about 7%	QUANTITY	0.96+
HAMR	ORGANIZATION	0.96+
14	QUANTITY	0.95+
next decade	DATE	0.95+
five years ago	DATE	0.95+
10	QUANTITY	0.94+
6%	QUANTITY	0.94+
512	QUANTITY	0.93+
About 90%	QUANTITY	0.91+
about 60 plus percent	QUANTITY	0.91+
last couple years	DATE	0.91+
earlier today	DATE	0.9+
single	QUANTITY	0.89+
six years	QUANTITY	0.89+
few years ago	DATE	0.88+

Janet George , Western Digital | Western Digital the Next Decade of Big Data 2017

>> Announcer: Live from San Jose, California, it's theCUBE, covering Innovating to Fuel the Next Decade of Big Data, brought to you by Western Digital. >> Hey welcome back everybody, Jeff Frick here with theCUBE. We're at Western Digital at their global headquarters in San Jose, California, it's the Almaden campus. This campus has a long history of innovation, and we're excited to be here, and probably have the smartest person in the building, if not the county, area code and zip code. I love to embarrass here, Janet George, she is the Fellow and Chief Data Scientist for Western Digital. We saw you at Women in Data Science, you were just at Grace Hopper, you're everywhere and get to get a chance to sit down again. >> Thank you Jeff, I appreciate it very much. >> So as a data scientist, today's announcement about MAMR, how does that make you feel, why is this exciting, how is this going to make you be more successful in your job and more importantly, the areas in which you study? >> So today's announcement is actually a breakthrough announcement, both in the field of machine learning and AI, because we've been on this data journey, and we have been very selectively storing data on our storage devices, and the selection is actually coming from the preconstructed queries that we do with business data, and now we no longer have to preconstruct these queries. We can store the data at scale in raw form. We don't even have to worry about the format or the schema of the data. We can look at the schema dynamically as the data grows within the storage and within the applications. >> Right, cause there's been two things, right. Before data was bad 'cause it was expensive to store >> Yes. >> Now suddenly we want to store it 'cause we know data is good, but even then, it still can be expensive, but you know, we've got this concept of data lakes and data swamps and data all kind of oceans, pick your favorite metaphor, but we want the data 'cause we're not really sure what we're going to do with it, and I think what's interesting that you said earlier today, is it was schema on write, then we evolved to schema on read, which was all the rage at Hadoop Summit a couple years ago, but you're talking about the whole next generation, which is an evolving dynamic schema >> Exactly. >> Based whatever happens to drive that query at the time. >> Exactly, exactly. So as we go through this journey, we are now getting independent of schema, we are decoupled from schema, and what we are finding out is we can capture data at its raw form, and we can do the learning at the raw form without human interference, in terms of transformation of the data and assigning a schema to that data. We got to understand the fidelity of the data, but we can train at scale from that data. So with massive amounts of training, the models already know to train itself from raw data. So now we are only talking about incremental learning, as the train model goes out into the field in production, and actually performs, now we are talking about how does the model learn, and this is where fast data plays a very big role. >> So that's interesting, 'cause you talked about that also earlier in your part of the presentation, kind of the fast data versus big data, which kind of maps the flash versus hard drive, and the two are not, it's not either or, but it's really both, because within the storage of the big data, you build the base foundations of the models, and then you can adapt, learn and grow, change with the fast data, with the streaming data on the front end, >> Exactly >> It's a whole new world. >> Exactly, so the fast data actually helps us after the training phase, right, and these are evolving architectures. This is part of your journey. As you come through the big data journey you experience this. But for fast data, what we are seeing is, these architectures like Lambda and Kappa are evolving, and especially the Lambda architecture is very interesting, because it allows for batch processing of historical data, and then it allows for what we call a high latency layer or a speed layer, where this data can then be promoted up the stack for serving purposes. And then Kappa architecture's where the data is being streamed near real time, bounded and unbounded streams of data. So this is again very important when we build machine learning and AI applications, because evolution is happening on the fly, learning is happening on the fly. Also, if you think about the learning, we are mimicking more and more on how humans learn. We don't really learn with very large chunks of data all at once, right? That's important for initially model training and model learning, but on a regular basis, we are learning with small chunks of data that are streamed to us near real time. >> Right, learning on the Delta. >> Learning on the Delta. >> So what is the bound versus the unbound? Unpack that a little bit. What does that mean? >> So what is bounded is basically saying, hey we are going to get certain amounts of data, so you're sizing the data for example. Unbounded is infinite streams of data coming to you. And so if your architecture can absorb infinite streams of data, like for example, the sensors constantly transmitting data to you, right? At that point you're not worried about whether you can store that data, you're simply worried about the fidelity of that data. But bounded would be saying, I'm going to send the data in chunks. You could also do bounded where you basically say, I'm going to pre-process the data a little bit just to see if the data's healthy, or if there is signal in the data. You don't want to find that out later as you're training, right? You're trying to figure that out up front. >> But it's funny, everything is ultimately bounded, it just depends on how you define the unit of time, right, 'cause you take it down to infinite zero, everything is frozen. But I love the example of the autonomous cars. We were at the event with, just talking about navigation just for autonomous cars. Goldman Sachs says it's going to be a seven billion dollar industry, and the great example that you used of the two systems working well together, 'cause is it the car centers or is it the map? >> Janet: That's right. >> And he says, well you know, you want to use the map, and the data from the map as much as you can to set the stage for the car driving down the road to give it some level of intelligence, but if today we happen to be paving lane number two on 101, and there's cones, now it's the real time data that's going to train the system. But the two have to work together, and the two are not autonomous and really can't work independent of each other. >> Yes. >> Pretty interesting. >> It makes perfect sense, right. And why it makes perfect sense is because first the autonomous cars have to learn to drive. Then the autonomous cars have to become an experienced driver. And the experience cannot be learned. It comes on the road. So one of the things I was watching was how insurance companies were doing testing on these cars, and they had a human, a human driving a car, and then an autonomous car. And the autonomous car, with the sensors, were predicting the behavior, every permutation and combination of how a bicycle would react to that car. It was almost predicting what the human on the bicycle would do, like jump in front of the car, and it got it right 80% of the cases. But a human driving a car, we're not sure how the bicycle is going to perform. We don't have peripheral vision, and we can't predict how the bicycle is going to perform, so we get it wrong. Now, we can't transmit that knowledge. If I'm a driver and I just encountered a bicycle, I can't transmit that knowledge to you. But a driverless car can learn, it can predict the behavior of the bicycle, and then it can transfer that information to a fleet of cars. So it's very powerful in where the learning can scale. >> Such a big part of the autonomous vehicle story that most people don't understand, that not only is the car driving down the road, but it's constantly measuring and modeling everything that's happening around it, including bikes, including pedestrians, including everything else, and whether it gets in a crash or not, it's still gathering that data and building the model and advancing the models, and I think that's, you know, people just don't talk about that enough. I want follow up on another topic. So we were both at Grace Hopper last week, which is a phenomenal experience, if you haven't been, go. Ill just leave it at that. But Dr. Fei-Fei Li gave one of the keynotes, and she made a really deep statement at the end of her keynote, and we were both talking about it before we turned the cameras on, which is, there's no question that AI is going to change the world, and it's changing the world today. The real question is, who are the people that are going to build the algorithms that train the AI? So you sit in your position here, with the power, both in the data and the tools and the compute that are available today, and this brand new world of AI and ML. How do you think about that? How does that make you feel about the opportunity to define the systems that drive the cars, et cetera. >> I think not just the diversity in data, but the diversity in the representation of that data are equally powerful. We need both. Because we cannot tackle diverse data, diverse experiences with only a single representation. We need multiple representation to be able to tackle that data. And this is how we will overcome bias of every sort. So it's not the question of who is going to build the AI models, it is a question of who is going to build the models, but not the question of will the AI models be built, because the AI models are already being built, but some of the models have biases into it from any kind of lack of representation. Like who's building the model, right? So I think it's very important. I think we have a powerful moment in history to change that, to make real impact. >> Because the trick is we all have bias. You can't do anything about it. We grew up in the world in which we grew up, we saw what we saw, we went to our schools, we had our family relationships et cetera. So everyone is locked into who they are. That's not the problem. The problem is the acceptance of bring in some other, (chuckles) and the combination will provide better outcomes, it's a proven scientific fact. >> I very much agree with that. I also think that having the freedom, having the choice to hear another person's conditioning, another person's experiences is very powerful, because that enriches our own experiences. Even if we are constrained, even if we are like that storage that has been structured and processed, we know that there's this other storage, and we can figure out how to get the freedom between the two point of views, right? And we have the freedom to choose. So that's very, very powerful, just having that freedom. >> So as we get ready to turn the calendar on 2017, which is hard to imagine it's true, it is. You look to 2018, what are some of your personal and professional priorities, what are you looking forward to, what are you working on, what's top of mind for Janet George? >> So right now I'm thinking about genetic algorithms, genetic machine learning algorithms. This has been around for a while, but I'll tell you where the power of genetic algorithms is, especially when you're creating powerful new technology memory cell. So when you start out trying to create a new technology memory cell, you have materials, material deformations, you have process, you have hundred permutation combination, and the genetic algorithms, we can quickly assign a cause function, and we can kill all the survival of the fittest, all that won't fit we can kill, arriving to the fastest, quickest new technology node, and then from there, we can scale that in mass production. So we can use these survival of the fittest mechanisms that evolution has used for a long period of time. So this is biology inspired. And using a cause function we can figure out how to get the best of every process, every technology, all the coupling effects, all the master effects of introducing a program voltage on a particular cell, reducing the program voltage on a particular cell, resetting and setting, and the neighboring effects, we can pull all that together, so 600, 700 permutation combination that we've been struggling on and not trying to figure out how to quickly narrow down to that perfect cell, which is the new technology node that we can then scale out into tens of millions of vehicles, right? >> Right, you're going to have to >> Getting to that spot. >> You're going to have to get me on the whiteboard on that one, Janet. That is amazing. Smart lady. >> Thank you. >> Thanks for taking a few minutes out of your time. Always great to catch up, and it was terrific to see you at Grace Hopper as well. >> Thank you, I really appreciate it, I appreciate it very much. >> All right, Janet George, I'm Jeff Frick. You are watching theCUBE. We're at Western Digital headquarters at Innovating to Fuel the Next Generation of Big Data. Thanks for watching.

Published Date : Oct 11 2017

SUMMARY :

the Next Decade of Big Data, in San Jose, California, it's the Almaden campus. the preconstructed queries that we do with business data, Right, cause there's been two things, right. of the data and assigning a schema to that data. and especially the Lambda architecture is very interesting, So what is the bound versus the unbound? the sensors constantly transmitting data to you, right? and the great example that you used and the data from the map as much as you can and it got it right 80% of the cases. and advancing the models, and I think that's, So it's not the question of who is going to Because the trick is we all have bias. having the choice to hear another person's conditioning, So as we get ready to turn the calendar on 2017, and the genetic algorithms, we can quickly assign You're going to have to get me on the whiteboard and it was terrific to see you at Grace Hopper as well. I appreciate it very much. at Innovating to Fuel the Next Generation of Big Data.

ENTITIES

Entity	Category	Confidence
Janet George	PERSON	0.99+
Jeff	PERSON	0.99+
Jeff Frick	PERSON	0.99+
Janet	PERSON	0.99+
Western Digital	ORGANIZATION	0.99+
80%	QUANTITY	0.99+
two things	QUANTITY	0.99+
2018	DATE	0.99+
last week	DATE	0.99+
2017	DATE	0.99+
Goldman Sachs	ORGANIZATION	0.99+
San Jose, California	LOCATION	0.99+
two systems	QUANTITY	0.99+
two	QUANTITY	0.99+
today	DATE	0.99+
both	QUANTITY	0.99+
seven billion dollar	QUANTITY	0.99+
Fei-Fei Li	PERSON	0.98+
Almaden	LOCATION	0.98+
two point	QUANTITY	0.97+
one	QUANTITY	0.97+
first	QUANTITY	0.95+
Grace Hopper	ORGANIZATION	0.95+
theCUBE	ORGANIZATION	0.95+
hundred permutation	QUANTITY	0.95+
MAMR	ORGANIZATION	0.94+
Women in Data Science	ORGANIZATION	0.91+
tens of millions of vehicles	QUANTITY	0.9+
one of	QUANTITY	0.89+
Kappa	ORGANIZATION	0.89+
Dr.	PERSON	0.88+
single representation	QUANTITY	0.83+
a couple years ago	DATE	0.83+
earlier today	DATE	0.82+
Next Decade	DATE	0.81+
Lambda	TITLE	0.8+
101	OTHER	0.8+
600, 700 permutation	QUANTITY	0.77+
Lambda	ORGANIZATION	0.7+
of data	QUANTITY	0.67+
keynotes	QUANTITY	0.64+
Hadoop Summit	EVENT	0.62+
zero	QUANTITY	0.6+
number	OTHER	0.55+
Delta	OTHER	0.54+
two	OTHER	0.35+

Mark Grace, Western Digital | Western Digital the Next Decade of Big Data 2017

>> Announcer: Live from San Jose, California, it's theCUBE, covering Innovating to Fuel the Next Decade of Big Data, brought to you by Western Digital. >> Hey welcome back everybody, Jeff Frick here with theCUBE. We're at Western Digital's headquarters in San Jose, California at the Almaden campus. Lot of innovation's been going on here, especially in storage for decades, and we're excited to be at this special press and analyst event that Western Digital put on today to announce some exciting new products. It's called Innovating to Fuel the Next Decade of Data. I'm super happy to have a long-time industry veteran, he just told me, 35 years, I don't know if I can tell (Mark laughs) that on air or not. He's Mark Grace, he's the Senior Vice President of Devices for Western Digital, Mar, great to have you on. >> Thanks Jeff, glad to be here. >> Absolutely, so you've seen this movie over and over and over, I mean that's one of the cool things about being in the Valley, is this relentless pace of innovation. So how does today's announcement stack up as you kind of look at this versus kind of where we've come from? >> Oh I think this is maybe one of the, as big as it comes, Jeff, to be honest. I think we've plotted a course now that I think was relatively uncertain for the hard drive industry and the data center, and plotted a course that I think we can speak clearly to the market, and clearly to customers about the value proposition for rotating magnetic storage for decades to come. >> Which is pretty interesting, 'cause, you know, rotating drives have been taking a hit over the last couple of years, right, flash has been kind of the sexy new kid on the block, so this is something new, >> Mark: It is. >> And a new S curve I think as John said. >> I agree, we're jumping onto a, we're extending the S curve, let's call it that. I think there's actually plenty of other S curve opportunities for us, but in this case, I think the industry, and I would say our customer base, we have been less than clear with those guys about how we see the future of rotating storage in the cloud and enterprise space, and I think today's announcement clarifies that and gives some confidence about architectural decisions relative to rotating storage going forward for a long time. >> Well I think it's pretty interesting, 'cause compared to the other technology that was highlighted, the other option, the HAMR versus the MAMR, this was a much more elegant, simpler way to add this new S curve into an existing ecosystem. >> You know, elegant's probably a good word for it, and it's always the best solution I would say. HAMR's been a push for many years. I can't remember the first time I heard about HAMR. It's still something we're going to continue to explore and invest in, but it has numerous hurdles compared to the simplicity and elegance, as you say, of MAMR, not the least of which is we're going to operate at normal ambient temperatures versus apply tremendous heat to try and energize the recording and the technologies. So any time you introduce extraordinary heat you face all kinds of ancillary engineering challenges, and this simplifies those challenges down to one critical innovation, which is the oscillator. >> Pretty interesting, 'cause it seems pretty obvious that heat's never a good thing. So it's curious that in the quest for this next S curve that the HAMR path was pursued for as long as it was, it sounds like, because it sounds like that's a pretty tough thing to overcome. >> Yeah, I think it initially presented perhaps the most longevity perhaps in early exploration days. I would say that HAMR has certainly received the most press as far as trying to assert it as the extending recording technology for enterprise HDDs. I would say we've invested for almost as long in MAMR, but we've been extremely quiet about it. This is kind of our nature. When we're ready to talk about something, you can kind of be sure we're ready to go with it, and ready to think about productization. So we're quite confident in what we're doing. >> But I'm curious from your perspective, having been in the business a long time, you know, we who are not directly building these magical machines, just now have come to expect that Moore's Law will contain, has zero to do with semiconductor physics anymore, it's really an attitude and this relentless pace of innovation that now is expected and taken for granted. You're on the other side, and have to face real physics and mechanical limitations of the media and the science and everything else. So is that something that gets you up every day >> Mark: Keeps me awake every night! >> Obviously keeps you awake at night and up every day. You've been doing it for 35 years, so there must be some appeal. >> Yeah. (laughs) >> But you know, it's a unique challenge, 'cause at the same time not only has it got to be better and faster and bigger, it's got to be cheaper, and it has been. So when you look at that, how does that kind of motivate you, the teams here, to deliver on that promise? >> Yeah, I mean in this case, we are a little bit defensive, in the sense of the flash expectations that you mentioned, and I think as we digest our news today, we'll be level setting a little bit more in a more balanced way the expectations for contribution from rotating magnetic storage and solid state storage to what I think is a more accurate picture of its future going forward in the enterprise and hyperscale space. To your point about just relentless innovation, a few of us were talking the other day in advance of this announcement that this MAMR adventure feels like the early days of PMR, perpendicular, the current recording technology. It feels like we understand a certain amount of distance ahead of us, and that's about this four-terabit per inch kind of distance, but it feels like the early days where we could only see so far but the road actually goes much further, and we're going to find more and more ways to extend this technology, and keep that order of magnitude cost advantage going from a hard drive standpoint versus flash. >> I wonder how this period compares to that, just to continue, in terms of on the demand side, 'cause you know, back in the day, the demand and the applications for these magical compute machines weren't near, I would presume, as pervasive as now, or am I missing the boat? 'Cause now clearly there is no shortage of demand for storage and compute. >> Yeah, depending on where you're coming from, you could take two different views of that. The engine that drove the scale of the hard drive industry to date has, a big piece of it in the long history of the hard drive industry has been the PC space. So you see that industry converting to flash and solid state storage more aggressively, and we embrace that, you know we're invested in flash and we have great products in that space, and we see that happening. The opportunity in the hyperscale and cloud space is we're only at the tip of the iceberg, and therefore I think, as we think about this generation, we think about it differently than those opportunities in terms of breadth of applications, PCs, and all that kind of create the foundation for the hard drive, but what we see here is the virtuous cycle of more storage, more economical storage begets more value proposition, more opportunities to integrate more data, more data collection, more storage. And that virtuous cycle seems to me that we're just getting started. So long live data, that's what we say. (both laugh) >> The other piece that I find interesting is before the PCs were the driver of scale relative to an enterprise data center, but with the hyperscale guys and the proliferation of cloud and actually the growth of PCs is slowing down dramatically, that it's kind of flipped the bit. Now the data centers themselves have the scale to drive >> Absolutely. >> the scale innovation that before was before was really limited to either a PC or a phone or some more consumer device. >> Absolutely the case. When you take that cross-section of hard drive applications, that's a hundred percent the case, and in fact, we look at the utilization as a vertically integrated company we look at our media facilities for the disks, we look at our wafer facilities for heads, and those facilities as we look forward are going to be as busy as busier than they've ever been. I mean the amount of data is relative to the density as well as disks and heads and how many you can employ. So we see this in terms of fundamental technology and component construction, manufacturing, busier than it's ever been. We'll make fewer units. I mean there will be fewer units as they become bigger and denser for this application space, but the fundamental consumption of magnetic recording technology and components is all-time records. >> Right. And you haven't even talked about the software-defined piece that's dragging the utilization of that data across multiple applications. >> And it's just one of these that come in to help everybody there too, yeah. >> Jeff: You got another 35 years more years in you? (both laugh) >> I hope so. >> All right. >> But that would be the edge of it, I think. >> All right, we're going to take Mark Grace here, only 35 more years, Lord knows what he'll be working on. Well Mark, thanks for taking a few minutes and answering your prospective >> No that's fine, thanks a lot. >> Absolutely, Mark Grace, I'm Jeff Frick, you're watching theCUBE from Western Digital headquarters in San Jose, California. Thanks for watching. >> Mark: All right.

Published Date : Oct 11 2017

SUMMARY :

the Next Decade of Big Data, in San Jose, California at the Almaden campus. and over, I mean that's one of the cool things and clearly to customers about the value proposition in the cloud and enterprise space, the HAMR versus the MAMR, and it's always the best solution I would say. So it's curious that in the quest for this next S curve and ready to think about productization. and mechanical limitations of the media and the science Obviously keeps you awake at night and up every day. 'cause at the same time not only has it got to be in the sense of the flash expectations that you mentioned, and the applications for these magical compute machines PCs, and all that kind of create the foundation and actually the growth of PCs is slowing down dramatically, the scale innovation I mean the amount of data is relative to the density piece that's dragging the utilization of that data that come in to help everybody there too, yeah. and answering your prospective No that's fine, in San Jose, California.

ENTITIES

Entity	Category	Confidence
Jeff Frick	PERSON	0.99+
Jeff	PERSON	0.99+
Western Digital	ORGANIZATION	0.99+
John	PERSON	0.99+
Mark Grace	PERSON	0.99+
35 years	QUANTITY	0.99+
Mark	PERSON	0.99+
San Jose, California	LOCATION	0.99+
Almaden	LOCATION	0.99+
Mar	PERSON	0.98+
both	QUANTITY	0.98+
35 more years	QUANTITY	0.97+
today	DATE	0.97+
first time	QUANTITY	0.97+
theCUBE	ORGANIZATION	0.96+
one	QUANTITY	0.95+
hundred percent	QUANTITY	0.94+
zero	QUANTITY	0.94+
four-terabit per inch	QUANTITY	0.93+
two different views	QUANTITY	0.93+
years	DATE	0.92+
HAMR	ORGANIZATION	0.91+
Moore	PERSON	0.9+
both laugh	QUANTITY	0.9+
35 years more years	QUANTITY	0.88+
decades	QUANTITY	0.86+
2017	DATE	0.75+
one critical innovation	QUANTITY	0.71+
Innovating to Fuel the Next Decade of	EVENT	0.7+
MAMR	OTHER	0.7+
last couple	DATE	0.66+
Next Decade	TITLE	0.62+
Innovating to Fuel the Next Decade of Big Data	TITLE	0.6+
Senior	PERSON	0.53+
President	PERSON	0.53+
HAMR	OTHER	0.45+
Data	TITLE	0.44+
MAMR	ORGANIZATION	0.38+

Dave Tang, Western Digital | Western Digital the Next Decade of Big Data 2017

(upbeat techno music) >> Announcer: Live from San Jose, California it's theCUBE, covering Innovating to Fuel the Next Decade of Big Data, brought to you by Western Digital. >> Hey, welcome back everybody. Jeff Frick here at theCUBE. We're at the Western Digital Headquarters off Almaden down in San Jose, a really important place. Western Digital's been here for a while, their headquarters. A lot of innovation's been going on here forever. So we're excited to be here really for the next generation. The event's called Innovating to Fuel the Next Generation of big data, and we're joined by many time Cuber, Dave Tang. He is the SVP in corporate marketing from Western Digital. Dave, always great to see you. >> Yeah. Always great to be here, Jeff. Thanks. >> Absolutely. So you got to MC the announcement today. >> Yes. >> So for the people that weren't there, let's give them a quick overview on what the announcement was and then we can dive in a little deeper. >> Great, so what we were announcing was a major breakthrough in technology that's going to allow us to drive the increase in capacity in density to support big data for the next decade and beyond, right? So capacities and densities had been starting to level off in terms of hard drive technology capability. So what we announced was microwave-assisted magnetic recording technology that will allow us to keep growing that areal density up and reducing the cost per terabyte. >> You know, it's fascinating cause everyone loves to talk about Moore's Law and have these silly architectural debates, whether Moore's Law is alive or dead, but, as anyone who's lived here knows, Moore's Law is really an attitude much more it is than the specific physics of microprocessor density growth. And it's interesting to see. As we know the growth of data is growing in giant and the types of data, and not only regular big data, but now streaming data are bigger and bigger and bigger. I think you talked about stuff coming off of people and machines compared to business data is way bigger. >> Right. >> But you guys continue to push limits and break through, and even though we expect everything to be cheaper, faster, and better, you guys actually have to execute it-- >> Dave: Right. >> Back at the factory. >> Right, well it's interesting. There's this healthy tension, right, a push and pull in the environment. So you're right, it's not just Moore's Law that's enabling a technology push, but we have this virtuous cycle, right? We've realized what the value is of data and how to extract the possibilities and value of data, so that means that we want to store more of that data and access more of that data, which drives the need for innovation to be able to support all of that in a cost effective way. But then that triggers another wave of new applications, new ways to tap into the possibilities of data. So it just feeds on itself, and fortunately we have great technologists, great means of innovation, and a great attitude and spirit of innovation to help drive that. >> Yeah, so for people that want more, they can go to the press releases and get the data. We won't dive deep into the weeds here on the technology, but I thought you had Janet George speak, and she's chief data scientist. Phenomenal, phenomenal big brain. >> Dave: Yes. >> A smart lady. But she talked about, from her perspective we're still just barely even getting onto this data opportunity in terms of automation, and we see over and over at theCUBE events, innovation's really not that complicated. Give more people access to the data, give them more access to the tools, and let them try things easier and faster and feel quick, there's actually a ton of innovation that companies can unlock within their own four walls. But the data is such an important piece of it, and there's more and more and more of this. >> Dave: Right, right. >> What used to be digital exhaust now is, I think maybe you said, or maybe it was Dave, that there's a whole economy now built on data like we used to do with petroleum. I thought that was really insightful. >> Yeah, right. It's like a gold mine. So not only are the sources of data increasing, which is driving increased volume, but, as Janet was alluding to, we're starting to come up with the tools and the sophistication with machine learning and artificial intelligence to be able to put that data to new use as well as to find the pieces of data to interconnect, to drive these new capabilities and new insights. >> Yeah, but unlike petroleum it doesn't get used up. I mean that's the beauty of data. (laughing) >> Yeah, that's right. >> It's a digital process that can be used over and over and over again. >> And a self-renewing, renewing resource. And you're right, in that sense that it's being used over and over again that the longevity of that data, the use for life is growing exponentially along with the volume. >> Right, and Western Digital's in a unique position cause you have systems and you have big systems that could be used in data centers, but you also have the media that powers a whole bunch of other people's systems. So I thought one of the real important announcements today was, yes it's an interesting new breakthrough technology that uses energy assist to get more density on the drives, but it's done in such a way that the stuff's all backward compatible. It's plug and play. You've got production scheduled in a couple years I think with test out the customers-- >> Dave: That's right. >> Next year. So, you know, that is such an important piece beyond the technology. What's the commercial acceptance? What are the commercial barriers? And this sounds like a pretty interesting way to skin that cow. >> Right, often times the best answers aren't the most complex answers. They're the more elegant and simplistic answers. So it goes from the standpoint of a user being able to plug and play with older systems, older technologies. That's beautiful, and for us, to be able to, the ability to manufacture it in high volume reliably and cost effectively is equally as important. >> And you also talked, which I think was interesting, is kind of the relationship between hard drives and flash, because, obviously, flash is a, I want to say the sexy new toy, but it's not a sexy new toy anymore. >> Right. >> It's been around for a while, but, with that pressure on flash performance, you're still seeing the massive amounts of big data, which is growing faster than that. And there is a rule for the high density hard drives in that environment, and, based on the forecast you shared, which I'm presuming came from IDC or people that do numbers for a living, still a significant portion of a whole lot of data is not going to be on flash. >> Yeah, that's right. I think we have a tendency, especially in technology, to think either or, right? Something is going to take over from something else, but in this case it's definitely an and, right. And a lot of that is driven by this notion that there's fast data and big data, and, while our attention seems to shift over to maybe some fast data applications like autonomous vehicles and realtime applications, surveillance applications, there's still a need for big data because the algorithms that drive those realtime applications have to come from analysis of vast amounts of data. So big data is here to stay. It's not going away or shifting over. >> I think it's a really interesting kind of cross over, which Janet talked about too, where you need the algorithms to continue sharing the system that are feeding, continuing, and reacting to the real data, but then that just adds more vocabulary to their learning set so they can continue to evolve overtime. >> Yeah, what really helps us out in the market place is that because we have technologies and products across that full spectrum of flash and rotating magnetic recording, and we sell to customers who buy devices as well as platforms and systems, we see a lot of applications, a lot of uses of data, and we're able to then anticipate what those needs are going to be in the near future and in the distant future. >> Right, so we're getting towards the end of 2017, which I find hard to say, but as you look forward kind of to 2018 and this insatiable desire for more storage, cause this insatiable creation of more data, what are some of your priorities for 2018 and what are you kind of looking at as, like I said, I can't believe we're going to actually flip the calendar here-- >> Dave: Right, right. >> In just a few short months. (laughing) >> Well, I think for us, it's the realization that all these applications that are coming at us are more and more diverse, and their needs are very specialized. So it's not just the storage, although we're thought of as a storage company, it's not just about the storage of that data, but you have contrive complete environments to capture and preserve and access and transform that data, which means we have to go well beyond storage and think about how that data is accessed, technical interfaces to our memory products as well as storage products, and then where compute sits. Does it still sit in a centralized place or do you move compute to out closer to where the data sits. So, all this innovation and changing the way that we think about how we can mine that data is top of the mind for us for the next year and beyond. >> It's only job security for you, Dave. (laughing) >> Dave: Funny to think about. >> Alright. He's Dave Tang. Thanks for inviting us and again congratulations on the presentation. >> Always a pleasure. >> Alright, Dave Tang, I'm Jeff Frick. You're watching theCUBE from Western Digital headquarters in San Jose, California. Thanks for watching. (upbeat techno music)

Published Date : Oct 11 2017

SUMMARY :

brought to you by Western Digital. He is the SVP in corporate marketing Always great to be here, Jeff. So you got to MC the announcement today. So for the people that weren't there, and reducing the cost per terabyte. and machines compared to business data and how to extract the possibilities and get the data. Give more people access to the data, that there's a whole economy now the pieces of data to interconnect, I mean that's the beauty of data. It's a digital process that can be used that the longevity of that data, that the stuff's all backward compatible. What are the commercial barriers? the ability to manufacture it in high volume is kind of the relationship between hard drives and, based on the forecast you shared, So big data is here to stay. and reacting to the real data, in the near future and in the distant future. (laughing) So it's not just the storage, It's only job security for you, Dave. and again congratulations on the in San Jose, California.

ENTITIES

Entity	Category	Confidence
Janet	PERSON	0.99+
Janet George	PERSON	0.99+
Jeff Frick	PERSON	0.99+
Dave Tang	PERSON	0.99+
Dave	PERSON	0.99+
Western Digital	ORGANIZATION	0.99+
Jeff	PERSON	0.99+
2018	DATE	0.99+
San Jose, California	LOCATION	0.99+
Almaden	LOCATION	0.99+
Next year	DATE	0.99+
San Jose	LOCATION	0.99+
next year	DATE	0.99+
today	DATE	0.98+
next decade	DATE	0.97+
one	QUANTITY	0.95+
Moore's Law	TITLE	0.91+
Western Digital	LOCATION	0.89+
Cuber	PERSON	0.88+
theCUBE	ORGANIZATION	0.88+
end of 2017	DATE	0.88+
Innovating to Fuel the Next Generation of	EVENT	0.81+
IDC	ORGANIZATION	0.8+
2017	DATE	0.8+
Moore's	TITLE	0.76+
Moore	PERSON	0.72+
months	QUANTITY	0.68+
couple years	QUANTITY	0.65+
lot of applications	QUANTITY	0.61+
four walls	QUANTITY	0.6+
Innovating to Fuel the Next Decade of Big Data	EVENT	0.59+
Next Decade	DATE	0.58+

Mike Cordano, Western Digital | Western Digital the Next Decade of Big Data 2017

>> Announcer: Live from San Jose, California, it's The Cube. Covering Innovating to Fuel the Next Decade of Big Data. Brought to you by Western Digital. >> Hey, welcome back everybody. Jeff Frick here with The Cube. We're at the Western Digital headquarters in San Jose, the Great Oaks Campus, a really historic place in the history of Silicon Valley and computing. It's The Innovating to Fuel the Next Generation of Big Data event with Western Digital. We're really excited to be joined by our next guest, Mike Cordano. He's the president and chief operating officer of Western Digital. Mike, great to see you. >> Great to see you as well. Happy you guys could be here. It's an exciting day. >> Absolutely. First off, I think the whole merger thing is about done, right? That's got to feel good. >> Yeah, it's done, but there's legs to it, right? So we've combined these companies now, three of them, three large ones, so obviously Western Digital and Hitachi Global Storage, now we've added SanDisk into one Western Digital, so we're all together. Obviously more to do, as you expect in a large scale integration. There will be a year or two of bringing all those business processes and systems together, but I got to say, the teams are coming together great, showing up in our financial performance and our product execution, so things are really coming together. >> Yeah, not an easy task by any stretch of the imagination. >> No, not easy, but certainly a compliment to our team. I mean, we've got great people. You know, like anything, if you can harness the capabilities of your team, there's a lot you can accomplish, and it really is a compliment to the team. >> Excellent. Well, congratulations on that, and talking a bit about this event here today, you've even used "Big Data" in the title of the event, so you guys are obviously in a really unique place, Western Digital. You make systems, big systems. You also make the media that feeds a lot of other people's systems, but as the big data grows, the demand for data grows, it's got to live somewhere, so you're sitting right at the edge where this stuff's got to sit. >> Yeah, that's right, and it's central to our strategy, right? So if you think about it, there's three fundamental technologies that we think are just inherent in all of the evolution of compute and IT architecture. Obviously, there is compute, there is storage or memory, and then there's sort of movement, or interconnect. We obviously live in the storage or memory node, and we have a very broad set of capabilities, all the way from rotating magnetic media, which was our heritage, now including non-volatile memory and flash, and that's just foundational to everything that is going to come, and as you said, we're not going to stop there. It's not just a devices or component company, we're going to continue to innovate above that into platforms and systems, and why that becomes important to us, is there's a lot of technology innovation we can do that enhances the offering that we can bring to market when we control the entire technology stat. >> Right. Now, we've had some other guests on and people can get more information on the nitty-gritty details of the announcement today, the main announcement. Basically, in a nutshell, enabling you to get a lot more capacity in hard drives. But, I thought in your opening remarks this morning, there were some more high-level things I wanted to dig into with you, and specifically, you made an analogy of the data economy, and compared it to the petroleum economy. I've never... A lot of times, they talk about big data, but no one really talks about it, that I've heard, in those terms, because when you think about the petroleum economy, it's so much more than fuel and cars, and the second-order impacts, and the third-order impacts on society are tremendous, and you're basically saying, "We're going to "do this all over again, but now it's based on data." >> Yeah, that's right, and I think it puts it into a form that people can understand, right? I think it's well-proven what happened around petroleum, so the discovery of petroleum, and then the derivative industries, whether it be automobiles, whether it be plastics, you pick it, the entire economy revolved around, and, to some degree, still revolves around petroleum. The same thing will occur around data. You're seeing it with investments, you hear now things like machine learning, or artificial intelligence, that is all ways to transform and mine data to create value. >> Right. >> And we're going to see industries change rapidly. Autonomous cars, that's going to be enabled by data, and capabilities here, so pick your domain. There's going to be innovation across a lot of fronts, across a lot of traditional vertical industries, that is all going to be about data and driven by data. >> It's interesting what Janet, Doctor Janet George talked about too a little bit is the types of data, and the nozzles of the data is also evolving very quickly from data at rest to data in motion, to real-time analytics, to, like you said, the machine learning and the AI, which is based on modeling prior data, but then ingesting new data, and adjusting those models so even the types and the rate and the speed of the data is under dramatic change right now. >> Yeah, that's right, and I think one of the things that we're helping enable is you kind of get to this concept of what do you need to do to do what you describe? There has to be an infrastructure there that actually enables it. So, when you think about the scale of data we're dealing with, that's one thing that we're innovating around, then the issue is, how do you allow multiple applications to simultaneously access and update and transform that? Those are all problems that need to be solved in the infrastructure to enable things like AI, right? And so, where we come into play, is creating that infrastructure layer that actually makes that possible. The other thing I talked about briefly in the Q and A was, think about the problem of a future where the data set is just too large to actually move it in a substantive way to the compute. We actually have to invert that model over time architecturally, and bring the compute to the data, right? Because it becomes too complicated and too expensive to move from the storage layer up to compute and back, right? That is a complex operation. That's why those three pillars of technology are so important. >> And you've talked, and we're seeing in the Cloud right, because this continuing kind of atomization, atomic, not automatic, but making these more atomic. A smaller unit that the Cloud has really popularized, so you need a lot, you need a little, really, by having smaller bits and bytes, it makes that that much more easy. But another concept that you delved into a little was fast data versus big data, and clearly flash has been the bright, shiny object for the last couple years, and you guys play in that market as well, but it is two very different ways to think of the data, and I thought the other statistic that was shared is you know, the amount of data coming off of the machines and people dwarfs the business data, which has been the driver of IT spend for the last several decades. >> Yeah, no, that's right, and sort of that... You think about that, and the best analogy is a broader definition of IOT, right? Where you've got all of these censors, whether it be camera censors, because that's just a censor, creating an image or a video, or if it's more industrialized too, you've got all these sources of data, and they're going to proliferate at an exponential rate, and our ability to aggregate that in some sort of an organized way, and then act upon it, again, let's use the autonomous car as the example. You've got all these censors that are in constant motion. You've got to be able to aggregate the data, and make decisions on it at the edge, so that's not something... You can't deal with latency up to the Cloud and back, if it's an automobile, and it needs to make an instantaneous decision, so you've got to create that capability locally, and so when you think about the evolution of all this, it's really the integration of the Cloud, which, as Janet talked about, is the ability to tap into this historical or legacy data to help inform a decision, but then there's things happening out at the edge that are real time, and you have to have the capability to ingest the content, make a decision on it very quickly, and then act on it. >> Right. There's a great example. We went to the autonomous... Just navigation for the autonomous vehicles. It's own subset that I think Goldman-Sachs said it a seven billion dollar industry in the not-too-distant future, and the great example is this combination of the big data and the live data is, when they actually are working on the road. So you've got maps that tell you, and are updated, kind of what the road looks like, but on Tuesday, they were shifting the lane, and that particular lane now has cones in it, so the combination of the two is such a powerful thing. >> That's right. >> I want to dive into another topic we talked about, which is really architecting for the future. Unlike oil, data doesn't get consumed and is no longer available, right? It's a reusable asset, and you talked about classic stove-topping of data within an application center world where now you want that data available for multiple applications, so very different architecture to be able to use it across many fronts, some of which you don't even know yet. >> That's right. I think that's a key point. One of the things, when we talk to CEOs, or CIOs I should say, what they're realizing, to the extent you can enable a cost-effective mechanism for me to store and keep everything, I don't know how I'll derive value from it some time in the future, because as applications evolve, we're finding new insights into what can help drive decisions or innovation, or, to take it to health care, some sort of innovation that cures disease. That's one of the things that everybody wants to do. I want to build aggregate everything. If I could do that cost effectively enough, I'll find a way to get value out of it over time, and that's something where, when we're thinking about big data and what we talked about today, that's central to that idea, and enabling it. >> Right, and digital transformation, right, the hot buzz word, but we hear, time and time again, such a big piece of that is giving the democratization. Democratization of the data, so more people have access to it, democratization of the tools to manipulate that data, not just Mahogany Row super smart people, and then to have a culture that lets people actually try, experiment, fail fast, and there's a lot of innovation that would be unlocked right within your four walls, that probably are not being tapped into. >> Well, that's right, and that's something that innovation, and an innovation culture is something that we're working hard at, right? So if you think about Western Digital, you might think of us as, you know, legacy Western Digital as sort of a fast following, very operational-centric company. We're still good at those things, but over the last five years, we've really pushed this notion of innovation, and really sort of pressing in to becoming more influential in those feature architectures. That drives a culture that, if we think about the technical community, if we create the right sort of mix of opportunity, appetite for some risk, that allows the best creativity to come out of our technical... Innovating along these lines. >> Right, I'll give you the last word. I can't believe we're going to turn the calendar here on 2017, which is a little scary. As you look forward to 2018, what are some of your top priorities? What are you going to be working on as we come into the new calendar year? >> Yeah, so as we look into 2018 and beyond, we really want to drive this continued architectural shift. You'll see us be very active, and I think you talked about it, you'll see us getting increasingly active in this democratization. So we're going to have to figure out how we engage the broader open-source development world, whether it be hardware or software. We agree with that mantra, we will support that. Obviously we can do unique development, but with some hooks and keys that we can drive a broader ecosystem movement, so that's something that's central to us, and one last word would be, one of the things that Martin Fink has talked about which is really part of our plans as we go onto the new year, is really this inverting the model, where we want to continue to drive an architecture that brings compute to the storage and enables some things that just can't be done today. >> All right, well Mike Cordano, thanks for taking a few minutes, and congratulations on the terrific event. >> Thank you. Appreciate it. >> He's Mike Cordano, I'm Jeff Frick, you're watching The Cube, we're at Western Digital headquarters in San Jose, Great Oaks Campus, it's historic. Check it out. Thanks for watching.

Published Date : Oct 11 2017

SUMMARY :

Brought to you by Western Digital. It's The Innovating to Fuel the Next Generation of Big Data Great to see you as well. That's got to feel good. Obviously more to do, as you expect and it really is a compliment to the team. of the event, so you guys are obviously in a really unique that is going to come, and as you said, more information on the nitty-gritty details of the and mine data to create value. that is all going to be about data and driven by data. to real-time analytics, to, like you said, the machine architecturally, and bring the compute to the data, right? and people dwarfs the business data, which has been talked about, is the ability to tap into this historical now has cones in it, so the combination of the two to be able to use it across many fronts, some of which that's central to that idea, and enabling it. and then to have a culture that lets people actually and really sort of pressing in to becoming more influential the new calendar year? architecture that brings compute to the storage and enables and congratulations on the terrific event. Thank you. The Cube, we're at Western Digital headquarters in San Jose,

ENTITIES

Entity	Category	Confidence
Mike Cordano	PERSON	0.99+
Western Digital	ORGANIZATION	0.99+
Janet	PERSON	0.99+
Jeff Frick	PERSON	0.99+
San Jose	LOCATION	0.99+
Janet George	PERSON	0.99+
Tuesday	DATE	0.99+
Mike	PERSON	0.99+
2018	DATE	0.99+
three	QUANTITY	0.99+
2017	DATE	0.99+
Goldman-Sachs	ORGANIZATION	0.99+
Silicon Valley	LOCATION	0.99+
Martin Fink	PERSON	0.99+
San Jose, California	LOCATION	0.99+
Hitachi Global Storage	ORGANIZATION	0.99+
One	QUANTITY	0.99+
seven billion dollar	QUANTITY	0.99+
two	QUANTITY	0.99+
SanDisk	ORGANIZATION	0.99+
The Cube	TITLE	0.99+
a year	QUANTITY	0.99+
today	DATE	0.98+
three pillars	QUANTITY	0.97+
three fundamental technologies	QUANTITY	0.97+
Great Oaks Campus	LOCATION	0.97+
one	QUANTITY	0.97+
First	QUANTITY	0.95+
two very different ways	QUANTITY	0.94+
one thing	QUANTITY	0.93+
this morning	DATE	0.92+
Next Decade	DATE	0.92+
three large	QUANTITY	0.91+
one last word	QUANTITY	0.91+
Doctor	PERSON	0.87+
The Cube	ORGANIZATION	0.84+
third-order	QUANTITY	0.84+
last couple years	DATE	0.83+
Mahogany Row	ORGANIZATION	0.82+
second-order	QUANTITY	0.81+
Cloud	TITLE	0.76+
last five years	DATE	0.7+
last several decades	DATE	0.69+
four walls	QUANTITY	0.64+
Big Data	EVENT	0.49+

Brendan Collins, Western Digital | Western Digital the Next Decade of Big Data 2017

>> Male voiceover: Live from San Jose California, it's the Cube, covering Innovating to Fuel the Next Decade of Big Data. Brought to you by Western Digital. >> Hey, welcome back everybody, Jeff Frick here with the Cube. We're at the Western Digital World Headquarters It's the Almaden Campus in San Jose. If you know anything about the tech world, you know there's a lot of innovation that's been happening on this campus for years and years and years. Big announcement today called Innovating to Fuel the Next Generation of Big Data. Lot of exciting announcements and here to join us to tell us all about it is Brendan Collins. He's the Vice President of Product Marketing Devices for Western Digital. Brendan, great to see you. >> Thank you, glad to be here. >> Absolutely so, really exciting announcement. You know, I've talked to Kim Stevenson at Intel, we had an interview talking about Moore's law. And one thing she really reinforced is that Moore's law is really more of an attitude than it is specifically physics, and whether you want to argue the physics is one thing, but the attitude for innovation, to continue to deliver a lot more for less, just continues, continues, and continues, and you guys announced a huge step in that direction today. >> Yeah, we have a challenge that storage is growing at a rate of about 40 percent per year. And budgets from the data centers are not growing, right? So the challenge is for us to develop new technologies that allow us to stay on the technology curve, and cut costs and do that efficiently. >> Then this is a big one, so let's jump in. So actually it was years ago I was actually at the event when you guys introduced the Helium drives, and that was a big deal there, and you've continued to kind of move that innovation but then you can see a plateau. And the density of this data, so you guys had to come up with something new. >> Yeah, what we've seen is that our PMR technology that we use currently is slowly running out of steam, right? So in order to come down the cost curve, we needed to boost areal density. And luckily we were able to come up with a new breakthrough in MAMR technology that will allow us to do that for the next decade. >> It's interesting in the talk, you talked about you guys could see this kind of coming and you actually put a lot of bets on the table, you didn't just bet on MAMR, you bet on HAMR, and you continued along a number of multiple tracks, and you've been at this for a while. What was kind of the innovation that finally gave you a breakthrough moment that got us to where we are today? >> Well, there were multiple technologies that we could have invested in, and we decided to continue on the two major ones which were HAMR and MAMR but we made a decision to invest in a process called, a head fabrication process called damascene that allowed us to extend the life of PMR for the last five to six years, and it's been in all the products we've been shipping since 2013. >> And you talked the areal density, so that's basically the amount of information we can put on the square inch of surface area And you've really, you attacked it on two vectors. One is how many tracks, just think of a record, how many tracks can you get on an album, in terms of the number of lines, and then how much density then you can have on each of those tracks. >> That's right, that's right. And you're now seeing major improvements on both of those factors. >> Well if you look at, we've had three enabling technologies in our products for the past three to four years, right. One is helium, one is micro actuation, and the other is the damascene process. Damascene and micro actuation actually push track density which enables higher capacity. But the newer technology that we're talking about, MAMR, addresses both factors. So we push the track density even tighter together, But we also boost the linear density at the same time, and we do that without adding cost. >> Right. The other thing you talked about, and I think it's a really important piece, right it's not only the technology breakthrough, but it's also how does that fit within the existing ecosystem of your customers, and obviously big giant data centers and big giant cloud providers, we actually have a show going on at a big cloud show right now, and this technology was innovative in that you've got a breakthrough on density, but not so crazy that you introduced a whole bunch of new factors into the ecosystem that would then have to be incorporated into all these systems, because you guys not only make your own systems, but you make the media that feeds a whole host of ecosystems, and that was a pretty important piece. >> If you look at some previous technologies we've introduced whether it be even 4K sectors in the industry, or shingled magnetic reporting, both of those require whole side modifications. Any time you have whole side modifications, it generally slows down the adoption, right? With HAMR, one of the challenges that we had was because of the concerns with thermals on the media, we needed a process called wear leveling, and that required whole software changes. In contrast, when we go to MAMR, everything is seamless, everything is transparent, and it's great. >> Right. I thought it was much simpler than that. I thought just heat is bad, HAMR is heat, and MAMR is microwave, and you know, heat and efficiency and data centers and all those, kind of again, system-level concerns; heat's never a good thing in electronics. >> Well, and in the case of MAMR versus HAMR, there's like an order of magnitude difference in the temperature on the disk, which is the key concern. >> And then of course as you mentioned in the key note, this is real, you've got sample units going on, correct me if I'm wrong, as early as next year >> That's right. >> you're hoping you'd be in scale production in 2020. Where some of these other competing technologies, there's really still no forecasted ship date on the horizon. >> Yeah, you can generate samples, you can build lower quantities of these HAMR drives, but you still have that big concern out there in front of you, how do I address the reliability, how do I address the complexity of all these new materials, and then if I got all of that to work, how do I do it commercially because of the cost additives. >> Right; so I just want to get your perspective before we let you go, you're busy, there's a high demand for your time, as you kind of think back and look at these increasing demands for storage, this increasing demand for computers, and I think one of the data points given is, you know, the data required for humans and machines and IOT is growing way way way way faster than business focused data which has been the driver of a lot of this stuff, if you just kind of sit back and take a look, you know, what are some of your thoughts because I'm sure not that long ago you could have never imagined that there would be the demand for the types of capacities that we're talking about now and we both know that when we sit down five years from now, ten years ago, you know, ten years from now, we're going to look back at today and think, you know, that was zero. >> Yeah, way back in the day there were just PCs and servers and there was traditional IT with rate, today with autonomous cars and IOT and AI and machine learning, it's just going to continue, so that exponential growth that you saw, there's no sign of that slowing down, which is good news for us. >> Yeah, good job security for you for sure. >> You bet! >> Alright Brendan, well, again, thanks for taking a few minutes to sit down and congratulations on the great event and the launch of these new products. >> Thank you, thank you. >> He's Brendan Collins, I'm Jeff Frick, you're watching the Cube from the Western Digital Headquarters in San Jose California. Thanks for watching.

Published Date : Oct 11 2017

SUMMARY :

Brought to you by Western Digital. and here to join us to tell us all about it and you guys announced a huge step in that direction today. and cut costs and do that efficiently. and that was a big deal there, that we use currently and you actually put a lot of bets on the table, and it's been in all the products and then how much density then you can have And you're now seeing major improvements and the other is the damascene process. but not so crazy that you introduced and that required whole software changes. and you know, heat and efficiency and data centers Well, and in the case of MAMR versus HAMR, Where some of these other competing technologies, and then if I got all of that to work, and we both know that when we sit down five years from now, so that exponential growth that you saw, for you for sure. and the launch of these new products. Western Digital Headquarters in San Jose California.

ENTITIES

Entity	Category	Confidence
Brendan Collins	PERSON	0.99+
Kim Stevenson	PERSON	0.99+
Jeff Frick	PERSON	0.99+
Western Digital	ORGANIZATION	0.99+
Brendan	PERSON	0.99+
2020	DATE	0.99+
Intel	ORGANIZATION	0.99+
San Jose California	LOCATION	0.99+
Moore	PERSON	0.99+
two vectors	QUANTITY	0.99+
San Jose	LOCATION	0.99+
both	QUANTITY	0.99+
both factors	QUANTITY	0.99+
2013	DATE	0.99+
next year	DATE	0.99+
each	QUANTITY	0.98+
One	QUANTITY	0.98+
next decade	DATE	0.98+
one	QUANTITY	0.98+
today	DATE	0.97+
ten years ago	DATE	0.97+
Next Decade	DATE	0.95+
two major ones	QUANTITY	0.95+
three enabling technologies	QUANTITY	0.94+
zero	QUANTITY	0.94+
Almaden Campus	LOCATION	0.92+
MAMR	ORGANIZATION	0.92+
one thing	QUANTITY	0.91+
five years	QUANTITY	0.89+
years ago	DATE	0.89+
Vice President	PERSON	0.88+
four years	QUANTITY	0.85+
six years	QUANTITY	0.84+
about 40 percent per year	QUANTITY	0.8+
Cube	COMMERCIAL_ITEM	0.76+
years	QUANTITY	0.76+
2017	DATE	0.75+
Next	DATE	0.74+
HAMR	ORGANIZATION	0.73+
Marketing Devices	PERSON	0.72+
MAMR	OTHER	0.71+
Innovating	TITLE	0.7+
bunch of new factors	QUANTITY	0.68+
Western	LOCATION	0.67+
ten years	DATE	0.61+
three	QUANTITY	0.61+
helium	OTHER	0.6+
Innovating to Fuel	TITLE	0.59+
Helium	COMMERCIAL_ITEM	0.58+
Digital	ORGANIZATION	0.58+
five	QUANTITY	0.55+
Big Data	EVENT	0.53+
Decade	TITLE	0.5+
Big Data	TITLE	0.49+
past	DATE	0.45+
4K	OTHER	0.44+
last	QUANTITY	0.4+

Dave Tang, Western Digital – When IoT Met AI: The Intelligence of Things - #theCUBE

>> Presenter: From the Fairmont Hotel, in the heart of Silicon Valley, it's theCUBE. Covering When IoT Met AI The Intelligence of Things. Brought to you by Western Digital. >> Hey welcome back everybody, Jeff Frick here with theCUBE. We're in downtown San Jose at the Fairmont Hotel, at an event called When IoT Met AI The Intelligence of Things. You've heard about the internet of things, and on the intelligence of things, it's IoT, it's AI, it's AR, all this stuff is really coming to play, it's very interesting space, still a lot of start-up activity, still a lot of big companies making plays in this space. So we're excited to be here, and really joined by our host, big thanks to Western Digital for hosting this event with WDLabs' Dave Tang. Got newly promoted since last we spoke. The SVP of corporate marketing and communications, for Western Digital, Dave great to see you as usual. >> Well, great to be here, thanks. >> So I don't think the need for more storage is going down anytime soon, that's kind of my takeaway. >> No, no, yeah. If this wall of data just keeps growing. >> Yeah, I think the term we had yesterday at the Ag event that we were at, also sponsored by you, is really the flood of data using an agricultural term. But it's pretty fascinating, as more, and more, and more data is not only coming off the sensors, but coming off the people, and used in so many more ways. >> That's right, yeah we see it as a virtual cycle, you create more data, you find more uses for that data to harness the power and unleash the promise of that data, and then you create even more data. So, when that virtual cycle of creating more, and finding more uses of it, and yeah one of the things that we find interesting, that's related to this event with IoT and AI, is this notion that data is falling into two general categories. There's big data, and there's fast data. So, big data I think everyone is quite familiar with by this time, these large aggregated likes of data that you can extract information out of. Look for insights and connections between data, predict the future, and create more prescriptive recommendations, right? >> Right. >> And through all of that you can gain algorithms that help to make predictions, or can help machines run based on that data. So we've gone through this phase where we focused a lot on how we harness big data, but now we're taking these algorithms that we've gleaned from that, and we're able to put them in real time applications, and that's sort of been the birth of fast data, it's been really-- >> Right, the streaming data. We cover Spark Summit, we cover Flink, and New, a new kind of open source project that came out of Berlin. That some people would say the next generation of Spark, and the other thing, you know, good for you guys, is that it used to be, not only was it old data, but it was a sampling of old data. Now on this new data, and the data stream that's all of the data. And I would actually challenge, I wonder if that separation as you describe, will stay, because I got to tell you, the last little drive I bought, just last week, was an SSD drive, you know, one terabyte. I needed some storage, and I had a choice between spinning disc and not, and I went with the flat. I mean, 'cause what's fascinating to me, is the second order benefits that we keep hearing time, and time, and time again, once people become a data-driven enterprise, are way more than just that kind of top-level thing that they thought. >> Exactly, and that's sort of that virtual cycle, you got to taste, and you learn how to use it, and then you want more. >> Jeff: Right, right. >> And that's the great thing about the breadth of technologies and products that Western Digital has, is from the solid state products, the higher performance flash products that we have, to the higher capacity helium-filled drive technologies, as well as devices going on up into systems, we cover this whole spectrum of fast data and big data. >> Right, right. >> I'll give an example. So credit card fraud detection is an interesting area. Billions of dollars potentially being lost there. Well to learn how to predict when transactions are fraudulent, you have to study massive amounts of data. Billions of transactions, so that's the big data side of it, and then as soon as you do that, you can take those algorithms and run them in real time. So as transactions come in for authorization, those algorithms can determine, before they're approved, that one's fraudulent, and that one's not. Save a lot of time and processing for fraud claims. So that's a great example of once you learn something from big data, you apply it to the real-time realm, and it's quite dire right? And then that spawned you to collect even more data, because you want to find new applications and new uses. >> Right, and too kind of this wave of computing back and forth from the shared services computer, then the desktop computer, now it's back to the cloud, and then now it's-- >> Dave: Out with the edge. >> IoT, it's all about the edge. >> Yeah, right. >> And at the end of the day, it's going to be application-specific. What needs to be processed locally, what needs to be processed back at the computer, and then all the different platforms. We were again at a navigation for autonomous vehicles show, who knew there was such a thing that small? And even the attributes of the storage required in the ecosystem of a car, right? And the environmental conditions-- >> That's right. >> Is the word I'm looking for. Completely different, new opportunity, kind of new class of hardware required to operate in that environment, and again that still combines cloud and Edge, sensors and maps. So just the, I don't think that the man's going down David. >> Yeah, absolutely >> I think you're in a good spot. (Jeff laughing) >> You're absolutely right, and even though we try to simplify into fast data, and big data, and Core and Edge, what we're finding is that applications are increasingly specialized, and have specialized needs in terms of the type of data. Is it large amounts of data, is it streaming? You know, what are the performance characteristics, and how is it being transformed, what's the compute aspect of it? And what we're finding, is that the days of general-purpose compute and storage, and memory platforms, are fading, and we're getting into environments with increasingly specialized architectures, across all those elements. Compute, memory and storage. So that's what's really exciting to be in our spot in the industry, is that we're looking at creating the future by developing new technologies that continue to fuel that growth even further, and fuel the uses of data even further. >> And fascinating just the ongoing case of Moore's law, which I know is not, you know you're not making microprocessors, but I think it's so powerful. Moore's law really is a philosophy, as opposed to an architectural spec. Just this relentless pace of innovation, and you guys just continue to push the envelope. So what are your kind of priorities? I can't believe we're halfway through 2017 already, but for kind of the balance of the year kind of, what are some of your top-of-mind things? I know it's exciting times, you're going through the merger, you know, the company is in a great space. What are your kind of top priorities for the next several months? >> Well, so, I think as a company that has gone through serial acquisitions and integrations, of course we're continuing to drive the transformation of the overall business. >> But the fun stuff right? It's not to increase your staff (Jeff laughing). >> Right, yeah, that is the hardware. >> Stitching together the European systems. >> But yeah, the fun stuff includes pushing the limits even further with solid state technologies, with our 3D NAND technologies. You know, we're leading the industry in 64 layer 3D NAND, and just yesterday we announced a 96 layer 3D NAND. So pushing those limits even further, so that we can provide higher capacities in smaller footprints, lower power, in mobile devices and out on the Edge, to drive all these exciting opportunities in IoT an AI. >> It's crazy, it's crazy. >> Yeah it is, yeah. >> You know, terabyte SD cards, terabyte Micro SD cards, I mean the amount of power that you guys pack into these smaller and smaller packages, it's magical. I mean it's absolutely magic. >> Yeah, and the same goes on the other end of the spectrum, with high-capacity devices. Our helium-filled drives are getting higher and higher capacity, 10, 12, 14 terabyte high-capacity devices for that big data core, that all the data has to end up with at some point. So we're trying to keep a balance of pushing the limits on both ends. >> Alright, well Dave, thanks for taking a few minutes out of your busy day, and congratulations on all your success. >> Great, good to be here. >> Alright, he's Dave Tang from Western Digital, he's changing your world, my world, and everyone else's. We're here in San Jose, you're watching theCUBE, thanks for watching.

Published Date : Jul 3 2017

SUMMARY :

in the heart of Silicon Valley, it's theCUBE. and on the intelligence of things, is going down anytime soon, that's kind of my takeaway. If this wall of data just keeps growing. is not only coming off the sensors, and then you create even more data. and that's sort of been the birth of fast data, and the other thing, you know, good for you guys, and then you want more. And that's the great thing about the breadth and then as soon as you do that, And at the end of the day, and again that still combines cloud and Edge, I think you're in a good spot. is that the days of general-purpose compute and storage, but for kind of the balance of the year kind of, of the overall business. But the fun stuff right? in mobile devices and out on the Edge, I mean the amount of power that you guys pack that all the data has to end up with at some point. and congratulations on all your success. and everyone else's.

ENTITIES

Entity	Category	Confidence
Jeff Frick	PERSON	0.99+
Dave Tang	PERSON	0.99+
Jeff	PERSON	0.99+
San Jose	LOCATION	0.99+
Western Digital	ORGANIZATION	0.99+
Dave	PERSON	0.99+
12	QUANTITY	0.99+
10	QUANTITY	0.99+
Berlin	LOCATION	0.99+
Silicon Valley	LOCATION	0.99+
David	PERSON	0.99+
yesterday	DATE	0.99+
last week	DATE	0.99+
2017	DATE	0.99+
second order	QUANTITY	0.99+
both ends	QUANTITY	0.98+
Billions of dollars	QUANTITY	0.98+
one terabyte	QUANTITY	0.97+
Flink	ORGANIZATION	0.96+
The Intelligence of Things	TITLE	0.95+
14 terabyte	QUANTITY	0.95+
Ag	EVENT	0.94+
one	QUANTITY	0.94+
two general categories	QUANTITY	0.91+
European	OTHER	0.9+
theCUBE	ORGANIZATION	0.87+
#theCUBE	ORGANIZATION	0.87+
Billions of transactions	QUANTITY	0.87+
Fairmont Hotel	LOCATION	0.87+
WDLabs'	ORGANIZATION	0.81+
64	QUANTITY	0.81+
Spark Summit	EVENT	0.71+
96 layer	QUANTITY	0.67+
Moore	PERSON	0.66+
yone	PERSON	0.66+
next several months	DATE	0.64+
Core	ORGANIZATION	0.59+
Edge	TITLE	0.58+
terabyte	ORGANIZATION	0.55+
layer 3D	OTHER	0.55+
Spark	TITLE	0.46+
theCUBE	TITLE	0.42+
When IoT	TITLE	0.36+
3D	QUANTITY	0.26+

Janet George, Western Digital –When IoT Met AI: The Intelligence of Things - #theCUBE

(upbeat electronic music) >> Narrator: From the Fairmont Hotel in the heart of Silicon Valley, it's theCUBE. Covering when IoT met AI, The Intelligence of Things. Brought to you by Western Digital. >> Welcome back here everybody, Jeff Frick here with theCUBE. We are at downtown San Jose at the Fairmont Hotel. When IoT met AI it happened right here, you saw it first. The Intelligence of Things, a really interesting event put on by readwrite and Western Digital and we are really excited to welcome back a many time CUBE alumni and always a fan favorite, she's Janet George. She's Fellow & Chief Data Officer of Western Digital. Janet, great to see you. >> Thank you, thank you. >> So, as I asked you when you sat down, you're always working on cool things. You're always kind of at the cutting edge. So, what have you been playing with lately? >> Lately I have been working on neural networks and TensorFlow. So really trying to study and understand the behaviors and patterns of neural networks, how they work and then unleashing our data at it. So trying to figure out how it's training through our data, how many nets there are, and then trying to figure out what results it's coming with. What are the predictions? Looking at how the predictions are, whether the predictions are accurate or less accurate and then validating the predictions to make it more accurate, and so on and so forth. >> So it's interesting. It's a different tool, so you're learning the tool itself. >> Yes. >> And you're learning the underlying technology behind the tool. >> Yes. >> And then testing it actually against some of the other tools that you guys have, I mean obviously you guys have been doing- >> That's right. >> Mean time between failure analysis for a long long time. >> That's right, that's right. >> So, first off, kind of experience with the tool, how is it different? >> So with machine learning, fundamentally we have to go into feature extraction. So you have to figure out all the features and then you use the features for predictions. With neural networks you can throw all the raw data at it. It's in fact data-agnostic. So you don't have to spend enormous amounts of time trying to detect the features. Like for example, If you throw hundreds of cat images at the neural network, the neural network will figure out image features of the cat; the nose, the eyes, the ears and so on and so forth. And once it trains itself through a series of iterations, you can throw a lot of deranged cats at the neural network and it's still going to figure out what the features of a real cat is. >> Right. >> And it will predict the cat correctly. >> Right. So then, how does that apply to, you know, the more specific use case in terms of your failure analysis? >> Yeah. So we have failures and we have multiple failures. Some failures through through the human eye, it's very obvious, right? But humans get tired, and over a period of time we can't endure looking at hundreds and millions of failures, right? And some failures are interconnected. So there is a relationship between these failure patterns or there is a correlation between two failures, right? It could be an edge failure. It could a radial failure, eye pattern type failure. It could be a radial failure. So these failures, for us as humans, we can't escape. >> Right. >> And we used to be able to take these failures and train them at scale and then predict. Now with neural networks, we don't have to take and do all that. We don't have to extract these labels and try to show them what these failures look like. Training is almost like throwing a lot of data at the neural networks. >> So it almost sounds like kind of the promise of the data lake if you will. >> Yes. >> If you have heard about, from the Hadoop Summit- >> Yes, yes, yes. >> For ever and ever and ever. Right? You dump it all in and insights will flow. But we found, often, that that's not true. You need hypothesis. >> Yes, yes. >> You need to structure and get it going. But what you're describing though, sounds much more along kind of that vision. >> Yes, very much so. Now, the only caveat is you need some labels, right? If there is no label on the failure data, it's very difficult for the neural networks to figure out what the failure is. >> Jeff: Right. >> So you have to give it some labels to understand what patterns it should learn. >> Right. >> Right, and that is where the domain experts come in. So we train it with labeled data. So if you are training with a cat, you know the features of a cat, right? In the industrial world, cat is really what's in the heads of people. The domain knowledge is not so authoritative. Like the sky or the animals or the cat. >> Jeff: Right. >> The domain knowledge is much more embedded in the brains of the people who are working. And so we have to extract that domain knowledge into labels. And then you're able to scale the domain. >> Jeff: Right. >> Through the neural network. >> So okay so then how does it then compare with the other tools that you've used in the past? In terms of, obviously the process is very different, but in terms of just pure performance? What are you finding? >> So we are finding very good performance and actually we are finding very good accuracy. Right? So once it's trained, and it's doing very well on the failure patterns, it's getting it right 90% of the time, right? >> Really? >> Yes, but in a machine learning program, what happens is sometimes the model is over-fitted or it's under-fitted or there is bias in the model and you got to remove the bias in the model or you got to figure out, well, is the model false-positive or false-negative? You got to optimize for something, right? >> Right, right. >> Because we are really dealing with mathematical approximation, we are not dealing with preciseness, we are not dealing with exactness. >> Right, right. >> In neural networks, actually, it's pretty good, because it's actually always dealing with accuracy. It's not dealing with precision, right? So it's accurate most of the time. >> Interesting, because that's often what's common about the kind of difference between computer science and statistics, right? >> Yes. >> Computers is binary. Statistics always has a kind of a confidence interval. But what you're describing, it sounds like the confidence is tightening up to such a degree that it's almost reaching binary. >> Yeah, yeah, exactly. And see, brute force is good when your traditional computing programing paradigm is very brute force type paradigm, right? The traditional paradigm is very good when the problems are simpler. But when the problems are of scale, like you're talking 70 petabytes of data or you're talking 70 billion roles, right? Find all these patterns in that, right? >> Jeff: Right. >> I mean you just, the scale at which that operates and at the scale at which traditional machine learning even works is quite different from how neural networks work. >> Jeff: Okay. >> Right? Traditional machine learning you still have to do some feature extraction. You still have to say "Oh I can't." Otherwise you are going to have dimensionality issues, right? It's too broad to get the prediction anywhere close. >> Right. >> Right? And so you want to reduce the dimensionality to get a better prediction. But here you don't have to worry about dimensionality. You just have to make sure the labels are right. >> Right, right. So as you dig deeper into this tool and expose all these new capabilities, what do you look forward to? What can you do that you couldn't do before? >> It's interesting because it's grossly underestimating the human brain, right? The human brain is supremely powerful in all aspects, right? And there is a great deal of difficulty in trying to code the human brain, right? But with neural networks and because of the various propagation layers and the ability to move through these networks we are coming closer and closer, right? So one example: When you think about driving, recently, Google driverless car got into an accident, right? And where it got into an accident was the driverless car was merging into a lane and there was a bus and it collided with the bus. So where did A.I. go wrong? Now if you train an A.I., birds can fly, and then you say penguin is a bird, it is going to assume penguin can fly. >> Jeff: Right, right. >> We as humans know penguin is a bird but it can't fly like other birds, right? >> Jeff: Right. >> It's that anomaly thing, right? Naturally when are driving and a bus shows up, even if it's yield, the bus goes. >> Jeff: Right, right. >> We yield to the bus because it's bigger and we know that. >> A.I. doesn't know that. It was taught that yield is yield. >> Right, right. >> So it collided with the bus. But the beauty is now large fleets of cars can learn very quickly based on what it just got from that one car. >> Right, right. >> So now there are pros and cons. So think about you driving down Highway 85 and there is a collision, it's Sunday morning, you don't know about the collision. You're coming down on the hill, right? Blind corner and boom that's how these crashes happen and so many people died, right? If you were driving a driverless car, you would have knowledge from the fleet and from everywhere else. >> Right. >> So you know ahead of time. We don't talk to each other when we are in cars. We don't have universal knowledge, right? >> Car-to-car communication. >> Car-to-car communications and A.I. has that so directly it can save accidents. It can save people from dying, right? But people still feel, it's a psychology thing, people still feel very unsafe in a driverless car, right? So we have to get over- >> Well they will get over that. They feel plenty safe in a driverless airplane, right? >> That's right. Or in a driveless light rail. >> Jeff: Right. >> Or, you know, when somebody else is driving they're fine with the driver who's driving. You just sit in the driver's car. >> But there's that one pesky autonomous car problem, when the pedestrian won't go. >> Yeah. >> And the car is stopped it's like a friendly battle-lock. >> That's right, that's right. >> Well good stuff Janet and always great to see you. I'm sure we will see you very shortly 'cause you are at all the great big data conferences. >> Thank you. >> Thanks for taking a few minutes out of your day. >> Thank you. >> Alright she is Janet George, she is the smartest lady at Western Digital, perhaps in Silicon Valley. We're not sure but we feel pretty confident. I am Jeff Frick and you're watching theCUBE from When IoT meets AI: The Intelligence of Things. We will be right back after this short break. Thanks for watching. (upbeat electronic music)

Published Date : Jul 2 2017

SUMMARY :

Brought to you by Western Digital. We are at downtown San Jose at the Fairmont Hotel. So, what have you been playing with lately? Looking at how the predictions are, So it's interesting. behind the tool. So you have to figure out all the features So then, how does that apply to, you know, So these failures, for us as humans, we can't escape. at the neural networks. the promise of the data lake if you will. But we found, often, that that's not true. But what you're describing though, sounds much more Now, the only caveat is you need some labels, right? So you have to give it some labels to understand So if you are training with a cat, in the brains of the people who are working. So we are finding very good performance we are not dealing with preciseness, So it's accurate most of the time. But what you're describing, it sounds like the confidence the problems are simpler. and at the scale at which traditional machine learning Traditional machine learning you still have to But here you don't have to worry about dimensionality. So as you dig deeper into this tool and because of the various propagation layers even if it's yield, the bus goes. It was taught that yield is yield. So it collided with the bus. So think about you driving down Highway 85 So you know ahead of time. So we have to get over- Well they will get over that. That's right. You just sit in the driver's car. But there's that one pesky autonomous car problem, I'm sure we will see you very shortly 'cause you are Alright she is Janet George, she is the smartest lady

ENTITIES

Entity	Category	Confidence
Jeff	PERSON	0.99+
Jeff Frick	PERSON	0.99+
Janet George	PERSON	0.99+
Janet	PERSON	0.99+
Western Digital	ORGANIZATION	0.99+
90%	QUANTITY	0.99+
Silicon Valley	LOCATION	0.99+
one car	QUANTITY	0.99+
Highway 85	LOCATION	0.99+
Sunday morning	DATE	0.99+
two failures	QUANTITY	0.99+
70 billion roles	QUANTITY	0.99+
Google	ORGANIZATION	0.99+
CUBE	ORGANIZATION	0.98+
one example	QUANTITY	0.96+
The Intelligence of Things	TITLE	0.94+
hundreds of cat images	QUANTITY	0.93+
first	QUANTITY	0.92+
theCUBE	ORGANIZATION	0.84+
San Jose	LOCATION	0.8+
one pesky autonomous car	QUANTITY	0.77+
70 petabytes of data	QUANTITY	0.77+
hundreds and	QUANTITY	0.76+
IoT	ORGANIZATION	0.74+
millions of failures	QUANTITY	0.66+
Fairmont Hotel	LOCATION	0.66+
ollision	PERSON	0.65+
meets	TITLE	0.64+
#theCUBE	ORGANIZATION	0.57+
Hadoop Summit	EVENT	0.51+
of	TITLE	0.47+

Western Digital Taking the Cloud to the Edge, Panel 2 | DataMakesPossible

>> They are disruptive technologies. And if you think about the disruption that's happening in business, with IoT, with OT, and with big data, you can't get anything more disruptive to the whole of the business chain as this particular area. It's an area that I focused on myself, asking the question, should everything go to the cloud? Is that the new future? Is 90% of the computing going to go to the cloud with just little mobile devices right on the edge? Felt wrong when I did the math on it, I did some examples of real-world environments, wind farms, et cetera, it clearly was not the right answer, things need to be near the edge. And I think one of the areas to me that solidified it was when you looked at an area like video. Huge amounts of data, real important decisions being made on the content of that video, for example, recognizing a face, a white hat or a black hat. If you look at the technology, sending that data somewhere to do that recognition just does not make sense. Where is it going? It's going actually into the camera itself, right next to the data, because that's where you have the raw data, that's where you have the maximum granularity of data, that's where you need to do the processing of which faces are which, right close to the edge itself, and then you can send the other data back up to the cloud, for example, to improve those algorithms within that camera, to do all that sort of work on the batch basis over time, that's what I was looking at, and looking at the cost justification for doing that sort of work. So today, we've got a set people here on the panel, and we want to talk about coming down one level to where IoT and IT are going to have to connect together. So on the panel I've got, I'm going to get these names really wrong, Sanjeev Kumar? >> Yes, that's right. >> From FogHorn, could you introduce yourself and what you're doing where the data is meeting the people and the machines? >> Sure, sure, so my name is Sanjeev Kumar, I actually run engineering for a company called FogHorn Systems, we are actually bringing analytics and machine learning to the edge, and, so our goal and motto is to take computing to where the data is, than the other way around. So it's a two-year-old company that started, was incubated in the hive, and we are in the process of getting our second release of the product out shortly. >> Excellent, so let me start at the other end, Rohan, can you talk about your company and what contribution you're focusing on? >> Sure, I'm head product marketing for Maana, Maana is a startup, about three years old, what we're doing is we're offering an enterprise platform for large enterprises, we're helping the likes of Shell and Maersk and Chevron digitally transform, and that simply means putting the focus on subject matter experts, putting the focus on the people, and data's definitely an important part of it, but allowing them to bring their expertise into the decision flows, so that ultimately the key decisions that are driving the revenue for these behemoths, are made at a higher quality and faster. >> Excellent. Well, two software companies, we have a practitioner here who is actually doing fog computing, doing it for real, has been doing it for some time, so could you like, Janet George from Western Digital, can you introduce yourself, and say something from the trenches, of what's really going on? >> Okay, very good, thank you. I actually build infrastructure for the edge to deal with fog computing, and so for Western Digital, we're very lucky, because we are the largest storage manufacture, and we have what we call Internet of Things, and Internet of Test Equipment, and I process petabytes of data that comes out of the Internet of Things, which is basically our factories, and then I take these petabytes of data, I process them both on the cloud and then on the edge, but primarily, to be able to consume that data. And the way we consume that data is by building very high-profile models through artificial intelligence and machine learning, and I'll talk a lot more about that, but at the end of the day, it's all about consuming the data that you collect from anywhere, Internet of Things, computer equipment, data that's being produced through products, you have to figure out a way to compute that, and the cloud has many advantages and many trade-offs, and so we're going to talk about the trade-offs, that's where the gap for computing comes into play. >> Excellent, thanks very much. And last but not least, we have Val, and I can never pronounce your surname. >> Bercovici. >> Thank you. (chuckling) You are in the midst of a transition yourself, so talk about where you have been and where you're going. >> For the better part of this century, I've been with NetApp, working at various functions, obviously enterprise storage, and around 2008, my developer instinct kind of fired up, and this thing called cloud became very interesting to me. So I became a self-anointed cloud czar at NetApp, and I ended up initiating a lot of our projects which we know today as the NetApp Data Fabric, that culminated about 18 months ago, in acquisition of SolidFire, and I'm now the acting CTO of SolidFire, but I plan to retire from the storage industry at the end of our fiscal year, at the end of April, and I'm spending a lot of time with particularly the Cloud Native Compute Foundation, that is, the opensource home of Google's Kubernetes Technology and about seven other related projects, we keep adding some almost every month, and I'm starting to lose track, and spending a lot of time on the data gravity challenge. It's a challenge in the cloud, it's a particularly new and interesting challenge at the edge, and I look forward to talking about that. >> Okay, and data gravity is absolutely key, isn't it, it's extremely expensive and extremely heavy to move around. >> And the best analogy is workloads are like electricity, they move fairly easily and lightly, data's like water, it's really hard to move, particularly large bodies around. >> Great. I want to start with one question though, just in the problem, the core problem, particularly in established industries, of how do we get change to work? In an IT shop, we have enough problems dealing with operations and development. In the industrial world, we have the IT and the OT, who look at each other with less than pleasure, and mainly disdain. How do we solve the people problem in trying to put together solutions? You must be right in the middle of it, would you like to start with that question? >> Absolutely, so we are 26 years old, probably more than that, but we have very old and new mix of manufacturing equipment, it's a storage industry, and in our storage industry, we are used to doing things a certain way. We have existing data, we have historical data, we have trend data, you can't get rid of what you already have. The goal is to make connectors such that you can move from where you're at to where you're going, and so you have to be able to take care of the shift that is happening in the market, so at the end of the day, if you look at five years from now, it's all going to be machine learning and AI, right? Agent technology's already here, it's proven, we can see, Siri is out here, we can see Alexa, we can see these agent technologies out there, so machine learning is a getting a lot of momentum, deep learning and neural networks, things like that. So we got to be able to look at that data and tap into our data, near realistically, very different, and the way to do that is really making these connections happen, tapping into old versus new. Like for example, if you look at storage, you have file storage, you have block storage, and then you have object storage, right? We've not really tapped into the field of object storage, and the reason is because if you are going to process one trillion objects like Amazon is doing right now with S3, you can't do it with the file system level storage or with the blog system level storage, you have to go to objects. Think Internet of Things. How many trillions of objects are going to come out of these Internet of Things? So one, you have to be positioned from an infrastructure standpoint. Two, you have to be positioned from a use case prototyping perspective, and three, you got to be able to scale that very rapidly, very quickly, and that's how change happens, change does not happen because you ask somebody to change their behavior, change happens when you show value, and people are so eager to get that value out of what you've shown them in real life, that they are so quick to adapt. >> That's an excellent-- >> If I could comment on that as well, which is, we just got through training a bunch of OT guys on our software, and two analogies that actually work very well, one is sort of, the operational people are very familiar with circuit diagrams, and so, and sort of, flow of things through essentially black boxes, you can think of these as something that has a bunch of inputs and has a bunch of outputs. So that's one thing that worked very well. The second thing that works very well is the PLC model, and there are direct analogies between PLC's and analytics, which people on the floor can actually relate to. So if you have software that's basically based on data streams and time, as a first-class citizen, the PLC model again works very well in terms of explaining the new software to the OT people. >> Excellent, okay, would you want to come in on that as well? >> Sure, I think a couple of points to add to what Janet said, I couldn't agree more in terms of the result, I think Maana did a few projects, a few pilots to convince customers of their value, and we typically focus very heavily on operationalizing the output, so we are very focused on making sure that there is some measurable value that comes out of it, and it's not until the end user started seeing that value that they were willing and open to adopt the newer methodologies. A second point to that is, a lot of the more recent techniques available to solve certain challenges, there are deep learning neural nets there's all sorts of sophisticated AI and machine learning algorithms that are out there, a lot of these are very sophisticated in their ability to deliver results, but not necessarily in the transparency of how you got that, and I think that's another thing that Maana's learning, is yes, we have this arsenal of fantastic algorithms to throw at problems, but we try to start with the simplest approach first, we don't unnecessarily try to brute force, because I think an enterprise, they are more than willing to have that transparency in how they're solving something, so if they're able to see how they were able to get to us, how the software was able to get to a certain conclusion, then they are a lot happier with that approach. >> Could you maybe just give one example, a real-world example, make it a little bit real? >> Right, absolutely, so we did a project for a very large organization for collections, they have a lot of outstanding capital locked up and customers not paying, it's a standard problem, you're going to find it in pretty much any industry, and so for that outstanding invoice, what we did was we went ahead and we worked with the subject matter experts, we looked at all the historical accounts receivable data, we took data from a lot of other sources, and we were able to come up with models to predict when certain customers are likely to pay, and when they should be contacted. Ultimately, what we wanted to give the collection agent were a list of customers to call. It was fairly straightforward, of course, the solution was not very, very easy, but at least on a holistic level, it made a lot of sense to us. When we went to the collection agents, many of them actually refused to use that approach, and this is part of change management in some sense, they were so used to doing things their way, they were so used to trying to target the customers with the largest outstanding invoice, or the ones that hadn't paid for the longest amount of time, that it actually took us a while, because initially, what the feedback we got was that your approach is not working, we're not seeing the results. And when we dug into it, it was because it wasn't being used, so that would be one example. >> So again, proof points that you will actually get results from this. >> Absolutely, and the transparency, I think we actually sent some of our engineers to work with the collections agents to help them understand what approach is it that we're taking, and we showed them that this is not magic, we're actually, instead of looking at the final dollar value, we're looking, we're calculating time value lost, so we are coming up with a metric that allows us to incorporate not just the outstanding amount, or the time that they haven't paid for, but a lot of other factors as well. >> Excellent, Val. >> When you asked that question, I immediately went to more of a nontechnical business side of my brain to answer it, so my experience over the years has been particularly during major industry transitions, I'm old enough to remember the mainframe to client server transition, and now client server to virtualization and cloud, and really, sales reps have that well-earned reputation of being coin-operated, though it's remarkable how much you can adjust compensation plans for pretty much anyone, in a capitalist environment, and the IT/OT divide, if you will, is pretty easy to solve from a business perspective when you take someone with an IT supporting the business mentality, and you compensate them on new revenue streams, new business, all of a sudden, the world perspective changes sometimes overnight, or certainly when that contract is signed. That's probably the number one thing you can do from a people perspective, is incent them and motivate them to focus on these new things, the technology is, particularly nowadays is evolving to support them for these new initiatives, but nothing motivates like the right compensation plan. >> Excellent, a great series of different viewpoints. So the second question I have again coming down a bit to this level, is how do we architect a solution? We heard you got to architect it, and you've got less, like this, it seems to me that that's pretty difficult to do ahead of where you're going, that in general, you take smaller steps, one step at a time, you solve one problem, you go on to the next. Am I right in that? If I am, how would you suggest the people go about this decision-making of putting architectures together, and if you think I'm wrong and you have a great new way of doing it, I'd love to hear about it. >> I can take a shorter route. So we have a number of customers that are trying to adopt, are going through a phased way of adopting our technology and products, and so it begins with first gathering of the data, and replaying it back, to build the first level of confidence, in the sense that the product is actually doing what you're expecting it to do. So that's more from monitoring administration standpoint. The second stage is you should begin to capture analytical logic into the project, where it can start doing prediction for you, so you go into, so from operational, you go into a predictive maintenance, predictive maintenance, predictive models standpoint. The third part is prescriptive, where you actually help create a machine learning model, now, it's still in flux in terms of where the model gets created, whether it's on the cloud, in a central fashion, or some sort of a, the right place, the right context in a multi-level hierarchical fog layer, and then, you sort of operationalize that as close to the data again as possible, so you go through this operational to predictive to prescriptive adoption of the technology, and that's how people actually build confidence in terms of adopting something new into, let's say, a manufacturing environment, or things that are pretty expensive, so I give you another example where you have the case of capacitors being built on a assembly line, manufacturing, and so how do you, can you look at data across different stations and manufacturing on a assembly line? And can you predict on the second station that it's going to fail on the eighth one? By that, what you're doing is you are actually reducing the scrap that's coming off of the assembly line. So, that's the kind of usage that you're going to in the second and third stage. >> Host: Excellent. Janet, do you want to go on? >> Yeah, I agree and I have a slightly different point of view also. I think architecture's very difficult, it's like Thomas Edison, he spent a lot of time creating negative knowledge to get to that positive knowledge, and so that's kind of the way it is in the trenches, we spend a lot of time trying to think through, the keyword that comes to mind is abstraction layers, because where we came from, everything was tightly coupled, and tightly coupled, computer and storage are tightly coupled, structured and unstructured data are tightly coupled, they're tightly coupled with the database, schema is tightly coupled, so now we are going into this world of everything being decoupled. In that, multiple things, multiple operating systems should be able to use your storage. Multiple models should be able to use your data. You cannot structure your data in any kind of way that is customized to one particular model. Many models have to run on that data on the fly, retrain itself, and then run again, so when you think about that, you think about what suits best to stay in the cloud, maybe large amounts of training data, schema that's already processed can stay on the cloud. Schema that is very dynamic, schema that is on the fly, that you need to read, and data that's coming at you from the Internet of Things that's changing, I call it heteroscedastic data, which is very statistical in nature, and highly variable in nature, you don't have time to sit there and create rows and columns and structure this data and put it into some sort of a structured set, you need to have a data lake, you need to have a stack on top of that data lake that can then adapt, create metadata, process that data and make it available for your models, so, and then over time, like I totally believe that now we're running into near realtime compute bottleneck, processing all this pattern processing for the different models and training sets, so we need a stack that we can quickly replace with GPUs, which is where the future is going, with pattern processing and machine learning, so your architecture has to be extremely flexible, high layers of abstraction, ability to train and grow and iterate. >> Excellent. Do you want to go next? >> So I'll be a broken record, back to data gravity, I think in an edge context, you really got to look at the cost of processing data is orders of magnitude less than moving it or even storing it, and so I think that the real urgency, I don't know, there's 90% that think of data at the edge is kind of wasted, you can filter through it and find that signal through the noise, so processing data to make sure that you're dealing with really good data at the edge first, figuring out what's worth retaining for future steps, I love the manufacturing example, I have lots of customer examples ourselves where, for quality control in a high-moving assembly line, you want to take thousands of not millions of images and compare frame and frame exactly according to the schematics where the device is compared to where it should be, or where the components, and the device compared to where they should be, processing all of that data locally and making sure you extract the maximum value before you move data to a central data lake to correlate it against other anomalies or other similarities, that's really key, so really focus on that cost of moving and storing data, yeah. >> Yes, do you want the last word? >> Sure, Maana takes an interesting approach, I'm going to up-level a little bit. Whenever we are faced with a customer or a particular problem for a customer, we try to go over the question-answer approach, so we start with taking a very specific business question, we don't look at what data sources are available, we don't ask them whether they have a data lake, or we literally get their business leaders, their subject matter experts, we literally lock them up in a room and we say, "You have to define "a very specific problem statement "from which we start working backwards," each problem statement can be then broken down into questions, and what we believe is any question can be answered by a series of models, you talked about models, we go beyond just data models, we believe anything in the real world, in the case of, let's say, manufacturing, since we're talking about it, any smallest component of a machine should be represented in the form of a concept, relationships between people operating that machinery should be represented in the form of models, and even physics equations that are going into predicting behavior should be able to represent in the form of a model, so ultimately, what that allows us is that granularity, that abstraction that you were talking about, that it shouldn't matter what the data source is, any model should be able to plug into any data source, or any more sophisticated bigger model, I'll give you an example of that, we started solving a problem of predictive maintenance for a very large customer, and while we were solving that predictive maintenance problem, we came up with a number of models to go ahead and solve that problem. We soon realized that within that enterprise, there are several related problems, for example, replacement of part inventory management, so now that you figured out which machine is going to fail at roughly what instance of time from now, we can also figure out what parts are likely to fail, so now you don't have to go ahead and order a ton of replacement parts, because you know what parts are going to likely fail, and then you can take that a step further by figuring out which equipment engineer has the skillset to go ahead and solve that particular issue. Now, all of that, in today's world, is somewhat happening in some companies, but it is actually a series of point solutions that are not talking to each other, that's where our pattern technology graph is coming into play where each and every model is actually a note on the graph including computational models, so once you build 10 models to solve that first problem, you can reuse some of them to solve the second and third, so it's a time-to-value advantage. >> Well, you've been a fantastic panel, I think these guys would like to get to a drink at the bar, and there's an opportunity to talk to you people, I think this conversation could go on for a long, long time, there's so much to learn and so much to share in this particular information. So with that, over to you! >> I'll just wrap it up real quick, thanks everyone, give the panel a hand, great job. Thanks for coming out, we have drinks for the next hour or two here, so feel free to network and mingle, great questions to ask them privately one-on-one, or just have a great conversation, and thanks for coming, we really appreciate it, for our Big Data SV Event livestreamed out, it'll be on demand on YouTube.com/siliconangle, all the video, if you want to go back, look at the presentations, go to YouTube.com/siliconangle, and of course, siliconangle.com, and Wikibond.com for the research and content coverage, so thanks for coming, one more time, big round of applause for the panel, enjoy your evening, thanks so much.

Published Date : Mar 16 2017

SUMMARY :

Is 90% of the computing going to go to the cloud of getting our second release of the product out shortly. and that simply means putting the focus so could you like, Janet George from Western Digital, consuming the data that you collect from anywhere, and I can never pronounce your surname. so talk about where you have been the acting CTO of SolidFire, but I plan to retire Okay, and data gravity is absolutely key, isn't it, And the best analogy is workloads are like electricity, would you like to start with that question? and the reason is because if you are going to process in terms of explaining the new software to the OT people. but not necessarily in the transparency of how you got that, and we were able to come up with models to predict So again, proof points that you will actually Absolutely, and the transparency, and the IT/OT divide, if you will, and if you think I'm wrong and you have a great new way and then, you sort of operationalize that Janet, do you want to go on? the keyword that comes to mind is abstraction layers, Do you want to go next? and the device compared to where they should be, and then you can take that a step further and there's an opportunity to talk to you people, all the video, if you want to go back,

ENTITIES

Entity	Category	Confidence
Janet George	PERSON	0.99+
Janet	PERSON	0.99+
Western Digital	ORGANIZATION	0.99+
Cloud Native Compute Foundation	ORGANIZATION	0.99+
90%	QUANTITY	0.99+
10 models	QUANTITY	0.99+
Amazon	ORGANIZATION	0.99+
Google	ORGANIZATION	0.99+
Shell	ORGANIZATION	0.99+
Siri	TITLE	0.99+
Sanjeev Kumar	PERSON	0.99+
second	QUANTITY	0.99+
Rohan	PERSON	0.99+
one question	QUANTITY	0.99+
Maana	ORGANIZATION	0.99+
FogHorn Systems	ORGANIZATION	0.99+
two analogies	QUANTITY	0.99+
thousands	QUANTITY	0.99+
Bercovici	PERSON	0.99+
Thomas Edison	PERSON	0.99+
second question	QUANTITY	0.99+
second station	QUANTITY	0.99+
SolidFire	ORGANIZATION	0.99+
FogHorn	ORGANIZATION	0.99+
third	QUANTITY	0.99+
first	QUANTITY	0.99+
third part	QUANTITY	0.99+
today	DATE	0.99+
second thing	QUANTITY	0.98+
Two	QUANTITY	0.98+
two-year-old	QUANTITY	0.98+
one problem	QUANTITY	0.98+
end of April	DATE	0.98+
Alexa	TITLE	0.98+
first problem	QUANTITY	0.98+
one example	QUANTITY	0.97+
three	QUANTITY	0.97+
second release	QUANTITY	0.97+
third stage	QUANTITY	0.97+
second point	QUANTITY	0.96+
one trillion objects	QUANTITY	0.96+
second stage	QUANTITY	0.96+
one	QUANTITY	0.96+
one level	QUANTITY	0.95+
two software companies	QUANTITY	0.95+
NetApp Data Fabric	ORGANIZATION	0.95+
each	QUANTITY	0.95+
millions of images	QUANTITY	0.95+
2008	DATE	0.95+
first level	QUANTITY	0.95+
both	QUANTITY	0.95+
eighth one	QUANTITY	0.94+
S3	TITLE	0.93+
trillions of objects	QUANTITY	0.92+
Wikibond.com	ORGANIZATION	0.92+
each problem statement	QUANTITY	0.92+
one thing	QUANTITY	0.92+
one step	QUANTITY	0.91+
Big Data SV Event	EVENT	0.91+
siliconangle.com	OTHER	0.91+
NetApp	ORGANIZATION	0.91+
Maersk	ORGANIZATION	0.9+
about three years old	QUANTITY	0.89+
five years	QUANTITY	0.89+
Maana	PERSON	0.89+
about 18 months ago	DATE	0.88+
26 years old	QUANTITY	0.82+
one particular model	QUANTITY	0.82+
Kubernetes Technology	ORGANIZATION	0.82+
Val	PERSON	0.82+
every	QUANTITY	0.81+
Chevron	ORGANIZATION	0.79+
Panel 2	QUANTITY	0.77+
seven other related projects	QUANTITY	0.7+
next hour	DATE	0.69+
to	TITLE	0.66+
petabytes	QUANTITY	0.64+
time	QUANTITY	0.64+
two	QUANTITY	0.63+
series of models	QUANTITY	0.52+

Western Digital Taking the Cloud to the Edge - #DataMakesPossible - Panel 1

>> Why don't I spend just a couple minutes talking about what we mean by digital enactment, turning data in models and models into action. And then we'll jump directly into, I'll introduce the panelists after that, and we'll jump directly into the questions. So Wikibon SiliconAngle has been on a mission for quite sometime now to really understand what is the nature of digital transformation, or digital disruption. And historically, when we've talked about digital, people talk about a variety of different characteristics of it, so we'll talk about new types of channels and activity on the web, and a many number of other things. But to really make sense of this, we kind of felt that we had to go to a set of basic principles, and utilize those basic principles to build our observations up. And so what we started with is a simple observation that, if it's not digital, or if it's not data, it ain't digital. By that we mean fundamentally the idea of digital business is how are we going to use data as an asset to differentially drive our business forward? And if we borrowed from Drucker, Drucker used to like to talk about the idea that business exists to create sustained customers, and so we would say that digital business is about applying data assets to differentially create sustained customers. Now to do that successfully, we have to be able to, as businesses, be able to establish a set of strategic business capabilities that will allow us to differentially use data assets. And we think that there are a couple of core strategic business capabilities required. One is human beings and most businesses operate in the analog world, so it's how do we take that analog data and turn it into digital data that we can then process. So that's the first one, the notion of an IOT as a transducer of information so that we can generate these very rich data streams. Secondly we have to be able to do something with those data streams, and that's the basis of big data. So we utilize big data to create models, to create insights, and increasingly through a more declarative style, actually create new types of software systems that will be crucial to driving the business forward. That's the second capability. The third capability is one that we're still coming to understand, and that is we have to take the output of those models, the output of those insights, and then turn them back into some event that has a consequential moment in the real world, or what we call systems of an action. And so the three core business capabilities that have to be built are this capture data through IOT, big data to process it, systems of an action also through IOT, through actuators, to actually that have a consequential action in the real world. So that's the basis of what we're talking about. We're going to take Flavio's vision that he just laid out, and then we, in this panel, are going to talk about some of the business capabilities necessary to make that happen, and then after this, David Foyer will lead a panel on specifically some of the lower level technologies that are going to make it work. Make sense guys? >> Sounds good (mumbles). >> Okay, so let me introduce the panelists. Over, down there on the end, Ted Connell. Ted is from Intel, I don't know if we can get the slide up that has their names and their titles. Ted, why don't you very quickly introduce yourself. >> Yeah, thank you very much. I run Solution Architecture for the manufacturing and industrial vertical, where we put together end to end ecosystem solutions that solve our clients business problems. So we're not selling silicone or semiconductors, we're solving our clients problems, which as Flavio said, requires ecosystem solutions of software, system integrators, and other partners to come together to put together end solutions. >> Excellent, next to Ted is Steve Madden of Equinix. >> Yeah, Steve Madden. Equinix is the largest interconnection, global interconnection company and a lot of the ecosystems that you'll be hearing about, come together inside our locations. And one of the things I do in there is work with our big customers on industry vertical level solutions, IOT being one of them. >> Phu Hoang, from Data Torrent. >> Hi, my name's Phu Hoang, I'm co-founder and chief strategy of a company called Data Torrent, and at Data Torrent, our mission is really to build out solutions to allow enterprises to process big data in a streaming fashion. So that whole theme around ingestion, transformation, analytics, and taking action in sub second on massive data is what we're focusing on. >> And you're familiar with Flavio. Flavio, will you take a second to introduce yourself. >> Yes, thank you, I am leading a company that is trying to manifest the vision highlighted here, building a platform. Not so much the applications, we are hosting the applications (mumbles) the data management and so forth. And trying to apply the industrial vertical first. Big enough to keep us busy for quite a while. >> So in case you didn't know this, we have an interesting panel, we have use case, application, technol infrastructure, and platform. So what' we'll try to do is over the next, say, 10 minutes or so, we're going to spend a little bit of time, again, talking about some of these business capabilities. Let me start off by asking each of you a question, and I will take, if anybody is really burning to ask a question, raise your hand, I'll do my best to see you and I'll share the microphone for just long enough for you to ask it. Okay, so first question, digital business is data. That means we have to think about data differently. Ted, at Intel, what is Intel doing when they think about data as an asset? >> So, Intel has been working on what is now being called Fog, and big data analytics for over a generation. The modern xeon server we're selling, the wire in the electronics if you will, is 10 silicon atoms wide. So to control that process, we've had to do what is called Industry 4.0 20 years ago. So all of our production equipment has been connected for 20 years, we're running... One of our factories will produce a petabyte of data a day, and we're running big data analytics, including machine learning on the stuff currently. If you look at an Intel factory, we have 2,000 fit clients on the factory floor supported by 600 servers in our data center at the factory, just to control the process and run predictive yield analytics. >> Peter: So that's your itch? >> Our competitive advantage at Intel is the factory. We are a manufacturer, we're a world class manufacturer. Our front end factories have zero people in it, not that we don't like people, but we had to fully automate the factory because as I speak, tens of thousands of water molecules are leaving my mouth, and if one of those water molecules lands on a silicon, it ain't going to work. So we had to get people physically out of the factory, and so we were forced by Moore's Law, and the product we build, to build out what became Fog, when they came up with the term seven years ago, we just came to that conclusion because of cost, latency, and security, it made sense to, you know, look, you got data, you got compute, there's a network between. It doesn't matter where you do the compute, bring the compute to the data, the data to the compute. You're doing a compute function, it doesn't matter where you do it. So Fog is not complicated, it's just a distributed data center. >> So when you think about some of the technologies necessary to make this work, it's not just batch, we're going to be doing a lot of stuff in real time, continuously. So Phu, talk a little bit about the system software, the infrastructure software that has to be put in place to ensure that this works for them. >> I think that's great. A little bit about our background, the company was founded by a bunch of ex-Yahoos that had been out for 12, 15 years from the early days. So we sort of grew up in that period where we had to learn about big data, learn about making all the mistakes of big data, and really seeing that nowadays, it's not good enough to get insight, you have to get insight in a timely fashion enough to actually do something about it. And for a lot of enterprise, especially with human being carrying around mobile phones and moving around all over the place, and sensors sending thousands, if not millions of events per second, the need for the business to understand what's going on and react, have insight and react sub second, is crucial. And what that means is the stuff that used to be batch, offline, you know, can kind of go down, now has to be continuous, 24 by seven. You can't lose data, you got to be able to recover and come back to where you were as if nothing has happened with no human intervention. There's a lot of theme around no human intervention, because this stuff is so fast, you can't involve human beings in it, then you're not reacting fast enough. >> Can I real quickly add one thing first? >> Peter: Sure. >> We think of data at Intel in half life terms. >> Yeah, that's exactly right. >> The data has valuable right now. If you wait a second, literally a second, the data has a little bit of value. You wait two second, it's historical data you can run regressions, and tell you why you screwed up, but you ain't going to fix anything. >> Exactly. >> If you want to do anything with your data, you got to do it now. >> So that, ultimately, we need to develop experience, a creed experience about what we're doing. And the stuff we're doing in applications will eventually find itself into platforms. So Flavio, talk to us a little bit about the types of things that are going to end up in the platform to ensure that these use cases are made available to, certainly, businesses that perhaps aren't as sophisticated as Intel. >> Yes, so in many ways, we are learning from what is going on in the Cloud, and has to come through this continuum, all the way into the machines. This break between what's going inside the machine, and old 1980 microprocessor and the server, and the Cloud server with virtualization on the other side cannot leave. So it has to be a continuum of computing so you can move the same function, the same container, all the way through first. Second, you really have to take the real time very, very seriously, particularly at the edge, but even in the back so that when you have these end to end continuum, you can decide where you do what. And I think that one of the models that was in that picture with a concentric circle is really telling what we need to learn first. Bring the data back and learn, and that can take time. But then you can have models that are lightweight, that can be brought down to the front, and impact the reaction to the data there. And we heard from a car company, a big car company, how powerful this was when they learned that the angle of a screwdriver, and a few other parameters, can determine the success of screwing something into a body of a car, that could go well, or could go very, very bad and be very costly. So all the learning, massive data, can come down to a simple model that can save a lot of money and improve efficiency. But that has to be hosted along this continuum. >> So from a continuum, it means we still have to have machines somewhere to do something. >> Touching the ground, touching the physical world requires machines, actuators. >> Peter: Absolutely, so Steve, what is Equinix doing to simplify the thinking through of some of these infrastructure issues? >> Yeah, I mean, the biggest thing that people find when they start looking at millions of devices, millions of data capture points, transferring those data real time and streaming it, is one thing hasn't changed and that's physics. So where those things are, where they need to go, where the data needs to move to and how fast, starts with having to figure out your own topology of how you're moving that data. As much as it's easy to say we're just going to buy a platform and choose a device, and we'll clink them together, there's still a lot of other things that need to be solved, physics being the first one. The second one, primarily, is volumes. So how much bandwidth and (mumbles) you're going to require. How much of that data are you going to back haul to centralized data center before you send it up to a Cloud? How much of it are you going to leave at the edge? Where do you place that becomes a bigger deal. And the third one is pretty much every industry has to deal with regulations. Regulations control what you can and can't do in terms of IT delivery, where you can place stuff, where you cannot place stuff, data that can leave the country, data that can't. So all these things mean that you need to have a thought through process of where you're placing certain functions, and what you're defining as your itch between the digital and physical world. And Equinix is an interconnection company that's sitting there as a neutral party across all the networks, all the clouds, all the enterprises, all the providers to help people figure that out. >> So before I ask the audience a question, now that I'm down here so I can see you so be prepared, I'm going to ask some of you a question. When you think about the strategic business capabilities necessary to succeed, what is the first thing that the business has to do? So why don't I just take Ted, and just go right on down the line. >> Yeah, so I think this is really, really important. I work with many, many clients around the world who are doing five, 10, 15 POCs, pilots, and the internet things, and they haven't thought through a codified strategy. So they're doing five things that will never fit together, that you will never scale, and the learnings you're using, you really can't do that much with. So coming up with what is my architecture, what is my stack going to look like, how am I going to push data, what is my data... You know, because when you connect to these things, I can't tell you how much data you're going to get. You're going to be overwhelmed by the data, and that's why we all go to the edge, and I got to process this data real time. And oh, by the way, if I only have one source of data, like I'm connecting to production equipment, you're not going to learn anything. 98% of that data's useless, you got to contextualize the data with either an inspection step, or some kind of contextualization that tells you if this then that. You need the then that, without that, your data is basically worthless. So now you're pulling multiple sources of data together in real time to make an understanding. And so understanding what that architecture looks like, spend the time upfront. Look, most of us are engineers, you know five percent additional work upfront saves you 95% on the backend, that's true here. So think through the architecture, talk to some of us who have been working in this area for a long time. We'll share our architecture, we have reference architecture that we're working with companies. How do you go from industry 2.0 or industry 3.0, to industry 4.0? And there is a logical path to do it, but ultimately, where we're going to end up is a software defined universe. I mean, what's a cloud? It's a software defined data center. Now we're doing software defined networks, software defined storages, ultimately we're going to be doing software defined systems because it's cheaper. You get better capital utilization, better asset utilization, so we will go there, so what does that mean for you infrastructure, and what are you going to do from an architectural perspective, and then take all of your POCs and pilots, and force them to do that specifically around security. People are doing POCs with security that they don't even have any protocols, they're violating all their industry standards doing POCs, and that's going to get thrown out. It's wasted time, wasted effort, don't do it. >> Steve, a couple sentences? >> Yeah, essentially it's not going to be any prizes for me saying think interconnection first. A lot of our customers, if we look at what they've done with us, everyone from GE to real time facial recognition at the edge, it all comes down to how are you wired, topology wise, first. You can't use the internet for risk reasons, you can't necessarily pay for multiple (mumbles) bandwidth costs, et cetera. So low latency, 80% lower latency, seven times of bandwidth at half the cost is a scalable infrastructure to move (mumbles) around the planet. If you don't have that, the rest of the stuff (mumbles) breakdown. >> Peter: Phu? >> Well I would say that analytics is hard, analytics in real time is even harder. And I think with us talking to our customers, I feel for them, they're confused. There's like a million solutions out there, everybody's trying to claim to do the same thing. I think it's both sides, consumers have to get more educated, they have to be more intelligent about their POCs, but as an industry, we also have to get better at thinking about how do we help our customer succeed. It's not about let me give you some open source, and then let me spend the next 10 months charging you professional services to help you. We ought to think about software tools and enterprise tools to really help the customer be able to think about their total cost (mumbles) and time to value to handle this thing, because it's not easy. >> Peter: Flavio. >> Yeah, we're facing an interesting situation where the customers are ready, the needs are there, the marketing is going to be huge, but the plot, the solution, is not trivial. It is maturing and we are all trying to understand how to do it. And this is the confusion that you see in many of these half baked solution (mumbles). Everything is coming together, and you have to go up the stalk and down the stalk with full confidence, that's not easy. So we all have to really work together. Give ourselves time, be feeling that we are in a competitive world, preparing for addressing together a huge market. And trying to mature these solutions that then will be replicated more and more, but we have to be patient with each other, and with the technologies that are maturing and they're not fully there and understood. But the market is amazing. >> Peter: So we have a Twitter question. >> Man: It's being live streamed, the audience is really engaged online as well, digital. So we have a question from Twitter from Lauren Cooney saying, "Would like to know what industries would "be most impacted with digitization "over the next five years." >> Which one won't be? (men laughing) All of them, what we've seen, the business model is the data. I mean, our CEOs calling data the new gold. I mean, it's the new oil. So I don't know of anything, unless you're doing something that is just physical therapy, but that even data, you can do data on that. So yeah, everything, yeah, I don't know of anything that won't be. >> I think the real question is how is it going to move through industries. Obviously it's going to start with some of the digital native, it's all ready deep into that, deep into media, we're moving through the media right now. Intel's clearly a digital company, and you've been working, you've been on this path for quite some time. >> Let me give you a stat. Intel has a 105,000 people, and 144,000 servers. So we're about 1.5 server to people, that's what kind of computation we're (mumbles). >> Peter: We can help you work on that. >> If you do like the networking started by (mumbles) the internet, then content delivery, and media, hard media, et cetera, is gone. Financial services and trading exchanges pretty much show what digital market's going to be in the future. Cloud showed up, and now, I think he's right, it's effecting every industry. Manufacturing, industrial, health professional services are the top three right now. But people who shop to ask for help went from every industry on every country, for that matter. >> Our customers are, you know, the top players in almost every vertical. You start out as a small company thinking that you're going to attack one vertical, but as you start to talk about the capability, everybody (mumbles) wait, you're solving my problem. >> Peter: (mumbles) are followers, is what you mean. >> Yeah, because what business would say, hey, I don't want to know what's going on with my business, and I don't want to take any action. >> Add to that it's an ecosystem of ecosystems. No one, by themselves, is going to solve anything. They have to partner and connect with other people to solve the solution. >> So I'll close the panel by making these kind of summary comments, the business capabilities that we think are going to be most important are, first off, when we talk about the internet of things, we like to talk about the internet of things and people. That the people equation doesn't go away. So we're building on mobile, we're building on other things, but if there's a strategic capability that's going to be required, it's going to be how is this going to impact folks who actually create value in the business. The second one, I'll turn it around, is that IT organizations have gone through a number of different range wars, if you will, over the past 20 years. I lived through IT versus telecom, for example. The IT, OT conflict, or potential conflict, is non trivial. There's going to be some serious work that has to be done, so I would add to the conversation that we've heard thus far, the answers that we've heard thus far, is the degree to which people are going to be essential to making this work, and how we diffuse this knowledge into our employees, and into our IT and professional communities is going to be crucial, especially with developers because Flavio, if we are, right now, trying to figure stuff out, it really matures when we think about the developer world. Okay, so I want to close the first panel and get ready for the second panel. So thank you very much, and thank you very much to our panelists. (audience applauding) And if we could bring David Foyer and the second panel up, we'll get going on panel two. Oh, we're going to get together for a picture. (exciting rhythmic music)

Published Date : Mar 16 2017

SUMMARY :

Now to do that successfully, we have to be able to, Okay, so let me introduce the panelists. I run Solution Architecture for the manufacturing And one of the things I do in there is work with our and at Data Torrent, our mission is really to build Flavio, will you take a second to introduce yourself. Not so much the applications, I'll do my best to see you and I'll share the microphone in our data center at the factory, just to control and the product we build, to build out what became Fog, the infrastructure software that has to be put in and come back to where you were as if nothing has happened the data has a little bit of value. you got to do it now. And the stuff we're doing in applications will eventually and impact the reaction to the data there. So from a continuum, it means we still have to have Touching the ground, touching the physical world all the providers to help people figure that out. the business has to do? and what are you going to do from an architectural perspective, at the edge, it all comes down to how are you wired, and time to value to handle this thing, the marketing is going to be huge, saying, "Would like to know what industries would I mean, our CEOs calling data the new gold. Obviously it's going to start with some of the digital native, Let me give you a stat. in the future. but as you start to talk about the capability, and I don't want to take any action. They have to partner and connect with other people is the degree to which people are going to be

ENTITIES

Entity	Category	Confidence
Steve	PERSON	0.99+
Peter	PERSON	0.99+
Lauren Cooney	PERSON	0.99+
Data Torrent	ORGANIZATION	0.99+
80%	QUANTITY	0.99+
Steve Madden	PERSON	0.99+
Equinix	ORGANIZATION	0.99+
five	QUANTITY	0.99+
first	QUANTITY	0.99+
David Foyer	PERSON	0.99+
600 servers	QUANTITY	0.99+
95%	QUANTITY	0.99+
five percent	QUANTITY	0.99+
second panel	QUANTITY	0.99+
second panel	QUANTITY	0.99+
98%	QUANTITY	0.99+
20 years	QUANTITY	0.99+
10 minutes	QUANTITY	0.99+
Ted Connell	PERSON	0.99+
thousands	QUANTITY	0.99+
Ted	PERSON	0.99+
first panel	QUANTITY	0.99+
144,000 servers	QUANTITY	0.99+
105,000 people	QUANTITY	0.99+
Intel	ORGANIZATION	0.99+
third one	QUANTITY	0.99+
two second	QUANTITY	0.99+
five things	QUANTITY	0.99+
millions of devices	QUANTITY	0.99+
first question	QUANTITY	0.99+
one source	QUANTITY	0.99+
Second	QUANTITY	0.99+
One	QUANTITY	0.99+
both sides	QUANTITY	0.99+
seven years ago	DATE	0.99+
10	QUANTITY	0.99+
first one	QUANTITY	0.99+
second capability	QUANTITY	0.99+
third capability	QUANTITY	0.98+
20 years ago	DATE	0.98+
seven times	QUANTITY	0.98+
three core	QUANTITY	0.98+
panel two	QUANTITY	0.98+
second one	QUANTITY	0.98+
one	QUANTITY	0.97+
Phu Hoang	ORGANIZATION	0.97+
GE	ORGANIZATION	0.97+
Twitter	ORGANIZATION	0.97+
each	QUANTITY	0.97+
24	QUANTITY	0.97+
Yahoos	ORGANIZATION	0.96+
seven	QUANTITY	0.96+
12, 15 years	QUANTITY	0.96+
15 POCs	QUANTITY	0.96+
Flavio	PERSON	0.96+
a second	QUANTITY	0.96+
zero people	QUANTITY	0.95+
one vertical	QUANTITY	0.94+
Drucker	ORGANIZATION	0.93+
Panel 1	QUANTITY	0.93+
Moore	ORGANIZATION	0.93+
millions of events per second	QUANTITY	0.92+

Western Digital Taking the Cloud to the Edge - #DataMakesPossible - Presentation by Flavio Bonomi

>> It's a pleasure to be here with you and to tell you about something I've been dreaming about and working for for many years and now is coming to the surface quite powerfully and quite usefully in many areas. I apologize, sometimes this flickers for some reason but I hope it doesn't disturb the story. I'd like to give you a little touch of history since I was there at the beginning of this journey and give you a brief introduction to what we mean for Fog Computing. And then go quickly to three powerful application spaces for this technology, together with industrial internet and one is industrial automation. That's the focus of our activity as Nebbiolo Technologies. The other one is one of my favorite ones and we'll get there is the automotive that caught fire here in Silicone Valley in the last years, the autonomous car, the connected vehicle and so on. And this is related to also to intelligent transportation and Smart Cities. And then a little touch on what Fog Computing means for Smart grid energy but many, many other sectors will find the same usefulness, the same architecture dimensions of Fog Computing applicable. So this is the story that comes back hopefully, here, the day in 2010 when Fog Computing, the word started here, oh God, is this jumping around? I think it's the connector, this is the age of the connector, this is the age of the Dongles. This is not an Apple Dongle and so we are having troubles. And this is not yet one of the last machines that are out. Let's hope for, I never had this problem, okay. Alright, this date 2010 at the Aquarium Research Center in Monterey where I gave a talk about robots going down deep in the bottom of those big valleys under the ocean and when I finished, the lady, Ginny in the middle approached me and told me, look, why don't you call what you're talking about fog computing? Because it's cloud computing brought too close to the ground and I protested for about 15 minutes. And on the drive home, I thought that's really a good name for what we are doing, what we have been doing in the last years and I started trying it out and using it and more and more I found good response and so seven years later, I'm still here talking about the same thing. What's happening is Fog, the edge of the metric zone was very important but it was always very important in IT, is still very important in IT in mobile, in content distribution but when IOT came to the surface, it became even more relevant to understand the need of resources, virtualized real time capable, secure, trusted with storage computing and networking coming together at the edge. At the edge of the IT network, now they are calling this mobile edge, they realize we are realizing that mobile can benefit from local resources at the edge, powerful real time capable resources but also and more importantly for what we are doing in this space of operational technologies, this is the space, the other and the other side of the boundary between information technologies and operational technologies and here is where we are living with Fog Computing these days so, apologize, I apologize for this behavior that is, maybe I have another Dongle, Apple Dongle. Maybe I could look at that, maybe Morris can help me out here, anyway, so what is Fog Computing? Fog Computing is really the platform that brings modern, Cloud inspired Computing storage here is important here for our friends at Western Digital and networking functions closer to the data producing sources. In our case, machines, things, but not just bringing Cloud down, it's also bringing functions up from the machine world, the real time, the safety functions, the trusting and reliability functions required in that area and this is a unified solution at the edge that really brings together communication, device management, data harvesting, analysis and control. So this is kind of new except for our friends in Wall Street. The real time part was not as sensitive. Now we are realizing how important it is and how important the position of resources is in the future of solutions in this space and so it's not boxes. It's a distributed layer of resources, well managed at the edge of the network and really has a lot of potential across multiple industries. Here we see the progress also in the awareness of this topic with the open fog control room that is now a very active and even the Vcs. Peter Levine here is talking about the importance of the edge. What is really happening is the the convergence. I think we should probably stop and use a different Dongle. Is this the one, no, no, this is not the right Dongle. The world of Dongles, sorry. Oh boy. Oh you have the computer with the, okay, is the right Dongle with the right computer, okay. Here we are, okay. Alright, we're getting back there. This is the new Apple. Okay, we are here, this looks better, thank you. Alright, so this is to be understood. This is the convergence of IT functionality, the modern IT functionality with the OT requirements and this is fundamentally the powerful angle that Fog Computing brings to IOT and machine world so all the nice things that happened in the Cloud come down but meet the requirements of resources, the needs and the timing of the Edge. And so when you look at what is brought into particularly the world of operations, you see these kind of functions that are not usually there. In fact, when you meet this operational world, you find microprocessors, you find Windows machines, industrial Pcs and so on, not so much Linux, not so much the modern approaches to computing. These are the type of dimensions that you'll see have a particular impact on the pain points seen in the wold of applications. So now we go to the Use cases in, use cases in the internet of things. I think it's on your side, I'm sorry. Because it's the second machine. Okay, well, maybe here's the solution. So we have seen this picture of IOT multiple times. A lot of verticals, we are concentrating on this tree, one is the industrial, the second one is the autonomous vehicle in intelligent transportation, the third one, just touched upon is the Smart Grid. This is the area of activity for Nebbiolo Technologies. Those kind of body shops and industrial floors with large robots with a lot of activity around those robots with cells protecting the activities within each working space, this is the world PLCs, industrial Pcs controlling robots, very fragmented. Here we are really finding even more critical this boundary between operational and informational technologies. This is a fire wall, also a mental fire wall between the two worlds and best practice is very different in one place than the other particularly also in the way we handle data, security, and many other areas. In this space, which is also a little more characterized here with this kind of machines that you see in this ISA 99 or ISA 95 type of picture, you see the boundary between the two spaces, once more when we come back. And alright, so the key message here, very tough to go across, it's very complex, the interaction between the two worlds. And there is where deeply we find a number of pain points at the security level, at the Hardware architecture level, at the data analytics and storage level, at the networking, software technologies and control architecture. There's a lot happening there that is old, 1980's time frame, very stable but in need of new approaches. And this is where Fog Computing has a very strong impact And we'll see, sorry, this is a disaster here. Alright, what do we do, alright. Maybe I should go around with this computer and show it to you. Okay, now it's there for a moment. Now, this is, maybe you have to remember one picture of all this talk, look at this, what is this? This is a graphical image of a body shop of a an important car company, you see the dots represent computers within boxes, industrial Pcs, PLCs, controllers for welding machines, tools and so on. That is, if you sum up the numbers, it's thousands of computers, each one of them is updated through a UPC, USB stick, sorry and is not managed remotely. It's not secure because there's a trust that the whole area is enclosed and protected through a fire wall on the other side but it's very stable but very rigid. So this is the world that we are finding with dedicated, isolated, not secure computing, this is Edge Computing. But it's not what we hope to be seeing soon as Fog Computing in action there so this is the situation. Very delicate, very powerful and very motivating. And now comes IOT and this is not the solution. It's helping, IOT tries to connect this big region, the operational region to the back end to the Clouds, to the power of computing that is there, very important, predicting maintenance, many other things can be done from there but it's still not solving the problem. Because now you have to put little machines, gateways into that region, one more machine to manage, one more machine to secure and now you're taking the data out. You are not solving a lot of the pain points. There's some important benefits, this is very, very good. But it's not the story, the story is sold once you really go one step deeper, in fact, from connectivity between information technologies and informational technologies to really Convergence and you see it here where you're starting to replace those machines supporting each cell with a fog node, with a powerful convergent point of computing, real time computing that can allow control, analytics and storage and networking in the same nodes so now these nodes are starting to replace all the objects controlling a cell. And offer more functions to the cell itself. And now, you can imagine where this goes, to a convergent architecture, much more compact, much more homogeneous, much more like Cloud. Much more like Cloud brought down to the Edge. When this comes back, okay, almost there. So this is okay, this is now the image that you can image leads to this final picture that is now even not, okay, do you see it, okay. Now you're seeing the operational space with the fabric of computing storage and networking that is modern, that is virtualized, that supports an application store, now you have containers there. You can imagine virtual machines and dockers living the operational space. At the same time, you have it continuing from the Cloud to the network, the modern network, moving to the Edge into the operational space. This is where we are going and this is where the world wants us to go and the picture representing this transition and this application of Fog Computing in this area is the following, the triangle, the pyramid is now showing a layer of modern computing that allows communications analysis control application hosting and orchestration in a new way. This is cataclysmic, really is a powerful shift, still not fully understood but with immense consequences. And now you can do control, tight, close to the machines, a little slower through the Fog and a little slower through the Cloud, this is where we are going. And there's many, many used cases, I don't dwell on those. But we are proceeding with some of our partners exactly in this direction. Now the exciting topics if I can have five more minutes making up the time wasted. What's going on here, the connected vehicle, the autonomous vehicle, the electrification of automobile are all converging and I think it's very clear that the para dime of Fog Computing is fundamental here. And in fact, imagine the equivalent of a manufacturing cell with a converging capabilities into the Fog and compare it with what's going on with the autonomous vehicle. This is a picture we used a Sysco seven years ago. But this is now, a car is a set of little control loops, ECUs, little dispersed, totally connected computers. Very difficult to program, same as the manufacturing cell. And now where are we going, we are going towards a Fog node on wheels, data center on wheels but better a Fog node on wheels with much better networking between, with a convergence of the intelligence, the control, the analytics, the communications in the middle and a modern network deterministic internet called TSN is going to replace all these CAN boxes and all these flakey things of the past. Same movement in industrial and in the automobile and then you look at what's going on in the intelligent transportation, you can imagine Fog Computing at the edge, controlling the junctions, the traffic lights, the interactions with cars, cars to cars and you see it here, this is the image, again where you have the operational space of transportation connected to the Clouds in a seamless way which these nodes of computing storage and networking at the junctions inside the cars talking to each other, so this is the beautiful movement coming to us and it requires the distribution of resources with real time capabilities, here you see it. And now, the Smart Grid, again, it cannot continue to go the same way with a utility data center controlling everything one way, it has to have and this is from Duke and a standardization body, you can see that there's a need of intelligence in the middle, Fog nodes, distributed computing that are allowing local decisions. Energy coming from a microcell into the grid and out, a car that wants to sell it's energy or buy energy doesn't need to go slowly to a utility data center to make decisions so again, same architecture, same technologies needed, very, very, very powerful. And we could go on and on and on, so what are we doing? We won't advertise here but the name has to be remembered. The name comes from a grape that grows in the Fog in Northern Italy, it's in Piedmont, my home town is behind that 13th century castle you see there. Out there is Northern Italy close to Switzerland. That vineyard is from my cousin, it's a good Nebbiolo, starting to be sold in California too. So this is the name Nebbia Fog comes to, Nebbiolo Technologies, we are building a platform for this space with all the features that we feel are required and we are applying it to industrial automation. And our funders are not so much from here, are from Germany, Austria, KUKA Robotics, TTTech, GiTV from Japan and a few bullets to complete my presentation. Fog Computing is really happening. There's a deep need for this converged infrastructure for IOT including Fog or Edge as someone calls it. But we need to continue to learn, demonstrate, validate through pilots and POCs and we need to continue to converge with each other and with the integrators because these solutions are big and they are not from a little start up. They are from integrators, customers, big customers at the other end, an ecosystem of creative companies. No body has all the pieces, no Sisco, no GE and so on. In fact, they are all trying to create the ecosystem. And so let's play, let's enjoy the Cloud, the Fog and the machines and try to solve some of the big problems of this world. >> Okay, Flavio, well done. >> Sorry for that. Sorry for the hiccups. >> Now we do that on purpose to see how you'd react and you're a pro, thank you so much for the great presentation. >> Alright. >> Alright, now we're going to get into panel one, looking at the data models and putting data to work.

Published Date : Mar 16 2017

SUMMARY :

the interactions with cars, cars to cars and you see it Sorry for the hiccups. Now we do that on purpose to see how you'd looking at the data models and putting data to work.

ENTITIES

Entity	Category	Confidence
KUKA Robotics	ORGANIZATION	0.99+
California	LOCATION	0.99+
2010	DATE	0.99+
Flavio Bonomi	PERSON	0.99+
TTTech	ORGANIZATION	0.99+
Nebbiolo Technologies	ORGANIZATION	0.99+
Monterey	LOCATION	0.99+
Switzerland	LOCATION	0.99+
GiTV	ORGANIZATION	0.99+
Piedmont	LOCATION	0.99+
Wall Street	LOCATION	0.99+
Apple	ORGANIZATION	0.99+
Sisco	ORGANIZATION	0.99+
Silicone Valley	LOCATION	0.99+
Ginny	PERSON	0.99+
Western Digital	ORGANIZATION	0.99+
GE	ORGANIZATION	0.99+
five more minutes	QUANTITY	0.99+
two worlds	QUANTITY	0.99+
Peter Levine	PERSON	0.99+
Northern Italy	LOCATION	0.99+
Duke	ORGANIZATION	0.99+
second machine	QUANTITY	0.99+
one	QUANTITY	0.99+
Linux	TITLE	0.99+
13th century	DATE	0.99+
Japan	LOCATION	0.99+
each cell	QUANTITY	0.99+
seven years ago	DATE	0.98+
one place	QUANTITY	0.98+
seven years later	DATE	0.98+
Nebbia Fog	ORGANIZATION	0.98+
thousands of computers	QUANTITY	0.98+
Windows	TITLE	0.98+
one picture	QUANTITY	0.98+
each one	QUANTITY	0.97+
two spaces	QUANTITY	0.97+
about 15 minutes	QUANTITY	0.95+
Cloud	TITLE	0.95+
Germany	LOCATION	0.95+
third one	QUANTITY	0.94+
Flavio	PERSON	0.93+
Morris	PERSON	0.93+
last years	DATE	0.92+
one more machine	QUANTITY	0.91+
Sysco	ORGANIZATION	0.88+
one more machine	QUANTITY	0.86+
1980's	DATE	0.82+
each working	QUANTITY	0.81+
one way	QUANTITY	0.81+
Nebbiolo	ORGANIZATION	0.81+
second one	QUANTITY	0.77+
one step	QUANTITY	0.72+
Austria	LOCATION	0.72+
Aquarium Research Center	LOCATION	0.67+
ISA 95	OTHER	0.63+
three	QUANTITY	0.63+
Edge	TITLE	0.62+
ISA 99	OTHER	0.57+
cases	QUANTITY	0.51+
Dongle	COMMERCIAL_ITEM	0.51+
Dongle	ORGANIZATION	0.45+

Janet George, Western Digital | Women in Data Science 2017

>> Male Voiceover: Live from Stanford University, it's The Cube covering the Women in Data Science Conference 2017. >> Hi, welcome back to The Cube, I'm Lisa Martin and we are live at Stanford University at the second annual Women in Data Science Technical Conference. It's a one day event here, incredibly inspiring morning we've had. We're joined by Janet George, who is the chief data scientist at Western Digital. Janet, welcome to the show. >> Thank you very much. >> You're a speaker at-- >> Very happy to be here. >> We're very happy to have you. You're a speaker at this event and we want to talk about what you're going to be talking about. Industrialized data science. What is that? >> Industrialized data science is mostly about how data science is applied in the industry. It's less about more research work, but it's more about practical application of industry use cases in which we actually apply machine learning and artificial intelligence. >> What are some of the use cases at Western Digital for that application? >> One of the use case that we use is, we are in the business of creating new technology nodes and for creating new technology nodes we actually create a lot of data. And with that data, we actually look at, can we understand pattern recognition at very large scale? We're talking millions of wafers. Can we understand memory holes? The shape, the type, the curvature, circularity, radius, can we detect these patterns at scale? And then how can we detect if the memory hole is warped or deformed and how can we have machine learning do that for us? We also look at things like correlations during the manufacturing process. Strong correlations, weak correlations, and we try to figure out interactions between different correlations. >> Fantastic. So if we look at big data, it's probably applicable across every industry. How has it helped to transform Western Digital, that's been an institution here in Silicon Valley for a while? >> We in Western Digital we move mountains of data. That's just part of our job, right? And so we are the leaders in storage technology, people store data in Western Digital products, and so data's inherently very familiar to us. We actually deal with data on a regular basis. And now we've started confronting our data with data science. And we started confronting our data with machine learning because we are very aware that artificial intelligence, machine learning can bring a different value to that data. We can look at the insides, we can develop intelligence about how we build our storage products. What we do with our storage. Failure analysis is a huge area for us. So we're really tapping into our data to figure out how can we make artificial intelligence and machine learning ingrained in the way we do work. >> So from a cultural perspective, you've really done a lot to evolve the culture of Western Digital to apply the learnings, to improve the values that you deliver to all of your customers. >> Yes, believe it or not, we've become a data-driven company. That's amazing, because we've invested in our own data, and we've said "Hey, if we are going to store the world's data, we need to lead, from a data perspective" and so we've sort of embraced machine learning and artificial intelligence. We've embraced new algorithms, technologies that's out there we can tap into to look at our data. >> So from a machine learning, human perspective, in storage manufacturing, is there still a dependence on human insight where storage manufacturing devices are concerned, or are you seeing the machine learning really, in this case, take more of a lead? >> No, I think humans play a huge role, right? Because these are domain experts. We're talking about Ph.D.'s in material science and device physics areas so what I see is the augmentation between machine learning and humans, and the domain experts. Domain experts will not be able to scale. When the scale of wafer production becomes very large. So let's talk about 3 million wafers. How is a machine going to physically look at all the failure patterns on those wafers? We're not going to be able to scale just having domain expertise. But taking our core domain expertise and using that as training data to build intelligence models that can inform the domain expert and be smart and come up with all the ideas, that's where we want to be. >> Excellent. So you talked a little bit about the manufacturing process. Who are some of the other constituents that you collaborate with as chief data scientist at Western Digital that are demanding access to data, marketing, etcetera, what are some of those key collaborators for your group? >> Many of our marketing department, as well as our customer service department, we also have collaborations going on with universities, but one of the things we found out was when a drive fails, and it goes to our customer, it's much better for us to figure out the failure. So we've started modeling out all the customer returns that we've received, and look at that and see "How can we predict the life cycle of our storage?" And get to those return possibilities or potential issues before it lands in the hands of customers. >> That's excellent. >> So that's one area we've been focusing quite a bit on, to look at the whole life cycle of failures. >> You also talked about collaborating with universities. Share a little bit about that in terms of, is there a program for internships for example? How are you helping to shape the next generation of computer scientists? >> We are very strongly embedded in universities. We usually have a very good internship program. Six to eight weeks, to 12 weeks in the summer, the interns come in. Ours is a little different where we treat our interns as real value add. They come in, and they're given a hypothesis, or problem domain that they need to go after. And within six to eight weeks, and they have access to tremendous amounts of data, so they get to play with all this industry data that they would never get to play with. They can quickly bring their academic background, or their academic learning to that data. We also take really hard research-ended problems or further out problems and we collaborate with universities on that, especially Stanford University, we've been doing great collaborations with them. I'm super encouraged with Feliz's work on computer vision, and we've been looking into things around deep neural networks. This is an area of great passion for me. I think the cognitive computing space is just started to open up and we have a lot to learn from neural networks and how they work and where the value can be added. >> Looking at, just want to explore the internship topic for a second. And we're at the second annual Women in Data Science Conference. There's a lot of young minds here, not just here in person, but in many cities across the globe. What are you seeing with some of the interns that come in? Are they confident enough to say "I'm getting access to real world data I wouldn't have access to in school", are they confident to play around with that, test out a hypothesis and fail? Or do they fear, "I need to get this right right away, this is my career at stake?" >> It's an interesting dichotomy because they have a really short time frame. That's an issue because of the time frame, and they have to quickly discover. Failing fast and learning fast is part of data science and I really think that we have to get to that point where we're really comfortable with failure, and the learning we get from the failure. Remember the light bulb was invented with 99% negative knowledge, so we have to get to that negative knowledge and treat that as learning. So we encourage a culture, we encourage a style of different learning cycles so we say, "What did we learn in the first learning cycle?" "What discoveries, what hypothesis did we figure out in the first learning cycle, which will then prepare our second learning cycle?" And we don't see it as a one-stop, rather more iterative form of work. Also with the internships, I think sometimes it's really essential to have critical thinking. And so the interns get that environment to learn critical thinking in the industry space. >> Tell us about, from a skills perspective, these are, you can share with us, presumably young people studying computer science, maybe engineering topics, what are some of the traditional data science skills that you think are still absolutely there? Maybe it's a hybrid of a hacker and someone who's got, great statistician background. What about the creative side and the ability to communicate? What's your ideal data scientist today? What are the embodiments of those? >> So this is a fantastic question, because I've been thinking about this a lot. I think the ideal data scientist is at the intersection of three circles. The first circle is really somebody who's very comfortable with data, mathematics, statistics, machine learning, that sort of thing. The second circle is in the intersection of implementation, engineering, computer science, electrical engineering, those backgrounds where they've had discipline. They understand that they can take complex math or complex algorithms and then actually implement them to get business value out of them. And the third circle is around business acumen, program management, critical thinking, really going deeper, asking the questions, explaining the results, very complex charts. The ability to visualize that data and understand the trends in that data. So it's the intersection of these very diverse disciplines, and somebody who has deep critical thinking and never gives up. (laughs) >> That's a great one, that never gives up. But looking at it, in that way, have you seen this, we're really here at a revolution, right? Have you seen that data science traditionalist role evolve into these three, the intersection of these three elements? >> Yeah, traditionally, if you did a lot of computer science, or you did a lot of math, you'd be considered a great data scientist. But if you don't have that business acumen, how do you look at the critical problems? How do you communicate what you found? How do you communicate that what you found actually matters in the scheme of things? Sometimes people talk about anomalies, and I always say "is the anomaly structured enough that I need to care about?" Is it systematic? Why should I care about this anomaly? Why is it different from an alert? If you have modeled all the behaviors, and you understand that this is a different anomaly than I've normally seen, and you must care about it. So you need to have business acumen to ask the right business questions and understand why that matters. >> So your background in computer science, your bachelor's Ph.D.? >> Bachelor's and master's in computer science, mathematics, and statistics, so I've got a combination of all of those and then my business experience comes from being in the field. >> Lisa: I was going to ask you that, how did you get that business acumen? Sounds like it was by in-field training, basically on-the-job? >> It was in the industry, it was on-the-job, I put myself in positions where I've had great opportunities and tackled great business problems that I had to go out and solve, very unique set of business problems that I had to dig deep into figuring out what the solutions were, and so then gained the experience from that. >> So going back to Western Digital, how you're leveraging data science to really evolve the company. You talked about the cultural evolution there, which we both were mentioning off-camera, is quite a feat because it's very challenging. Data from many angles, security, usage, is a board level, boardroom conversation. I'd love to understand, and you also talked about collaboration, so talk to us a little bit about how, and some of the ways, tangible ways, that data science and your team have helped evolve Western Digital. Improving products, improving services, improving revenue. >> I think of it as when an algorithm or a machine learning model is smart, it cannot be a threat. There's a difference between being smart and being a threat. It's smart when it actually provides value. It's a threat when it takes away or does something you would be wanting to do, and here I see that initially there's a lot of fear in the industry, and I think the fear is related to "oh, here's a new technology," and we've seen technologies come in and disrupt in a major way. And machine learning will make a lot of disruptions in the industry for sure. But I think that will cause a shift, or a change. Look at our phone industry, and how much the phone industry has gone through. We never complain that the smart phone is smarter than us. (laughs) We love the fact that the smartphone can show us maps and it can send us in the right, of course, it sends us in the wrong direction sometimes, most of the time it's pretty good. We've grown to rely on our cell phones. We've grown to rely on the smartness. I look at when technology becomes your partner, when technology becomes your ally, and when it actually becomes useful to you, there is a shift in culture. We start by saying "how do we earn the value of the humans?" How can machine learning, how can the algorithms we built, actually show you the difference? How can it come up with things you didn't see? How can it discover new things for you that will create a wow factor for you? And when it does create a wow factor for you, you will want more of it, so it's more, to me, it's most an intent-based progress, in terms of a culture change. You can't push any new technology on people. People will be reluctant to adapt. The only way you can, that people adopt to new technologies is when they the value of the technology instantly and then they become believers. It's a very grassroots-level change, if you will. >> For the foreseeable future, that from a fear perspective and maybe job security, that at least in the storage and manufacturing industry, people aren't going to be replaced by machines. You think it's going to maybe live together for a very long, long time? >> I totally agree. I think that it's going to augment the humans for a long, long time. I think that we will get over our fear, we worry that the humans, I think humans are incredibly powerful. We give way too little credit to ourselves. I think we have huge creative capacity. Machines do have processing capacity, they have very large scale processing capacity, and humans and machines can augment each other. I do believe that the time when we had computers and we relied on our computers for data processing. We're going to rely on computers for machine learning. We're going to get smarter, so we don't have to do all the automation and the daily grind of stuff. If you can predict, and that prediction can help you, and you can feed that prediction model some learning mechanism by reinforced learning or reading or ranking. Look at spam industry. We just taught the Spam-a-Guccis to become so good at catching spam, and we don't worry about the fact that they do the cleansing of that level of data for us and so we'll get to that stage first, and then we'll get better and better and better. I think humans have a natural tendency to step up, they always do. We've always, through many generations, we have always stepped up higher than where we were before, so this is going to make us step up further. We're going to demand more, we're going to invent more, we're going to create more. But it's not going to be, I don't see it as a real threat. The places where I see it as a threat is when the data has bias, or the data is manipulated, which exists even without machine learning. >> I love though, that the analogy that you're making is as technology is evolving, it's kind of a natural catalyst >> Janet: It is a natural catalyst. >> For us humans to evolve and learn and progress and that's a great cycle that you're-- >> Yeah, imagine how we did farming ten years ago, twenty years ago. Imagine how we drive our cars today than we did many years ago. Imagine the role of maps in our lives. Imagine the role of autonomous cars. This is a natural progression of the human race, that's how I see it, and you can see the younger, young people now are so natural for them, technology is so natural for them. They can tweet, and swipe, and that's the natural progression of the human race. I don't think we can stop that, I think we have to embrace that it's a gift. >> That's a great message, embracing it. It is a gift. Well, we wish you the best of luck this year at Western Digital, and thank you for inspiring us and probably many that are here and those that are watching the livestream. Janet George, thanks so much for being on The Cube. >> Thank you. >> Thank you for watching The Cube. We are again live from the second annual Women in Data Science conference at Stanford, I'm Lisa Martin, don't go away. We'll be right back. (upbeat electronic music)

Published Date : Feb 3 2017

SUMMARY :

it's The Cube covering the Women in I'm Lisa Martin and we are going to be talking about. data science is applied in the industry. One of the use case How has it helped to in the way we do work. apply the learnings, to to look at our data. that can inform the a little bit about the the things we found out quite a bit on, to look at the helping to shape the next started to open up and we but in many cities across the globe. That's an issue because of the time frame, the ability to communicate? So it's the intersection of the intersection of I always say "is the So your background in computer science, comes from being in the field. problems that I had to You talked about the how can the algorithms we built, that at least in the I do believe that the time of the human race, Well, we wish you the We are again live from the second annual

ENTITIES

Entity	Category	Confidence
Janet	PERSON	0.99+
Lisa Martin	PERSON	0.99+
Janet George	PERSON	0.99+
Western Digital	ORGANIZATION	0.99+
Six	QUANTITY	0.99+
Lisa	PERSON	0.99+
99%	QUANTITY	0.99+
Silicon Valley	LOCATION	0.99+
12 weeks	QUANTITY	0.99+
Stanford University	ORGANIZATION	0.99+
third circle	QUANTITY	0.99+
first circle	QUANTITY	0.99+
twenty years ago	DATE	0.99+
second circle	QUANTITY	0.99+
eight weeks	QUANTITY	0.99+
six	QUANTITY	0.99+
The Cube	TITLE	0.99+
ten years ago	DATE	0.99+
One	QUANTITY	0.98+
three	QUANTITY	0.98+
eight weeks	QUANTITY	0.98+
three circles	QUANTITY	0.98+
Women in Data Science Technical Conference	EVENT	0.98+
this year	DATE	0.98+
Feliz	PERSON	0.97+
Stanford	LOCATION	0.97+
three elements	QUANTITY	0.97+
one	QUANTITY	0.97+
Women in Data Science Conference 2017	EVENT	0.97+
both	QUANTITY	0.96+
Women in Data Science Conference	EVENT	0.96+
many years ago	DATE	0.96+
second learning cycle	QUANTITY	0.96+
Women in Data Science	EVENT	0.96+
one day event	QUANTITY	0.96+
first learning cycle	QUANTITY	0.94+
first learning cycle	QUANTITY	0.93+
today	DATE	0.91+
one area	QUANTITY	0.91+
Women in Data Science conference	EVENT	0.89+
second	QUANTITY	0.88+
millions of wafers	QUANTITY	0.87+
first	QUANTITY	0.86+
one-stop	QUANTITY	0.86+
about 3 million wafers	QUANTITY	0.84+
-a-Guccis	ORGANIZATION	0.81+
The Cube	ORGANIZATION	0.77+
University	ORGANIZATION	0.6+
second annual	QUANTITY	0.56+
2017	DATE	0.51+
Cube	PERSON	0.36+

Bill Andrews, ExaGrid | VeeamON 2022

(upbeat music) >> We're back at VeeamON 2022. We're here at the Aria in Las Vegas Dave Vellante with Dave Nicholson. Bill Andrews is here. He's the president and CEO of ExaGrid, mass boy. Bill, thanks for coming on theCUBE. >> Thanks for having me. >> So I hear a lot about obviously data protection, cyber resiliency, what's the big picture trends that you're seeing when you talk to customers? >> Well, I think clearly we were talking just a few minutes ago, data's growing like crazy, right This morning, I think they said it was 28% growth a year, right? So data's doubling almost just a little less than every three years. And then you get the attacks on the data which was the keynote speech this morning as well, right. All about the ransomware attacks. So we've got more and more data, and that data is more and more under attack. So I think those are the two big themes. >> So ExaGrid as a company been around for a long time. You've kind of been the steady kind of Eddy, if you will. Tell us about ExaGrid, maybe share with us some of the differentiators that you share with customers. >> Sure, so specifically, let's say in the Veeam world you're backing up your data, and you really only have two choices. You can back that up to disc. So some primary storage disc from a Dell, or a Hewlett Packard, or an NetApp or somebody, or you're going to back it up to what's called an inline deduplication appliance maybe a Dell Data Domain or an HPE StoreOnce, right? So what ExaGrid does is we've taken the best of both those but not the challenges of both those and put 'em together. So with disc, you're going to get fast backups and fast restores, but because in backup you keep weekly's, monthly's, yearly retention, the cost of this becomes exorbitant. If you go to a deduplication appliance, and let's say the Dell or the HPs, the data comes in, has to be deduplicated, compare one backup to the next to reduce that storage, which lowers the cost. So fixes that problem, but the fact that they do it inline slows the backups down dramatically. All the data is deduplicated so the restores are slow, and then the backup window keeps growing as the data grows 'cause they're all scale up technologies. >> And the restores are slow 'cause you got to rehydrate. >> You got to rehydrate every time. So what we did is we said, you got to have both. So our appliances have a front end disc cache landing zone. So you're right directed to the disc., Nothing else happens to it, whatever speed the backup app could write at that's the speed we take it in at. And then we keep the most recent backups in that landing zone ready to go. So you want to boot a VM, it's not an hour like a deduplication appliance it's a minute or two. Secondly, we then deduplicate the data into a second tier which is a repository tier, but we have all the deduplicated data for the long term retention, which gets the cost down. And on top of that, we're scale out. Every appliance has networking processor memory end disc. So if you double, triple, quadruple the data you double, triple, quadruple everything. And if the backup window is six hours at 100 terabyte it's six hours at 200 terabyte, 500 terabyte, a petabyte it doesn't matter. >> 'Cause you scale out. >> Right, and then lastly, our repository tier is non-network facing. We're the only ones in the industry with this. So that under a ransomware attack, if you get hold of a rogue server or you hack the media server, get to the backup storage whether it's disc or deduplication appliance, you can wipe out all the backup data. So you have nothing to recover from. In our case, you wipe it out, our landing zone will be wiped out. We're no different than anything else that's network facing. However, the only thing that talks to our repository tier is our object code. And we've set up security policies as to how long before you want us to delete data, let's say 10 days. So if you have an attack on Monday that data doesn't get deleted till like a week from Thursday, let's say. So you can freeze the system at any time and do restores. And then we have immutable data objects and all the other stuff. But the culmination of a non-network facing tier and the fact that we do the delayed deletes makes us the only one in the industry that can actually truly recover. And that's accelerating our growth, of course. >> Wow, great description. So that disc cache layer is a memory, it's a flash? >> It's disc, it's spinning disc. >> Spinning disc, okay. >> Yeah, no different than any other disc. >> And then the tiered is what, less expensive spinning disc? >> No, it's still the same. It's all SaaS disc 'cause you want the quality, right? So it's all SaaS, and so we use Western Digital or Seagate drives just like everybody else. The difference is that we're not doing any deduplication coming in or out of that landing zone to have fast backups and fast restores. So think of it like this, you've got disc and you say, boy it's too expensive. What I really want to do then is put maybe a deduplication appliance behind it to lower the cost or reverse it. I've got a deduplication appliance, ugh, it's too slow for backups and restores. I really want to throw this in front of it to have fast backups first. Basically, that's what we did. >> So where does the cost savings, Bill come in though, on the tier? >> The cost savings comes in the fact that we got deduplication in that repository. So only the most recent backup >> Ah okay, so I get it. >> are the duplicated data. But let's say you had 40 copies of retention. You know, 10 weekly's, 36 monthly's, a few yearly. All of that's deduplicated >> Okay, so you're deduping the stuff that's not as current. >> Right. >> Okay. >> And only a handful of us deduplicate at the layer we do. In other words, deduplication could be anywhere from two to one, up to 50 to one. I mean it's all over the place depending on the algorithm. Now it's what everybody's algorithms do. Some backup apps do two to one, some do five to one, we do 20 to one as well as much as 50 to one depending on the data types. >> Yeah, so the workload is going to largely determine the combination >> The content type, right. with the algos, right? >> Yeah, the content type. >> So the part of the environment that's behind the illogical air gap, if you will, is deduped data. >> Yes. >> So in this case, is it fair to say that you're trading a positive economic value for a little bit longer restore from that environment? >> No, because if you think about backup 95% of the customers restores are from the most recent data. >> From the disc cache. >> 95% of the time 'cause you think about why do you need fast restores? Somebody deleted a file, somebody overwrote a file. They can't go work, they can't open a file. It's encrypted, it's corrupted. That's what IT people are trying to keep users productive. When do you go for longer-term retention data? It's an SEC audit. It's a HIPAA audit. It's a legal discovery, you don't need that data right away. You have days and weeks to get that ready for that legal discovery or that audit. So we found that boundary where you keep users productive by keeping the most recent data in the disc cache landing zone, but anything that's long term. And by the way, everyone else is long term, at that point. >> Yeah, so the economics are comparable to the dedupe upfront. Are they better, obviously get the performance advance? >> So we would be a lot looped. The thing we replaced the most believe it or not is disc, we're a lot less expensive than the disc. I was meeting with some Veeam folks this morning and we were up against Cisco 3260 disc at a children's hospital. And on our quote was $500,000. The disc was 1.4 million. Just to give you an example of the savings. On a Data Domain we're typically about half the price of a Data Domain. >> Really now? >> The reason why is their front end control are so expensive. They need the fastest trip on the planet 'cause they're trying to do inline deduplication. >> Yeah, so they're chasing >> They need the fastest memory >> on the planet. >> this chips all the time. They need SSD on data to move in and out of the hash table. In order to keep up with inline, they've got to throw so much compute at it that it drives their cost up. >> But now in the case of ransomware attack, are you saying that the landing zone is still available for recovery in some circumstances? Or are you expecting that that disc landing zone would be encrypted by the attacker? >> Those are two different things. One is deletion, one is encryption. So let's do the first scenario. >> I'm talking about malicious encryption. >> Yeah, absolutely. So the first scenario is the threat actor encrypts all your primary data. What's does he go for next? The backup data. 'Cause he knows that's your belt and suspend is to not pay the ransom. If it's disc he's going to go in and put delete commands at the disc, wipe out the disc. If it's a data domain or HPE StoreOnce, it's all going to be gone 'cause it's one tier. He's going to go after our landing zone, it's going to be gone too. It's going to wipe out our landing zone. Except behind that we have the most recent backup deduplicate in the repository as well as all the other backups. So what'll happen is they'll freeze the system 'cause we weren't going to delete anything in the repository for X days 'cause you set up a policy, and then you restore the most recent backup into the landing zone or we can restore it directly to your primary storage area, right? >> Because that tier is not network facing. >> That's right. >> It's fenced off essentially. >> People call us every day of the week saying, you saved me, you saved me again. People are coming up to me here, you saved me, you saved me. >> Tell us a story about that, I mean don't give me the names but how so. >> I'll actually do a funnier story, 'cause these are the ones that our vendors like to tell. 'Cause I'm self-serving as the CEO that's good of course, a little humor. >> It's your 15 minutes of job. >> That is my 15 minutes of fame. So we had one international company who had one ExaGrid at one location, 19 Data Domains at the other locations. Ransomware attack guess what? 19 Data Domains wiped out. The one ExaGrid, the only place they could restore. So now all 20 locations of course are ExaGrids, China, Russia, Mexico, Germany, US, et cetera. They rolled us out worldwide. So it's very common for that to occur. And think about why that is, everyone who's network facing you can get to the storage. You can say all the media servers are buttoned up, but I can find a rogue server and snake my way over the storage, I can. Now, we also of course support the Veeam Data Mover. So let's talk about that since we're at a Veeam conference. We were the first company to ever integrate the Veeam Data Mover. So we were the first actually ever integration with Veeam. And so that Veeam Data Mover is a protocol that goes from Veeam to the ExaGrid, and we run it on both ends. So that's a more secure protocol 'cause it's not an open format protocol like SaaS. So with running the Veeam Data Mover we get about 30% more performance, but you do have a more secure protocol layer. So if you don't get through Veeam but you get through the protocol, boom, we've got a stronger protocol. If you make it through that somehow, or you get to it from a rogue server somewhere else we still have the repository. So we have all these layers so that you can't get at it. >> So you guys have been at this for a while, I mean decade and a half plus. And you've raised a fair amount of money but in today's terms, not really. So you've just had really strong growth, sequential growth. I understand it, and double digit growth year on year. >> Yeah, about 25% a year right now >> 25%, what's your global strategy? >> So we have sales offices in about 30 countries already. So we have three sales teams in Brazil, and three in Germany, and three in the UK, and two in France, and a lot of individual countries, Chile, Argentina, Columbia, Mexico, South Africa, Saudi, Czech Republic, Poland, Dubai, Hong Kong, Australia, Singapore, et cetera. We've just added two sales territories in Japan. We're adding two in India. And we're installed in over 50 countries. So we've been international all along the way. The goal of the company is we're growing nicely. We have not raised money in almost 10 years. >> So you're self-funding. You're cash positive. >> We are cash positive and self-funded and people say, how have you done that for 10 years? >> You know what's interesting is I remember, Dave Scott, Dave Scott was the CEO of 3PAR, and he told me when he came into that job, he told the VCs, they wanted to give him 30 million. He said, I need 80 million. I think he might have raised closer to a hundred which is right around what you guys have raised. But like you said, you haven't raised it in a long time. And in today's terms, that's nothing, right? >> 100 is 500 in today's terms. >> Yeah, right, exactly. And so the thing that really hurt 3PAR, they were public companies so you could see all this stuff is they couldn't expand internationally. It was just too damn expensive to set up the channels, and somehow you guys have figured that out. >> 40% of our business comes out of international. We're growing faster internationally than we are domestically. >> What was the formula there, Bill, was that just slow and steady or? >> It's a great question. >> No, so what we did, we said let's build ExaGrid like a McDonald's franchise, nobody's ever done that before in high tech. So what does that mean? That means you have to have the same product worldwide. You have to have the same spares model worldwide. You have to have the same support model worldwide. So we early on built the installation. So we do 100% of our installs remotely. 100% of our support remotely, yet we're in large enterprises. Customers racks and stacks the appliances we get on with them. We do the entire install on 30 minutes to about three hours. And we've been developing that into the product since day one. So we can remotely install anywhere in the world. We keep spares depots all over the world. We can bring 'em up really quick. Our support model is we have in theater support people. So they're in Europe, they're in APAC, they're in the US, et cetera. And we assign customers to the support people. So they deal with the same support person all the time. So everything is scalable. So right now we're going to open up India. It's the same way we've opened up every other country. Once you've got the McDonald's formula we just stamp it all over the world. >> That's amazing. >> Same pricing, same product same model, same everything. >> So what was the inspiration for that? I mean, you've done this since day one, which is what like 15, 16 years ago. Or just you do engineering or? >> No, so our whole thought was, first of all you can't survive anymore in this world without being an international company. 'Cause if you're going to go after large companies they have offices all over the world. We have companies now that have 17, 18, 20, 30 locations. And there were in every country in the world, you can't go into this business without being able to ship anywhere in the world and support it for a single customer. You're not going into Singapore because of that. You're going to Singapore because some company in Germany has offices in the U.S, Mexico Singapore and Australia. You have to be international. It's a must now. So that was the initial thing is that, our goal is to become a billion dollar company. And we're on path to do that, right. >> You can see a billion. >> Well, I can absolutely see a billion. And we're bigger than everybody thinks. Everybody guesses our revenue always guesses low. So we're bigger than you think. The reason why we don't talk about it is we don't need to. >> That's the headline for our writers, ExaGrid is a billion dollar company and nobody's know about it. >> Million dollar company. >> On its way to a billion. >> That's right. >> You're not disclosing. (Bill laughing) But that's awesome. I mean, that's a great story. I mean, you kind of are a well kept secret, aren't you? >> Well, I dunno if it's a well kept secret. You know, smaller companies never have their awareness of big companies, right? The Dells of the world are a hundred billion. IBM is 70 billion, Cisco is 60 billion. Easy to have awareness, right? If you're under a billion, I got to give a funny story then I think we got to close out here. >> Oh go ahead please. >> So there's one funny story. So I was talking to the CIO of a super large Fortune 500 company. And I said to him, "Just so who do you use?" "I use IBM Db2, and I use, Cisco routers, and I use EMC primary storage, et cetera. And I use all these big." And I said, "Would you ever switch from Db2?" "Oh no, the switching costs would kill me. I could never go to Oracle." So I said to him, "Look would you ever use like a Pure Storage, right. A couple billion dollar company." He says, "Who?" >> Huh, interesting. >> I said to him, all right so skip that. I said, "VMware, would you ever think about going with Nutanix?" "Who?" Those are billion dollar plus companies. And he was saying who? >> Public companies. >> And he was saying who? That's not uncommon when I talk to CIOs. They see the big 30 and that's it. >> Oh, that's interesting. What about your partnership with Veeam? Tell us more about that. >> Yeah, so I would actually, and I'm going to be bold when I say this 'cause I think you can ask anybody here at the conference. We're probably closer first of all, to the Veeam sales force than any company there is. You talk to any Veeam sales rep, they work closer with ExaGrid than any other. Yeah, we are very tight in the field and have been for a long time. We're integrated with the Veeam Data Boomer. We're integrated with SOBR. We're integrated with all the integrations or with the product as well. We have a lot of joint customers. We actually do a lot of selling together, where we go in as Veeam ExaGrid 'cause it's a great end to end story. Especially when we're replacing, let's say a Dell Avamar to Dell Data Domain or a Dell Network with a Dell Data Domain, very commonly Veeam ExaGrid go in together on those types of sales. So we do a lot of co-selling together. We constantly train their systems engineers around the world, every given week we're training either inside sales teams, and we've trained their customer support teams in Columbus and Prague. So we're very tight with 'em we've been tight for over a decade. >> Is your head count public? Can you share that with us? >> So we're just over 300 employees. >> Really, wow. >> We have 70 open positions, so. >> Yeah, what are you looking for? Yeah, everything, right? >> We are looking for engineers. We are looking for customer support people. We're looking for marketing people. We're looking for inside sales people, field people. And we've been hiring, as of late, major account reps that just focus on the Fortune 500. So we've separated that out now. >> When you hire engineers, I mean I think I saw you were long time ago, DG, right? Is that true? >> Yeah, way back in the '80s. >> But systems guy. >> That's how old I am. >> Right, systems guy. I mean, I remember them well Eddie Castro and company. >> Tom West. >> EMV series. >> Tom West was the hero of course. >> The EMV 4000, the EMV 20,000, right? >> When were kids, "The Soul of a New Machine" was the inspirational book but anyway, >> Yeah Tracy Kidder, it was great. >> Are you looking for systems people, what kind of talent are you looking for in engineering? >> So it's a lot of Linux programming type stuff in the product 'cause we run on a Linux space. So it's a lot of Linux programs so its people in those storage. >> Yeah, cool, Bill, hey, thanks for coming on to theCUBE. Well learned a lot, great story. >> It's a pleasure. >> That was fun. >> Congratulations. >> Thanks. >> And good luck. >> All right, thank you. >> All right, and thank you for watching theCUBE's coverage of VeeamON 2022, Dave Vellante for Dave Nicholson. We'll be right back right after this short break, stay with us. (soft beat music)

Published Date : May 17 2022

SUMMARY :

We're here at the Aria in Las Vegas And then you get the attacks on the data You've kind of been the steady and let's say the Dell or And the restores are slow that's the speed we take it in at. and the fact that we So that disc cache layer No, it's still the same. So only the most recent backup are the duplicated data. Okay, so you're deduping the deduplicate at the layer we do. with the algos, right? So the part of the environment 95% of the customers restores 95% of the time 'cause you think about Yeah, so the economics are comparable example of the savings. They need the fastest trip on the planet in and out of the hash table. So let's do the first scenario. So the first scenario is the threat actor Because that tier day of the week saying, I mean don't give me the names but how so. 'Cause I'm self-serving as the CEO So if you don't get through Veeam So you guys have been The goal of the company So you're self-funding. what you guys have raised. And so the thing that really hurt 3PAR, than we are domestically. It's the same way we've Same pricing, same product So what was the inspiration for that? country in the world, So we're bigger than you think. That's the headline for our writers, I mean, you kind of are a The Dells of the world So I said to him, "Look would you ever I said, "VMware, would you ever think They see the big 30 and that's it. Oh, that's interesting. So we do a lot of co-selling together. that just focus on the Fortune 500. Eddie Castro and company. in the product 'cause thanks for coming on to theCUBE. All right, and thank you for watching

ENTITIES

Entity	Category	Confidence
Veeam	ORGANIZATION	0.99+
Japan	LOCATION	0.99+
Dave Scott	PERSON	0.99+
Columbus	LOCATION	0.99+
Brazil	LOCATION	0.99+
Germany	LOCATION	0.99+
McDonald	ORGANIZATION	0.99+
India	LOCATION	0.99+
Dell	ORGANIZATION	0.99+
Bill Andrews	PERSON	0.99+
$500,000	QUANTITY	0.99+
UK	LOCATION	0.99+
three	QUANTITY	0.99+
Cisco	ORGANIZATION	0.99+
France	LOCATION	0.99+
Tracy Kidder	PERSON	0.99+
Europe	LOCATION	0.99+
Seagate	ORGANIZATION	0.99+
Dave Nicholson	PERSON	0.99+
100%	QUANTITY	0.99+
U.S	LOCATION	0.99+
two	QUANTITY	0.99+
Australia	LOCATION	0.99+
six hours	QUANTITY	0.99+
US	LOCATION	0.99+
Singapore	LOCATION	0.99+
100 terabyte	QUANTITY	0.99+
17	QUANTITY	0.99+
Prague	LOCATION	0.99+
15 minutes	QUANTITY	0.99+
1.4 million	QUANTITY	0.99+
200 terabyte	QUANTITY	0.99+
40 copies	QUANTITY	0.99+
30 minutes	QUANTITY	0.99+
3PAR	ORGANIZATION	0.99+
Western Digital	ORGANIZATION	0.99+
Nutanix	ORGANIZATION	0.99+
Dave Vellante	PERSON	0.99+
Oracle	ORGANIZATION	0.99+
The Soul of a New Machine	TITLE	0.99+
20	QUANTITY	0.99+
500 terabyte	QUANTITY	0.99+
80 million	QUANTITY	0.99+
30 million	QUANTITY	0.99+
ExaGrid	ORGANIZATION	0.99+
three sales teams	QUANTITY	0.99+
95%	QUANTITY	0.99+
10 years	QUANTITY	0.99+
first scenario	QUANTITY	0.99+
Monday	DATE	0.99+
10 days	QUANTITY	0.99+
IBM	ORGANIZATION	0.99+
18	QUANTITY	0.99+
Thursday	DATE	0.99+
60 billion	QUANTITY	0.99+
Hewlett Packard	ORGANIZATION	0.99+
50	QUANTITY	0.99+
70 billion	QUANTITY	0.99+
two sales territories	QUANTITY	0.99+
Tom West	PERSON	0.99+
Mexico	LOCATION	0.99+
20 locations	QUANTITY	0.99+
Bill	PERSON	0.99+
Argentina	LOCATION	0.99+
Linux	TITLE	0.99+
70 open positions	QUANTITY	0.99+
Poland	LOCATION	0.99+
five	QUANTITY	0.99+
Las Vegas	LOCATION	0.99+
first	QUANTITY	0.99+
One	QUANTITY	0.99+
Columbia	LOCATION	0.99+
Dubai	LOCATION	0.99+
a minute	QUANTITY	0.99+

Chen Goldberg, Google Cloud | CUBE Conversation

(peaceful music) >> Welcome to this cube conversation. I'm Dave Nicholson, and I am delighted to welcome back to the cube, cube veteran, Chen Goldberg, VP of engineering from Google. Chen, welcome back to the cube. >> Hey Dave, super happy to be here. >> Absolutely delighted to have you here. Let's dive right into this conversation. There was a, there was a blog post this week, talking about Google Cloud putting a lot of weight behind this idea of principles for software development. What are those principles and why are they important? >> The three principles that we put in that blog post is open, easy, and transformative. And I think what's really important to recognize with the three principles that those are not new principles, not for Google Cloud, and definitely not for me. I joined Google about, a little bit over five years ago. Right when just Kubernetes started to lead Kubernetes and Google Kubernetes engine team. And we immediately recognized, the idea of open and the importance of flexibility and choice is a foundation to the idea of Kubernetes and portability workloads. But pretty early on, it was clear that it's not enough just to have portability and flexibility because it creates a lot of complexity. So how can we still have that without creating a trade-off or tension for our customers? So really making sure that everything is also easy. You know, and one of the things, I use, I like to say it's not just portability of workloads, but also portability of skills and you achieve that through consistent experience, right? A lot of automation. And when you bring all of those things together, what I love about Google Cloud is that, you know, I'm an infrastructure person. I've always been infrastructure person. And what excites me the most is seeing others take this innovation and, and really empowers developers to make amazing, or, you know, unique ideas, a reality. And that's really the foundation principles for Google Cloud. >> So how does that translate into, from a customer perspective? >> So I would just start with some customer examples, right? Starting from, their perspective. So when we think about open, this is actually part of the, our customers cloud strategy, right? You say cloud, you immediately think only about public cloud, but from our customer perspectives, right? They think about public clouds, right? Most of them have more than one cloud, but they also think about the private cloud, you know, IOT edge and having that openness and flexibility to choose where they can run their workload, is critical. It's critical for them. What I hear mostly is of course, innovation, managing costs, and also making sure that they are not locked out of innovation that happens for example, in any cloud or, or somewhere else. So that's a really a key consideration for our customers when they think about their cloud strategy. The second thing that open matters is that it's really hard to hire talent that is expert and has the right skills. And we see that by using a leveraging open source technologies, it actually makes it easier to our customers to hire the best talent there is in the industry. At one of the previous Google Cloud Next sessions, we had the Loblaw for example, which is the biggest grocery in Canada. And, you know, we were joking on stage, that even though at our hiring for grocery shop, they still can hire the best talent because they are using the best technologies out there in the industry. So that's one, if you think about the importance of easy, I would just call out Western Digital that we've just announced how they decided to standardize on Anthos for their cloud strategy, right? Both of course, Google cloud platform, but On Prem and the Edge. And for them what's important is that when they have all of their amazing developers and operators, how can they provide them reach experience, right. We don't want our developers or operators to spend time on things that can be automated or managed by others. So having a smooth, intuitive experience is really critical. And we we've been announcing some new stuff like a, a Google Cloud deploy and really integrating the entire experience, especially integration for managing, deploying directly to Google Kubernetes engine. And of course, one of my favorite is Jiechi autopilot, which really takes all the goodness with Kubernetes and automatically managing. And then transformative, this is like what I said before, unleashing innovation. And we see Wendy's, for example, right, they want to actually have AI machine learning at run time at their branches, which will allow them to create a new experience for their customers. So this is how we see customers really appreciate these three principles. >> So whenever the subject of Kubernetes and Google comes up, we have to talk Anthos. We're now into what year three of Anthos. How has adoption looked what's the latest on that front? >> That has been really great. We actually have been seeing a 500% growth on the end of Q2 of year over year. And it's important you know to mention that the journey with Anthos is not something new, but something that we have built with our customers when they really love the experience they have on GCP, but needed to innovate elsewhere and not just on Google Cloud. So we've been seeing that, you know, I mentioned the Western Digital, blah, blah, and Wendy's we also have customers like MLB, which is really exciting how they've changed their entire fans' experience using Anthos. And for them, again, it was both the easy part, right? How can I deal with that complexity of having compute and storage everywhere in every one of the stadiums, but also how can I use AI and machine learning, which is unique to Google Cloud in order to create unique experiences for the fans, at real time, of course. >> Yeah. Now you've, touched on this a bit already, if you had to, if you thought about someone reviewing Anthos, their Anthos experience, because we're in the midst of people adopting Anthos and becoming new to Anthos at this point. What does a delighted customers response sound like to you? What is that Yelp review that they would write? If they were telling people we, doubled down on Anthos and we are thrilled because, fill in the blank for a second. >> The first thing that comes to mind is that it works everywhere and the developer experience that comes with it, right? So we have, of course the platform and the infrastructure, but where Anthos really shine is that experience, on top of thinking about all developers and operators that can really work in every environment without paying too much attention to that. And just having that intuitive experience, right? If you go to the Google Cloud console, you see all your clusters, and now we're actually also going to add your VMs into that view, and you can use tools like Anthos config managers, and Anthos service mash to manage your security posture or the configuration in all of those environments. >> So we hear a lot about Multicloud. Multicloud is fantastic, but it sounds like, dealing with the complexity associated with Multicloud is something that Anthos definitely helps with. >> Yes, you know, Google is best with complexity at scale, we've been running containers and really large environments for many years. And some of those principles really, you know, have been fundamental to the way we've started with Kubernetes. So the idea of the declarative intent and automation is really critical in managing large environment and high complexity because in those environments, lots of things can change, but with the declarative approach, you don't have to anticipate everything that is going to change, but you need to know what is your desired state. And that's really one way that Anthos is leveraging the Kubernetes primitives and those ideas to manage different types of environments. In addition to that, it's actually really adding that layer that I talked about before, around the easy can I make sure that my tools, right, if it's, for example, a cloud hybrid build or cloud deploy or Anthos service manager, Anthos config manager, can I make sure that this UI, the CLI the API will be consistent in all of those environments? Can I view in one place, all of my clusters, all of my applications, and this is really where Anthos shines. >> So the cloud data foundation had a, had to get together at the same time as, Google Cloud Next. And there's been a lot of discussion around topics like security. I just like to get your thoughts on, you know, what what's at the forefront of your mind, working in engineering at Google, working in this world where people are deploying Anthos, working in a world where in a multi-cloud environment, you don't necessarily have control as vice president of engineering at Google over what's happening in these other clouds. So what are some of the things that are at the front of your mind is security one of them, what are your thoughts? >> Security is top of mine. Similar to all of our customers and definitely internally. And there are many things that we are very worried about or create some risks. You know, we've just started talking about the secure central supply chain, by building with open source, how can we make sure that everything is secure, right? Then we know what is the contribution that's from the software that we are delivering, how can we make sure that the security posture is portable, right? We talked about workloads portability. We talked about skills portability, and experience, but really I think the next phase for us as an industry is to think about security posture portability. Can I really apply the same policy everywhere and still make sure that I have the right controls in place, which will have to be different depending on the environment, and to make sure that that really is the case. So lots of work around that, and again, talking about the other things we talked about. We talked about open and flexibility, how can you make sure that it's easy? One of the areas that we are very excited about is really around binary authorization, for example. So when you use our tools like cloud build, cloud deploy, artifact, registry, you can get your container images automatically scanned for vulnerabilities and tools like onto service mesh, which allows you to actually manage your security posture, traffic management, who can access what without doing any changes to your applications. >> Fantastic stuff. As we, as we wrap up our time here, do you have any final thoughts on the direction of cloud where we are in the adoption curve? You know, by some estimates, something like 75% of IT is still happening on premises. There've been some announcements coming out of Cloud Next regarding the ability to run all sorts of Google goodness on premises. So we seem to all be acknowledging that we're going to be in a bit of a hybrid world, in addition to a multicloud world, moving forward. Do you want to place any bets on, on when we'll hit the 50, 50 mark or the 25% on premises, 75% cloud mark. What do you think? >> Yeah, I'm not the best gambler to be honest, but I do have a thought about that. I think what's interesting is that customers started to talk, you know, few years back, it was, hey, I have my on-prem environment and I have the cloud. How can they, these two work together. And now what we see our customers talking, you know, they're on premises, their edge is part of their cloud strategy. It's not separated. And I think this is what we'll see more and more of, right? Regardless if this is your private cloud or public cloud, your edge, we would like to have a cloud like experience in that environment and consistency. And of course, we would love to leverage all the goodness of the cloud. If it's like machine learning, AI, and other capabilities, automation, everywhere we go. So I think this is the biggest change we're starting to see. And in addition to that, I think we will see, today everybody are already multicloud cloud, right? If it's recquisitions and just by cause of bottom up culture, you know, people choose different services. And I expect we'll see more strategic thinking about our customers multicloud strategy. Where do I deploy my workloads? What are the benefits? If it's latency, if it's specific services that are available, maybe cost, we'll see the customers becoming more intentional about that and this is really exciting. >> Well Chen, amazing insights. It's obvious why you're a cube veteran. It's obviously why we seek you out for your counsel and guidance on a variety of subjects. Thank you so much for spending time with us today in this cube conversation. With that I'd like to thank you for joining us. Until next time, I'm Dave Nicholson, thanks for joining (peaceful music)

Published Date : Oct 29 2021

SUMMARY :

and I am delighted to Absolutely delighted to have you here. And that's really the foundation the private cloud, you know, the latest on that front? but something that we have What is that Yelp review and Anthos service mash to is something that Anthos everything that is going to that are at the front of One of the areas that we are regarding the ability to run all is that customers started to With that I'd like to

ENTITIES

Entity	Category	Confidence
Dave Nicholson	PERSON	0.99+
Canada	LOCATION	0.99+
Dave	PERSON	0.99+
Western Digital	ORGANIZATION	0.99+
Chen Goldberg	PERSON	0.99+
75%	QUANTITY	0.99+
50	QUANTITY	0.99+
Google	ORGANIZATION	0.99+
500%	QUANTITY	0.99+
25%	QUANTITY	0.99+
Chen	PERSON	0.99+
Both	QUANTITY	0.99+
Yelp	ORGANIZATION	0.99+
Anthos	TITLE	0.99+
today	DATE	0.98+
one	QUANTITY	0.98+
two	QUANTITY	0.98+
both	QUANTITY	0.98+
first thing	QUANTITY	0.97+
second thing	QUANTITY	0.97+
Multicloud	TITLE	0.97+
three principles	QUANTITY	0.97+
Wendy	ORGANIZATION	0.96+
Cloud Next	TITLE	0.96+
this week	DATE	0.96+
Wendy	PERSON	0.95+
more than one cloud	QUANTITY	0.95+
Kubernetes	PERSON	0.94+
Google Cloud	TITLE	0.93+
Google Cloud	TITLE	0.92+
Kubernetes	TITLE	0.91+
one place	QUANTITY	0.9+
One	QUANTITY	0.9+
Anthos	ORGANIZATION	0.87+
Kubernetes	ORGANIZATION	0.83+
one way	QUANTITY	0.82+
five years ago	DATE	0.79+
end of Q2	DATE	0.77+
year three	QUANTITY	0.76+
Google Kubernetes	TITLE	0.76+
few years	DATE	0.73+
Jiechi	ORGANIZATION	0.72+
Loblaw	PERSON	0.71+
GCP	TITLE	0.68+
MLB	ORGANIZATION	0.58+
second	QUANTITY	0.58+
over	DATE	0.56+

Phil Bullinger V1

>>from the Cube Studios in >>Palo Alto and Boston connecting with thought >>leaders all around the world. This is a cube conversation. >>Hey, welcome back, everybody. Jeff Frick here with the Cube. We're in our Palo Alto Studios Cove. It is still going on. So, uh, all of our all of the interviews continue to be remote, but we're excited to have Ah, Cube alumni hasn't been on for a long time, but this guy has been in the weeds of the storage industry for a very, very long time, and we're happy to, uh, I have a mon and get an update because there continues to be a lot of exciting developments. He's Phill Bollinger. Ah, he is the SVP and general manager Data center business unit from Western Digital. Joining us, I think from Colorado. So, Phil, great to see you. How is the weather in Colorado today? >>Hi, Jeff. It's great to be here. Well, it's It's a hot, dry summer here. I'm sure like a lot of places. Yeah, enjoying enjoying this summer through these unusual times it >>is. It is unusual times, but fortunately, there's great things like the Internet and heavy duty. Ah, compute and store out there so we can we can get together this way. So let's jump into it. You've been in the business a long time. You've been a Western digital, your DMC you worked on I salon and you were at storage companies before that. And you've seen kind of this never ending up into the right slope that we see, you know, kind of ad nauseam. In terms of the amount of storage demands. It's not going anywhere but up in police. Increased complexity in terms of unstructured data, sources of data, speed of data, you know, the kind of classic big V's of big data. So I wonder before we jump into specifics if you can kind of share your perspective because you've been kind of sitting in the catbird seat. And Western Digital's a really unique company. You not only have solutions, but you also have media that feeds other people solutions. So you guys are really, you know, seeing. And ultimately all this computes gotta put this data somewhere, and a whole lot of it's in our western digital. >>Yeah, it's It's a great a great intro there. Yeah, it's been interesting, you know, through my career. I've seen a lot of advances in storage technology. Uh, you know, speeds and feeds like we often say, But you know, the advancement through mechanical innovation, electrical innovation, chemistry, physics, you know, just the relentless growth of data has been, has been driven in many ways by the relentless acceleration and innovation of our ability to store that data. And that's that's been a very virtuous cycle through you know what for me has been more than 30 years and in enterprise storage there are some really interesting changes going on that I think if you think about it in a relatively short amount of time, data has gone from, you know, just kind of this artifact of our digital lives, um, to the very engine that's driving the global economy, um, our jobs, our relationships, our health, our security. They all depend on data on for most companies, kind of irrespective of size. How you use data, how you how you store it, how you monetize it, how you use it to make better decisions to improve products and services. You know, it becomes not just a matter of whether your company's going to thrive and I bet in many industries it's it's almost an existential question. Is, is your company going to be around in the future? And it and it depends on how well you're using data. So this this drive toe capitalize on the value of data is is pretty significant. >>It's Ah, it's a really interesting topic. We've had a number of conversations around trying to get, like a book value of data, if you will. And I think there's a lot of conversations, whether it's accounting, kind of way or finance or kind of of good will of how do you value this data? But I think we see it intrinsically in a lot of the big companies that are really database, like the Facebooks and the Amazons and the Netflix and the Googles and those >>types >>of companies where it's really easy to see. And if you see you know the valuation that they have compared to their book value of assets, right, it's really baked into there. So it's it's it's fundamental to going forward. And then we have this thing called Covet Hit, which, you know, >>you've >>seen on the media on social media, right? What drove your digital transformation. The CEO CIO, the CMO, the board Rick over 19. And it became this light switch moment where your opportunities to think about it or no more, you've got to jump in with both feet. And it's really interesting to your point that it's the ability to store this and think about it differently as an asset driving business value versus a cost that I t has >>to >>accommodate to put this stuff somewhere. So it's a really different kind of a mind shift and really changes the investment equation for companies like Western Digital about how people should invest in higher performance and higher capacity and more unified it in kind of democratizing the accessibility that data to a much greater set of people with tools that can now start making much more business line and in line decisions than just the data scientists you know, kind of on mahogany row. >>Yeah, like as you mentioned Jeff Inherit Western Digital. We have such a unique kind of perch in the industry to see all the dynamics in the ODM space and the hyper scale space and the channel really across all the global economy's about this this growth of data. I have worked at several companies and have been familiar with what I would have called big data projects and and, ah, fleets in the past. But the Western digital you have to move the decimal point, you know, quite a few digits to the right to get to get the perspective that that we have on just the volume of data, that the world is just relentlessly, insatiably consuming. Just a couple examples for for our Dr Projects we're working on now, our capacity enterprise Dr. Projects. You know, we used to do business case analyses and look at their life cycle. Pass it ease and we measure them and exabytes and not anymore. Now we're talking about Zeta Bytes were actually measuring capacity Enterprise drive families in terms of how many's petabytes they're gonna ship in their life cycle. And if we look at just the consumption of this data the last 12 months of Industry tam for capacity enterprise, compared to the 12 months prior to that, that annual growth rate was north of 60%. So it's it's rare to see industries that are that are growing at that pace. And so the world is just consuming immense amounts of data. And as you mentioned, the dynamics have been both an accelerant in some areas as well as headwinds and others. But it's certainly accelerated digital transformation. I think a lot of companies were talking about digital transformation and and, um, hybrid models. And Covert has really accelerated that. And it's certainly driving continues to drive just this relentless need toe to store and access and take advantage of data. Yeah, >>well, filling In advance of this interview, I pulled up the old chart right with with the all the different bytes, right, kilobytes, megabytes, gigabytes, terabytes, petabytes, exabytes and petabytes. And just just for the Wikipedia page. What is is that a byte, a zoo? Much information as there are grains of sand in all the world's beaches. For one fight, you're talking about thinking in terms of those units. I mean, that is just mind boggling to think that that is the scale in which we're operating. >>It's really hard to get your head wrapped around a set amount of storage. And, you know, I think a lot of the industry thinks when we say that a byte scale era that It's just a buzzword. But I'm here to say it's a real thing where we're measuring projects and in terms of petabytes, that's >>amazing. Let's jump into some of the technology. So I've been fortunate enough here at the Cube toe to be there at a couple of major announcements along the way. We talked before we turned the cameras on the helium announcement and having the hard drive sit in the in the fish bowl, um, to get off types of interesting benefits from this less dense air that is helium versus oxygen. I was down at the mammary and hammer announcement, which was pretty interesting. Big, big, heavy technology moves there to again increase the capacity of the hard drive based systems. You guys are doing a lot of stuff on. This five I know is an open source projects. You guys have a lot of things happening, but now there's this new thing, this new thing called zoned storage. So first off before we get into, why do we need zone storage? And really, what does it now bring to the table in terms of ah, capability? >>Yeah, Great question, Jeff. So why now, right. I as I mentioned, you know, storage. I've been in storage for quite some time in the last. Let's just say, in the last decade we've seen the advent of the hyper scale model and certainly the, you know, a whole another explosion, level of, of data and just the veracity with which the hyper scaler is can create and consume and process and monetize data. And, of course, with that has also come a lot of innovation, frankly, in the compute space around had a process that data and moving from, you know, what was just a general purpose CPU model to GP use and DP use. And so we've seen a lot of innovation on that. But you know, frankly, in the storage side, we haven't seen much change at all in terms of how operating systems applications, final systems, how they actually use the storage or communicate with the storage. And sure we've seen, you know, advances in storage capacities. Hard drives have gone from 2 to 4 to 8 to 10 to 14 16 and now are leading 18 and 20 terabyte hard drives and similarly on the SSD side, you know, now we're dealing with the complexities of seven and 15 and 30 terabytes. So things have gotten larger, as you would expect, but and and some interfaces have improved, I think Envy Me, which we'll talk about, has been nice advance in the industry. It's really now brought a very modern, scalable, low latency, multi threaded interface to a NAND flash to take advantage of the inherent performance of transistor based, persistent storage. But really, when you think about it hasn't changed a lot and so but what has changed his workloads? One thing that definitely has evolved in the space of the last decade or so is this. The thing that's driving a lot of this explosion of data and industry is around workloads that I would characterize as a sequential in nature there, see, really captured and written. They also have a very consistent lifecycle, so you would write them in a big chunk. You would read them, uh, maybe in smaller pieces, but the lifecycle of that data we can treat more as a chunk of data, but the problem is applications. Operating systems. File systems continue to interface with storage, using paradigms that are, you know, many decades old, they'll find 12 bite or even four K sectors. Size constructs were developed in, you know, in the hard drive industry, just as convenient paradigms to structure what is unstructured sea of magnetic grains into something structured that can be used to store and access data. But the reality is, you know, when we talk about SSD is structured really matters. And so these what has changed in the industry as the workloads are driving very, very fresh looks at how more intelligence could be applied to that application OS storage device interface to drive much greater officials. >>Right? So there's there's two things going on here that I want to drill down on one hand. You know, you talked about kind of the introduction of NAND flash Ah, and treating it like you did generically. You did a regular hard drive, but but you could get away and you could do some things because the interface wasn't taking full advantage of the speed that was capable in the nan. But envy me has changed that and forced kind of getting getting rid of some of those inefficient processes that you could live with. So it's just kind of classic. Next next level step up and capabilities. One is you got the better media. You just kind of plug it into the old way. Now, actually, you're starting to put in processes that take full advantage of the speed that that flash has. And I think you know, obviously, prices have come down dramatically since the first introduction. And for before, we always kind of clustered offer super high end, super low latency, super high value APS. You know, it just continues to Teoh to spread and proliferate throughout the data center. So, you know what did envy me force you to think about in terms of maximizing, you know, kind of the return on the NAND and flash? >>Yeah, yeah, in envy me, which, you know, we've been involved in the standardization after I think it's been a very successful effort, but we have to remember Envy me is is about a decade old, you know, or even more When the original work started around defining this this interface and but it's been very successful, you know, the envy, any standards, bodies, very productive, you know, across company effort, it's really driven a significant change. And what we see now is the rapid adoption of Envy Me in all data center architectures. Whether it's a very large hyper scale to, you know, classic on prim enterprise to even, you know, smaller applications. It's just a very efficient interface mechanism for connecting SSD, ease and Teoh into a server, you know, So the we continue to see evolution and envy me, which is great, and we'll talk about Z and s. Today is one of those evolutions. We're also very keenly interested in VM e protocol over fabrics. And so one of the things that Western Digital has been talking about a lot lately is incorporating Envy me over fabrics as a mechanism for now connecting shared storage into multiple post architectures. We think this is a very attractive way to build shared storage architectures in the future that are scalable, that air compose herbal that really are more have a lot more agility with respect two rack level infrastructure and applying that infrastructure to applications. Right >>now, one thing that might strike some people it's kind of counterintuitive is is within the zone, um, storage and zoning off parts of the media to think of the data also kind of in these big chunks, is it? It feels contrary to kind of optimization that we're seeing in the rest of the data center. Right? So smaller units of compute smaller units of store so that you can assemble and disassemble them in different quantities as needed. So what was the special attributes that you had to think about and and actually come back and provide a benefit in actually kind of re chunking, if you will in the zones versus trying to get as atomic as possible? >>Yeah, It's a great question, Jeff, and I think it's maybe not intuitive in terms of why zone storage actually creates a more efficient storage paradigm when you're storing stuff essentially in larger blocks of data. But if this is really where the intersection of structure and workload and sort of the nature of the data all come together, uh, if you turn back the clock, maybe 45 years when SMR hard drives host managers from our hard drives first emerged on the scene, this was really taking advantage of the fact that the right head on a hard describe is larger than the reader can't reach. It could be much smaller, and so then the notion of overlapping or singling the data on the drive giving the read had a smaller target to read. But the writer a larger right pad to write the data I could. Actually, what we found was it increases areal density significantly, Um, and so that was really the emergence of this notion of sequentially written larger blocks of data being actually much more efficiently stored. When you think about physically how it's being stored, what is very new now and really gaining a lot of traction is is the the SSD corollary to tomorrow in the hard drive. On the SSD side, we have the CNS specification, which is very similarly where you divide up a name space of an SSD and two fixed size zones, and those zones are written sequentially. But now those zones are are intimately tied to the underlying physical architecture of the NAND itself. The dies, the planes, the the three pages, the the race pages so that in treating data as a black, you're actually eliminating a lot of the complexity and the work that an SSD has to do to emulate a legacy hard drive. And in doing so, you're increasing performance and endurance and and the predictable performance of the device. >>I just love the way that that, you know, you kind of twist the lens on the problem and and on one hand, you know, by rule just looking at my notes of his own storage devices, the CS DS introduced a number of restrictions and limitations and and rules that are outside the full capabilities of what you might do. But in doing so in aggregate, the efficiency and the performance of the system in the hole is much, much better, even though when you first look at you think it's more of a limiter, but it's actually opens up. I wonder if there's any kind of performance stats you can share or any kind of empirical data, just to >>get people kind >>of a feel for what? That what that comes out as >>so if you think about the potential of zone storage in general, when again, When I talk about zone storage, there's two components. There's an HDD component of zone storage that we that we refer to as S. Some are, and there's an SSD version of that that we call Z and s So you think about SMR. The value proposition. There is additional capacity so effectively in the same Dr architecture with with, you know, roughly the same bill of material used to build the drive. We can overlap or single the data on the drive and generate for the customer additional capacity. Today with our 18 20 terabyte offerings, that's on the order of just over 10% but that Delta is going to increase significantly, going forward 20% or more. And when you think about ah, hyper scale customer that has not hundreds or thousands of racks but tens of thousands of racks, a 10 or 20% improvement and effective capacity is a tremendous TCO benefit, and the reason we do that is obvious. I mean, the the the the economic paradigm that drives large scale data centers is total cost of ownership, the acquisition costs and operating costs. And if you can put more storage in a square, you know, style of data center space, you're going to generally use less power. You're gonna run it more efficiently. You're actually from an acquisition cost. You're getting a more efficient purchase of that capacity. And in doing that, our innovation, you know, we benefit from it and our customers benefit from it so that the value proposition pours. Don't storage in in capacity. Enterprise HDD is very clear. It's it's additional capacity. The exciting thing is in the SSD side of things for Z and as it actually opens up even more value proposition for the customer. Um, because SSD is have had to emulate hard drives. There's been a lot of inefficiency in complexity inside an enterprise. SSD dealing with things like garbage collection and write amplification, reducing the endurance of the device. You have to over provision. You have to insert as much as 2025 28% additional NAND bits inside the device just too allow for that extra space, that working space to deal with with delete of the you know that that are smaller than the the a block of race that that device supports. And so you have to do a lot of reading and writing of data and cleaning up it creates for a very complex environment. Z and S by mapping the zone size with the physical structure of the SSD, essentially eliminates garbage collection. It reduces over provisioning by as much as 10% are 10 x And so if you were over provisioning by 20 or 25% in an enterprise SSD and Xeon SSD, that could be, you know, one or 2%. The other thing we have to keep in mind is enterprise. SSD is typically incorporate D RAM and that D RAM is used to help manage all those dynamics that I that I just mentioned, but with a very much simpler structure where the pointers to the data can be managed without all that d ram, we can actually reduce the amount of D ram in an enterprise SSD by as much as eight X. And if you think about the bill of material of an enterprise, SSD d ram is number two on the list in terms of the most expensive bomb components. So Z and S and SSD is actually have a significant customer. Total cost of ownership impact. Um, it's it's an exciting it's an exciting standard. And now that we have the standard ratified through the Envy me working group, um, you can really accelerate the development of the software ecosystem around >>right. So let's shift gears and talk a little bit about less about the tech and more about the customers and the implementation of this. So, you know, are there you talked to kind of generally, but are there certain certain types of workloads that you're seeing in the marketplace where this is, you know, a better fit? Or is it just really the big heavy lifts? Um, where they just need more and this is better. And then secondly, within you know, these both hyper scale companies, um, as well as just regular enterprises that are also seeing their data demands grow dramatically. Are you seeing you know, that this is a solution that they want to bring in for kind of the marginal kind of next data center extension data center or their next ah, cloud region? Or are they doing you know, lift and shift and ripping stuff out? Or do they have enough? Do they have enough data growth organically? >>Then >>there's plenty of new stuff that they can. They can put in these new systems. >>Yeah, well, the large customers don't don't rip and shift. They they write their assets for a long life cycle because with the relentless growth of data. You're primarily investing to handle what's what's coming in over the transom, but we're seeing we're seeing solid adoption in SMR. As you know, we've been working on that for a number of years. We've we've got, you know, significant interest in investment co investment, our engineering and our customers engineering, adapting the the application environments. Let's take advantage of SMR. The great thing is, now that we've got the envy me, the Xeon s standard ratified now, in the envy of the working group, um, we've got a very similar and all approved now situation where we've got SMR standards that have been approved for some time in the sand and scuzzy standards. Now we've got the same thing in the envy, any standard. And that's the great thing is once a company goes through the lifts, so it's B to adapt an application file system, operating system, ecosystem to zone storage. It pretty much works seamlessly between HDD and SSD. And so it's not. It's not an incremental investment when you're switching technologies and for obviously the early adopters of these technologies are going to be the large companies who designed their own infrastructure. You have you know, mega fleets of racks of infrastructure where these efficiencies really, really make a difference in terms of how they can monetize that data, how they compete against, you know, the landscape of competitors They have, um, for companies that are totally reliant on kind of off the shelf standard applications. That adoption curve is gonna be longer, of course, because there are there are some software changes that you need to adapt to to enable zone storage. One of the things Western Digital is has done, and taking the lead on is creating a landing page for the industry with zone storage. Not Iot. It's a Web page that's actually an area where, where many companies can contribute open source tools, code validation environments, technical documentation it's not. It's not a marketeering website. It's really a website bill toe land, actual open source content that companies can and use and leverage and contribute to. To accelerate the engineering work to adapt software stacks his own storage devices on to share those things. >>Let me just follow up on that, because again you've been around for a while and get your perspective on the power of open source and you know, it used to be, you know, the the best secrets, the best I p were closely guarded and held inside. And now really, we're in an age where it's not necessarily and you know, the the brilliant minds and use cases and people out there. You know, just by definition, it's a It's a more groups of engineers, more engineers outside your building than inside your building and how that's really changed. You know, kind of the strategy in terms of development when you can leverage open source. >>Yeah, Open source clearly has has accelerated innovation across the industry in so many ways. Um, and it's ah, you know, it's the paradigm around which, you know companies have built business models and innovated on top of it. I think it's always important as a company to understand what value add, you're bringing on what value add that customers want to pay for what unmet needs and your customers are you trying to solve for and what's the best mechanism to do that? And do you want to spend your R and D recreating things or leveraging what's available and and innovating on top of it? It's all about ecosystems in the days where the single company can vertically integrate. I talked about him a complete end solution. You know those air few and far between. I think it's It's about collaboration and building ecosystems and operating within those. >>Yeah, it's it's It's such an interesting change. And one more thing again, to get your perspective, you run the data center group. But there's this little thing happening out there that we see growing in I o T Internet of things and the industrial Internet of things and edge computing. As we, you know, try to move more, compute and store and power, you know, kind of outside the pristine world of the data center and out towards where this data is being collected and processed when you've got latency issues and and in all kinds of reasons to start to shift the balance of where the computers aware that store Ah, and the reliance on the network. So when you look back from a storage perspective in your history in this industry and you start to see that basically everything is now going to be connected, generating data and and and a lot of it is even open source. I talked to somebody the other day doing, you know, kind of open source, computer vision on surveillance, you know, video. So, you know, the amount of stuff coming off of these machines is growing like crazy ways at the same time, you know, it can't all be processed at the data center. It can all be kind of shift back and then have you have a decision and then ship that information back out to. So when you sit back and look at the edge from your kind of historical perspective, what goes through your mind? What gets you excited? You know, what are some of the opportunities that you see that maybe the Lehman is not paying close enough attention to? >>Yeah, it's It's really an exciting time in storage. I get asked that question from time to time, having been in storage for more than 30 years, you know what was the most interesting time, and there's been a lot of them, but I wouldn't trade today's environment for any other in terms of just the velocity with which data is is evolving and how it's being used and where it's being used. You know that the TCO equation made describe what a data center looks like. But data locality will determine where it's located and we're excited about the edge opportunity. We see that as a pretty significant, meaningful part of the TAM. As we look out 3 to 5 years, certainly five G is driving much of that. I think just anytime you speed up the speed of the connected fabric, you're going to increase storage and increase the processing of the data. So the edge opportunity is very interesting to us. We think a lot of it is driven by low latency workloads. So the concept of envy any, um is very appropriate for that. We think in general SSD is deployed in in edge data centers defined as anywhere from a meter to a few kilometres from the source of the data. We think that's going to be a very strong paradigm. Um, the workloads you mentioned especially I O. T just machine generated data in general now I believe, has eclipse human generated data in terms of just the amount of data stored, and so we think that curve is just going to keep going in terms of machine generated data, much of that data is so well suited for zone story because it's sequential, it's sequentially written, it's captured, it's it has a very consistent and homogeneous lifecycle associated with it. So we think what's going on with with Zone storage in general and and Z and S and SMR specifically are well suited for where a lot of the data growth is happening. And certainly we're going to see a lot of that at the edge. >>Well, Phil, it's always great to talk to somebody who's been in the same industry for 30 years and is excited about today and the future on as excited as they have been throughout the whole careers. That really bodes well for you both. Well, for for Western Digital. And we'll just keep hoping the smart people that you guys have over there keep working on the software and the physics, Um, and then in the mechanical engineering to keep moving this stuff along. It's really ah, it's just amazing and just relentless. >>Yeah, it is. It is relentless. What's what's exciting to me in particular, Jeff is we've we've we've driven storage advancements, you know, largely through. As I said, a you know a number of engineering disciplines, and those are still going to be important going forward the chemistry of the physics, the electrical, the hardware capabilities. But I think, as you know, is widely recognized in the industry that it's a diminishing curve. I mean, the amount of energy, the amount of engineering, effort, investment, the cost and complexity of these products to get to that next capacity step, um, is getting more difficult, not less. And so things like zone storage where we now bring intelligent data placement to this paradigm is what I think makes this current juncture that we're at a very exciting >>right, Right. Well, it is applied ai, right. Ultimately, you're gonna have, you know, more more compute, you know, compute power. You know, driving the storage process and how that stuff is managed. And, you know, as more cycles become available and they're cheaper and ultimately compute, um gets cheaper and cheaper. You know, as you said, you guys just keep finding new ways to ah, to move the curve. And we didn't even get into the totally new material science, which is also, you know, come down the pike at some point in time. Well, >>very exciting. >>It's been great to catch up with you. I really enjoy the Western Digital story. I've been fortunate to to sit in on a couple chapters. So again, congrats to you. And, uh, we'll continue to watch and look forward to our next update. Hopefully, it won't be another four years. >>Okay. Thanks, Jeff. I really appreciate the time. All >>right. Thanks a lot. Alright. He's Phill. I'm Jeff. You're watching the Cube. Thanks for watching. We'll see you next time. Yeah, Yeah, yeah, yeah.

Published Date : Aug 11 2020

SUMMARY :

leaders all around the world. he is the SVP and general manager Data center business unit from Western Digital. Well, it's It's a hot, dry summer here. into the right slope that we see, you know, kind of ad nauseam. really interesting changes going on that I think if you think about it in a kind of way or finance or kind of of good will of how do you value this data? And if you see you know the valuation that they have compared And it's really interesting to your point that it's the ability decisions than just the data scientists you know, kind of on mahogany row. But the Western digital you have to move the decimal point, And just just for the Wikipedia page. you know, I think a lot of the industry thinks when we say that a byte scale era that It's just a buzzword. and having the hard drive sit in the in the fish bowl, um, to get off types But the reality is, you know, when we talk about SSD is structured really matters. And I think you know, obviously, prices have come down dramatically since the first introduction. and but it's been very successful, you know, the envy, any standards, bodies, very productive, kind of re chunking, if you will in the zones versus trying to get as atomic as possible? on the drive giving the read had a smaller target to read. I just love the way that that, you know, you kind of twist the lens on the problem and and on one And in doing that, our innovation, you know, we benefit from it and our customers benefit from So, you know, are there you talked to kind of generally, but are there certain certain types of workloads there's plenty of new stuff that they can. monetize that data, how they compete against, you know, the landscape of competitors They have, kind of the strategy in terms of development when you can leverage open source. it's the paradigm around which, you know companies have built business models and innovated So, you know, the amount of stuff from time to time, having been in storage for more than 30 years, you know what was the most interesting people that you guys have over there keep working on the software and the physics, Um, But I think, as you know, is widely recognized in the industry that it's a diminishing curve. material science, which is also, you know, come down the pike at some point in time. I really enjoy the Western Digital story. We'll see you next time.

ENTITIES

Entity	Category	Confidence
Jeff	PERSON	0.99+
Colorado	LOCATION	0.99+
20	QUANTITY	0.99+
Phil	PERSON	0.99+
3	QUANTITY	0.99+
Jeff Frick	PERSON	0.99+
Western Digital	ORGANIZATION	0.99+
20%	QUANTITY	0.99+
18	QUANTITY	0.99+
Phil Bullinger	PERSON	0.99+
Phill Bollinger	PERSON	0.99+
45 years	QUANTITY	0.99+
Today	DATE	0.99+
Amazons	ORGANIZATION	0.99+
10	QUANTITY	0.99+
hundreds	QUANTITY	0.99+
three pages	QUANTITY	0.99+
30 years	QUANTITY	0.99+
Cube	ORGANIZATION	0.99+
15	QUANTITY	0.99+
one	QUANTITY	0.99+
25%	QUANTITY	0.99+
2%	QUANTITY	0.99+
both feet	QUANTITY	0.99+
12 bite	QUANTITY	0.99+
thousands	QUANTITY	0.99+
Facebooks	ORGANIZATION	0.99+
Palo Alto	LOCATION	0.99+
Googles	ORGANIZATION	0.99+
2	QUANTITY	0.99+
Netflix	ORGANIZATION	0.99+
Delta	ORGANIZATION	0.99+
more than 30 years	QUANTITY	0.99+
seven	QUANTITY	0.99+
Lehman	ORGANIZATION	0.99+
10%	QUANTITY	0.99+
Cube Studios	ORGANIZATION	0.99+
8	QUANTITY	0.99+
4	QUANTITY	0.99+
5 years	QUANTITY	0.99+
Rick	PERSON	0.99+
20 terabyte	QUANTITY	0.99+
both	QUANTITY	0.99+
TAM	ORGANIZATION	0.99+
14	QUANTITY	0.98+
four years	QUANTITY	0.98+
Boston	LOCATION	0.98+
two things	QUANTITY	0.98+
over 10%	QUANTITY	0.98+
first	QUANTITY	0.98+
today	DATE	0.97+
30 terabytes	QUANTITY	0.97+
Envy	ORGANIZATION	0.97+
secondly	QUANTITY	0.97+
two components	QUANTITY	0.97+
16	QUANTITY	0.96+
TCO	ORGANIZATION	0.96+
single	QUANTITY	0.96+
One	QUANTITY	0.96+
two fixed size zones	QUANTITY	0.94+
2025	DATE	0.93+
two rack	QUANTITY	0.93+
Jeff Inherit	PERSON	0.93+
One thing	QUANTITY	0.93+
first introduction	QUANTITY	0.92+
one thing	QUANTITY	0.91+
tomorrow	DATE	0.91+
DMC	ORGANIZATION	0.91+
Phill	PERSON	0.9+
28%	QUANTITY	0.9+
one fight	QUANTITY	0.89+
Covert	ORGANIZATION	0.89+
Envy Me	ORGANIZATION	0.89+

Vara Kumar, Whatfix | CUBEConversation, November 2019

(funky music) >> Announcer: From our studios in the heart of Silicon Valley, Palo Alto, California. This is a CUBE Conversation. >> Hello, and welcome to theCUBE Studios in Palo Alto, California for another CUBE Conversation, where we go in depth with thought leaders driving innovation across the tech industry. I'm your host, Donald Klein. Today, we're here to talk about digital transformation, and the challenges many enterprises face in helping employees adopt the new applications that drive their business. To have that conversation, we're joined by Vara Kumar, CTO and co-founder of Whatfix . Vara, welcome to the show. >> Thanks, thanks Don for inviting me to the show. >> Great, so looking forward to this. Tell us a little bit about Whatfix, what you guys do and a little bit of the history here. >> Sure. Whatfix is a digital adoption platform. So essentially it overlays on top of applications and makes employees to use applications faster and better. So, we are five years old company, so we have offices in four different countries, and we have 500 customers, and 60 Fortune 500 users, including companies like Amazon, UPS, Facebook, Microsoft, Western Union, Western Digital. The users are a variety of applications like CRMs, VRPs, SCMs. >> Great, fantastic customer base, that's really good. So when you say you're a digital adoption platform that kind of provides an overlay, what specifically you're talking about? You're talking about notifications inside of an application? Tell us a little more about that. >> Sure, sure. So Whatfix helps employees through all their journey in their applications. As they onboard to the applications for the first time, Whatfix welcomes them and holds them to the application. So we provide these, what we call as flows, so these are step-by-step guidance to the users for using the applications and processes and not only within the application, Whatfix guides the users across applications that the employee is faced with, so for example-- >> So cross-application workflows, that kind of thing. >> That's correct, that's correct. For example Anaconda executive, so they have to manage opportunities in the CRM, and then to create a code they have to go to the CPQ and then to submit a purchase order, probably they will have to go to a digital workplace to maintain the code and proceed to the contract. So Whatfix can guide the users through all this and that process. >> Great, so it's not just about the challenges of learning about one particular application, it's actually learning an entire workflow that might stretch across multiple applications. >> That's accurate. >> Some sort of end to end process that gets very complex because you're moving from one interface to the next. >> That's accurate, that's accurate. >> And I guess another challenge would be that, many, in a large enterprise, right, many of the applications that come from the well-known vendors get highly customized for that particular business environment is that correct? >> That's completely true Don, so, Salesforce instance of UPS looks very different from Salesforce instance of Western Union because it's completely customized to the organization business workflows and the nature of their business. So this customization also brings in lot more adoption challenges because even though I'm an account executive who has seen Salesforce in my past experience it looks very different from my current job. So that's more challenging for the enterprises to train their employees and make them use. >> Great, understood, okay, very good. So the problem that you guys are really trying to solve is around this challenge of getting employees to adopt the new applications as part of their overall digital transformation journey, is that right? >> That's true, it's not only new applications now with the so much of cloud movement. So the applications are actually getting updated more faster as you must have heard so much about as well being used in the enterprises for the IT rollouts and IT deployments. That means changes are constant so because you are reinventing your business process as you're learning. So now employees, actually, it impacts more the employee experience because every update employees have to be on top of those. So Whatfix can help them with these updates and making sure that employees are self-served. >> Understood, okay, very good. So now, this has become really such a widespread problem across many of these large enterprises that it's now become quite a mature category of software solutions, is that right? >> That's accurate, Gartner called this category as digital adoption solutions, DAS. >> Digital adoption solutions, okay DAS. >> That's correct, and then they coined this term only a quarter back, and they published reports, in terms of seller productivity and how it will influence the digital workplace, and such kind of things. >> Great, okay, great. So this category now is becoming more widespread, and people really see these types of solutions as being key to enabling digital transformation in the enterprise, right? >> That's accurate. >> Okay great, so then what are the trends that are driving this problem, what is it that's making this such a problem area that companies really need to focus on dedicated solutions to solving? >> Yeah, so one primary is the cloud migration. Yes, more and more the companies, wants to move to the cloud, for example you no longer hear CRM being on-prem, so people are using all the cloud CRMs. So the migration to the cloud, and then the digital, overall the digital transformation of these business practices to suit this cloud migration, that'll also, it's all putting pressure on employee experience. So impacting and making sure that employees are getting used to these new applications and the constant rollouts. >> Got it, got it, okay. So with all this happening, right, more and more applications moving to the cloud, the applications themselves are evolving much faster, the interfaces are changing, and then moreover they're getting more complex, because they're getting more interconnected, right? And so you have this step by step kind of work flow that helps people navigate all of these integrated applications to actually perform a single workflow. >> That's accurate, that's accurate and at the same time given that Whatfix is on top of these applications we are learning a lot about the user, this particular user and what they are doing, what they are good at, what they're not good at, this is helping us to make our content more personalized to the users, just like Google, you search for the same keyword, I search for the same thing, we both see different results. Because it's personalizing to our taste and our knowledge and expertise. So that's exactly it's getting into. And it's not only guidance, we are also helping users to be more productive by automating certain steps. Let's say you do a certain activity every day, then Whatfix can do that automatically for you so that you're becoming more and more faster in your job. >> Interesting, that's interesting. So it's not only helping provide guidance for people moving through these applications but it's actually collecting data about how users are interfacing with it, right, and then delivering a more kind of personalized experience in terms of the guidance that it offers. >> Accurate, accurate, that's accurate. >> Great, so that's really kind of, I guess that would really be the main kind of area of innovation I would imagine for a system like this, right, the ability to capture data about how users are interfacing with the application and then provide recommendations on how to do it better. >> Yes, that's definitely one of the area there's several reasons why customers choose us. We believe in the concept called adoption everywhere, so that adoption everywhere, that means employee need not be on the application all the time, they may want to interact with the application when they're outside of the application. So be, maybe you're on the wiki, you're looking for something then you wanted interface with application, so Whatfix is present across all touchpoints wherever the employee may decide and guide them to the application and help them use the application. And second, we are very easy to deploy and maintain, so we invested heavily on this, because we realize that we don't have to, we shouldn't be providing a platform which is technically more complex for the business guys to create these kind of process flows, we kept them very low tech, and our authoring environment is very easy for them to use and maintain. And then we are, we are very open in terms of how our API is just like Salesforce, which is very open in terms of integrations because we do understand that enterprises are, wants more of interconnected applications so, our allowed APIs are open and we work well with the enterprise ecosystem. And our customer success is highly regarded, so we are best among in the software vendors in terms of the highest customer satisfaction. And we care a lot about real user privacy and security so we don't really collect PII information for the recommendations and personalizations as I spoke about. >> Okay, very good. And so then the, tell us a little bit more about which kind of applications are you guys finding, which categories of applications really kind of benefit most from this kind of guided, walkthrough capability? >> Sure, so the applications are widespread so but more commonly people use us on CRMs, ERPs, and SCMs, and digital workplaces. These are the kind of applications where customers commonly use us on. >> Okay, so ERP and CRM would be the kind of core, would you say? >> Yeah, CRM, ERP and SCM. So these will be the core I would say. And, it's not only the applications that the customers are purchasing from outside, but Whatfix can work on any application that is internally built, any applications that are their IT team is customizing, Whatfix can work on those. >> Great so, bespoke applications developed internally inside companies are equally suitable for this, as are the the packaged applications they might be customizing for their business processes. >> That's correct. >> Okay, that's great. So I just, on the kind of conclude here, let's talk a little bit about maybe some of the customer success stories, that you guys have had. And I'm not asking you to necessarily name names, but maybe talk about some of the areas where you've seen some real value creation from implementing a system like this. >> Sure, sure, our customers have seen that our onboarding time of employees into the applications have reduced one third, because of using Whatfix, because now employees are learning in the flow of work, you no longer have to train them to use the applications, and we've also seen that organizations telling that their content creation times and their amount of planning preparation time has reduced 85%, because of our easy to use authoring environment, and easy to maintain authoring environment. And we've also seen organizations have reduced internal support tickets by 60%. Yeah, so we have also seen that the overall productivity of the employees increased by 35% because they're able to find things and be able to self serve, and do more faster in the applications. >> Got it, so the old days of sitting down and sort of expecting the employees to read the user manual page by page, right, before they dive in, is kind of gone now right? >> It's gone. >> What people want to see is they want to get into the application and then they want to be able to be guided through what they need to do to solve their particular problem. And they want it done in real time. >> That's true, that's right Don. >> And even better they have that whole guide through maybe customized to their particular problem. >> That's accurate Don, that's the digital option solutions for you. >> Great, well that sounds like a fantastic solution, understand the role it plays fantastic, I think you guys are doing great work. So, want to maybe just kind of touch on this last point, where do you see this kind of industry going in the future? This is fantastic innovation but how do you see this trending as digital transformation becomes more widespread? >> Yeah, sure. So adoption as the problem statement more and more organizations understand, and more and more software vendors understand today, we are at Dreamforce now, so if you see the number of sessions around adoption is phenomenal. So, when the Gartner called out that only six to 7% of our enterprises have adopted to the solutions like Whatfix, so there has to be more awareness, some people, enterprises understand that adoption is a problem, but they are not aware that there are solutions that exist to tackle that problem. So we see that's going to be the future for us as we go forward, having more and more enterprises adopt these kind of solutions. >> Great, okay, well if you happen to be at Dreamforce, folks, stop by and talk to Whatfix. So, Vara Kumar thank you for coming on TheCUBE. So thanks for joining us for another CUBE Conversation, I'm Donald Klein and we'll see you next time. (funky music)

Published Date : Nov 22 2019

SUMMARY :

in the heart of Silicon Valley, Palo Alto, California. and the challenges many enterprises face and a little bit of the history here. and makes employees to use applications faster and better. So when you say you're a digital adoption platform so these are step-by-step guidance to the users so they have to manage opportunities in the CRM, the challenges of learning about one particular application, Some sort of end to end process that gets very complex So that's more challenging for the enterprises to train So the problem that you guys are really So the applications are actually getting updated more faster So now, this has become really That's accurate, Gartner called this category influence the digital workplace, and such kind of things. of solutions as being key to So the migration to the cloud, and then the digital, more and more applications moving to the cloud, That's accurate, that's accurate and at the same time of the guidance that it offers. the ability to capture data about how users are interfacing the employee may decide and guide them to the application And so then the, tell us a little bit more Sure, so the applications are widespread And, it's not only the applications that the customers they might be customizing for their business processes. So I just, on the kind of conclude here, and do more faster in the applications. they need to do to solve their particular problem. guide through maybe customized to their particular problem. That's accurate Don, that's the I think you guys are doing great work. So adoption as the problem statement to be at Dreamforce, folks, stop by and talk to Whatfix.

ENTITIES

Entity	Category	Confidence
Donald Klein	PERSON	0.99+
Amazon	ORGANIZATION	0.99+
Vara Kumar	PERSON	0.99+
Microsoft	ORGANIZATION	0.99+
Facebook	ORGANIZATION	0.99+
Western Union	ORGANIZATION	0.99+
Western Digital	ORGANIZATION	0.99+
UPS	ORGANIZATION	0.99+
November 2019	DATE	0.99+
85%	QUANTITY	0.99+
35%	QUANTITY	0.99+
Vara	PERSON	0.99+
Gartner	ORGANIZATION	0.99+
Dreamforce	ORGANIZATION	0.99+
60%	QUANTITY	0.99+
Whatfix	ORGANIZATION	0.99+
Palo Alto, California	LOCATION	0.99+
Don	PERSON	0.99+
Anaconda	ORGANIZATION	0.99+
first time	QUANTITY	0.99+
Today	DATE	0.98+
second	QUANTITY	0.98+
Google	ORGANIZATION	0.98+
one third	QUANTITY	0.98+
500 customers	QUANTITY	0.98+
six	QUANTITY	0.98+
one	QUANTITY	0.97+
7%	QUANTITY	0.96+
both	QUANTITY	0.95+
Silicon Valley, Palo Alto, California	LOCATION	0.94+
four different countries	QUANTITY	0.93+
today	DATE	0.93+
CUBE Conversation	EVENT	0.92+
a quarter back	DATE	0.92+
five years old	QUANTITY	0.88+
CTO	PERSON	0.86+
Salesforce	ORGANIZATION	0.86+
Salesforce	TITLE	0.85+
single workflow	QUANTITY	0.84+
theCUBE Studios	ORGANIZATION	0.78+
one primary	QUANTITY	0.76+
500 users	QUANTITY	0.7+
application	QUANTITY	0.69+
60 Fortune	QUANTITY	0.6+
CUBEConversation	EVENT	0.59+
Whatfix	TITLE	0.58+
TheCUBE	ORGANIZATION	0.51+

Breaking Analysis: Dell Technologies Financial Meeting Takeaways

>> From the SiliconANGLE Media Office in Boston, Massachusetts, it's theCUBE! Now here's your host, Dave Vellante. >> Hi, everybody, welcome to this Cube Insights, powered by ETR. In this breaking analysis I want to talk to you about what I learned this week at Dell Technology's financial analyst meeting in New York. They gathered all the financial analysts, Rob Williams hosted it, he's the head of IR, Michael Dell of course was there. They had Dennis Hoffman who is the head of strategic planning, Jeff Clarke who basically runs the business and Tom Sweet, of course, who was the star of the show, the CFO, all the analysts want to see him. Dell laid out its longterm goals, it provided much clearer understanding of its strategic direction, basically focused on three areas. Dell believes that IT is getting more complex, we know that, they want to capitalize on that by simplifying IT. We'll talk about that. And then they want to position for the wave of digital transformations that are coming and they also believe, Dell believes, that it can capitalize on the consolidation trend, consolidating vendors, so I'll talk about each of those. And so let me bring up the first slide, Alex, if you would. The takeaways from the Dell financial analyst meeting. Let me share with you the overall framework that Tom Sweet laid out. And I have to say, the messaging was very consistent, these guys were very well-prepared. I think Dell is, from a management perspective, very well-run company. They're targeting three to 5% growth on what they're saying is a 4% GDP forecast. Or sorry, 4%, I have GDP here, it's really 4% industry growth. GDP's a little lower than that obviously. So this is IDC data, Gartner data, 4% industry growth. So that's an error on my part, I apologize. The strategies to grow relative to their competition. So grow share on a relative basis. So whatever the market does, again, not GDP, but whatever the market does, Dell wants to grow faster than the market. So it wants to gain share, that's its primary metric. From there they want to grow operating income and they want to grow that faster than revenue, that's going to throw off cash. And then they're going to also continue to delever the balance sheet. I think they paid down 17 billion in debt since the EMC acquisition. They want to get to a two X debt to EBITA ratio within 18 months. And what they're saying is, you know, they talked about, Tom Sweet talked about this consistent march toward investment-grade rating. They've been talkin' about that for awhile. He made the comment, we don't need to have a triple A rating but we want to get to the point where we can reduce our interest expense, and that will, 'cause they'll drop right into the bottom line. So they talked about these various levers that they can turn, some of them under the P and L, gaining share, some are their operating structure and their organizational structure, and one big one is obviously their debt structure. The other key issue here is will this cut the liquidity discount that Dell faces? What do I mean by that? Well, VMware has about a $60 billion valuation. Dell owns about 80% of VMware, which would equate to 48 billion. But if you look at Dell's market cap, it's only 37 billion. So it essentially says that Dell's core business is worth minus 11 billion. We used to talk about this when EMC owned VMware. Its core business only comprised about 40% of the overall value of the company, in this case because of the high debt, Dell has a negative value. And it's not just the high debt. Michael Dell has control over the voting shares, it's essentially a conglomerate structure, there's very high debt, and it's a relatively low margin business, notwithstanding VMware. And so as a result, Dell trades at a discount relative to what you would think it should trade at, given its prominence in the market, $92 billion company, the leader in every category under the sun. So that's the big question is can Dell turn these levers, drop EBITA or cash to the bottom line, affect operating income, and then ultimately pay down its debt and affect that discount that it trades at? Okay, bring up, if you would, Alex, the next slide. Now I want to share with you the takeaways from the Dell line of business focus. This really was Jeff Clarke's presentations that I'm going to draw from. Servers, we know, they're softer demand, but the key there is they're really faced tough compares. Last year, Dell's server business grew like crazy. So this year the comparisons are lessened. But there's less spending on servers. I'll share with you some of the ETR data. Storage, they call it holding serve, you saw last quarter I did an analysis, I took the ETR data and the income statement, it showed Pure was gaining share at like 22% growth from the income statement standpoint. Dell was 0% growth but is actually growing faster than its competitors. With the exception of Pure. It's growing faster than the market. So Dell actually gained share with 0% growth. Dell's really focused on consolidating the portfolio. They've cut the portfolio down from 80, I think actually the right number is 88 products, down to 20 by May of 2020. They've got some new mid-range coming, they've just refreshed their data protection portfolio, so again, by May of next year, by Dell Technologies World they'll have a much, much more simplified portfolio. And they're gaining back share. They've refocused on the storage business. You might recall after the acquisition, EMC was kind of a mess. It was losing share before the acquisition, it was so distracted with all the Elliott Management stuff goin' on. And kind of took its eye off the ball, and then after the acquisition it took awhile for them to get their act together. They gained back about 375 basis points in the last 18 months. Remember a basis point is 1/100th of 1%. So gaining share and their consistent focus on trying to do that. Their PC business, which is actually doin' quite well, is focused on the commercial segment and focused on higher margins. They made the statement that the PCs are kind of undersupply right now so it's helping margins. There's a big focus in Jeff Clarke's organization on VMware integration. To me this makes a lot of sense. To the extent that you can take the VMware platform and make Dell hardware run VMware better, that's something that is an advantage for Dell, obviously. And at the same time, VMware has to walk the fine line with the ecosystem. But certainly it's earned the presence in the market now that it can basically do what I just said, tightly integrate with Dell and at the same time serve the ecosystem, 'cause frankly, the ecosystem has no choice. It must serve VMware customers. The strategy, essentially, is to, as I say, capitalize on vendor consolidation, leverage value across the portfolio, so whether it's pivotal, VMware integration, the security portfolio, try to leverage that and then differentiate with scale. And Dell really has the number one supply chain in the tech business. Something that Dave Donatelli at HP, when he was at HP, used to talk about. HPE doesn't really talk about that supply chain advantage anymore 'cause essentially it doesn't have it. Dell does. So Jeff Clarke's reorganization, he came in, he streamlined the organization, really from the focus on R and D to product to collaboration across the organization and the VMware integration. I actually was quite impressed with when I first met Jeff Clarke I guess two years ago now, what he and the organization have accomplished since then. No BS kind of person. And you can see it's starting to take effect. So we'll keep an eye on that. The next slide I want to show you, I want to bring in the ETR data. We've been sharing with you the ETR spending intention surveys for the last couple of weeks and months. ETR, enterprise technology research, they have a data platform that comprises 4,500 practitioners that share spending data with them. CIOs, IT managers, et cetera. What I'm showing here is a cut off of the server sector. So I'm going to drill down into server and storage. So these are spending intentions from the July survey asking about the second half of 2019 relative to the first half of 2019. And this is a drill-down into the giant public and private firms. Why do I do that? Because in meeting the ETR, this is the best indicator. So it's big, big public companies and big private companies. Think Uber. Private companies that spend a ton of dough on IT. UPS before it went public, for example. So those companies are in here. And they're, according to ETR, the best indicators. What this chart shows, so the bars show, and I've shared this with you a number of times, the lime green is we're adding, we're new to this platform, we're new adoption. The evergreen is we're spending more, the gray is we're spending the same, the light red or pink is we're spending less, and the dark red is we're leaving the platform. So if you subtract the red from the green you get what's called a net score, and that's that blue line. And this is the overall server spending intentions from that July survey. The end is about 525 respondents out of the 4,500. And this is, again, those that just answered the question on server. So you can see the net score on server spend is dropping. And you can see the market share on server is dropping. The takeaway here is that servers, as a percentage of overall IT spend, are on a downward slope, and have been for quite some time. Back to the January '16 survey. Okay, so that's going to serve us. Let's take a look at the same data for storage. So if, Alex, if you bring up the storage sector slide, You can see kind of a similar trend. And I would argue what's happening here, a couple of things. You've got the CLOB effect, I'll talk about that some more, and you've also got, in this case, the flash, all-flash array effect. What happened was you had all-flash arrays and flash come into the data center, and that gave performance a huge headroom. Remember, spinning disk was the last bastion of mechanical movement and it was the main bottleneck in terms of overall application performance. IO was the problem. Well you put a bunch of flash into the system and it gives a lot of headroom. People used to over-provision capacity just for performance reasons. So flash has had the effect of customers saying, hey, my performance is good, I don't need to over-provision anymore, I don't need to buy so much. So that combined with cloud, I think, has put down the pressure on the storage business as well. Now the next slide, Alex, that I want you to bring up is the vendor net scores, the server spending intentions. And what I've done is I've highlighted Dell EMC. Now what's happening here in the slide, and I realize it's an eye chart, but basically where you want to be in this chart is in the left-hand side. What it shows is the spending intentions and the momentum from the October '18, which is the gray, the April '19, which is the blue, and then the July '19 which is the most recent one. Again, the end is 525 in the servers for the July '19 survey. And you can see Dell's kind of in the middle of the pack. You'd love to be in the left-hand side, you know, Docker, Microsoft, VMware, Intel, Ubuntu. And you don't want to be on the right-hand side, you know, Fujitsu, IBM, is sort of below the line. Dell's kind of in the middle there, Dell EMC. The next slide I want to show you is that same slide for storage. And again, you can see here is that on-- So this is vendor net scores, the storage spending intentions. On the left-hand side it's all the high growth companies. Rubrik, Cohesity, Nutanix, Pure, VMware with vSAN, Veeam. You see Dell EMC's VxRail. On the right-hand side, you see the guys that are losing momentum. Veritas, Iron Mountain, Barracuda, HitachiHDS, Fusion-io still comes up in the survey after the acquisition by Western Digital. Again, you see Dell EMC kind of holding serve in the middle there. Not great, not bad. Okay, so that's kind of just some other ETR data that I wanted to share. All right, next thing we're going to talk about is the macros market summary. And Alex, I've got some bullet points on this, so if you bring up that slide, let me talk about that a little bit. So five points here. First, cloud continues to eat away at on-prem, despite all this talk about repatriation, which I know does happen. People try to throw everything to the cloud and they go, whoa! Look at my Amazon bill, yeah, I get that. That's at the margin. The main trend is that cloud continues to grow. That whole repatriation thing is not moving the on-prem market. On-prem is kind of steady eddy. Storage is still working through that AFA injection. Got a lot of headroom from performance standpoint. So people don't need to buy as much as they used to because you had that step function in performance. Now eventually the market will catch up, all this digital transformation is happening, all this data is flowing through the system and it will catch up, and the storage market is elastic. As NAN prices fall, people will, I predict, will buy more storage. But there's been somewhat of a lull in the overall storage market. It's not a great market right now, frankly, at the macro level. Now ETR does these surveys on a quarterly basis. They're just about to release the October survey, and they put out a little glimpse on Friday about this survey. And I'll share some bullet points there. Overall IT spending clearly is softening. We kind of know that, everybody kind of realizes that. Here's the nuance. New adoptions are reverting to pre-2018 levels, and the replacements are rising. What does this mean? So the number of respondents that said, oh yes, we're adopting this platform for the first time is declining, and the replacements are actually accelerating. Why is that? Well I was at ETR last week and we were talking about this and one of the theories, and I think it's a good one, is that 2016, 2017 was kind of experimentation around digital transformation. 2018, people started to put things into production or closer to production, they were running systems in parallel, and now they're making their bets, they're saying, hey, this test worked, let's put this heavy into production in 2019, and now we're going to start replacing. So we're not going to adopt as much stuff 'cause we're not doing as much experimentation. We're going to now focus and narrow in on those things that are going to drive our business, and we're going to replace those things that aren't going to drive our business. We're going to start unplugging them. So that's some of what's happening. Another big trend is Microsoft. Microsoft is extending its presence throughout. They're goin' after collaboration, you saw the impact that they had on Slack and Slack stock recently. So Slack Box, Dropbox, are kind of exposed there. They're goin' after security, they've just announced a SIM product. So Splunk and IBM, they're kind of goin' after that base. The application performance management vendors. For instance, New Relic. Microsoft goin' after them. Obviously they got a huge presence in cloud. Their Windows 10 cycle is a little slower this time around, but they've got other businesses that are really starting to click. So Microsoft is one of the few vendors that really is showing accelerated spending momentum in the ETR data. Financial services and telcos, which are always leading spender indicators, are actually very weak right now. That's having a spillover effect into Europe, which is over-banked, if I can use that term. Banking heavy, if you will. So right now it's not a pretty picture, but it's not a disaster. I don't want to necessarily suggest this as like going back to 2007, 2008, it's not. It's really just a matter of things are softening and it's, you know, maybe taking a little breath. Okay, so let me summarize the meeting overall. Again, it was a very well-run meeting. Started at 9:00, ended at 12:00, bagged lunch, go home. Nice and crisp. So these guys are very well-prepared. I think, again, Dell is a extremely well-managed company. They laid out a much clearer vision for Wall Street of its strategy, where it's headed. As they say, they're going after IT complexity. I want to make a comment on this. You think about Legacy EMC. Legacy EMC was not the company that you would expect to deal with complexity. In fact, they were the culprit of complexity. One of the things that Jeff Clarke did when he came in, he said, this portfolio's too complex, needs to be simplified. Joe Tucci used to say, overlap is better than gaps. Jeff Clarke said we got too much overlap. We don't have a lot of gaps so let's streamline that portfolio. Taking advantage of vendor consolidation, this is an interesting one. Ever since I've been in this business, which has been quite a long time now, I've been hearing that buyers want to consolidate the number of vendors that they have. They've really not succeeded in doing that. Now can they do that now 'cause there are less vendors? Well, in a sense, yes, there are less sort of on-prem big vendors. EMC's no longer in the market, you don't have companies like Sun and Digital anymore, Compact is gone. HP split in two, but still. You're not seeing a huge number of new vendors, at scale, come into the market. Except you've got AWS and Google as new players there. So I think that injects sort of a new dynamic that a lot of people like to put cloud aside and kind of ignore it and talk about the old on-prem business, but I think that you're going to see a lot of experimentations and workload ins and outs, particularly with AWS and Google and of course Azure, which is in itself, their cloud is almost a separate force. So we'll see how that shakes up. As I say, servers right now, Dell's got a very tough compare. I think Dell will be fine in the server space. Storage, it's all about simplifying the portfolio, they've got a refreshed portfolio focused on regaining share. They've rebranded everything Power, so their whole line is going to be Power by, if it's not already, by May of next year, Dell Technologies World. It's a much more scalable portfolio. And I think Dell's got a lot of valuation levers. They're a $92 billion company, they've got their current operations, their current P and L, their share gains, their cross-company synergies, particularly with VMware, they can expand their TAM into cloud with partnerships like they're doing with AWS and others, Google, Microsoft. The Edge is a TAM expansion opportunity to them. And also corporate structure. You've seen them. VMware acquired Pivotal. They're cleaning that up. I'm sure they could potentially make some other moves. Secureworks is out there, for example. Maybe they'll do some things with RSA. So they got that knob to turn and they can delever. Paying down the debt to the extent that they can get back to investment grade, that will lower their interest rates, that'll drop right to the bottom line, and they'll be able to reinvest that. And Tom Sweet said, within 18 months, we'll be able to get there with that two X ratio relative to EBITA, and that's when they're going to start having conversations with the rating agencies to talk about you know, hey, maybe we can get a better rating and lower our interest expense. Bottom line, did Wall Street buy the story? Yes. But I don't think it's going to necessarily change anything in the near term. This is a show me from Missouri, prove it, execute, and then I think Dell will get rewarded. Okay, so this is Dave Vellante, thanks for watching this Cube Insights powered by ETR. We'll see ya next time. (electronic music)

Published Date : Sep 27 2019

SUMMARY :

From the SiliconANGLE Media Office And at the same time, VMware has to walk the fine line

ENTITIES

Entity	Category	Confidence
Jeff Clarke	PERSON	0.99+
Rob Williams	PERSON	0.99+
Tom Sweet	PERSON	0.99+
Michael Dell	PERSON	0.99+
IBM	ORGANIZATION	0.99+
Dave Vellante	PERSON	0.99+
VMware	ORGANIZATION	0.99+
Fujitsu	ORGANIZATION	0.99+
Google	ORGANIZATION	0.99+
Dave Donatelli	PERSON	0.99+
Microsoft	ORGANIZATION	0.99+
Western Digital	ORGANIZATION	0.99+
Dennis Hoffman	PERSON	0.99+
HPE	ORGANIZATION	0.99+
EMC	ORGANIZATION	0.99+
AWS	ORGANIZATION	0.99+
Alex	PERSON	0.99+
Dell	ORGANIZATION	0.99+
October	DATE	0.99+
2019	DATE	0.99+
Uber	ORGANIZATION	0.99+
Friday	DATE	0.99+
New York	LOCATION	0.99+
October '18	DATE	0.99+
4,500	QUANTITY	0.99+
Splunk	ORGANIZATION	0.99+
April '19	DATE	0.99+
4,500 practitioners	QUANTITY	0.99+
Legacy EMC	ORGANIZATION	0.99+
17 billion	QUANTITY	0.99+
$92 billion	QUANTITY	0.99+
Sun	ORGANIZATION	0.99+
July '19	DATE	0.99+
2016	DATE	0.99+
2017	DATE	0.99+
July	DATE	0.99+
Joe Tucci	PERSON	0.99+
Last year	DATE	0.99+
Gartner	ORGANIZATION	0.99+
Iron Mountain	ORGANIZATION	0.99+
Intel	ORGANIZATION	0.99+
Veritas	ORGANIZATION	0.99+
4%	QUANTITY	0.99+
HitachiHDS	ORGANIZATION	0.99+
Nutanix	ORGANIZATION	0.99+
Dropbox	ORGANIZATION	0.99+
May	DATE	0.99+
2007	DATE	0.99+
Amazon	ORGANIZATION	0.99+
HP	ORGANIZATION	0.99+
last week	DATE	0.99+
88 products	QUANTITY	0.99+
2018	DATE	0.99+

Hend Alhinnawi, Humanitarian Tracker | AWS Imagine Nonprofit 2019

>> From Seattle Washington, it's theCUBE, covering AWS Imagine, nonprofit. Brought to you by Amazon Web Services. >> Hey welcome back everybody, Jeff Frick here with theCUBE. We're actually on the waterfront in Seattle at the AWS Imagine nonprofit event. We were here a couple weeks ago for the AWS Imagine education event. This is really about nonprofits and solving big, big problems. So Dave Levy and team have you know dedicated to some of these big problems. And one of the big problems in the world is human trafficking, and problems that people are encountering and all kinds of nasty situations all over the world. And we're really excited to have someone who's tackling that problem, and really trying to bring a voice to those people that wouldn't otherwise have a voice. And she's Hend Alhinnawai, she's the CEO of Humanitarian Tracker. Hend, good to see you. >> Thank you Jeff, good to be here. >> Absolutely. So before we jump into it, impressions on this event? >> Wonderful event bringing together technologists, people in nonprofits, really creating synergies for people to collaborate and talk to each other and network and learn how they can advance their organizations. >> Such important work. >> Yes. >> So give us kind of the background on what you're up to, what Humanitarian Tracker's all about. >> So Humanitarian Tracker's a nonprofit forum. It was created to connect and empower citizens using innovation and technology, but specifically for humanitarian events. We were among the first to combine crowdsourced reports with data mining and artificial intelligence and apply them to humanitarian disasters, conflicts, human rights violations, disease outbreak. All the way to tracking the UN's Sustainable Development Goals. Really giving a holistic view of what's happening. >> It's interesting, you know, it's probably like the middle eastern spring, I can't remember the exact term that people use, where it was kind of the first use of regular people using their mobile phones to kind of grab a ground swell of action. You're not looking at the politics specifically, you're looking more at humanitarian disasters. But pretty amazing kind of what a connected phone represents to anyone anywhere in the world now to communicate what's happening to them. To share that story. We really didn't have anything like that before. To get that personal event on the ground. >> No it's really a new way of consuming, creating and consuming information. So the cell phone has really given people on the ground a chance to tell their own story. But it's not enough. If you have an event that happens to you. Something happens to you. And you record it, it stops there. But the unique thing with Humanitarian Tracker is it gives people that forum to show the world and tell them what's happening to and around them. >> Right, but it's not just about the individual. And what you guys are doing is using cutting edge technology, obviously you're here as part of the AWS event. In terms of machine-learning and big data to grab a large number of these reported events and distill it into more of an overarching view of what is actually happening on the ground. How did you do that, where did you get that vision, how are you executing that? >> Well, we're all about empowering the citizen. And in our line of work we deal with a lot of data, a lot of information, most of it is unstructured, most of it is crowdsourced. So we use machine-learning to help us extract important details. Information on time. Event location, what is happening. And at the same time we really cared that this reporter, stays anonymous for their own safety. We, privacy and security is utmost importance to us. So that's always our focus. So in that space, we de-identify them. We take out any information that could be identifiable, that could lead to their arrest, or could lead to someone identifying that it was them that reported. >> And how do you get this information to the people that are suffering this activity ground? How do they know about you, how do they know that you are anonymizing their information so there's not going to be repercussions if they report. You know, how do, kind of I guess your go-to-market, to steal a business terms, in making sure that people know this tool's available for help? >> It depends on the situation. For example in the conflict situation, we rolled it out, and we kept it low key for awhile. Because we didn't want government attacks, we didn't want people to be arrested, or to be tried. So we rolled it out. And it was word of mouth that spreads. And people started submitting supports. Actually the first project we did with conflict, we weren't sure if we were going to get one report, zero reports. The first week we got nothing. And then slowly as people learned about it they started submitting their reports. And we see our job as really elevating the otherwise marginalized voice. So you submit a report to us, we then take it. We verify it. We make it public. And that, we welcome, we encourage, we want people to consume it. Whether you're a student, whether you're a journalist, whether you're a government, whether you work in a nonprofit, the UN. It's been used to address human rights violations, it's been used to identify humanitarian hotspots. The data's phenomenal, and what you get from it. It's not just collecting data. We're not just about collecting the data. We want to make sure it's meaningful, and we want to derive insights. So we want to know what is the data actually telling us? >> Right, right. So just to be clear for people that don't know, so you're making that data available, you're cleansing the data, you're running some AI on it to try to get a bigger picture, and anyone with a login, any kind of journalist can now access that data in support of whatever issue or topic or story they're chasing? >> That's it Jeff. >> That's phenomenal. And just kind of size and scope. You've been at this I think you said since 2011. You know kind of how many active, activities, crisis, I don't know, what the definition is of a bucket of these problems. Are you tracking historically at a given point in time? Give us some kind of basic sizing type of dimensions. >> It really ranges, because it could, when we were tracking conflict for example, we were really focused on one area, and the surrounding countries. Because you had refugee population, you had displacement, you had all sorts of issues. But it could be anywhere from five projects, it just depends. And we want to make sure that each project we're taking on we're giving it our full attention, full scope. And I like to run the organization like a two-team pizza team. And so I don't take on more than I could handle. >> Right, right. So then how did it morph from the conflict to the Global Sustainability Goal? So we've worked with Western Digital, they're doing a lot of work, ASP's doing a lot of work on kind of these global sustainability goals. How did you get involved in that, and how did the two kind of dovetail together? >> So the elasticity of the cloud has helped our operation scale tremendously. And in 2016 we were selected as a top 10 global innovation, that could be applied to the Sustainable Development Goals, and-- >> So they found you, the UN find you, or did you get nominated? How did that happen? >> We were nominated, and from over 1,000 solutions we were chosen. >> Congratulations. >> Thank you. And we were showcased at the Solutions Summit which is hosted at the United Nations. And just based on that experience of meeting people that were doing really cool things in their respective communities, we launched the Global Action Mosaic. Because we wanted to create one place where people that are doing projects in their communities could submit it, and have it showcased. And the goals are not only to crowdsource the SGD's, but to also be a part of the effort to track what's happening. Who's doing what where, make it easy for people to search say, Jeff you decided to get involved in a project with education. You can go onto our Global Action Mosaic, search projects on education in your community or in other parts of the world and then get involved in it. So it's really creating a centralized place where people can get information on the global goals. >> Awesome. So that's pretty much the Global Action Mosaic. It's pretty much focused on the UN global goals versus your core efforts around the Humanitarian Tracker. >> Yes. >> That's great. So we're here at AWS. Have you always been on AWS? Is this something new? How does being on kind of the AWS infrastructure help you do your mission better? >> We are, we've been partners in running AWS since we actually started. >> Since the beginning. >> Yes we have Yusheheedi as one of our partners, development partners, AWS. And because one of the core, one of the most important things to us is privacy and security, we want to make sure that whatever data is being handled and received is stored securely. >> Right, right. >> And that information transmitted, handled is also being done so in a secure way. Like I mentioned, the elasticity of the cloud has helped us scale our mission tremendously. It's affordable, we've been able to us it, we've learned their machine-learning stock to de-identify some of the data that comes in. So we're firm believers that AWS is essential to how we run our operation. >> Because do the individual conflicts kind of grow and shrink over time? Do you see it's really a collection of kind of firing up hotspots and then turning down versus one long, sustained, relatively flat, from kind of a utilization and capacity point of view? >> Yeah, no it definitely, it flares up and you'll have like a year, months, weeks sometimes where it's just focused on one area. But one of the things we focus on, it's not just. So what is the data actually telling us? So say you're focusing on point A. But just down the street in location B there is a dire humanitarian emergency that needs to be addressed. The crowdsourced reports, combined with the data mining and the AI, helps us identify those hotspots. So everybody could be focused here, but there could be an emergency down the street that needs to be addressed as well. It just depends. >> And do you have your own data scientists or do you, do other people take your data and run it through their own processes to try to find some of these insights? >> We have both. >> You have both. >> Yeah. >> So what's been the biggest surprise when you anonymize and aggregate the data around some of these hotspots? Is there a particular pattern that you see over and over? Is there some insight, that now that you've seen so much of it, from kind of the (muffled speaking) that you can share and reflect on? >> I think it' very unique to each project to do. But there is one thing that I strongly support, that I don't see enough of, and that's the sharing of data within the organizations. And so, for example just getting to that culture where sharing your data between organizations is encouraged and actually done. Could help create a, create a pool of knowledge. So, for example we worked with 13 different organizations that were all tackling humanitarian events. The same one, in Syria. And the 13 did not share data and did not talk to each other. And so we found that for example, they were all focused on one area. When just a few miles down, there was a need that wasn't being addressed. But because they don't share information, they had no idea. >> Right. >> It was only when we were able to take a look at it, kind of from the, from an overarching view, looking all their data, we were able to say you know, it would be helpful, it would actually, you could save on resources, and less time, and less effort, and you guys are tackling a small funding pool to begin with. If you shared information and tackled different things, instead of focusing on one area, because you don't know what the other guys doing. >> And were they using crowdsource data, is there source data, or were they just trying to collect their own from the field? >> They were collecting their own. >> So I assume that the depth, and the richness, and the broadness of data is nothing like you're collecting. >> Well you get a different kind of, you get different kind of information when the individuals actually telling you what's happening versus you asking a very direct question like, "Are you healthy? Yes or No?". Whereas you give them the chance, they might tell you that they haven't eaten, and their diabetic and you know, give you other pieces of information. Where they're living, are they refugees? Are they healthy? Are they not healthy? Do they go to school? Do their kids go to school? How many kids they have? Are they a female-run household? All this information could help guide development in the proper way. >> Right, right. All right. So give you the final word, how should people get involved if they want to help? >> You can go to humanitariantracker.org if you want to volunteer with us. And if you're doing a project that is related to the UN's Sustainable Development Goals, I would like you to go to globalactionmosaic.org, and map it there, and be part of our community. >> So Hend, thank you for taking a few minutes to share your story, and for all the good work that you're doing out there. >> Thank you Jeff it was a pleasure. >> All right, she's Hend, I'm Jeff, you're watching theCUBE, we're at AWS Imagine nonprofit. Thanks for watching we'll see you next time. (techno music)

Published Date : Aug 14 2019

SUMMARY :

Brought to you by Amazon Web Services. So Dave Levy and team have you know dedicated So before we jump into it, impressions on this event? for people to collaborate and talk to each other So give us kind of the background on what you're up to, and apply them to humanitarian disasters, conflicts, To get that personal event on the ground. is it gives people that forum to show the world And what you guys are doing And at the same time we really cared that this reporter, And how do you get this information So we want to know what is the data actually telling us? So just to be clear for people that don't know, And just kind of size and scope. And I like to run the organization and how did the two kind of dovetail together? So the elasticity of the cloud and from over 1,000 solutions we were chosen. And the goals are not only to crowdsource the SGD's, So that's pretty much the Global Action Mosaic. How does being on kind of the AWS infrastructure since we actually started. one of the most important things to us to how we run our operation. But one of the things we focus on, it's not just. And the 13 did not share data looking all their data, we were able to say you know, So I assume that the depth, and the richness, and their diabetic and you know, So give you the final word, that is related to the UN's Sustainable Development Goals, and for all the good work that you're doing out there. Thanks for watching we'll see you next time.

ENTITIES

Entity	Category	Confidence
Jeff	PERSON	0.99+
Hend Alhinnawi	PERSON	0.99+
Hend Alhinnawai	PERSON	0.99+
Hend	PERSON	0.99+
Amazon Web Services	ORGANIZATION	0.99+
Jeff Frick	PERSON	0.99+
Dave Levy	PERSON	0.99+
Western Digital	ORGANIZATION	0.99+
Seattle	LOCATION	0.99+
AWS	ORGANIZATION	0.99+
2016	DATE	0.99+
five projects	QUANTITY	0.99+
zero reports	QUANTITY	0.99+
2011	DATE	0.99+
humanitariantracker.org	OTHER	0.99+
13	QUANTITY	0.99+
Syria	LOCATION	0.99+
Seattle Washington	LOCATION	0.99+
one report	QUANTITY	0.99+
two	QUANTITY	0.99+
both	QUANTITY	0.99+
each project	QUANTITY	0.99+
two-team	QUANTITY	0.99+
first	QUANTITY	0.99+
over 1,000 solutions	QUANTITY	0.98+
Humanitarian Tracker	ORGANIZATION	0.98+
UN	ORGANIZATION	0.98+
one	QUANTITY	0.98+
one thing	QUANTITY	0.98+
globalactionmosaic.org	OTHER	0.98+
13 different organizations	QUANTITY	0.98+
one area	QUANTITY	0.97+
first project	QUANTITY	0.97+
Global Action Mosaic	ORGANIZATION	0.97+
Solutions Summit	EVENT	0.96+
first week	QUANTITY	0.95+
Yusheheedi	PERSON	0.93+
AWS Imagine nonprofit	ORGANIZATION	0.93+
Global Action Mosaic	TITLE	0.92+
United Nations	LOCATION	0.91+
2019	DATE	0.9+
location B	OTHER	0.88+
couple weeks ago	DATE	0.86+
Humanitarian Tracker	TITLE	0.85+
one place	QUANTITY	0.84+
a year	QUANTITY	0.81+
AWS Imagine	EVENT	0.81+
first use	QUANTITY	0.79+
AWS Imagine	ORGANIZATION	0.79+
ASP	ORGANIZATION	0.78+
theCUBE	ORGANIZATION	0.77+
Development Goals	TITLE	0.76+
point A.	OTHER	0.74+
middle eastern spring	DATE	0.72+
months	QUANTITY	0.65+
top 10 global innovation	QUANTITY	0.64+
Global Action Mosaic	EVENT	0.63+
of data	QUANTITY	0.57+
Sustainable Development Goals	TITLE	0.52+
Nonprofit	EVENT	0.39+

Liran Zvibel, WekaIO | CUBEConversations, June 2019

>> from our studios in the heart of Silicon Valley. HOLLOWAY ALTO, California It is a cube conversation. >> Hi! And welcome to the Cube studios from the Cube conversation, where we go in depth with thought leaders driving innovation across the tech industry on hosted a Peter Burress. What are we talking about today? One of the key indicators of success and additional business is how fast you can translate your data into new value streams. That means sharing it better, accelerating the rate at which you're running those models, making it dramatically easier to administrate large volumes of data at scale with a lot of different uses. That's a significant challenge. Is going to require a rethinking of how we manage many of those data assets and how we utilize him. Notto have that conversation. We're here with Le'Ron v. Bell, who was the CEO of work a Iot leering. Welcome back to the Cube. >> Thank you very much for having >> me. So before we get to the kind of a big problem, give us an update. What's going on at work a Iot these days? >> So very recently we announced around CIA financing for the company. Another 31.7 a $1,000,000 we've actually had a very unorthodox way of raising thiss round. Instead of going to the traditional VC lead round, we actually went to our business partners and joined forces with them into building a stronger where Collier for customers we started with and video that has seen a lot of success going with us to their customers. Because when Abel and Video to deploy more G pews so they're customers can either solve bigger problems or solve their problems faster. The second pillar off the data center is networking. So we've had melon ox investing in the company because there are the leader ofthe fast NETWORKINGS. So between and Vidia, melon, ox and work are yo u have very strong pillars. Iran compute network and storage performance is crucial, but it's not the only thing customers care about, so customers need extremely fast access to their data. But they're also accumulating and keeping and storing tremendous amount of it. So we've actually had the whole hard drive industry investing in us, with Sigi and Western Digital both investing in the company and finally one off a very successful go to market partner, Hewlett Pocket enterprise invested in us throw their Pathfinder program. So we're showing tremendous back from the industry, supporting our vision off, enabling next generation performance, two applications and the ability to scale to any workload >> graduations. And it's good money. But it's also smart money that has a lot of operational elements and just repeat it. It's a melon ox, our video video, H P E C Gate and Western Digital eso. It's It's an interesting group, but it's a group that will absolutely sustain and further your drive to try to solve some of these key data Orient problems. But let's talk about what some of those key day or data oriented problems where I set up front that one of the challenges that any business that has that generates a lot of it's value out of digital assets is how fast and how easily and with what kind of fidelity can I reuse and process and move those data assets? How are how is the industry attending? How's that working in the industry today, and where do you think we're going? >> So that's part on So businesses today, through different kind of workloads, need toe access, tremendous amount of data extremely quickly, and the question of how they're going to compare to their cohort is actually based on how quickly and how well they can go through the data and process it. And that's what we're solving for our customers. And we're now looking into several applications where speed and performance. On the one hand, I have to go hand in hand with extreme scale. So we see great success in machine learning, where in videos in we're going after Life Sciences, where the genomic models, the cryo here microscopy the computational chemistry all are now accelerated. And for the pharmacy, because for the research interested to actually get to conclusion, they serve to sift through a lot of data. We are working extremely well at financial analytics, either for the banks, for the hedge funds for the quantitative trading Cos. Because we allow them to go through data much, much quicker. Actually, only last week I had the grades to rate the customer where we were able to change the amount of time they go through one analytic cycle from almost two hours, four minutes. >> This is in a financial analytics >> Exactly. And I think last time I was here was telling you about one of their turn was driving companies using us taking, uh, time to I poke another their single up from two weeks to four hours. So we see consistent 122 orders of monk to speed time in wall clock. So we're not just showing we're faster for a benchmark. We're showing our customer that by leveraging our technology, they get results significantly faster. We're also successful in engineering around chip designed soft rebuild fluid dynamics. We've announced Melon ox as an idiot customer. The chip designed customers, so they're not only a partner, they have brought our technology in house, and they're leveraging us for the next chips. And recently we've also discovered that we are great help for running Noah scale databases in the clouds running ah sparkles plank or Cassandra over work. A Iot is more than twice faster than running over the Standard MPs elected elastic clock services. >> All right, so let's talk about this because your solving problems that really only recently have been within range of some of the technology, but we still see some struggling. The way I described it is that storage for a long time was focused on persisting data transactions executed. Make sure you persisted Now is moved to these life life sciences, machine learning, genomics, those types of outpatients of five workloads we're talking about. How can I share data? How can I deploy and use data faster? But the historian of the storage industry still predicated on this designs were mainly focused on persistent. You think about block storage and filers and whatnot. How is Wecker Io advancing that knowledge that technology space of, you know, reorganizing are rethinking storage for the types, performance and scale that some of these use cases require. >> This is actually a great question. We actually started the company. We We had a long legacy at IBM. We now have no Andy from, uh, metta, uh, kind of prints from the emcee. We see what happens. Page be current storage portfolio for the large Players are very big and very convoluted, and we've decided when we're starting to come see that we're solving it. So our aim is to solve all the little issues storage has had for the last four decades. So if you look at what customers used today, if they need the out most performance they go to direct attached. This's what fusion I awards a violin memory today, these air Envy me devices. The downside is that data is cannot be sure, but it cannot even be backed up. If a server goes away, you're done. Then if customers had to have some way of managing the data they bought Block san, and then they deployed the volume to a server and run still a local file system over that it wasn't as performance as the Daz. But at least you could back it up. You can manage it some. What has happened over the last 15 years, customers realized more. Moore's law has ended, so upscaling stopped working and people have to go out scaling. And now it means that they have to share data to stop to solve their problems. >> More perils more >> probably them out ofthe Mohr servers. More computers have to share data to actually being able to solve the problem, and for a while customers were able to use the traditional filers like Aneta. For this, kill a pilot like an eyes alone or the traditional parlor file system like the GP affair spectrum scale or luster, but these were significantly slower than sand and block or direct attached. Also, they could never scale matter data. You were limited about how many files that can put in a single, uh, directory, and you were limited by hot spots into that meta data. And to solve that, some customers moved to an object storage. It was a lot harder to work with. Performance was unimpressive. You had to rewrite our application, but at least he could scale what were doing at work a Iot. We're reconfiguring the storage market. We're creating a storage solution that's actually not part of any of these for categories that the industry has, uh, become used to. So we are fasted and direct attached, they say is some people hear it that their mind blows off were faster, the direct attached, whereas resilient and durable as San, we provide the semantics off shirt file, so it's perfect your ability and where as Kayla Bill for capacity and matter data as an object storage >> so performance and scale, plus administrative control and simplicity exactly alright. So because that's kind of what you just went through is those four things now now is we think about this. So the solution needs to be borrow from the best of these, but in a way that allows to be applied to work clothes that feature very, very large amounts of data but typically organized as smaller files requiring an enormous amount of parallelism on a lot of change. Because that's a big part of their hot spot with metadata is that you're constantly re shuffling things. So going forward, how does this how does the work I owe solution generally hit that hot spot And specifically, how are you going to apply these partnerships that you just put together on the investment toe actually come to market even faster and more successfully? >> All right, so these are actually two questions. True, the technology that we have eyes the only one that paralyzed Io in a perfect way and also meditate on the perfect way >> to strangers >> and sustains it parla Liz, um, buy load balancing. So for a CZ, we talked about the hot sport some customers have, or we also run natively in the cloud. You may get a noisy neighbor, so if you aren't employing constant load balancing alongside the extreme parallelism, you're going to be bound to a bottleneck, and we're the only solution that actually couples the ability to break each operation to a lot of small ones and make sure it distributed work to the re sources that are available. Doing that allows us to provide the tremendous performance at tremendous scale, so that answers the technology question >> without breaking or without without introducing unbelievable complexity in the administration. >> It's actually makes everything simpler because looking, for example, in the ER our town was driving example. Um, the reason they were able to break down from two weeks to four hours is that before us they had to copy data from their objects, George to a filer. But the father wasn't fast enough, so they also had to copy the data from the filer to a local file system. And these copies are what has added so much complexity into the workflow and made it so slow because when you copy, you don't compute >> and loss of fidelity along the way right? OK, so how is this money and these partnerships going to translate into accelerated ionization? >> So we are leveraging some off the funds for Mohr Engineering coming up with more features supporting Mohr enterprise applications were gonna leverage some of the funds for doing marketing. And we're actually spending on marketing programs with thes five good partners within video with melon ox with sick it with Western Digital and with Hewlett Packard Enterprise. But we're also deploying joint sales motion. So we're now plugged into in video and plugged, anted to melon ox and plugging booked the Western Digital and to Hillary Pocket Enterprise so we can leverage their internal resource now that they have realized through their business units and the investment arm that we make sense that we can actually go and serve their customers more effectively and better. >> Well, well, Kaio is introduced A road through the unique on new technology into makes perfect sense. But it is unique and it's relatively new, and sometimes enterprises might go well. That's a little bit too immature for me, but if the problem than it solves is that valuable will bite the bullet. But even more importantly, a partnership line up like this has got to be ameliorating some of the concerns that your fearing from the marketplace >> definitely so when and video tells the customers Hey, we have tested it in our laps. Where in Hewlett Packard Enterprise? Till the customer, not only we have tested it in our lab, but the support is going to come out of point. Next. Thes customers now have the ability to keep buying from their trusted partners. But get the intellectual property off a nor company with better, uh, intellectual property abilities another great benefit that comes to us. We are 100% channel lead company. We are not doing direct sales and working with these partners, we actually have their channel plans open to us so we can go together and we can implement Go to Market Strategy is together with they're partners that already know howto work with them. And we're just enabling and answering the technical of technical questions, talking about the roadmap, talking about how to deploy. But the whole ecosystem keeps running in the fishing way it already runs, so we don't have to go and reinvent the whales on how how we interact with these partners. Obviously, we also interact with them directly. >> You could focus on solving the problem exactly great. Alright, so once again, thanks for joining us for another cube conversation. Le'Ron zero ofwork I Oh, it's been great talking to you again in the Cube. >> Thank you very much. I always enjoy coming over here >> on Peter Burress until next time.

Published Date : Jun 5 2019

SUMMARY :

from our studios in the heart of Silicon Valley. One of the key indicators of me. So before we get to the kind of a big problem, give us an update. is crucial, but it's not the only thing customers care about, How are how is the industry attending? And for the pharmacy, because for the research interested to actually get to conclusion, in the clouds running ah sparkles plank or Cassandra over But the historian of the storage industry still predicated on this And now it means that they have to share data to stop to solve We're reconfiguring the storage market. So the solution needs to be borrow and also meditate on the perfect way actually couples the ability to break each operation to a lot of small ones and Um, the reason they were able to break down from two weeks to four hours So we are leveraging some off the funds for Mohr Engineering coming up is that valuable will bite the bullet. Thes customers now have the ability to keep buying from their You could focus on solving the problem exactly great. Thank you very much.

ENTITIES

Entity	Category	Confidence
Western Digital	ORGANIZATION	0.99+
Liran Zvibel	PERSON	0.99+
IBM	ORGANIZATION	0.99+
two weeks	QUANTITY	0.99+
Mohr Engineering	ORGANIZATION	0.99+
100%	QUANTITY	0.99+
two weeks	QUANTITY	0.99+
Western Digital	ORGANIZATION	0.99+
two questions	QUANTITY	0.99+
Silicon Valley	LOCATION	0.99+
CIA	ORGANIZATION	0.99+
Peter Burress	PERSON	0.99+
George	PERSON	0.99+
June 2019	DATE	0.99+
122 orders	QUANTITY	0.99+
Hewlett Packard Enterprise	ORGANIZATION	0.99+
four hours	QUANTITY	0.99+
last week	DATE	0.99+
four minutes	QUANTITY	0.99+
Mohr	ORGANIZATION	0.99+
Sigi	ORGANIZATION	0.99+
Hewlett Pocket	ORGANIZATION	0.99+
two applications	QUANTITY	0.99+
five good partners	QUANTITY	0.98+
second pillar	QUANTITY	0.98+
both	QUANTITY	0.98+
31.7	QUANTITY	0.98+
Andy	PERSON	0.98+
Collier	ORGANIZATION	0.98+
today	DATE	0.98+
H P E C Gate	ORGANIZATION	0.97+
more than twice	QUANTITY	0.97+
Le'Ron	PERSON	0.96+
Hillary Pocket Enterprise	ORGANIZATION	0.95+
Bell	PERSON	0.95+
four things	QUANTITY	0.95+
Melon ox	ORGANIZATION	0.95+
one	QUANTITY	0.94+
five workloads	QUANTITY	0.92+
each operation	QUANTITY	0.92+
Cube	ORGANIZATION	0.92+
Abel	ORGANIZATION	0.91+
Western Digital eso	ORGANIZATION	0.91+
$1,000,000	QUANTITY	0.89+
almost two hours	QUANTITY	0.89+
single	QUANTITY	0.89+
Io	PERSON	0.88+
last 15 years	DATE	0.86+
HOLLOWAY ALTO, California	LOCATION	0.86+
One	QUANTITY	0.85+
one analytic cycle	QUANTITY	0.81+
Cassandra	PERSON	0.78+
last four decades	DATE	0.77+
Kayla Bill	PERSON	0.74+
Kaio	ORGANIZATION	0.69+
Moore	PERSON	0.67+
Vidia	ORGANIZATION	0.59+
Noah	COMMERCIAL_ITEM	0.59+
Video	ORGANIZATION	0.56+
Iran	LOCATION	0.54+
Wecker	ORGANIZATION	0.54+
Aneta	ORGANIZATION	0.53+
Block san	ORGANIZATION	0.5+
Iot	TITLE	0.5+
Daz	ORGANIZATION	0.44+
WekaIO	PERSON	0.41+
Pathfinder	TITLE	0.32+

theCUBE Video Report Exclusive | E3 2018

Jeff Rick here at the cube we're at the LA Convention Center at e3 is our first time coming to this convention is sixty eight thousand people and every single hall and outside inside hotels it's pretty crazy great to see you thank you so much for having me [Music] years ago it was really much more about a a trade show so that you know the big people who are gonna buy the disc could actually come to eat right right check out our games and place their disorders and now it's really much more of a consumer phenomenon right let's have a competition let a brand's find outdo each other but more of let's make this more about the games than the booth babes and those things it's funny everything changed in dubbings chains right people are always super excited there's always gamers that want to see the newest stuff that hasn't changed at all but just the sheer technology differences so we're doing this series as part of the Western Digital data makes possible and data is such a big part of what you guys do you can really start to understand who your players are and so if you're gonna do an upsell offer you know you can understand like oh this person has actually already purchased this type of material so I'm gonna give them this type of upsell versus this type of upsell or you know I see all my players are really struggling on level three and no one's making it through what's wrong with level three they're spending too much time in an area not knowing what they're doing will go OK right we need to change that we need to signpost back to serenity we need to turn around say how can we make it clearer to the players they know what they do but also keep the reward so that they feel like they've achieved it they feel like they've figured it out right we've placed people in front of the game in very early stages to receive him alcohol ideas of working and then based on that we then look at video footage interviews and all that stuff some kind of that feedback see into the design loop process previously years ago to get some of these insights you would have had to be one of the largest game company from them and now with you know the democratization of these different game engines and then the democratization of this type of like to lean and online services that are available it really creates an amazing opportunity for all developers everywhere we see these tremendous boots that are here fabulous graphics VR coming down the pike CPU and graphical chips are all over the place so basically power an internet and 5 G's coming mobility is gonna be way way faster the horsepower that you need to run this kind of game is actually pretty staggering we can compute a lot of stuff on the GPU the CPUs tons and tons of the objects get physics constraints and things that are costly for computation cycles and then there's like memory issues you know we have streaming that we have to kind of get better at these worlds are very large and so to store the things that you're gonna see and do takes a lot of actual you know harddrive space and the speed at which we can load and unload things is that critical factor in terms of you know unlocking the freedom of your experience right we really have a PC development technology that is easy to port the Xbox and PlayStation so we have a private cloud in Europe and a private cloud and we run this on your own inference we're on our totally on our own infrastructure and it has its advantages because we're completely in control but I think now just don't need to make the big investment in hardware upfront you can solve all the problems in a cloud solution right now and then deploy either privately or publicly it's much more flexible now than it was we know from our creator standpoint the biggest thing that they complain about is hey I want to grow right like I've been streaming for X amount of years I'm creating content how do I grow at twitch we have like the broadest means of ways to monetize but also the lowest barrier of entry to take advantage of them and our subscribers by the way they know that they're supporting you and proud to do so Joy's supporting the kind of courage do they know if they didn't support you you might not be streaming they love being playing a role in keeping their favorite creators around the content that you see here today much more diverse and much broader you know we still have a long way to go as an industry but it's very different than my first 17 years ago used to be gamers played games because of the technology and now they play games because of the games right because no one cares about the technology right because you could do almost anything on any device now and now so it's really important to us as game developers to hide the technology from players and just give them a great expression and every year you know new stuff rolls out slightly newer Xbox slightly newer PlayStation better pcs so we just stay up-to-date with the drivers and make sure that we support whatever crazy hardware is coming out right and it all works great you're watching the cube from e3 I like convention center thanks for watching [Music]

Published Date : Jun 25 2018

**Summary and Sentiment Analysis are not been shown because of improper transcript**

ENTITIES

Entity	Category	Confidence
Europe	LOCATION	0.99+
Jeff Rick	PERSON	0.99+
first time	QUANTITY	0.99+
PlayStation	COMMERCIAL_ITEM	0.99+
LA Convention Center	LOCATION	0.99+
sixty eight thousand people	QUANTITY	0.99+
e3	EVENT	0.98+
level three	QUANTITY	0.97+
Xbox	COMMERCIAL_ITEM	0.97+
level three	QUANTITY	0.97+
today	DATE	0.96+
Western Digital	ORGANIZATION	0.96+
twitch	ORGANIZATION	0.95+
one	QUANTITY	0.94+
every single hall	QUANTITY	0.93+
Joy	PERSON	0.92+
17 years ago	DATE	0.91+
E3 2018	EVENT	0.91+
years ago	DATE	0.89+
much	QUANTITY	0.88+
theCUBE	ORGANIZATION	0.85+
tons and tons of the objects	QUANTITY	0.8+
first	QUANTITY	0.69+
lot	QUANTITY	0.64+
stuff	QUANTITY	0.56+
cube	COMMERCIAL_ITEM	0.47+

Recommend Videos

Sentiment Analysis

AWS Comprehend

Search Results for Western digital: