Swami Sivasubramanian, AWS | CUBE Conversation, January 2022
>>And welcome to this special cube conversation. I'm John for a, your host of the cube. We're here in Palo Alto, California, and I'm here with a very special guest coming down from Seattle remotely into the cube studios is the leader at AWS Amazon web services, the vice president of database analytics and machine learning Swami. Great to see you cube alumni recently taking over the database business at AWS as a leader. Congratulations. And thanks for coming on the cube. >>Hey, my pleasure to be here, John, very excited to talk to you. >>Yeah. We've had many conversations on the cube and also in person and also online around all the major mega trends. You've had your hand in all the action, going back to your days when you were in school learning and, and writing papers. And 10 years ago, Amazon web services launched AWS dynamo, DB, fast, flexible, no SQL database that everyone loves today, which has inspired a generation of what I would call database distributing cloud scale, single digit millisecond performance at scale. And again, the key scale. And again, this is 10 years ago, so it seems like yesterday, but you guys are celebrating and your name was on the original paper with CTO Verner. Vogel's your celebrity. Congratulations. >>Thank you. Not sure about the celebrating part, but I'm very excited. At least I played a hand in building such an amazing technology that has enabled so many amazing customers along the way as well. So >>Trivia on the, on the paper as you were an intern at AWS, so you're getting your PhD. And then since, since rising through the ranks and involved in a lot of products over the years, and then leading the machine learning and AI, which is now changing the game at the industry level, but I got to ask you getting back to the story here. A lot of customers have built amazing things on top of dynamo DB, not to mention lots of other AWS and Amazon tech riding on it. Can you share some of the highlights that came out of the original paper? And so with some examples, because I think this is a point in time, 10 years ago, where you start to, so the KickUp of cloud scale, not just, just for developers and building startups, you're really starting to see the scale rise. >>Yeah, I actually, I mean, as you probably know, based on what he read to explain the Genesis of dynamo DB itself had to explain the Genesis of how Amazon got into building the original dynamo, right? And this was during the time when miner, I joined Ron esteem as an intern and, and Amazon was one of the pioneers in pushing the boundary of scale. And a year over year, our Q4 holiday season tends to be really, really bad for all the right reasons. We all want our holiday shopping done during that time. And you want to be able to scale your website, arters fulfillment centers, all of them at that time. And those are the times around 2005. And the answer is when people think our database, they think of a single database server that actually runs on a box and has a certain characteristics and does a scale and availability and whatnot. >>And it's usually relational. And then when we had a major disruption during Q4 that's when yeah, ask ourselves the question, why are we actually using a relational database for some of these things when they really didn't need the data model complexity of relational database. And normally I would say most companies where to actually ask an intern or a few engineers who are early in the career saying like, what the hell are you suggesting? Just go away. But Amazon being enabling Buddhists to build what they want. And they actually let us start reimagining what a database or our scale could look like. And that led to dynamo. And since she unstained mine, then we migrated from an traditional relational database stair this one for some of the amazon.com services. And then I moved on to actually start building some butts off our storage service and then our managed relational database service, I explicitly remember. >>And one of our customer advisory board, we're just the set off some of our leading customers who actually give us feedback on roadmap. Another son, Don, who's the CEO and chief geek of spunk bargain faker. And him actually looking at the Trinity me, I was starting in the corner and saying like you all, both tomorrow and why do I need to keep shotting my, my sequel database and reshooting assigned scaling. And this is the time when the state of the art in most databases were around. Like, you start sharding your relational database and constantly reshaping. And this is when most websites are starting to experience the kind of scale which we consider a normal month. During those times it was mostly, most companies used to have a single relational database backend and start scaling that way. And that conversation led entirely under duress, unaided read, lot of AWS leaders and myself saying like, Hey, what is a cloud database reimagined without the hampering SQL look like? And that led us to start building dynamo DB, but just a key value database at that time. Now we support document might've too, but that single digit millisecond latency at any scale imagine. So >>I think about that time at that time, 10 years ago, when you were having this conversation and I know the smug mug and I, he said, he's in totally geek and he's, he's good to point that out. You also have Netflix as customers too. I'd like to hear how that's evolved, but, but I think back at the time, if you look back then I got to ask you most people we've talked about this before. No one database rules, a world that's now standard people now don't see one database back then it was a one database kind of mindset back then. Yeah. And then you had that big data movement happening with Hadoop. You had the object store developing. So you're in you're you're circling around that area. What was it like then? I mean, take, take us through that because there was obvious visibility that, Hey, let's just store this. Now you see data lakes and that's all happening. But back then object store was kind of new. Yeah. >>Ah, it's a great question. Now, one of the things I realized early on, especially when I was working with binary, when you're saying amazon.com itself as an example, that the access patterns for various applications and Amazon, but let alone AWS customers tend to be very, very, very, some of them really just needed an object store. Some of them needed a relational database. Some of them really wanted a key value store within a fast latency. Some of them really needed a durable cash. And, but it so happens when you have a giant hammer. You use that for everything looks like a map, which is essentially the story at that time. And so everyone kept using the same database, irrespective of what the problem was because nobody else, I mean, thought about like, what else can we build that is better? So this let us do, literally I remember writing a paper with Bernard internally that is widely used in Amazon explaining what are all the menu of booklets that access. >>And then how do we go about actually solving for each of these things so that they can actually grow and innovate faster. And, and this was led to actually the Genesis of not only building IDs and so forth, but also dynamo and various other non-relational data. There's a still let alone not so storage access patterns and what not. So, and this was one of the big revelations he had just that there is not a single database that is going to meet the customer, needs us. The diversity of workloads in the internet is growing. And this was a key pivotal moment because with cloud now applications can scale very more instantly than before now. Building an application for Superbowl is very easier than before. That means that on, I mean, everybody is pushing the boundaries of what scale means, and they are expecting more from their obligations. That's when you need technologies like dynamo, DB, and that's exactly what dynamo already be set out to do. And since then, we are continuing to innovate on behalf of our customers and the purpose of the database story as well. And this concept has resonated well across the board. If you see that the database industry has also embraced this method, >>It's natural that you obviously evolved into the machine learning side of it because that's data is big part of that. And you see back then you, you bringing up kind of like flashes for me where it's like those, the data conversations back then and the data movement was just beginning. So the idea that you can have diversity in access methods of the kind of databases was a use case driven by the application, not so much database saying, this is how you have to work, that the script was flipped. It it's changed from infrastructure dictating to the applications, what to do. Now, the applications are going to the infrastructure and saying, give me what I want. I want to access something here in an office store, something here in no SQL that became the Genesis of infrastructure as code at a, at a global level. And so your paper kind of set the, the, the wave, the influence for this, no SQL did big data movement. It's created tons of value, maybe a third Mongo might've been influenced by this other people have been influenced. Can you share some stories of how people adopted the concept of dynamo DB and how that's changed in the industry and how has that helped the industry evolve? >>I mean, plus file data. Most share our experience of building and dynamo style data store. Very, it is a non-relational API and showing what are some of the experiences that the Venter in building such an paper and these set out early on itself, that it is should not be just a design paper, but it should be something that we shared our experiences. So even now, when I talked to my friends and colleagues and various other companies, one thing they always tell me is they appreciated the openness with which we were sharing. Some of the examples and learnings that we learned to not optimizing for percentile latencies, and what are some of the scalability challenges, how we solved and some of the techniques around things like sloppy Cora or various other stuff. We invented a lot of towns along the way too, but people really appreciated several of some of our findings and as talking about it. >>And since then I met so many other innovations are happening in the industry and the AWS, but also across the entire academia and industry in this space, the databases I've been going through what I call as a period of Renaissance, where one of the things, if you see our own arc, when Roger and I started on the database, front Disney started over the promo saying like, if you were to build a database where cloud is the new normal, this is again in 2008, we asked ourselves that question and what the belt that led us to start building things like dynamo, DB, RDS star. I know that alone, we reimagined data viruses with Redshift and several, and then several other databases like time stream for time series workloads started running Neptune for graph and whatnot. But at the moment we started actually asking that question and working backwards from customers. Then you will start being able to innovate accordingly. And this has worked really well. Then more than a hundred thousand AWS customers have chosen dynamo DB for mobile gaming tech IOT. Many of these are fast growing businesses, such as ledge, Darryl BNB, red fan, as soon as enterprises like Samsung Toyota, capital one and so far. So these are like really some meaningful clouds, let alone amazon.com. I run this. >>We have an internal customer is always good to have that entire inside customer. You know, I really find this a really profound use case because you're just talking, you know, in Amazonian terms, I'll just translate for the audience working backwards from the customer, which is the customer obsession you guys have. So here's, what's going on off the way I see it. You got dynamo, DB, paper, you and Verner, and the team Paul was a great as a great video on your blog posts that goes into the, to the talk he gave at around that time, which is fun to watch if you look back, but you have a radical enabler here, that's disrupting and changing S3 RDS, Aurora. These are game-changing concepts inside the, the landscape of AWS at the same time, you're working backwards from the customer. So the question I have for you as a leader and as a builder, how did you balance the working backwards from the customer while bringing something brand new and radical at that time to the market? >>Yeah, this is one of the S I mean hardest things to be, as leaders need to balance on. If you see many times, then we actually worked backwards from customers. The literal later translated this, literally do what customers are asking for, which is true nine out of 10 times, but there is one or a 10 times, you got to read between the lines on what they are asking. Because many times customers when are articulate that they need to go fast. If in the right way, they might say, Hey, I wish my heart storage goes faster, but they're not going to tell you they need a car, but you need to know and be able to translate and read between the lines we call it under the bucket of innovate on behalf of customers. And that is exactly the kind of a mantra we had when we were thinking about concepts like dynamo DB, because essentially at that time, almost everybody would, if I asked, they would just say, I wish a relational database could actually be able to scale from not just like a hundred gigabyte to one terabyte are, it can take up to like 2 million transactions, a second and so forth and still be cheap and made in reality as relational databases, the way they were engineered at that time, those are not going to meet the scale needs. >>So this is fair. We hunted read between the lines on what are some of the key Mustang needs from customers and then work backwards and then innovate on behalf of these workloads, be enabled by the sun oh four, which are some of the reasons that led to us launching some of the initial sets on dynamo on a single digit millisecond latency and seamless scale. At that time, databases didn't have the elasticity to go from like 10 requests, a second to like a hundred thousand or 1 million requests a second, and then scaled right back in an hour. So that was not possible. And we kind of enabled that. And that was an, a pretty big game changer that showed the elasticity of the cloud to a database. Well, >>Yeah, I think also just to, not to nerd out on this, but it enables a lot of other kind of cool scaled concepts, like queuing storage. It's all kind of together. This database piece of that you guys are solving. And again, props to you guys on the team. Congratulations. I have to ask, you know, more generally, how has your thinking changed since the paper? I'll see, you've got more experience under your belt. You don't yet have the gray hairs yet, but we'll see those soon come in, but you know, you're, you got a lot more experience. You're running teams, you're launching a lot of products. How has your thinking changed in the industry since the paper what's happening now? What's the big evolution. What are those new things now that are in the innovate on behalf of the customer? What's between the lines now, how do you see this happening? >>I mean, now since wanting dynamo via a victim, I had the opportunity to work on various problems in the big data space. There we've worked on some are fire things that you might be aware of in the analytics all the way from Redshift to quick side, too. Then I moved on to start some of our efforts, having built systems that enabled customer to store process and credit, and then analyze them. One of the realizations, I had this, the in around 2015 or 2016, I kinda had that machine learning was hitting a critical point where now it is ready for being scaled at option. Their cloud has basically enabled limitless compute and limitless storage, which are the factors that are holding back machine learning technology. Then I realized that now we have a unique opportunity to bring machine learning BI to everybody, not just folks with PhD in machine learning. >>And that's when I moved on from database and analytics areas, they started machine learning. We're just a descent area because machine learning is powered by data and then started building capabilities like SageMaker, which is our end to end ML platform to build, train and deploy them on models. And this, what does the leading enterprise platform by several gaggled users and then also a bunch of our AI services since then, I view the reason I'm giving all this historical context is one of the biggest realization I had early on itself. And 2016 as first machine learning is one of the most disruptive technologies. She will then country in our generation. This is right after cloud. I think these still are the most amazing combination that is going to revolutionize how we build applications and how we actually reason about that. Now, the second thing is that at the end of the day, when you look at the ANC and journey, it is not just about one database or one data Varroa. >>So one data lake product, or even 1:00 AM out platform. It is about the end to end journey where a customer is storing their order database. And then they are actually building a data lake that test customer history and order history. And they want to be able to personalize. And for their viewer experience are actually forecast what products to staff in their fulfillment center, but then all these things need to work and to handle. And that view is one of the big things that struck me for the past five years. And I've been on this journey in addition to building this Emma building blocks to connect the dots so that customers can go on this modern end to end data strategy as I call it, right. It goes beyond a single database technology or data technology, but putting now all of these end to end together so that customers don't end up spending six months connecting the dots, which has been the state of the down for the last couple of years. And we are bringing it down to matter of the Sundays. Now >>He's incredible Swami. Thank you so much for spending the time with us here in the, >>Yeah, my pleasure. Thanks again, Sean. Thanks for having me.
SUMMARY :
And thanks for coming on the cube. And again, this is 10 years ago, so it seems like yesterday, but you guys are celebrating so many amazing customers along the way as well. and then leading the machine learning and AI, which is now changing the game at the industry level, but I got to ask you getting back to And the answer is when people think our database, they think of a single database server that And that led to dynamo. at the Trinity me, I was starting in the corner and saying like you all, And then you had that big data movement happening with Hadoop. Now, one of the things I realized early I mean, everybody is pushing the boundaries of what scale means, So the idea that you can have diversity in Some of the examples and learnings that we learned to not optimizing for percentile And since then I met so many other innovations are happening in the industry from the customer, which is the customer obsession you guys have. And that is exactly the kind of a of the cloud to a database. And again, props to you guys on the team. I had the opportunity to work on various problems in the big data space. And this, what does the leading enterprise And I've been on this journey in addition to building this Emma building blocks Thank you so much for spending the time with us here in the, Yeah, my pleasure.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Amazon | ORGANIZATION | 0.99+ |
AWS | ORGANIZATION | 0.99+ |
John | PERSON | 0.99+ |
2008 | DATE | 0.99+ |
Sean | PERSON | 0.99+ |
Seattle | LOCATION | 0.99+ |
10 requests | QUANTITY | 0.99+ |
six months | QUANTITY | 0.99+ |
January 2022 | DATE | 0.99+ |
Don | PERSON | 0.99+ |
2016 | DATE | 0.99+ |
one | QUANTITY | 0.99+ |
10 times | QUANTITY | 0.99+ |
tomorrow | DATE | 0.99+ |
nine | QUANTITY | 0.99+ |
1:00 AM | DATE | 0.99+ |
Swami | PERSON | 0.99+ |
one terabyte | QUANTITY | 0.99+ |
second thing | QUANTITY | 0.99+ |
10 years ago | DATE | 0.99+ |
first | QUANTITY | 0.99+ |
Palo Alto, California | LOCATION | 0.99+ |
Netflix | ORGANIZATION | 0.99+ |
more than a hundred thousand | QUANTITY | 0.99+ |
yesterday | DATE | 0.99+ |
Disney | ORGANIZATION | 0.99+ |
both | QUANTITY | 0.98+ |
Roger | PERSON | 0.98+ |
Ron | PERSON | 0.98+ |
1 million requests | QUANTITY | 0.98+ |
Swami Sivasubramanian | PERSON | 0.98+ |
amazon.com | ORGANIZATION | 0.98+ |
S3 RDS | COMMERCIAL_ITEM | 0.98+ |
an hour | QUANTITY | 0.98+ |
Paul | PERSON | 0.97+ |
Bernard | PERSON | 0.97+ |
2005 | DATE | 0.97+ |
2 million transactions | QUANTITY | 0.97+ |
Vogel | PERSON | 0.97+ |
one database | QUANTITY | 0.97+ |
each | QUANTITY | 0.96+ |
SQL | TITLE | 0.96+ |
single database | QUANTITY | 0.96+ |
single | QUANTITY | 0.96+ |
2015 | DATE | 0.95+ |
Mongo | ORGANIZATION | 0.94+ |
Redshift | TITLE | 0.94+ |
Verner | ORGANIZATION | 0.93+ |
today | DATE | 0.93+ |
One | QUANTITY | 0.93+ |
Varroa | ORGANIZATION | 0.92+ |
CTO Verner | PERSON | 0.92+ |
Hadoop | TITLE | 0.91+ |
a second | QUANTITY | 0.91+ |
single digit | QUANTITY | 0.91+ |
Mustang | ORGANIZATION | 0.91+ |
last couple of years | DATE | 0.89+ |
Q4 | DATE | 0.88+ |
a year | QUANTITY | 0.88+ |
a hundred thousand | QUANTITY | 0.87+ |
Samsung Toyota | ORGANIZATION | 0.86+ |
single digit | QUANTITY | 0.86+ |
SageMaker | TITLE | 0.86+ |
BOS3 Ali Zafar AWS
(upbeat music) >> Welcome back to "The Cube's" continuous coverage of AWS re:Invent 2021. We're running one of the industry's most important and largest hybrid tech events of the year with AWS and its ecosystem partners. And of course, special thanks to AMD for supporting this year's editorial coverage at the event. We got two live sets. We had two remote sets, one in Boston and one in Palo Alto. We got more than 100 guests coming on the program, and we're looking deep into the next decade of cloud innovation. We're super excited to be joined by Ali Zafar, who's the senior director of platform strategy and operations at Dropbox. Ali, great to see you. Thanks for coming on. >> Awesome, it's a pleasure to be here with you, Dave. >> So hey, what's your day job like at Dropbox and what's your role? >> Got it. So I actually oversee the global supply chain at Dropbox. Also all of the capacity planning, which entails both our budget and also capacity requirements at Dropbox. And then I also focus on the platform product management side, which is basically building our build versus buy and our overall roadmap for our platform in the long run. >> Great, thank you. So, everybody knows Dropbox, but maybe you could talk a little bit about your business, your mission, and how that's evolved over the past several years. >> Got it. So Dropbox is a global collaboration platform and our mission at Dropbox is to help design a more enlightened way of working. Dropbox has over 700 million registered users and over 550 billion pieces of content. So taking a step back Dave, Dropbox helps with all of your content. Think of this as videos as music, even your tax returns, allows you to organize all of this content and then you can share this content with anybody at any time. You can also take Dropbox to work and actually it makes you even more productive in the workplace. Integrating all of your tools seamlessly, also allowing you to collaborate with all of your teams internally and also externally. >> Thank you, Ali. When Dropbox was founded, the cloud was really nascent, right? So it was early days, and so a lot has changed since the mid last decade. And of course with remote work and hybrid work, that's had to be a real tailwind to your business, but maybe you could explain your cloud and your hybrid cloud strategy. >> Got it, you're spot on Dave. So Dropbox has always been hybrid since its inception in 2007. And when I say hybrid, I mean we have our own on-prem infrastructure and then we also leverage public cloud. Now public cloud still to these days, remains absolutely critical for Dropbox to serve all of its customer needs. And when we talk about the decision between public or private, we think about three or four key things. One is the total cost of ownership, we look at go-to-market, we also look at our customer requirements and the latest technology that's available in the market. And then any international data storage requirements to make the decision of going towards public or private for that specific use case. >> So, I wonder if we could follow up on that. Maybe you could talk about the key business considerations as an SAS storage provider. What are the real drivers and in your business framework? >> Got it. At the end of the day, what really matters for us Dave, is to actually think about our customers and delight them. And what better than to focus on performance, reliability and also security, right? So we want to make sure that the infrastructure that we have today, allows Dropbox to actually solve for this specific use case for our customers. You know, what do they care about? While also doing this in a very efficient, managed way. So to summarize that, looking at performance, looking at liability, looking at scalability, looking at efficiency, and then also compliance. >> So that leads me to my next question, is about the EC2 instances that you use. I know you make heavy use of AMD compute. How did you come to that decision? Was it these factors? Was it all performance? How did you migrate to really enable that capability? How complex was that? >> Got it. So AMD has always been our key strategic partners partners. The partnership is well over four to five years right now, and we've been leveraging them on our on-prem infrastructure for compute. So we've always had AMD in our infrastructure and when the time came where AWS was also leveraging some of the AMD instances, we wanted to see how we can expand the partnership with AMD and AWS and also experiment with these instances. So we looked at some of the tooling updates that were required. We also looked at specific instances, which are either compute optimized and memory optimized instances, and then we actually built our footprint on AMD. And what we saw is that the overall performance improvement and also cost improvements that we got per specific workloads was actually extremely, was overall awesome results for Dropbox and our end customers. And we have been using them ever since. >> What kind of business impact did that make? Did it make a difference to your business that was noticeable? >> On the business side, I think primarily it was more on the TCO side, which is where we got most of the benefits on the cost side. And then also for some of our internal workloads, we also saw benefit to our internal developers that are using some of those workloads. >> So, you guys have become the poster child for hybrid and a lot has been talked about about you all, but I wonder if you could help us understand what part of your infrastructure is going to be better served by public cloud versus doing your own I T on-prem. What are some of the value drivers that are making you push workloads into the public cloud? Help us understand that better and squint through that. >> Got it, I get asked that question a lot. So public cloud in general allows for faster go-to-market. Think about this as like product launches, feature launches, also international expansion. It allows us to scale and then also leveraging some of the existing technologies out there in the market for some of the common workloads. So just taking a step back and thinking about Dropbox, we keep on evaluating also the criteria and then also specific workloads on what makes sense on private or public cloud. And AWS had some instances like S3 RDS and EC2 that when we started looking at, we knew that some of our key services like data platform, some parts of the MLN A I, and even paper platform would make more sense for us to actually leverage some of these in a public cloud for that. >> So what are the characteristics of the workload that are better suited to be an AWS? What's the ideal workload profile? You know, we talk about ideal customer profile. What's the ideal workload profile for the AWS cloud? >> Got it, so the way we think about it, at least, we call it the rule of three at Dropbox. And that means we look at scale first, we look at technology and innovation and what I mean by that is, is there a faster innovation in the public cloud and is that workload common enough that there's already a lot of work going on in public cloud, then there's no reason for us to actually innovate faster than that, we probably can't. And if the scale is not large enough. So when we talk about our storage side, like magic pocket, the scale is large enough where innovating there makes sense and it's better for the end customer. So we would probably go towards private cloud there. But then when we talk about international expansion, when we talk about faster go-to-market or some of the innovation in the MLN A I space, it really makes sense to use public cloud because of all of the advancements that we've seen there. >> So let me circle back to the business benefits and impact of the AMD based compute specifically, but you talked about TCO before, so there's certain things you mentioned on-prem you sometimes use. If the thing's hardened, you don't want to necessarily rip and replace it, but if you can accelerate go-to-market and you spin up things in the cloud, that makes sense. You mentioned customer requirements, so that just kind of depends. And then the international expansion and scale. So it comes down to those whatever four or five factors, right? The TCO, those other factors that I mentioned. Kind of the high level benefits, if you wouldn't mind summarizing for us, Ali. >> Yeah, I think you're spot on there. So there's looking at the overall TCO, right? The cost of serving the overall cloud, looking at go-to-market in general, can we leverage public cloud and go-to-market faster, obviously meeting our end customer requirements. We also looked at international expansion, any of the customer's data that is stored outside of the U S is all on public cloud for Dropbox. No plans in the short term to do something different there. And then also just looking at, like I mentioned, anything in the technology space that is ongoing, that we can leverage feature side or the product side for our customers, like A I or M L, we are going to leverage public cloud there. >> So of course you know, we've followed the progression of semiconductor technology for decades. This industry is marched to the cadence of performance improvements. What do the futures hold from a technology roadmap standpoint, and particularly as it relates to leveraging AMD EC2 instances, Ali? >> Got it. So Dropbox is in a very unique position where we actually leverage AMD both on-prem and for public cloud, leveraging some of the AWS EC2 instances like you mentioned. And EPYC processors from AMD are what we're using today, both on the hybrid infrastructure side and the performance. And also the TCO benefits are real and something that we are observing on a day-to-day basis. So we are going to be leveraging that technology even in the future, and the partnership with the AMD continues to be very, very strong for Dropbox. >> Well, Ali, I really appreciate you coming on "The Cube" as part of our coverage. It's great to have you. Love to have you back sometime. >> Awesome, thank you. And also just last thing we wanted to also call out that we are also going to be experimenting with probably Milan that is coming out. Rome is the current processors from AMD that we have been leveraging and as Milan comes available, we do want to continue to evaluate it and see how we can fit it in our infrastructure. >> So their are generations are city-based. Are they all Italian city-based, or are we going to run out of cities soon? (both laughing) >> Got it. Again, the partnership with both AWS and AMD is something that I'm very proud of. >> Excellent. >> Thank you so much. Thank you, Dave. >> Great to have you Ali, and really appreciate you watching. Keep it right there for more action on "The Cube", your leader in hybrid tech event coverage. (calm music)
SUMMARY :
and largest hybrid tech events of the year to be here with you, Dave. So I actually oversee the the past several years. and then you can share this content the cloud was really nascent, right? and the latest technology What are the real drivers and that the infrastructure is about the EC2 instances that you use. and then we actually built On the business side, I think primarily What are some of the value drivers for some of the common workloads. characteristics of the workload Got it, so the way we and impact of the AMD No plans in the short term to So of course you know, we've followed and the partnership with the AMD Love to have you back sometime. Rome is the current processors from AMD or are we going to run out of cities soon? Again, the partnership Thank you so much. Great to have you Ali, and
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
AMD | ORGANIZATION | 0.99+ |
Dave | PERSON | 0.99+ |
Ali Zafar | PERSON | 0.99+ |
Palo Alto | LOCATION | 0.99+ |
AWS | ORGANIZATION | 0.99+ |
Boston | LOCATION | 0.99+ |
2007 | DATE | 0.99+ |
Ali | PERSON | 0.99+ |
Dropbox | ORGANIZATION | 0.99+ |
one | QUANTITY | 0.99+ |
two remote sets | QUANTITY | 0.99+ |
two live sets | QUANTITY | 0.99+ |
five factors | QUANTITY | 0.99+ |
Milan | LOCATION | 0.99+ |
One | QUANTITY | 0.99+ |
more than 100 guests | QUANTITY | 0.99+ |
four | QUANTITY | 0.99+ |
mid last decade | DATE | 0.98+ |
Rome | LOCATION | 0.98+ |
both | QUANTITY | 0.97+ |
over 550 billion pieces | QUANTITY | 0.97+ |
EC2 | TITLE | 0.96+ |
five years | QUANTITY | 0.96+ |
over 700 million registered users | QUANTITY | 0.96+ |
S3 RDS | TITLE | 0.95+ |
TCO | ORGANIZATION | 0.93+ |
today | DATE | 0.92+ |
Italian | LOCATION | 0.91+ |
three | QUANTITY | 0.9+ |
this year | DATE | 0.86+ |
U S | LOCATION | 0.86+ |
BOS3 | TITLE | 0.86+ |
four key things | QUANTITY | 0.85+ |
over four | QUANTITY | 0.82+ |
re:Invent 2021 | EVENT | 0.77+ |
first | QUANTITY | 0.77+ |
years | DATE | 0.7+ |
Zafar | TITLE | 0.7+ |
MLN | TITLE | 0.67+ |
The Cube | TITLE | 0.65+ |
decades | QUANTITY | 0.65+ |
Ali Zafar AWS
(upbeat music) >> Welcome back to "The Cube's" continuous coverage of AWS re:Invent 2021. We're running one of the industry's most important and largest hybrid tech events of the year with AWS and its ecosystem partners. And of course, special thanks to AMD for supporting this year's editorial coverage at the event. We got two live sets. We had two remote sets, one in Boston and one in Palo Alto. We got more than 100 guests coming on the program, and we're looking deep into the next decade of cloud innovation. We're super excited to be joined by Ali Zafar, who's the senior director of platform strategy and operations at Dropbox. Ali, great to see you. Thanks for coming on. >> Awesome, it's a pleasure to be here with you, Dave. >> So hey, what's your day job like at Dropbox and what's your role? >> Got it. So I actually oversee the global supply chain at Dropbox. Also all of the capacity planning, which entails both our budget and also capacity requirements at Dropbox. And then I also focus on the platform product management side, which is basically building our build versus buy and our overall roadmap for our platform in the long run. >> Great, thank you. So, everybody knows Dropbox, but maybe you could talk a little bit about your business, your mission, and how that's evolved over the past several years. >> Got it. So Dropbox is a global collaboration platform and our mission at Dropbox is to help design a more enlightened way of working. Dropbox has over 700 million registered users and over 550 billion pieces of content. So taking a step back Dave, Dropbox helps with all of your content. Think of this as videos as music, even your tax returns, allows you to organize all of this content and then you can share this content with anybody at any time. You can also take Dropbox to work and actually it makes you even more productive in the workplace. Integrating all of your tools seamlessly, also allowing you to collaborate with all of your teams internally and also externally. >> Thank you, Ali. When Dropbox was founded, the cloud was really nascent, right? So it was early days, and so a lot has changed since the mid last decade. And of course with remote work and hybrid work, that's had to be a real tailwind to your business, but maybe you could explain your cloud and your hybrid cloud strategy. >> Got it, you're spot on Dave. So Dropbox has always been hybrid since its inception in 2007. And when I say hybrid, I mean we have our own on-prem infrastructure and then we also leverage public cloud. Now public cloud still to these days, remains absolutely critical for Dropbox to serve all of its customer needs. And when we talk about the decision between public or private, we think about three or four key things. One is the total cost of ownership, we look at go-to-market, we also look at our customer requirements and the latest technology that's available in the market. And then any international data storage requirements to make the decision of going towards public or private for that specific use case. >> So, I wonder if we could follow up on that. Maybe you could talk about the key business considerations as an SAS storage provider. What are the real drivers and in your business framework? >> Got it. At the end of the day, what really matters for us Dave, is to actually think about our customers and delight them. And what better than to focus on performance, reliability and also security, right? So we want to make sure that the infrastructure that we have today, allows Dropbox to actually solve for this specific use case for our customers. You know, what do they care about? While also doing this in a very efficient, managed way. So to summarize that, looking at performance, looking at liability, looking at scalability, looking at efficiency, and then also compliance. >> So that leads me to my next question, is about the EC2 instances that you use. I know you make heavy use of AMD compute. How did you come to that decision? Was it these factors? Was it all performance? How did you migrate to really enable that capability? How complex was that? >> Got it. So AMD has always been our key strategic partners partners. The partnership is well over four to five years right now, and we've been leveraging them on our on-prem infrastructure for compute. So we've always had AMD in our infrastructure and when the time came where AWS was also leveraging some of the AMD instances, we wanted to see how we can expand the partnership with AMD and AWS and also experiment with these instances. So we looked at some of the tooling updates that were required. We also looked at specific instances, which are either compute optimized and memory optimized instances, and then we actually built our footprint on AMD. And what we saw is that the overall performance improvement and also cost improvements that we got per specific workloads was actually extremely, was overall awesome results for Dropbox and our end customers. And we have been using them ever since. >> What kind of business impact did that make? Did it make a difference to your business that was noticeable? >> On the business side, I think primarily it was more on the TCO side, which is where we got most of the benefits on the cost side. And then also for some of our internal workloads, we also saw benefit to our internal developers that are using some of those workloads. >> So, you guys have become the poster child for hybrid and a lot has been talked about about you all, but I wonder if you could help us understand what part of your infrastructure is going to be better served by public cloud versus doing your own I T on-prem. What are some of the value drivers that are making you push workloads into the public cloud? Help us understand that better and squint through that. >> Got it, I get asked that question a lot. So public cloud in general allows for faster go-to-market. Think about this as like product launches, feature launches, also international expansion. It allows us to scale and then also leveraging some of the existing technologies out there in the market for some of the common workloads. So just taking a step back and thinking about Dropbox, we keep on evaluating also the criteria and then also specific workloads on what makes sense on private or public cloud. And AWS had some instances like S3 RDS and EC2 that when we started looking at, we knew that some of our key services like data platform, some parts of the MLN A I, and even paper platform would make more sense for us to actually leverage some of these in a public cloud for that. >> So what are the characteristics of the workload that are better suited to be an AWS? What's the ideal workload profile? You know, we talk about ideal customer profile. What's the ideal workload profile for the AWS cloud? >> Got it, so the way we think about it, at least, we call it the rule of three at Dropbox. And that means we look at scale first, we look at technology and innovation and what I mean by that is, is there a faster innovation in the public cloud and is that workload common enough that there's already a lot of work going on in public cloud, then there's no reason for us to actually innovate faster than that, we probably can't. And if the scale is not large enough. So when we talk about our storage side, like magic pocket, the scale is large enough where innovating there makes sense and it's better for the end customer. So we would probably go towards private cloud there. But then when we talk about international expansion, when we talk about faster go-to-market or some of the innovation in the MLN A I space, it really makes sense to use public cloud because of all of the advancements that we've seen there. >> So let me circle back to the business benefits and impact of the AMD based compute specifically, but you talked about TCO before, so there's certain things you mentioned on-prem you sometimes use. If the thing's hardened, you don't want to necessarily rip and replace it, but if you can accelerate go-to-market and you spin up things in the cloud, that makes sense. You mentioned customer requirements, so that just kind of depends. And then the international expansion and scale. So it comes down to those whatever four or five factors, right? The TCO, those other factors that I mentioned. Kind of the high level benefits, if you wouldn't mind summarizing for us, Ali. >> Yeah, I think you're spot on there. So there's looking at the overall TCO, right? The cost of serving the overall cloud, looking at go-to-market in general, can we leverage public cloud and go-to-market faster, obviously meeting our end customer requirements. We also looked at international expansion, any of the customer's data that is stored outside of the U S is all on public cloud for Dropbox. No plans in the short term to do something different there. And then also just looking at, like I mentioned, anything in the technology space that is ongoing, that we can leverage feature side or the product side for our customers, like A I or M L, we are going to leverage public cloud there. >> So of course you know, we've followed the progression of semiconductor technology for decades. This industry is marched to the cadence of performance improvements. What do the futures hold from a technology roadmap standpoint, and particularly as it relates to leveraging AMD EC2 instances, Ali? >> Got it. So Dropbox is in a very unique position where we actually leverage AMD both on-prem and for public cloud, leveraging some of the AWS EC2 instances like you mentioned. And EPYC processors from AMD are what we're using today, both on the hybrid infrastructure side and the performance. And also the TCO benefits are real and something that we are observing on a day-to-day basis. So we are going to be leveraging that technology even in the future, and the partnership with the AMD continues to be very, very strong for Dropbox. >> Well, Ali, I really appreciate you coming on "The Cube" as part of our coverage. It's great to have you. Love to have you back sometime. >> Awesome, thank you. And also just last thing we wanted to also call out that we are also going to be experimenting with probably Milan that is coming out. Rome is the current processors from AMD that we have been leveraging and as Milan comes available, we do want to continue to evaluate it and see how we can fit it in our infrastructure. >> So their are generations are city-based. Are they all Italian city-based, or are we going to run out of cities soon? (both laughing) >> Got it. Again, the partnership with both AWS and AMD is something that I'm very proud of. >> Excellent. >> Thank you so much. Thank you, Dave. >> Great to have you Ali, and really appreciate you watching. Keep it right there for more action on "The Cube", your leader in hybrid tech event coverage. (calm music)
SUMMARY :
and largest hybrid tech events of the year to be here with you, Dave. So I actually oversee the the past several years. and then you can share this content the cloud was really nascent, right? and the latest technology What are the real drivers and that the infrastructure is about the EC2 instances that you use. and then we actually built On the business side, I think primarily What are some of the value drivers for some of the common workloads. characteristics of the workload Got it, so the way we and impact of the AMD No plans in the short term to So of course you know, we've followed and the partnership with the AMD Love to have you back sometime. Rome is the current processors from AMD or are we going to run out of cities soon? Again, the partnership Thank you so much. Great to have you Ali, and
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
AMD | ORGANIZATION | 0.99+ |
Dave | PERSON | 0.99+ |
Ali Zafar | PERSON | 0.99+ |
Palo Alto | LOCATION | 0.99+ |
AWS | ORGANIZATION | 0.99+ |
Boston | LOCATION | 0.99+ |
2007 | DATE | 0.99+ |
Ali | PERSON | 0.99+ |
Dropbox | ORGANIZATION | 0.99+ |
one | QUANTITY | 0.99+ |
two remote sets | QUANTITY | 0.99+ |
two live sets | QUANTITY | 0.99+ |
five factors | QUANTITY | 0.99+ |
Milan | LOCATION | 0.99+ |
One | QUANTITY | 0.99+ |
more than 100 guests | QUANTITY | 0.99+ |
four | QUANTITY | 0.99+ |
mid last decade | DATE | 0.98+ |
Rome | LOCATION | 0.98+ |
both | QUANTITY | 0.97+ |
over 550 billion pieces | QUANTITY | 0.97+ |
EC2 | TITLE | 0.96+ |
five years | QUANTITY | 0.96+ |
over 700 million registered users | QUANTITY | 0.96+ |
S3 RDS | TITLE | 0.95+ |
TCO | ORGANIZATION | 0.93+ |
today | DATE | 0.92+ |
Italian | LOCATION | 0.91+ |
three | QUANTITY | 0.9+ |
this year | DATE | 0.86+ |
U S | LOCATION | 0.86+ |
four key things | QUANTITY | 0.85+ |
over four | QUANTITY | 0.82+ |
re:Invent 2021 | EVENT | 0.77+ |
first | QUANTITY | 0.77+ |
years | DATE | 0.7+ |
MLN | TITLE | 0.67+ |
decades | QUANTITY | 0.65+ |