Evan Touger, Prowess | Prowess Benchmark Testing Results for AMD EPYC Genoa on Dell Servers
(upbeat music) >> Welcome to theCUBE's continuing coverage of AMD's fourth generation EPYC launch. I've got a special guest with me today from Prowess Consulting. His name is Evan Touger, he's a senior technical writer with Prowess. Evan, welcome. >> Hi, great to be here. Thanks. >> So tell us a little bit about Prowess, what does Prowess do? >> Yeah, we're a consulting firm. We've been around for quite a few years, based in Bellevue, Washington. And we do quite a few projects with folks from Dell to a lot of other companies, and dive in. We have engineers, writers, production folks, so pretty much end-to-end work, doing research testing and writing, and diving into different technical topics. >> So you- in this case what we're going to be talking about is some validation studies that you've done, looking at Dell PowerEdge servers that happened to be integrating in fourth-gen EPYC processors from AMD. What were the specific workloads that you were focused on in this study? >> Yeah, this particular one was honing in on virtualization, right? You know, obviously it's pretty much ubiquitous in the industry, everybody works with virtualization in one way or another. So just getting optimal performance for virtualization was critical, or is critical for most businesses. So we just wanted to look a little deeper into, you know, how do companies evaluate that? What are they going to use to make the determination for virtualization performance as it relates to their workloads? So that led us to this study, where we looked at some benchmarks, and then went a little deeper under the hood to see what led to the results that we saw from those benchmarks. >> So when you say virtualization, does that include virtual desktop infrastructure or are we just talking about virtual machines in general? >> No, it can include both. We looked at VMs, thinking in terms of what about database performance when you're working in VMs, all the way through to VDI and companies like healthcare organizations and so forth, where it's common to roll out lots of virtual desktops, and performance is critical there as well. >> Okay, you alluded to, sort of, looking under the covers to see, you know, where these performance results were coming from. I assume what you're referencing is the idea that it's not just all about the CPU when you talk about a system. Am I correct in that assumption and- >> Yeah, absolutely. >> What can you tell us? >> Well, you know, for companies evaluating, there's quite a bit to consider, obviously. So they're looking at not just raw performance but power performance. So that was part of it, and then what makes up that- those factors, right? So certainly CPU is critical to that, but then other things come into play, like the RAID controllers. So we looked a little bit there. And then networking, of course can be critical for configurations that are relying on good performance on their networks, both in terms of bandwidth and just reducing latency overall. So interconnects as well would be a big part of that. So with, with PCIe gen 5 or 5.0 pick your moniker. You know in this- in the infrastructure game, we're often playing a game of whack-a-mole, looking for the bottlenecks, you know, chasing the bottlenecks. PCIe 5 opens up a lot of bandwidth for memory and things like RAID controllers and NICs. I mean, is the bottleneck now just our imagination, Evan, have we reached a point where there are no bottlenecks? What did you see when you ran these tests? What, you know, what were you able to stress to a point where it was saturated, if anything? >> Yeah. Well, first of all, we didn't- these are particular tests were ones that we looked at industry benchmarks, and we were examining in particular to see where world records were set. And so we uncovered a few specific servers, PowerEdge servers that were pretty key there, or had a lot of- were leading in the category in a lot of areas. So that's what led us to then, okay, well why is that? What's in these servers, and what's responsible for that? So in a lot of cases they, we saw these results even with, you know, gen 4, PCIe gen 4. So there were situations where clearly there was benefit from faster interconnects and, and especially NVMe for RAID, you know, for supporting NVMe and SSDs. But all of that just leads you to the understanding that it means it can only get better, right? So going from gen 4 to- if you're seeing great results on gen 4, then gen 5 is probably going to be, you know, blow that away. >> And in this case, >> It'll be even better. >> In this case, gen 5 you're referencing PCIe >> PCIe right. Yeah, that's right. >> (indistinct) >> And then the same thing with EPYC actually holds true, some of the records, we saw records set for both 3rd and 4th gen, so- with EPYC, so the same thing there. Anywhere there's a record set on the 3rd gen, you know, makes us really- we're really looking forward to going back and seeing over the next few months, which of those records fall and are broken by newer generation versions of these servers, once they actually wrap to the newer generation processors. You know, based on, on what we're seeing for the- for what those processors can do, not only in. >> (indistinct) Go ahead. >> Sorry, just want to say, not only in terms of raw performance, but as I mentioned before, the power performance, 'cause they're very efficient, and that's a really critical consideration, right? I don't think you can overstate that for companies who are looking at, you know, have to consider expenditures and power and cooling and meeting sustainability goals and so forth. So that was really an important category in terms of what we looked at, was that power performance, not just raw performance. >> Yeah, I want to get back to that, that's a really good point. We should probably give credit where credit is due. Which Dell PowerEdge servers are we talking about that were tested and what did those interconnect components look like from a (indistinct) perspective? >> Yeah, so we focused primarily on a couple benchmarks that seemed most important for real world performance results for virtualization. TPCx-V and VMmark 3.x. the TPCx-V, that's where we saw PowerEdge R7525, R7515. They both had top scores in different categories there. That benchmark is great for looking at database workloads in particular, right? Running in virtualization settings. And then the VMmark 3.x was critical. We saw good, good results there for the 7525 and the R 7515 as well as the R 6525, in that one and that included, sorry, just checking notes to see what- >> Yeah, no, no, no, no, (indistinct) >> Included results for power performance, as I mentioned earlier, that's where we could see that. So we kind of, we saw this in a range of servers that included both 3rd gen AMD EPYC and newer 4th gen as well as I mentioned. The RAID controllers were critical in the TPCx-V. I don't think that came into play in the VM mark test, but they were definitely part of the TPCx-V benchmarks. So that's where the RAID controllers would make a difference, right? And in those tests, I think they're using PERC 11. So, you know, the newer PERC 12 controllers there, again we'd expect >> (indistinct) >> To see continued, you know, gains in newer benchmarks. That's what we'll be looking for over the next several months. >> Yeah. So I think if I've got my Dell nomenclature down, performance, no no, PowerEdge RAID Controller, is that right? >> Exactly, yeah, there you go. Right? >> With Broadcom, you know, powered by Broadcom. >> That's right. There you go. Yeah. Isn't the Dell naming scheme there PERC? >> Yeah, exactly, exactly. Back to your comment about power. So you've had a chance to take a pretty deep look at the latest stuff coming out. You're confident that- 'cause some of these servers are going to be more expensive than previous generation. Now a server is not a server is not a server, but some are awakening to the idea that there might be some sticker shock. You're confident that the bang for your buck, the bang for your kilowatt hour is actually going to be beneficial. We're actually making things better, faster, stronger, cheaper, more energy efficient. We're continuing on that curve? >> That's what I would expect to see, right. I mean, of course can't speak to to pricing without knowing, you know, where the dollars are going to land on the servers. But I would expect to see that because you're getting gains in a couple of ways. I mean, one, if the performance increases to the point where you can run more VMs, right? Get more performance out of your VMs and run more total VMs or more BDIs, then there's obviously a good, you know, payback on your investment there. And then as we were discussing earlier, just the power performance ratio, right? So if you're bringing down your power and cooling costs, if these machines are just more efficient overall, then you should see some gains there as well. So, you know, I think the key is looking at what's the total cost of ownership over, you know, a standard like a three-year period or something and what you're going to get out of it for your number of sessions, the performance for the sessions, and the overall efficiency of the machines. >> So just just to be clear with these Dell PowerEdge servers, you were able to validate world record performance. But this isn't, if you, if you look at CPU architecture, PCIe bus architecture, memory, you know, the class of memory, the class of RAID controller, the class of NIC. Those were not all state of the art in terms of at least what has been recently announced. Correct? >> Right. >> Because (indistinct) the PCI 4.0, So to your point- world records with that, you've got next-gen RAID controllers coming out, and NICs coming out. If the motherboard was PCIe 5, with commensurate memory, all of those things are getting better. >> Exactly, right. I mean you're, you're really you're just eliminating bandwidth constraints latency constraints, you know, all of that should be improved. NVMe, you know, just collectively all these things just open the doors, you know, letting more bandwidth through reducing all the latency. Those are, those are all pieces of the puzzle, right? That come together and it's all about finding the weakest link and eliminating it. And I think we're reaching the point where we're removing the biggest constraints from the systems. >> Okay. So I guess is it fair to summarize to say that with this infrastructure that you tested, you were able to set world records. This, during this year, I mean, over the next several months, things are just going to get faster and faster and faster and faster. >> That's what I would anticipate, exactly, right. If they're setting world records with these machines before some of the components are, you know, the absolute latest, it seems to me we're going to just see a continuing trend there, and more and more records should fall. So I'm really looking forward to seeing how that goes, 'cause it's already good and I think the return on investment is pretty good there. So I think it's only going to get better as these roll out. >> So let me ask you a question that's a little bit off topic. >> Okay. >> Kind of, you know, we see these gains, you know, we're all familiar with Moore's Law, we're familiar with, you know, the advancements in memory and bus architecture and everything else. We just covered SuperCompute 2022 in Dallas a couple of weeks ago. And it was fascinating talking to people about advances in AI that will be possible with new architectures. You know, most of these supercomputers that are running right now are n minus 1 or n minus 2 infrastructure, you know, they're, they're, they're PCI 3, right. And maybe two generations of processors old, because you don't just throw out a 100,000 CPU super computing environment every 18 months. It doesn't work that way. >> Exactly. >> Do you have an opinion on this question of the qualitative versus quantitative increase in computing moving forward? And, I mean, do you think that this new stuff that you're starting to do tests on is going to power a fundamental shift in computing? Or is it just going to be more consolidation, better power consumption? Do you think there's an inflection point coming? What do you think? >> That's a great question. That's a hard one to answer. I mean, it's probably a little bit of both, 'cause certainly there will be better consolidation, right? But I think that, you know, the systems, it works both ways. It just allows you to do more with less, right? And you can go either direction, you can do what you're doing now on fewer machines, you know, and get better value for it, or reduce your footprint. Or you can go the other way and say, wow, this lets us add more machines into the mix and take our our level of performance from here to here, right? So it just depends on what your focus is. Certainly with, with areas like, you know, HPC and AI and ML, having the ability to expand what you already are capable of by adding more machines that can do more is going to be your main concern. But if you're more like a small to medium sized business and the opportunity to do what you were doing on, on a much smaller footprint and for lower costs, that's really your goal, right? So I think you can use this in either direction and it should, should pay back in a lot of dividends. >> Yeah. Thanks for your thoughts. It's an interesting subject moving forward. You know, sometimes it's easy to get lost in the minutiae of the bits and bites and bobs of all the components we're studying, but they're powering something that that's going to effect effectively all of humanity as we move forward. So what else do we need to consider when it comes to what you've just validated in the virtualization testing? Anything else, anything we left out? >> I think we hit all the key points, or most of them it's, you know, really, it's just keeping in mind that it's all about the full system, the components not- you know, the processor is a obviously a key, but just removing blockages, right? Freeing up, getting rid of latency, improving bandwidth, all these things come to play. And then the power performance, as I said, I know I keep coming back to that but you know, we just, and a lot of what we work on, we just see that businesses, that's a really big concern for businesses and finding efficiency, right? And especially in an age of constrained budgets, that's a big deal. So, it's really important to have that power performance ratio. And that's one of the key things we saw that stood out to us in, in some of these benchmarks, so. >> Well, it's a big deal for me. >> It's all good. >> Yeah, I live in California and I know exactly how much I pay for a kilowatt hour of electricity. >> I bet, yeah. >> My friends in other places don't even know. So I totally understand the power constraint question. >> Yeah, it's not going to get better, so, anything you can do there, right? >> Yeah. Well Evan, this has been great. Thanks for sharing the results that Prowess has come up with, third party validation that, you know, even without the latest and greatest components in all categories, Dell PowerEdge servers are able to set world records. And I anticipate that those world records will be broken in 2023 and I expect that Prowess will be part of that process, So Thanks for that. For the rest of us- >> (indistinct) >> Here at theCUBE, I want to thank you for joining us. Stay tuned for continuing coverage of AMD's fourth generation EPYC launch, for myself and for Evan Touger. Thanks so much for joining us. (upbeat music)
SUMMARY :
Welcome to theCUBE's Hi, great to be here. to a lot of other companies, and dive in. that you were focused on in this study? you know, how do companies evaluate that? all the way through to VDI looking under the covers to see, you know, you know, chasing the bottlenecks. But all of that just leads you Yeah, that's right. you know, makes us really- (indistinct) are looking at, you know, and what did those interconnect and the R 7515 as well as So, you know, the newer To see continued, you know, is that right? Exactly, yeah, there you go. With Broadcom, you There you go. the bang for your buck, to pricing without knowing, you know, PCIe bus architecture, memory, you know, So to your point- world records with that, just open the doors, you know, with this infrastructure that you tested, components are, you know, So let me ask you a question that's we're familiar with, you know, and the opportunity to do in the minutiae of the or most of them it's, you know, really, it's a big deal for me. for a kilowatt hour of electricity. So I totally understand the third party validation that, you know, I want to thank you for joining us.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Evan | PERSON | 0.99+ |
Evan Touger | PERSON | 0.99+ |
California | LOCATION | 0.99+ |
Dallas | LOCATION | 0.99+ |
Dell | ORGANIZATION | 0.99+ |
Prowess Consulting | ORGANIZATION | 0.99+ |
2023 | DATE | 0.99+ |
three-year | QUANTITY | 0.99+ |
AMD | ORGANIZATION | 0.99+ |
R 6525 | COMMERCIAL_ITEM | 0.99+ |
Broadcom | ORGANIZATION | 0.99+ |
3rd | QUANTITY | 0.99+ |
R 7515 | COMMERCIAL_ITEM | 0.99+ |
R7515 | COMMERCIAL_ITEM | 0.99+ |
both | QUANTITY | 0.99+ |
4th gen | QUANTITY | 0.99+ |
3rd gen | QUANTITY | 0.98+ |
both ways | QUANTITY | 0.98+ |
7525 | COMMERCIAL_ITEM | 0.98+ |
Prowess | ORGANIZATION | 0.98+ |
Bellevue, Washington | LOCATION | 0.98+ |
100,000 CPU | QUANTITY | 0.98+ |
PowerEdge | COMMERCIAL_ITEM | 0.97+ |
two generations | QUANTITY | 0.97+ |
one | QUANTITY | 0.96+ |
PCIe 5 | OTHER | 0.96+ |
today | DATE | 0.95+ |
theCUBE | ORGANIZATION | 0.94+ |
this year | DATE | 0.93+ |
PCI 4.0 | OTHER | 0.92+ |
TPCx-V | COMMERCIAL_ITEM | 0.92+ |
fourth-gen | QUANTITY | 0.92+ |
gen 5 | QUANTITY | 0.9+ |
Moore | ORGANIZATION | 0.89+ |
fourth generation | QUANTITY | 0.88+ |
gen 4 | QUANTITY | 0.87+ |
PCI 3 | OTHER | 0.87+ |
couple of weeks ago | DATE | 0.85+ |
SuperCompute 2022 | TITLE | 0.8+ |
PCIe gen 5 | OTHER | 0.79+ |
VMmark 3.x | COMMERCIAL_ITEM | 0.75+ |
minus | QUANTITY | 0.74+ |
one way | QUANTITY | 0.74+ |
18 months | QUANTITY | 0.7+ |
PERC 12 | COMMERCIAL_ITEM | 0.67+ |
5.0 | OTHER | 0.67+ |
EPYC | COMMERCIAL_ITEM | 0.65+ |
months | DATE | 0.64+ |
5 | QUANTITY | 0.63+ |
PERC 11 | COMMERCIAL_ITEM | 0.6+ |
next few months | DATE | 0.6+ |
first | QUANTITY | 0.59+ |
VMmark 3.x. | COMMERCIAL_ITEM | 0.55+ |
EPYC Genoa | COMMERCIAL_ITEM | 0.53+ |
gen | OTHER | 0.52+ |
R7525 | COMMERCIAL_ITEM | 0.52+ |
1 | QUANTITY | 0.5+ |
2 | QUANTITY | 0.47+ |
PowerEdge | ORGANIZATION | 0.47+ |
Aaron Suzuki, Prowess Labs | Does Hardware Matter?
>>Mm. Joining me is Aaron Suzuki, founder and CEO of Prowess. >>Aaron. Welcome. Thank you. Thanks so much for having me. >>Absolutely. Thanks for joining us. So let's dive right in. Tell us about prowess. >>Progress has been around for quite a while. We've been serving the technology industry from the very beginning, almost 20 years. We've always been able to bridge the gap between the story of the product and what it actually does. And a lot of times, there's a pretty fundamental disconnect between what engineering says and what marketing wants to claim. And so this is sort of how we got down this road of getting into testing and validation of products such as we do today quite quite extensively. And >>that's really what we're focusing on right now is this idea of your independence as a lab. And in this particular case, uh, it's a series of tests that you've done for, uh, you know, using Dell hardware combined with Broadcom cards. So talk a little more about that. About that the concept of independence and what that means. >>Yeah. You know, it's important to us that we stay vendor, agnostic, platform agnostic. Um, and there are a lot of things happening concurrently in the industry. A lot of people want to get a lot of work done really fast, and most customers are not sort of vendor exclusive. In fact, we're not sure we know of any. We always try to keep this objective point of view. That is to say that we don't allow our customers to buy results when we're doing quantitative testing. We really are out there trying to come up with a story or a narrative, and that really seemed to be The missing link in all of this is that there are the quantitative houses that do traditional benchmark testing on one side and then system integrators and kind of on the other extreme agencies. That would really do the narrative and the system integrator side build out a solution, but they wouldn't be able to tell you how it would perform. And so reconciling those two things really became challenging. So having a source that would be able to give you that insight that goes beyond just transactions, um, you know, per whatever unit of time and finding some of these metrics in between that we're more relevant to people's jobs. Where was really the inspiration for creating this unique practise that we call prowess? Labs? >>So, Erin, when I think about performance testing, uh, it's very easy to think of it from the perspective that it's a bunch of hardware slapped together in Iraq. You get some engineers, scientists to run some tests. Why prowess? What? What do you specifically bring to the table? That's meaningful? >>Performance testing is usually done in one of two ways, predominantly one way. It's a very academic approach, which says this specific benchmark test run this specific way gives us X. Another approach is more narrative in nature and more demonstrative. And there's this huge gap in between, and that's really what prowess labs exist to fulfil. >>So, Erin, give us an idea of the scale of prowess. How many of these projects have you worked on? How many how many customers have you worked with over >>over time? Um, we do this work with most of the leading global hardware and software manufacturers and a select number of emerging providers as well. So for us, you know, year to year dozens of projects of varying scope and scale. Various projects also running kind of programmatic form where we're kind of iterating constantly throughout the year. Um, so it's It's really a lot of fun for our team members to to do this. And some of them have been doing it for 10 or 12 years in continuity. >>Erin, Thanks for joining us to talk about practise today. >>My pleasure. Thanks for having me. Mhm. Yeah.
SUMMARY :
Thanks so much for having me. Thanks for joining us. And a lot of times, there's a pretty fundamental disconnect About that the concept of independence and what that means. and kind of on the other extreme agencies. from the perspective that it's a bunch of hardware slapped together in Iraq. And there's this huge gap in between, and that's really what prowess labs exist to fulfil. How many of these projects have you worked on? So for us, you know, year to year dozens Thanks for having me.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Erin | PERSON | 0.99+ |
10 | QUANTITY | 0.99+ |
Aaron Suzuki | PERSON | 0.99+ |
Aaron | PERSON | 0.99+ |
Iraq | LOCATION | 0.99+ |
12 years | QUANTITY | 0.99+ |
Dell | ORGANIZATION | 0.99+ |
Prowess | ORGANIZATION | 0.99+ |
two ways | QUANTITY | 0.99+ |
one way | QUANTITY | 0.98+ |
one | QUANTITY | 0.98+ |
two things | QUANTITY | 0.98+ |
prowess | ORGANIZATION | 0.97+ |
almost 20 years | QUANTITY | 0.97+ |
today | DATE | 0.97+ |
Prowess Labs | ORGANIZATION | 0.96+ |
one side | QUANTITY | 0.95+ |
dozens | QUANTITY | 0.87+ |