Latanya Sweeney, Harvard University | Women in Data Science (WiDS) 2018
>> Narrator: Live from Stanford University in Palo Alto, California. It's theCUBE. Covering Women in Data Science Conference 2018. Brought to you by Stanford. (upbeat music) >> Welcome back to theCUBE. We are live at Stanford University for the Third Annual Women in Data Science WiDS Conference. I'm Lisa Marten and we've had a great morning so far talking with a lot the speakers and participants at this event here at Stanford, which of course is going on globally as well. Very excited to be joined by one of the Keynotes this morning at WiDS, Latanya Sweeney, the Professor of Government and Technology from Harvard. Latanya, thank you so much for stopping by theCUBE. >> Well thank you for having me. >> Absolutely. So you are a computer scientist by training. WiDS as a mentioned is in its third year, they're expecting a 100,000 people to engage. There's a 177 I think, Margot said, regional WiDS events going on right now. In 53 countries. >> Isn't that amazing? >> It is! >> It's so exciting. >> Incredible in such a short period of time. What is it about WiDS that was attraction to you saying, "Yes, I want to participate in this event." >> Well one of the issues is just simply the idea the data science represents this sort of wave of change, of how do I analyze data? How do I make it different? And the conference itself celebrating the fact that women are taking the step, is hugely important. I mean, when I was a graduate student at MIT, I was the first black woman to get a PhD in Computer Science from MIT. And sort of, no women you really just didn't see women in this area at all. So when I come to a conference like WiDS, it's huge. It's just huge to see all these walls broken down. >> I love that walls breaking down, barriers kind of evaporating. In your time though at MIT, I'd love to understand a little bit more. Were you very conscience, "Hey I'm one of the very "few females here?" (Latanya laughs) Did it bother you or were you just, "You know what, "this is my passion, and I don't care. "I'm going to keep going forward." What was that experience like? >> Well, at first I was very naive, in a belief that you know all that really mattered was the work I did. And, I never had problems with the students, but I did have lots of problems with the professors, with this idea that you had to be like them in ways that was beyond your brain or your work, in order to really be exalted by them. And so, so whether I wanted to admit it, or whether I just wanted to ignore it, it just sort of came crashing down. >> Did you have mentors at that time, or did you think, "You know what, I'm not finding anybody "that I can really follow. "I've got to by my own mentor right now." >> Right, I mean I don't think my experience is really that uncommon for women in my generation. Very difficult to find mentors who would be complete mentors, complete see themselves in you and really try to exalt you and navigate you. What women often have found is that they can find a partial person here, and a partial person there. One who can help them in this regard, or that regard, but not the same kind of idea that you would be the superstar of one of these mentors. And it's not to take away from the fact that there have been these angels in my life, who made a big difference, and so I don't want to take away from that that somehow I did this all by myself. That's not true. >> So with the conference today, one of the things that Maria Klawe said in her welcome remarks was encouraging this generation, "Don't be worried if there's something "that you're not good at." So I loved how she was sort of encouraging people to sort of, women sort of, let go of maybe some of those preconceived notions that, "I can't do this. "I'm not good at that." I think that it's very liberating and still in 2018 with the fact there is such a diversity gap, it's still so needed. What were maybe some of the three takeaways, if you will, of your Keynote this morning that you imparted on the audience? >> Was that technology design is the new policy maker. That they're making policy, the design itself is making policy, but nobody's like monitoring it. But we could in fact use data science to monitor, to show the unforeseen consequences, and in the examples that we've done that, we've had big impact on the world. >> So share some of that with us, because that's your focus. You're in... What department in Harvard? You said government? >> So I sit in the government department. >> Unforeseen consequences of technology? >> Yes. >> Tell us about that. >> Well, you know, so in the Keynote, I talked about examples where technology is basically challenging every democratic value that we have. And sort of like no one's really aware, we kind of think about it here and there, but by doing simple data science experiments, we can quantify that. We can demonstrate it, and by doing that we shore up sort of those who can help us the most; the advocates, the regulators, and journalists. And so I gave examples from my own work and from the work of my students. >> Tell me a little bit about your students actually. Are they undergrads? Do you also have graduate students as well? >> I have both. >> You have both. >> Both. The talk was about, I teach a class called Data Science to Save the World, and we tackle three to four real world problems within the semester, that we solve. And then the students love to do their own independent projects, and at the end many of those go on to be published papers. >> Wow! I feel like you need to have a cape or some sort of superhero emblem. We can work on that later. But tell me about the diversity within the student body at Harvard in your classes. Are you finding, what's maybe the ratio of men to women, for example? >> Well you know many of the universities from my time have really changed. So when I was an undergraduate the typical classroom of Harvard undergrads would be all white men, or mostly all white men. >> Lisa: Sounds like a lot of STEM's still. (Latanya laughs) >> Yeah, but now if you walk into Harvard we see a lot more diversity within the university. I'm also a faculty dean at one of the residential houses, and so the diversity is huge. However, when you start getting into computer science, you start seeing, you don't see as much diversity. But in the Data Sciences of the World course, we get students from all over. They come from different backgrounds. They come in different colors, shapes, and sizes. Each with a skillset and a desire to learn how to have impact. >> I think that desire is key. How do you help them sort of build their own confidence in terms of, regardless of what color, flavor, you know my peer group is, I like this. I want to be in this. How do you help ignite that confidence within someone that's quite new into this? >> So if you're 20 something or almost 20, and you do something that a regulator changes their laws, or a newspaper article picks up, or you're on the Today Show, that pretty much changes the course of your life, and that's what we found with the students. That some of them have done just some remarkable work that's really been picked up and exalted, and it's stayed with them. It would change the direction in which they've gone. So what we do in the course, is we teach them that there's just so many problems that are low hanging, and how to spot a problem, an issue that they can solve, and how to solve it in a way that can be have impact. And that's really what the course focus is on. >> That impact is so important to just continue to fuel someones fire, and for that person to then be empowered to be able to ignite a fire under somebody else. I think one of the things that you mentioned sort of speaks to some of the things that we're seeing in these boundaries and lines are blurring. Not just so much even on from a gender perspective, but even career path A, B, C, D, now it's data is fueling the world. Every company is becoming a company because they have to be, right, to make consumer demands and just grow and be profitable as a business. But I also I like the parallel there that these rigid maybe, more rigid lines of careers are now opening up, because like you're saying, you can make impact being a data scientist. In every sector you can influence policy and wow, what a huge opportunity. It's almost like it's infinite, right? >> Yeah. I mean if you look at even the range of talks in the conference today, you get a great sense of not only new tools in different areas, but just the sheer spectrum of areas in which data science is playing. And that these women are already working it, already have the impact. >> So, speaking of the conference today, one of the things that I think is that we're hearing, is it's not just about inspiring, I think, Maria Klawe had said in theCUBE previous to today, that she found that young women in their first semester of university college courses, are probably like the right age and time in their lives to really ignite a spark, but I think there's also sort of a reinvigoration of the women that have been in technology and STEM fields for a while. Are you feeling and hearing kind of some of the same things from your peers and colleagues here? >> Definitely. We see it at the two levels. It's really important to try to get them in freshman year before they have a discipline defined for themselves, or how they see themselves. So that you can sort of ignite that spark and keep that spark alive. But then later women who, women or others, who are already in a field and looking for a way to sort of release and redefine themselves, data science is definitely giving them that opportunity. >> It really is. So what are some of the things that you're looking forward to for your career at Harvard as 2018 moves forward? >> Well, we, you know, the students we try to tackle the big problems. Election vulnerabilities has been a big one for us, on our agenda. The privacy of publicly available data is another big one that we've been working on. Well I think that's enough for awhile. (laughs) >> Lisa: That's pretty big. >> Yeah. >> I think so. >> Yeah, we'll get those done! >> Well that and you know, designing the logo for the t-shirt cause you definitely need to have a superpower t-shirt. So last question for you, if you could give young Latanya advice, when you were just starting out college, not knowing any of this was going to happen in terms of this movement that is WiDS and 2018, what would some of those key advice points for you, for your younger self be? >> To believe in yourself. To believe in yourself and that it's going to work out. One of the things that I grew to learn was how to turn lemons into lemonade, and that turns out to be very, very powerful, because it's a way to bounce back when you're faced with things that you can't control, that people are trying to put obstacles in your way, you just sort of find another way to keep going. And the world sort of bended towards me, so that was really cool. >> And also that failure is not a bad F word, right? (Latanya laughs) >> That's absolutely correct. >> It's part of a natural course and I think any leader and whatever and just you're in whatever, country whatever ethnicity, gender, everybody has I wouldn't even say missteps, it's just part of life, but I think... >> Yeah it's just part of the what... And Harvard like I said, I am the dean in one of the faculty houses, and one of the main things that we do each, throughout the year, is invite speakers and who're accomplished in whatever area they're in, but the one thing that they all have in common is they took this really roundabout way to get where they are. And a lot of that was because failures and blocks came in the way, and that's really important I think for young adults to really understand. >> I agree. Well, Latanya, thank you so much for carving out some time to stop by and chat with us on theCUBE. We are excited to have your wisdom shared to our audience and we wish you a great rest of the conference. >> Alright, thank you very much. >> We'll see you next time on theCUBE. >> Okay. >> We want to thank you for watching theCUBE. I'm Lisa Marten. We are live from the Third Annual Women in Data Science Conference at Stanford University. Stick around after this short break, I'll be back with my next guest. (upbeat music)
SUMMARY :
Brought to you by Stanford. Latanya, thank you so much for stopping by theCUBE. So you are a computer scientist by training. What is it about WiDS that was attraction to you saying, And sort of, no women you really just didn't Did it bother you or were you just, "You know what, in order to really be exalted by them. Did you have mentors at that time, or did you but not the same kind of idea that you would be the What were maybe some of the three takeaways, if you will, Was that technology design is the new policy maker. So share some of that with us, because that's your focus. and from the work of my students. Do you also have graduate students as well? And then the students love to do their own I feel like you need to have a cape Well you know many of the universities from my time Lisa: Sounds like a lot of STEM's still. But in the Data Sciences of the World course, How do you help ignite that confidence within someone that pretty much changes the course of your life, But I also I like the parallel there that these rigid in the conference today, you get a great sense sort of a reinvigoration of the women that have been So that you can sort of ignite that spark to for your career at Harvard as 2018 moves forward? Well, we, you know, the students Well that and you know, One of the things that I grew to learn was how to It's part of a natural course and I think And a lot of that was because failures and blocks We are excited to have your wisdom shared to our We want to thank you for watching theCUBE.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Lisa Marten | PERSON | 0.99+ |
Latanya | PERSON | 0.99+ |
Margot | PERSON | 0.99+ |
Latanya Sweeney | PERSON | 0.99+ |
Lisa | PERSON | 0.99+ |
Maria Klawe | PERSON | 0.99+ |
2018 | DATE | 0.99+ |
20 | QUANTITY | 0.99+ |
Both | QUANTITY | 0.99+ |
three | QUANTITY | 0.99+ |
both | QUANTITY | 0.99+ |
three takeaways | QUANTITY | 0.99+ |
Palo Alto, California | LOCATION | 0.99+ |
first semester | QUANTITY | 0.99+ |
100,000 people | QUANTITY | 0.99+ |
first | QUANTITY | 0.99+ |
today | DATE | 0.99+ |
one | QUANTITY | 0.98+ |
Harvard University | ORGANIZATION | 0.98+ |
WiDS | EVENT | 0.98+ |
two levels | QUANTITY | 0.98+ |
53 countries | QUANTITY | 0.98+ |
Each | QUANTITY | 0.98+ |
third year | QUANTITY | 0.98+ |
MIT | ORGANIZATION | 0.97+ |
four | QUANTITY | 0.97+ |
Stanford | LOCATION | 0.97+ |
Third Annual Women in Data Science WiDS Conference | EVENT | 0.97+ |
Today Show | TITLE | 0.97+ |
Stanford | ORGANIZATION | 0.97+ |
Harvard | ORGANIZATION | 0.96+ |
Third Annual Women in Data Science Conference | EVENT | 0.96+ |
One | QUANTITY | 0.95+ |
one thing | QUANTITY | 0.95+ |
each | QUANTITY | 0.94+ |
Stanford University | ORGANIZATION | 0.93+ |
Covering Women in Data Science Conference 2018 | EVENT | 0.92+ |
theCUBE | ORGANIZATION | 0.91+ |
177 | QUANTITY | 0.89+ |
Women in Data Science | ORGANIZATION | 0.89+ |
this morning | DATE | 0.89+ |
Data Science to Save the World | TITLE | 0.87+ |
Narrator | TITLE | 0.81+ |
Harvard | LOCATION | 0.77+ |
one of | QUANTITY | 0.74+ |
Professor of Government and Technology | PERSON | 0.69+ |
almost | QUANTITY | 0.66+ |
black | OTHER | 0.63+ |
Stanford University | LOCATION | 0.6+ |
Keynote | TITLE | 0.57+ |
world | QUANTITY | 0.5+ |
WiDS | ORGANIZATION | 0.49+ |
theCUBE | TITLE | 0.46+ |
Shir Meir Lador, Intuit | WiDS 2023
(gentle upbeat music) >> Hey, friends of theCUBE. It's Lisa Martin live at Stanford University covering the Eighth Annual Women In Data Science. But you've been a Cube fan for a long time. So you know that we've been here since the beginning of WiDS, which is 2015. We always loved to come and cover this event. We learned great things about data science, about women leaders, underrepresented minorities. And this year we have a special component. We've got two grad students from Stanford's Master's program and Data Journalism joining. One of my them is here with me, Hannah Freitag, my co-host. Great to have you. And we are pleased to welcome from Intuit for the first time, Shir Meir Lador Group Manager at Data Science. Shir, it's great to have you. Thank you for joining us. >> Thank you for having me. >> And I was just secrets girl talking with my boss of theCUBE who informed me that you're in great company. Intuit's Chief Technology Officer, Marianna Tessel is an alumni of theCUBE. She was on at our Supercloud event in January. So welcome back into it. >> Thank you very much. We're happy to be with you. >> Tell us a little bit about what you're doing. You're a data science group manager as I mentioned, but also you've had you've done some cool things I want to share with the audience. You're the co-founder of the PyData Tel Aviv Meetups the co-host of the unsupervised podcast about data science in Israel. You give talks, about machine learning, about data science. Tell us a little bit about your background. Were you always interested in STEM studies from the time you were small? >> So I was always interested in mathematics when I was small, I went to this special program for youth going to university. So I did my test in mathematics earlier and studied in university some courses. And that's when I understood I want to do something in that field. And then when I got to go to university, I went to electrical engineering when I found out about algorithms and how interested it is to be able to find solutions to problems, to difficult problems with math. And this is how I found my way into machine learning. >> Very cool. There's so much, we love talking about machine learning and AI on theCUBE. There's so much potential. Of course, we have to have data. One of the things that I love about WiDS and Hannah and I and our co-host Tracy, have been talking about this all day is the impact of data in everyone's life. If you break it down, I was at Mobile World Congress last week, all about connectivity telecom, and of course we have these expectation that we're going to be connected 24/7 from wherever we are in the world and we can do whatever we want. I can do an Uber transaction, I can watch Netflix, I can do a bank transaction. It all is powered by data. And data science is, some of the great applications of it is what it's being applied to. Things like climate change or police violence or health inequities. Talk about some of the data science projects that you're working on at Intuit. I'm an intuit user myself, but talk to me about some of those things. Give the audience really a feel for what you're doing. >> So if you are a Intuit product user, you probably use TurboTax. >> I do >> In the past. So for those who are not familiar, TurboTax help customers submit their taxes. Basically my group is in charge of getting all the information automatically from your documents, the documents that you upload to TurboTax. We extract that information to accelerate your tax submission to make it less work for our customers. So- >> Thank you. >> Yeah, and this is why I'm so proud to be working at this team because our focus is really to help our customers to simplify all the you know, financial heavy lifting with taxes and also with small businesses. We also do a lot of work in extracting information from small business documents like bill, receipts, different bank statements. Yeah, so this is really exciting for me, the opportunity to work to apply data science and machine learning to solution that actually help people. Yeah >> Yeah, in the past years there have been more and more digital products emerging that needs some sort of data security. And how did your team, or has your team developed in the past years with more and more products or companies offering digital services? >> Yeah, so can you clarify the question again? Sorry. >> Yeah, have you seen that you have more customers? Like has your team expanded in the past years with more digital companies starting that need kind of data security? >> Well, definitely. I think, you know, since I joined Intuit, I joined like five and a half years ago back when I was in Tel Aviv. I recently moved to the Bay Area. So when I joined, there were like a dozens of data scientists and machine learning engineers on Intuit. And now there are a few hundreds. So we've definitely grown with the year and there are so many new places we can apply machine learning to help our customers. So this is amazing, so much we can do with machine learning to get more money in the pocket of our customers and make them do less work. >> I like both of those. More money in my pocket and less work. That's awesome. >> Exactly. >> So keep going Intuit. But one of the things that is so cool is just the the abstraction of the complexity that Intuit's doing. I upload documents or it scans my receipts. I was just in Barcelona last week all these receipts and conversion euros to dollars and it takes that complexity away from the end user who doesn't know all that's going on in the background, but you're making people's lives simpler. Unfortunately, we all have to pay taxes, most of us should. And of course we're in tax season right now. And so it's really cool what you're doing with ML and data science to make fundamental processes to people's lives easier and just a little bit less complicated. >> Definitely. And I think that's what's also really amazing about Intuit it, is how it combines human in the loop as well as AI. Because in some of the tax situation it's very complicated maybe to do it yourself. And then there's an option to work with an expert online that goes on a video with you and helps you do your taxes. And the expert's work is also accelerated by AI because we build tools for those experts to do the work more efficiently. >> And that's what it's all about is you know, using data to be more efficient, to be faster, to be smarter, but also to make complicated processes in our daily lives, in our business lives just a little bit easier. One of the things I've been geeking out about recently is ChatGPT. I was using it yesterday. I was telling everyone I was asking it what's hot in data science and I didn't know would it know what hot is and it did, it gave me trends. But one of the things that I was so, and Hannah knows I've been telling this all day, I was so excited to learn over the weekend that the the CTO of OpenAI is a female. I didn't know that. And I thought why are we not putting her on a pedestal? Because people are likening ChatGPT to like the launch of the iPhone. I mean revolutionary. And here we have what I think is exciting for all of us females, whether you're in tech or not, is another role model. Because really ultimately what WiDS is great at doing is showcasing women in technical roles. Because I always say you can't be what you can't see. We need to be able to see more role models, female role role models, underrepresented minorities of course men, because a lot of my sponsors and mentors are men, but we need more women that we can look up to and see ah, she's doing this, why can't I? Talk to me about how you stay the course in data science. What excites you about the potential, the opportunities based on what you've already accomplished what inspires you to continue and be one of those females that we say oh my God, I could be like Shir. >> I think that what inspires me the most is the endless opportunities that we have. I think we haven't even started tapping into everything that we can do with generative AI, for example. There's so much that can be done to further help you know, people make more money and do less work because there's still so much work that we do that we don't need to. You know, this is with Intuit, but also there are so many other use cases like I heard today you know, with the talk about the police. So that was really exciting how you can apply machine learning and data to actually help people, to help people that been through wrongful things. So I was really moved by that. And I'm also really excited about all the medical applications that we can have with data. >> Yeah, yeah. It's true that data science is so diverse in terms of what fields it can cover but it's equally important to have diverse teams and have like equity and inclusion in your teams. Where is Intuit at promoting women, non-binary minorities in your teams to progress data science? >> Yeah, so I have so much to say on this. >> Good. >> But in my work in Tel Aviv, I had the opportunity to start with Intuit women in data science branch in Tel Aviv. So that's why I'm super excited to be here today for that because basically this is the original conference, but as you know, there are branches all over the world and I got the opportunity to lead the Tel Aviv branch with Israel since 2018. And we've been through already this year it's going to be it's next week, it's going to be the sixth conference. And every year our number of submission to make talk in the conference doubled itself. >> Nice. >> We started with 20 submission, then 50, then 100. This year we have over 200 submissions of females to give talk at the conference. >> Ah, that's fantastic. >> And beyond the fact that there's so much traction, I also feel the great impact it has on the community in Israel because one of the reason we started WiDS was that when I was going to conferences I was seeing so little women on stage in all the technical conferences. You know, kind of the reason why I guess you know, Margaret and team started the WiDS conference. So I saw the same thing in Israel and I was always frustrated. I was organizing PyData Meetups as you mentioned and I was always having such a hard time to get female speakers to talk. I was trying to role model, but that's not enough, you know. We need more. So once we started WiDS and people saw you know, so many examples on the stage and also you know females got opportunity to talk in a place for that. Then it also started spreading and you can see more and more female speakers across other conferences, which are not women in data science. So I think just the fact that Intuits started this conference back in Israel and also in Bangalore and also the support Intuit does for WiDS in Stanford here, it shows how much WiDS values are aligned with our values. Yeah, and I think that to chauffeur that I think we have over 35% females in the data science and machine learning engineering roles, which is pretty amazing I think compared to the industry. >> Way above average. Yeah, absolutely. I was just, we've been talking about some of the AnitaB.org stats from 2022 showing that 'cause usually if we look at the industry to you point, over the last, I don't know, probably five, 10 years we're seeing the number of female technologists around like a quarter, 25% or so. 2022 data from AnitaB.org showed that that number is now 27.6%. So it's very slowly- >> It's very slowly increasing. >> Going in the right direction. >> Too slow. >> And that representation of women technologists increase at every level, except intern, which I thought was really interesting. And I wonder is there a covid relation there? >> I don't know. >> What do we need to do to start opening up the the top of the pipeline, the funnel to go downstream to find kids like you when you were younger and always interested in engineering and things like that. But the good news is that the hiring we've seen improvements, but it sounds like Intuit is way ahead of the curve there with 35% women in data science or technical roles. And what's always nice and refreshing that we've talked, Hannah about this too is seeing companies actually put action into initiatives. It's one thing for a company to say we're going to have you know, 50% females in our organization by 2030. It's a whole other ball game to actually create a strategy, execute on it, and share progress. So kudos to Intuit for what it's doing because that is more companies need to adopt that same sort of philosophy. And that's really cultural. >> Yeah. >> At an organization and culture can be hard to change, but it sounds like you guys kind of have it dialed in. >> I think we definitely do. That's why I really like working and Intuit. And I think that a lot of it is with the role modeling, diversity and inclusion, and by having women leaders. When you see a woman in leadership position, as a woman it makes you want to come work at this place. And as an evidence, when I build the team I started in Israel at Intuit, I have over 50% women in my team. >> Nice. >> Yeah, because when you have a woman in the interviewers panel, it's much easier, it's more inclusive. That's why we always try to have at least you know, one woman and also other minorities represented in our interviews panel. Yeah, and I think that in general it's very important as a leader to kind of know your own biases and trying to have defined standard and rubrics in how you evaluate people to avoid for those biases. So all of that inclusiveness and leadership really helps to get more diversity in your teams. >> It's critical. That thought diversity is so critical, especially if we talk about AI and we're almost out of time, I just wanted to bring up, you brought up a great point about the diversity and equity. With respect to data science and AI, we know in AI there's biases in data. We need to have more inclusivity, more representation to help start shifting that so the biases start to be dialed down and I think a conference like WiDS and it sounds like someone like you and what you've already done so far in the work that you're doing having so many females raise their hands to want to do talks at events is a good situation. It's a good scenario and hopefully it will continue to move the needle on the percentage of females in technical roles. So we thank you Shir for your time sharing with us your story, what you're doing, how Intuit and WiDS are working together. It sounds like there's great alignment there and I think we're at the tip of the iceberg with what we can do with data science and inclusion and equity. So we appreciate all of your insights and your time. >> Thank you very much. >> All right. >> I enjoyed very, very much >> Good. We hope, we aim to please. Thank you for our guests and for Hannah Freitag. This is Lisa Martin coming to you live from Stanford University. This is our coverage of the eighth Annual Women in Data Science Conference. Stick around, next guest will be here in just a minute.
SUMMARY :
Shir, it's great to have you. And I was just secrets girl talking We're happy to be with you. from the time you were small? and how interested it is to be able and of course we have these expectation So if you are a Intuit product user, the documents that you upload to TurboTax. the opportunity to work Yeah, in the past years Yeah, so can you I recently moved to the Bay Area. I like both of those. and data science to make and helps you do your taxes. Talk to me about how you stay done to further help you know, to have diverse teams I had the opportunity to start of females to give talk at the conference. Yeah, and I think that to chauffeur that the industry to you point, And I wonder is there the funnel to go downstream but it sounds like you guys I build the team I started to have at least you know, so the biases start to be dialed down This is Lisa Martin coming to you live
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Hannah Freitag | PERSON | 0.99+ |
Lisa Martin | PERSON | 0.99+ |
Marianna Tessel | PERSON | 0.99+ |
Israel | LOCATION | 0.99+ |
Bangalore | LOCATION | 0.99+ |
27.6% | QUANTITY | 0.99+ |
iPhone | COMMERCIAL_ITEM | 0.99+ |
Margaret | PERSON | 0.99+ |
Shir Meir Lador | PERSON | 0.99+ |
Hannah | PERSON | 0.99+ |
Bay Area | LOCATION | 0.99+ |
Intuit | ORGANIZATION | 0.99+ |
Tel Aviv | LOCATION | 0.99+ |
last week | DATE | 0.99+ |
Uber | ORGANIZATION | 0.99+ |
Barcelona | LOCATION | 0.99+ |
January | DATE | 0.99+ |
Shir | PERSON | 0.99+ |
20 submission | QUANTITY | 0.99+ |
50 | QUANTITY | 0.99+ |
Tracy | PERSON | 0.99+ |
2030 | DATE | 0.99+ |
100 | QUANTITY | 0.99+ |
35% | QUANTITY | 0.99+ |
50% | QUANTITY | 0.99+ |
yesterday | DATE | 0.99+ |
2015 | DATE | 0.99+ |
five | QUANTITY | 0.99+ |
this year | DATE | 0.99+ |
next week | DATE | 0.99+ |
both | QUANTITY | 0.99+ |
2022 | DATE | 0.99+ |
sixth conference | QUANTITY | 0.99+ |
Intuits | ORGANIZATION | 0.99+ |
today | DATE | 0.99+ |
OpenAI | ORGANIZATION | 0.99+ |
This year | DATE | 0.99+ |
Stanford | ORGANIZATION | 0.98+ |
one | QUANTITY | 0.98+ |
WiDS | EVENT | 0.98+ |
2018 | DATE | 0.98+ |
over 200 submissions | QUANTITY | 0.98+ |
Eighth Annual Women In Data Science | EVENT | 0.98+ |
eighth Annual Women in Data Science Conference | EVENT | 0.98+ |
theCUBE | ORGANIZATION | 0.98+ |
TurboTax | TITLE | 0.98+ |
One | QUANTITY | 0.98+ |
over 50% | QUANTITY | 0.98+ |
over 35% | QUANTITY | 0.97+ |
five and a half years ago back | DATE | 0.97+ |
Stanford University | ORGANIZATION | 0.97+ |
first time | QUANTITY | 0.97+ |
Netflix | ORGANIZATION | 0.96+ |
one woman | QUANTITY | 0.96+ |
Mobile World Congress | EVENT | 0.94+ |
one thing | QUANTITY | 0.94+ |
AnitaB.org | ORGANIZATION | 0.93+ |
25% | QUANTITY | 0.92+ |
PyData Meetups | EVENT | 0.9+ |
Breaking Analysis: H1 of ‘22 was ugly…H2 could be worse Here’s why we’re still optimistic
>> From theCUBE Studios in Palo Alto in Boston, bringing you data driven insights from theCUBE and ETR. This is Breaking Analysis with Dave Vellante. >> After a two-year epic run in tech, 2022 has been an epically bad year. Through yesterday, The NASDAQ composite is down 30%. The S$P 500 is off 21%. And the Dow Jones Industrial average 16% down. And the poor holders at Bitcoin have had to endure a nearly 60% decline year to date. But judging by the attendance and enthusiasm, in major in-person tech events this spring. You'd never know that tech was in the tank. Moreover, walking around the streets of Las Vegas, where most tech conferences are held these days. One can't help but notice that the good folks of Main Street, don't seem the least bit concerned that the economy is headed for a recession. Hello, and welcome to this weeks Wiki Bond Cube Insights powered by ETR. In this Breaking Analysis we'll share our main takeaways from the first half of 2022. And talk about the outlook for tech going forward, and why despite some pretty concerning headwinds we remain sanguine about tech generally, but especially enterprise tech. Look, here's the bumper sticker on why many folks are really bearish at the moment. Of course, inflation is high, other than last year, the previous inflation high this century was in July of 2008, it was 5.6%. Inflation has proven to be very, very hard to tame. You got gas at $7 dollars a gallon. Energy prices they're not going to suddenly drop. Interest rates are climbing, which will eventually damage housing. Going to have that ripple effect, no doubt. We're seeing layoffs at companies like Tesla and the crypto names are also trimming staff. Workers, however are still in short supply. So wages are going up. Companies in retail are really struggling with the right inventory, and they can't even accurately guide on their earnings. We've seen a version of this movie before. Now, as it pertains to tech, Crawford Del Prete, who's the CEO of IDC explained this on theCUBE this very week. And I thought he did a really good job. He said the following, >> Matt, you have a great statistic that 80% of companies used COVID as their point to pivot into digital transformation. And to invest in a different way. And so what we saw now is that tech is now where I think companies need to focus. They need to invest in tech. They need to make people more productive with tech and it played out in the numbers. Now so this year what's fascinating is we're looking at two vastly different markets. We got gasoline at $7 a gallon. We've got that affecting food prices. Interesting fun fact recently it now costs over $1,000 to fill an 18 wheeler. All right, based on, I mean, this just kind of can't continue. So you think about it. >> Don't put the boat in the water. >> Yeah, yeah, yeah. Good luck if ya, yeah exactly. So a family has kind of this bag of money, and that bag of money goes up by maybe three, 4% every year, depending upon earnings. So that is sort of sloshing around. So if food and fuel and rent is taking up more, gadgets and consumer tech are not, you're going to use that iPhone a little longer. You're going to use that Android phone a little longer. You're going to use that TV a little longer. So consumer tech is getting crushed, really it's very, very, and you saw it immediately in ad spending. You've seen it in Meta, you've seen it in Facebook. Consumer tech is doing very, very, it is tough. Enterprise tech, we haven't been in the office for two and a half years. We haven't upgraded whether that be campus wifi, whether that be servers, whether that be commercial PCs as much as we would have. So enterprise tech, we're seeing double digit order rates. We're seeing strong, strong demand. We have combined that with a component shortage, and you're seeing some enterprise companies with a quarter of backlog, I mean that's really unheard of. >> And higher prices, which also profit. >> And therefore that drives up the prices. >> And this is a theme that we've heard this year at major tech events, they've really come roaring back. Last year, theCUBE had a huge presence at AWS Reinvent. The first Reinvent since 2019, it was really well attended. Now this was before the effects of the omicron variant, before they were really well understood. And in the first quarter of 2022, things were pretty quiet as far as tech events go But theCUBE'a been really busy this spring and early into the summer. We did 12 physical events as we're showing here in the slide. Coupa, did Women in Data Science at Stanford, Coupa Inspire was in Las Vegas. Now these are both smaller events, but they were well attended and beat expectations. San Francisco Summit, the AWS San Francisco Summit was a bit off, frankly 'cause of the COVID concerns. They were on the rise, then we hit Dell Tech World which was packed, it had probably around 7,000 attendees. Now Dockercon was virtual, but we decided to include it here because it was a huge global event with watch parties and many, many tens of thousands of people attending. Now the Red Hat Summit was really interesting. The choice that Red Hat made this year. It was purposefully scaled down and turned into a smaller VIP event in Boston at the Western, a couple thousand people only. It was very intimate with a much larger virtual presence. VeeamON was very well attended, not as large as previous VeeamON events, but again beat expectations. KubeCon and Cloud Native Con was really successful in Spain, Valencia, Spain. PagerDuty Summit was again a smaller intimate event in San Francisco. And then MongoDB World was at the new Javits Center and really well attended over the three day period. There were lots of developers there, lots of business people, lots of ecosystem partners. And then the Snowflake summit in Las Vegas, it was the most vibrant from the standpoint of the ecosystem with nearly 10,000 attendees. And I'll come back to that in a moment. Amazon re:Mars is the Amazon AI robotic event, it's smaller but very, very cool, a lot of innovation. And just last week we were at HPE Discover. They had around 8,000 people attending which was really good. Now I've been to over a dozen HPE or HPE Discover events, within Europe and the United States over the past decade. And this was by far the most vibrant, lot of action. HPE had a little spring in its step because the company's much more focused now but people was really well attended and people were excited to be there, not only to be back at physical events, but also to hear about some of the new innovations that are coming and HPE has a long way to go in terms of building out that ecosystem, but it's starting to form. So we saw that last week. So tech events are back, but they are smaller. And of course now a virtual overlay, they're hybrid. And just to give you some context, theCUBE did, as I said 12 physical events in the first half of 2022. Just to compare that in 2019, through June of that year we had done 35 physical events. Yeah, 35. And what's perhaps more interesting is we had our largest first half ever in our 12 year history because we're doing so much hybrid and virtual to compliment the physical. So that's the new format is CUBE plus digital or sometimes just digital but that's really what's happening in our business. So I think it's a reflection of what's happening in the broader tech community. So everyone's still trying to figure that out but it's clear that events are back and there's no replacing face to face. Or as I like to say, belly to belly, because deals are done at physical events. All these events we've been to, the sales people are so excited. They're saying we're closing business. Pipelines coming out of these events are much stronger, than they are out of the virtual events but the post virtual event continues to deliver that long tail effect. So that's not going to go away. The bottom line is hybrid is the new model. Okay let's look at some of the big themes that we've taken away from the first half of 2022. Now of course, this is all happening under the umbrella of digital transformation. I'm not going to talk about that too much, you've had plenty of DX Kool-Aid injected into your veins over the last 27 months. But one of the first observations I'll share is that the so-called big data ecosystem that was forming during the hoop and around, the hadoop infrastructure days and years. then remember it dispersed, right when the cloud came in and kind of you know, not wiped out but definitely dampened the hadoop enthusiasm for on-prem, the ecosystem dispersed, but now it's reforming. There are large pockets that are obviously seen in the various clouds. And we definitely see a ecosystem forming around MongoDB and the open source community gathering in the data bricks ecosystem. But the most notable momentum is within the Snowflake ecosystem. Snowflake is moving fast to win the day in the data ecosystem. They're providing a single platform that's bringing different data types together. Live data from systems of record, systems of engagement together with so-called systems of insight. These are converging and while others notably, Oracle are architecting for this new reality, Snowflake is leading with the ecosystem momentum and a new stack is emerging that comprises cloud infrastructure at the bottom layer. Data PaaS layer for app dev and is enabling an ecosystem of partners to build data products and data services that can be monetized. That's the key, that's the top of the stack. So let's dig into that further in a moment but you're seeing machine intelligence and data being driven into applications and the data and application stacks they're coming together to support the acceleration of physical into digital. It's happening right before our eyes in every industry. We're also seeing the evolution of cloud. It started with the SaaS-ification of the enterprise where organizations realized that they didn't have to run their own software on-prem and it made sense to move to SaaS for CRM or HR, certainly email and collaboration and certain parts of ERP and early IS was really about getting out of the data center infrastructure management business called that cloud 1.0, and then 2.0 was really about changing the operating model. And now we're seeing that operating model spill into on-prem workloads finally. We're talking about here about initiatives like HPE's Green Lake, which we heard a lot about last week at Discover and Dell's Apex, which we heard about in May, in Las Vegas. John Furrier had a really interesting observation that basically this is HPE's and Dell's version of outposts. And I found that interesting because outpost was kind of a wake up call in 2018 and a shot across the bow at the legacy enterprise infrastructure players. And they initially responded with these flexible financial schemes, but finally we're seeing real platforms emerge. Again, we saw this at Discover and at Dell Tech World, early implementations of the cloud operating model on-prem. I mean, honestly, you're seeing things like consoles and billing, similar to AWS circa 2014, but players like Dell and HPE they have a distinct advantage with respect to their customer bases, their service organizations, their very large portfolios, especially in the case of Dell and the fact that they have more mature stacks and knowhow to run mission critical enterprise applications on-prem. So John's comment was quite interesting that these firms are basically building their own version of outposts. Outposts obviously came into their wheelhouse and now they've finally responded. And this is setting up cloud 3.0 or Supercloud, as we like to call it, an abstraction layer, that sits above the clouds that serves as a unifying experience across a continuum of on-prem across clouds, whether it's AWS, Azure, or Google. And out to both the near and far edge, near edge being a Lowes or a Home Depot, but far edge could be space. And that edge again is fragmented. You've got the examples like the retail stores at the near edge. Outer space maybe is the far edge and IOT devices is perhaps the tiny edge. No one really knows how the tiny edge is going to play out but it's pretty clear that it's not going to comprise traditional X86 systems with a cool name tossed out to the edge. Rather, it's likely going to require a new low cost, low power, high performance architecture, most likely RM based that will enable things like realtime AI inferencing at that edge. Now we've talked about this a lot on Breaking Analysis, so I'm not going to double click on it. But suffice to say that it's very possible that new innovations are going to emerge from the tiny edge that could really disrupt the enterprise in terms of price performance. Okay, two other quick observations. One is that data protection is becoming a much closer cohort to the security stack where data immutability and air gaps and fast recovery are increasingly becoming a fundamental component of the security strategy to combat ransomware and recover from other potential hacks or disasters. And I got to say from our observation, Veeam is leading the pack here. It's now claiming the number one revenue spot in a statistical dead heat with the Dell's data protection business. That's according to Veeam, according to IDC. And so that space continues to be of interest. And finally, Broadcom's acquisition of Dell. It's going to have ripple effects throughout the enterprise technology business. And there of course, there are a lot of questions that remain, but the one other thing that John Furrier and I were discussing last night John looked at me and said, "Dave imagine if VMware runs better on Broadcom components and OEMs that use Broadcom run VMware better, maybe Broadcom doesn't even have to raise prices on on VMware licenses. Maybe they'll just raise prices on the OEMs and let them raise prices to the end customer." Interesting thought, I think because Broadcom is so P&L focused that it's probably not going to be the prevailing model but we'll see what happens to some of the strategic projects rather like Monterey and Capitola and Thunder. We've talked a lot about project Monterey, the others we'll see if they can make the cut. That's one of the big concerns because it's how OEMs like the ones that are building their versions of outposts are going to compete with the cloud vendors, namely AWS in the future. I want to come back to the comment on the data stack for a moment that we were talking about earlier, we talked about how the big data ecosystem that was once coalescing around hadoop dispersed. Well, the data value chain is reforming and we think it looks something like this picture, where cloud infrastructure lives at the bottom. We've said many times the cloud is expanding and evolving. And if companies like Dell and HPE can truly build a super cloud infrastructure experience then they will be in a position to capture more of the data value. If not, then it's going to go to the cloud players. And there's a live data layer that is increasingly being converged into platforms that not only simplify the movement in ELTing of data but also allow organizations to compress the time to value. Now there's a layer above that, we sometimes call it the super PaaS layer if you will, that must comprise open source tooling, partners are going to write applications and leverage platform APIs and build data products and services that can be monetized at the top of the stack. So when you observe the battle for the data future it's unlikely that any one company is going to be able to do this all on their own, which is why I often joke that the 2020s version of a sweaty Steve Bomber running around the stage, screaming, developers, developers developers, and getting the whole audience into it is now about ecosystem ecosystem ecosystem. Because when you need to fill gaps and accelerate features and provide optionality a list of capabilities on the left hand side of this chart, that's going to come from a variety of different companies and places, we're talking about catalogs and AI tools and data science capabilities, data quality, governance tools and it should be of no surprise to followers of Breaking Analysis that on the right hand side of this chart we're including the four principles of data mesh, which of course were popularized by Zhamak Dehghani. So decentralized data ownership, data as products, self-serve platform and automated or computational governance. Now whether this vision becomes a reality via a proprietary platform like Snowflake or somehow is replicated by an open source remains to be seen but history generally shows that a defacto standard for more complex problems like this is often going to emerge prior to an open source alternative. And that would be where I would place my bets. Although even that proprietary platform has to include open source optionality. But it's not a winner take all market. It's plenty of room for multiple players and ecosystem innovators, but winner will definitely take more in my opinion. Okay, let's close with some ETR data that looks at some of those major platform plays who talk a lot about digital transformation and world changing impactful missions. And they have the resources really to compete. This is an XY graphic. It's a view that we often show, it's got net score on the vertical access. That's a measure of spending momentum, and overlap or presence in the ETR survey. That red, that's the horizontal access. The red dotted line at 40% indicates that the platform is among the highest in terms of spending velocity. Which is why I always point out how impressive that makes AWS and Azure because not only are they large on the horizontal axis, the spending momentum on those two platforms rivals even that of Snowflake which continues to lead all on the vertical access. Now, while Google has momentum, given its goals and resources, it's well behind the two leaders. We've added Service Now and Salesforce, two platform names that have become the next great software companies. Joining likes of Oracle, which we show here and SAP not shown along with IBM, you can see them on this chart. We've also plotted MongoDB, which we think has real momentum as a company generally but also with Atlas, it's managed cloud database as a service specifically and Red Hat with trying to become the standard for app dev in Kubernetes environments, which is the hottest trend right now in application development and application modernization. Everybody's doing something with Kubernetes and of course, Red Hat with OpenShift wants to make that a better experience than do it yourself. The DYI brings a lot more complexity. And finally, we've got HPE and Dell both of which we've talked about pretty extensively here and VMware and Cisco. Now Cisco is executing on its portfolio strategy. It's got a lot of diverse components to its company. And it's coming at the cloud of course from a networking and security perspective. And that's their position of strength. And VMware is a staple of the enterprise. Yes, there's some uncertainty with regards to the Broadcom acquisition, but one thing is clear vSphere isn't going anywhere. It's entrenched and will continue to run lots of IT for years to come because it's the best platform on the planet. Now, of course, these are just some of the players in the mix. We expect that numerous non-traditional technology companies this is important to emerge as new cloud players. We've put a lot of emphasis on the data ecosystem because to us that's really going to be the main spring of digital, i.e., a digital company is a data company and that means an ecosystem of data partners that can advance outcomes like better healthcare, faster drug discovery, less fraud, cleaner energy, autonomous vehicles that are safer, smarter, more efficient grids and factories, better government and virtually endless litany of societal improvements that can be addressed. And these companies will be building innovations on top of cloud platforms creating their own super clouds, if you will. And they'll come from non-traditional places, industries, finance that take their data, their software, their tooling bring them to their customers and run them on various clouds. Okay, that's it for today. Thanks to Alex Myerson, who is on production and does the podcast for Breaking Analysis, Kristin Martin and Cheryl Knight, they help get the word out. And Rob Hoofe is our editor and chief over at Silicon Angle who helps edit our posts. Remember all these episodes are available as podcasts wherever you listen. All you got to do is search Breaking Analysis podcast. I publish each week on wikibon.com and siliconangle.com. You can email me directly at david.vellante@siliconangle.com or DM me at dvellante, or comment on my LinkedIn posts. And please do check out etr.ai for the best survey data in the enterprise tech business. This is Dave Vellante for theCUBE's Insights powered by ETR. Thanks for watching be well. And we'll see you next time on Breaking Analysis. (upbeat music)
SUMMARY :
This is Breaking Analysis that the good folks of Main Street, and it played out in the numbers. haven't been in the office And higher prices, And therefore that is that the so-called big data ecosystem
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Alex Myerson | PERSON | 0.99+ |
Tesla | ORGANIZATION | 0.99+ |
Rob Hoofe | PERSON | 0.99+ |
Cisco | ORGANIZATION | 0.99+ |
Cheryl Knight | PERSON | 0.99+ |
Dave Vellante | PERSON | 0.99+ |
John | PERSON | 0.99+ |
Dell | ORGANIZATION | 0.99+ |
Kristin Martin | PERSON | 0.99+ |
July of 2008 | DATE | 0.99+ |
Europe | LOCATION | 0.99+ |
5.6% | QUANTITY | 0.99+ |
Matt | PERSON | 0.99+ |
Spain | LOCATION | 0.99+ |
ORGANIZATION | 0.99+ | |
Boston | LOCATION | 0.99+ |
San Francisco | LOCATION | 0.99+ |
Monterey | ORGANIZATION | 0.99+ |
IBM | ORGANIZATION | 0.99+ |
12 year | QUANTITY | 0.99+ |
2018 | DATE | 0.99+ |
Discover | ORGANIZATION | 0.99+ |
Zhamak Dehghani | PERSON | 0.99+ |
Las Vegas | LOCATION | 0.99+ |
Palo Alto | LOCATION | 0.99+ |
2019 | DATE | 0.99+ |
May | DATE | 0.99+ |
June | DATE | 0.99+ |
AWS | ORGANIZATION | 0.99+ |
IDC | ORGANIZATION | 0.99+ |
Last year | DATE | 0.99+ |
Oracle | ORGANIZATION | 0.99+ |
iPhone | COMMERCIAL_ITEM | 0.99+ |
Broadcom | ORGANIZATION | 0.99+ |
Silicon Angle | ORGANIZATION | 0.99+ |
Crawford Del Prete | PERSON | 0.99+ |
30% | QUANTITY | 0.99+ |
80% | QUANTITY | 0.99+ |
HPE | ORGANIZATION | 0.99+ |
12 physical events | QUANTITY | 0.99+ |
Dave | PERSON | 0.99+ |
KubeCon | EVENT | 0.99+ |
last week | DATE | 0.99+ |
United States | LOCATION | 0.99+ |
Android | TITLE | 0.99+ |
Dockercon | EVENT | 0.99+ |
40% | QUANTITY | 0.99+ |
two and a half years | QUANTITY | 0.99+ |
35 physical events | QUANTITY | 0.99+ |
Steve Bomber | PERSON | 0.99+ |
Capitola | ORGANIZATION | 0.99+ |
Cloud Native Con | EVENT | 0.99+ |
Red Hat Summit | EVENT | 0.99+ |
two leaders | QUANTITY | 0.99+ |
San Francisco Summit | EVENT | 0.99+ |
last year | DATE | 0.99+ |
21% | QUANTITY | 0.99+ |
david.vellante@siliconangle.com | OTHER | 0.99+ |
Veeam | ORGANIZATION | 0.99+ |
yesterday | DATE | 0.99+ |
One | QUANTITY | 0.99+ |
John Furrier | PERSON | 0.99+ |
VeeamON | EVENT | 0.99+ |
this year | DATE | 0.99+ |
16% | QUANTITY | 0.99+ |
$7 a gallon | QUANTITY | 0.98+ |
each week | QUANTITY | 0.98+ |
over $1,000 | QUANTITY | 0.98+ |
35 | QUANTITY | 0.98+ |
PagerDuty Summit | EVENT | 0.98+ |
Boost Your Solutions with the HPE Ezmeral Ecosystem Program | HPE Ezmeral Day 2021
>> Hello. My name is Ron Kafka, and I'm the senior director for Partner Scale Initiatives for HBE Ezmeral. Thanks for joining us today at Analytics Unleashed. By now, you've heard a lot about the Ezmeral portfolio and how it can help you accomplish objectives around big data analytics and containerization. I want to shift gears a bit and then discuss our Ezmeral Technology Partner Program. I've got two great guest speakers here with me today. And together, We're going to discuss how jointly we are solving data analytic challenges for our customers. Before I introduce them, I want to take a minute to talk to provide a little bit more insight into our ecosystem program. We've created a program with a realization based on customer feedback that even the most mature organizations are struggling with their data-driven transformation efforts. It turns out this is largely due to the pace of innovation with application vendors or ICS supporting data science and advanced analytic workloads. Their advancements are simply outpacing organization's ability to move workloads into production rapidly. Bottom line, organizations want a unified experience across environments where their entire application portfolio in essence provide a comprehensive application stack and not piece parts. So, let's talk about how our ecosystem program helps solve for this. For starters, we were leveraging HPEs long track record of forging technology partnerships and it created a best in class ISB partner program specific for the Ezmeral portfolio. We were doing this by developing an open concept marketplace where customers and partners can explore, learn, engage and collaborate with our strategic technology partners. This enables our customers to adopt, deploy validated applications from industry leading software vendors on HPE Ezmeral with a high degree of confidence. Also, it provides a very deep bench of leading ISVs for other groups inside of HPE to leverage for their solutioning efforts. Speaking of industry leading ISV, it's about time and introduce you to two of those industry leaders right now. Let me welcome Daniel Hladky from Dataiku, and Omri Geller from Run:AI. So I'd like to introduce Daniel Hladky. Daniel is with Dataiku. He's a great partner for HPE. Daniel, welcome. >> Thank you for having me here. >> That's great. Hey, would you mind just talking a bit about how your partnership journey has been with HPE? >> Yes, pleasure. So the journey started about five years ago and in 2018 we signed a worldwide reseller agreement with HPE. And in 2020, we actually started to work jointly on the integration between the Dataiku Data Science Studio called DSS and integrated that with the Ezmeral Container platform, and was a great success. And it was on behalf of some clear customer projects. >> It's been a long partnership journey with you for sure with HPE. And we welcome your partnership extremely well. Just a brief question about the Container Platform and really what that's meant for Dataiku. >> Yes, Ron. Thanks. So, basically I'd like the quote here Florian Douetteau, which is the CEO of Dataiku, who said that the combination of Dataiku with the HPE Ezmeral Container Platform will help the customers to successfully scale and put machine learning projects into production. And this basically is going to deliver real impact for their business. So, the combination of the two of us is a great success. >> That's great. Can you talk about what Dataiku is doing and how HPE Ezmeral Container Platform fits in a solution offering a bit more? >> Great. So basically Dataiku DSS is our product which is a end to end data science platform, and basically brings value to the project of customers on their past enterprise AI. In simple ways, we can say it could be as simple as building data pipelines, but it could be also very complex by having machine and deep learning models at scale. So the fast track to value is by having collaboration, orchestration online technologies and the models in production. So, all of that is part of the Data Science Studio and Ezmeral fits perfectly into the part where we design and then basically put at scale those project and put it into product. >> That's perfect. Can you be a bit more specific about how you see HPE and Dataiku really tightening up a customer outcome and value proposition? >> Yes. So what we see is also the challenge of the market that probably about 80% of the use cases really never make it to production. And this is of course a big challenge and we need to change that. And I think the combination of the two of us is actually addressing exactly this need. What we can say is part of the MLOps approach, Dataiku and the Ezmeral Container Platform will provide a frictionless approach, which means without scripting and coding, customers can put all those projects into the productive environment and don't have to worry any more and be more business oriented. >> That's great. So you mentioned you're seeing customers be a lot more mature with their AI workloads and deployment. What do you suggest for the other customers out there that are just starting this journey or just thinking about how to get started? >> Yeah. That's a very good question, Ron. So what we see there is actually the challenge that people need to go on a pass of maturity. And this starts with a simple data pipelines, et cetera, and then basically move up the ladder and basically build large complex project. And here I see a very interesting offer coming now from HPE which is called D3S, which is the data science startup pack. That's something I discussed together with HPE back in early 2020. And basically, it solves the three stages, which is explore, experiment and evolve and builds quickly MVPs for the customers. By doing so, basically you addressed business objectives, lay out in the proper architecture and also setting up the proper organization around it. So, this is a great combination by HPE and Dataiku through the D3S. >> And it's a perfect example of what I mentioned earlier about leveraging the ecosystem program that we built to do deeper solutioning efforts inside of HPE in this case with our AI business unit. So, congratulations on that and thanks for joining us today. I'm going to shift gears. I'm going to bring in Omri Geller from Run:AI. Omri, welcome. It's great to have you. You guys are killing it out there in the market today. And I just thought we could spend a few minutes talking about what is so unique and differentiated from your offerings. >> Thank you, Ron. It's a pleasure to be here. Run:AI creates a virtualization and orchestration layer for AI infrastructure. We help organizations to gain visibility and control over their GPO resources and help them deliver AI solutions to market faster. And we do that by managing granular scheduling, prioritization, allocation of compute power, together with the HPE Ezmeral Container Platform. >> That's great. And your partnership with HPE is a bit newer than Daniel's, right? Maybe about the last year or so we've been working together a lot more closely. Can you just talk about the HPE partnership, what it's meant for you and how do you see it impacting your business? >> Sure. First of all, Run:AI is excited to partner with HPE Ezmeral Container Platform and help customers manage appeals for their AI workloads. We chose HPE since HPE has years of experience partnering with AI use cases and outcomes with vendors who have strong footprint in this markets. HPE works with many partners that are complimentary for our use case such as Nvidia, and HPE Container Platform together with Run:AI and Nvidia deliver a world class solutions for AI accelerated workloads. And as you can understand, for AI speed is critical. Companies want to gather important AI initiatives into production as soon as they can. And the HPE Ezmeral Container Platform, running IGP orchestration solution enables that by enabling dynamic provisioning of GPU so that resources can be easily shared, efficiently orchestrated and optimal used. >> That's great. And you talked a lot about the efficiency of the solution. What about from a customer perspective? What is the real benefit that our customers are going to be able to gain from an HPE and Run:AI offering? >> So first, it is important to understand how data scientists and AI researchers actually build solution. They do it by running experiments. And if a data scientist is able to run more experiments per given time, they will get to the solution faster. With HPE Ezmeral Container Platform, Run:AI and users such as data scientists can actually do that and seamlessly and efficiently consume large amounts of GPU resources, run more experiments or given time and therefore accelerate their research. Together, we actually saw a customer that is running almost 7,000 jobs in parallel over GPUs with efficient utilization of those GPUs. And by running more experiments, those customers can be much more effective and efficient when it comes to bringing solutions to market >> Couldn't agree more. And I think we're starting to see a lot of joint success together as we go out and talk to the story. Hey, I want to thank you both one last time for being here with me today. It was very enlightening for our team to have you as part of the program. And I'm excited to extend this customer value proposition out to the rest of our communities. With that, I'd like to close today's session. I appreciate everyone's time. And keep an eye out on our ISP marketplace for Ezmeral We're continuing to expand and add new capabilities and new partners to our marketplace. We're excited to do a lot of great things and help you guys all be successful. Thanks for joining. >> Thank you, Ron. >> What a great panel discussion. And these partners they really do have a good understanding of the possibilities, working on the platform, and I hope and expect we'll see this ecosystem continue to grow. That concludes the main program, which means you can now pick one of three live demos to attend and chat live with experts. Now those three include day in the life of IT Admin, day in the life of a data scientist, and even a day in the life of the HPE Ezmeral Data Fabric, where you can see the many ways the data fabric is used in your life today. Wish you could attend all three, no worries. The recordings will be available on demand for you and your teams. Moreover, the show doesn't stop here, HPE has a growing and thriving tech community, you should check it out. It's really a solid starting point for learning more, talking to smart people about great ideas and seeing how Ezmeral can be part of your own data journey. Again, thanks very much to all of you for joining, until next time, keep unleashing the power of your data.
SUMMARY :
and how it can help you Hey, would you mind just talking a bit and integrated that with the and really what that's meant for Dataiku. So, basically I'd like the quote here Florian Douetteau, and how HPE Ezmeral Container Platform and the models in production. about how you see HPE and and the Ezmeral Container Platform or just thinking about how to get started? and builds quickly MVPs for the customers. and differentiated from your offerings. and control over their GPO resources and how do you see it and HPE Container Platform together with Run:AI efficiency of the solution. So first, it is important to understand for our team to have you and even a day in the life of
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Daniel | PERSON | 0.99+ |
Ron Kafka | PERSON | 0.99+ |
Ron | PERSON | 0.99+ |
Omri Geller | PERSON | 0.99+ |
Florian Douetteau | PERSON | 0.99+ |
HPE | ORGANIZATION | 0.99+ |
Daniel Hladky | PERSON | 0.99+ |
Dataiku | ORGANIZATION | 0.99+ |
two | QUANTITY | 0.99+ |
2020 | DATE | 0.99+ |
Nvidia | ORGANIZATION | 0.99+ |
2018 | DATE | 0.99+ |
DSS | ORGANIZATION | 0.99+ |
one | QUANTITY | 0.99+ |
last year | DATE | 0.99+ |
today | DATE | 0.99+ |
three | QUANTITY | 0.99+ |
early 2020 | DATE | 0.99+ |
first | QUANTITY | 0.98+ |
Data Science Studio | ORGANIZATION | 0.98+ |
Ezmeral | PERSON | 0.98+ |
Ezmeral | ORGANIZATION | 0.98+ |
Dataiku Data Science Studio | ORGANIZATION | 0.97+ |
three live demos | QUANTITY | 0.97+ |
both | QUANTITY | 0.97+ |
about 80% | QUANTITY | 0.96+ |
HPEs | ORGANIZATION | 0.95+ |
three stages | QUANTITY | 0.94+ |
two great guest speakers | QUANTITY | 0.93+ |
Omri | PERSON | 0.91+ |
Analytics Unleashed | ORGANIZATION | 0.91+ |
D3S | TITLE | 0.87+ |
almost 7,000 jobs | QUANTITY | 0.87+ |
HPE Container Platform | TITLE | 0.86+ |
HPE Ezmeral Container Platform | TITLE | 0.83+ |
HBE Ezmeral | ORGANIZATION | 0.83+ |
Run | ORGANIZATION | 0.82+ |
Ezmeral Container Platform | TITLE | 0.81+ |
about five years ago | DATE | 0.8+ |
Platform | TITLE | 0.71+ |
Ezmeral | TITLE | 0.7+ |
Run:AI | ORGANIZATION | 0.7+ |
Ezmeral Data | ORGANIZATION | 0.69+ |
2021 | DATE | 0.68+ |
Ezmeral Ecosystem Program | TITLE | 0.68+ |
ICS | ORGANIZATION | 0.67+ |
Run | TITLE | 0.66+ |
Partner Scale Initiatives | ORGANIZATION | 0.66+ |
Boost Your Solutions with the HPE Ezmeral Ecosystem Program | HPE Ezmeral Day 2021
>> Hello. My name is Ron Kafka, and I'm the senior director for Partner Scale Initiatives for HBE Ezmeral. Thanks for joining us today at Analytics Unleashed. By now, you've heard a lot about the Ezmeral portfolio and how it can help you accomplish objectives around big data analytics and containerization. I want to shift gears a bit and then discuss our Ezmeral Technology Partner Program. I've got two great guest speakers here with me today. And together, We're going to discuss how jointly we are solving data analytic challenges for our customers. Before I introduce them, I want to take a minute to talk to provide a little bit more insight into our ecosystem program. We've created a program with a realization based on customer feedback that even the most mature organizations are struggling with their data-driven transformation efforts. It turns out this is largely due to the pace of innovation with application vendors or ICS supporting data science and advanced analytic workloads. Their advancements are simply outpacing organization's ability to move workloads into production rapidly. Bottom line, organizations want a unified experience across environments where their entire application portfolio in essence provide a comprehensive application stack and not piece parts. So, let's talk about how our ecosystem program helps solve for this. For starters, we were leveraging HPEs long track record of forging technology partnerships and it created a best in class ISB partner program specific for the Ezmeral portfolio. We were doing this by developing an open concept marketplace where customers and partners can explore, learn, engage and collaborate with our strategic technology partners. This enables our customers to adopt, deploy validated applications from industry leading software vendors on HPE Ezmeral with a high degree of confidence. Also, it provides a very deep bench of leading ISVs for other groups inside of HPE to leverage for their solutioning efforts. Speaking of industry leading ISV, it's about time and introduce you to two of those industry leaders right now. Let me welcome Daniel Hladky from Dataiku, and Omri Geller from Run:AI. So I'd like to introduce Daniel Hladky. Daniel is with Dataiku. He's a great partner for HPE. Daniel, welcome. >> Thank you for having me here. >> That's great. Hey, would you mind just talking a bit about how your partnership journey has been with HPE? >> Yes, pleasure. So the journey started about five years ago and in 2018 we signed a worldwide reseller agreement with HPE. And in 2020, we actually started to work jointly on the integration between the Dataiku Data Science Studio called DSS and integrated that with the Ezmeral Container platform, and was a great success. And it was on behalf of some clear customer projects. >> It's been a long partnership journey with you for sure with HPE. And we welcome your partnership extremely well. Just a brief question about the Container Platform and really what that's meant for Dataiku. >> Yes, Ron. Thanks. So, basically I like the quote here Florian Douetteau, which is the CEO of Dataiku, who said that the combination of Dataiku with the HPE Ezmeral Container Platform will help the customers to successfully scale and put machine learning projects into production. And this basically is going to deliver real impact for their business. So, the combination of the two of us is a great success. >> That's great. Can you talk about what Dataiku is doing and how HPE Ezmeral Container Platform fits in a solution offering a bit more? >> Great. So basically Dataiku DSS is our product which is a end to end data science platform, and basically brings value to the project of customers on their past enterprise AI. In simple ways, we can say it could be as simple as building data pipelines, but it could be also very complex by having machine and deep learning models at scale. So the fast track to value is by having collaboration, orchestration online technologies and the models in production. So, all of that is part of the Data Science Studio and Ezmeral fits perfectly into the part where we design and then basically put at scale those project and put it into product. >> That's perfect. Can you be a bit more specific about how you see HPE and Dataiku really tightening up a customer outcome and value proposition? >> Yes. So what we see is also the challenge of the market that probably about 80% of the use cases really never make it to production. And this is of course a big challenge and we need to change that. And I think the combination of the two of us is actually addressing exactly this need. What we can say is part of the MLOps approach, Dataiku and the Ezmeral Container Platform will provide a frictionless approach, which means without scripting and coding, customers can put all those projects into the productive environment and don't have to worry any more and be more business oriented. >> That's great. So you mentioned you're seeing customers be a lot more mature with their AI workloads and deployment. What do you suggest for the other customers out there that are just starting this journey or just thinking about how to get started? >> Yeah. That's a very good question, Ron. So what we see there is actually the challenge that people need to go on a pass of maturity. And this starts with a simple data pipelines, et cetera, and then basically move up the ladder and basically build large complex project. And here I see a very interesting offer coming now from HPE which is called D3S, which is the data science startup pack. That's something I discussed together with HPE back in early 2020. And basically, it solves the three stages, which is explore, experiment and evolve and builds quickly MVPs for the customers. By doing so, basically you addressed business objectives, lay out in the proper architecture and also setting up the proper organization around it. So, this is a great combination by HPE and Dataiku through the D3S. >> And it's a perfect example of what I mentioned earlier about leveraging the ecosystem program that we built to do deeper solutioning efforts inside of HPE in this case with our AI business unit. So, congratulations on that and thanks for joining us today. I'm going to shift gears. I'm going to bring in Omri Geller from Run:AI. Omri, welcome. It's great to have you. You guys are killing it out there in the market today. And I just thought we could spend a few minutes talking about what is so unique and differentiated from your offerings. >> Thank you, Ron. It's a pleasure to be here. Run:AI creates a virtualization and orchestration layer for AI infrastructure. We help organizations to gain visibility and control over their GPO resources and help them deliver AI solutions to market faster. And we do that by managing granular scheduling, prioritization, allocation of compute power, together with the HPE Ezmeral Container Platform. >> That's great. And your partnership with HPE is a bit newer than Daniel's, right? Maybe about the last year or so we've been working together a lot more closely. Can you just talk about the HPE partnership, what it's meant for you and how do you see it impacting your business? >> Sure. First of all, Run:AI is excited to partner with HPE Ezmeral Container Platform and help customers manage appeals for their AI workloads. We chose HPE since HPE has years of experience partnering with AI use cases and outcomes with vendors who have strong footprint in this markets. HPE works with many partners that are complimentary for our use case such as Nvidia, and HPE Ezmeral Container Platform together with Run:AI and Nvidia deliver a word about solution for AI accelerated workloads. And as you can understand, for AI speed is critical. Companies want to gather important AI initiatives into production as soon as they can. And the HPE Ezmeral Container Platform, running IGP orchestration solution enables that by enabling dynamic provisioning of GPU so that resources can be easily shared, efficiently orchestrated and optimal used. >> That's great. And you talked a lot about the efficiency of the solution. What about from a customer perspective? What is the real benefit that our customers are going to be able to gain from an HPE and Run:AI offering? >> So first, it is important to understand how data scientists and AI researchers actually build solution. They do it by running experiments. And if a data scientist is able to run more experiments per given time, they will get to the solution faster. With HPE Ezmeral Container Platform, Run:AI and users such as data scientists can actually do that and seamlessly and efficiently consume large amounts of GPU resources, run more experiments or given time and therefore accelerate their research. Together, we actually saw a customer that is running almost 7,000 jobs in parallel over GPUs with efficient utilization of those GPUs. And by running more experiments, those customers can be much more effective and efficient when it comes to bringing solutions to market >> Couldn't agree more. And I think we're starting to see a lot of joint success together as we go out and talk to the story. Hey, I want to thank you both one last time for being here with me today. It was very enlightening for our team to have you as part of the program. And I'm excited to extend this customer value proposition out to the rest of our communities. With that, I'd like to close today's session. I appreciate everyone's time. And keep an eye out on our ISP marketplace for Ezmeral We're continuing to expand and add new capabilities and new partners to our marketplace. We're excited to do a lot of great things and help you guys all be successful. Thanks for joining. >> Thank you, Ron. (bright upbeat music)
SUMMARY :
and how it can help you journey has been with HPE? and integrated that with the and really what that's meant for Dataiku. and put machine learning and how HPE Ezmeral Container Platform and the models in production. about how you see HPE and and the Ezmeral Container Platform or just thinking about how to get started? and builds quickly MVPs for the customers. and differentiated from your offerings. and control over their GPO resources and how do you see it and outcomes with vendors efficiency of the solution. So first, it is important to understand and new partners to our marketplace. Thank you, Ron.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Daniel | PERSON | 0.99+ |
Ron Kafka | PERSON | 0.99+ |
Florian Douetteau | PERSON | 0.99+ |
Ron | PERSON | 0.99+ |
Omri Geller | PERSON | 0.99+ |
HPE | ORGANIZATION | 0.99+ |
Daniel Hladky | PERSON | 0.99+ |
Nvidia | ORGANIZATION | 0.99+ |
two | QUANTITY | 0.99+ |
2020 | DATE | 0.99+ |
2018 | DATE | 0.99+ |
Dataiku | ORGANIZATION | 0.99+ |
DSS | ORGANIZATION | 0.99+ |
last year | DATE | 0.99+ |
today | DATE | 0.99+ |
Omri | PERSON | 0.99+ |
Data Science Studio | ORGANIZATION | 0.98+ |
early 2020 | DATE | 0.98+ |
first | QUANTITY | 0.98+ |
Ezmeral | ORGANIZATION | 0.98+ |
Dataiku Data Science Studio | ORGANIZATION | 0.97+ |
about 80% | QUANTITY | 0.97+ |
both | QUANTITY | 0.97+ |
HPEs | ORGANIZATION | 0.95+ |
three stages | QUANTITY | 0.94+ |
two great guest speakers | QUANTITY | 0.93+ |
one | QUANTITY | 0.93+ |
almost 7,000 jobs | QUANTITY | 0.92+ |
Analytics Unleashed | ORGANIZATION | 0.91+ |
HPE Ezmeral Container Platform | TITLE | 0.84+ |
HBE Ezmeral | ORGANIZATION | 0.83+ |
Run | ORGANIZATION | 0.83+ |
Ezmeral Container Platform | TITLE | 0.82+ |
D3S | TITLE | 0.81+ |
about five years ago | DATE | 0.8+ |
HPE Ezmeral Container Platform | TITLE | 0.79+ |
2021 | DATE | 0.76+ |
Run:AI | ORGANIZATION | 0.72+ |
Ezmeral | TITLE | 0.7+ |
Platform | TITLE | 0.69+ |
Ezmeral Container Platform | TITLE | 0.68+ |
ICS | ORGANIZATION | 0.67+ |
Partner Scale Initiatives | ORGANIZATION | 0.66+ |
HPE | TITLE | 0.62+ |
DSS | TITLE | 0.6+ |
Ezmeral Container | TITLE | 0.59+ |
Container | TITLE | 0.56+ |
HPE Ezmeral | EVENT | 0.55+ |
First | QUANTITY | 0.52+ |
Run | TITLE | 0.51+ |
Day | EVENT | 0.51+ |
Rob Thomas, IBM | Change the Game: Winning With AI 2018
>> [Announcer] Live from Times Square in New York City, it's theCUBE covering IBM's Change the Game: Winning with AI, brought to you by IBM. >> Hello everybody, welcome to theCUBE's special presentation. We're covering IBM's announcements today around AI. IBM, as theCUBE does, runs of sessions and programs in conjunction with Strata, which is down at the Javits, and we're Rob Thomas, who's the General Manager of IBM Analytics. Long time Cube alum, Rob, great to see you. >> Dave, great to see you. >> So you guys got a lot going on today. We're here at the Westin Hotel, you've got an analyst event, you've got a partner meeting, you've got an event tonight, Change the game: winning with AI at Terminal 5, check that out, ibm.com/WinWithAI, go register there. But Rob, let's start with what you guys have going on, give us the run down. >> Yeah, it's a big week for us, and like many others, it's great when you have Strata, a lot of people in town. So, we've structured a week where, today, we're going to spend a lot of time with analysts and our business partners, talking about where we're going with data and AI. This evening, we've got a broadcast, it's called Winning with AI. What's unique about that broadcast is it's all clients. We've got clients on stage doing demonstrations, how they're using IBM technology to get to unique outcomes in their business. So I think it's going to be a pretty unique event, which should be a lot of fun. >> So this place, it looks like a cool event, a venue, Terminal 5, it's just up the street on the west side highway, probably a mile from the Javits Center, so definitely check that out. Alright, let's talk about, Rob, we've known each other for a long time, we've seen the early Hadoop days, you guys were very careful about diving in, you kind of let things settle and watched very carefully, and then came in at the right time. But we saw the evolution of so-called Big Data go from a phase of really reducing investments, cheaper data warehousing, and what that did is allowed people to collect a lot more data, and kind of get ready for this era that we're in now. But maybe you can give us your perspective on the phases, the waves that we've seen of data, and where we are today and where we're going. >> I kind of think of it as a maturity curve. So when I go talk to clients, I say, look, you need to be on a journey towards AI. I think probably nobody disagrees that they need something there, the question is, how do you get there? So you think about the steps, it's about, a lot of people started with, we're going to reduce the cost of our operations, we're going to use data to take out cost, that was kind of the Hadoop thrust, I would say. Then they moved to, well, now we need to see more about our data, we need higher performance data, BI data warehousing. So, everybody, I would say, has dabbled in those two area. The next leap forward is self-service analytics, so how do you actually empower everybody in your organization to use and access data? And the next step beyond that is, can I use AI to drive new business models, new levers of growth, for my business? So, I ask clients, pin yourself on this journey, most are, depends on the division or the part of the company, they're at different areas, but as I tell everybody, if you don't know where you are and you don't know where you want to go, you're just going to wind around, so I try to get them to pin down, where are you versus where do you want to go? >> So four phases, basically, the sort of cheap data store, the BI data warehouse modernization, self-service analytics, a big part of that is data science and data science collaboration, you guys have a lot of investments there, and then new business models with AI automation running on top. Where are we today? Would you say we're kind of in-between BI/DW modernization and on our way to self-service analytics, or what's your sense? >> I'd say most are right in the middle between BI data warehousing and self-service analytics. Self-service analytics is hard, because it requires you, sometimes to take a couple steps back, and look at your data. It's hard to provide self-service if you don't have a data catalog, if you don't have data security, if you haven't gone through the processes around data governance. So, sometimes you have to take one step back to go two steps forward, that's why I see a lot of people, I'd say, stuck in the middle right now. And the examples that you're going to see tonight as part of the broadcast are clients that have figured out how to break through that wall, and I think that's pretty illustrative of what's possible. >> Okay, so you're saying that, got to maybe take a step back and get the infrastructure right with, let's say a catalog, to give some basic things that they have to do, some x's and o's, you've got the Vince Lombardi played out here, and also, skillsets, I imagine, is a key part of that. So, that's what they've got to do to get prepared, and then, what's next? They start creating new business models, imagining this is where the cheap data officer comes in and it's an executive level, what are you seeing clients as part of digital transformation, what's the conversation like with customers? >> The biggest change, the great thing about the times we live in, is technology's become so accessible, you can do things very quickly. We created a team last year called Data Science Elite, and we've hired what we think are some of the best data scientists in the world. Their only job is to go work with clients and help them get to a first success with data science. So, we put a team in. Normally, one month, two months, normally a team of two or three people, our investment, and we say, let's go build a model, let's get to an outcome, and you can do this incredibly quickly now. I tell clients, I see somebody that says, we're going to spend six months evaluating and thinking about this, I was like, why would you spend six months thinking about this when you could actually do it in one month? So you just need to get over the edge and go try it. >> So we're going to learn more about the Data Science Elite team. We've got John Thomas coming on today, who is a distinguished engineer at IBM, and he's very much involved in that team, and I think we have a customer who's actually gone through that, so we're going to talk about what their experience was with the Data Science Elite team. Alright, you've got some hard news coming up, you've actually made some news earlier with Hortonworks and Red Hat, I want to talk about that, but you've also got some hard news today. Take us through that. >> Yeah, let's talk about all three. First, Monday we announced the expanded relationship with both Hortonworks and Red Hat. This goes back to one of the core beliefs I talked about, every enterprise is modernizing their data and application of states, I don't think there's any debate about that. We are big believers in Kubernetes and containers as the architecture to drive that modernization. The announcement on Monday was, we're working closer with Red Hat to take all of our data services as part of Cloud Private for Data, which are basically microservice for data, and we're running those on OpenShift, and we're starting to see great customer traction with that. And where does Hortonworks come in? Hadoop has been the outlier on moving to microservices containers, we're working with Hortonworks to help them make that move as well. So, it's really about the three of us getting together and helping clients with this modernization journey. >> So, just to remind people, you remember ODPI, folks? It was all this kerfuffle about, why do we even need this? Well, what's interesting to me about this triumvirate is, well, first of all, Red Hat and Hortonworks are hardcore opensource, IBM's always been a big supporter of open source. You three got together and you're proving now the productivity for customers of this relationship. You guys don't talk about this, but Hortonworks had to, when it's public call, that the relationship with IBM drove many, many seven-figure deals, which, obviously means that customers are getting value out of this, so it's great to see that come to fruition, and it wasn't just a Barney announcement a couple years ago, so congratulations on that. Now, there's this other news that you guys announced this morning, talk about that. >> Yeah, two other things. One is, we announced a relationship with Stack Overflow. 50 million developers go to Stack Overflow a month, it's an amazing environment for developers that are looking to do new things, and we're sponsoring a community around AI. Back to your point before, you said, is there a skills gap in enterprises, there absolutely is, I don't think that's a surprise. Data science, AI developers, not every company has the skills they need, so we're sponsoring a community to help drive the growth of skills in and around data science and AI. So things like Python, R, Scala, these are the languages of data science, and it's a great relationship with us and Stack Overflow to build a community to get things going on skills. >> Okay, and then there was one more. >> Last one's a product announcement. This is one of the most interesting product annoucements we've had in quite a while. Imagine this, you write a sequel query, and traditional approach is, I've got a server, I point it as that server, I get the data, it's pretty limited. We're announcing technology where I write a query, and it can find data anywhere in the world. I think of it as wide-area sequel. So it can find data on an automotive device, a telematics device, an IoT device, it could be a mobile device, we think of it as sequel the whole world. You write a query, you can find the data anywhere it is, and we take advantage of the processing power on the edge. The biggest problem with IoT is, it's been the old mantra of, go find the data, bring it all back to a centralized warehouse, that makes it impossible to do it real time. We're enabling real time because we can write a query once, find data anywhere, this is technology we've had in preview for the last year. We've been working with a lot of clients to prove out used cases to do it, we're integrating as the capability inside of IBM Cloud Private for Data. So if you buy IBM Cloud for Data, it's there. >> Interesting, so when you've been around as long as I have, long enough to see some of the pendulums swings, and it's clearly a pendulum swing back toward decentralization in the edge, but the key is, from what you just described, is you're sort of redefining the boundary, so I presume it's the edge, any Cloud, or on premises, where you can find that data, is that correct? >> Yeah, so it's multi-Cloud. I mean, look, every organization is going to be multi-Cloud, like 100%, that's going to happen, and that could be private, it could be multiple public Cloud providers, but the key point is, data on the edge is not just limited to what's in those Clouds. It could be anywhere that you're collecting data. And, we're enabling an architecture which performs incredibly well, because you take advantage of processing power on the edge, where you can get data anywhere that it sits. >> Okay, so, then, I'm setting up a Cloud, I'll call it a Cloud architecture, that encompasses the edge, where essentially, there are no boundaries, and you're bringing security. We talked about containers before, we've been talking about Kubernetes all week here at a Big Data show. And then of course, Cloud, and what's interesting, I think many of the Hadoop distral vendors kind of missed Cloud early on, and then now are sort of saying, oh wow, it's a hybrid world and we've got a part, you guys obviously made some moves, a couple billion dollar moves, to do some acquisitions and get hardcore into Cloud, so that becomes a critical component. You're not just limiting your scope to the IBM Cloud. You're recognizing that it's a multi-Cloud world, that' what customers want to do. Your comments. >> It's multi-Cloud, and it's not just the IBM Cloud, I think the most predominant Cloud that's emerging is every client's private Cloud. Every client I talk to is building out a containerized architecture. They need their own Cloud, and they need seamless connectivity to any public Cloud that they may be using. This is why you see such a premium being put on things like data ingestion, data curation. It's not popular, it's not exciting, people don't want to talk about it, but we're the biggest inhibitors, to this AI point, comes back to data curation, data ingestion, because if you're dealing with multiple Clouds, suddenly your data's in a bunch of different spots. >> Well, so you're basically, and we talked about this a lot on theCUBE, you're bringing the Cloud model to the data, wherever the data lives. Is that the right way to think about it? >> I think organizations have spoken, set aside what they say, look at their actions. Their actions say, we don't want to move all of our data to any particular Cloud, we'll move some of our data. We need to give them seamless connectivity so that they can leave their data where they want, we can bring Cloud-Native Architecture to their data, we could also help move their data to a Cloud-Native architecture if that's what they prefer. >> Well, it makes sense, because you've got physics, latency, you've got economics, moving all the data into a public Cloud is expensive and just doesn't make economic sense, and then you've got things like GDPR, which says, well, you have to keep the data, certain laws of the land, if you will, that say, you've got to keep the data in whatever it is, in Germany, or whatever country. So those sort of edicts dictate how you approach managing workloads and what you put where, right? Okay, what's going on with Watson? Give us the update there. >> I get a lot of questions, people trying to peel back the onion of what exactly is it? So, I want to make that super clear here. Watson is a few things, start at the bottom. You need a runtime for models that you've built. So we have a product called Watson Machine Learning, runs anywhere you want, that is the runtime for how you execute models that you've built. Anytime you have a runtime, you need somewhere where you can build models, you need a development environment. That is called Watson Studio. So, we had a product called Data Science Experience, we've evolved that into Watson Studio, connecting in some of those features. So we have Watson Studio, that's the development environment, Watson Machine Learning, that's the runtime. Now you move further up the stack. We have a set of APIs that bring in human features, vision, natural language processing, audio analytics, those types of things. You can integrate those as part of a model that you build. And then on top of that, we've got things like Watson Applications, we've got Watson for call centers, doing customer service and chatbots, and then we've got a lot of clients who've taken pieces of that stack and built their own AI solutions. They've taken some of the APIs, they've taken some of the design time, the studio, they've taken some of the Watson Machine Learning. So, it is really a stack of capabilities, and where we're driving the greatest productivity, this is in a lot of the examples you'll see tonight for clients, is clients that have bought into this idea of, I need a development environment, I need a runtime, where I can deploy models anywhere. We're getting a lot of momentum on that, and then that raises the question of, well, do I have expandability, do I have trust in transparency, and that's another thing that we're working on. >> Okay, so there's API oriented architecture, exposing all these services make it very easy for people to consume. Okay, so we've been talking all week at Cube NYC, is Big Data is in AI, is this old wine, new bottle? I mean, it's clear, Rob, from the conversation here, there's a lot of substantive innovation, and early adoption, anyway, of some of these innovations, but a lot of potential going forward. Last thoughts? >> What people have to realize is AI is not magic, it's still computer science. So it actually requires some hard work. You need to roll up your sleeves, you need to understand how I get from point A to point B, you need a development environment, you need a runtime. I want people to really think about this, it's not magic. I think for a while, people have gotten the impression that there's some magic button. There's not, but if you put in the time, and it's not a lot of time, you'll see the examples tonight, most of them have been done in one or two months, there's great business value in starting to leverage AI in your business. >> Awesome, alright, so if you're in this city or you're at Strata, go to ibm.com/WinWithAI, register for the event tonight. Rob, we'll see you there, thanks so much for coming back. >> Yeah, it's going to be fun, thanks Dave, great to see you. >> Alright, keep it right there everybody, we'll be back with our next guest right after this short break, you're watching theCUBE.
SUMMARY :
brought to you by IBM. Long time Cube alum, Rob, great to see you. But Rob, let's start with what you guys have going on, it's great when you have Strata, a lot of people in town. and kind of get ready for this era that we're in now. where you want to go, you're just going to wind around, and data science collaboration, you guys have It's hard to provide self-service if you don't have and it's an executive level, what are you seeing let's get to an outcome, and you can do this and I think we have a customer who's actually as the architecture to drive that modernization. So, just to remind people, you remember ODPI, folks? has the skills they need, so we're sponsoring a community and it can find data anywhere in the world. of processing power on the edge, where you can get data a couple billion dollar moves, to do some acquisitions This is why you see such a premium being put on things Is that the right way to think about it? to a Cloud-Native architecture if that's what they prefer. certain laws of the land, if you will, that say, for how you execute models that you've built. I mean, it's clear, Rob, from the conversation here, and it's not a lot of time, you'll see the examples tonight, Rob, we'll see you there, thanks so much for coming back. we'll be back with our next guest
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
IBM | ORGANIZATION | 0.99+ |
Dave | PERSON | 0.99+ |
Hortonworks | ORGANIZATION | 0.99+ |
six months | QUANTITY | 0.99+ |
Rob | PERSON | 0.99+ |
Rob Thomas | PERSON | 0.99+ |
John Thomas | PERSON | 0.99+ |
two months | QUANTITY | 0.99+ |
one month | QUANTITY | 0.99+ |
Germany | LOCATION | 0.99+ |
last year | DATE | 0.99+ |
Red Hat | ORGANIZATION | 0.99+ |
Monday | DATE | 0.99+ |
one | QUANTITY | 0.99+ |
100% | QUANTITY | 0.99+ |
GDPR | TITLE | 0.99+ |
three people | QUANTITY | 0.99+ |
first | QUANTITY | 0.99+ |
two | QUANTITY | 0.99+ |
ibm.com/WinWithAI | OTHER | 0.99+ |
Watson Studio | TITLE | 0.99+ |
Python | TITLE | 0.99+ |
Scala | TITLE | 0.99+ |
First | QUANTITY | 0.99+ |
Data Science Elite | ORGANIZATION | 0.99+ |
both | QUANTITY | 0.99+ |
Cube | ORGANIZATION | 0.99+ |
one step | QUANTITY | 0.99+ |
One | QUANTITY | 0.99+ |
Times Square | LOCATION | 0.99+ |
today | DATE | 0.99+ |
Vince Lombardi | PERSON | 0.98+ |
three | QUANTITY | 0.98+ |
Stack Overflow | ORGANIZATION | 0.98+ |
tonight | DATE | 0.98+ |
Javits Center | LOCATION | 0.98+ |
Barney | ORGANIZATION | 0.98+ |
Terminal 5 | LOCATION | 0.98+ |
IBM Analytics | ORGANIZATION | 0.98+ |
Watson | TITLE | 0.97+ |
two steps | QUANTITY | 0.97+ |
New York City | LOCATION | 0.97+ |
Watson Applications | TITLE | 0.97+ |
Cloud | TITLE | 0.96+ |
This evening | DATE | 0.95+ |
Watson Machine Learning | TITLE | 0.94+ |
two area | QUANTITY | 0.93+ |
seven-figure deals | QUANTITY | 0.92+ |
Cube | PERSON | 0.91+ |
Sreesha Rao, Niagara Bottling & Seth Dobrin, IBM | Change The Game: Winning With AI 2018
>> Live, from Times Square, in New York City, it's theCUBE covering IBM's Change the Game: Winning with AI. Brought to you by IBM. >> Welcome back to the Big Apple, everybody. I'm Dave Vellante, and you're watching theCUBE, the leader in live tech coverage, and we're here covering a special presentation of IBM's Change the Game: Winning with AI. IBM's got an analyst event going on here at the Westin today in the theater district. They've got 50-60 analysts here. They've got a partner summit going on, and then tonight, at Terminal 5 of the West Side Highway, they've got a customer event, a lot of customers there. We've talked earlier today about the hard news. Seth Dobern is here. He's the Chief Data Officer of IBM Analytics, and he's joined by Shreesha Rao who is the Senior Manager of IT Applications at California-based Niagara Bottling. Gentlemen, welcome to theCUBE. Thanks so much for coming on. >> Thank you, Dave. >> Well, thanks Dave for having us. >> Yes, always a pleasure Seth. We've known each other for a while now. I think we met in the snowstorm in Boston, sparked something a couple years ago. >> Yep. When we were both trapped there. >> Yep, and at that time, we spent a lot of time talking about your internal role as the Chief Data Officer, working closely with Inderpal Bhandari, and you guys are doing inside of IBM. I want to talk a little bit more about your other half which is working with clients and the Data Science Elite Team, and we'll get into what you're doing with Niagara Bottling, but let's start there, in terms of that side of your role, give us the update. >> Yeah, like you said, we spent a lot of time talking about how IBM is implementing the CTO role. While we were doing that internally, I spent quite a bit of time flying around the world, talking to our clients over the last 18 months since I joined IBM, and we found a consistent theme with all the clients, in that, they needed help learning how to implement data science, AI, machine learning, whatever you want to call it, in their enterprise. There's a fundamental difference between doing these things at a university or as part of a Kaggle competition than in an enterprise, so we felt really strongly that it was important for the future of IBM that all of our clients become successful at it because what we don't want to do is we don't want in two years for them to go "Oh my God, this whole data science thing was a scam. We haven't made any money from it." And it's not because the data science thing is a scam. It's because the way they're doing it is not conducive to business, and so we set up this team we call the Data Science Elite Team, and what this team does is we sit with clients around a specific use case for 30, 60, 90 days, it's really about 3 or 4 sprints, depending on the material, the client, and how long it takes, and we help them learn through this use case, how to use Python, R, Scala in our platform obviously, because we're here to make money too, to implement these projects in their enterprise. Now, because it's written in completely open-source, if they're not happy with what the product looks like, they can take their toys and go home afterwards. It's on us to prove the value as part of this, but there's a key point here. My team is not measured on sales. They're measured on adoption of AI in the enterprise, and so it creates a different behavior for them. So they're really about "Make the enterprise successful," right, not "Sell this software." >> Yeah, compensation drives behavior. >> Yeah, yeah. >> So, at this point, I ask, "Well, do you have any examples?" so Shreesha, let's turn to you. (laughing softly) Niagara Bottling -- >> As a matter of fact, Dave, we do. (laughing) >> Yeah, so you're not a bank with a trillion dollars in assets under management. Tell us about Niagara Bottling and your role. >> Well, Niagara Bottling is the biggest private label bottled water manufacturing company in the U.S. We make bottled water for Costcos, Walmarts, major national grocery retailers. These are our customers whom we service, and as with all large customers, they're demanding, and we provide bottled water at relatively low cost and high quality. >> Yeah, so I used to have a CIO consultancy. We worked with every CIO up and down the East Coast. I always observed, really got into a lot of organizations. I was always observed that it was really the heads of Application that drove AI because they were the glue between the business and IT, and that's really where you sit in the organization, right? >> Yes. My role is to support the business and business analytics as well as I support some of the distribution technologies and planning technologies at Niagara Bottling. >> So take us the through the project if you will. What were the drivers? What were the outcomes you envisioned? And we can kind of go through the case study. >> So the current project that we leveraged IBM's help was with a stretch wrapper project. Each pallet that we produce--- we produce obviously cases of bottled water. These are stacked into pallets and then shrink wrapped or stretch wrapped with a stretch wrapper, and this project is to be able to save money by trying to optimize the amount of stretch wrap that goes around a pallet. We need to be able to maintain the structural stability of the pallet while it's transported from the manufacturing location to our customer's location where it's unwrapped and then the cases are used. >> And over breakfast we were talking. You guys produce 2833 bottles of water per second. >> Wow. (everyone laughs) >> It's enormous. The manufacturing line is a high speed manufacturing line, and we have a lights-out policy where everything runs in an automated fashion with raw materials coming in from one end and the finished goods, pallets of water, going out. It's called pellets to pallets. Pellets of plastic coming in through one end and pallets of water going out through the other end. >> Are you sitting on top of an aquifer? Or are you guys using sort of some other techniques? >> Yes, in fact, we do bore wells and extract water from the aquifer. >> Okay, so the goal was to minimize the amount of material that you used but maintain its stability? Is that right? >> Yes, during transportation, yes. So if we use too much plastic, we're not optimally, I mean, we're wasting material, and cost goes up. We produce almost 16 million pallets of water every single year, so that's a lot of shrink wrap that goes around those, so what we can save in terms of maybe 15-20% of shrink wrap costs will amount to quite a bit. >> So, how does machine learning fit into all of this? >> So, machine learning is way to understand what kind of profile, if we can measure what is happening as we wrap the pallets, whether we are wrapping it too tight or by stretching it, that results in either a conservative way of wrapping the pallets or an aggressive way of wrapping the pallets. >> I.e. too much material, right? >> Too much material is conservative, and aggressive is too little material, and so we can achieve some savings if we were to alternate between the profiles. >> So, too little material means you lose product, right? >> Yes, and there's a risk of breakage, so essentially, while the pallet is being wrapped, if you are stretching it too much there's a breakage, and then it interrupts production, so we want to try and avoid that. We want a continuous production, at the same time, we want the pallet to be stable while saving material costs. >> Okay, so you're trying to find that ideal balance, and how much variability is in there? Is it a function of distance and how many touches it has? Maybe you can share with that. >> Yes, so each pallet takes about 16-18 wraps of the stretch wrapper going around it, and that's how much material is laid out. About 250 grams of plastic that goes on there. So we're trying to optimize the gram weight which is the amount of plastic that goes around each of the pallet. >> So it's about predicting how much plastic is enough without having breakage and disrupting your line. So they had labeled data that was, "if we stretch it this much, it breaks. If we don't stretch it this much, it doesn't break, but then it was about predicting what's good enough, avoiding both of those extremes, right? >> Yes. >> So it's a truly predictive and iterative model that we've built with them. >> And, you're obviously injecting data in terms of the trip to the store as well, right? You're taking that into consideration in the model, right? >> Yeah that's mainly to make sure that the pallets are stable during transportation. >> Right. >> And that is already determined how much containment force is required when your stretch and wrap each pallet. So that's one of the variables that is measured, but the inputs and outputs are-- the input is the amount of material that is being used in terms of gram weight. We are trying to minimize that. So that's what the whole machine learning exercise was. >> And the data comes from where? Is it observation, maybe instrumented? >> Yeah, the instruments. Our stretch-wrapper machines have an ignition platform, which is a Scada platform that allows us to measure all of these variables. We would be able to get machine variable information from those machines and then be able to hopefully, one day, automate that process, so the feedback loop that says "On this profile, we've not had any breaks. We can continue," or if there have been frequent breaks on a certain profile or machine setting, then we can change that dynamically as the product is moving through the manufacturing process. >> Yeah, so think of it as, it's kind of a traditional manufacturing production line optimization and prediction problem right? It's minimizing waste, right, while maximizing the output and then throughput of the production line. When you optimize a production line, the first step is to predict what's going to go wrong, and then the next step would be to include precision optimization to say "How do we maximize? Using the constraints that the predictive models give us, how do we maximize the output of the production line?" This is not a unique situation. It's a unique material that we haven't really worked with, but they had some really good data on this material, how it behaves, and that's key, as you know, Dave, and probable most of the people watching this know, labeled data is the hardest part of doing machine learning, and building those features from that labeled data, and they had some great data for us to start with. >> Okay, so you're collecting data at the edge essentially, then you're using that to feed the models, which is running, I don't know, where's it running, your data center? Your cloud? >> Yeah, in our data center, there's an instance of DSX Local. >> Okay. >> That we stood up. Most of the data is running through that. We build the models there. And then our goal is to be able to deploy to the edge where we can complete the loop in terms of the feedback that happens. >> And iterate. (Shreesha nods) >> And DSX Local, is Data Science Experience Local? >> Yes. >> Slash Watson Studio, so they're the same thing. >> Okay now, what role did IBM and the Data Science Elite Team play? You could take us through that. >> So, as we discussed earlier, adopting data science is not that easy. It requires subject matter, expertise. It requires understanding of data science itself, the tools and techniques, and IBM brought that as a part of the Data Science Elite Team. They brought both the tools and the expertise so that we could get on that journey towards AI. >> And it's not a "do the work for them." It's a "teach to fish," and so my team sat side by side with the Niagara Bottling team, and we walked them through the process, so it's not a consulting engagement in the traditional sense. It's how do we help them learn how to do it? So it's side by side with their team. Our team sat there and walked them through it. >> For how many weeks? >> We've had about two sprints already, and we're entering the third sprint. It's been about 30-45 days between sprints. >> And you have your own data science team. >> Yes. Our team is coming up to speed using this project. They've been trained but they needed help with people who have done this, been there, and have handled some of the challenges of modeling and data science. >> So it accelerates that time to --- >> Value. >> Outcome and value and is a knowledge transfer component -- >> Yes, absolutely. >> It's occurring now, and I guess it's ongoing, right? >> Yes. The engagement is unique in the sense that IBM's team came to our factory, understood what that process, the stretch-wrap process looks like so they had an understanding of the physical process and how it's modeled with the help of the variables and understand the data science modeling piece as well. Once they know both side of the equation, they can help put the physical problem and the digital equivalent together, and then be able to correlate why things are happening with the appropriate data that supports the behavior. >> Yeah and then the constraints of the one use case and up to 90 days, there's no charge for those two. Like I said, it's paramount that our clients like Niagara know how to do this successfully in their enterprise. >> It's a freebie? >> No, it's no charge. Free makes it sound too cheap. (everybody laughs) >> But it's part of obviously a broader arrangement with buying hardware and software, or whatever it is. >> Yeah, its a strategy for us to help make sure our clients are successful, and I want it to minimize the activation energy to do that, so there's no charge, and the only requirements from the client is it's a real use case, they at least match the resources I put on the ground, and they sit with us and do things like this and act as a reference and talk about the team and our offerings and their experiences. >> So you've got to have skin in the game obviously, an IBM customer. There's got to be some commitment for some kind of business relationship. How big was the collective team for each, if you will? >> So IBM had 2-3 data scientists. (Dave takes notes) Niagara matched that, 2-3 analysts. There were some working with the machines who were familiar with the machines and others who were more familiar with the data acquisition and data modeling. >> So each of these engagements, they cost us about $250,000 all in, so they're quite an investment we're making in our clients. >> I bet. I mean, 2-3 weeks over many, many weeks of super geeks time. So you're bringing in hardcore data scientists, math wizzes, stat wiz, data hackers, developer--- >> Data viz people, yeah, the whole stack. >> And the level of skills that Niagara has? >> We've got actual employees who are responsible for production, our manufacturing analysts who help aid in troubleshooting problems. If there are breakages, they go analyze why that's happening. Now they have data to tell them what to do about it, and that's the whole journey that we are in, in trying to quantify with the help of data, and be able to connect our systems with data, systems and models that help us analyze what happened and why it happened and what to do before it happens. >> Your team must love this because they're sort of elevating their skills. They're working with rock star data scientists. >> Yes. >> And we've talked about this before. A point that was made here is that it's really important in these projects to have people acting as product owners if you will, subject matter experts, that are on the front line, that do this everyday, not just for the subject matter expertise. I'm sure there's executives that understand it, but when you're done with the model, bringing it to the floor, and talking to their peers about it, there's no better way to drive this cultural change of adopting these things and having one of your peers that you respect talk about it instead of some guy or lady sitting up in the ivory tower saying "thou shalt." >> Now you don't know the outcome yet. It's still early days, but you've got a model built that you've got confidence in, and then you can iterate that model. What's your expectation for the outcome? >> We're hoping that preliminary results help us get up the learning curve of data science and how to leverage data to be able to make decisions. So that's our idea. There are obviously optimal settings that we can use, but it's going to be a trial and error process. And through that, as we collect data, we can understand what settings are optimal and what should we be using in each of the plants. And if the plants decide, hey they have a subjective preference for one profile versus another with the data we are capturing we can measure when they deviated from what we specified. We have a lot of learning coming from the approach that we're taking. You can't control things if you don't measure it first. >> Well, your objectives are to transcend this one project and to do the same thing across. >> And to do the same thing across, yes. >> Essentially pay for it, with a quick return. That's the way to do things these days, right? >> Yes. >> You've got more narrow, small projects that'll give you a quick hit, and then leverage that expertise across the organization to drive more value. >> Yes. >> Love it. What a great story, guys. Thanks so much for coming to theCUBE and sharing. >> Thank you. >> Congratulations. You must be really excited. >> No. It's a fun project. I appreciate it. >> Thanks for having us, Dave. I appreciate it. >> Pleasure, Seth. Always great talking to you, and keep it right there everybody. You're watching theCUBE. We're live from New York City here at the Westin Hotel. cubenyc #cubenyc Check out the ibm.com/winwithai Change the Game: Winning with AI Tonight. We'll be right back after a short break. (minimal upbeat music)
SUMMARY :
Brought to you by IBM. at Terminal 5 of the West Side Highway, I think we met in the snowstorm in Boston, sparked something When we were both trapped there. Yep, and at that time, we spent a lot of time and we found a consistent theme with all the clients, So, at this point, I ask, "Well, do you have As a matter of fact, Dave, we do. Yeah, so you're not a bank with a trillion dollars Well, Niagara Bottling is the biggest private label and that's really where you sit in the organization, right? and business analytics as well as I support some of the And we can kind of go through the case study. So the current project that we leveraged IBM's help was And over breakfast we were talking. (everyone laughs) It's called pellets to pallets. Yes, in fact, we do bore wells and So if we use too much plastic, we're not optimally, as we wrap the pallets, whether we are wrapping it too little material, and so we can achieve some savings so we want to try and avoid that. and how much variability is in there? goes around each of the pallet. So they had labeled data that was, "if we stretch it this that we've built with them. Yeah that's mainly to make sure that the pallets So that's one of the variables that is measured, one day, automate that process, so the feedback loop the predictive models give us, how do we maximize the Yeah, in our data center, Most of the data And iterate. the Data Science Elite Team play? so that we could get on that journey towards AI. And it's not a "do the work for them." and we're entering the third sprint. some of the challenges of modeling and data science. that supports the behavior. Yeah and then the constraints of the one use case No, it's no charge. with buying hardware and software, or whatever it is. minimize the activation energy to do that, There's got to be some commitment for some and others who were more familiar with the So each of these engagements, So you're bringing in hardcore data scientists, math wizzes, and that's the whole journey that we are in, in trying to Your team must love this because that are on the front line, that do this everyday, and then you can iterate that model. And if the plants decide, hey they have a subjective and to do the same thing across. That's the way to do things these days, right? across the organization to drive more value. Thanks so much for coming to theCUBE and sharing. You must be really excited. I appreciate it. I appreciate it. Change the Game: Winning with AI Tonight.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Shreesha Rao | PERSON | 0.99+ |
Seth Dobern | PERSON | 0.99+ |
IBM | ORGANIZATION | 0.99+ |
Dave Vellante | PERSON | 0.99+ |
Walmarts | ORGANIZATION | 0.99+ |
Costcos | ORGANIZATION | 0.99+ |
Dave | PERSON | 0.99+ |
30 | QUANTITY | 0.99+ |
Boston | LOCATION | 0.99+ |
New York City | LOCATION | 0.99+ |
California | LOCATION | 0.99+ |
Seth Dobrin | PERSON | 0.99+ |
60 | QUANTITY | 0.99+ |
Niagara | ORGANIZATION | 0.99+ |
Seth | PERSON | 0.99+ |
Shreesha | PERSON | 0.99+ |
U.S. | LOCATION | 0.99+ |
Sreesha Rao | PERSON | 0.99+ |
third sprint | QUANTITY | 0.99+ |
90 days | QUANTITY | 0.99+ |
two | QUANTITY | 0.99+ |
first step | QUANTITY | 0.99+ |
Inderpal Bhandari | PERSON | 0.99+ |
Niagara Bottling | ORGANIZATION | 0.99+ |
Python | TITLE | 0.99+ |
both | QUANTITY | 0.99+ |
tonight | DATE | 0.99+ |
ibm.com/winwithai | OTHER | 0.99+ |
one | QUANTITY | 0.99+ |
Terminal 5 | LOCATION | 0.99+ |
two years | QUANTITY | 0.99+ |
about $250,000 | QUANTITY | 0.98+ |
Times Square | LOCATION | 0.98+ |
Scala | TITLE | 0.98+ |
2018 | DATE | 0.98+ |
15-20% | QUANTITY | 0.98+ |
IBM Analytics | ORGANIZATION | 0.98+ |
each | QUANTITY | 0.98+ |
today | DATE | 0.98+ |
each pallet | QUANTITY | 0.98+ |
Kaggle | ORGANIZATION | 0.98+ |
West Side Highway | LOCATION | 0.97+ |
Each pallet | QUANTITY | 0.97+ |
4 sprints | QUANTITY | 0.97+ |
About 250 grams | QUANTITY | 0.97+ |
both side | QUANTITY | 0.96+ |
Data Science Elite Team | ORGANIZATION | 0.96+ |
one day | QUANTITY | 0.95+ |
every single year | QUANTITY | 0.95+ |
Niagara Bottling | PERSON | 0.93+ |
about two sprints | QUANTITY | 0.93+ |
one end | QUANTITY | 0.93+ |
R | TITLE | 0.92+ |
2-3 weeks | QUANTITY | 0.91+ |
one profile | QUANTITY | 0.91+ |
50-60 analysts | QUANTITY | 0.91+ |
trillion dollars | QUANTITY | 0.9+ |
2-3 data scientists | QUANTITY | 0.9+ |
about 30-45 days | QUANTITY | 0.88+ |
almost 16 million pallets of water | QUANTITY | 0.88+ |
Big Apple | LOCATION | 0.87+ |
couple years ago | DATE | 0.87+ |
last 18 months | DATE | 0.87+ |
Westin Hotel | ORGANIZATION | 0.83+ |
pallet | QUANTITY | 0.83+ |
#cubenyc | LOCATION | 0.82+ |
2833 bottles of water per second | QUANTITY | 0.82+ |
the Game: Winning with AI | TITLE | 0.81+ |
Carol Carpenter, Google Cloud & Ayin Vala, Precision Medicine | Google Cloud Next 2018
>> Live from San Francisco, it's the Cube, covering Google Cloud Next 2018. Brought to you by Google Cloud and its ecosystem partners. >> Hello and welcome back to The Cube coverage here live in San Francisco for Google Cloud's conference Next 2018, #GoogleNext18. I'm John Furrier with Jeff Frick, my cohost all week. Third day of three days of wall to wall live coverage. Our next guest, Carol Carpenter, Vice President of Product Marketing for Google Cloud. And Ayin Vala, Chief Data Science Foundation for Precision Medicine. Welcome to The Cube, thanks for joining us. >> Thank you for having us. >> So congratulations, VP of Product Marketing. Great job getting all these announcements out, all these different products. Open source, big query machine learning, Istio, One dot, I mean, all this, tons of products, congratulations. >> Thank you, thank you. It was a tremendous amount of work. Great team. >> So you guys are starting to show real progress in customer traction, customer scale. Google's always had great technology. Consumption side of it, you guys have made progress. Diane Green mentioned on stage, on day one, she mentioned health care. She mentioned how you guys are organizing around these verticals. Health care is one of the big areas. Precision Medicine, AI usage, tell us about your story. >> Yes, so we are a very small non-profit. And we are at the intersection of data science and medical science and we work on projects that have non-profits impact and social impact. And we work on driving and developing projects that have social impact and in personalized medicine. >> So I think it's amazing. I always think with medicine, right, you look back five years wherever you are and you look back five years and think, oh my god, that was completely barbaric, right. They used to bleed people out and here, today, we still help cancer patients by basically poisoning them until they almost die and hopefully it kills the cancer first. You guys are looking at medicine in a very different way and the future medicine is so different than what it is today. And talk about, what is Presicion Medicine? Just the descriptor, it's a very different approach to kind of some of the treatments that we still use today in 2018. It's crazy. >> Yes, so Presicion Medicine has the meaning of personalized medicine. Meaning that we hone it into smaller population of people to trying to see what is the driving factors, individually customized to those populations and find out the different variables that are important for that population of people for detection of the disease, you know, cancer, Alzheimer's, those things. >> Okay, talk about the news. Okay, go ahead. >> Oh, oh, I was just going to say. And to be able to do what he's doing requires a lot of computational power to be able to actually get that precise. >> Right. Talk about the relationship and the news you guys have here. Some interesting stuff. Non-profits, they need compute power, they need, just like an eneterprise. You guys are bringing some change. What's the relationship between you guys? How are you working together? >> So one of our key messages here at this event is really around making computing available for everyone. Making data and analytics and machine learning available for everyone. This whole idea of human-centered AI. And what we've realized is, you know, data is the new natural resource. >> Yeah. >> In the world these days. And companies that know how to take advantage and actually mine insights from the data to solve problems like what they're solving at Precision Medicine. That is really where the new breakthroughs are going to come. So we announced a program here at the event, It's called Data Solutions for Change. It's from Google Cloud and it's a program in addition to our other non-profit programs. So we actually have other programs like Google Earth for non-profits. G Suite for non-profits. This one is very much focused on harnessing and helping non-profits extract insights from data. >> And is it a funding program, is it technology transfer Can you talk about, just a little detail on how it actually works. >> It's actually a combination of three things. One is funding, it's credits for up to $5,000 a month for up to six months. As well as customer support. One thing we've all talked about is the technology is amazing. You often also need to be able to apply some business logic around it and data scientists are somewhat of a challenge to hire these days. >> Yeah. >> So we're also proving free customer support, as well as online learning. >> Talk about an impact of the Cloud technology for the non-proit because6 I, you know, I'm seeing so much activity, certainly in Washington D.C. and around the world, where, you know, since the Jobs Act, fundings have changed. You got great things happening. You can have funding on mission-based funding. And also, the legacy of brand's are changing and open source changes So faster time to value. (laughs) >> Right. >> And without all the, you know, expertise it's an issue. How is Cloud helping you be better at what you do? Can you give some examples? >> Yes, so we had two different problems early on, as a small non-profit. First of all, we needed to scale up computationally. We had in-house servers. We needed a HIPAA complaint way to put our data up. So that's one of the reasons we were able to even use Google Cloud in the beginning. And now, we are able to run our models or entire data sets. Before that, we were only using a small population. And in Presicion Medicine, that's very important 'cause you want to get% entire population. That makes your models much more accurate. The second things was, we wanted to collaborate with people with clinical research backgrounds. And we need to provide a platform for them to be able to use, have the data on there, visualize, do computations, anything they want to do. And being on a Cloud really helped us to collaborate much more smoothly and you know, we only need their Gmail access, you know to Gmail to give them access and things. >> Yeah. >> And we could do it very, very quickly. Whereas before, it would take us months to transfer data. >> Yeah, it's a huge savings. Talk about the machine learning, AutoML's hot at the show, obviously, hot trend. You start to see AI ops coming in and disrupt more of the enterprise side but as data scientists, as you look at some of these machine learnings, I mean, you must get pretty excited. What are you thinking? What's your vision and how you going to use, like BigQuery's got ML built in now. This is like not new, it's Google's been using it for awhile. Are you tapping some of that? And what's your team doing with ML? >> Absolutely. We use BigQuery ML. We were able to use a few months in advance. It's great 'cause our data scientists like to work in BigQuery. They used to see, you know, you query the data right there. You can actually do the machine learning on there too. And you don't have to send it to different part of the platform for that. And it gives you sort of a proof of concept right away. For doing deep learning and those things, we use Cloud ML still, but for early on, you want to see if there is potential in a data. And you're able to do that very quickly with BigQuery ML right there. We also use AutoML Vision. We had access to about a thousand patients for MRI images and we wanted to see if we can detect Alzheimer's based on those. And we used AutoML for that. Actually works well. >> Some of the relationships with doctors, they're not always seen as the most tech savvy. So now they are getting more. As you do all this high-end, geeky stuff, you got to push it out to an interface. Google's really user-centric philosophy with user interfaces has always been kind of known for. Is that in Sheets, is that G Suite? How will you extend out the analysis and the interactions. How do you integrate into the edge work flow? You know? (laughs) >> So one thing I really appreciated for Google Cloud was that it was, seems to me it's built from the ground up for everyone to use. And it was the ease of access was very, was very important to us, like I said. We have data scientisits and statisticians and computer scientists onboard. But we needed a method and a platform that everybody can use. And through this program, they actually.. You guys provide what's called Qwiklab, which is, you know, screenshot of how to spin up a virtual machine and things like that. That, you know, a couple of years ago you have to run, you know, few command lines, too many command lines, to get that. Now it's just a push of a button. So that's just... Makes it much easier to work with people with background and domain knowledge and take away that 80% of the work, that's just a data engineering work that they don't want to do. >> That's awesome stuff. Well congratulations. Carol, a question to you is How does someone get involved in the Data Solutions for Change? An application? Online? Referral? I mean, how do these work? >> All of the above. (John laughs) We do have an online application and we welcome all non-profits to apply if they have a clear objective data problem that they want to solve. We would love to be able to help them. >> Does scope matter, big size, is it more mission? What's the mission criteria? Is there a certain bar to reach, so to speak, or-- >> Yeah, I mean we're most focused on... there really is not size, in terms of size of the non-profit or the breadth. It's much more around, do you have a problem that data and analytics can actually address. >> Yeah. >> So really working on problems that matter. And in addition, we actually announced this week that we are partnering with United Nations on a contest. It's called Sustainable.. It's for Visualize 2030 >> Yeah. >> So there are 17 sustainable development goals. >> Right, righr. >> And so, that's aimed at college students and storytelling to actually address one of these 17 areas. >> We'd love to follow up after the show, talk about some of the projects. since you have a lot of things going on. >> Yeah. >> Use of technology for good really is important right now, that people see that. People want to work for mission-driven organizations. >> Absolutely >> This becomes a clear citeria. Thanks for coming on. Appreciate it. Thanks for coming on today. Acute coverage here at Google Could Next 18 I'm John Furrier with Jeff Fricks. Stay with us. More coverage after this short break. (upbeat music)
SUMMARY :
Brought to you by Google Cloud Welcome to The Cube, thanks for joining us. So congratulations, VP of Product Marketing. It was a tremendous amount of work. So you guys are starting to show real progress And we work on driving and developing and you look back five years for that population of people for detection of the disease, Okay, talk about the news. And to be able to do what he's doing and the news you guys have here. And what we've realized is, you know, And companies that know how to take advantage Can you talk about, just a little detail You often also need to be able to apply So we're also proving free customer support, And also, the legacy of brand's are changing And without all the, you know, expertise So that's one of the reasons we And we could do it very, very quickly. and disrupt more of the enterprise side And you don't have to send it to different Some of the relationships with doctors, and take away that 80% of the work, Carol, a question to you is All of the above. It's much more around, do you have a problem And in addition, we actually announced this week and storytelling to actually address one of these 17 areas. since you have a lot of things going on. Use of technology for good really is important right now, Thanks for coming on today.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Jeff Frick | PERSON | 0.99+ |
Carol Carpenter | PERSON | 0.99+ |
Diane Green | PERSON | 0.99+ |
80% | QUANTITY | 0.99+ |
Ayin Vala | PERSON | 0.99+ |
United Nations | ORGANIZATION | 0.99+ |
Carol | PERSON | 0.99+ |
ORGANIZATION | 0.99+ | |
San Francisco | LOCATION | 0.99+ |
Washington D.C. | LOCATION | 0.99+ |
Jeff Fricks | PERSON | 0.99+ |
Precision Medicine | ORGANIZATION | 0.99+ |
John | PERSON | 0.99+ |
five years | QUANTITY | 0.99+ |
John Furrier | PERSON | 0.99+ |
One | QUANTITY | 0.99+ |
three days | QUANTITY | 0.99+ |
Jobs Act | TITLE | 0.99+ |
BigQuery | TITLE | 0.99+ |
G Suite | TITLE | 0.99+ |
2018 | DATE | 0.99+ |
17 areas | QUANTITY | 0.98+ |
one | QUANTITY | 0.98+ |
today | DATE | 0.98+ |
Third day | QUANTITY | 0.98+ |
this week | DATE | 0.98+ |
AutoML | TITLE | 0.98+ |
Cloud ML | TITLE | 0.98+ |
up to six months | QUANTITY | 0.98+ |
First | QUANTITY | 0.97+ |
Gmail | TITLE | 0.97+ |
BigQuery ML | TITLE | 0.97+ |
second things | QUANTITY | 0.97+ |
17 sustainable development goals | QUANTITY | 0.96+ |
about a thousand patients | QUANTITY | 0.95+ |
three things | QUANTITY | 0.95+ |
Google Cloud | ORGANIZATION | 0.94+ |
two different problems | QUANTITY | 0.94+ |
Google Earth | TITLE | 0.93+ |
AutoML Vision | TITLE | 0.93+ |
The Cube | ORGANIZATION | 0.93+ |
ML | TITLE | 0.93+ |
Alzheimer | OTHER | 0.91+ |
up to $5,000 a month | QUANTITY | 0.91+ |
day one | QUANTITY | 0.87+ |
couple of years ago | DATE | 0.87+ |
Istio | PERSON | 0.87+ |
first | QUANTITY | 0.85+ |
Vice President | PERSON | 0.85+ |
Google Cloud | TITLE | 0.85+ |
BigQuery ML. | TITLE | 0.85+ |
Next 2018 | DATE | 0.84+ |
one thing | QUANTITY | 0.83+ |
Qwiklab | TITLE | 0.79+ |
2030 | TITLE | 0.78+ |
Cloud | TITLE | 0.76+ |
#GoogleNext18 | EVENT | 0.73+ |
HIPAA | TITLE | 0.72+ |
Data Science Foundation | ORGANIZATION | 0.72+ |
Next 18 | TITLE | 0.7+ |
Cube | ORGANIZATION | 0.67+ |
I'm John | TITLE | 0.64+ |
tons | QUANTITY | 0.64+ |
Next | DATE | 0.63+ |
Furrier | PERSON | 0.59+ |
messages | QUANTITY | 0.58+ |
Ram Venkatesh, Hortonworks & Sudhir Hasbe, Google | DataWorks Summit 2018
>> Live from San Jose, in the heart of Silicon Valley, it's theCUBE, covering DataWorks Summit 2018. Brought to you by HortonWorks. >> We are wrapping up Day One of coverage of Dataworks here in San Jose, California on theCUBE. I'm your host, Rebecca Knight, along with my co-host, James Kobielus. We have two guests for this last segment of the day. We have Sudhir Hasbe, who is the director of product management at Google and Ram Venkatesh, who is VP of Engineering at Hortonworks. Ram, Sudhir, thanks so much for coming on the show. >> Thank you very much. >> Thank you. >> So, I want to start out by asking you about a joint announcement that was made earlier this morning about using some Hortonworks technology deployed onto Google Cloud. Tell our viewers more. >> Sure, so basically what we announced was support for the Hortonworks DataPlatform and Hortonworks DataFlow, HDP and HDF, running on top of the Google Cloud Platform. So this includes deep integration with Google's cloud storage connector layer as well as it's a certified distribution of HDP to run on the Google Cloud Platform. >> I think the key thing is a lot of our customers have been telling us they like the familiar environment of Hortonworks distribution that they've been using on-premises and as they look at moving to cloud, like in GCP, Google Cloud, they want the similar, familiar environment. So, they want the choice to deploy on-premises or Google Cloud, but they want the familiarity of what they've already been using with Hortonworks products. So this announcement actually helps customers pick and choose like whether they want to run Hortonworks distribution on-premises, they want to do it in cloud, or they wat to build this hybrid solution where the data can reside on-premises, can move to cloud and build these common, hybrid architecture. So, that's what this does. >> So, HDP customers can store data in the Google Cloud. They can execute ephemeral workloads, analytic workloads, machine learning in the Google Cloud. And there's some tie-in between Hortonworks's real-time or low latency or streaming capabilities from HDF in the Google Cloud. So, could you describe, at a full sort of detail level, the degrees of technical integration between your two offerings here. >> You want to take that? >> Sure, I'll handle that. So, essentially, deep in the heart of HDP, there's the HDFS layer that includes Hadoop compatible file system which is a plug-able file system layer. So, what Google has done is they have provided an implementation of this API for the Google Cloud Storage Connector. So this is the GCS Connector. We've taken the connector and we've actually continued to refine it to work with our workloads and now Hortonworks has actually bundling, packaging, and making this connector be available as part of HDP. >> So bilateral data movement between them? Bilateral workload movement? >> No, think of this as being very efficient when our workloads are running on top of GCP. When they need to get at data, they can get at data that is in the Google Cloud Storage buckets in a very, very efficient manner. So, since we have fairly deep expertise on workloads like Apache Hive and Apache Spark, we've actually done work in these workloads to make sure that they can run efficiently, not just on HDFS, but also in the cloud storage connector. This is a critical part of making sure that the architecture is actually optimized for the cloud. So, at our skill and our customers are moving their workloads from on-premise to the cloud, it's not just functional parity, but they also need sort of the operational and the cost efficiency that they're looking for as they move to the cloud. So, to do that, we need to enable these fundamental disaggregated storage pattern. See, on-prem, the big win with Hadoop was we could bring the processing to where the data was. In the cloud, we need to make sure that we work well when storage and compute are disaggregated and they're scaled elastically, independent of each other. So this is a fairly fundamental architectural change. We want to make sure that we enable this in a first-class manner. >> I think that's a key point, right. I think what cloud allows you to do is scale the storage and compute independently. And so, with storing data in Google Cloud Storage, you can like scale that horizontally and then just leverage that as your storage layer. And the compute can independently scale by itself. And what this is allowing customers of HDP and HDF is store the data on GCP, on the cloud storage, and then just use the scale, the compute side of it with HDP and HDF. >> So, if you'll indulge me to a name, another Hortonworks partner for just a hypothetical. Let's say one of your customers is using IBM Data Science Experience to do TensorFlow modeling and training, can they then inside of HDP on GCP, can they use the compute infrastructure inside of GCP to do the actual modeling which is more compute intensive and then the separate decoupled storage infrastructure to do the training which is more storage intensive? Is that a capability that would available to your customers? With this integration with Google? >> Yeah, so where we are going with this is we are saying, IBM DSX and other solutions that are built on top of HDP, they can transparently take advantage of the fact that they have HDP compute infrastructure to run against. So, you can run your machine learning training jobs, you can run your scoring jobs and you can have the same unmodified DSX experience whether you're running against an on-premise HDP environment or an in-cloud HDP environment. Further, that's sort of the benefit for partners and partner solutions. From a customer standpoint, the big value prop here is that customers, they're used to securing and governing their data on-prem in their particular way with HDP, with Apache Ranger, Atlas, and so forth. So, when they move to the cloud, we want this experience to be seamless from a management standpoint. So, from a data management standpoint, we want all of their learning from a security and governance perspective to apply when they are running in Google Cloud as well. So, we've had this capability on Azure and on AWS, so with this partnership, we are announcing the same type of deep integration with GCP as well. >> So Hortonworks is that one pane of glass across all your product partners for all manner of jobs. Go ahead, Rebecca. >> Well, I just wanted to ask about, we've talked about the reason, the impetus for this. With the customer, it's more familiar for customers, it offers the seamless experience, But, can you delve a little bit into the business problems that you're solving for customers here? >> A lot of times, our customers are at various points on their cloud journey, that for some of them, it's very simple, they're like there's a broom coming by and the datacenter is going away in 12 months and I need to be in the cloud. So, this is where there is a wholesale movement of infrastructure from on-premise to the cloud. Others are exploring individual business use cases. So, for example, one of our large customers, a travel partner, so they are exploring their new pricing model and they want to roll out this pricing model in the cloud. They have on-premise infrastructure, they know they have that for a while. They are spinning up new use cases in the cloud typically for reasons of agility. So, if you, typically many of our customers, they operate large, multi-tenant clusters on-prem. That's nice for, so a very scalable compute for running large jobs. But, if you want to run, for example, a new version of Spark, you have to upgrade the entire cluster before you can do that. Whereas in this sort of model, what they can say is, they can bring up a new workload and just have the specific versions and dependency that it needs, independent of all of their other infrastructure. So this gives them agility where they can move as fast as... >> Through the containerization of the Spark jobs or whatever. >> Correct, and so containerization as well as even spinning up an entire new environment. Because, in the cloud, given that you have access to elastic compute resources, they can come and go. So, your workloads are much more independent of the underlying cluster than they are on-premise. And this is where sort of the core business benefits around agility, speed of deployment, things like that come into play. >> And also, if you look at the total cost of ownership, really take an example where customers are collecting all this information through the month. And, at month end, you want to do closing of books. And so that's a great example where you want ephemeral workloads. So this is like do it once in a month, finish the books and close the books. That's a great scenario for cloud where you don't have to on-premises create an infrastructure, keep it ready. So that's one example where now, in the new partnership, you can collect all the data through the on-premises if you want throughout the month. But, move that and leverage cloud to go ahead and scale and do this workload and finish the books and all. That's one, the second example I can give is, a lot of customers collecting, like they run their e-commerce platforms and all on-premises, let's say they're running it. They can still connect all these events through HDP that may be running on-premises with Kafka and then, what you can do is, in-cloud, in GCP, you can deploy HDP, HDF, and you can use the HDF from there for real-time stream processing. So, collect all these clickstream events, use them, make decisions like, hey, which products are selling better?, should we go ahead and give?, how many people are looking at that product?, or how many people have bought it?. That kind of aggregation and real-time at scale, now you can do in-cloud and build these hybrid architectures that are there. And enable scenarios where in past, to do that kind of stuff, you would have to procure hardware, deploy hardware, all of that. Which all goes away. In-cloud, you can do that much more flexibly and just use whatever capacity you have. >> Well, you know, ephemeral workloads are at the heart of what many enterprise data scientists do. Real-world experiments, ad-hoc experiments, with certain datasets. You build a TensorFlow model or maybe a model in Caffe or whatever and you deploy it out to a cluster and so the life of a data scientist is often nothing but a stream of new tasks that are all ephemeral in their own right but are part of an ongoing experimentation program that's, you know, they're building and testing assets that may be or may not be deployed in the production applications. That's you know, so I can see a clear need for that, well, that capability of this announcement in lots of working data science shops in the business world. >> Absolutely. >> And I think coming down to, if you really look at the partnership, right. There are two or three key areas where it's going to have a huge advantage for our customers. One is analytics at-scale at a lower cost, like total cost of ownership, reducing that, running at-scale analytics. That's one of the big things. Again, as I said, the hybrid scenarios. Most customers, enterprise customers have huge deployments of infrastructure on-premises and that's not going to go away. Over a period of time, leveraging cloud is a priority for a lot of customers but they will be in these hybrid scenarios. And what this partnership allows them to do is have these scenarios that can span across cloud and on-premises infrastructure that they are building and get business value out of all of these. And then, finally, we at Google believe that the world will be more and more real-time over a period of time. Like, we already are seeing a lot of these real-time scenarios with IoT events coming in and people making real-time decisions. And this is only going to grow. And this partnership also provides the whole streaming analytics capabilities in-cloud at-scale for customers to build these hybrid plus also real-time streaming scenarios with this package. >> Well it's clear from Google what the Hortonworks partnership gives you in this competitive space, in the multi-cloud space. It gives you that ability to support hybrid cloud scenarios. You're one of the premier public cloud providers and we all know about. And clearly now that you got, you've had the Hortonworks partnership, you have that ability to support those kinds of highly hybridized deployments for your customers, many of whom I'm sure have those requirements. >> That's perfect, exactly right. >> Well a great note to end on. Thank you so much for coming on theCUBE. Sudhir, Ram, that you so much. >> Thank you, thanks a lot. >> Thank you. >> I'm Rebecca Knight for James Kobielus, we will have more tomorrow from DataWorks. We will see you tomorrow. This is theCUBE signing off. >> From sunny San Jose. >> That's right.
SUMMARY :
in the heart of Silicon Valley, for coming on the show. So, I want to start out by asking you to run on the Google Cloud Platform. and as they look at moving to cloud, in the Google Cloud. So, essentially, deep in the heart of HDP, and the cost efficiency is scale the storage and to do the training which and you can have the same that one pane of glass With the customer, it's and just have the specific of the Spark jobs or whatever. of the underlying cluster and then, what you can and so the life of a data that the world will be And clearly now that you got, Sudhir, Ram, that you so much. We will see you tomorrow.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
James Kobielus | PERSON | 0.99+ |
Rebecca Knight | PERSON | 0.99+ |
Rebecca | PERSON | 0.99+ |
two | QUANTITY | 0.99+ |
Sudhir | PERSON | 0.99+ |
Ram Venkatesh | PERSON | 0.99+ |
San Jose | LOCATION | 0.99+ |
HortonWorks | ORGANIZATION | 0.99+ |
Sudhir Hasbe | PERSON | 0.99+ |
ORGANIZATION | 0.99+ | |
Hortonworks | ORGANIZATION | 0.99+ |
Silicon Valley | LOCATION | 0.99+ |
two guests | QUANTITY | 0.99+ |
San Jose, California | LOCATION | 0.99+ |
DataWorks | ORGANIZATION | 0.99+ |
tomorrow | DATE | 0.99+ |
Ram | PERSON | 0.99+ |
AWS | ORGANIZATION | 0.99+ |
one example | QUANTITY | 0.99+ |
one | QUANTITY | 0.99+ |
two offerings | QUANTITY | 0.98+ |
12 months | QUANTITY | 0.98+ |
One | QUANTITY | 0.98+ |
Day One | QUANTITY | 0.98+ |
DataWorks Summit 2018 | EVENT | 0.97+ |
IBM | ORGANIZATION | 0.97+ |
second example | QUANTITY | 0.97+ |
Google Cloud Platform | TITLE | 0.96+ |
Atlas | ORGANIZATION | 0.96+ |
Google Cloud | TITLE | 0.94+ |
Apache Ranger | ORGANIZATION | 0.92+ |
three key areas | QUANTITY | 0.92+ |
Hadoop | TITLE | 0.91+ |
Kafka | TITLE | 0.9+ |
theCUBE | ORGANIZATION | 0.88+ |
earlier this morning | DATE | 0.87+ |
Apache Hive | ORGANIZATION | 0.86+ |
GCP | TITLE | 0.86+ |
one pane | QUANTITY | 0.86+ |
IBM Data Science | ORGANIZATION | 0.84+ |
Azure | TITLE | 0.82+ |
Spark | TITLE | 0.81+ |
first | QUANTITY | 0.79+ |
HDF | ORGANIZATION | 0.74+ |
once in a month | QUANTITY | 0.73+ |
HDP | ORGANIZATION | 0.7+ |
TensorFlow | OTHER | 0.69+ |
Hortonworks DataPlatform | ORGANIZATION | 0.67+ |
Apache Spark | ORGANIZATION | 0.61+ |
GCS | OTHER | 0.57+ |
HDP | TITLE | 0.5+ |
DSX | TITLE | 0.49+ |
Cloud Storage | TITLE | 0.47+ |
Pandit Prasad, IBM | DataWorks Summit 2018
>> From San Jose, in the heart of Silicon Valley, it's theCube. Covering DataWorks Summit 2018. Brought to you by Hortonworks. (upbeat music) >> Welcome back to theCUBE's live coverage of Data Works here in sunny San Jose, California. I'm your host Rebecca Knight along with my co-host James Kobielus. We're joined by Pandit Prasad. He is the analytics, projects, strategy, and management at IBM Analytics. Thanks so much for coming on the show. >> Thanks Rebecca, glad to be here. >> So, why don't you just start out by telling our viewers a little bit about what you do in terms of in relationship with the Horton Works relationship and the other parts of your job. >> Sure, as you said I am in Offering Management, which is also known as Product Management for IBM, manage the big data portfolio from an IBM perspective. I was also working with Hortonworks on developing this relationship, nurturing that relationship, so it's been a year since the Northsys partnership. We announced this partnership exactly last year at the same conference. And now it's been a year, so this year has been a journey and aligning the two portfolios together. Right, so Hortonworks had HDP HDF. IBM also had similar products, so we have for example, Big Sequel, Hortonworks has Hive, so how Hive and Big Sequel align together. IBM has a Data Science Experience, where does that come into the picture on top of HDP, so it means before this partnership if you look into the market, it has been you sell Hadoop, you sell a sequel engine, you sell Data Science. So what this year has given us is more of a solution sell. Now with this partnership we go to the customers and say here is NTN experience for you. You start with Hadoop, you put more analytics on top of it, you then bring Big Sequel for complex queries and federation visualization stories and then finally you put Data Science on top of it, so it gives you a complete NTN solution, the NTN experience for getting the value out of the data. >> Now IBM a few years back released a Watson data platform for team data science with DSX, data science experience, as one of the tools for data scientists. Is Watson data platform still the core, I call it dev ops for data science and maybe that's the wrong term, that IBM provides to market or is there sort of a broader dev ops frame work within which IBM goes to market these tools? >> Sure, Watson data platform one year ago was more of a cloud platform and it had many components of it and now we are getting a lot of components on to the (mumbles) and data science experience is one part of it, so data science experience... >> So Watson analytics as well for subject matter experts and so forth. >> Yes. And again Watson has a whole suit of side business based offerings, data science experience is more of a a particular aspect of the focus, specifically on the data science and that's been now available on PRAM and now we are building this arm from stack, so we have HDP, HDF, Big Sequel, Data Science Experience and we are working towards adding more and more to that portfolio. >> Well you have a broader reference architecture and a stack of solutions AI and power and so for more of the deep learning development. In your relationship with Hortonworks, are they reselling more of those tools into their customer base to supplement, extend what they already resell DSX or is that outside of the scope of the relationship? >> No it is all part of the relationship, these three have been the core of what we announced last year and then there are other solutions. We have the whole governance solution right, so again it goes back to the partnership HDP brings with it Atlas. IBM has a whole suite of governance portfolio including the governance catalog. How do you expand the story from being a Hadoop-centric story to an enterprise data-like story, and then now we are taking that to the cloud that's what Truata is all about. Rob Thomas came out with a blog yesterday morning talking about Truata. If you look at it is nothing but a governed data-link hosted offering, if you want to simplify it. That's one way to look at it caters to the GDPR requirements as well. >> For GDPR for the IBM Hortonworks partnership is the lead solution for GDPR compliance, is it Hortonworks Data Steward Studio or is it any number of solutions that IBM already has for data governance and curation, or is it a combination of all of that in terms of what you, as partners, propose to customers for soup to nuts GDPR compliance? Give me a sense for... >> It is a combination of all of those so it has a HDP, its has HDF, it has Big Sequel, it has Data Science Experience, it had IBM governance catalog, it has IBM data quality and it has a bunch of security products, like Gaurdium and it has some new IBM proprietary components that are very specific towards data (cough drowns out speaker) and how do you deal with the personal data and sensitive personal data as classified by GDPR. I'm supposed to query some high level information but I'm not allowed to query deep into the personal information so how do you blog those queries, how do you understand those, these are not necessarily part of Data Steward Studio. These are some of the proprietary components that are thrown into the mix by IBM. >> One of the requirements that is not often talked about under GDPR, Ricky of Formworks got in to it a little bit in his presentation, was the notion that the requirement that if you are using an UE citizen's PII to drive algorithmic outcomes, that they have the right to full transparency. It's the algorithmic decision paths that were taken. I remember IBM had a tool under the Watson brand that wraps up a narrative of that sort. Is that something that IBM still, it was called Watson Curator a few years back, is that a solution that IBM still offers, because I'm getting a sense right now that Hortonworks has a specific solution, not to say that they may not be working on it, that addresses that side of GDPR, do you know what I'm referring to there? >> I'm not aware of something from the Hortonworks side beyond the Data Steward Studio, which offers basically identification of what some of the... >> Data lineage as opposed to model lineage. It's a subtle distinction. >> It can identify some of the personal information and maybe provide a way to tag it and hence, mask it, but the Truata offering is the one that is bringing some new research assets, after GDPR guidelines became clear and then they got into they are full of how do we cater to those requirements. These are relatively new proprietary components, they are not even being productized, that's why I am calling them proprietary components that are going in to this hosting service. >> IBM's got a big portfolio so I'll understand if you guys are still working out what position. Rebecca go ahead. >> I just wanted to ask you about this new era of GDPR. The last Hortonworks conference was sort of before it came into effect and now we're in this new era. How would you say companies are reacting? Are they in the right space for it, in the sense of they're really still understand the ripple effects and how it's all going to play out? How would you describe your interactions with companies in terms of how they're dealing with these new requirements? >> They are still trying to understand the requirements and interpret the requirements coming to terms with what that really means. For example I met with a customer and they are a multi-national company. They have data centers across different geos and they asked me, I have somebody from Asia trying to query the data so that the query should go to Europe, but the query processing should not happen in Asia, the query processing all should happen in Europe, and only the output of the query should be sent back to Asia. You won't be able to think in these terms before the GDPR guidance era. >> Right, exceedingly complicated. >> Decoupling storage from processing enables those kinds of fairly complex scenarios for compliance purposes. >> It's not just about the access to data, now you are getting into where the processing happens were the results are getting displayed, so we are getting... >> Severe penalties for not doing that so your customers need to keep up. There was announcement at this show at Dataworks 2018 of an IBM Hortonwokrs solution. IBM post-analytics with with Hortonworks. I wonder if you could speak a little bit about that, Pandit, in terms of what's provided, it's a subscription service? If you could tell us what subset of IBM's analytics portfolio is hosted for Hortonwork's customers? >> Sure, was you said, it is a a hosted offering. Initially we are starting of as base offering with three products, it will have HDP, Big Sequel, IBM DB2 Big Sequel and DSX, Data Science Experience. Those are the three solutions, again as I said, it is hosted on IBM Cloud, so customers have a choice of different configurations they can choose, whether it be VMs or bare metal. I should say this is probably the only offering, as of today, that offers bare metal configuration in the cloud. >> It's geared to data scientist developers and machine-learning models will build the models and train them in IBM Cloud, but in a hosted HDP in IBM Cloud. Is that correct? >> Yeah, I would rephrase that a little bit. There are several different offerings on the cloud today and we can think about them as you said for ad-hoc or ephemeral workloads, also geared towards low cost. You think about this offering as taking your on PRAM data center experience directly onto the cloud. It is geared towards very high performance. The hardware and the software they are all configured, optimized for providing high performance, not necessarily for ad-hoc workloads, or ephemeral workloads, they are capable of handling massive workloads, on sitcky workloads, not meant for I turned this massive performance computing power for a couple of hours and then switched them off, but rather, I'm going to run these massive workloads as if it is located in my data center, that's number one. It comes with the complete set of HDP. If you think about it there are currently in the cloud you have Hive and Hbase, the sequel engines and the stories separate, security is optional, governance is optional. This comes with the whole enchilada. It has security and governance all baked in. It provides the option to use Big Sequel, because once you get on Hadoop, the next experience is I want to run complex workloads. I want to run federated queries across Hadoop as well as other data storage. How do I handle those, and then it comes with Data Science Experience also configured for best performance and integrated together. As a part of this partnership, I mentioned earlier, that we have progress towards providing this story of an NTN solution. The next steps of that are, yeah I can say that it's an NTN solution but are the product's look and feel as if they are one solution. That's what we are getting into and I have featured some of those integrations. For example Big Sequel, IBM product, we have been working on baking it very closely with HDP. It can be deployed through Morey, it is integrated with Atlas and Granger for security. We are improving the integrations with Atlas for governance. >> Say you're building a Spark machine learning model inside a DSX on HDP within IH (mumbles) IBM hosting with Hortonworks on HDP 3.0, can you then containerize that machine learning Sparks and then deploy into an edge scenario? >> Sure, first was Big Sequel, the next one was DSX. DSX is integrated with HDP as well. We can run DSX workloads on HDP before, but what we have done now is, if you want to run the DSX workloads, I want to run a Python workload, I need to have Python libraries on all the nodes that I want to deploy. Suppose you are running a big cluster, 500 cluster. I need to have Python libraries on all 500 nodes and I need to maintain the versioning of it. If I upgrade the versions then I need to go and upgrade and make sure all of them are perfectly aligned. >> In this first version will you be able build a Spark model and a Tesorflow model and containerize them and deploy them. >> Yes. >> Across a multi-cloud and orchestrate them with Kubernetes to do all that meshing, is that a capability now or planned for the future within this portfolio? >> Yeah, we have that capability demonstrated in the pedestal today, so that is a new one integration. We can run virtual, we call it virtual Python environment. DSX can containerize it and run data that's foreclosed in the HDP cluster. Now we are making use of both the data in the cluster, as well as the infrastructure of the cluster itself for running the workloads. >> In terms of the layers stacked, is also incorporating the IBM distributed deep-learning technology that you've recently announced? Which I think is highly differentiated, because deep learning is increasingly become a set of capabilities that are across a distributed mesh playing together as is they're one unified application. Is that a capability now in this solution, or will it be in the near future? DPL distributed deep learning? >> No, we have not yet. >> I know that's on the AI power platform currently, gotcha. >> It's what we'll be talking about at next year's conference. >> That's definitely on the roadmap. We are starting with the base configuration of bare metals and VM configuration, next one is, depending on how the customers react to it, definitely we're thinking about bare metal with GPUs optimized for Tensorflow workloads. >> Exciting, we'll be tuned in the coming months and years I'm sure you guys will have that. >> Pandit, thank you so much for coming on theCUBE. We appreciate it. I'm Rebecca Knight for James Kobielus. We will have, more from theCUBE's live coverage of Dataworks, just after this.
SUMMARY :
Brought to you by Hortonworks. Thanks so much for coming on the show. and the other parts of your job. and aligning the two portfolios together. and maybe that's the wrong term, getting a lot of components on to the (mumbles) and so forth. a particular aspect of the focus, and so for more of the deep learning development. No it is all part of the relationship, For GDPR for the IBM Hortonworks partnership the personal information so how do you blog One of the requirements that is not often I'm not aware of something from the Hortonworks side Data lineage as opposed to model lineage. It can identify some of the personal information if you guys are still working out what position. in the sense of they're really still understand the and interpret the requirements coming to terms kinds of fairly complex scenarios for compliance purposes. It's not just about the access to data, I wonder if you could speak a little that offers bare metal configuration in the cloud. It's geared to data scientist developers in the cloud you have Hive and Hbase, can you then containerize that machine learning Sparks on all the nodes that I want to deploy. In this first version will you be able build of the cluster itself for running the workloads. is also incorporating the IBM distributed It's what we'll be talking next one is, depending on how the customers react to it, I'm sure you guys will have that. Pandit, thank you so much for coming on theCUBE.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Rebecca | PERSON | 0.99+ |
James Kobielus | PERSON | 0.99+ |
Rebecca Knight | PERSON | 0.99+ |
Europe | LOCATION | 0.99+ |
IBM | ORGANIZATION | 0.99+ |
Asia | LOCATION | 0.99+ |
Rob Thomas | PERSON | 0.99+ |
San Jose | LOCATION | 0.99+ |
Silicon Valley | LOCATION | 0.99+ |
Pandit | PERSON | 0.99+ |
last year | DATE | 0.99+ |
Python | TITLE | 0.99+ |
yesterday morning | DATE | 0.99+ |
Hortonworks | ORGANIZATION | 0.99+ |
three solutions | QUANTITY | 0.99+ |
Ricky | PERSON | 0.99+ |
Northsys | ORGANIZATION | 0.99+ |
Hadoop | TITLE | 0.99+ |
Pandit Prasad | PERSON | 0.99+ |
GDPR | TITLE | 0.99+ |
IBM Analytics | ORGANIZATION | 0.99+ |
first version | QUANTITY | 0.99+ |
both | QUANTITY | 0.99+ |
one year ago | DATE | 0.98+ |
Hortonwork | ORGANIZATION | 0.98+ |
three | QUANTITY | 0.98+ |
today | DATE | 0.98+ |
DSX | TITLE | 0.98+ |
Formworks | ORGANIZATION | 0.98+ |
this year | DATE | 0.98+ |
Atlas | ORGANIZATION | 0.98+ |
first | QUANTITY | 0.98+ |
Granger | ORGANIZATION | 0.97+ |
Gaurdium | ORGANIZATION | 0.97+ |
one | QUANTITY | 0.97+ |
Data Steward Studio | ORGANIZATION | 0.97+ |
two portfolios | QUANTITY | 0.97+ |
Truata | ORGANIZATION | 0.96+ |
DataWorks Summit 2018 | EVENT | 0.96+ |
one solution | QUANTITY | 0.96+ |
one way | QUANTITY | 0.95+ |
next year | DATE | 0.94+ |
500 nodes | QUANTITY | 0.94+ |
NTN | ORGANIZATION | 0.93+ |
Watson | TITLE | 0.93+ |
Hortonworks | PERSON | 0.93+ |
Krishna Venkatraman, IBM | IBM CDO Summit Spring 2018
>> Announcer: Live, from downtown San Francisco, it's theCUBE covering IBM Chief Data Officer Strategy Summit 2018, brought to you by IBM. >> We're back at the IBM CDO Strategy Summit in San Francisco, we're at the Parc 55, you're watching theCUBE, the leader in live tech coverage. My name is Dave Vellante, and I'm here with Krishna Venkatraman, who is with IBM, he's the Vice President of Data Science and Data Governance. Krishna, thanks for coming on. >> Thank you, thank you for this opportunity. >> Oh, you're very welcome. So, let's start with your role. Your passion is really creating value from data, that's something you told me off-camera. That's a good passion to have these days. So what's your role at IBM? >> So I work for Inderpal, who's GCDO. He's the CDO for the company, and I joined IBM about a year ago, and what I was intrigued by when I talked to him early on was, you know, IBM has so many assets, it's got a huge history and legacy of technology, enormous, copious amounts of data, but most importantly, it also has a lot of experience helping customers solve problems at enterprise scale. And in my career, I started at HP Labs many, many years ago, I've been in a few startups, most recently before I joined IBM, I was at On Deck. What I've always found is that it's very hard to extract information and insights from data unless you have the end-to-end pieces in place, and when I was at On Deck, we built all of it from scratch, and I thought this would be a great opportunity to come to IBM, leverage all that great history and legacy and skill to build something that would allow data to almost be taken for granted. So, in a sense, a company doesn't have to think about the pain of getting value extracted from data, they could just say, you know, I trust data just as I trust the other things in life, like when I go buy a book, I know all the backend stuff is done for me, I can trust the product I get. And I was interested in that, and that's the role that Inderpal offered to me. >> So the opposite of On Deck, really. On Deck was kind of a blank sheet of paper, right? And so now you have a complex organization, as Inderpal was describing this morning, so big challenge. Ginni Rometty at IBM Think talked about incumbent disruptors, so that's essentially what IBM is, right? >> Exactly, exactly. The fact is IBM has a history and a culture of making their customers successful, so they understand business problems really well. They have a huge legacy in innovation around technology, and I think now is the right time to put all of those pieces together, right? To string together a lifecycle for how data can work for you, so when you embark on a data project, it doesn't have to take six months, it could be done in two or three days, because you've cobbled together how to manage data at the backend, you've got the data science and the data science lifecycle worked out, and you know how to deploy it into a business process, because you understand the business process really well. And I think, you know, those are the mismatches that I've seen happen over and over again, data isn't ready for the application of machine learning, the machine learning model really isn't well-suited to the eventual environment in which it's deployed, but I think IBM has all of that expertise, and I feel like it's an opportunity for us to tie that together. >> And everybody's trying to get, I often say, get digital right, you know, your customers, your clients, everyone talks about digital transformation, but it's really all about the data, isn't it? Getting the data right. >> Getting the data right, that's where it starts. Tomorrow, I'm doing a panel on trust, you know, we can talk about the CDO and all the great things that are happening and extracting value, but unless you have trust at the beginning and you're doing good data governance, and you're able to understand your data, all of the rest will never happen. >> But you have to have both, alright? Because if you have trust without the data value, then okay. And you do see a lot of organizations just focusing, maybe over-rotating on that privacy and trust and security, for good reason, how do you balance that information as an asset versus liability equation? Because you're trying to get value out of it, and at the same time, you're trying to protect your organization. >> Yeah. I think it's a virtuous cycle, I think they build on each other. If customers trust you with their data, they're going to give you more of it, because they know you're going to use it responsibly, and I think that's a very positive thing, so I actually look at privacy and trust as enablers to create value, rather than somehow they're in competition. >> Not a zero-sum game. >> Not at all. >> Let's talk some more about that, I mean, when you think about it, because I've heard this before, GDPR comes up. Hey, we can turn GDPR into an opportunity, it's not just this onerous, even though it is, regulatory imposition, so maybe some examples or maybe talk through how organizations can take the privacy and trust part of the equation and turn it into value. >> So very simply, what does GDPR promise, right? It's restoring the fundamental rights of data subjects, in terms of their ownership of their data and the processing of their data and the ability to know how that data is used at any point in time. Now imagine if you're a data scientist and you could, for a problem that you're trying to solve, have the same kind of guarantees. You know all about the data, you know where it resides, you know exactly what it contains. They're very similar, you know? They both are asking for the same type of information. So, in a sense, if you solve the GDPR problem well, you have to really understand your data assets very well, and you have to have it governed really well, which is exactly the same need for data scientists. So, in a way, I seem them as, you know, they're twins, separated at some point, but... >> What's interesting, too, is you think about, we were sort of talking about this off-camera, but now, you're one step away from going to a user or customer and saying here, here's your data, do what you like with it. Now okay, in the one case, GDPR, you control it, sort of. But the other is if you want to monetize your own data, why pay the search company for clicking on an ad? Why not monetize your own data based on your reputation or do you see a day where consumers will actually be able to own, truly own their own data? >> I think, as a consumer, as well as a data professional, I think that the technologies are falling into place for that model to possibly become real. So if you have something that's very valuable that other people want, there should be a way for you to get some remuneration for that, right? And maybe it's something like a blockchain. You contribute your data and then when that data is used, you get some little piece of it as your reward for that. I don't know, I think it's possible, I haven't really... >> Nirvana. I wonder if we can talk about disruption, nobody talks about that, we haven't had a ton of conversations here about disruption, it seems to be more applying disciplines to create data value, but coming from the financial services industry, there's an industry that really hasn't been highly disrupted, you know, On Deck, in a way, was trying to disrupt. Healthcare is another one that hasn't been disrupted. Aerospace really hasn't been disrupted. Other industries like publishing, music, taxis, hotels have been disrupted. The premise is, it's the data that enables that disruption. Thoughts on disruption from the standpoint of your clients and how you're helping them become incumbent disruptors? >> I think sometimes disruption happens and then you look back and you say, that was disrupted after all, and you don't notice it when it happens, so even if I look at financial services and I look at small business lending, the expectations of businesses have changed on how they would access capital in that case. Even though the early providers of that service may not be the ones who win in the end, that's a different matter, so I think the idea that, you know, and I feel like this confluence of technologies, where's there's blockchain or quantum computing or even regulation that's coming in, that's sort of forcing certain types of activities around cleaning up data, they're all happening simultaneously. I think we will see certain industries and certain processes transform dramatically. >> Orange Bank was an example that came up this morning, an all-digital bank, you can't call them, right? You can't walk into their branch. You think banks will lose control of the payment systems? They've always done a pretty good job of hanging onto them, but... >> I don't know. I think, ultimately, customers are going to go to institutions they trust, so it's all going to end up with, do you trust the entity you've given your precious commodities to, right? Your data, your information, I think companies that really take that seriously and not take it as a burden are the ones who are going to find that customers are going to reach out to them. So it's more about not necessarily whether banks are going to lose control or whether... Which banks are going to win, is the way I would look at it. >> Maybe the existing banks might get trouble, but there's so many different interesting disruption scenarios, I mean, you think about Watson in healthcare, maybe we're at the point already where machines can make better diagnoses than doctors. You think about retail, and certain retail won't go away, obviously grocery and maybe high-end luxury malls won't go away, but you wonder about the future of retail as a result of this data disruption. Your thoughts? >> On retail? I do feel like, because the data is getting more, people are going to have more access to their own information, it will lead to a change in business models in certain cases. And the friction or the forces that used to keep customers with certain businesses may dissolve, so if you don't have friction, then it's going to end up with value and loyalty and service, and those are the ones I think that will thrive. >> Client comes to you, says, Krishna, I'm really struggling with my overall data strategy, my data platform, governance, skills, all the things that Inderpal talked about this morning, where do I start? >> I would start with making sure that the client has really thought about the questions they need answered. What is it that you really want to answer with data, or it doesn't even have to be with data, for the business, with its strategy, with its tactics, there have to be a set of questions framed up that are truly important to that business. And then starting from there, you can say, you know, let's slow it down and see what technologies, what types of data will help support answering those questions. So there has to be an overarching value proposition that you're trying to solve for. And I see, you know, that's why when, the way we work in our organization is, we look at use cases as a way to drive the technology adoption. What are the big business processes you are trying to transform, what's the value you expect to create, so we have a very robust discovery process where we ask people to answer those types of questions, we help them with it. We ask them to think through what they would do if they had the perfect answer, how they will implement it, how they will measure it. And then we start working on the technology. I often think technology is an easier question to answer once you know what you want to ask. >> Totally. Is that how you spend your time, mostly working with the lines of business, trying to help them sort of answer those questions? >> That is one part of my charter. So my charter involves basically four areas, the first is data governance, just making sure that we are creating all the tools and processes so that we can guarantee that when data is used, it is trusted, it is certified, and that it's always going to be reliable. The second piece is building up a real data competency and data science competency in the organization, so we know how to use data for different types of business value, and then the third is actually taking these client engagements internally and making sure that they are successful. So our model is what we call co-creation. We ask business teams to contribute their own resources. Data engineers, data scientists, business experts. We contribute specialized skills as well. And so we're jointly in the game together, right? So that's the third piece. And the last piece is, we're building out this platform that Inderpal showed this morning, that platform needs product management, so we are also working on, what are the fundamental pieces of functionality we want in the platform, and how do we make sure they're on the roadmap and they're prioritized in the right way. >> Excellent. Well, Krishna, thanks very much for coming to theCUBE, it was a pleasure meeting you. >> Thanks. >> Alright, keep it right there everybody, we'll be back with our next guest. You're watching theCUBE live from IBM CDO Summit in San Francisco. We'll be right back. (funky electronic music) (phone dialing)
SUMMARY :
brought to you by IBM. he's the Vice President of Data for this opportunity. that's something you told me off-camera. and that's the role that And so now you have a And I think, you know, those Getting the data right. and all the great things that and at the same time, you're trying to they're going to give you more of it, I mean, when you think about it, and the ability to know But the other is if you want So if you have something the standpoint of your clients and then you look back and you say, control of the payment systems? to end up with, do you trust the entity about the future of retail so if you don't have friction, And I see, you know, that's why when, you spend your time, So that's the third piece. much for coming to theCUBE, from IBM CDO Summit in San Francisco.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Dave Vellante | PERSON | 0.99+ |
IBM | ORGANIZATION | 0.99+ |
Krishna | PERSON | 0.99+ |
Ginni Rometty | PERSON | 0.99+ |
Krishna Venkatraman | PERSON | 0.99+ |
Orange Bank | ORGANIZATION | 0.99+ |
six months | QUANTITY | 0.99+ |
third piece | QUANTITY | 0.99+ |
San Francisco | LOCATION | 0.99+ |
second piece | QUANTITY | 0.99+ |
third | QUANTITY | 0.99+ |
first | QUANTITY | 0.99+ |
Tomorrow | DATE | 0.99+ |
HP Labs | ORGANIZATION | 0.99+ |
both | QUANTITY | 0.99+ |
two | QUANTITY | 0.99+ |
one part | QUANTITY | 0.99+ |
three days | QUANTITY | 0.99+ |
GDPR | TITLE | 0.99+ |
Inderpal | ORGANIZATION | 0.98+ |
Inderpal | PERSON | 0.98+ |
Parc 55 | LOCATION | 0.98+ |
one case | QUANTITY | 0.97+ |
On Deck | ORGANIZATION | 0.97+ |
this morning | DATE | 0.97+ |
twins | QUANTITY | 0.93+ |
four areas | QUANTITY | 0.91+ |
Strategy Summit 2018 | EVENT | 0.9+ |
IBM CDO Summit | EVENT | 0.9+ |
Vice President | PERSON | 0.89+ |
IBM Think | ORGANIZATION | 0.89+ |
Spring 2018 | DATE | 0.89+ |
years ago | DATE | 0.87+ |
a year ago | DATE | 0.86+ |
IBM CDO Strategy Summit | EVENT | 0.76+ |
one step | QUANTITY | 0.76+ |
Watson | ORGANIZATION | 0.74+ |
On Deck | TITLE | 0.66+ |
Data Science and Data Governance | ORGANIZATION | 0.65+ |
about | DATE | 0.65+ |
last | QUANTITY | 0.6+ |
Chief | EVENT | 0.56+ |
Officer | EVENT | 0.54+ |
Nirvana | PERSON | 0.41+ |
theCUBE | TITLE | 0.4+ |
theCUBE | ORGANIZATION | 0.39+ |
Ian Swanson, DataScience.com | Big Data SV 2018
(royal music) >> Announcer: John Cleese. >> There's a lot of people out there who have no idea what they're doing, but they have absolutely no idea that they have no idea what they're doing. Those are the ones with the confidence and stupidity who finish up in power. That's why the planet doesn't work. >> Announcer: Knowledgeable, insightful, and a true gentleman. >> The guy at the counter recognized me and said... Are you listening? >> John Furrier: Yes, I'm tweeting away. >> No, you're not. >> I tweet, I'm tweeting away. >> He is kind of rude that way. >> You're on your (bleep) keyboard. >> Announcer: John Cleese joins the Cube alumni. Welcome, John. >> John Cleese: Have you got any phone calls you need to answer? >> John Furrier: Hold on, let me check. >> Announcer: Live from San Jose, it's the Cube, presenting Big Data Silicon Valley, brought to you by Silicon Angle Media and its ecosystem partners. (busy music) >> Hey, welcome back to the Cube's continuing coverage of our event, Big Data SV. I'm Lisa Martin with my co-host, George Gilbert. We are down the street from the Strata Data Conference. This is our second day, and we've been talking all things big data, cloud data science. We're now excited to be joined by the CEO of a company called Data Science, Ian Swanson. Ian, welcome to the Cube. >> Thanks so much for having me. I mean, it's been a awesome two days so far, and it's great to wrap up my trip here on the show. >> Yeah, so, tell us a little bit about your company, Data Science, what do you guys do? What are some of the key opportunities for you guys in the enterprise market? >> Yeah, absolutely. My company's called datascience.com, and what we do is we offer an enterprise data science platform where data scientists get to use all they tools they love in all the languages, all the libraries, leveraging everything that is open source to build models and put models in production. Then we also provide IT the ability to be able to manage this massive stack of tools that data scientists require, and it all boils down to one thing, and that is, companies need to use the data that they've been storing for years. It's about, how do you put that data into action. We give the tools to data scientists to get that data into action. >> Let's drill down on that a bit. For a while, we thought if we just put all our data in this schema-on-read repository, that would be nirvana. But it wasn't all that transparent, and we recognized we have to sort of go in and structure it somewhat, help us take the next couple steps. >> Ian: Yeah, the journey. >> From this partially curated data sets to something that turns into a model that is actionable. >> That's actually been the theme in the show here at the Strata Data Conference. If we went back years ago, it was, how do we store data. Then it was, how do we not just store and manage, but how do we transform it and get it into a shape that we can actually use it. The theme of this year is how do we get it to that next step, the next step of putting it into action. To layer onto that, data scientists need to access data, yes, but then they need to be able to collaborate, work together, apply many different techniques, machine learning, AI, deep learning, these are all techniques of a data scientist to be able to build a model. But then there's that next step, and the next is, hey, I built this model, how do I actually get it in production? How does it actually get used? Here's the shocking thing. I was at an event where there's 500 data scientists in the audience, and I said, "Stand up if you worked on a model for more than nine months "and it never went into production." 90% of the audience stood up. That's the last mile that we're all still working on, and what's exciting is, we can make it possible today. >> Wanting to drill down into the sort of, it sounds like there's a lot of choice in the tools. But typically, to do a pipeline, you either need well established APIs that everyone understands and plugs together with, or you need an end to end sort of single vendor solution that becomes the sort of collaboration backbone. How are you organized, how are you built? >> This might be self-serving, but datascience.com, we have enterprise data science platform, we recommend a unified platform for data science. Now, that unified platform needs to be highly configurable. You need to make it so that that workbench, you can use any tool that you want. Some data scientists might want to use a hammer, others want to be able to use a screwdriver over here. The power is how configurable, how extensible it is, how open source you can adopt everything. The amazing trends that we've seen have been proprietary solutions going back decades, to now, the rise of open source. Every day, dozens if not hundreds of new machine learning libraries are being released every single day. We've got to give those capabilities to data scientists and make them scale. >> OK, so the, and I think it's pretty easy to see how you would have incorporate new machine learning libraries into a pipeline. But then there's also the tools for data preparation, and for like feature extraction and feature engineering, you might even have some tools that help you with figuring out which algorithm to select. What holds all that together? >> Yeah, so orchestrating the enterprise data science stack is the hardest challenge right now. There has to be a company like us that is the glue, that is not just, do these solutions work together, but also, how do they collaborate, what is that workflow? What are those steps in that process? There's one thing that you might have left out, and that is, model deployment, model interpretation, model management. >> George: That's the black art, yeah. >> That's where this whole thing is going next. That was the exciting thing that I heard in terms of all these discussion with business leaders throughout the last two days is model deployment, model management. >> If I can kind of take this to maybe shift the conversation a little bit to the target audience. Talked a lot about data scientists and needing to enable them. I'm curious about, we just talked with, a couple of guests ago, about the chief data officer. How, you work with enterprises, how common is the chief data officer role today? What are some of the challenges they've got that datascience.com can help them to eliminate? >> Yeah, the CIO and the chief data officer, we have CIOs that have been selecting tools for companies to use, and now the chief data officer is sitting down with the CEO and saying, "How do we actually drive business results?" We work very closely with both of those personas. But on the CDO side, it's really helping them educate their teams on the possibilities of what could be realized with the data at hand, and making sure that IT is enabling the data scientists with the right tools. We supply the tools, but we also like to go in there with our customers and help coach, help educate what is possible, and that helps with the CDO's mission. >> A question along that front. We've been talking about sort of empowering the data scientist, and really, from one end of the modeling life cycle all the way to the end or the deployment, which is currently the hardest part and least well supported. But we also have tons of companies that don't have data science trained people, or who are only modestly familiar. Where do, what do we do with them? How do we get those companies into the mainstream in terms of deploying this? >> I think whether you're a small company or a big company, digital transformation is the mandate. Digital transformation is not just, how do I make a taxi company become Uber, or how do I make a speaker company become Sonos, the smart speaker, it's how do I exploit all the sources of my data to get better and improved operational processes, new business models, increased revenue, reduced operation costs. You could start small, and so we work with plenty of smaller companies. They'll hire a couple data scientists, and they're able to do small quick wins. You don't have to go sit in the basement for a year having something that is the thing, the unicorn in the business, it's small quick wins. Now we, my company, we believe in writing code, trained, educated, data scientists. There are solutions out there that you throw data at, you push a button, it gets an output. It's this magic black box. There's risk in that. Model interpretation, what are the features it's scoring on, there's risk, but those companies are seeing some level of success. We firmly believe, though, in hiring a data science team that is trained, you can start small, two or three, and get some very quick wins. >> I was going to say, those quick wins are essential for survivability, like digital transformation is essential, but it's also, I mean, to survival at a minimum, right? >> Ian: Yes. >> Those quick wins are presumably transformative to an enterprise being able to sustain, and then eventually, or ideally, be able to take market share from their competition. >> That is key for the CDO. The CDO is there pitching what is possible, he's pitching, she's pitching the dream. In order to be able to help visualize what that dream and the outcome could be, we always say, start small, quick wins, then from there, you can build. What you don't want to do is go nine months working on something and you don't know if there's going to be outcome. A lot of data science is trial and error. This is science, we're testing hypotheses. There's not always an outcome that's to be there, so small quick wins is something we highly recommend. >> A question, one of the things that we see more and more is the idea that actionable insights are perishable, and that latency matters. In fact, you have a budget for latency, almost, like in that short amount of time, the more sort of features that you can dynamically feed into a model to get a score, are you seeing more of that? How are the use cases that you're seeing, how's that pattern unfolding? >> Yeah, so we're seeing more streaming data use cases. We work with some of the biggest technology companies in the world, so IoT, connected services, streaming real time decisions that are happening. But then, also, there are so many use cases around org that could be marketing, finance, HR related, not just tech related. On the marketing side, imagine if you're customer service, and somebody calls you, and you know instantly the lifetime value of that customer, and it kicks off a totally new talk track, maybe get escalated immediately to a new supervisor, because that supervisor can handle this top tier customer. These are decisions that can happen real time leveraging machine learning models, and these are things that, again, are small quick wins, but massive, massive impact. It's about decision process now. That's digital transformation. >> OK. Are you seeing patterns in terms of how much horsepower customers are budgeting for the training process, creating the model? Because we know it's very compute intensive, like, even Intel, some people call it, like, high performance compute, like a supercomputer type workload. How much should people be budgeting? Because we don't see any guidelines or rules of thumb for this. >> I still think the boundaries are being worked out. There's a lot of great work that Nvidia's doing with GPU, we're able to do things faster on compute power. But even if we just start from the basics, if you go and talk to a data scientist at a massive company where they have a team of over 1,000 data scientists, and you say to do this analysis, how do you spin up your compute power? Well, I go walk over to IT and I knock on the door, and I say, "Set up this machine, set up this cluster." That's ridiculous. A product like ours is able to instantly give them the compute power, scale it elastically with our cloud service partners or work with on-prem solutions to be able to say, get the power that you need to get the results in the time that's needed, quick, fast. In terms of the boundaries of the budget, that's still being defined. But at the end of the day, we are seeing return on investment, and that's what's key. >> Are you seeing a movement towards a greater scope of integration for the data science tool chain? Or is it that at the high end, where you have companies with 1,000 data scientists, they know how to deal with specialized components, whereas, when there's perhaps less of, a smaller pool of expertise, the desire for end to end integration is greater. >> I think there's this kind of thought that is not necessarily right, and that is, if you have a bigger data science team, you're more sophisticated. We actually see the same sophistication level of 1,000 person data science team, in many cases, to a 20 person data science team, and sometimes inverse, I mean, it's kind of crazy. But it's, how do we make sure that we give them the tools so they can drive value. Tools need to include collaboration and workflow, not just hammers and nails, but how do we work together, how do we scale knowledge, how do we get it in the hands of the line of business so they can use the results. It's that that is key. >> That's great, Ian. I also like that you really kind of articulated start small, quick ins can make massive impact. We want to thank you so much for stopping by the Cube and sharing that, and what you guys are doing at Data Science to help enterprises really take advantage of the value that data can really deliver. >> Thanks so much for having datascience.com on, really appreciate it. >> Lisa: Absolutely. George, thank you for being my co-host. >> You're always welcome. >> We want to thank you for watching the Cube. I'm Lisa Martin with George Gilbert, and we are at our event Big Data SV on day two. Stick around, we'll be right back with our next guest after a short break. (busy music)
SUMMARY :
Those are the ones with the confidence and stupidity and a true gentleman. The guy at the counter recognized me and said... Announcer: John Cleese joins the Cube alumni. brought to you by Silicon Angle Media We are down the street from the Strata Data Conference. and it's great to wrap up my trip here on the show. and it all boils down to one thing, and that is, the next couple steps. to something that turns into a model that is actionable. and the next is, hey, I built this model, that becomes the sort of collaboration backbone. how open source you can adopt everything. OK, so the, and I think it's pretty easy to see Yeah, so orchestrating the enterprise data science stack in terms of all these discussion with business leaders a couple of guests ago, about the chief data officer. and making sure that IT is enabling the data scientists empowering the data scientist, and really, having something that is the thing, or ideally, be able to take market share and the outcome could be, we always say, start small, the more sort of features that you can dynamically in the world, so IoT, connected services, customers are budgeting for the training process, get the power that you need to get the results Or is it that at the high end, We actually see the same sophistication level and sharing that, and what you guys are doing Thanks so much for having datascience.com on, George, thank you for being my co-host. and we are at our event Big Data SV on day two.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
George Gilbert | PERSON | 0.99+ |
Lisa Martin | PERSON | 0.99+ |
Ian Swanson | PERSON | 0.99+ |
George | PERSON | 0.99+ |
Ian | PERSON | 0.99+ |
Lisa | PERSON | 0.99+ |
Uber | ORGANIZATION | 0.99+ |
John Furrier | PERSON | 0.99+ |
Silicon Angle Media | ORGANIZATION | 0.99+ |
John | PERSON | 0.99+ |
John Cleese | PERSON | 0.99+ |
500 data scientists | QUANTITY | 0.99+ |
90% | QUANTITY | 0.99+ |
dozens | QUANTITY | 0.99+ |
Nvidia | ORGANIZATION | 0.99+ |
San Jose | LOCATION | 0.99+ |
20 person | QUANTITY | 0.99+ |
Data Science | ORGANIZATION | 0.99+ |
nine months | QUANTITY | 0.99+ |
1,000 person | QUANTITY | 0.99+ |
two | QUANTITY | 0.99+ |
two days | QUANTITY | 0.99+ |
more than nine months | QUANTITY | 0.99+ |
second day | QUANTITY | 0.99+ |
1,000 data scientists | QUANTITY | 0.99+ |
three | QUANTITY | 0.99+ |
Big Data SV | EVENT | 0.99+ |
over 1,000 data scientists | QUANTITY | 0.99+ |
Cube | ORGANIZATION | 0.99+ |
both | QUANTITY | 0.99+ |
Strata Data Conference | EVENT | 0.98+ |
one | QUANTITY | 0.98+ |
Intel | ORGANIZATION | 0.98+ |
Sonos | ORGANIZATION | 0.98+ |
one thing | QUANTITY | 0.97+ |
a year | QUANTITY | 0.96+ |
today | DATE | 0.95+ |
day two | QUANTITY | 0.95+ |
this year | DATE | 0.94+ |
single | QUANTITY | 0.92+ |
Big Data SV 2018 | EVENT | 0.88+ |
DataScience.com | ORGANIZATION | 0.87+ |
hundreds of new machine learning libraries | QUANTITY | 0.86+ |
lot of people | QUANTITY | 0.83+ |
decades | QUANTITY | 0.82+ |
every single day | QUANTITY | 0.81+ |
years ago | DATE | 0.77+ |
last two days | DATE | 0.76+ |
datascience.com | ORGANIZATION | 0.75+ |
one end | QUANTITY | 0.7+ |
years | QUANTITY | 0.67+ |
datascience.com | OTHER | 0.65+ |
couple steps | QUANTITY | 0.64+ |
Big Data | EVENT | 0.64+ |
couple of guests | DATE | 0.57+ |
couple | QUANTITY | 0.52+ |
Silicon Valley | LOCATION | 0.52+ |
things | QUANTITY | 0.5+ |
Cube | TITLE | 0.47+ |
Ziya Ma, Intel | Big Data SV 2018
>> Live from San Jose, it's theCUBE! Presenting Big Data Silicon Valley, brought to you by SiliconANGLE Media and its ecosystem partners. >> Welcome back to theCUBE. Our continuing coverage of our event, Big data SV. I'm Lisa Martin with my co-host George Gilbert. We're down the street from the Strata Data Conference, hearing a lot of interesting insights on big data. Peeling back the layers, looking at opportunities, some of the challenges, barriers to overcome but also the plethora of opportunities that enterprises alike have that they can take advantage of. Our next guest is no stranger to theCUBE, she was just on with me a couple days ago at the Women in Data Science Conference. Please welcome back to theCUBE, Ziya Ma. Vice President of Software and Services Group and the Director of Big Data Technologies from Intel. Hi Ziya! >> Hi Lisa. >> Long time, no see. >> I know, it was just really two to three days ago. >> It was, well and now I can say happy International Women's Day. >> The same to you, Lisa. >> Thank you, it's great to have you here. So as I mentioned, we are down the street from the Strata Data Conference. You've been up there over the last couple days. What are some of the things that you're hearing with respect to big data? Trends, barriers, opportunities? >> Yeah, so first it's very exciting to be back at the conference again. The one biggest trend, or one topic that's hit really hard by many presenters, is the power of bringing the big data system and data science solutions together. You know, we're definitely seeing in the last few years the advancement of big data and advancement of data science or you know, machine learning, deep learning truly pushing forward business differentiation and improve our life quality. So that's definitely one of the biggest trends. Another thing I noticed is there was a lot of discussion on big data and data science getting deployed into the cloud. What are the learnings, what are the use cases? So I think that's another noticeable trend. And also, there were some presentations on doing the data science or having the business intelligence on the edge devices. That's another noticeable trend. And of course, there were discussion on security, privacy for data science and big data so that continued to be one of the topics. >> So we were talking earlier, 'cause there's so many concepts and products to get your arms around. If someone is looking at AI and machine learning on the back end, you know, we'll worry about edge intelligence some other time, but we know that Intel has the CPU with the Xeon and then this lower power one with Atom. There's the GPU, there's ASICs, FPGAS, and then there are these software layers you know, with higher abstraction layer, higher abstraction level. Help us put some of those pieces together for people who are like saying, okay, I know I've got a lot of data, I've got to train these sophisticated models, you know, explain this to me. >> Right, so Intel is a real solution provider for data science and big data. So at the hardware level, and George, as you mentioned, we offer a wide range of products from general purpose like Xeon to targeted silicon such as FPGA, Nervana, and other ASICs chips like Nervana. And also we provide adjacencies like networking the hardware, non-volatile memory and mobile. You know, those are the other adjacent products that we offer. Now on top of the hardware layer, we deliver fully optimized software solutions stack from libraries, frameworks, to tools and solutions. So that we can help engineers or developers to create AI solutions with greater ease and productivity. For instance, we deliver Intel optimized math kernel library. That leverage of the latest instruction set gives us significant performance boosts when you are running your software on Intel hardware. We also deliver framework like BigDL and for Spark and big data type of customers if they are looking for deep learning capabilities. We also optimize some popular open source deep learning frameworks like Caffe, like TensorFlow, MXNet, and a few others. So our goal is to provide all the necessary solutions so that at the end our customers can create the applications, the solutions that they really need to address their biggest pinpoints. >> Help us think about the maturity level now. Like, we know that the very most sophisticated internet service providers who are sort of all over this machine learning now for quite a few years. Banks, insurance companies, people who've had this. Statisticians and actuaries who have that sort of skillset are beginning to deploy some of these early production apps. Where are we in terms of getting this out to the mainstream? What are some of the things that have to happen? >> To get it to mainstream, there are so many things we could do. First I think we will continue to see the wide range of silicon products but then there are a few things Intel is pushing. For example, we're developing this in Nervana, graph compiler that will encapsulate the hardware integration details and present a consistent API for developers to work with. And this is one thing that we hope that we can eventually help the developer community with. And also, we are collaborating with the end user. Like, from the enterprise segment. For example, we're working with the financial services industry, we're working with a manufacturing sector and also customers from the medical field. And online retailers, trying to help them to deliver or create the data science and analytics solutions on Intel-based hardware or Intel optimized software. So that's another thing that we do. And we're seeing actually very good progress in this area. Now we're also collaborating with many cloud service providers. For instance, we work with some of the top seven cloud service providers, both in the U.S. and also in China to democratize the, not only our hardware, but also our libraries and tools, BigDL, MKL, and other frameworks and libraries so that our customers, including individuals and businesses, can easily access to those building blocks from the cloud. So definitely we're working from different factors. >> So last question in the last couple of minutes. Let's kind of vibe on this collaboration theme. Tell us a little bit about the collaboration that you're having with, you mentioned customers in some highly regulated industries, for as an example. But a little bit to understand what's that symbiosis? What is Intel learning from your customers that's driving Intel's innovation of your technologies and big data? >> That's an excellent question. So Lisa, maybe I can start my sharing a couple of customer use cases. What kind of a solution that we help our customer to address. I think it's always wise not to start a conversation with the customer on technology that you deliver. You want to understand the customer's needs first. And then so that you can provide a solution that really address their biggest pinpoint rather than simply selling technology. So for example, we have worked with an online retailer to better understand their customers' shopping behavior and to assess their customers' preferences and interests. And based upon that analysis, the online retailer made different product recommendations and maximized its customers' purchase potential. And it drove up the retailer's sales. You know, that's one type of use case that we have worked. We also have partnered with the customers from the medical field. Actually, today at the Strata Conference we actually had somebody highlighting, we had a joint presentation with UCSF where we helped the medical center to automate the diagnosis and grading of meniscus lesions. And so today actually, that's all done manually by the radiologist but now that entire process is automated. The result is much more accurate, much more consistent, and much more timely. Because you don't have to wait for the availability of a radiologist to read all the 3D MRI images. And that can all be done by machines. You know, so those are the areas that we work with our customers, understand their business need, and give them the solution they are looking for. >> Wow, the impact there. I wish we had more time to dive into some of those examples. But we thank you so much, Ziya, for stopping by twice in one week to theCUBE and sharing your insights. And we look forward to having you back on the show in the near future. >> Thanks, so thanks Lisa, thanks George for having me. >> And for my co-host George Gilbert, I'm Lisa Martin. We are live at Big Data SV in San Jose. Come down, join us for the rest of the afternoon. We're at this cool place called Forager Tasting and Eatery. We will be right back with our next guest after a short break. (electronic outro music)
SUMMARY :
brought to you by SiliconANGLE Media some of the challenges, barriers to overcome What are some of the things that you're So that's definitely one of the biggest trends. on the back end, So at the hardware level, and George, as you mentioned, What are some of the things that have to happen? and also customers from the medical field. So last question in the last couple of minutes. customers from the medical field. And we look forward to having you We will be right back with our
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
George Gilbert | PERSON | 0.99+ |
Lisa Martin | PERSON | 0.99+ |
UCSF | ORGANIZATION | 0.99+ |
George | PERSON | 0.99+ |
Lisa | PERSON | 0.99+ |
San Jose | LOCATION | 0.99+ |
China | LOCATION | 0.99+ |
Ziya Ma | PERSON | 0.99+ |
U.S. | LOCATION | 0.99+ |
International Women's Day | EVENT | 0.99+ |
SiliconANGLE Media | ORGANIZATION | 0.99+ |
Ziya | PERSON | 0.99+ |
one week | QUANTITY | 0.99+ |
today | DATE | 0.99+ |
twice | QUANTITY | 0.99+ |
First | QUANTITY | 0.99+ |
Strata Data Conference | EVENT | 0.99+ |
one topic | QUANTITY | 0.98+ |
Spark | TITLE | 0.98+ |
both | QUANTITY | 0.98+ |
Intel | ORGANIZATION | 0.98+ |
one thing | QUANTITY | 0.98+ |
three days ago | DATE | 0.98+ |
Women in Data Science Conference | EVENT | 0.97+ |
Strata Conference | EVENT | 0.96+ |
first | QUANTITY | 0.96+ |
BigDL | TITLE | 0.96+ |
TensorFlow | TITLE | 0.96+ |
one type | QUANTITY | 0.95+ |
two | DATE | 0.94+ |
MXNet | TITLE | 0.94+ |
Caffe | TITLE | 0.92+ |
theCUBE | ORGANIZATION | 0.91+ |
one | QUANTITY | 0.9+ |
Software and Services Group | ORGANIZATION | 0.9+ |
Forager Tasting and Eatery | ORGANIZATION | 0.88+ |
Vice President | PERSON | 0.86+ |
Big Data Technologies | ORGANIZATION | 0.84+ |
seven cloud service providers | QUANTITY | 0.81+ |
last couple days | DATE | 0.81+ |
Atom | COMMERCIAL_ITEM | 0.76+ |
Silicon Valley | LOCATION | 0.76+ |
Big Data SV 2018 | EVENT | 0.74+ |
a couple days ago | DATE | 0.72+ |
Big Data SV | ORGANIZATION | 0.7+ |
Xeon | COMMERCIAL_ITEM | 0.7+ |
Nervana | ORGANIZATION | 0.68+ |
Big Data | EVENT | 0.62+ |
last | DATE | 0.56+ |
data | EVENT | 0.54+ |
case | QUANTITY | 0.52+ |
3D | QUANTITY | 0.48+ |
couple | QUANTITY | 0.47+ |
years | DATE | 0.47+ |
Nervana | TITLE | 0.45+ |
Big | ORGANIZATION | 0.32+ |
Seth Dobrin, IBM | Big Data SV 2018
>> Announcer: Live from San Jose, it's theCUBE. Presenting Big Data Silicon Valley, brought to you by SiliconANGLE Media and it's ecosystem partners. >> Welcome back to theCUBE's continuing coverage of our own event, Big Data SV. I'm Lisa Martin, with my cohost Dave Vellante. We're in downtown San Jose at this really cool place, Forager Eatery. Come by, check us out. We're here tomorrow as well. We're joined by, next, one of our CUBE alumni, Seth Dobrin, the Vice President and Chief Data Officer at IBM Analytics. Hey, Seth, welcome back to theCUBE. >> Hey, thanks for having again. Always fun being with you guys. >> Good to see you, Seth. >> Good to see you. >> Yeah, so last time you were chatting with Dave and company was about in the fall at the Chief Data Officers Summit. What's kind of new with you in IBM Analytics since then? >> Yeah, so the Chief Data Officers Summit, I was talking with one of the data governance people from TD Bank and we spent a lot of time talking about governance. Still doing a lot with governance, especially with GDPR coming up. But really started to ramp up my team to focus on data science, machine learning. How do you do data science in the enterprise? How is it different from doing a Kaggle competition, or someone getting their PhD or Masters in Data Science? >> Just quickly, who is your team composed of in IBM Analytics? >> So IBM Analytics represents, think of it as our software umbrella, so it's everything that's not pure cloud or Watson or services. So it's all of our software franchise. >> But in terms of roles and responsibilities, data scientists, analysts. What's the mixture of-- >> Yeah. So on my team I have a small group of people that do governance, and so they're really managing our GDPR readiness inside of IBM in our business unit. And then the rest of my team is really focused on this data science space. And so this is set up from the perspective of we have machine-learning engineers, we have predictive-analytics engineers, we have data engineers, and we have data journalists. And that's really focus on helping IBM and other companies do data science in the enterprise. >> So what's the dynamic amongst those roles that you just mentioned? Is it really a team sport? I mean, initially it was the data science on a pedestal. Have you been able to attack that problem? >> So I know a total of two people that can do that all themselves. So I think it absolutely is a team sport. And it really takes a data engineer or someone with deep expertise in there, that also understands machine-learning, to really build out the data assets, engineer the features appropriately, provide access to the model, and ultimately to what you're going to deploy, right? Because the way you do it as a research project or an activity is different than using it in real life, right? And so you need to make sure the data pipes are there. And when I look for people, I actually look for a differentiation between machine-learning engineers and optimization. I don't even post for data scientists because then you get a lot of data scientists, right? People who aren't really data scientists, and so if you're specific and ask for machine-learning engineers or decision optimization, OR-type people, you really get a whole different crowd in. But the interplay is really important because most machine-learning use cases you want to be able to give information about what you should do next. What's the next best action? And to do that, you need decision optimization. >> So in the early days of when we, I mean, data science has been around forever, right? We always hear that. But in the, sort of, more modern use of the term, you never heard much about machine learning. It was more like stats, math, some programming, data hacking, creativity. And then now, machine learning sounds fundamental. Is that a new skillset that the data scientists had to learn? Did they get them from other parts of the organization? >> I mean, when we talk about math and stats, what we call machine learning today has been what we've been doing since the first statistics for years, right? I mean, a lot of the same things we apply in what we call machine learning today I did during my PhD 20 years ago, right? It was just with a different perspective. And you applied those types of, they were more static, right? So I would build a model to predict something, and it was only for that. It really didn't apply it beyond, so it was very static. Now, when we're talking about machine learning, I want to understand Dave, right? And I want to be able to predict Dave's behavior in the future, and learn how you're changing your behavior over time, right? So one of the things that a lot of people don't realize, especially senior executives, is that machine learning creates a self-fulfilling prophecy. You're going to drive a behavior so your data is going to change, right? So your model needs to change. And so that's really the difference between what you think of as stats and what we think of as machine learning today. So what we were looking for years ago is all the same we just described it a little differently. >> So how fine is the line between a statistician and a data scientist? >> I think any good statistician can really become a data scientist. There's some issues around data engineering and things like that but if it's a team sport, I think any really good, pure mathematician or statistician could certainly become a data scientist. Or machine-learning engineer. Sorry. >> I'm interested in it from a skillset standpoint. You were saying how you're advertising to bring on these roles. I was at the Women in Data Science Conference with theCUBE just a couple of days ago, and we hear so much excitement about the role of data scientists. It's so horizontal. People have the opportunity to make impact in policy change, healthcare, etc. So the hard skills, the soft skills, mathematician, what are some of the other elements that you would look for or that companies, enterprises that need to learn how to embrace data science, should look for? Someone that's not just a mathematician but someone that has communication skills, collaboration, empathy, what are some of those, openness, to not lead data down a certain, what do you see as the right mix there of a data scientist? >> Yeah, so I think that's a really good point, right? It's not just the hard skills. When my team goes out, because part of what we do is we go out and sit with clients and teach them our philosophy on how you should integrate data science in the enterprise. A good part of that is sitting down and understanding the use case. And working with people to tease out, how do you get to this ultimate use case because any problem worth solving is not one model, any use case is not one model, it's many models. How do you work with the people in the business to understand, okay, what's the most important thing for us to deliver first? And it's almost a negotiation, right? Talking them back. Okay, we can't solve the whole problem. We need to break it down in discreet pieces. Even when we break it down into discreet pieces, there's going to be a series of sprints to deliver that. Right? And so having these soft skills to be able to tease that in a way, and really help people understand that their way of thinking about this may or may not be right. And doing that in a way that's not offensive. And there's a lot of really smart people that can say that, but they can come across at being offensive, so those soft skills are really important. >> I'm going to talk about GDPR in the time we have remaining. We talked about in the past, the clocks ticking, May the fines go into effect. The relationship between data science, machine learning, GDPR, is it going to help us solve this problem? This is a nightmare for people. And many organizations aren't ready. Your thoughts. >> Yeah, so I think there's some aspects that we've talked about before. How important it's going to be to apply machine learning to your data to get ready for GDPR. But I think there's some aspects that we haven't talked about before here, and that's around what impact does GDPR have on being able to do data science, and being able to implement data science. So one of the aspects of the GDPR is this concept of consent, right? So it really requires consent to be understandable and very explicit. And it allows people to be able to retract that consent at any time. And so what does that mean when you build a model that's trained on someone's data? If you haven't anonymized it properly, do I have to rebuild the model without their data? And then it also brings up some points around explainability. So you need to be able to explain your decision, how you used analytics, how you got to that decision, to someone if they request it. To an auditor if they request it. Traditional machine learning, that's not too much of a problem. You can look at the features and say these features, this contributed 20%, this contributed 50%. But as you get into things like deep learning, this concept of explainable or XAI becomes really, really important. And there were some talks earlier today at Strata about how you apply machine learning, traditional machine learning to interpret your deep learning or black box AI. So that's really going to be important, those two things, in terms of how they effect data science. >> Well, you mentioned the black box. I mean, do you think we'll ever resolve the black box challenge? Or is it really that people are just going to be comfortable that what happens inside the box, how you got to that decision is okay? >> So I'm inherently both cynical and optimistic. (chuckles) But I think there's a lot of things we looked at five years ago and we said there's no way we'll ever be able to do them that we can do today. And so while I don't know how we're going to get to be able to explain this black box as a XAI, I'm fairly confident that in five years, this won't even be a conversation anymore. >> Yeah, I kind of agree. I mean, somebody said to me the other day, well, it's really hard to explain how you know it's a dog. >> Seth: Right (chuckles). But you know it's a dog. >> But you know it's a dog. And so, we'll get over this. >> Yeah. >> I love that you just brought up dogs as we're ending. That's my favorite thing in the world, thank you. Yes, you knew that. Well, Seth, I wish we had more time, and thanks so much for stopping by theCUBE and sharing some of your insights. Look forward to the next update in the next few months from you. >> Yeah, thanks for having me. Good seeing you again. >> Pleasure. >> Nice meeting you. >> Likewise. We want to thank you for watching theCUBE live from our event Big Data SV down the street from the Strata Data Conference. I'm Lisa Martin, for Dave Vellante. Thanks for watching, stick around, we'll be rick back after a short break.
SUMMARY :
brought to you by SiliconANGLE Media Welcome back to theCUBE's continuing coverage Always fun being with you guys. Yeah, so last time you were chatting But really started to ramp up my team So it's all of our software franchise. What's the mixture of-- and other companies do data science in the enterprise. that you just mentioned? And to do that, you need decision optimization. So in the early days of when we, And so that's really the difference I think any good statistician People have the opportunity to make impact there's going to be a series of sprints to deliver that. in the time we have remaining. And so what does that mean when you build a model Or is it really that people are just going to be comfortable ever be able to do them that we can do today. I mean, somebody said to me the other day, But you know it's a dog. But you know it's a dog. I love that you just brought up dogs as we're ending. Good seeing you again. We want to thank you for watching theCUBE
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Dave Vellante | PERSON | 0.99+ |
Lisa Martin | PERSON | 0.99+ |
Seth | PERSON | 0.99+ |
Dave | PERSON | 0.99+ |
IBM | ORGANIZATION | 0.99+ |
Seth Dobrin | PERSON | 0.99+ |
20% | QUANTITY | 0.99+ |
50% | QUANTITY | 0.99+ |
TD Bank | ORGANIZATION | 0.99+ |
San Jose | LOCATION | 0.99+ |
two people | QUANTITY | 0.99+ |
tomorrow | DATE | 0.99+ |
IBM Analytics | ORGANIZATION | 0.99+ |
two things | QUANTITY | 0.99+ |
SiliconANGLE Media | ORGANIZATION | 0.99+ |
one model | QUANTITY | 0.99+ |
five years | QUANTITY | 0.98+ |
20 years ago | DATE | 0.98+ |
Big Data SV | EVENT | 0.98+ |
five years ago | DATE | 0.98+ |
GDPR | TITLE | 0.98+ |
theCUBE | ORGANIZATION | 0.98+ |
one | QUANTITY | 0.98+ |
Strata Data Conference | EVENT | 0.97+ |
today | DATE | 0.97+ |
first statistics | QUANTITY | 0.95+ |
CUBE | ORGANIZATION | 0.94+ |
Women in Data Science Conference | EVENT | 0.94+ |
both | QUANTITY | 0.94+ |
Chief Data Officers Summit | EVENT | 0.93+ |
Big Data SV 2018 | EVENT | 0.93+ |
couple of days ago | DATE | 0.93+ |
years | DATE | 0.9+ |
Forager Eatery | ORGANIZATION | 0.9+ |
first | QUANTITY | 0.86+ |
Watson | TITLE | 0.86+ |
Officers Summit | EVENT | 0.74+ |
Data Officer | PERSON | 0.73+ |
SV | EVENT | 0.71+ |
President | PERSON | 0.68+ |
Strata | TITLE | 0.67+ |
Big Data | ORGANIZATION | 0.66+ |
earlier today | DATE | 0.65+ |
Silicon Valley | LOCATION | 0.64+ |
years | QUANTITY | 0.6+ |
Chief | EVENT | 0.44+ |
Kaggle | ORGANIZATION | 0.43+ |
Wrap | WiDS 2018
>> Narrator: Live from Stanford University, in Palo Alto California, it's The Cube, Covering Women in Data Science Conference 2018. Brought to you by Stanford. >> Welcome back to The Cube, our continuing coverage of Women in Data Science 2018 continues. I'm Lisa Martin, live from Stanford University, and very excited to be joined by our Co-founder, Co-CEO of SiliconANGLE Media and The Cube, John Furrier. John, what an amazing event, the 3rd Annual WiDS event, the third time The Cube has been here, this event, the energy, the momentum, the excitement, you can feel it. >> I really wanted to interview with you all day, but I wanted to make sure that we had the right women in tech, women in data science. (Lisa laughs) You're an amazing host. I thought it was awesome. What a great powerhouse of women. It's just such an honor for The Cube team and SiliconANGLE to be here. We're listed as a global innovative sponsor on there, so it's like the recognition because they have high integrity. The organizers, Judy, Karen, and Margot, when we first met, when they first started, this "Can you bring The Cube?", of course we will! Because we knew the network effect was big here. They were early on, and they took a great approach. They really nailed the positioning of the event. Use Stanford University as a base, establish a global community, which they have now done. It is so successful, this is the future of events, in my opinion. The way they do it, the way they bring in the content curation here at Stanford, but it's open, it's inclusive, they created a network effect with satellite communities around the world. They've created a VIP network of power women, and it's a shortcut to trust. This is the trusted network of women in data science. It's super exciting. I'm so proud to be part of it in a small way. They get all the credit, but just capturing all the data, the interviews are great data. You've done a great job. The conversations were amazing. The hallway conversations went great. It was just fantastic. >> Yeah it was fantastic, and thank you for handing the keys to The Cube to me for this event. The remarkable thing-- One of the remarkable things to me about this event is that they have, in third year, they're going to reach 100,000 people with this event. There were 177 regional events in the last 24 hours, #WiDS2018, in 53 countries. And we were fortunate to have Margot Gerritsen on a few hours ago, and I said, "You must be pleasantly shocked at this massive trajectory, "but where do go from here?" "Sustaining, maintaining, but also reaching out," she said, "to even younger audiences in high schools "and being able to ignite the bunsen burner, "turn it up a little bit higher." What were some of the hallway conversations that you had? >> Well I think the big thing was is that, first of all, the panels on the conversation of the content was not about women, it was about data science, that happen to be women. >> Yes. So the quality of the conversations, if you close your eyes, you'll be like, "There are some serious pros on here". And they had some side discussions around how to be a woman in tech and data science, and how to use your integrity and reputation, but the content program was top-shelf. I mean, it was fantastic, so that was equalizing. The hallway conversations was global. I heard about global impact, I heard that data science is very mission-driven. And you're seeing a confluence of technology and innovation with technology like data analytics, data science, fueling mission-driven, so standard run your business on analytics, but now run society on analytics. So you're seeing a global framework developing around mission-driven, you'll hear the word "impact" a lot, and it was not just speeds-and-feeds data science, although they're plenty to geek out about, but it was more of a higher level order bit around mission, and society. So this is right around what we're seeing at The Cube around cloud computing, cryptocurrency and blockchain, that you're seeing a democracy being rewritten with technology. Data's the new oil. Oil's power in the new global economy, and you're seeing that in all kinds of decentralized forms of blockchain and cryptocurrency, you're seeing businesses transform with data science, so with that comes a lot of responsibility. So, ethics conversation in the hallway. I felt like I was at a TED talk, meets World Economic Forum, meets Stanford Think Tank, meets practitioner. It was like, really exciting. >> And they had keynotes, which we had a few on some tech tracks, and a career panel. Did you get to listen to the career panel? >> John: The career panel was interesting and I'd love to get your thoughts on some of your interviews that crossover, because it was really more about being proud and high integrity. So the word "democratization" came up, and the conversations in the audience when they had the Q&A was, "Isn't it more about respect?", democratization, not that there's anything wrong with that, but "Isn't it about integrity? "What is the integrity of us as a community, "as women in data science, what is the respect, "integrity, and mission of the role?" Of course democratization is a side effect of good news data, so that was super exciting. And then also, stand up, never give up, never worry about the failure, never worry about getting in a blocker, remove that blocker or as Teresa Carlson at Amazon would say. So there was definitely the woman vibe of "Listen, don't take things lying down. "Have a tough skin. "Take names and kick butt, but be proud." >> That's where a lot of the, when I'd ask some of our guests, "What advice would you give your younger self?" and a lot of them said the same thing, of "Don't be afraid to get out of your comfort zone". My mentor says, "Get comfortably uncomfortable." I think that's pretty hard for a lot-- If I look back at myself 20 years ago I wouldn't have been able to do that. It took a mentor, and just as Maria Klawe has said on The Cube before, the best time to reach and inspire the next generation of females to go into STEM is first semester yoo-nuh-ver-zhen, that's exactly when it happened for me and I didn't plan it, but it took someone to kind of go like Maria said this morning, "Don't be focused "on the things you think you're not good at." So that "failure is not a bad F word" was a theme that we heard a number of times today, and I think, incredibly important. >> And the tweets I tweeted out but it was kind of said differently, I don't know the exact tweet, but I'd kind of paraphrase it by saying Maria from Harvey Mudd said, "Look it, there's plenty of opportunities "in data science, go there." And she compared and contrasted her journey in a male-dominated world with "Look, if you're stuck or you're in a rut, "or you're in somewhere you're uncomfortable with, "from a male perspective or dogma, "or structural system that's not working for you, "just get out of it and go to another venue." Another venue being a growth market. So the message here was there's plenty of opportunities in data science than just data analytics. There's math career paths, there's cryptocurrency, there's blockchain, there's all kinds of different elements. Go where the growth is. If you go where the growth is, you can pioneer and find like-minded individuals. That was a great message I thought, for women, because you're going to find men in those markets that love collaborating with anyone who's smart, and since everyone here's smart, they're saying just go where the growth is. Don't try to go to a stagnant pond where all the dogma and the structural stuff is. That's going to take too long to change. That's my take, but I think that's kind of the message I thought was really, really powerful. And that's the message I'm going to tell my two daughters is "Stand tall, and go after the new territory." >> You can do anything, and that was also a theme of "Don't be afraid to take risks". In any way of life if we don't take risks, we risk losing out on something. That was something we heard a lot. >> John: Let me ask you a question then, because you did the interview. I was jealous, 'cause you know I hate to give up the microphone. >> I know you. (laughs) But I love this event, 'cause it's super awesome. What were some of the highlights for you? Was there a notable interview, was there some sound bites? What were some of the things that you found were inspiring, informational, or notable? >> Oh, all of the above. Everybody. I loved talking with Maria Klawe this morning who, to your point earlier, had to from many generations face the gender bias, and has such a... That her energy alone is so incredibly inspiring. And what she has been able to do as the first female president of Harvey Mudd and the transformation that she's facilitated so far is remarkable. Margot Gerritsen also was a great, inspiring guest for me. She had said, they had this idea three years ago, you were there from the beginning and I said how long was it from concept to first event? Six months. Whoa, strap on your seatbelt. And she said it was almost-- >> And they did it on a limited budget too, by the way. >> Sure. She said it was almost like the revenge conference. Tell us we can't do something, and I heard that theme as well, people saying, "Tell me I can't do something, "and I will prove you wrong in spades." (John laughs) And I think it's an important message. There's still such a gap in diversity. Not just in diversity in gender and ethnicity, there's a thought diversity gap that every industry is missing. That was another kind of common theme, and that was kind of a new term for me, thought diversity. I thought, "Wow, it's incredibly important "to bring in different perspectives." >> And on that point, one of the things I did here in the hallway was a conversation of, this is not just a movement, it's a collection of movements. So it's not one movement, this one is, or women in general, it's a collection of movements, but it's really one movement. So that was interesting, I was kind of like "Hmm", as being a guy I'm like, "Can you women-splain that to me please?" (Lisa and John laugh) >> Yeah, well the momentum that they-- >> What kind of movement is this? (laughing) >> They're achieving. (laughing) I'm sure there'll be a hashtag for that, and speaking of hashtags, I did think it was very cool that today is Monday, #MotivationMonday, this whole day was Motivation Monday to me. And I asked Margot, "Where do you go from here? "You've achieved this in the third year." And she said, "Doing more WiDS events throughout the year, "also starting to deliver resources on demand for folks". Not just females, to your point, this is people in data science, globally, to consume, and then going sort of downstream if you will, or maybe it's upstream, and starting to reach more of that high school age, those girls who might have a desire or interest in something but might think, "I don't think I can do this". >> Well I think one of the things that I'm seeing, and I was glad to be one of the men that stood up, and there's men here, is that men being part of it is super important because these newer markets, like I was just in the Bahamas for a cryptocurrency blockchain event, and there's a lot of younger generations, the whole gender thing to them, they think is nonsense. They should be all equal. So in these new growth areas they're kind of libertarian, but also they're really open and inclusive. It's because of their open-source ethos. So I think for the younger generation in the youth, we can kind of set the table now, and men got to be a part of that. So to be that kind of world where the conversation isn't about women in tech, means that it's all good now, >> Yeah. Right? So the question we've had on The Cube is when we're done with the diversity and inclusion discussion, that means we've accomplished the goal, which is there's no longer a need for that discussion because it's all kind of leveled up. So I mean, a long ways to go for sure, but that's the goal, and I think the younger generations are like, "You old people are like... "We don't view it that way", so we hope that structurally, we have these kinds of conferences where the conversation is not about just women, but the topics, and their gurus at their field. To me, that is the shining light that we want to focus on, because that's also inspirational. Now the stuff that needs to be fixed, is hard conversations, and it's tough but you can do both. And I think that's a message that I hear here. Phenomenal. >> Great to hear though from your perspectives, from what you're hearing with the millennials in the next generation going "Why are you even talking about this?" It would be great if we eventually get there, but some other things that are really key, and some of these companies are WiDS sponsors, Intel and SAP, and what they're doing to achieve, really aggressively, much more gender diversity. We heard Intel talk about it. We heard SAP talk about it today, Walmart Labs as well. And it's still obviously quite a need for it is what it's showing. >> The pay gap is still off. Way too off, yes. >> So that is like, the conversation needs to happen, I'm not trying to minimize that with my other point, but we got to get there. The other thing that's really off, the pay has got to get leveled up and people are working on that. That's great, let's see the progress. Let's look at the data. But the other one that no one's talking about is not only is the pay a problem, the big problem is the titles. So, we've been looking at data amongst a lot of the big companies. Women are getting some pay leveled up, but their titles aren't. So there's still a lot of these little things out there that matter. She's only a VP, and he's an SVP, but she's actually operating at an SVP level, or Senior Director, I mean, this is happening. So much more work to do, but again, the more that they come in with the skills that they got like in here, the networks that are forming, the VIP trust influence networks, it's just phenomenal. I think this is going to really accelerate the peer review, the peer relationships, access to the data, and just the more the merrier. Shine the light on it, turn the sunlight on. >> Exactly, shining a light on the awareness that they're generating, and also that we have a chance to share through The Cube, bringing more light to some of these things that you talked about, the faster, like you said, the more we're going to be able to accelerate making this a non-topic. >> It's our mission. The Cube's mission is to open the content up, get the conversations, document the folks, get them ingested into our network, share our networks open content. The more that that meta data and that knowledge can share digitally, that is the mission that we live for. As you know we love doing it. You did a great job today. >> Lisa: Thank you! It was my pleasure. It's an inspiring event, even just getting prepped for it, and you can hear all the buzz around us that it probably feels-- >> Cocktail party time. It is cocktail party time. Feels pretty darn good. Well John, thanks so much for being our fearless leader and allowing us to come here. And we want to thank you for watching The Cube. We have been live all day at WiDS 2018. Join the conversation. Follow us, @thecube. Join the conversation with #WiDS2018, and please join the conversation and share the videos of some of these fantastic leaders and inspirational folks that we had on the show today. For my co-host, John Furrier, I am Lisa Martin. We'll see ya next time. (electronic music)
SUMMARY :
Brought to you by Stanford. the momentum, the excitement, you can feel it. and it's a shortcut to trust. One of the remarkable things to me about this event the panels on the conversation of the content So the quality of the conversations, if you close your eyes, And they had keynotes, which we had a few "integrity, and mission of the role?" "on the things you think you're not good at." And that's the message I'm going to tell my two daughters You can do anything, and that was also a theme I was jealous, 'cause you know I hate What were some of the things that you found and the transformation that she's facilitated so far and that was kind of a new term for me, thought diversity. And on that point, one of the things I did and starting to reach more of that high school age, and men got to be a part of that. To me, that is the shining light that we want to focus on, and some of these companies are WiDS sponsors, The pay gap is still off. So that is like, the conversation needs to happen, the faster, like you said, the more we're going to be able that is the mission that we live for. and you can hear all the buzz around us and please join the conversation and share the videos
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Judy | PERSON | 0.99+ |
Margot | PERSON | 0.99+ |
Lisa Martin | PERSON | 0.99+ |
John | PERSON | 0.99+ |
Karen | PERSON | 0.99+ |
John Furrier | PERSON | 0.99+ |
Maria Klawe | PERSON | 0.99+ |
Maria | PERSON | 0.99+ |
Lisa | PERSON | 0.99+ |
Teresa Carlson | PERSON | 0.99+ |
Margot Gerritsen | PERSON | 0.99+ |
Amazon | ORGANIZATION | 0.99+ |
two daughters | QUANTITY | 0.99+ |
Six months | QUANTITY | 0.99+ |
Walmart Labs | ORGANIZATION | 0.99+ |
The Cube | TITLE | 0.99+ |
Bahamas | LOCATION | 0.99+ |
100,000 people | QUANTITY | 0.99+ |
Intel | ORGANIZATION | 0.99+ |
Palo Alto California | LOCATION | 0.99+ |
#WiDS2018 | EVENT | 0.99+ |
today | DATE | 0.99+ |
The Cube | ORGANIZATION | 0.99+ |
Stanford University | ORGANIZATION | 0.99+ |
SiliconANGLE Media | ORGANIZATION | 0.99+ |
Stanford | ORGANIZATION | 0.99+ |
Monday | DATE | 0.99+ |
both | QUANTITY | 0.98+ |
three years ago | DATE | 0.98+ |
177 regional events | QUANTITY | 0.98+ |
53 countries | QUANTITY | 0.98+ |
third time | QUANTITY | 0.98+ |
SiliconANGLE | ORGANIZATION | 0.98+ |
one | QUANTITY | 0.98+ |
WiDS 2018 | EVENT | 0.98+ |
One | QUANTITY | 0.98+ |
third year | QUANTITY | 0.98+ |
SAP | ORGANIZATION | 0.97+ |
one movement | QUANTITY | 0.97+ |
Harvey Mudd | PERSON | 0.97+ |
first | QUANTITY | 0.97+ |
first event | QUANTITY | 0.96+ |
Women in Data Science 2018 | EVENT | 0.96+ |
WiDS | ORGANIZATION | 0.94+ |
20 years ago | DATE | 0.94+ |
this morning | DATE | 0.93+ |
3rd Annual WiDS | EVENT | 0.92+ |
few hours ago | DATE | 0.91+ |
Stanford Think Tank | ORGANIZATION | 0.91+ |
a number of times | QUANTITY | 0.91+ |
first female | QUANTITY | 0.86+ |
first semester | QUANTITY | 0.85+ |
@thecube | PERSON | 0.84+ |
The Cube, Covering Women in Data Science Conference 2018 | EVENT | 0.83+ |
Cube | ORGANIZATION | 0.72+ |
TED talk | EVENT | 0.69+ |
last 24 hours | DATE | 0.68+ |
World Economic Forum | EVENT | 0.67+ |
things | QUANTITY | 0.65+ |
plenty of opportunities | QUANTITY | 0.62+ |
Stanford | LOCATION | 0.51+ |
Cube | COMMERCIAL_ITEM | 0.35+ |
Ziya Ma, Intel Corporation | WiDS 2018
>> Announcer: Live from Stanford University in Palo Alto, California, it's theCUBE. Covering Women in Data Science Conference 2018. Brought to you by Stanford. >> Welcome back to theCUBE, we are live at Stanford University for the third annual Women in Data Science Conference, hashtag WiDS2018. Participate in the conversation and you're going to see people at WiDS events in over 177 regions in over 53 countries. This even is aiming to reach about 100,000 people in the next couple of days, which in its third year is remarkable. It's aimed at inspiring and educating data scientists worldwide and of course supporting females in the field. It's also got keynotes, technical vision tracks, and a career panel. And we're excited to welcome back to theCUBE, a cube alumni, Ziya Ma, the Vice President of Software and Services Group and the Director of Big Data Technologies at Intel. Ziya, welcome back to theCube. >> Thanks for having me, Lisa. >> You have been, this is your first time coming to a WiDS event in person and your first year here. You are on the career panel. >> Yes. >> That's pretty cool. Tell us about, you just came from that career panel, tell us about that. What were some of the things that excited you? What are some of the things that surprised you in what you heard at that panel? >> So I think one thing that was really exciting is to see the passion from the audience, so many women excited with data science. And it was the future of what data science can bring. That's the most exciting part. And also, it's very exciting to get connected with so many women professionals. And in terms of, you know, surprise? I think it's a good surprise to see so much advancement in women development in data science. Comparing where we are and where we were two years ago, it's great to see so many woman speakers and leaders talking about their work in the data science space, applying data science to solve real business problems, to solve transportation problems, to solve education, healthcare problems. I think that's the happy surprise, you know, the fast advancement with woman development in this field. >> What were some of the things that you shared, maybe recommendations or advice. You've been in industry for a long time. You've been at Intel for quite a long time. What were some of the things that you felt important to share with the audience, those in-person here at Stanford which is about 400 plus, and those watching the live stream? >> Yeah, you know, Lisa, I provide career coaching actually for many women professionals at Intel and also from the industry. And a lot of them expressed an interest of getting into a data science field. And they ask me, what is the skillset that I need to develop in order to get into this field? I think first, you need to ask yourself, what kind of job you want to get into in this field. You know, there are marketing jobs, there are sales jobs. And even for technical jobs, there are data engineering type of jobs, data visualization, statistician, data science, or AI engineer, machine learning, deep learning engineer. So you have to ask yourself, what kind of job you want to move to and then assess your skillset gap. And work to close that gap. Another advice I give to many woman professionals is that data science appears to have a high bar today. And it may be too significant a jump to move from where you are to a data science field. You may want to move to adjacent field first. And to have a sense of what is it like to work in the data science field and also have more insights with what's going on. And then, to better prepare you for eventually moving into this field. >> Great advice and I think one of the things that jumped out at me was you talked about skillsets. And we often hear a lot of the technical skills, right, that are essential for a data scientist. But there's also softer skills, maybe it's more left brain, right brain, creativity, empathy, communication. Tell me, in your ascension to now the VP level at Intel, what are some of the other skills besides the technical skills that you find as data science as a field grows and infiltrates everything, what are some of those softer skills that you think are really advantageous? >> Great question. I think openness and collaboration are very important soft skills. Because as a data scientist, you need to work with data engineering teams. Because as a data scientist, you extract business insights from the data. But then you cannot work alone. You have to work with the data engineering team who prepares the data infrastructure, stores, and manages the data very efficiently for you to consume. You also have to work with domain experts. Let's say if you are applying data science solutions to solve a real business problem, let's say in a medical field. You need to work with a domain expert from the medical field so that you can tailor your solution towards, you know, addressing some medical problems. So you need to work with that domain expert who knows the business operations and processes in medical field really, really well. So I think that's, you know, collaboration is key. And of course you also want to collaborate maybe with academia and open source community where a lot of real innovations are happening. And you want to leverage the latest technology building blocks so that you can accelerate your data science application or solution advancement. So collaboration and openness are the key. >> Openness is a great one. I'm glad that you brought that up. We had another guest on talking about that earlier. In terms of being open, one, to not expecting, you know, in the scientific method, you go into it with a hypothesis and you think you know what you're going to find or you want to know, I want to find this. And you might not, and being open to going, okay, that's okay, I'm going to course correct. 'Cause failure in this sense is not a bad F word. But also being open to other opinions, other perspectives. That seems to be kind of a theme that we're hearing more about today, it's be willing to be open-minded. >> You know, that's an excellent point, Lisa. You know, I can share one example. When coming from an engineering background, when I first moved into this field, we always had the assumption that when we talk with your customers, they must be looking for something that's high performance. So our initial discussion with our customers centered around Intel product lineup that will give you the highest of performance for deep learning training or for analytics solution. But as we went deeper with the discussion, we realized that's not what customers are looking for in many cases. The fact is that many of them have collected a massive amount of data over the years. They have built analytics applications and you add on top of that. And so as the data representations get more complex, we want to extract more complex insights. That's the time they want to apply deep learning but to the existing application infrastructure. So they're looking for something, let's say deep learning capability, that can be easily integrated into the existing analytics solutions stack, into its existing infrastructure and reuse its existing infrastructure for lower cost of ownership. That's what they are looking for. And high performance is just nice to have. So once we are open-minded to that learning, that totally changed the conversation. Actually, in the last couple of years, we applied that learning and we have collaborated with top cloud service providers like Amazon, Microsoft, Google, and you know, Alibaba and Baidu and a few others to deploy Intel-based deep learning capabilities. Libraries, frameworks, into cloud so that, you know, more businesses and individuals can have access. But again, it's that openness. You truly need to understand what is the problem you are solving before simply just selling a technology. >> Absolutely, and that's one of the best examples of openness that's obviously in this case listening to customers. We think we know the problem that we need to solve and they're telling you, actually, it's not that. It's a nice to have, and you go, whoa, that changes everything! And it also changes, sounds like, the downstream collaboration that Intel knew we need to have in order to drive our business forward and help our customers in every industry do the same thing. >> Exactly, exactly. >> So a couple of things that I'd love to get your perspective on is the culture at Intel. You've been there a long time. What is that culture like in terms of maybe fueling or being a nice opportunity for bringing in this diversity that we so need in every industry? >> Yeah, you know, one thing I want to share, actually, just now during the panel discussion I shared this. I said Intel will be the first high tech company achieving full representation of women and under-represented minorities by the end of this year. >> Wow, by the end of 2018? >> Yes, we pulled in our timeline by two years. Yes, we're well on track for this year. >> Wow. >> To achieve that. And I personally, I like this quote from Brian Krzanich, our CEO, that if we want tech to define the future, we must be representative of that future. So in the last few years now, Intel has put great effort into hiring and retention for diversity. We also have put great effort for inclusion. We want to make sure our employees, every one of them, come to work, bring their full selves for the value add. We also invest in diverse entrepreneurs through Intel capital initiatives. And most importantly, we also partner with academia, universities, to build the pipeline for tech sectors. So we put a lot of effort and we committed about $300 million for closing the gap at the company but also for the high tech sector. So definitely we are very committed to the diversity and inclusion. But that doesn't mean that we only focus on this. And of course, we make sure that our people are bringing the right skillsets and we bring the most qualified people, you know, to do the job. >> On the pipeline front, one of the things I was reading recently is some of the challenges that organizations that are going to, say, college campuses to recruit, some of the missteps they might be taking in terms of if they're trying to bring more females info their organization in STEM roles, don't staff a booth with men, right? Or have the only females that are at a recruitment event be doing, handing out swag, or taking names. Obviously there's important roles to be had everywhere. But that was one of the things that seems to be, well what a simple thing to change. Just flip the model so that the pipeline, to your point, is fueling really what corporations like Intel want to achieve so that that future is really as inclusive and diverse as it should be. The second thing that you mentioned before we went live, from an Intel perspective, is you guys were challenged on the talent acquisition front. And so a few years ago, you started the Women in Big Data Forum to solve that problem. Tell us about that and what have you achieved so far? >> Great question. So you know, this is three or four years ago. And Intel, you know, because I manage the big data engineering organization within Intel, and we are working to hire some diversity talents. So we opened some racks and we look at our candidate pool. There were very few women, actually barely any women in the candidate pool. Again, yes, we always want to hire the most qualified people, but it also does not feel right that when you don't even have any diversity candidates in that pool. Even though we exhausted all possible options, even tried to bring the relevant diversity candidates into the pool. But it's very challenging. So then we reached out to a few industrial partners to see, is Intel the only company that had this problem or you have the same problem? It turned out everyone had the same problem. So yes, people value diversity, they all see the value. But it's very challenging to have a successful recruiting process for diversity. That's the time the few of us gathered together, we said, maybe there is something that we can do to support a stronger woman pipeline for future hiring. And it may take a couple of years, and it may take one year, but unless we start doing something today, we're going to talk about the same problem two years from now. >> Exactly. >> So then with sponsorship from our executive team, Doug Fisher, the Intel software analysis group GM, and also Michael Greene and a few others, we bring the team together, we started to look at networking opportunities, training opportunities. We worked with our industrial partners to offer many free training classes and we also start reaching out to universities to build the pipeline. And especially to motivate the female students to get passionate about big data, about analytics. So as of now, we have more than 2000 members globally for the forum and also we have many chapters. We have chapters along the West Coast in the Bay Area, also East Coast. We also have chapters in Europe and Asia so we're definitely seeing more and more women getting excited with big data and analytics. And also, we have great collaboration with women in data science at Stanford. >> Yeah and it sounds like the momentum, it doesn't sound like the momentum, you can feel it, right? You can feel it online with, I can see a Twitter stream in front of me on this monitor. People are getting involved in droves all across the globe and I said to Margot, I asked her earlier, Margot Gerritsen, one of the founders of WiDS, I said, first of all, you must be pleasantly pretty shocked at how quickly this has ascended. And she said yes, and I said, where do you go from here? And she said, it's really now going to be about getting involved with WiDS more frequently throughout the year. Also, kind of going up a funnel if you will, to high school students and starting to encourage them, excite them, and start that motivation track, if you will, even earlier. And I think that is, in terms to your point about we can't do anything if the pipeline isn't there to support it. One of the things that WiDS is aiming to do, and it sounds like what you're doing as well, similar to Women in Big Data Forum at Intel, is let's start creating a pipeline of women that are educated in the technical side and the software softer skill side that are interested and find their passion so that we can help motivate them, that you can do this. The sky's the limit where data science is concerned. >> Absolutely, absolutely. And it's great to see actually everybody recognize the value of building the pipeline and reaching out beyond the university students. Because have to get more and more girls getting into the science and tech sector. And we have to start from young. And I, yeah, totally agree, I think we really need to build our pipeline and a pipeline for our pipeline. >> Yes, exactly. And also that sort of sustaining momentum as women, you know, go in university and study STEM subjects, get into the field. Obviously retention is a big challenge that the tech industry and STEM fields alike have faced. But that retention, that motivation, and I think organizations like this, just with this, you can feel the passion when you walk into this alumni center at Stanford is really key. We thank you so much for carving out some time to share your insights and your career path and your recommendations on theCUBE and wish you continued success at Intel and with Women in Big Data Forum, which I'm sure we'll see you back at WiDS next year. >> Alright, thank you, thanks Lisa. >> Absolutely, my pleasure. We want to thank you, you have been watching theCUBE live from the Women in Data Science Conference 2018. Hashtag WiDS2018, join the conversation, get involved. I'm Lisa Martin from Stanford. Stick around, I'll be right back with John Furrier to do a wrap of the day. (outro electronic music)
SUMMARY :
Brought to you by Stanford. Welcome back to theCUBE, we are live at You are on the career panel. What are some of the things that I think that's the happy surprise, you know, What were some of the things that you shared, And then, to better prepare you the technical skills that you find And of course you also want to collaborate to not expecting, you know, in the scientific method, And so as the data representations get more complex, It's a nice to have, and you go, to get your perspective on is the culture at Intel. Yeah, you know, one thing I want to share, actually, Yes, we pulled in our timeline by two years. So in the last few years now, Intel has put great effort Just flip the model so that the pipeline, to your point, And Intel, you know, because I manage the big data for the forum and also we have many chapters. it doesn't sound like the momentum, you can feel it, right? And it's great to see actually everybody recognize just with this, you can feel the passion when you walk from the Women in Data Science Conference 2018.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Brian Krzanich | PERSON | 0.99+ |
Lisa Martin | PERSON | 0.99+ |
Margot | PERSON | 0.99+ |
Alibaba | ORGANIZATION | 0.99+ |
Amazon | ORGANIZATION | 0.99+ |
Microsoft | ORGANIZATION | 0.99+ |
Europe | LOCATION | 0.99+ |
Margot Gerritsen | PERSON | 0.99+ |
ORGANIZATION | 0.99+ | |
Lisa | PERSON | 0.99+ |
Ziya Ma | PERSON | 0.99+ |
Doug Fisher | PERSON | 0.99+ |
Asia | LOCATION | 0.99+ |
Intel | ORGANIZATION | 0.99+ |
John Furrier | PERSON | 0.99+ |
Baidu | ORGANIZATION | 0.99+ |
one year | QUANTITY | 0.99+ |
first year | QUANTITY | 0.99+ |
WiDS | ORGANIZATION | 0.99+ |
WiDS | EVENT | 0.99+ |
three | DATE | 0.99+ |
Palo Alto, California | LOCATION | 0.99+ |
two years | QUANTITY | 0.99+ |
Michael Greene | PERSON | 0.99+ |
first time | QUANTITY | 0.99+ |
one | QUANTITY | 0.99+ |
Bay Area | LOCATION | 0.99+ |
more than 2000 members | QUANTITY | 0.99+ |
GM | ORGANIZATION | 0.99+ |
first | QUANTITY | 0.99+ |
today | DATE | 0.98+ |
Intel Corporation | ORGANIZATION | 0.98+ |
Ziya | PERSON | 0.98+ |
four years ago | DATE | 0.98+ |
next year | DATE | 0.98+ |
second thing | QUANTITY | 0.98+ |
Stanford | ORGANIZATION | 0.98+ |
about $300 million | QUANTITY | 0.98+ |
two years ago | DATE | 0.98+ |
end of this year | DATE | 0.97+ |
Stanford University | ORGANIZATION | 0.97+ |
ORGANIZATION | 0.97+ | |
one example | QUANTITY | 0.97+ |
third year | QUANTITY | 0.97+ |
about 100,000 people | QUANTITY | 0.97+ |
one thing | QUANTITY | 0.97+ |
Women in Data Science Conference 2018 | EVENT | 0.97+ |
WiDS 2018 | EVENT | 0.96+ |
this year | DATE | 0.96+ |
over 53 countries | QUANTITY | 0.96+ |
about 400 plus | QUANTITY | 0.96+ |
East Coast | LOCATION | 0.95+ |
Vice President | PERSON | 0.95+ |
WiDS2018 | EVENT | 0.95+ |
Women in Data Science Conference | EVENT | 0.94+ |
Software and Services Group | ORGANIZATION | 0.93+ |
One | QUANTITY | 0.93+ |
Big Data Technologies | ORGANIZATION | 0.93+ |
over 177 regions | QUANTITY | 0.93+ |
end of 2018 | DATE | 0.91+ |
few years ago | DATE | 0.89+ |
Covering Women in Data Science Conference 2018 | EVENT | 0.87+ |
years | DATE | 0.86+ |
West Coast | LOCATION | 0.86+ |
Nathalie Henry Riche, Microsoft Research | WiDS 2018
(light electronic music) >> Announcer: Live from Stanford University, in Paolo Alto, California, it's theCUBE. Covering Women in Data Science Conference, 2018. Brought to you by Stanford. >> Welcome back to theCUBE, I'm Lisa Martin. At Stanford University, we're here for the third annual Women in Data Science Conference. #WiDS2018, check it out, be part of the conversation, WiDS is in it's third year, but it's aiming to reach about a hundred thousand people this week alone. There's 177 regional WiDS events in 53 countries. This event here, the main event at Stanford, features key notes, technical vision talks, a career panel, and we're excited to be joined next by Dr. Nathalie Henry Riche. I did that in French. >> Yes. (laughs) Who is a researcher at Microsoft, and Natalie, first of all, welcome to theCUBE. >> Thank you, I'm really thrilled to be here. >> Yeah, you gave a technical vision talk on data visualization, and data driven's story telling. Share with our audience, some of the key messages, that the WiDS audience heard from you earlier today. >> Well, I guess, I gave two main messages. The first one is, that a visualization has two superpowers. >> Lisa: Superpowers? >> Superpowers. >> Tell me girl. The first one is enable you to kind of think about your data in a new way. So, just kind of form hypothesis, and answer questions you didn't even know, you had by your data. So, that's the first one. The second super power, is it's really useful to communicate information, and communicate with a large audience. Visualization helps you, kind of convey your point with data, to back it up. So, that's kind of the short one minute. >> I love that, super super hero, super power. So, WiDS is, as I mentioned at the intro, in its third year, and reaching, it's grown dramatically in such a short period of time. This is your first WiDS, and your first WiDS you are a speaker. What was is that attracted you to WiDS, and you went, yes I want to give some of my time to this, and come down from Seattle? >> Well, so I'm French originally, and my studies I did at engineering school, and it was one of three out of 300 men, right? >> Wow. >> So, I was requested a lot for women in computer science, and engineering. So, I actually really like it. Just meeting all of those people, talking about, you know, trying to bring more women in. Part of the job I'm doing is very creative, so, we're trying to come up with new ideas for visualization. I think having, you know, a wide range of people adds to the mix, and we get so many more exciting ideas. So, I really want to try to have more diverse group of people I can work with, and connect to, and so that's why that attracted me to here. >> Excellent, couple of things that you said I've heard a number of times today. The first one is, what Daniela went and shared, who's also a speaker, that often times, some of the few women in tech, and you mentioned being one of three in 300? Are asked to do a lot of other things. Did you find that, that, okay you're one of the few females, you're articulate, you like speaking, we want you to do all these things. >> Yes, and I say no a lot. (laughs) >> 'Cause I have kids, too. >> That's a skill, too. But yeah, it happens a lot. I think as we go further, it's going to be less and less happening. It's better in the end. So, it's kind of a service, I see it as a service to, you know, my field, and my company. But, at the same time, we'll also get a lot of benefits from it. But that said, I try to cut it down to a manageable level, so two hours flight from Seattle works great. >> Right, right, right. Another thing is that, that you mentioned the creativity. I've heard that a number of times, today from our guest Margot Gerritsen, was on as well. Tell me about your thoughts about being in this data science role, the need for creativity. How does, how it, why is that you might consider it, like a softer skill versus the technical skills. But, how important is that creativity in your job, for example? >> So, my job is really like researcher. Trying to have new ideas, and innovate for Microsoft in particular. So, I'm not really a data scientist, but I build the tools for a data scientist. So, knowing that, creativity is important because you need to kind of think out of the box. What is the next generation of tools that they will need? In turn, they need to think out of the box, kind of get more insight out of the data they're collecting. So, creativity is just like, pervasive to this whole data science thing. Problem solving as well, so you need a lot the left brain, and a lot of the right brain. Kind of both of them together. I think that having different cultures, and different genders, even different age ranges just, you know, makes you think out of the box. That's just what's happening. Discussing with people, I was discussing with someone in cosmology, and I was like, whoa. That brought up a lot of different ideas in me, so, to me, that's really critical part of what I'm doing every day. >> I like that, that kind of aligns to what one of our guests said earlier, and that is the thought diversity. Wow, I've never >> Yes. thought of thought diversity. But, you bring up a good point about it's not just about having women in the field, it's also having diversity, in terms of generations. One of the things that's, I think, pretty unique about WiDS, is it's not just about reaching young women in their first semester at University, for example. Maria Clavijo said that's the ideal time to really inspire. But, it's also reinvigorating women who've been in academia, or industry in stem subjects for a long time. So, you have, we have multiple generations, and to your point, that diversity is important, it's not just about gender, ethnicity. It's also about the diverse perspectives that come from being >> Exactly. from different generations. >> So, it's funny, 'cause I was giving this talk earlier, and it was, one part of it was about time line. When I was researching, you know how people draw time? Well there's, depending some culture, it goes from left to right, but some other culture it's front to back, back to front, right to left. So, we need to be aware of all of that, and it's so much easier to just have the people to converse with right in your office, or next door, to be aware of those. So, that's very important, especially to big companies, like Microsoft, 'cause of, you know, a lot of customers world wide. So, it's very important to just be immersed in that. >> Definitely. So, you have been published, you've got published research, and over 60 articles in leading venues, and human-computer interaction, and information visualization. But, something we chatted about off camera, was very intriguing about visualization and children. Tell me a little bit more about that. >> So, I happen to have two kids, you know, seven and four. I'm passionate about what I'm doing, and I just couldn't keep it out of their hands, right? So, I was just starting, you know, seeing what does my daughter learn at school, like, what does she learn in kindergarten? In fact, in kindergarten, I remember one day, she brought back candies, and I'm like did you get candies from school? She's like no, because we were doing a bar chart. I was like, what? (laughs) So, I was very intrigued in, you know, what do we teach, what do your kids learn? It was fascinating to see that, you know, from an early age, they learn how to do those visualizations. But, they don't really learn how you can lie with them, or you know, to kind of think critically about that. That, you know, maybe you can start your bar chart at two, and you know, you would have less candy, I guess. But, you could, kind of convey the wrong messages. So, I became passionate about this, and decided we need to just improve our teaching about how we can represent data, and how we can also misrepresent it. In the hope that for the next generation to come, they'll be able to look at a chart, and think critically about it. Whether or not it tells the right story with the right data. Kind of beyond, just picture's worth a thousand words, then I'm not going to think about it. >> Yeah. >> This is kind of my personal effort that I try to move myself forward. (chuckles) >> Well, it's so important about having that passion, and I think that's one of things that seems to be inherent about WiDS. Even, you know, yesterday seeing on the Twitter stream, WiDS New Zealand starting in five minutes, and it's been really focused on being so, kind of inclusive. Just sort of naturally, and one of the things that I learned in some of my prep for the show, is the bias that is still there, in data interpretation. You kind of talked about that, and I never really thought about it in that way. But, if a particular group of people is looking at a data set, and thinking it says this, and no other opinions, perspectives, thoughts are able to be incorporated to go, well, maybe it says this. >> Yeah. >> Then we're limiting ourselves in terms of one, the potential that the data has to, you know, help a business, create a new business model. But also, we're limiting our perspectives on making a massive social impact with data. >> Yeah, what I find very interesting is visualization often people think about it at the end of the spectrum. Like, I've collected my data, I analyze it, and now I need to pretty picture to kind of explain what I found. But, the most powerful use of visualization, I think, comes early on. Where you actually just collected your data, and you look at it before you run any statistical test. I did that not long ago with French air traffic data in the Hollands, I put them in, and I saw the little airplanes moving around. Then, what we saw, is one air planes doing loops like this. I was like, what is this going on, right? It was just a drone, doing like tests, right? But, somehow it got looped in into that data set. So, by looking at your data early on, you can detect what's wrong with the data. So then, when you actually run your statistical test, and your analysis, you better reflect what was that data in the first place, you know, what could go wrong there? So, I think inserting visualization early on is also critical to understand what we can really know, and do, and ask, about the data in the first place. >> So, it's kind of like, watching the story unfold, rather than going, we've done all this analysis here's the picture, the story is this. The story is, your sort of, turning it sort of page by page, it sounds like, and watching it, and interpreting it, as it's unfolding. >> Rethinking what you collected in the first place. Is that the right data you collected to answer the question you wanted to ask? Is it a good match or not? Then, rethink that, you know, collect new data, or the missing one, and then go on with your analysis. So, I think to me, it's really a thinking tool. >> It also sounds like another, we talked about the technical skills that had, obviously that a computer scientist, data scientist needs to have. But, there's other skills. Empathy, communication, collaboration. Sounds like also, there needs to be an ideal kind of skill set, it has to include open mindedness. >> Yes. >> Tell me a little bit about some of your experiences there, and not being married to, the data must say this. So, if it doesn't, I'm not going to look anywhere else. Where is open mindedness, in terms of being a critical skill set that needs to come to the field? >> Yeah, I mean we, that's that is totally a re-critical point. Think already, when you're collecting the data, especially as a scientist, when I run experiment, I kind of know what I want to find. Sometimes, you don't find it. You need to kind of embrace it. But, it's hard to have because sometimes, it's like those unconscious bias you have. Like, you're not really necessarily controlling them, and just the way you collected the data in the first place, maybe just, you know, skewed your result. So, it's very important to kind of think ahead of time of all of those bias you could have, and think about all of what could go wrong. Often, the scientific process is actually that trying to think about all of the stuff that could go wrong, and then check whether or not they're wrong. We're trying to infuse that, a little bit over Microsoft as well, kind of, you know, the data that we collect, can we analyze them, can we have teams of people who really think is that the right data? Are we collecting like, world-wide for example? Are we just collecting from the US? So, there's a lot of those, kind of, ethical, and bias, kind of training, and effort to try and remove that. The maximum from our work, and I think that it's across the entire world. I think, with all of this data collection everywhere, we kind of have to do that, very consciously. >> I think two things kind of speak to me that out of what you just said, that we've heard a number of times today. One, that failure, and I don't mean to say that failure is not a bad thing. That's how you, >> That's how you learn, Exactly, >> and grow. Exactly, in many ways it's not a bad F-word, it's this is how everybody that's successful got to wherever they are. But, it's also about embracing, as you said, the word embracing, embracing the fact that you might be bring bias into this, and you have to be okay with maybe this is the wrong data set. If you consider that a failure, consider it, to your point, a growth opportunity. That is one of the themes that we've heard today, and you've, kind of, elaborated on that. The second one is, be okay getting uncomfortable, get out of that comfort zone. Consciously uncomfortable, because when you're able to do that, the possibilities are limitless. >> Yes, and that's what I try to do everyday, 'cause I try to push all of the software that we're doing, and Microsoft is so big, you know, and all of those software are like so there. (laughs) So trying to come up with new ideas, like so many are failures, you know. Oh they won't make money, or they don't actually work when you, you know, for this population. So, most of my work is failure. (laughs) But hey, one success when you know why, and I'm happy about it. >> Exactly, but it's just charting that course to getting to the ah, this is the pot of gold at the end of the rainbow. Well Nathalie, thank you so much for taking some time to talk with us on theCUBE, and sharing your stories. Congratulations on being a speaker, your first WiDS, and we look forward to seeing you back next year. >> Thank you very much. >> We want to thank you for watching theCUBE. I'm Lisa Martin, live from WiDS 2018 at Stanford University. Stick around, I'll be back with my next guest after a short break. (light electronic music)
SUMMARY :
Brought to you by Stanford. #WiDS2018, check it out, be part of the conversation, and Natalie, first of all, welcome to theCUBE. that the WiDS audience heard from you earlier today. The first one is, that a visualization has two superpowers. and answer questions you didn't even know, and you went, yes I want to give some of my time to this, I think having, you know, a wide range of people and you mentioned being one of three in 300? Yes, and I say no a lot. to, you know, my field, and my company. Another thing is that, that you mentioned the creativity. just, you know, makes you think out of the box. and that is the thought diversity. and to your point, that diversity is important, from different generations. and it's so much easier to just have the people So, you have been published, you've got published research, So, I happen to have two kids, you know, seven and four. This is kind of my personal effort Even, you know, yesterday seeing to, you know, help a business, create a new business model. and you look at it before you run any statistical test. So, it's kind of like, watching the story unfold, Is that the right data you collected Sounds like also, there needs to be So, if it doesn't, I'm not going to look anywhere else. and just the way you collected the data in the first place, that out of what you just said, and you have to be okay and Microsoft is so big, you know, and we look forward to seeing you back next year. We want to thank you for watching theCUBE.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Daniela | PERSON | 0.99+ |
Lisa Martin | PERSON | 0.99+ |
Maria Clavijo | PERSON | 0.99+ |
Nathalie | PERSON | 0.99+ |
Microsoft | ORGANIZATION | 0.99+ |
two hours | QUANTITY | 0.99+ |
Margot Gerritsen | PERSON | 0.99+ |
Nathalie Henry Riche | PERSON | 0.99+ |
two kids | QUANTITY | 0.99+ |
seven | QUANTITY | 0.99+ |
four | QUANTITY | 0.99+ |
Lisa | PERSON | 0.99+ |
Natalie | PERSON | 0.99+ |
Seattle | LOCATION | 0.99+ |
300 | QUANTITY | 0.99+ |
one minute | QUANTITY | 0.99+ |
next year | DATE | 0.99+ |
third year | QUANTITY | 0.99+ |
both | QUANTITY | 0.99+ |
first | QUANTITY | 0.99+ |
yesterday | DATE | 0.99+ |
today | DATE | 0.99+ |
two main messages | QUANTITY | 0.99+ |
first semester | QUANTITY | 0.99+ |
WiDS | EVENT | 0.99+ |
over 60 articles | QUANTITY | 0.99+ |
five minutes | QUANTITY | 0.99+ |
one | QUANTITY | 0.99+ |
three | QUANTITY | 0.99+ |
US | LOCATION | 0.99+ |
first one | QUANTITY | 0.99+ |
two | QUANTITY | 0.98+ |
53 countries | QUANTITY | 0.98+ |
Stanford University | ORGANIZATION | 0.98+ |
one part | QUANTITY | 0.98+ |
#WiDS2018 | EVENT | 0.98+ |
One | QUANTITY | 0.98+ |
Paolo Alto, California | LOCATION | 0.97+ |
Stanford | ORGANIZATION | 0.97+ |
two things | QUANTITY | 0.97+ |
this week | DATE | 0.96+ |
WiDS 2018 | EVENT | 0.95+ |
first place | QUANTITY | 0.95+ |
Stanford | LOCATION | 0.95+ |
Hollands | LOCATION | 0.94+ |
two superpowers | QUANTITY | 0.94+ |
second super power | QUANTITY | 0.93+ |
Microsoft Research | ORGANIZATION | 0.92+ |
300 men | QUANTITY | 0.92+ |
about a hundred thousand people | QUANTITY | 0.92+ |
New Zealand | LOCATION | 0.91+ |
177 regional | QUANTITY | 0.9+ |
second one | QUANTITY | 0.89+ |
Women in Data Science Conference | EVENT | 0.89+ |
Covering | EVENT | 0.88+ |
one day | QUANTITY | 0.87+ |
earlier today | DATE | 0.85+ |
WiDS | COMMERCIAL_ITEM | 0.84+ |
ORGANIZATION | 0.82+ | |
WiDS | ORGANIZATION | 0.79+ |
things | QUANTITY | 0.77+ |
one air | QUANTITY | 0.74+ |
Stanford University | LOCATION | 0.7+ |
French | LOCATION | 0.7+ |
thousand words | QUANTITY | 0.69+ |
Vijay Raghavendra, Walmart Labs | WiDS 2018
>> Narrator: Live from Stanford University in Palo Alto, California, it's the CUBE! Covering, Women in Data Science Conference 2018, brought to you by Stanford. >> Welcome back to the CUBE, we are live at Stanford University, we've been here all day at the third annual Women in Data Science Conference, WiDS 2018. This event is remarkable in its growth in scale, in its third year, and that is, in part by the partners and the sponsors that they have been able to glean quite early on. I'm excited to be joined by Vijay Raghavendra, the senior vice president of Merchant Technology and stores as well, from Walmart Labs. Vijay, welcome to the CUBE! >> Thank you, thank you for having me. >> Walmart Labs has been paramount to the success of WiDS, we had Margot Gerritsen on earlier, and I said, "How did you get the likes of a Walmart Labs as a partner?" And, she was telling me that, the coffee-- the coffee shop conversation >> Yeah, the Coupa Cafe! >> That she had with Walmart Labs a few years ago, and said, "Really, partners and sponsors like Walmart have been instrumental in the growth and the scale, of this event." And, we've got the buzz around, so we can hear the people here, but this is the big event at Stanford. There's 177 regional events, 177! In 53 countries. It's incredible. Incredible, the reach. So, tell me a little bit about the... From Walmart Labs perspective, the partnership with WiDS, what is it that really kind of was an "Aha! We've got to do this"? >> Yeah, it's just incredible, seeing all of these women and women data scientists here. It all started with Esteban Arcaute, who used to lead data science at Walmart Labs, and Search, before he moved on to Facebook with Margot. And, Karen in the cafe in Palo Alto, in 2015, I think. And Esteban and I had been talking about how we really expand the leverage of data and data science within Walmart, but more specifically, how we get more women into data science. And, that was really the genesis of that, and, it was really-- credit goes to Esteban, Margot, and Karen for, really, thinking through it, bringing it together, and, here we are. >> Right, I mean bringing it together from that concept, that conversation here at Stanford Cafe to the first event was six months. >> Yeah, from June to November, and, it's just incredible the way they put it together. And, from a Walmart Labs perspective, we were thrilled to be a huge part of it. And, all the way up the leadership chain there was complete support, including my boss Jeremy King, who was all in, and, that really helped. >> Margot was, when we were chatting earlier, she was saying, "It's still sort of surprising," and she said she's been, I think in, in the industry for, 30-plus years, and she said that, she always thought, back in the day, that by the time she was older, this problem would be solved, this gender gap. And she says, "Actually, it's not like it's still stagnant," we're almost behind, in a sense. When I look at the ... women that are here, in Stanford, and those that are participating via those regional events, the livestream that WiDS is doing, as well as their Facebook livestream. You know, the lofty goal and opportunity to reach 100,000 people shows you that there's clearly a demand, there's a need for this. I'd love to get your perspective on data science at Walmart Labs. Tell me a little bit about the team that you're leading, you lead a team of engineers, data scientists, product managers, you guys are driving some of the core capabilities that drive global e-commerce for Walmart. Tell me about, what you see as important for that female perspective, to help influence, not only what Walmart Labs is doing, but technology and industry in general. >> Yeah. So, the team I lead is called Merchant Technology, and my teams are responsible for, almost every aspect of what drives merchandising within Walmart, both on e-commerce and stores. So, within the purview of my teams are everything from the products our customers want, the products we should be carrying either in stores or online, to, the product catalog, to search, to the way the products are actually displayed within a store, to the way we do pricing. All of these are aspects of what my teams are driving. And, data and data science really put me at every single aspect of this. And the reason why we are so excited about women in data science and why getting that perspective is so important, is, we are in the retail business, and our customers are really span the entire spectrum, from, obviously a lot of women shop at Walmart, lot of moms, lot of millennials, and, across the entire spectrum. And, our workforce needs to reflect our customers. That's when you build great products. That's when you build products that you can relate to as a customer, and, to us that is a big part of what is driving, not just the interest in data science, but, really ensuring that we have as diverse and as inclusive a community within Walmart, so we can build products that customers can really relate to. >> Speaking of being relatable, I think that is a key thing here that, a theme that we're hearing from the guests that we're talking to, as well as some of the other conversations is, wanting to inspire the next generation, and helping them understand how data science relates to, every industry. It's very horizontal, but it also, like a tech company, or any company these days is a tech company, really, can transform to a digital business, to compete, to become more profitable. It opens up new business models, right, new opportunities for that. So does data science open up so many, almost infinite opportunities and possibilities on the career front. So that's one of the things that we're hearing, is being able to relate that to the next generation to understand, they don't have to fit in the box. As a data scientist, it sounds like from your team, is quite interdisciplinary, and collaborative. >> And, to us that is really the essence of, or the magic of, how you build great products. For us data science is not a function that is sitting on the side. For us, it is the way we operate as we have engineers, product managers, folks from the business teams, with our data scientists, really working together and collaborating every single day, to build great products. And that's, really how we see this evolving, it's not as a separate function, but, as a function that is really integrated into every single aspect of what we do. >> Right. One of the things that we talked about is, that's thematic for WiDS, is being able to inspire and educate data scientists worldwide, and obviously with the focus of helping females. But it's not just the younger generation. Some of the things that we're also hearing today at WiDS 2018 is, there's also an opportunity within this community to reinvigorate the women that have been in, in STEM and academia and industry for quite a while. Tell me a little bit more about your team and, maybe some of the more veterans and, how do you kind of get that spirit of collaboration so that those that, maybe, have been in, in the industry for a while get inspired and, maybe get that fire relit underneath them. >> That's a great question, because we, on our teams, when you look across all the different teams across different locations, we have a great mix of folks that bring very different, diverse experiences to the table. And, what we've found, especially with the way we are leveraging data, and, how that is invigorating the way we are... How people come to the table, is really almost seeing the art of what is possible. We are able to have, with data, with data science, we are able to do things that, are, really step functions in terms of the speed at which we can do things. Or, the- for example, take something as simple as search, product search, which is one of the, capabilities we own, or my team is responsible for, but, you could build the machine learning ranking, and, relevance and ranking algorithms, but, when you combine it with, for example, a merchant that really fundamentally understands their category, and you combine data science with that, you can accelerate the learning in ways that is not possible. And when folks see that, and see that in operation that really opens up a whole, slew of other ideas and possibilities that they think about. >> And, I couldn't agree more. Looking at sort of the skillset, we talk a lot about, the obvious technical skillset, that a data scientist needs to have, but there's also, the skills of, empathy, of communication, of collaboration. Tell me about your thoughts on, what is an ideal mix, of skills that that data scientist, in this interdisciplinary function, should have. >> Yeah, in fact, I was talking with a few folks over lunch about just this question! To me, some of the technical skills, the grounding in math and analytics, are table stakes. Beyond that, what we look for in data scientists really starts with curiosity. Are they really curious about the problems they're trying to solve? Do they have tenacity? Do they settle for the more obvious answers, or do they really dig into, the root cause, or the root, core of the problems? Do they have the empathy for our customers and for our business partners, because unless you're able to put yourself in those shoes, you're going to be approaching at, maybe, in somewhat of an antiseptic way? And it doesn't really work. And the last, but one of the most important parts is, we look for folks who have a good sense for product and business. Are they able to really get into it, and learn the domain? So for example, if someone's working on pricing, do they really understand pricing, or can they really understand pricing? We don't expect them to know pricing when they come in, but, the aptitude and the attitude is really, really critical, almost as much as the core technical skills, because, in some ways, you can teach the technical skills, but not some of these other skills. >> Right, and that's an interesting point that you bring up, is, what's teachable, and, I won't say what's not, but what might be, maybe not so natural for somebody. One of the things, too, that is happening at WiDS 2018 is the first annual Datathon. And, Margot was sharing this huge number of participants that they had and they set a few ground rules like wanting the teams to be 50% female, but, tell us about the Datathon from your global visionary sponsorship level; what excites you about that in terms of, the participation in the community and the potential of, "Wow, what's next"? >> Yeah... So, it's hugely exciting for us, just seeing the energy that we've seen. And, the way people are approaching different problems, using data to solve very different kinds of problems ... across the spectrum. And for us, that is a big part of what we look for. For us it is really about, not just coming up with a solution, that's in search of a problem, but really looking at real-world problems and looking at it from the perspective of, "Can I bring data, can I bring data science to bear on this problem?", to solve it in ways that, either are not possible, or can accelerate the way we would solve the problems otherwise. And that is a big part of what is exciting. >> Yeah, and the fact that the impact that data science can make to, every element of our lives is, like I said before, it's infinite, the possibilities are infinite. But that impact is something that, I think, how exciting to be able to be in an industry or a field, that is so pervasive and so horizontal, that you can make a really big social impact. One of they other things, too, that Margot said. She mentioned that the Datathon should be fun, and I loved that, and also have an element of creativity. What's that balance of, creativity in data science? Like, what's the mixture, because we can be maybe over-creative, and maybe interpret something that's in a biased way. What is your recommendation on how much creativity can creep into, and influence, positively, data science? >> Yeah, that's a great question, and there's no perfect answer for it. Ultimately, at least my biases towards using data and data science to, solve real problems. And... As opposed to, pure research, so our focus very much is on applied learning, and applied science. And, to me, within that, I do want the data science to be creative, data scientists to be creative, because, by putting too many guardrails, you limit the way in which they would explore the data, that they may come up with insights that, well, we might not see otherwise. And, which is why, I go back to the point I made, when you have data scientists who fundamentally understand a business, and the business problems we are trying to solve, or the business domains, I think they can then come up with very interesting, innovative ways of looking at the data, and the problem, that you might not otherwise. So, I would by no means want to limit their creativity, but I do have a bias towards ensuring that it is focused on problems we are trying to solve. >> Excellent. Well, Vijay, thank you so much for stopping by the CUBE, congratulations on the continued success of the partnership with WiDS and, we're looking forward to seeing what happens the rest of the year, and we'll probably see you next year at WiDS 2019! >> Absolutely, thank you! >> Excellent, we want to thank you, you're watching the CUBE, live from Stanford University, the third annual Women in Data Science Conference. I am Lisa Martin, I'll be right back after a short break with my next guest. (cool techno music)
SUMMARY :
in Palo Alto, California, it's the CUBE! in part by the partners and the sponsors and the scale, of this event." And, Karen in the cafe in Palo Alto, to the first event was six months. And, all the way up the leadership chain back in the day, that by the time she was older, the product catalog, to search, from the guests that we're talking to, or the magic of, how you build great products. One of the things that we talked about is, is really almost seeing the art of what is possible. Looking at sort of the skillset, and learn the domain? and the potential of, "Wow, what's next"? and looking at it from the perspective of, Yeah, and the fact that the impact and the business problems we are trying to solve, of the partnership with WiDS and, the third annual Women in Data Science Conference.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Esteban | PERSON | 0.99+ |
Lisa Martin | PERSON | 0.99+ |
Jeremy King | PERSON | 0.99+ |
Margot | PERSON | 0.99+ |
Karen | PERSON | 0.99+ |
Walmart | ORGANIZATION | 0.99+ |
Vijay Raghavendra | PERSON | 0.99+ |
2015 | DATE | 0.99+ |
Palo Alto | LOCATION | 0.99+ |
Margot Gerritsen | PERSON | 0.99+ |
Vijay | PERSON | 0.99+ |
Walmart Labs | ORGANIZATION | 0.99+ |
June | DATE | 0.99+ |
50% | QUANTITY | 0.99+ |
100,000 people | QUANTITY | 0.99+ |
November | DATE | 0.99+ |
53 countries | QUANTITY | 0.99+ |
Esteban Arcaute | PERSON | 0.99+ |
Stanford | LOCATION | 0.99+ |
177 | QUANTITY | 0.99+ |
Palo Alto, California | LOCATION | 0.99+ |
177 regional events | QUANTITY | 0.98+ |
next year | DATE | 0.98+ |
third year | QUANTITY | 0.98+ |
six months | QUANTITY | 0.98+ |
first event | QUANTITY | 0.98+ |
one | QUANTITY | 0.98+ |
WiDS 2018 | EVENT | 0.98+ |
WiDS | ORGANIZATION | 0.98+ |
CUBE | ORGANIZATION | 0.98+ |
30-plus years | QUANTITY | 0.98+ |
ORGANIZATION | 0.97+ | |
both | QUANTITY | 0.97+ |
Stanford | ORGANIZATION | 0.97+ |
Stanford University | ORGANIZATION | 0.97+ |
Datathon | EVENT | 0.96+ |
Women in Data Science Conference | EVENT | 0.95+ |
Merchant Technology | ORGANIZATION | 0.95+ |
One | QUANTITY | 0.95+ |
Stanford Cafe | LOCATION | 0.93+ |
WiDS | EVENT | 0.92+ |
today | DATE | 0.9+ |
WiDS 2019 | EVENT | 0.9+ |
Search | ORGANIZATION | 0.89+ |
Women in Data Science Conference 2018 | EVENT | 0.89+ |
few years ago | DATE | 0.83+ |
Stanford University | ORGANIZATION | 0.8+ |
single aspect | QUANTITY | 0.79+ |
annual | QUANTITY | 0.7+ |
first annual | QUANTITY | 0.69+ |
moms | QUANTITY | 0.67+ |
every single day | QUANTITY | 0.67+ |
third | EVENT | 0.66+ |
parts | QUANTITY | 0.66+ |
Coupa Cafe | ORGANIZATION | 0.64+ |
third annual | QUANTITY | 0.62+ |
Covering, | EVENT | 0.58+ |
CUBE | EVENT | 0.42+ |
Dawn Woodard, Uber | WiDS 2018
>> Announcer: Live from Stanford University in Palo Alto, California, it's theCUBE! Covering Women In Data Science Conference 2018. Brought to you by-- >> Coverage of Women in Data Science 2018. I am Lisa Martin. We're at Stanford University. This is where the big in-person event is, but there are more than 177 regional WiDS events going on around the globe today. They are in 53 countries, and they're actually expecting to have about 100,000 people engaged with WiDS 2018. Pretty awesome. I'm joined by one of the speakers for WiDS 2018, Dawn Woodard, the senior data science manager of maps at Uber. Welcome to theCUBE! >> Thank you so much, Lisa. >> It's exciting to have you here. This is your first WiDS, and you are already a speaker. Tell us a little bit about what attracted you to WiDS. What was it that kind of spoke to you as a female leader in data science? >> Well, I tried to do a fair amount of reach-out to women in data science. I really feel like I've been blessed throughout my career with inspiring female mentors, including my mother, for example. Not every woman comes into her career with that kind of mentorship, so I really wanted to reach out and help provide that to some of the younger folks in our community. >> That's fantastic. One of the things that's remarkable about WiDS, one, is the growth and scale that they've achieved reaching such big, broad audiences in such a short time period. But it's also from a thematic perspective, aiming to inspire and to educate data scientists worldwide, and of course, to support females in that. What are some of the, tell us a little bit about your talk is Dynamic Pricing and Matching in Ride Sharing. What are some of the takeaways that the audience watching the livestream and here in person are going to hear from your talk? >> There are two technical takeaways, and then there's one non-technical takeaway. The first technical takeaway is that the matching algorithms that we use are really designed to reduce the amount of time that riders and drivers have to spend waiting in the app. For drivers, that means that we're working to increase the amount of time that they spend on-trip and getting paid. For riders, that means that we're reducing the amount of time that they have to wait to be picked up by a car. That's the first takeaway. The second takeaway is around dynamic pricing, and why it's important in ride-hailing services in particular. It turns out that it's really important in creating a seamless and reliable experience, both for riders and for drivers, so I talk through the technical reasons for that. Interestingly, these technical arguments are based not just on machine learning and statistics, but also on economic analyses and some optimization concepts. The third takeaway is really that data science is this incredibly interdisciplinary environment in which we have economics, statistics, optimization, machine learning, and more. >> It's really, data sciences has the opportunity, or really is, very horizontal. Every sector, every area of our lives is impacted by it. I mean, we think of all of us that use Uber and ride-sharing apps. I think that's one of the neat things that we're hearing from the event and from the speakers like yourself is these demarcated lines of career paths are blurring, or some of 'em are evaporating. And so, I think having the opportunity to talk to the younger generation, showing them how much impact they can make in this field has got to sort of be maybe, I would even guess, invigorating for you, as someone who's been in the tech in both industry and academia for a while. >> Absolutely. I think about data science as being the way that we learn about the world, statistics and data science. So, how do we use data to learn about the world, and how do we use data to improve, to make great products, to make great apps, for example. >> Exactly. Tell me a little bit about your career path. You have your PhD in statistics from Duke University. Tell me about how you got there, and then how you also got into industry. Were you always a STEM fan as a kid, or was it something that you had a passion for early on, or developed over time? >> I was always passionate about math and science. When I was an undergraduate, I did an internship with a defense contractor. That's how I got interested in machine learning in particular. That's where it took off. I decided to get a PhD in statistics from there. Statistics and machine learning are really closely related. And then, continued down that path throughout my academic career, and now my career in tech. >> What are some of the things that you think that prepared you for a being a female leader? Was it those mentors that you mentioned before? Was it the fact that you just had a passion for it and thought, "If I'm one of the only females in the room, I don't care. "This is something that's interesting to me." What were some of those foundational elements that really guided you? >> One is the inspiration of some women in my life, and if we have to be completely honest, I'm a person who, when, the very rare times in my career when somebody has acted like I couldn't hack it or couldn't make it, it always really got me angry. The way that I channeled that was really to turn it around and to say, "No problem. "I'm going to show you that I can go well beyond "anything that you had conceived of." >> You know, I love that you said that, 'cause Margot Gerritsen, one of the founders of WiDS actually said a couple hours ago, a few years ago, when they had this idea, from concept to first conference was six months, and she said she almost thought of it like a revenge conference. Like, "We can do this!" I think it's kind of, when they had this idea in 2015, the fact that even in 2015, there's still not only demand for, but the demand is growing. As we're seeing, the statistics that show a low percentage of women that have degrees in engineering, I want to say 20%, but only 11% of them are actually working in their field. We still have a lot of work to do to ignite the fire in this next generation of prospective leaders in technology. There's still a lot of groundwork to make up there. I think we're hearing that a lot at WiDS. Are you hearing that in your peer groups as well? >> Absolutely. I think one of the things that I've really focused on is mentoring women as leaders and managers within my organization, and I really find that that's an amazing way to reach out, is not just to reach out myself, but also to do that through female leaders in my own organization. For example, I've mentored and managed two women through the transition from individual contributor to manager. Just watching their trajectory afterwards is incredibly inspiring. But then, of course, those female managers bring in additional female contributors, and it grows from there. >> Right. And you have a pretty good, pretty diverse team at Uber. Tell us a little bit about your rise at Uber. One of the things that I saw on your LinkedIn profile, that you achieved pretty quickly in the first three years, or probably less, was that you led the marketplace data science team through a period of transformative growth. You started that team with 10 data scientists, and by the time you transitioned into your next role, there were 49 data scientists, including seven managers. How were you able to come in and make such a big impact so quickly? >> Well, the whole team chipped in in terms of hiring and reaching out. But at the time when I joined Uber, data science was still relatively small. Those 10 people were being asked to do all of the pricing and matching algorithms, all of the data science for Uber Pool, all of the data science for Uber Eats. We just had one person in each of these areas, and those people very quickly stepped up to the plate and said, "Okay, I need help." We worked together to help grow their teams. It's really a collaborative effort involving the whole team. >> The current team that you're managing, what does that look like from a male/female ratio standpoint? >> The current team is more than 50% female at this point, which is something that I'm really proud of. It's definitely not only my achievement. There was a manager who was leading the team just before I switched to leading maps, and that person also helped increase the presence of women in data science for Uber's mapping organization. The first data scientist on maps at Uber was a woman, actually. >> That's fantastic. And you were saying before we went live that there's a good-sized contingent of women data scientists at Uber today that are participating in WiDS up in San Francisco? >> That's right, yes. We're live-streaming it. There's a Women in Data Science organization at Uber, and that organization is sponsoring the internal events for the live stream, not just for my talk, but really, the whole conference. >> That's one of the things that Margot Gerritsen was also saying, that from a timing perspective, they really knew they were on to something pretty quickly, and being able to take advantage of technology, live streaming, they're also doing it on Facebook, gives them that opportunity to reach a bigger audience. It also is, for you and your peers as speakers, gives you an even bigger platform to be able to reach that audience. But one of the things I find interesting about WiDS is it's not just the younger audience. Like Maria Klawe had said in her opening remarks this morning and before, that the optimal time that she's found of reaching women to get them interested in STEM subjects is first year college, first semester of college. I actually had the same exact experience many years ago, and I didn't realize that was a timing that was actually proven to be the most successful. But it's not just young women at that stage of their university career. It's also those who've been in tech, academia, and industry for a while who, we're hearing, are feeling invigorated by events like WiDS. Do you feel the same? Is this something that just sort of turns up that bunsen burner maybe a little bit higher? >> Oh, it's incredibly empowering to be in a room full of such technically powerful women. It's a wonderful opportunity. >> It really is, and I think that reinvigoration is key. Some of the things like, as we look at what you've already achieved at Uber so far, and we're in 2018, what are some of the things that you're looking forward to your team helping to impact for Uber in 2018? >> In 2018, we're looking to magnify the impact of data science within Uber's mapping organization, which is my main focus right now. Maps at Uber does several things. Think of Uber as being a physical logistics platform. We move people and things from point A to point B. Maps, as our physical world, really impacts every aspect of the user experience, both for riders and for drivers. And then, whenever we're making a dispatch decision or a pricing decision, we need to know something about how long it would take this driver to get to this rider, for example, which is really a mapping prediction. We are looking at increasing the presence of data science within the mapping organization, really bringing that perspective to the table, both at the individual contributor level, but really also growing leadership of data science within the mapping organization so that we can help drive the direction of maps at Uber through data-driven insights. >> Data-driven insights, I'm glad that you brought that up. That's something that, as we talk about data science. Data science is helping to make decisions on policy, healthcare, so many different things, you name it. It really seems like these blurred lines of job categories, as businesses use data science, and even Uber, to extend, grow the business, open new business models, so can the next generation leverage data science to just open up this infinite box, if you will, of careers that they can go into and industries they can impact by having this foundation of data science. >> Absolutely. Well, any time we have to make a decision about what direction we go in, right, as a business, for example, as an organization, then doing that starting from data, understanding what is the world really like, what are the opportunities, what are the places in which we as a company are not doing very well, for example, and can make a simple change and get an incredible impact? Those are incredibly powerful insights. What do you think, last question-ish, 'cause we're getting low on time. We talk a lot about, there's the hard skills/soft skills. Soft is kind of a weird word these days to describe that. You know, statistical analysis, data mining. But there's also this, the softer skills, empathy, things like that. How do you find those two sides, maybe it's right brain/left brain, as being essential for people to become well-rounded data scientists? >> The couple of soft skills that I really look for heavily when I'm hiring a data scientist, one is being really focused on impact, as opposed to focused on building a new shiny thing. That's quite a different approach to the world, and if we stay focused on the product that we're creating, that means that we're willing to chip in, even if the work that's being done is not as glamorous, or is not going to get as much attention, or is not as fancy of a model. We can really stay focused on what are some simple approaches that we can use that can really drive the product forward. That kind of impact focus, and also, that great attitude about being willing to chip in on something, even if it's not that fancy or if I'm not going to get in the limelight for doing this. Those are the kinds of soft skills that really are so critical for us. >> Attitude and impact. I've heard impact a number of times today. Dawn, thank you so much for carving out some time to chat with us on theCUBE. We congratulate you on being a speaker at this year's event, and look forward to talking to you next year. >> Thank you, Lisa. >> We want to thank you for watching theCUBE. We are live at Stanford for the third annual Women in Data Science Conference, hashtag #WiDS2018. Get involved in the conversation. It is happening in over 53 countries. After this short break, I will be right back with my next guest. (fast electronic music)
SUMMARY :
Brought to you by-- and they're actually expecting to have about 100,000 people It's exciting to have you here. to women in data science. and here in person are going to hear from your talk? that they have to wait to be picked up by a car. and from the speakers like yourself the way that we learn about the world, and then how you also got into industry. I decided to get a PhD in statistics from there. What are some of the things that you think "I'm going to show you that I can go well beyond You know, I love that you said that, and I really find that that's an amazing way and by the time you transitioned into your next role, all of the data science for Uber Pool, and that person also helped increase And you were saying before we went live and that organization is sponsoring the internal events that the optimal time that she's found Oh, it's incredibly empowering to be Some of the things like, really bringing that perspective to the table, to just open up this infinite box, if you will, the softer skills, empathy, things like that. that can really drive the product forward. and look forward to talking to you next year. We are live at Stanford for the third annual
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Lisa Martin | PERSON | 0.99+ |
Uber | ORGANIZATION | 0.99+ |
2015 | DATE | 0.99+ |
Margot Gerritsen | PERSON | 0.99+ |
Dawn Woodard | PERSON | 0.99+ |
Maria Klawe | PERSON | 0.99+ |
2018 | DATE | 0.99+ |
20% | QUANTITY | 0.99+ |
San Francisco | LOCATION | 0.99+ |
49 data scientists | QUANTITY | 0.99+ |
Lisa | PERSON | 0.99+ |
Duke University | ORGANIZATION | 0.99+ |
10 data scientists | QUANTITY | 0.99+ |
next year | DATE | 0.99+ |
10 people | QUANTITY | 0.99+ |
Dawn | PERSON | 0.99+ |
second takeaway | QUANTITY | 0.99+ |
first takeaway | QUANTITY | 0.99+ |
WiDS | ORGANIZATION | 0.99+ |
two sides | QUANTITY | 0.99+ |
11% | QUANTITY | 0.99+ |
first | QUANTITY | 0.99+ |
two women | QUANTITY | 0.99+ |
seven managers | QUANTITY | 0.99+ |
Palo Alto, California | LOCATION | 0.99+ |
two technical takeaways | QUANTITY | 0.99+ |
one | QUANTITY | 0.99+ |
first year | QUANTITY | 0.99+ |
one person | QUANTITY | 0.99+ |
each | QUANTITY | 0.99+ |
53 countries | QUANTITY | 0.99+ |
WiDS | EVENT | 0.99+ |
first semester | QUANTITY | 0.99+ |
six months | QUANTITY | 0.99+ |
both | QUANTITY | 0.99+ |
more than 50% | QUANTITY | 0.99+ |
first three years | QUANTITY | 0.98+ |
WiDS 2018 | EVENT | 0.98+ |
first conference | QUANTITY | 0.98+ |
first technical takeaway | QUANTITY | 0.98+ |
ORGANIZATION | 0.98+ | |
ORGANIZATION | 0.98+ | |
third takeaway | QUANTITY | 0.98+ |
today | DATE | 0.97+ |
more than 177 regional | QUANTITY | 0.97+ |
Stanford | LOCATION | 0.97+ |
about 100,000 people | QUANTITY | 0.97+ |
#WiDS2018 | EVENT | 0.96+ |
over 53 countries | QUANTITY | 0.95+ |
one non-technical takeaway | QUANTITY | 0.95+ |
Stanford University | ORGANIZATION | 0.94+ |
Women in Data Science 2018 | EVENT | 0.94+ |
One | QUANTITY | 0.92+ |
this year | DATE | 0.92+ |
Dynamic Pricing and Matching in Ride Sharing | TITLE | 0.89+ |
Covering Women In Data Science Conference 2018 | EVENT | 0.89+ |
Uber Pool | ORGANIZATION | 0.88+ |
Daniela Witten, University of Washington | WiDS 2018
(energetic music) >> Announcer: Live, from Stanford University in Palo Alto, California, it's The Cube, covering Women in Data Science Conference 2018. Brought to you by Stanford. >> Welcome back to The Cube. We are live at Stanford University at the third annual Women in Data Science Conference. I am Lisa Martin. We've had a really exciting day so far, talking with a lot of female leaders in different parts of STEM fields. And I'm excited to be joined by my next guest, who is a speaker at this year's WIDS 2018 event, Daniela Witten, the Associate Professor of Statistics and Biostatistics at the University of Washington. Daniela, thanks so much for stopping by The Cube. >> Oh, thanks so much for the invitation. >> So here we are at Stanford University. You spent quite a lot of time here. You've got three degrees from Stanford, so it's kind of like coming back home? >> Yeah, I've spent from 2001 to 2010 here. I started with a bachelor's degree in math and biology, and then I did a master's, and finally a PhD in statistics. >> And so now you're up at the University of Washington. Tell us about that. What is your focus there? >> Yeah, so my work is in statistical machine learning, with applications to large scale data coming out of biology. And so the idea is that in the last ten or 20 years, the field of biology has been totally transformed by new technologies that make it possible to measure a person's DNA sequence, or to see the activity in their brain. Really, all different types of measurements that would have been unthinkable just a few years ago. But unfortunately, we don't yet know really how to make sense of these data statistically. So there's a pretty big gap between the data that we're collecting, or rather, the data that biologists are collecting, and then the scientific conclusions that we can draw from these data. So my work focuses on trying to bridge this gap by developing statistical methods that we can use to make sense of this large scale data. >> That sounds exciting. So, WIDS, this is the third year, and they have grown this event remarkably quickly. So, we had Margot Garritsen on the program a little bit earlier, and she had shared 177 regional WIDS events going on today, this week, in 53 countries. And they're expecting to reach 100,000 people. So, for you, as a speaker, what is it that attracted you to participate in the WIDS movement, and share your topic, which we'll get to in a second, what was it that sort of attracted you to that? >> Well, first of all, it's an honor to be invited to participate in this event, which, as you mentioned, is getting live streamed and so many people are watching. But what's really special for me, of course, as a woman, is that there's so many conferences out there that I speak at, and the vast majority have a couple of female speakers, and it's not because there's a lack of talent. There are plenty of very qualified women who could be speaking at these conferences. But often, the conference organizers just don't think of women right away, or maybe add a couple women as an afterthought to their speaker lineups. And so it's really wonderful to be part of a conference where all of the speakers are women, and so we can really see the broad ways in which women are contributing to data science, both in and out of industry. >> And one of the things that Margot shared was, she had this idea with her co-founders only three years ago in 2015, and they got from concept to their first event in six months. >> Daniela: Women know how to get things done. >> We do, don't we? (laughs) But also what it showed, and even in 2015, and we still have this problem in 2018, is there's a massive demand for this. >> Yeah. >> The statistics, speaking of statistics, the numbers show very few women that are getting degrees in STEM subjects are actually working in their field. I just saw this morning, it's really cool, interactive infographic that someone shared with me on Twitter, thank you very much, that showed that 20 percent of females get degrees in engineering, but only 11 percent of them are working in engineering. And you think, "How have we gone backwards in the last 30 years?" But at least now we've got this movement, this phenomenon that is WIDS to start, even from an awareness perspective, of showing we don't have a lot of thought diversity. We have a great opportunity to increase that, and you've got a great platform in order to share your story. >> Yeah. Well, I think that you raise a good point though, as, even though the number of women majoring in STEM fields, at least in some areas of STEM has increased, the number of women making it higher up in the STEM ladder hasn't, for the most part. And one reason for this is possibly the lack of female role models. So being able to attend a conference like this, for young women who are interested in developing their career in STEM, I'm sure is really inspirational and a great opportunity. So it's wonderful for Margot and the other organizers to have put this together. >> It is. Even on the recruiting side, some of the things that still surprise me are when some, whether it's universities or companies that are going to universities to recruit for STEM roles, they're still bringing mostly men. And if there are females at the events, they're, often times they're handing out swag, they're doing more event coordination, which is great. I'm a marketer. There's a lot of females in marketing. But it still shows the need to start from a visibility standpoint and a messaging standpoint alone. They've got to flip this. >> I completely agree with that, but it also works the other way. So, often a company or an academic department might have a few women in a particular role, and those women get asked to do everything. Because they'll say, "Oh, we're going to Stanford to recruit. We need a woman there. We're having some event, and we don't want it to look totally non-diverse, so we need a woman there too." And the small number of women in STEM get asked to do a lot of things that the men don't get asked to do, and this can also be really problematic. Even though the intent is good, to clearly showcase the fact that there's diversity in STEM and in academia, the end outcome can actually be hurtful to the women involved who are being asked to do more than their fair share. So we need to find a way to balance this. >> Right. That balance is key. So what I want to kind of pivot on next is, just looking at the field of data science, it's so interesting because it's very, I like 'cause it's horizontal. We just had a guest on from Uber, and we talk to on The Cube, people in many different industries, from big tech to baseball teams and things like that. And what it really shows, though, is, there's blurred lines, or maybe even lines that have evaporated between demarcated career A, B, C, D. And data science is so pervasive that it's impacting, people that are working in it, like yourself, have the ability to impact every sector, policy changes, things like that. Do you think that that message is out there enough? That the next generation understands how much impact they can make in data science? >> I think there is a lot of excitement from young people about data science. At U-dub, we have a statistics major, and it's really grown a lot in popularity in the last few years. We have a new master's degree in data science that just was started around the same time that WIDS was started, and we had 800 applicants this year. >> Wow. >> For a single masters program. Truly incredible. But I think that there's an element of it that also maybe people don't realize. So data science, there's a technical skill set that comes with it, and people are studying undergrad in statistics, and getting master's in data science in order to get that technical skill set. But there's also a non-technical skill set that's incredibly important, because data science isn't done in a vacuum. It's done within the context of interdisciplinary teams with team members from all different areas. So, for example, in my work, I work with biologists. Your previous guest from Uber, I'm sure is working with engineers and all different areas of the company. And in order to be successful in data science, you need to really not only have technical skills, but also the ability to work as a team player and to communicate your ideas. >> Yeah, you're right. Balancing those technical skills with, what some might call soft skills, empathy, collaboration, the ability to communicate, seems to be, we talked about balance earlier, a scale-wise. Would you say they're pretty equivalent, in terms of really, that would give somebody a great foundation as a data scientist? >> I would say that having both of those skill sets would give you a good foundation, yes. The extent to which either one is needed probably depends on the details of your job. >> True. So, I want to talk a little bit more about your background. Something that caught my eye was that your work has been featured in popular media. Forbes, three times, and Elle magazine, which of course, I thought, "What? I've got to talk to you about that!" Tell me a little bit about the opportunities that you've had in Forbes and in Elle magazine to share your story and to be a mentor. >> Yeah. Well, I've just been lucky to be getting involved in the field of statistics at a time when statistics is really growing in importance and interest. So the joke is, that ten years ago, if you went to a cocktail party, and you said that you were a statistician, then nobody would want to talk to you. (Lisa laughs) And now, if you go to a cocktail party and you say you're a statistician, everyone wants to know more and find out if you know of any job openings for them. >> Lisa: That's pretty cool! >> Yeah. So it's a really great time to be doing this kind of work. And there's really an increased appreciation for the fact that it's not enough to have access to a lot of data, but we really need the technical skills to make sense of that data. >> Right. So share with us a little bit about the session that you're doing here: More Data, More Statistical Problems. Tell us a little bit about that and maybe some of the three, what are the three key takeaways that the audience was hearing from you? >> Yeah. So I think the first real takeaway is, sometimes there's a feeling that, when we have a lot of data, we don't really need a deep understanding of statistics, we just need to know how to do machine learning, or how to develop a black box predictor. And so, the first point that I wanted to make is that that's not really right. Actually, the more data you have, often the more opportunity there is for your analysis to go awry, if you don't really have the solid foundations. Another point that I wanted to make is that there's been a lot of excitement about the promise of biology. So, a lot of my work has biomedical applications, and people have been hoping for many years that the new technologies that have come out in recent years in biology, would lead to improve understanding of human health and improve treatment of disease. And, it turns out, that it hasn't, at least not yet. We've got the data, but what we don't know how to do is how to analyze it yet. And so, the real gap between the data that we have and achieving its promise is actually a statistical gap. So there's a lot of opportunity for statisticians to help bridge that gap, in order to improve human health. And finally, the last point that I want to make is that a lot of these issues are really subtle. So we can try to just swing a hammer at our data and hope to get something out of it, but often there's subtle statistical issues that we need to think about, that could very much affect our results. And keeping in mind sort of the effects of our models, and some of these subtle statistical issues is very important. >> So, in terms of your team at University of Washington, or your classes that you teach, you work with undergrads. >> Yeah, I teach undergrads and PhD students, and I work mostly with PhD students. And I've just been lucky to work with incredibly talented students. I did my PhD here at Stanford, and I had a great advisor and really wonderful mentoring from my advisor and from the other faculty in the department. And so it's really great to have the opportunity now, in turn, to mentor grad students at University of Washington. >> What are some of the things that you help them with? Is it, we talk about inspiring women to get into the field, but, as you prepare these grad students to finish their master's or PhD's, and then go out either into academia or in industry, what are some of the other elements that you think is important for them to understand in terms of learning how to be assertive, or make their points in a respectful, professional way? Is that part of what you help them understand and achieve? >> That's definitely part of it. I would say another thing that I try to teach them, so everyone who I work with, all my students, they're incredibly strong technically, because you don't get into a top PhD program in statistics or biostatistics if you're not technically very strong, so what I try to help my students do is figure out not just how to solve problems, because they can solve any problem they set their mind to, but actually how to identify the problems that are likely to be high impact. Because there's so many problems out there that you can try to solve statistically, and, of course, we should all be focusing our efforts on the ones that are likely to have a really big impact on society, or on health, or whatever it is that we're trying to influence. >> Last question for you. If you look back to your education to now, what advice would you give your younger self? >> Gosh, that's a really great question. I think that I'm happy with many of the career decisions I've made. For example, getting a PhD in statistics, I think is a great career move. But, at the same time, maybe I would tell a younger version of me to take more risks, and not be so worried about meeting every requirement on time, and instead, expanding a little bit, taking more courses in other areas, and really broadening instead of just deepening my skill set. >> We've heard that sentiment echoed a number of times today, and one of the themes that I'm hearing a lot is don't be afraid to get out of your comfort zone. And it's so hard for us when we're in it, when we're younger, 'cause you don't know that, you don't have any experience there. But it's something that I always appreciate hearing from the women who've kind of led the way for those of us and then, the next generation, is, don't be afraid to get comfortably uncomfortable and as you said, take risks. It's not a bad thing, right? Well, Daniela, thanks so much for carving out some time to visit us on The Cube, and we're happy to have given you the opportunity to reach an even bigger audience with your message, and we wish you continued success at U-dub. >> Oh, thanks so much. >> We want to thank you for watching. I'm Lisa Martin live with The Cube at WIDS 2018 from Stanford University. Stick around, I'll be back with my next guest after a short break. (energetic music)
SUMMARY :
Brought to you by Stanford. And I'm excited to be joined by my next guest, So here we are at Stanford University. Yeah, I've spent from 2001 to 2010 here. And so now you're up at the University of Washington. And so the idea is that in the last ten or 20 years, And they're expecting to reach 100,000 people. and the vast majority have a couple of female speakers, And one of the things that Margot shared was, and even in 2015, and we still have this problem in 2018, in order to share your story. in the STEM ladder hasn't, for the most part. But it still shows the need to start that the men don't get asked to do, have the ability to impact every sector, in the last few years. but also the ability to work as a team player empathy, collaboration, the ability to communicate, probably depends on the details of your job. I've got to talk to you about that!" and you say you're a statistician, that it's not enough to have access to a lot of data, and maybe some of the three, and hope to get something out of it, So, in terms of your team at University of Washington, And so it's really great to have the opportunity now, on the ones that are likely to have a really big impact what advice would you give your younger self? to take more risks, and not be so worried and we wish you continued success at U-dub. We want to thank you for watching.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Lisa Martin | PERSON | 0.99+ |
2018 | DATE | 0.99+ |
2015 | DATE | 0.99+ |
Daniela Witten | PERSON | 0.99+ |
Lisa | PERSON | 0.99+ |
Daniela | PERSON | 0.99+ |
20 percent | QUANTITY | 0.99+ |
Margot Garritsen | PERSON | 0.99+ |
Margot | PERSON | 0.99+ |
2010 | DATE | 0.99+ |
2001 | DATE | 0.99+ |
Uber | ORGANIZATION | 0.99+ |
800 applicants | QUANTITY | 0.99+ |
Palo Alto, California | LOCATION | 0.99+ |
this week | DATE | 0.99+ |
three key takeaways | QUANTITY | 0.99+ |
first point | QUANTITY | 0.99+ |
100,000 people | QUANTITY | 0.99+ |
Forbes | TITLE | 0.99+ |
U-dub | ORGANIZATION | 0.99+ |
three years ago | DATE | 0.99+ |
University of Washington | ORGANIZATION | 0.99+ |
three degrees | QUANTITY | 0.98+ |
both | QUANTITY | 0.98+ |
Elle | TITLE | 0.98+ |
first event | QUANTITY | 0.98+ |
53 countries | QUANTITY | 0.98+ |
this year | DATE | 0.98+ |
11 percent | QUANTITY | 0.98+ |
Stanford University | ORGANIZATION | 0.98+ |
The Cube | ORGANIZATION | 0.97+ |
WIDS | EVENT | 0.97+ |
six months | QUANTITY | 0.97+ |
177 regional | QUANTITY | 0.97+ |
three | QUANTITY | 0.97+ |
today | DATE | 0.97+ |
WIDS 2018 | EVENT | 0.96+ |
one | QUANTITY | 0.96+ |
WiDS 2018 | EVENT | 0.95+ |
Women in Data Science Conference 2018 | EVENT | 0.95+ |
third year | QUANTITY | 0.95+ |
one reason | QUANTITY | 0.94+ |
Women in Data Science Conference | EVENT | 0.94+ |
ten years ago | DATE | 0.92+ |
Stanford | ORGANIZATION | 0.92+ |
Stanford | LOCATION | 0.89+ |
few years | DATE | 0.87+ |
this morning | DATE | 0.87+ |
first | QUANTITY | 0.86+ |
first real takeaway | QUANTITY | 0.86+ |
single masters program | QUANTITY | 0.81+ |
last 30 years | DATE | 0.81+ |
last few years | DATE | 0.8+ |
20 years | QUANTITY | 0.76+ |
three times | QUANTITY | 0.76+ |
second | QUANTITY | 0.74+ |
ORGANIZATION | 0.72+ | |
Associate Professor of Statistics | PERSON | 0.69+ |
ten | QUANTITY | 0.66+ |
last | DATE | 0.6+ |
WIDS | ORGANIZATION | 0.6+ |
couple women | QUANTITY | 0.59+ |
third annual | EVENT | 0.57+ |
couple | QUANTITY | 0.52+ |
conferences | QUANTITY | 0.49+ |
female | QUANTITY | 0.43+ |
Biostatistics | PERSON | 0.38+ |
Cube | TITLE | 0.3+ |
Jennifer Prendki, Atlassian | WiDS 2018
>> Narrator: Live from Stanford University in Palo Alto California, it's theCUBE, covering Women in Data Science Conference 2018. Brought to you by Stanford. >> Back to the cube, our continuing coverage of Women in Data Science 2018 continues. I am Lisa Martin, live from Stanford University. We have had a great array of guests this morning, from speakers, panelists, as well as attendees. This is an incredible one day technical event, and we're very excited to be joined by one of the panelists on the career panel this afternoon, Dr. Jennifer Prendki, the Head of Data Science at Atlassian. Welcome to theCUBE. >> Hi, it's my pleasure to be here. >> It's exciting to have you here. >> So you lead all search and machine learning initiatives at Atlassian, but you were telling me something interesting about your team, tell us about that. >> The interesting thing about my team is even though I'm the Head of Data Science, my team is not 100% data scientists. The belief of the company is that we really wanted to be in charge of our own destiny and be able to deploy our models ourselves and not be depending on other people to make deployment faster. >> Was that one of the interesting kind of culture elements that attracted you last year to Atlassian? >> What is really interesting about Atlassian, it's definitely a company that create products that I would say virtually every single software company in the world is using. They have a very strong software engineering culture, and so last year they decided to embrace data science. I thought it was a very interesting challenge for me to try and infuse a little bit of my passion for data and data-driven est to the company. >> You had quite a fast ramp at Atlassian. You joined last summer, and in less than six months, you grew your team of data scientists and engineers from three people to fifteen, and it gets better, in less than six months, across three locations, Mountain View, San Francisco, and Sydney. What were some of the key things for you that led you to make that impact so quickly? >> I think most data scientists on the world are interested in making an impact, and this is a company that obviously does a lot of impact, and a lot of people talk about this company, and there is obviously a lot of interesting data, and so I think one of the amazing things is that we have a very important role to play, because we are in a position where we have data related to the way people work with each other, collaborate with each other, and this is a very unique data set, so it's usually pretty easy to attract people to Atlassian. >> You mentioned collaboration, and that's certainly an undertone here at WiDS. In its third year, you were here last year as an attendee, now you're here this year as a speaker. They've grown this event dramatically in a couple of years alone. The opportunity to reach, they're expecting, a hundred thousand, to engage. It's a hundred and seventy-seven regional events, Margot Gerritsen gave us that number about an hour ago, in fifty-three countries. What is it about WiDS that attracted you, not only back, this year, but to welcome the opportunity to be on this career panel? >> I'll actually tell you something, so, we talk about diversity, and I think people usually think of diversity as meeting some kind of racial bar, to have, equality between male and female, or specific minorities. I think people tend to forget that the real diversity is diversity of thought, and so I actually found out that the very data science job I actually got, I was actually the only person who had a background in applied math, and everybody else was coming from a background in computer science. I quickly realized that I'm the only person who is really trained to push for, let's validate our models really properly, etc., and so that made realize how important that is to have a lot of diversity. I think WiDS is definitely a place where you see lots of women interested in the same thing, but coming from different perspective, different horizons, at different levels, and this is really something unique in the industry. >> Diversity of thought, I love that. I've not heard that before, I'm going to use that, but I'll give you credit for it. That is one of the things that is so, the more people we speak to, not just at WiDS, but at events like this on theCUBE, you hear, there's still such a need, obviously, the scale of which that WiDS has grown, shows clear demand for, we need more awareness that this diversity is missing, but in the fact that data science is so horizontal, across every industry, and it sort of is blurring the boundaries between rigid job roles, doctor, lawyer, attorney, teacher, whatever. This is quite pervasive and it provides the opportunity for data scientists globally to be able to make massive impact, but also, it still, as Margot Gerritsen was sharing earlier, it still requires what you said is that diversity in thought because having a particular small set of perspectives evaluating data, you think about it from an enterprise perspective, the types of companies that Atlassian deals with, and they are looking to grow and expand and launch new business models, but if the thought diversity is narrow, there's probably a lot of opportunity that is never going to be discovered. One of the things also I found interesting in your background, was that you found yourself sort of at this interesting juxtaposition of being a mentor, and going, wait a minute, this now gives you a great opportunity, but it also comes with some overhead. You've got it from a management perspective. What is that sort of crossroads that you've found yourself reaching and what have you done with that? >> I think it's true of probably every single technical role, but maybe data science more than others, you have to be technical to be part of the story. I think people need to have a leader that they can relate to and I think it's very important that you're still part of this. It's particularly interesting for data science, because data science is a field that moves so quickly. Usually you have people moving on to data science manager positions after being in IC and so if you don't make a conscious effort to remain that technical point of contact person, that people trust and people go to, then, when I think back of the technologies that were trendy when I was still in IC compared to now, it's really important for the managers to be still aware of that, to do a good job as a mentor and as a leader. >> You also said something I think before we went live, that is an important element for the women that WiDS is aiming to inspire and educate, today. Those that are new to the field or thinking about it, as well as those who've been it for a while. There is not just getting there, and going yes I'm interested, this is my passion, I want to have a career in this, it's also having to learn how to be a female leader, and you mentioned from a management perspective, you got to learn, you have to know how to be assertive. Tell us a little bit about the trials and tribulations that you have encountered in that respect. >> That's a very interesting question, because I'm actually very happy to see that nowadays, it's becoming easier and easier for women to step into individual contributor positions, because I think that people realize now that a woman can do just as good a job as men for a defined position, but when you're actually in a leadership position, you have to step into like a thought leadership role. Basically, you sometimes have to be in a meeting where you only have all the male engineers or male data scientists over there and say, you know what, I disagree with you, right? This as a woman becomes a little bit challenging because following the processes that are already in place, I believe that people have realized that it's okay for a woman to do that, but then being the assertive person that goes against the flow and says you are not thinking about it the right way, might sometimes be a problem, because women are not being perceived as creatures that are naturally assertive. It's typical for people, like a Head of Data Science, female data scientists, to be in a situation where they are perceived as being maybe a little bit aggressive or a little bit pushy, and you sometimes fall into this old saying, "he's the boss, she's bossy," kind of thing, and that is a challenge. >> I had someone once tell me a couple years ago, and I'm in tech as well, that I was pushy, and I think this was a language barrier thing, I think he meant to say persistent, but on that front, tell me a little bit more about your team of data scientists and engineers, and the females on your team, how do you help coach them to embrace, it's okay to speak your mind? What's that been like for you? >> I would say I was actually pretty soft-spoken myself. At some point I realized that public speaking actually helped me out there. Somebody at some point told me like, you should go, you're a brilliant, technical like go speak at a conference, and then I realized people are listening to me. You always have a little bit of like imposter syndrome kind of problem as a woman, so it helped me overcome this. Now I'm kind of trained to stimulate the ladies on my group to do the same thing, because that has worked really well for me I think. You have to get outside your comfort zone, and try to, things that help you have the self-confidence for you to get to the level of assertiveness you need to become successful. >> Exactly right, we've had a number of women on the show, today alone, talk about getting outside of your comfort zone, and one of my mentors always says, get comfortably uncomfortable. That's not an easy thing to achieve, but I think you walk in the door at WiDS, and you instantly feel inspired, and empowered. I think a number of the women that we've had on today, already, have talked about having, sort of being charged as a mentor with the responsibility like you just said, of helping those that are following your footsteps, to maybe understand how to have that confidence, and then have that right balance, so that there's professionalism there, there's respect, but it's not just about getting them into the field. It's about teaching them how to, once you're there, how to navigate a career path that is successful. >> That's an interesting thought, because I actually believe that getting comfortable with the uncomfortable is definitely something that data science is about, because you have new technologies, you have new models, you have lateral moves, like I actually was in the advertising industry as a data scientist, before switching to e-commerce and then eventually to the software industry, so I think that people who are trained to be data scientists are like that, and they should also be comfortable with the uncomfortable in their daily lives. >> Yeah, so you were mentioning before we went on that some of the people that you work with are like, it's my hope and dream to be at WiDS next year. What are some of the things that you've heard as we're at the halfway mark of WiDS today, that you're going to go back and share with your team, as well as maybe your friends, other females that are working in STEM fields as well? >> I would say, last year I was here just listening to all the people and whatever. This year, I'm on the panel, so I mean, I'm just like, nothing is impossible, I think. We've proven that over and over again in data science, I mean, who would have thought that ten years ago, we would be at the level of understanding of artificial intelligence and the entire field, right? It's all about waiting and seeing what the future has to bring to you, and we have all these amazing women today, to actually show us that, it's possible to get there, and it's exciting to be here. >> It is possible, and it's exciting. Well, Jennifer, thanks so much for carving out some of your time today to speak with us. We wish you continued success at Atlassian and we look forward to seeing you back at WiDS next year. >> Thank you. >> We want to thank you for watching theCUBE, we're live at Stanford University at the third annual Women in Data Science Conference, hashtag WiDS2018, join the conversation. I'll be right back with my next guest after a short break. (upbeat music)
SUMMARY :
Brought to you by Stanford. of the panelists on the career panel this afternoon, at Atlassian, but you were telling me something interesting in charge of our own destiny and be able to deploy for data and data-driven est to the company. you grew your team of data scientists and engineers and a lot of people talk about this company, What is it about WiDS that attracted you, not only back, I think people tend to forget that the real diversity a lot of opportunity that is never going to be discovered. it's really important for the managers to be still Those that are new to the field or thinking about it, that goes against the flow and says you are not thinking and try to, things that help you have the but I think you walk in the door at WiDS, because you have new technologies, you have new models, that some of the people that you work with to all the people and whatever. and we look forward to seeing you back at WiDS next year. We want to thank you for watching theCUBE,
SENTIMENT ANALYSIS :
ENTITIES
Margot Gerritsen, Stanford University | WiDS 2018
>> Narrator: Alumni. (upbeat music) >> Announcer: Live from Stanford University in Palo Alto, California, it's theCUBE. Covering Women in Data Science Conference 2018. Brought to you by Stanford. >> Welcome back to theCUBE, we are live at Stanford University for the third annual Women in Data Science Conference, WiDS. I'm Lisa Martin, very honored to be joined by one of the co-founders of this incredible WiDS movement and phenomenon, Dr. Margot Gerritsen. Welcome to theCUBE! >> It's great to be here, thanks so much for being at our conference. >> Oh, likewise. You were the senior associate dean and director of the Institute for Computational Mathematics and Engineering at Stanford. >> Gerritsen: That's right, yep. >> Wow, that's a mouthful and I'm glad I could actually pronounce that. So you have been, well, I would love to give our audience a sense of the history of WiDS, which is very short. You've been on this incredible growth and scale trajectory. But you've been in this field of computational science for what, 30, over 30 years? >> Yeah, probably since I was 16, so that was 35 years ago. >> Yeah, and you were used to being one of few, or if not the only woman >> That's right. >> In a meeting, in a room. You were okay with that but you realized, you know what? There are probably women who are not comfortable with this and it's probably going to be a barrier. Tell us about the conception of WiDS that you and your co-founders had. >> So, May, 2015, Esteban from Walmart Labs, now at Facebook, and Karen Matthys, who's still very active, you know, one of the organizers of the conference, and I were having coffee at a cafe in Stanford and we were lamenting the fact that at another data science conference that we had been to had only had male speakers. And so we connected with the organizers and asked them why? Did you notice? Because very often people are not even aware, it's just such the norm to only have male speakers, >> Right, right. >> That people don't even notice. And so we asked why is that? And they said, "Well, you know we really tried to find "speakers but we couldn't find any." And that really was, for me, the last straw. I've been in so many of these situations and I thought, you know, we're going to show them. So we joke sometimes, a little bit, we say it's sort of a revenge conference. (laughs) We said, let's show them we can get some really outstanding women, and in fact only women. And that's how it started. Now we were sitting at this coffee shop and I said, "Let's do a conference." And they said, "Well, that would be great, next year." And I said, "No, this year. "Let's just do it. "Let's do it in November." We had six months to put it together. It was just a local conference here. We got outstanding speakers, which were really great. Mostly from the area. And then we started live-streaming because we thought it would be fun to do. And to our big surprise, we had 6,000 people on the livestream just without really advertising. That made us realize, in November 2015, my goodness, we're onto something. And we had such amazing responses. We wanted to then scale up the conference and then you can hire a fantastic conference center in San Francisco and get 10,000 people in like they do, for example, at Grace Hopper. But we thought, why not use online technology and scale it up virtually and make this a global event using the livestream, that we will then provide to people, and asking for regional events, local events to be set up all around the world. And we created this ambassador program, that is now in its second year. the first year the responses were actually overwhelming to us already then. We got 75 ambassadors who set up 75 events around the world >> In about 40 countries. >> This was last year, 2017? >> Yeah, almost exactly 13 months ago, and then this year now we have over 200 ambassadors. We have 177 events in 155 cities in 53 countries. >> That's incredible. >> So we're on every continent apart from Antarctica but we're working on that one. >> Martin: I was going to say, that's probably next year. >> Yeah, that's right. >> The scale, though, that you've achieved in such a short time period, I think, not only speaks to the power, like you said, of using technology and using live-streaming, but also, there is a massive demand. >> Gerritsen: There is a great need, yeah. >> For not only supporting, like from the perspective of the conference, you want to support and inspire and educate data scientists worldwide and support females in the field, but it really, I think, underscores, there is still in 2018, a massive need to start raising more profiles and not just inspiring undergrad females, but also reinvigorating those of us that have been in the STEM field and technology for a while. >> Gerritsen: That's right. >> So, what are some of the things, so, this year, not only are you reaching, hopefully about 100,000 people, you mentioned some of the countries involved today, but you also have a new first this year with the WiDS Datathon. >> That's right. >> Tell us about the WiDS Datathon, what was the idea behind it? You announced some winners today? >> Yeah. Yeah, so with WiDS last year, we really felt that we hit a nerve. Now there is an incredible need for women to see other women perform so well in this field. And, you know, that's why we do it, to inspire. But it's a one-time event, it's once a year. And we started to think about, what are some of the ways that we can make this movement, because it's really become a movement, into something more than just an annual, once-a-year conference? And so, Datathon is a fantastic way to do that. You can engage people for several months before the conference, and you can announce the winner at the conference. It is something that can be done really easily worldwide if it is supported again by the ambassadors, so the local WiDS organizations. So we thought we'd just try. But again, it's one of those things we say, "Oh, let's do it." We, I think, thought about this about six months ago. Finding a good data set is always a challenge but we found a wonderful data set, and we had a great response with 1100, almost 1200 people in the world participating. >> That's incredible. >> Several hundred teams. Yeah, and what we said at the time was, well, let's have the teams be 50% female at least, so that was the requirement, we have a lot of mixed teams. And ultimately, of course, that's what we want. We want 50-50, men-women, have them both at the table, to participate in data science activities, to do data science research, and answer a lot of these data questions that are now driving so many decisions. Now we want everybody around the table. So with this Datathon, it was just a very small event in the sense, and I'm sure next year it will be bigger, but it was a great success now. >> Well, congratulations on that. One of the things I saw you on a Youtube video talking about over the weekend when I was doing some prep was that you wanted this Datathon to be fun, creative, and I think those are two incredibly important ways to describe careers, not just in STEM but in data science, that yes, this can be fun. >> Yep. >> Should be if you're spending so much time every day, right, doing something for a living. But I love the creativity descriptor. Tell us a little bit about the room for interpretation and creativity to start removing some of the bias that is clearly there in data interpretation? >> Oh. (laughs) You're hitting the biggest sore point in data science. And you could even turn it around, you say, because of creativity, we have a problem too. Because you can be very creative in how you interpret the data, and unfortunately, for most of us, whenever we look at news, whenever we look at data or other information given to us, we never see this through an objective lens. We always see this through our own filters. And that, of course, when you're doing data analysis is risky, and it's tricky. 'cause you're often not even aware that you're doing it. So that's one thing, you have this bias coming in just as a data scientist and engineer. Even though we always say we do objective work and we're building neutral software programs, we're not. We're not. Everything that we do in machine learning, data mining, we're looking for patterns that we think may be in the data because we have to program this data. And then even looking at some of the results, the way we visualize them, present them, can really introduce bias as well. And then we don't control the perception of people of this data. So we can present it the way we think is fair, but other people can interpret or use little bits of that data in other ways. So it's an incredibly difficult problem and the more we use data to address and answer critical challenges, the more data is influencing decisions made by politicians, made in industry, made by government, the more important it is that we are at least aware. One of the really interesting things this conference, is that many of the speakers are talking to that. We just had Latanya Sweeney give an outstanding keynote really about this, raising this awareness. We had Daniela Witten saying this, and various other speakers. And in the first year that we had this conference, you would not have heard this. >> Martin: Really? Only two years ago? >> Yeah. So even two years ago, some people were bringing it up, but now it is right at the forefront of almost everybody's thinking. Data ethics, the issue of reproducibility, confirmations bias, now at least people now are aware. And I'm always a great optimist, thinking if people are aware, and they see the need to really work on this, something will happen. But it is incredibly important for the new data scientists that come into the field to really have this awareness, and to have the skill sets to actually work with that. So as a data scientist, one of the reasons why I think it's so fun, you're not just a mathematician or statistician or computer scientist, you are somebody who needs to look at things taking into account ethics, and fairness. You need to understand human behavior. You need to understand the social sciences. And we're seeing that awareness now grow. The new generation of data scientists is picking that up now much more. Educational programs like ours too have embedded these sort of aspects into the education and I think there is a lot of hope for the future. But we're just starting. >> Right. But you hit the nail on the head. You've got to start with that awareness. And it sounds like, another thing that you just described is we often hear, the top skills that a data scientist needs to have is statistical analysis, data mining. But there's also now some of these other skills you just mentioned, maybe more on the softer side, that seem to be, from what we hear on theCUBE, as important, >> Gerritsen: That's right. >> As really that technical training. To be more well-rounded and to also, as you mentioned earlier, to have to the chance to influence every single sector, every single industry, in our world today. >> And it's a pity that they're called softer skills. (laughs) >> It is. >> Because they're very very hard skills to really master. >> A lot of them are probably you're born with it, right? It's innate, certain things that you can't necessarily teach? >> Well, I don't believe that you cannot do this without innate ability. Of course if you have this innate ability it helps a little, but there's a growth mindset of course, in this, and everybody can be taught. And that's what we try to do. Now, it may take a little bit of time, but you have to confront this and you have to give the people the skills and really integrate this in your education, integrate this at companies. Company culture plays a big role. >> Absolutely. >> This is one of the reasons why we want way more diversity in these companies, right. It's not just to have people in decision-making teams that are more diverse, but the whole culture of the company needs to change so that these sort of skills, communication, empathy, big one, communication skills, presentation skills, visualization skills, negotiation skills, that they really are developed everywhere, in the companies, at the universities. >> Absolutely. We speak with some companies, and some today, even, on theCUBE, where they really talk about how they're shifting, and SAP is one of them, their corporate culture to say we've got a goal by 2020 to have 30% of our workforce be female. You've got some great partners, you mentioned Walmart Labs, how challenging was it to go to some of these companies here in Silicon Valley and beyond and say, hey we have this idea for a conference, we want to do this in six months so strap on your seatbelts, what were those conversations like to get some of those partners onboard? >> We wouldn't have been able to do it in six months if the response had not been fantastic right from the get-go. I think we started the conference just at the right time. There was a lot of talk about diversity. Several of the companies were starting really big diversity initiatives. Intel is one of them, SAP is another one of them. We were connected with these companies. Walmart Labs, for example, one of the founders of the company was from Walmart Labs. And so when we said, look, we want to put this together, they said great. This is a fantastic venue for us also. You see this with some of these companies, they don't just come and give us money for this conference. They build their own WiDS events around the world. Like SAP built 30 WiDS events around the world. So they're very active everywhere. They see the need, of course, too. They do this because they really believe that a changed culture is for the best of everybody. But they also believe that because they need the women. There is a great shortage of really excellent data scientists right now, so why not look at 50% of your population? >> Martin: Exactly. >> You know, there's fantastic talent in that pool and they want to track that also. So I think that within the companies, there is more awareness, there is an economic need to do so, a real need, if they want to grow, they need those people. There is an awareness that for their future, the long term benefit of the company, they need this diversity in opinions, they need the diversity in the questions that are being asked, and the way that the companies look at the data. And so, I think we're at a golden age for that now. Now am I a little bit frustrated that it's 2018 and we're doing this? Yes. When I was a student 30 some years ago, I was one of the very few women, and I thought, by the time I'm old, and now I'm old, you know, as far as my 18-year-old self, right, I mean in your 50s, you're old. I thought everything would be better. And we certainly would be at critical mass, which is 30% or higher, and it's actually gone down since the 80s, in computer science and in data science and statistics, so it is really very frustrating in that sense that we're really starting again from quite a low level. >> Right. Right. >> But I see much more enthusiasm and now the difference is the economical need. So this is going to be driven by business sense as well as any other sense. >> Well I think you definitely, with WiDS, you are beyond onto something with what you've achieved in such a short time period. So I can only imagine, WiDS 2018 reaching up to 100,000 people over these events, what do you do next year? Where do you go from here? (laughs) >> Well, it's becoming a little bit of a challenge actually to organize and help and support all of these international events, so we're going to be thinking about how to organize ourselves, maybe on every continent. >> Getting to Antarctica in 2019? >> Yeah, but have a little bit more of a local or regional organization, so that's one thing. The main thing that we'd like to do is have even more events during the year. There are some specific needs that we cannot address right now. One need, for example, is for high school students. We have two high school students here today, which is wonderful, and quite a few of them are looking at the live-stream of the conference. But if you want to really reach out to high school students and tell them about this and the sort of skill sets that they should be thinking about developing when they are at university, you have to really do a special event. The same with undergraduate students, graduate students. So there are some markets there, some subgroups of people that we would really like to tailor to. The other thing is a lot of people are very very eager to self-educate, and so what we are going to be putting together, at least that's the plan now, we'll see, if we can make this, is educational tools, and really have a repository of educational tools that people can use to educate themselves and to learn more. We're going to start a podcast series of women, which will be very, very interesting. We'll start this next month, and so every week or every two weeks we'll have a new podcast out there. And then we'll keep the momentum going. But really the idea is to not provide just this one day of inspiration, but to provide throughout the year, >> Sustained inspiration. >> Sustained inspiration and resources. >> Wow, well, congratulations, Margot, to you and your co-founders. This is a movement, and we are very excited for the opportunity to have you on theCUBE as well as some of the speakers and the attendeees from the event today. And we look forward to seeing all the great things that I think are going to come for sure, the rest of this year and beyond. So thank you for giving us some of your time. >> Thank you so much, we're a big fan of theCUBE. >> Oh, we're lucky, thank you, thank you. We want to thank you for watching theCUBE. I'm Lisa Martin, we are live at the third annual Women in Data Science Conference coming to you from Stanford University, #WiDS2018, join the conversation. I'll be back with my next guest after a short break. (upbeat music)
SUMMARY :
(upbeat music) Brought to you by Stanford. Welcome back to theCUBE, we are live It's great to be here, thanks so much and director of the Institute for Computational a sense of the history of WiDS, which is very short. and it's probably going to be a barrier. And so we connected with the organizers and asked them why? And to our big surprise, we had 6,000 people now we have over 200 ambassadors. So we're on every continent apart from Antarctica not only speaks to the power, like you said, that have been in the STEM field and technology for a while. so, this year, not only are you reaching, before the conference, and you can announce so that was the requirement, we have a lot of mixed teams. One of the things I saw you on a Youtube video talking about and creativity to start removing some of the bias is that many of the speakers are talking to that. that come into the field to really have this awareness, that seem to be, from what we hear on theCUBE, as you mentioned earlier, to have to the chance to influence And it's a pity that they're called softer skills. and you have to give the people the skills that are more diverse, but the whole culture of the company You've got some great partners, you mentioned Walmart Labs, of the company was from Walmart Labs. by the time I'm old, and now I'm old, you know, Right. and now the difference is the economical need. what do you do next year? how to organize ourselves, maybe on every continent. But really the idea is to not provide for the opportunity to have you on theCUBE coming to you from Stanford University,
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Daniela Witten | PERSON | 0.99+ |
Margot Gerritsen | PERSON | 0.99+ |
Latanya Sweeney | PERSON | 0.99+ |
Lisa Martin | PERSON | 0.99+ |
Esteban | PERSON | 0.99+ |
Martin | PERSON | 0.99+ |
Gerritsen | PERSON | 0.99+ |
2018 | DATE | 0.99+ |
November 2015 | DATE | 0.99+ |
Walmart Labs | ORGANIZATION | 0.99+ |
Karen Matthys | PERSON | 0.99+ |
30% | QUANTITY | 0.99+ |
May, 2015 | DATE | 0.99+ |
Institute for Computational Mathematics and Engineering | ORGANIZATION | 0.99+ |
75 ambassadors | QUANTITY | 0.99+ |
Silicon Valley | LOCATION | 0.99+ |
50% | QUANTITY | 0.99+ |
75 events | QUANTITY | 0.99+ |
San Francisco | LOCATION | 0.99+ |
six months | QUANTITY | 0.99+ |
Antarctica | LOCATION | 0.99+ |
November | DATE | 0.99+ |
155 cities | QUANTITY | 0.99+ |
1100 | QUANTITY | 0.99+ |
18-year | QUANTITY | 0.99+ |
SAP | ORGANIZATION | 0.99+ |
Margot | PERSON | 0.99+ |
last year | DATE | 0.99+ |
53 countries | QUANTITY | 0.99+ |
next year | DATE | 0.99+ |
2019 | DATE | 0.99+ |
Stanford | LOCATION | 0.99+ |
2020 | DATE | 0.99+ |
10,000 people | QUANTITY | 0.99+ |
two | QUANTITY | 0.99+ |
177 events | QUANTITY | 0.99+ |
30 | QUANTITY | 0.99+ |
Intel | ORGANIZATION | 0.99+ |
one | QUANTITY | 0.99+ |
one-time | QUANTITY | 0.99+ |
6,000 people | QUANTITY | 0.99+ |
Palo Alto, California | LOCATION | 0.99+ |
WiDS Datathon | EVENT | 0.99+ |
this year | DATE | 0.99+ |
over 200 ambassadors | QUANTITY | 0.99+ |
WiDS | EVENT | 0.99+ |
#WiDS2018 | EVENT | 0.99+ |
second year | QUANTITY | 0.99+ |
ORGANIZATION | 0.98+ | |
One | QUANTITY | 0.98+ |
Stanford University | ORGANIZATION | 0.98+ |
Stanford | ORGANIZATION | 0.98+ |
one day | QUANTITY | 0.98+ |
today | DATE | 0.98+ |
Youtube | ORGANIZATION | 0.98+ |
once a year | QUANTITY | 0.97+ |
next month | DATE | 0.97+ |
two years ago | DATE | 0.97+ |
50-50 | QUANTITY | 0.97+ |
13 months ago | DATE | 0.97+ |
50s | QUANTITY | 0.97+ |
16 | QUANTITY | 0.97+ |
both | QUANTITY | 0.97+ |
80s | DATE | 0.97+ |
WiDS 2018 | EVENT | 0.96+ |
Mala Anand, SAP | WiDS 2018
>> Narrator: Live from Stanford University in Palo Alto, California. It's theCUBE covering Women in Data Science Conference 2018. Brought to you by Stanford. >> Welcome back to theCUBE. Our continuing coverage live at the Women in Data Science Conference 2018, #WiDS2018. I'm Lisa Martin and I'm very excited to not only be at the event, but to now be joined by one of the speakers who spoke this morning. Mala Anand, the executive vice president at SAP and the president of SAP Leonardo Data Analytics, Mala Anand, Mala, welcome to theCUBE. >> Thank you Lisa, I'm delighted to be here. >> So this is your first WiDS and we were talking off camera about this is the third WiDS and 100,000 people they're expecting to reach today. As a speaker, how does that feel knowing that this is being live streamed and on their Facebook Live page and you have the chance to reach that many people? >> It's really exciting, Lisa and you know, it's inspiring to see that we've been able to attract so many participants. It's such an important topic for us. More and more I think two elements of the topic, one is the impact that data science is going to have in our industry as well as the impact that we want more women to participate with the right passion and being able to be successful in this field. >> I love that you said passion. I think that's so key and that's certainly one of the things, I think as my second year hosting theCUBE at WiDS, you feel it when you walk in the door. You feel it when you're reading the #WiDS2018 Twitter feed. It's the passion is here, the excitement is here. 150 plus regional WiDS events going on today in over 50 countries so the reach can be massive. What were maybe the top three takeaways from your talk this morning that the participants got to learn? >> Absolutely, and what's really exciting to see is that we see from a business perspective that customers are seeing the potential to drive higher productivity and faster growth in this whole new notion of digital technologies and the ability now for these new forms of systems of intelligence where we embed machine learning, big data, analytics, IoT, into the core of the business processes and it allows us to reap unprecedented value from data. It allows us to create new business models and it also allows us to reimagine experiences. But all of this is only possible now with the ability to apply data science across industries in a very deep and domain expertise way, and so that's really exciting and, moreover, to see diversity in the participants. Diversity in the people that can impact this is very exciting. >> I agree. You talked about digital business. Digital transformation opens up so many new business model opportunities for companies but the application of advanced analytics, for example, alone opens up so many more career opportunities because every sector is affected by big data. Whether we know it or not, right? And so the opportunity for those careers is exploding. But another thing that I think is also ripe for conversation is bringing in diverse perspectives to analyze and interpret that data. >> Absolutely. >> To remove some of the bias so that more of those business models and opportunities can really bubble up. >> Absolutely. >> Lisa: Tell me about your team at SAP Leonardo and from a diversity perspective, what's going on there? >> Yeah, absolutely. So I think your point is really valid which is, the importance of bringing in diversity and also the importance of diversity both from a gender perspective and a diversity in skills. And I think the key element of data and decision science is now it opens up different types of skills, right? It opens up the skills of course, the technology skills are fundamental. The ability to read data modeling is fundamental, but then we add in the deep domain expertise. The add in the business perspectives. The ability to story tell and that's where I see the ability to story tell with the right domain expertise opens up such a massive opportunity for different kinds of participants in this field and so within SAP itself, we are very driven by driving diversity. SAP had set a very aggressive goal for by 2017 to be at 25% of women in leadership positions and we achieved that. We've got an aggressive goal to be at 30% of women in leadership positions by 2020 and we're really excited to achieve that as well and very important as well both within Leonardo and data analytics as well, by diversity is fundamental to our growth and more importantly to the growth for the industry. I think that's going to be fundamental. >> I think that's a really important point, the growth of the industry. SAP does a lot with WiDS. We had Ann Rosenberg on last year. I saw her walking around. So from a cultural stand point, what you've described, there's really a dedicated focus there and I think it's a unique opportunity that SAP doesn't have. They're taking advantage of it to really show how a massive corporation, a huge enterprise, can really be very dedicated to bringing in this diversity. It helps the business, but it also, to your point, can make a big impact on industry. >> Absolutely, you know, culture is such a critical part of being succeeding in the business, and I think culture is an important lever that can help differentiate companies in the market. So of course it's technology, it's value creation for our customers, and I think culture is such an important part of it, and when you unpeel the lever of culture, within there comes diversity, and within there comes bringing a different diversity of skills base as well that is going to be really critical in the next generation of businesses that will get created. >> I like that. Especially sitting in Silicon Valley where there's new businesses being created every, probably 30 seconds. I'd love to understand, if we kind of take a walk back through your career and how you got to where you are now. What were some of the things that inspired you along the way, mentors? What were some of the things that you found really impactful and crucial to you being as successful as you are and a speaker at an event like WiDS? >> Oh, absolutely. It's really exciting to see that from my own personal journey, I think that one of the things that was really important is passion. And ensuring that you find those areas that you're passionate about. I was always very passionate about software and being able to look at data and analyze data. From doing my undergraduate in Computer Science, as well as my graduate work in Computer Science from Brown, and from there on out, always looking at any of the opportunities whether it was an individual contributor that I did. It's important to be passionate and I felt that that was really my guiding post to really being able to move up from a career perspective, and also looking to be in an environment, in an ecosystem, of people and environments that you're always learning from, right? And always never being afraid to reach a little bit further than your capabilities. I think ensuring that you always have confidence in the ability that you can reach, and even though the goals might feel a little bit far away at the moment. So I think also being around a really solid team of mentors and being able to constantly learn. So I would say a constant, continuous learning, and passion is really the key to success. >> I couldn't agree more. I think it's that we often, the word expert is thrown around so often and in so many things, and there certainly are people that have garnered a lot of expertise in certain areas, but I always think, "Are you really ever an expert?" There's so much to learn everyday, there's so many opportunities. But another thing that you mentioned that reminded me of, we had Maria Klawe on a little bit earlier today and one of the things that she said in her welcome address was, in terms of inspiration, "Don't worry if there's something "that you think you're not good at." >> Mala: Absolutely. >> It's sort of getting out of your comfort zone and one of my mentors likes to say, "getting comfortably uncomfortable." That's not an easy thing to achieve. So I think having people around, people like yourself, you're now a mentor to potentially 100,000 people today, alone. What are some of the steps that you recommend of, how does someone go, "I really like this, "but I don't know if I can do it." How would you help someone get comfortably uncomfortable? >> Yeah, I think first of all, building a small group I would say, of stakeholders that are behind you and your success is going to be really important. I think also being confident about your abilities. Confidence comes in failing a few times. It's okay to miss a few goals, it's okay to fail, but then you leap forward even faster. >> Failure is not a bad F word, right? >> Mala: Absolutely. >> It really can be, and I think, a lot of leaders, like yourself will say that it's actually part of the process. >> It's very much part of the process. And so I think, number one thing is passion. First you've got to be really clear that this is exactly what you're passionate about. Second is building a team around you that you can count on, you can rely on, that are invested in your success. And then thirdly is also just to ensure that you are confident. Being confident about asking for more. Being confident about being able to reach close to the impossible is okay. >> It is okay, and it should be encouraged, every day. No matter what gender, what ethnicity, that should just sort of be one of those level playing fields, I think. Unfortunately, it probably won't be but events like WiDS, and the reach that it's making today alone, certainly, I think, offer a great foundation to start helping break some of the molds that even as we sit in Silicon Valley, are still there. There's still massive discrepancies in pay grades. There's still a big percentage of females with engineering degrees that are not working in the field. And I think the more people like yourself, and some of your other colleagues that are here participating at WiDS alone today, have the opportunity to reach a broader audience, share their stories. Their failures, the successes, and all the things that have shaped that path, the bigger the opportunity we have and it's, I think, almost, sort of a responsibility for those of us who've been in STEM for a while, to help the next generation understand nobody got here with a silver spoon. Eh, some. >> Absolutely. >> But on a straight path. It's always that zig zaggy sort of path, and embrace it! >> Yeah, I think that's key, right? And the one point here is very relevant that you mentioned as well is, that it's very important for us to recognize that a love for an environment where you can embrace the change, right? In order to embrace change, it's not just people that are going through it, but people that are supporting it and sponsoring it because it's a big change. It's a change from what was an environment a few years ago to what is going to be an environment of the future, which is an environment full of diversity. So I think being able to be ambassadors of the change is really important. As well as to allow for confidence building in this environment, right? I think that's going to be really critical as well. And for us to support those environments and build awareness. Build awareness of what is possible. I think many times people will go through their careers without being aware of what is possible. Things that were certain thresholds, certain limits, certain guidelines, two years ago are dramatically different today. >> Oh yes. >> So having those ambassadors of change that can help us build awareness, with our growing community, I think is going to be really important. >> I think, some of the things too, that you're speaking to, there are boundaries that are evaporating. We're seeing them become perforated and sort of disappear, as well as maybe some of these structured careers. There's a career as this, as that. They used to be pretty demarcated. Doctor, lawyer, architect, accountant, whatnot. And now it's almost infinite. Especially having a foundation in technology with data science and the real world social implications alone, that a career in this field can deliver just kind of shows the sky's the limit. >> Yeah, absolutely. The sky's truly the limit, and I think that's where you're absolutely right. The lines are blurring between certain areas, and at the same time, I think, this opens up huge opportunity for diversity in skill set and diversity in domain. I think equally important is to ensure to be successful you want to start by driving focus, as well, right? So, how do you draw that balance? And for us to be able to mentor and guide the younger generation, to drive that focus. At the same time take leverage the opportunities open is going to be critical. >> So getting back to SAP Leondardo. What's next in this year, we're in March of 2018. What are some of the things that are exciting you that your team is going to be working on and delivering for SAP and your customers this year? >> SAP Leondardo is really exciting because it essentially allows for our customers to drive faster innovation with less risk. And it allows our customers to create these digital businesses where you have to change a business process and a business model that no single technology can deliver. So as a result we bring together machine learning, big data analytics, IoT, all running on a solid cloud platform with in-memory databases like Kana, at scale. So this year is going to be all about how we bring these capabilities together very specifically by industry and reimagine processes across different industries. >> I like that, reimagine. I think that's one of the things that you're helping to do for females in data science and computer sciences. Reimagine the possibilities. Not just the younger generation, but also those who've been in the field for a while that I think will probably be quite inspired and reinvigorated by some of the things that you're sharing. So, Mala, thank you so much for taking the time to stop by theCUBE and share your insights with us. We wish you continued success in your career and we look forward to seeing you WiDS next year. >> Thank you so much, Lisa. I'm delighted to be here. >> Excellent. >> Thank you. >> My pleasure. We want to thank you. You are watching theCUBE live from WiDS 2018, at Stanford University. I'm Lisa Martin. Stick around, my next guest will be joining me after this short break.
SUMMARY :
Brought to you by Stanford. be at the event, but to now be joined and 100,000 people they're expecting to reach today. and being able to be successful in this field. that the participants got to learn? and the ability now for these new forms And so the opportunity for those careers is exploding. To remove some of the bias so that more I think that's going to be fundamental. to your point, can make a big impact on industry. that can help differentiate companies in the market. to you being as successful as you are and passion is really the key to success. and one of the things that she said and one of my mentors likes to say, It's okay to miss a few goals, it's okay to fail, a lot of leaders, like yourself to ensure that you are confident. that have shaped that path, the bigger It's always that zig zaggy sort of path, and embrace it! I think that's going to be really critical as well. I think is going to be really important. can deliver just kind of shows the sky's the limit. the opportunities open is going to be critical. What are some of the things that are exciting you And it allows our customers to create and reinvigorated by some of the things that you're sharing. I'm delighted to be here. from WiDS 2018, at Stanford University.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Lisa Martin | PERSON | 0.99+ |
Lisa | PERSON | 0.99+ |
March of 2018 | DATE | 0.99+ |
Mala Anand | PERSON | 0.99+ |
Silicon Valley | LOCATION | 0.99+ |
Ann Rosenberg | PERSON | 0.99+ |
2017 | DATE | 0.99+ |
Maria Klawe | PERSON | 0.99+ |
SAP | ORGANIZATION | 0.99+ |
30% | QUANTITY | 0.99+ |
2020 | DATE | 0.99+ |
Second | QUANTITY | 0.99+ |
30 seconds | QUANTITY | 0.99+ |
100,000 people | QUANTITY | 0.99+ |
last year | DATE | 0.99+ |
Mala | PERSON | 0.99+ |
next year | DATE | 0.99+ |
25% | QUANTITY | 0.99+ |
first | QUANTITY | 0.99+ |
two elements | QUANTITY | 0.99+ |
Palo Alto, California | LOCATION | 0.99+ |
#WiDS2018 | EVENT | 0.99+ |
second year | QUANTITY | 0.99+ |
First | QUANTITY | 0.99+ |
SAP Leonardo | ORGANIZATION | 0.99+ |
Women in Data Science Conference 2018 | EVENT | 0.98+ |
one | QUANTITY | 0.98+ |
both | QUANTITY | 0.98+ |
two years ago | DATE | 0.98+ |
over 50 countries | QUANTITY | 0.98+ |
third | QUANTITY | 0.98+ |
this year | DATE | 0.98+ |
one point | QUANTITY | 0.98+ |
Stanford | ORGANIZATION | 0.98+ |
SAP Leonardo Data Analytics | ORGANIZATION | 0.97+ |
Brown | ORGANIZATION | 0.97+ |
today | DATE | 0.97+ |
WiDS | EVENT | 0.97+ |
Women in Data Science Conference 2018 | EVENT | 0.97+ |
thirdly | QUANTITY | 0.96+ |
Stanford University | ORGANIZATION | 0.95+ |
single | QUANTITY | 0.94+ |
WiDS 2018 | EVENT | 0.93+ |
few years ago | DATE | 0.92+ |
WiDS | ORGANIZATION | 0.92+ |
executive vice president | PERSON | 0.9+ |
ORGANIZATION | 0.9+ | |
this morning | DATE | 0.89+ |
three takeaways | QUANTITY | 0.86+ |
theCUBE | ORGANIZATION | 0.84+ |
Leondardo | TITLE | 0.83+ |
one of the speakers | QUANTITY | 0.83+ |
Narrator | TITLE | 0.8+ |
TITLE | 0.79+ | |
president | PERSON | 0.76+ |
earlier | DATE | 0.73+ |
Bhavani Thurasingham, UT Dallas | WiDS 2018
>> Announcer: Live, from Stanford University in Palo Alto, California, it's theCUBE covering Women in Data Science Conference 2018, brought to you by Stanford. (light techno music) >> Welcome back to theCUBE's continuing coverage of the Women in Data Science event, WiDS 2018. We are live at Stanford University. You can hear some great buzz around us. A lot of these exciting ladies in data science are here around us. I'm pleased to be joined by my next guest, Bhavani Thuraisingham, who is one of the speakers this afternoon, as well as a distinguished professor of computer science and the executive director of Cyber Security Institute at the University of Texas at Dallas. Bhavani, thank you so much for joining us. >> Thank you very much for having me in your program. >> You have an incredible career, but before we get into that I'd love to understand your thoughts on WiDS. In it's third year alone, they're expecting to reach over 100,000 people today, both here at Stanford, as well as more than 150 regional events in over 50 countries. When you were early in your career you didn't have a mentor. What does an event like WiDS mean to you? What are some of the things that excite you about giving your time to this exciting event? >> This is such an amazing event and just in three years it has just grown and I'm just so motivated myself and it's just, words cannot express to see so many women working in data science or wanting to work in data science, and not just in U.S. and in Stanford, it's around the world. I was reading some information about WiDS and I'm finding that there are WiDS ambassadors in Africa, South America, Asia, Australia, Europe, of course U.S., Central America, all over the world. And data science is exploding so rapidly because data is everywhere, right? And so you really need to collect the data, stow the data, analyze the data, disseminate the data, and for that you need data scientists. And what I'm so encouraged is that when I started getting into this field back in 1985, and that was 32 plus years ago in the fall, I worked 50% in cyber security, what used to be called computer security, and 50% in data science, what used to be called data management at the time. And there were so few women and we did not have, as I said, women role models, and so I had to sort of work really hard, the commercial industry and then the MITRE Corporation and the U.S. Government, but slowly I started building a network and my strongest supporters have been women. And so that was sort of in the early 90's when I really got started to build this network and today I have a strong support group of women and we support each other and we also mentor so many of the junior women and so that, you know, they don't go through, have to learn the hard way like I have and so I'm very encouraged to see the enthusiasm, the motivation, both the part of the mentors as well as the mentees, so that's very encouraging but we really have to do so much more. >> We do, you're right. It's really kind of the tip of the iceberg, but I think this scale at which WiDS has grown so quickly shines a massive spotlight on there's clearly such a demand for it. I'd love to get a feel now for the female undergrads in the courses that you teach at UT Dallas. What are some of the things that you are seeing in terms of their beliefs in themselves, their interests in data science, computer science, cyber security. Tell me about that dynamic. >> Right, so I have been teaching for 13 plus years full-time now, after a career in industry and federal research lab and government and I find that we have women, but still not enough. But just over the last 13 years I'm seeing so much more women getting so involved and wanting to further their careers, coming and talking to me. When I first joined in 2004 fall, there weren't many women, but now with programs like WiDS and I also belong to another conference and actually I shared that in 2016, called WiCyS, Women in Cyber Security. So, through these programs, we've been able to recruit more women, but I would still have to say that most of the women, especially in our graduate programs are from South Asia and East Asia. We hardly find women from the U.S., right, U.S. born women pursuing careers in areas like cyber security and to some extent I would also say data science. And so we really need to do a lot more and events like WiDS and WiCys, and we've also started a Grace Lecture Series. >> Grace Hopper. >> We call it Grace Lecture at our university. Of course there's Grace Hopper, we go to Grace Hopper as well. So through these events I think that, you know women are getting more encouraged and taking leadership roles so that's very encouraging. But I still think that we are really behind, right, when you compare men and women. >> Yes and if you look at the statistics. So you have a speaking session this afternoon. Share with our audience some of the things that you're going to be sharing with the audience and some of the things that you think you'll be able to impart, in terms of wisdom, on the women here today. >> Okay, so, what I'm going to do is that, first start off with some general background, how I got here so I've already mentioned some of it to you, because it's not just going to be a U.S. event, you know, it's going to be in Forbes reports that around 100,000 people are going to watch this event from all over the world so I'm going to sort of speak to this global audience as to how I got here, to motivate these women from India, from Nigeria, from New Zealand, right? And then I'm going to talk about the work I've done. So over the last 32 years I've said about 50% of my time has been in cyber security, 50% in data science, roughly. Sometimes it's more in cyber, sometimes more in data. So my work has been integrating the two areas, okay? So my talk, first I'm going to wear my data science hat, and as a data scientist I'm developing data science techniques, which is integration of statistical reasoning, machine learning, and data management. So applying data science techniques for cyber security applications. What are these applications? Intrusion detection, insider threat detection, email spam filtering, website fingerprinting, malware analysis, so that's going to be my first part of the talk, a couple of charts. But then I'm going to wear my cyber security hat. What does that mean? These data science techniques could be hacked. That's happening now, there are some attacks that have been published where the data science, the models are being thwarted by the attackers. So you can do all the wonderful data science in the world but if your models are thwarted and they go and do something completely different, it's going to be of no use. So I'm going to wear my cyber security hat and I'm going to talk about how we are taking the attackers into consideration in designing our data science models. It's not easy, it's extremely challenging. We are getting some encouraging results but it doesn't mean that we have solved the problem. Maybe we will never solve the problem but we want to get close to it. So this area called Adversarial Machine Learning, it started probably around five years ago, in fact our team has been doing some really good work for the Army, Army research office, on Adversarial Machine Learning. And when we started, I believe it was in 2012, almost six years ago, there weren't many people doing this work, but now, there are more and more. So practically every cyber security conference has got tracks in data science machine learning. And so their point of view, I mean, their focus is not, sort of, designing machine learning techniques. That's the area of data scientists. Their focus is going to be coming up with appropriate models that are going to take the attackers into consideration. Because remember, attackers are always trying to thwart your learning process. >> Right, we were just at Fortinet Accelerate last week, theCUBE was, and cyber security and data science are such interesting and pervasive topics, right, cyber security things when Equifax happened, right, it suddenly translates to everyone, male, female, et cetera. And the same thing with data science in terms of the social impact. I'd love your thoughts on how cyber security and data science, how you can educate the next generation and maybe even reinvigorate the women that are currently in STEM fields to go look at how much more open and many more opportunities there are for women to make massive impact socially. >> There are, I would say at this time, unlimited opportunities in both areas. Now, in data science it's really exploding because every company wants to do data science because data gives them the edge. But what's the point in having raw data when you cannot analyze? That's why data science is just exploding. And in fact, most of our graduate students, especially international students, want to focus in data science. So that's one thing. Cyber security is also exploding because every technology that is being developed, anything that has a microprocessor could be hacked. So, we can do all the great data science in the world but an attacker can thwart everything, right? And so cyber security is really crucial because you have to try and stop the attacker, or at least detect what the attacker is doing. So every step that you move forward you're going to be attacked. That doesn't mean you want to give up technology. One could say, okay, let's just forget about Facebook, and Google, and Amazon, and the whole lot and let's just focus on cyber security but we cannot. I mean we have to make progress in technology. Whenever we make for progress in technology, driver-less cars or pacemakers, these technologies could be attacked. And with cyber security there is such a shortage with the U.S. Government. And so we have substantial funding from the National Science Foundation to educate U.S. citizen students in cyber security. And especially recruit more women in cyber security. So that's why we're also focusing, we are a permanent coach here for the women in cyber security event. >> What have some of the things along that front, and I love that, that you think are key to successfully recruiting U.S. females into cyber security? What do you think speaks to them? >> So, I think what speaks to them, and we have been successful in recent years, this program started in 2010 for us, so it's about eight years. The first phase we did not have women, so 2000 to 2014, because we were trying to get this education program going, giving out the scholarships, then we got our second round of funding, but our program director said, look, you guys have done a phenomenal job in having students, educating them, and placing them with U.S. Government, but you have not recruited female students. So what we did then is to get some of our senior lecturers, a superb lady called Dr. Janelle Stratch, she can really speak to these women, so we started the Grace Lecture. And so with those events, and we started the women in cyber security center as part of my cyber security institute. Through these events we were able to recruit more women. We are, women are still under-represented in our cyber security program but still, instead of zero women, I believe now we have about five women, and that's, five, by the time we will have finished a second phase we will have total graduated about 50 plus students, 52 to 55 students, out of which, I would say about eight would be female. So from zero to go to eight is a good thing, but it's not great. >> We want to keep going, keep growing that. >> We want out of 50 we should get at least 25. But at least it's a start for us. But data science we don't have as much of a problem because we have lots of international students, remember you don't need U.S. citizenship to get jobs at Facebook or, but you need U.S. citizenships to get jobs as NSA or CIA. So we get many international students and we have more women and I would say we have, I don't have the exact numbers, but in my classes I would say about 30%, maybe just under 30%, female, which is encouraging but still it's not good. >> 30% now, right, you're right, it's encouraging. What was that 13 years ago when you started? >> When I started, before data science and everything it was more men, very few women. I would say maybe about 10%. >> So even getting to 30% now is a pretty big accomplishment. >> Exactly, in data science, but we need to get our cyber security numbers up. >> So last question for you as we have about a minute left, what are some of the things that excite you about having the opportunity, to not just mentor your students, but to reach such a massive audience as you're going to be able to reach through WiDS? >> I, it's as I said, words cannot express my honor and how pleased and touched, these are the words, touched I am to be able to talk to so many women, and I want to say why, because I'm of, I'm a tamil of Sri Lanka origin and so I had to make a journey, I got married and I'm going to talk about, at 20, in 1975 and my husband was finishing, I was just finishing my undergraduate in mathematics and physics, my husband was finishing his Ph.D. at University of Cambridge, England, and so soon after marriage, at 20 I moved to England, did my master's and Ph.D., so I joined University of Bristol and then we came here in 1980, and my husband got a position at New Mexico Petroleum Recovery Center and so New Mexico Tech offered me a tenure-track position but my son was a baby and so I turned it down. Once you do that, it's sort of hard to, so I took visiting faculty positions for three years in New Mexico then in Minneapolis, then I was a senior software developer at Control Data Corporation it was one of the big companies. Then I had a lucky break in 1985. So I wanted to get back into research because I liked development but I wanted to get back into research. '85 I became, I was becoming in the fall, a U.S. citizen. Honeywell got a contract to design and develop a research contract from United States Air Force, one of the early secure database systems and Honeywell had to interview me and they had to like me, hire me. All three things came together. That was a lucky break and since then my career has been just so thankful, so grateful. >> And you've turned that lucky break by a lot of hard work into what you're doing now. We thank you so much for stopping. >> Thank you so much for having me, yes. >> And sharing your story and we're excited to hear some of the things you're going to speak about later on. So have a wonderful rest of the conference. >> Thank you very much. >> We wanted to thank you for watching theCUBE. Again, we are live at Stanford University at the third annual Women in Data Science Conference, #WiDs2018, I am Lisa Martin. After this short break I'll be back with my next guest. Stick around. (light techno music)
SUMMARY :
brought to you by Stanford. of computer science and the executive director What are some of the things that excite you so many of the junior women and so that, you know, What are some of the things that you are seeing and I find that we have women, but still not enough. So through these events I think that, you know and some of the things that you think you'll be able and I'm going to talk about how we and maybe even reinvigorate the women that are currently and let's just focus on cyber security but we cannot. and I love that, that you think are key to successfully and that's, five, by the time we will have finished to get jobs at Facebook or, but you need U.S. citizenships What was that 13 years ago when you started? it was more men, very few women. So even getting to 30% now Exactly, in data science, but we need and so I had to make a journey, I got married We thank you so much for stopping. some of the things you're going to speak about later on. We wanted to thank you for watching theCUBE.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Honeywell | ORGANIZATION | 0.99+ |
National Science Foundation | ORGANIZATION | 0.99+ |
1980 | DATE | 0.99+ |
Bhavani | PERSON | 0.99+ |
2010 | DATE | 0.99+ |
New Mexico | LOCATION | 0.99+ |
1975 | DATE | 0.99+ |
Lisa Martin | PERSON | 0.99+ |
Minneapolis | LOCATION | 0.99+ |
Control Data Corporation | ORGANIZATION | 0.99+ |
NSA | ORGANIZATION | 0.99+ |
Amazon | ORGANIZATION | 0.99+ |
2012 | DATE | 0.99+ |
Janelle Stratch | PERSON | 0.99+ |
1985 | DATE | 0.99+ |
England | LOCATION | 0.99+ |
Australia | LOCATION | 0.99+ |
MITRE Corporation | ORGANIZATION | 0.99+ |
New Zealand | LOCATION | 0.99+ |
Africa | LOCATION | 0.99+ |
ORGANIZATION | 0.99+ | |
United States Air Force | ORGANIZATION | 0.99+ |
2016 | DATE | 0.99+ |
ORGANIZATION | 0.99+ | |
Europe | LOCATION | 0.99+ |
Asia | LOCATION | 0.99+ |
52 | QUANTITY | 0.99+ |
five | QUANTITY | 0.99+ |
three years | QUANTITY | 0.99+ |
Nigeria | LOCATION | 0.99+ |
2014 | DATE | 0.99+ |
CIA | ORGANIZATION | 0.99+ |
U.S. | LOCATION | 0.99+ |
13 plus years | QUANTITY | 0.99+ |
India | LOCATION | 0.99+ |
second round | QUANTITY | 0.99+ |
Grace Hopper | PERSON | 0.99+ |
Central America | LOCATION | 0.99+ |
South Asia | LOCATION | 0.99+ |
30% | QUANTITY | 0.99+ |
50% | QUANTITY | 0.99+ |
Cyber Security Institute | ORGANIZATION | 0.99+ |
U.S. Government | ORGANIZATION | 0.99+ |
eight | QUANTITY | 0.99+ |
East Asia | LOCATION | 0.99+ |
first phase | QUANTITY | 0.99+ |
Bhavani Thuraisingham | PERSON | 0.99+ |
South America | LOCATION | 0.99+ |
Dallas | LOCATION | 0.99+ |
last week | DATE | 0.99+ |
University of Bristol | ORGANIZATION | 0.99+ |
third year | QUANTITY | 0.99+ |
Palo Alto, California | LOCATION | 0.99+ |
zero | QUANTITY | 0.99+ |
first part | QUANTITY | 0.99+ |
2004 fall | DATE | 0.99+ |
Stanford | LOCATION | 0.99+ |
New Mexico Tech | ORGANIZATION | 0.98+ |
WiDS | EVENT | 0.98+ |
over 100,000 people | QUANTITY | 0.98+ |
Equifax | ORGANIZATION | 0.98+ |
one | QUANTITY | 0.98+ |
more than 150 regional events | QUANTITY | 0.98+ |
second phase | QUANTITY | 0.98+ |
over 50 countries | QUANTITY | 0.98+ |
UT Dallas | ORGANIZATION | 0.98+ |
two areas | QUANTITY | 0.98+ |
2000 | DATE | 0.98+ |
one thing | QUANTITY | 0.98+ |
early 90's | DATE | 0.98+ |
both areas | QUANTITY | 0.98+ |
both | QUANTITY | 0.98+ |
Stanford University | ORGANIZATION | 0.98+ |
Women in Data Science | EVENT | 0.98+ |
55 students | QUANTITY | 0.98+ |
today | DATE | 0.98+ |
first | QUANTITY | 0.98+ |
WiDS 2018 | EVENT | 0.98+ |
'85 | DATE | 0.98+ |
theCUBE | ORGANIZATION | 0.98+ |
Ruth Marinshaw, Research Computing | WiDS 2018
>> Narrator: Live from Stanford University in Palo Alto, California, it's theCube, covering Women in Data Science conference 2018. Brought to you by Stanford. >> Welcome back to theCube. I'm Lisa Martin and we're live at Stanford University, the third annual Women in Data Science conference, WiDS. This is a great one day technical event with keynote speakers, with technical vision tracks, career panel and some very inspiring leaders. It's also expected to reach over 100,000 people today, which is incredible. So we're very fortunate to be joined by our next guest, Ruth Marinshaw, the CTO for Research Computing at Stanford University. Welcome to theCube, Ruth. >> Thank you. It's an honor to be here. >> It's great to have you here. You've been in this role as CTO for Research Computing at Stanford for nearly six years. >> That's correct. I came here after about 25 years at the University of North Carolina Chapel Hill. >> So tell us a little bit about what you do in terms of the services that you support to the Institute for Computational Mathematics and Engineering. >> So our team and we're about 17 now supports systems, file systems storage, databases, software across the university to support computational and data intensive science. So ICME, being really the home of computational science education at Stanford from a degree perspective, is a close partner with us. We help them with training opportunities. We try to do some collaborative planning, event promotion, sharing of ideas. We have joint office hours where we can provide system support. Margot's graduate students and data scientists can provide algorithmic support to some thousands of users across the campus, about 500 faculty. >> Wow. So this is the third year for WiDS, your third year here. >> Ruth: It is. >> When you spoke with Margot Gerritsen, who's going to be joining us later today, about the idea for WiDS, what were some of your thoughts about that? Did you expect it to make as big of >> Ruth: No. >> an impact? >> No, no people have been talking about this data tsunami and the rise of big data, literally for 10 years, but actually it arrived. This is the world we live in, data everywhere, that data deluge that had been foreseen or promised or feared was really there. And so when Margot had the idea to start WiDS, I actually thought what a nice campus event. There are women all over Stanford, across this disciplines who are engaged in data science and more who should. Stanford, if anything, is known for its interdisciplinary research and data science is one of those fields that really crosses the schools and the disciplines. So I thought, what a great way to bring women together at Stanford. I clearly did not expect that it would turn into this global phenomenon. >> That is exactly. I love that word, it is a phenomenon. It's a movement. They're expecting, there's, I said over a 100,000 participants today, at more than 150 regional events. I think that number will go up. >> Ruth: Yes. >> During the day. And more than 50 countries. >> Ruth: Yes. >> But it shows, even in three years, not only is there a need for this, there's a demand for it. That last year, I think it was upwards of 75,000 people. To make that massive of a jump in one year and global impact, is huge. But it also speaks to some of the things that Margot and her team have said. It may have been comfortable as one of or the only woman at a boardroom table, but maybe there are others that aren't comfortable and how do we help them >> Ruth: Exactly. >> and inspire them and inspire the next generation. >> Exactly. I think it's a really very powerful statement and demonstration of the importance of community and building technical teams in making, as you said, people comfortable and feeling like they're not alone. We see what 100,000 women maybe joining in internationally over this week for these events. That's such a small fraction compared to what the need probably is to what the hunger probably is. And as Margot said, we're a room full of women here today, but we're still such a minority in the industry, in the field. >> Yes. So you mentioned, you've been here at Stanford for over five years, but you were at Chapel Hill before. >> Ruth: Yes. >> Tell me a little bit about your career path in the STEM field. What was your inspiration all those years ago to study this? >> My background is actually computational social sciences. >> Lisa: Oh interesting. >> And so from an undergraduate and graduate perspective and this was the dawn of western civilization, long ago, not quite that long (Lisa laughs) but long ago and even then, I was drawn to programming and data analysis and data sort of discovery. I as a graduate student and then for a career worked at a demographic research center at UNC Chapel Hill, where firsthand you did data science, you did original data collection and data analysis, data manipulation, interpretation. And then parlayed that into more of a technical role, learning more programming languages, computer hardware, software systems and the like. And went on to find that this was really my love, was technology. And it's so exciting to be here at Stanford from that perspective because this is the birthplace of many technologies and again, referencing the interdisciplinary nature of work here, we have some of the best data scientists in the world. We have some of the best statisticians and algorithm developers and social scientists, humanists, who together can really make a difference in solving, using big data, data science, to solve some of the pressing problems. >> The social impact that data science and computer science alone can make with ideally a diverse set of eyes and perspectives looking at it, is infinite. >> Absolutely. And that's one reason I'm super excited today, this third WiDS for one of the keynote speakers, Latanya from Harvard. She's going to be talking, she's from government and sort of political science, but she's going to be talking about data science from the policy perspective and also the privacy perspective. >> Lisa: Oh yes. >> I think that this data science provides such great opportunity, not just to have the traditional STEM fields participating but really to leverage the ethicists and the humanists and the social sciences so we have that diversity of opinions shaping decision making. >> Exactly. And as much as big data and those technologies open up a lot of opportunities for new business models for corporations, I think so does it also in parallel open up new opportunities for career paths and for women in the field all over the world to make a big, big difference. >> Exactly. I think that's another value add for WiDS over it's three years is to expose young women to the range of career paths in which data science can have an impact. It's not just about coding, although that's an important part. As we heard this morning, investment banking, go figure. Right now SAP is talking about the impact on precision medicine and precision healthcare. Last year, we had the National Security Agency here, talking about use of data. We've had geographers. So I think it helps broaden the perspective about where you can take your skills in data science. And also expose you to the full range of skills that's needed to make a good data science team. >> Right. The hard skills, right, the data and statistical analyses, the computational skills, but also the softer skills. >> Ruth: Exactly. >> How do you see that in your career as those two sides, the hard skills, the soft skills coming together to formulate the things that you're doing today? >> Well we have to have a diverse team, so I think the soft skills come into play not just from having women on your team but a diversity of opinions. In all that we do in managing our systems and making decisions about what to do, we do look at data. They may not be data at scale that we see in healthcare or mobile devices or you know, our mobile health, our Fitbit data. But we try to base our decisions on an analysis of data. And purely running an algorithm or applying a formula to something will give you one perspective, but it's only part of the answer. So working as a team to evaluate other alternative methods. There never is just one right way to model something, right. And I think that, having the diversity across the team and pulling in external decision makers as well to help us evaluate the data. We look at the hard science and then we ask about, is this the right thing to do, is this really what the data are telling us. >> So with WiDS being aimed at inspiring and educating data scientists worldwide, we kind of talked a little bit already about inspiring the younger generation who are maybe as Maria Callaway said that the ideal time to inspire young females is first semester of college. But there's also sort of a flip side to that and I think that's reinvigorating. >> Yes. >> That the women who've been in the STEM field or in technology for awhile. What are some of the things that you have found invigorating in your own career about WiDS and the collaboration with other females in the industry? >> I think hearing inspirational speakers like Maria, last here and this year, Diane Greene from Google last year, talk about just the point you made that there's always opportunity, there's always time to learn new things, to start a new career. We don't have to be first year freshmen in college in order to start a career. We're all lifelong learners and to hear women present and to see and meet with people at the breakout sessions and the lunch, whose careers have been shaped by and some cases remade by the opportunity to learn new things and apply those skills in new areas. It's just exciting. Today for this conference, I brought along four or five of my colleagues from IT at Stanford, who are not data scientists. They would not call themselves data scientists, but there are data elements to all of their careers. And watching them in there this morning as they see what people are doing and hear about the possibilities, it's just exciting. It's exciting and it's empowering as well. Again back to that idea of community, you're not in it alone. >> Lisa: Right. >> And to be connected to all of these women across a generation is really, it's just invigorating. >> I love that. It's empowering, it is invigorating. Did you have mentors when you were in your undergraduate >> Ruth: I did. >> days? Were they males, females, both? >> I'd say in undergraduate and graduate school, actually they were more males from an academic perspective. But as a graduate student, I worked in a programming unit and my mentors there were all females and one in particular became then my boss. And she was a lifelong mentor to me. And I found that really important. She believed in women. She believed that programming was not a male field. She did not believe that technology was the domain only of men. And she really was supportive throughout. And I think it's important for young women as well as mid-career women to continue to have mentors to help bounce ideas off of and to help encourage inquiries. >> Definitely, definitely. I'm always surprised every now and then when I'm interviewing females in tech, they'll say I didn't have a mentor. >> Lisa: Oh. >> So I had to become one. But I think you know we think maybe think of mentors in an earlier stage of our careers, but at a later stage we talked about that reinvigoration. Are you finding WiDS as a source of maybe not only for you to have the opportunity to mentor more women but also are you finding more mentors of different generations >> Oh sure. >> as being part of WiDS? >> Absolutely, think of Karen Mathis, not just Margot but Karen, getting to know her. And we go for sort of walks around the campus and bounce ideas of each other. I think it is a community for yes, for all of us. It's not just for the young women and we want to remain engaged in this. The fact that it's global now, I think a new challenge is how do we leverage this international community now. So our opportunities for mentorship and partnership aren't limited to our local WiDS. They're an important group. But how do we connect across those different communities? >> Lisa: Exactly. >> They're international now. >> Exactly. I think I was on Twitter last night and there was the WiDS New Zealand about to go live. >> Yeah, yeah. >> And I just thought, wow it's this great community. But you make a good point that it's reached such scale so quickly. Now it's about how can we learn from women in different industries in other parts of the world. How can they learn from us? To really grow this foundation of collaboration and to a word you said earlier, community. >> It really is amazing though that in three years WiDS has become what it has because if you think about other organizations, special interest groups and the like, often they really are, they're not parochial. But they tend to be local and if they're national, they're not at this scale. >> Right. >> And so again back to it's the right time, it's the right set of organizers. I mean Margot, anything that she touches, she puts it herself completely into it and it's almost always successful. The right people, the right time. And finding ways to harness and encourage enthusiasm in really productive ways. I think it's just been fabulous. >> I agree. Last question for you. Looking back at your career, what advice would you have given young Ruth? >> Oh gosh. That's a really great question. I think to try to connect as much as you can outside your comfort zone. Back to that idea of mentorship. You think when you're an undergraduate, you explore curricula, you take crazy classes, Chinese or, not that that's crazy, but you know if you're a math major and you go take art or something. To really explore not just your academic breadth but also career opportunities and career understanding earlier on that really, oh I want to be a doctor, actually what do you know about being a doctor. I don't want to be a statistician, well why not? So I think to encourage more curiosity outside the classroom in terms of thinking about what is the world about and how can you make a difference. >> I love that, getting out of the comfort zone. One of my mentors says get comfortably uncomfortable and I love that. >> Ruth: That's great, yeah. >> I love that. Well Ruth, thank you so much for joining us on theCube today. It's our pleasure to have you here and we hope you have a great time at the event. We look forward to talking with you next time. >> We'll see you next year. >> Lisa: Excellent. >> Thank you. Buh-bye. >> I'm Lisa Martin. You're watching theCube live from Stanford University at the third annual Women in Data Science conference. #WiDS2018, join the conversation. After this short break, I'll be right back with my next guest. Stick around. (techno music)
SUMMARY :
Brought to you by Stanford. It's also expected to reach over 100,000 people today, It's an honor to be here. It's great to have you here. at the University of North Carolina Chapel Hill. in terms of the services that you support So ICME, being really the home So this is the third year for WiDS, and the rise of big data, literally for 10 years, I love that word, it is a phenomenon. During the day. But it also speaks to some of the things that Margot and inspire the next generation. and demonstration of the importance of community So you mentioned, you've been here at Stanford in the STEM field. And it's so exciting to be here at Stanford The social impact that data science and computer science and also the privacy perspective. and the social sciences so we have that diversity and for women in the field all over the world And also expose you to the full range of skills The hard skills, right, the data and statistical analyses, to something will give you one perspective, But there's also sort of a flip side to that and the collaboration with other females in the industry? and to hear women present and to see and meet with people And to be connected to all of these women Did you have mentors when you were in your undergraduate and to help encourage inquiries. I'm always surprised every now and then But I think you know we think maybe think of mentors It's not just for the young women and there was the WiDS New Zealand about to go live. and to a word you said earlier, community. But they tend to be local and if they're national, And so again back to it's the right time, what advice would you have given young Ruth? I think to try to connect as much as you can I love that, getting out of the comfort zone. We look forward to talking with you next time. Thank you. at the third annual Women in Data Science conference.
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Margot | PERSON | 0.99+ |
Karen | PERSON | 0.99+ |
Ruth | PERSON | 0.99+ |
Diane Greene | PERSON | 0.99+ |
Ruth Marinshaw | PERSON | 0.99+ |
Lisa Martin | PERSON | 0.99+ |
Karen Mathis | PERSON | 0.99+ |
Maria Callaway | PERSON | 0.99+ |
Lisa | PERSON | 0.99+ |
National Security Agency | ORGANIZATION | 0.99+ |
Margot Gerritsen | PERSON | 0.99+ |
Institute for Computational Mathematics and Engineering | ORGANIZATION | 0.99+ |
Last year | DATE | 0.99+ |
one year | QUANTITY | 0.99+ |
last year | DATE | 0.99+ |
five | QUANTITY | 0.99+ |
third year | QUANTITY | 0.99+ |
three years | QUANTITY | 0.99+ |
two sides | QUANTITY | 0.99+ |
Latanya | PERSON | 0.99+ |
100,000 women | QUANTITY | 0.99+ |
Today | DATE | 0.99+ |
four | QUANTITY | 0.99+ |
10 years | QUANTITY | 0.99+ |
Palo Alto, California | LOCATION | 0.99+ |
one day | QUANTITY | 0.99+ |
next year | DATE | 0.99+ |
both | QUANTITY | 0.99+ |
last night | DATE | 0.99+ |
One | QUANTITY | 0.99+ |
more than 50 countries | QUANTITY | 0.99+ |
Maria | PERSON | 0.99+ |
one | QUANTITY | 0.99+ |
over five years | QUANTITY | 0.99+ |
more than 150 regional events | QUANTITY | 0.98+ |
this year | DATE | 0.98+ |
over 100,000 people | QUANTITY | 0.98+ |
Fitbit | ORGANIZATION | 0.98+ |
Stanford University | ORGANIZATION | 0.98+ |
thousands | QUANTITY | 0.98+ |
first year | QUANTITY | 0.98+ |
about 500 faculty | QUANTITY | 0.98+ |
Stanford | ORGANIZATION | 0.98+ |
today | DATE | 0.98+ |
first semester | QUANTITY | 0.98+ |
75,000 people | QUANTITY | 0.98+ |
WiDS | EVENT | 0.98+ |
one reason | QUANTITY | 0.98+ |
University of North Carolina | ORGANIZATION | 0.97+ |
ORGANIZATION | 0.97+ | |
about 25 years | QUANTITY | 0.97+ |
Chapel Hill | LOCATION | 0.96+ |
SAP | ORGANIZATION | 0.96+ |
one perspective | QUANTITY | 0.95+ |
UNC Chapel Hill | ORGANIZATION | 0.94+ |
#WiDS2018 | EVENT | 0.94+ |
WiDS | ORGANIZATION | 0.94+ |
New Zealand | LOCATION | 0.94+ |
this week | DATE | 0.93+ |
third | QUANTITY | 0.92+ |
WiDS 2018 | EVENT | 0.92+ |