Image Title

Search Results for Farallon Institute:

Dr. Chelle Gentemann, Farallon Institute | AWS Public Sector Online


 

>> Narrator: From around the globe, it's theCUBE with digital coverage of AWS Public Sector Online. Brought to you by Amazon Web Services. >> Welcome back to the coverage of AWS Public Sector Summit virtual. I'm John for host of theCUBE. We're here in theCUBE studios, quarantine crew here talking to all the guests remotely as part of our virtual coverage of AWS Public Sector. So I've got a great guest here talking about data science, weather predictions, accurate climate modeling, really digging into how cloud is helping science. Dr. Chelle Gentemann, who is a senior scientist at Farallon Institute is my guest. Chelle, thank you for joining me. >> Thank you. >> So tell us a little about your research. It's fascinating how, I've always joked in a lot of my interviews, 10, 15, 20 years ago, you need super computers to do all these calculations. But now with cloud computing, it opens up so much more on the research side and the impact is significant. You're at an awesome Institute, the Farallon Institute, doing a lot of stuff in the sea and the ocean and a lot of your things. What's your focus? >> I study the ocean from space, and about 71% was covered by ocean. 40% of our population in the globe actually lives within 100 kilometers of the coast. The ocean influences our weather, it influences climate, but it also provides fisheries and recreational opportunities for people. So it's a really important part of the earth system. And I've been focused on using satellites. So from space, trying to understand how the ocean influences weather and climate >> And how new is this in terms of just state of the art? Fairly new, been around for a while? What's some of the progress for the state of the art we're involved in. >> I started working on satellite data in the 90s during school, and I liked the satellite data cause it's the interface of sort of applied math, computer science and physics. The state of the art is that we've really had remote sensing around for about 20, 30 years. But things are changing because right now we're having more sensors and different types of instruments up there and trying to combine that data is really challenging. To use it, our brain is really good in two and three dimensions, but once you get past that, it's really difficult for the human brain to try and interpret the data. And that's what scientists do. Is they try and take all these multidimensional data sets and try to build some understanding of the physics of what's going on. And what's really interesting is how cloud computing is impacting that. >> It sounds so exciting. The confluence of multiple disciplines kind of all right there, kind of geek out big time. So I've got to ask you, in the past you had the public data set program. Are you involved in that? Do you take advantage that research? How is some of the things that AWS is doing help you and is that public data set part of it? >> It's a big part of it now. I've helped to deploy some of the ocean temperature data sets on the cloud. And the way that AWS public data sets as sort of has potential to transform science is the way that we've been doing science, the way that I was trained in science was that you would go and download the data. And most of these big institutions that do research, you start to create these dark repositories where the institutions or someone in your group has downloaded data sets. And then you're trying to do science with these data, but you're not sure if it's the most recent version. It makes it really hard to do reproducible science, because if you want to share your code, somebody also has to access that data and download it. And these are really big data sets. So downloading it could take quite a long time. It's not very transparent, it's not very open. So when you move to a public data set program like AWS, you just take all of that download out of the equation. And instantly when I share my code now, people can run the code and just build on it and go right from there, or they can add to it or suggest changes. That's a really big advantage for trying to do open science. >> I had a dinner with Teresa Carlson who is awesome. She runs the Public Sector Summit for AWS. And I remember this was years ago and we were dreaming about a future where we would have national parks in the cloud or this concept of a Yosemite-like beautiful treasure. Physical place you could go there. And we were kind of dreaming that, wouldn't it be great to have like these data sets or supercomputer public commons. It sounds like that's kind of the vibe here where it's shareable and it's almost like a digital national park or something. Is that it's a shared resource. Is that kind of happening? First of all, what do you react to that? And what's your thoughts around that dream? And does this kind of tie to that? >> Yeah, I think it ties directly to that. When I think about how science is still being done and has been done for the past sort of 20 years, we had a real change about 20 years ago when a lot of the government agencies started requiring their data to be public. And that was a big change. So then we got, we actually had public data sets to work with. So more people started getting involved in science. Now I see it as sort of this fortress of data that in some ways have prevented scientists from really moving rapidly forward. But with moving onto the cloud and bringing your ideas and your compute to the data set, it opens up this entire Pandora's box, this beautiful world of how you can do science. You're no longer restricted to what you have downloaded or what you're able to do because you have this unlimited compute. You don't have to be at a big institution with massive supercomputers. I've been running hundreds of workers analyzing in my realm. Over two or 300 gigabytes of data on a $36 Raspberry Pi that I was playing around with my kids. That's transformative. That allows anyone to access data. >> And if you think about what it would have to do to do that in the old days, stack and spike servers. Call, first of all, you'll get the cash, buy servers, rack them and stack them, connect to a download of nightmare. So I got to ask you now with all this capability, first of all, you're talking to someone who loves the cloud. So I'm pretty biased. What are you doing now with the cloud that you couldn't do before? So certainly the old way from a provisioning standpoint, check, done. Innovation, bars raised. Now you're creative, you're looking at solutions, you're building enabling device like a Raspberry Pi, almost like a switch or an initiation point. How has the creativity changed? What can you do now? What are some of the things that are possible that you're doing? >> I think that you can point to within some of the data sets that have already gone on the cloud are being used in these really new, different ways. Again, it points to this, when you don't have access to the data, just simply because you have to download it. So that downloading the data and figuring out how to use it and figuring out how to store it is a big barrier for people. But when things like the HF Radar data set went online. Within a couple of months, there was a paper where people were using it to monitor bird migration in ways that they'd never been able to do before, because they simply hadn't been able to get the data. There's other research being done, where they've put whale recordings on the cloud and they're using AI to actually identify different whales. It's using one data set, but it's also the ability to combine all these different data sets and have access to them at the same time and not be limited by your computer anymore. Which for a lot of science, we've been limited by our access to compute. And that when you take away that, it opens all these new doors into doing different types of research with new types of data, >> You could probably correlate the whale sounds with the temperature and probably say, hey, it's cold. >> Chelle: Exactly. >> I'm making that up. But that's the kind of thing that wouldn't be possible before because you'd have to get the data set, do some math. I mean, this is cool stuff with the ocean. I mean, can you just take a minute to share some, give people an insight in some of the cool projects that are being either thought up or dreamed up or initiated or done or in process or in flight, because actually there's so much data in the ocean. So much things to do, it's very dynamic. There's a lot of data obviously. Share, for the folks that might not have a knowledge of what goes on. What are you guys thinking about? >> A lot of what we're thinking about is how to have societal impact. So as a scientist, you want your work to be relevant. And one of the things that we found is that the ocean really impacts weather at scales that we simply can't measure right now. So we're really trying to push forward with space instrumentation so that we can monitor the ocean in new ways at new resolutions. And the reason that we want to do that is because the ocean impacts longterm predictability in the weather forecast. So a lot of weather forecasts now, if you look out, you can go on to Weather Underground or whatever weather site you want. And you'll see the forecast goes out 10 days and that's because there's not a lot of accuracy after that. So a lot of research is going into how do we extend into seasonal forecast? I'm from Santa Rosa, California. We've been massively impacted by wildfires. And being able to understand how to prepare for the coming season is incredibly important. And surprisingly, I think to a lot of people, the ocean plays a big role in that. The ocean can impact how much storm systems, how they grow, how they evolve, how much water they actually got. Moisture they pick up from the ocean and then transport over land. So if you want to talk about, it's really interesting to talk about how the ocean impacts our weather and our seasonal weather. So that's an area where people are doing a lot of research. And again, you're talking about different data sets and being able to work together in a collaborative environment on the cloud is really what's starting to transform how people are working together, how they're communicating and how they're sharing their science. >> I just hope it opens up someone's possibilities. I want to get your vision of what you think the breakthroughs might be possible with cloud for research and computing. Because you have kind of old school and new school. Amazon CEO, Andy Jassy calls it old guard, new guard. The new guard is really more looking for self provisioning, auto-scaling, all that. Super computer on demand, all that stuff at your fingertips. Great, love that. But is there any opportunity for institutional change within the scientific community? What's your vision around the impact? It's not just scientific. It also can go to government for societal impact. So you start to see this modernization trend. What's your vision on the impact of the scientific community with cloud? >> I think that the way the scientific community has been organized for a long time is that scientists that are at an institute. And a lot of the research has been siloed. And it's siloed in part because of the way the funding mechanism works. But that inhibits creativity and inhibits collaboration. And it inhibits the advancement of science. Because if you hold onto data, you hold on to code. You're not allowing other people to work on it and to build on what you do. The traditional way that scientists have moved forward is you make a discovery, you write up a paper, you describe it in a journal article, and then you publish that. Then if someone wants to build on your research, they get your journal article, they read it. Then they try to understand what you did. They maybe recode all of your analysis. So they're redoing the work that you did, which is simply not efficient. Then they have to download the data sets that you access. This slows down all of science. And it also inhibits bringing in new data sets again because you don't have access to them. So one of the things I'm really excited about with cloud computing is that by bringing our scientific ideas and our compute to the data, it allows us to break out of these silos and collaborate with people outside of our institution, outside of our country, and bring new ideas and new voices and elevate everyone's ideas to another level. >> It brings the talent and the ideas together. And now you have digital and virtual worlds, cause we've been virtualized with COVID-19. You can create content as a community building capability or your work can create a network effect with other peers. And is a flash mobbing effect of potential collaboration. So work, work forces, workplaces, work loads, work flows, kind of are interesting or kind of being changed in real time. You were just talking about speed, agility. These are technical concepts being applied to kind of real world scenarios. I mean your thoughts on that. >> I now work with people like right now, I'm working with students in Denmark, Oman, India, France, and the US. That just wasn't possible 10 years ago. And we're able to bring all these different voices together, which it really frees up science and it frees up who can participate in science, which is really fun. I mean, I'm a scientist. I do it because it's really, really fun. And I love working with other people. So this new ability that I've gained in the last couple of years by moving onto the cloud has really accelerated all the different types of collaborations I'm involved with. And hopefully accelerating science as a whole. >> I love this topic. It's one of my passion areas where it's an issue I've been scratching for over a decade too. Is that content and your work is an enabler for community engagement because you don't need to publish it to a journal. It's like waterfall mentality. It's like you do it. But if you can publish something or create something and show it, demo it or illustrate it, that's better than a paper. If you're on video, you can talk about it. It's going to attract other people, like-minded peers can come together. That's going to create more collaboration data. That's going to create more solidarity around topics and accelerate the breakthroughs. >> For our last paper, we actually published all the software with it. We got a digital firewall for the software, published the software and then containerized it so that when you read our paper, at the bottom of the paper, you get a link. You go to that link, you click on a button and you're instantly in our compute environment, you can reproduce all of our results. Do the error propagation analysis that we did. And then if you don't like something, go ahead and change it or add onto it or ask us some questions. That's just magical. >> Yeah, it really is. And Amazon has been a real investor and I got to give props to Teresa Carlson and her team and Andy Jassy, the CEO, because they've been investing in credits and collaborating with groups like Jet Propulsion Lab, you guys, everyone else. Just space has been a big part of that. I see Bezos love space. So they've been investing in that and bringing that resource to the table. So you've got to give Amazon some props for that. But great work that you're doing. I'm fascinated. I think it's one of those examples where it's a moonshot, but it's doable. It's like you can get there. >> Yeah, and it's just so exciting. I'm the lead on a proposal for a new science mission to NASA. And we are going all in with the cloud computing. So we're going to do all the processing on the cloud. We want to do the entire science team on the cloud and create a science data platform where we're all working together. That's just never happened before. And I think that by doing this, we multiply the benefits of all of our analysis. We make it faster and we make it better and we make it more collaborative. So everyone wins. >> Sure, you're an inspiration to many. I'm so excited to do this interview with you. I love what you said earlier at the beginning about your focus of being in computer science, physics, space. That confluence is multiple disciplines. Not everyone can have that. Some people just get a computer science degree. Some people get, I'm premed, or I'm going to do biology. I'm going to do this. This notion of multiple disciplines coming together is really what society needs now. Is we're converging or virtualizing or becoming a global society. And that brings up my final question. Is something I know that you're passionate about creating a more inclusive scientific community because you don't have to be the, just the computer science major. Now, if you have all three, it's a multi-tool when you're a multiple skill player. But you don't have to be something to get into this new world. Because if you have certain disciplines, whether it's math, maybe you don't have computer science but it's quick to learn. There's frameworks out there, no code, low code. So cloud computing supports this. What's your vision and what's your opinion of how more inclusivity can come into the scientific community? >> I think that, when you're at an institution or at a commercial company or a nonprofit, if you're at some sort of organized institution, you have access to things that not everyone has access to. And in a lot of the world, there's trouble with internet connectivity. There is trouble downloading data. They simply don't have the ability to download large data sets. So I'm passionate about inclusivity because I think that, until we include global voices in science, we're not going to see these global results that we need to. We need to be more interdisciplinary. And that means working with different scientists in different fields. And if we can all work together on the same platform that really helps explode interdisciplinary science and what can be done. A lot of science has been quite siloed because you work at an institution. So you talked to the people one door down, or two doors down or on the same floor. But when you start working in this international community and people don't have to be online all the time, they can write code and then just jump on and upload it. You don't need to have these big, powerful resources or institutions behind you. And that gives a platform for all types of scientists, that all types of levels to start working with everyone. >> This is why I love the idea of the content and the community being horizontally scalable. Because if you're stuck around a physical institution or space, you kind of like have group think, or maybe you have the same kind of ideas being talked about. But here, when you pull back the remote work with COVID-19, as an example, it highlights it. The remote scientist could be anywhere. So that's going to increase access. What can we do to accept those voices? Is there a way or an idea or formula you see that people could, assuming there's access, which I would say, yes. What do we do? What do you do? >> I think you have to be open and you have to listen. Because, if I ask a question into the room where my colleagues work, we're going to come up with an answer. But we're going to come up with an answer that's informed by how we were trained in science and what fields we know. So when you open up this box and you allow other voices to participate in science, you're going to get new and different answers. And as a scientist, you need to be open to allowing those voices to be heard and to acting on them and including them in your research results and thinking about how they may change what you think and bring you to new conclusions. >> Machine learning has been a part. I know your work in the past, obviously cloud you're a big fan, obviously can tell. Proponent of it. Machine learning and AI can be a big part of this too, both on not only sourcing new voices and identifying what's contextually relevant at any given time, but also on the science-side machine learning. Because if we can take a minute to give your thoughts on the and relevance of machine learning and AI, because you still got the humans and you got machines augmenting each other, that relationship is going to be a constant conversation point going forward. Is there data about the data and what's the machines doing? What's your thoughts on all of these? Machine learning and AI as an impact. >> It's funny you say impact. So I work with this NASA IMPACT project, which is this interdisciplinary team that tries to advance science, and it's really into machine learning and AI. One of the difficulties when you start to do science is you have an idea like, okay, I want to study tropical storms. And then you have to go and wade through all these different types of data to identify when events happened and then gather all the data from those different events and start to try and do some analysis. They're working and they've been really successful in using AI to actually do this sort of event identification. So what's interesting and how can we use AI and machine learning to identify those interesting events and gathering everything together for scientists to then try and bring for analysis? So AI is being used in a lot of different ways in science. It's being used to look at these multi-dimensional problems that are just a little bit too big for our brains to try and understand. But if we can use AI and machine learning to gather insights into certain aspects of them, it starts to lead to new conclusions and it starts to allow us to see new connections. AI and machine learning has this potential to transform how we do science. Cloud computing is part of that because we have access to so much more data now. >> It's a real enabling technology. And when you have enabling technology, the power is in the hands of the creative minds. And it's really what you can think up and what you can dream up and that's going to come from people. Phenomenal. Final question for you, to kind of end on a light note. Dr. Chelle Gentemann here, senior scientist at the Farallon Institute. You're doing a lot of work on the ocean, space, ocean interaction. What's the coolest thing you're working on right now? Or you you've worked on that you think would be worth sharing. >> There's a couple of things. I have to think about what's the most fun. Right now, I'm working on doing some analysis with data. We had a big, huge international field campaign this winter off of Barbados, there were research festival, rustles and aircraft. There were sail drones involved, which are these autonomous robotic vehicles that go along the ocean surface and measure air-sea interactions. Right now we're working on analyzing that data. So we have all of this ground truth data. We're bringing in all the satellite observations to see how we can better understand the earth system in that region with a specific focus on air-sea interactions over the ocean where when it rains, you get the salinity stratification. When there's strong solar, you get diurnal stratification. So you have upper ocean stratification and heat and salinity. And how those impact the fluxes and how the ocean impacts the heat and moisture transport into the atmosphere, which then affects weather. So again, this is this multidimensional data set with all these different types of both ground truth data, satellite data that we're trying to bring together and it's really exciting. >> It could shape policy, it could shape society. Maybe have a real input into global warming. Our behaviors in the world, sounds awesome. Plus, I love the ground truth and the observational data. It sounds like our media business algorithm, we got to get the observation, get the truth, report it. Sounds like there's something in there that we could learn from. (both giggling) >> Yeah, it's very interesting cause you often find what you see from a distance is not quite true up close. >> I can tell you that we as in media as we do a lot of investigative journalism. So we appreciate that. Dr. Chelle Gentemann, senior scientist at the Farallon Institute, here as part of AWS Public Sector Summit. Thank you so much for time. What a great story. We'll keep in touch. Love the sails drone. Great innovation. And continue the good work, I'm looking forward to checking in later. Thanks for joining. >> Thanks so much. It was nice talking to you. >> I'm John Furrier with theCUBE. We're here in our studios covering the Amazon Web Services Public Sector Summit virtual. This is theCUBE virtual bringing you all the coverage with Amazon and theCUBE. Thanks for watching. (upbeat music)

Published Date : Jun 30 2020

SUMMARY :

Brought to you by Amazon Web Services. Chelle, thank you for joining me. and the ocean and a lot of your things. I study the ocean from space, for the state of the the human brain to try in the past you had the and download the data. First of all, what do you react to that? to what you have downloaded So I got to ask you now And that when you take away that, correlate the whale sounds So much things to do, it's very dynamic. And the reason that we want to do that of the scientific community with cloud? and to build on what you do. and the ideas together. and the US. and accelerate the breakthroughs. You go to that link, you click on a button and bringing that resource to the table. science team on the cloud But you don't have to be something And in a lot of the world, and the community being and you allow other voices and you got machines And then you have to go And it's really what you can think up and how the ocean impacts the heat and the observational data. cause you often find what And continue the good work, It was nice talking to you. the Amazon Web Services

SENTIMENT ANALYSIS :

ENTITIES

EntityCategoryConfidence
Andy JassyPERSON

0.99+

Chelle GentemannPERSON

0.99+

AmazonORGANIZATION

0.99+

Teresa CarlsonPERSON

0.99+

DenmarkLOCATION

0.99+

Farallon InstituteORGANIZATION

0.99+

Amazon Web ServicesORGANIZATION

0.99+

FranceLOCATION

0.99+

AWSORGANIZATION

0.99+

ChellePERSON

0.99+

Jet Propulsion LabORGANIZATION

0.99+

NASAORGANIZATION

0.99+

Teresa CarlsonPERSON

0.99+

IndiaLOCATION

0.99+

JohnPERSON

0.99+

John FurrierPERSON

0.99+

10 daysQUANTITY

0.99+

twoQUANTITY

0.99+

two doorsQUANTITY

0.99+

$36QUANTITY

0.99+

USLOCATION

0.99+

OmanLOCATION

0.99+

BarbadosLOCATION

0.99+

40%QUANTITY

0.99+

oneQUANTITY

0.99+

COVID-19OTHER

0.99+

300 gigabytesQUANTITY

0.99+

Santa Rosa, CaliforniaLOCATION

0.99+

bothQUANTITY

0.99+

three dimensionsQUANTITY

0.98+

10 years agoDATE

0.98+

PandoraORGANIZATION

0.98+

one doorQUANTITY

0.98+

90sDATE

0.97+

about 71%QUANTITY

0.97+

Public Sector SummitEVENT

0.97+

OneQUANTITY

0.97+

YosemiteLOCATION

0.96+

10DATE

0.96+

BezosPERSON

0.96+

over a decadeQUANTITY

0.95+

Amazon Web Services Public Sector SummitEVENT

0.94+

about 20, 30 yearsQUANTITY

0.94+

AWS Public Sector SummitEVENT

0.93+

15DATE

0.92+

threeQUANTITY

0.92+

about 20 years agoDATE

0.92+

100 kilometersQUANTITY

0.92+

one data setQUANTITY

0.92+

hundreds of workersQUANTITY

0.91+

AWS Public Sector OnlineORGANIZATION

0.89+

Over twoQUANTITY

0.88+

both groundQUANTITY

0.85+

FirstQUANTITY

0.83+

earthLOCATION

0.82+

this winterDATE

0.8+

20 years agoDATE

0.77+

years agoDATE

0.75+

last couple of yearsDATE

0.7+

HFORGANIZATION

0.69+

couple of monthsQUANTITY

0.69+

Dr.PERSON

0.68+

yearsDATE

0.66+

AWS Public SectorORGANIZATION

0.62+

Raspberry PiCOMMERCIAL_ITEM

0.62+

theCUBEORGANIZATION

0.61+

theCUBETITLE

0.6+

thoseQUANTITY

0.6+

Raspberry PiORGANIZATION

0.56+

RadarCOMMERCIAL_ITEM

0.56+

20QUANTITY

0.55+

thingsQUANTITY

0.51+

minuteQUANTITY

0.5+