Ahmad Haider, AGCO | AWS Summit New York 2019
>> Narrator: Live from New York, it's The Cube. Covering AWS Global Summit 2019. Brought to you by Amazon Web Services. >> Welcome back, I'm Stu Miniman and with my co-host Corey Quinn and we're here at AWS New York City Summit. Always happy when we have users on the program to tell their story, and joining us for the first time, Ahmad Haider who's the Lead Enterprise Data Science Architecter at AGCO, an agricultural company based down in Georgia. Ahmed thanks so much for joining us. >> Thank you for having me. >> All right so, agriculture obviously y'know we understand in general, y'know the joke I have for most people is well luckily, your industry isn't going through much change (laughter) and of course yeah, that's the response we get in most but y'know give us the thumbnail, AGCO, how long's the company been around? The focus and y'know right, some of those changes that you're seeing in the industry. >> Sure, so let me start just about AGCO, so AGCO is about a 9.4 million dollar agricultural equipment manufacturer, it's been around for 20 plus years and we are well known in the industry so some of our famous brands like Valtra, Fendt, Massey. Coming back to your other question, we are not going through a lot of change, I get that very often and you know what, it was an eyeopener when I joined AGCO. So the farming industry is actually going through a lot of change, you must have heard of Agrotech and so the farmers now, they want better efficient solutions that will help them manage their farms while they focus on the core work of farming and they are looking at companies which manufacture agricultural equipment to help provide that digital support, help provide solutions that help manage a farm better, help them to provide the maintenance better, help them optimize the equipment and so on and that's where we are trying to help them out. >> Yep, so it's always easy to look at any industry and they're like oh they have it easy and it's not changing that much. You've got data science in your title, talk a little bit about your role inside the company, y'know we know how important data is to most companies but of course with a data scientist, it's your job to help unlock that power. >> Yeah, definitely. Let me give you a little bit of background and that will help frame this much better so AGCO realized the part of data a while ago but very recently they started working on this so something called a digital experience, digital customer experience program. What that does is basically it creates you a set of connected solutions that manage the data of our customers, our dealers, our part and machine data in a fast, reliable and secure manner and all these digital solutions that we are creating, they are powered through analytics to leverage new market insights, to unlock new opportunities, to help understand our customers better. So given that particular space, I help design the AGCO's data science vision, that involves, first of all, setting up a data science platform that enables us to maximize the user data that we have. Secondly, working with our business to identify analytics use-cases which could be a part of the product roadmap and build them out and then execute this on the data science platform and thirdly, from the point of view of architecture, understanding what things go in the design, making sure everything's state of the art, help the design document and making sure that we are staying right at the top in terms of agriculture, in terms of data science and pushing at the boundaries in all their products. >> What are the, I guess, hidden secrets of data science across the board as the sheer amount of time and effort that has to be put into data normalization before you can start getting useful information out of it, was that a significant concern given what you do? Or given the fact that you more or less control the entire thing and you can reformulate the data as it's ingested? >> That was a very valid concern, I mean what most people don't talk about is the quality of data. They only talk about the data science, the fancy things, so we had the same challenges. Our data was distributed in different places, had different formats, had different levels of cleanliness so what I did was, during the building of the data science platform, I recognized this challenge proactively and made sure that we do cleanse the data, we normalize it to a format that's usable for our use-cases but we don't do it all at once, we go use-case by use-case, we identify our business priorities, we normalize the data, we cleanse it, we normalize it, bring it to a format that can be used going forward and we do it with every use-case. Over time, majority of it will be normalized but that will take an incremental, gradual of course. >> All right, Ahmad bring us into the role of cloud in your environment. >> Sure, so cloud is a very important component, so historically, we were more like an on premises organization and when we went on cloud data, it was a very important change, more so from the point of view, if you think about it, for a company to migrate or position itself, transform itself into a software organization in terms of data science, you need a lot of accelerators, you need data scientists, you need infrastructure, you need data engineers and you need people to manage all of this and all that hiring talent takes time but what cloud does is, there's the ability to procure services on demand and something which is fully managed, all services, that allows you to overcome a lot of those barriers quickly while you have time to actually build other solutions on top of the cloud. Over time when we understand our processes better, our demands better, then we can think about, okay where does it make sense to go hybrid but cloud is that great accelerator that allowed us to set up this data analytics platform which we did in roughly about fifteen weeks. Before that I was working in another organization where we did this on premises and I can tell you it took at least like three times if not more, so that I mean, I think that's the real value of cloud apart from all it's machine learning services and everything. It helped us to accelerate that process easily. How, I guess, in the workflow that you'd wind up going through how close is the data that you're generating to the cloud? Are you doing this at the edge, are you doing this in the field in some cases? I guess where is the data entering your pipeline? >> Yeah, so there are different forms of data that we have, we have a lot of data that is customer-related data that essentially is more or less slow-moving data that we have in the organization. That constitutes the major bulk of the data, apart from that, we have data that are coming from machines which are these smart machines operating in the field and data comes through the satellite and comes to our servers. We also have data that comes from the edge from some of these machinery that are operating in the factory and from there you will get data on the edge. Among all these different data sources that we have, I would say the predominant, or the initial focus, the pillar focus is to first start with the data that we have in abundance, so that's essentially the customer data, our dealer data to be able to understand that better, derive new market insights but our focus is to go forward, getting data from these machines combining that with the soil data with the farming data, with the agronomy data to deliver these very precise, things like precise planting schedules, things like predictive maintenance of machines as they operate out in the field and things like value driven care. So those are things that we are hoping to do with this as well. >> Right, you mentioned machine learning, y'know where are you along your journey kind of with the MLAI and the like? >> That's a really good question, so AGCO as a whole, I think we are at different stages at different parts of the organization so a lot of the organization is focused on generating value through descriptive analytics and explorative analytics whereby we are exploring the data and we are finding these insights and then making decisions on top of them. We are going into the area of predictive analytics fairly recently, about a year so and we essentially, that is our next step so we went into predictive analytics, we are creating machine learning models, we are creating combined stat models. We are using services like SageMaker on the cloud, we are using Spark libraries, we are using Cyclone, we are using Arc, all of that to create predictive analytics solutions. So in terms of the technology that we use right now, it's actually pretty much state of the art, we have created our own model management engines. We are using what Amazon provides and we supplement them with what we have. So we are pretty much at state of the art in terms of current what we are doing. We're hoping to take that state of the art and apply it to large parts of the organization. >> So as you look at, I guess some of the higher level differentiated services coming down in the world of machine learning, do you find that a lot of what you're doing today and in a few years is going to be something that's being handled automatically and then you're able to focus on the more interesting parts of the work? Or is there really no end in sight for I guess sort of some of the current block and tackle that a lot of data scientists are sort of struggling with today? >> I'm sorry I couldn't hear a part of your voice >> No, my apologies. Just a you see things continuing to evolve in this space, are you finding, are you predicting that there's going to be more I guess higher level services that solve some of this problem for you or is a lot of it I guess, block and tackle, not really having a relief point in sight? >> That's a very good question, I get that very often. So, I would like to say the answer, it depends but I'll describe that answer. So there are some parts of this machine learning AI that I think will be solved by newer services, by technology going forward. You can take an example, I'll give you a concrete example, SageMaker, which is fairly recent offering by Amazon about a year ago that we started using SageMaker, it didn't have a lot of competence that it currently has and we had to build a lot of the competence to get towards something called model management. Now, we built all of that but lo and behold after we went, they actually added a lot of these. So over technology, they will take care of a lot of these things which you currently do by smart automation. Now smart automation can take care of a lot of things, it helps you identify when you need to retrain a model, it helps you to deploy a model, it helps you to identify the trigger points but what analytics, I mean, where I think the challenge will come is how to actually apply it to the business because that needs a lot of context and for that you need to understand where are these perfect pinpoints, where do you actually apply it? Does it make sense to use it in a prioritization model? Does it make sense to use it as a explorative model? Does it make sense to use an attribution model? And to help define that use-case in the beginning to essentially say going from a business landscape to come to a specific problem that you want to solve, that is a part that I think will take some time and can't be readily addressed by these technologies but everything down the line, I fairly see that in a few years all of that will be available. >> All right, Ahmad are you speaking here at the conference? >> No, I actually spoke at the keynote in Atlanta. >> Okay >> And the summit >> Great, give us a little about y'know what you get out of coming to some of the regional summits here from Amazon. >> Yeah definitely, so I get a lot out of it. So, the biggest thing is I get to know what are the different things that are happening in the industry from the point of business, so not just about technologies right. Like lots of different technologies coming on but how are people using it? How does it make an impact in their business? Because for me the intersection of technology and business is the key point. So coming to a lot of these regional summits where they have these different business partners, they come in and they describe their work and connecting with them. That, for me, is the main draw, apart from that there's the other piece which is you get to know about the different things that are being done in this space. For example, if you go to AWS summit, you get to know everything that is coming to the cloud and you can try and experiment that and you can basically create like a nice ecosystem. If you go to an Azure summit, you get something similar. So that state of the art is also important but more important is the draw, that intersection. >> And I guess just one followup on that is y'know the data scientist community is y'know, what are some of your best sources of y'know learning and sharing today? >> That's a very good question, data science is one of those aspects because two parts to it. I don't know, I mean now there are machine learning engineers too, so but one part is the technical part of this, to be able to create these models with pinpoint accuracy and the second is applications. So in terms of the first part of learning about creating these models, the best sources in that case would be self-learning, I have, I went through, when I was doing this, I did my PhD, I learned a lot of stuff and then I go through a lot of articles when new things come out, you go through them, once you have the different sources, there are lots of them. The second part, right, applications, I have found the best source of learning there is actually interacting with people who use these technologies. Interacting with people, let's say who have no experience of data science, they have experience of business and then working with them to understand how can you take this insight that's created out of a model and impart into business, for that there's no other substitute than just talking to people, understanding the pinpoints and then solving those. >> All right, well Ahmad thank you so much for giving the update on AGCO and your role inside. >> Thank you >> All right, for Corey Quinn, I'm Stu Miniman, we'll be back with more coverage here from AWS' New York City Summit. Thanks as always for watching the cube. (upbeat electronic music)
SUMMARY :
Brought to you by Amazon Web Services. and with my co-host Corey Quinn and of course yeah, that's the response we get in most and so the farmers now, and it's not changing that much. and making sure that we are staying right at the top and made sure that we do cleanse the data, in your environment. more so from the point of view, if you think about it, in the factory and from there you will get data on the edge. So in terms of the technology that we use right now, Just a you see things continuing to evolve in this space, and for that you need to understand what you get out of coming to some of the regional summits and business is the key point. and the second is applications. All right, well Ahmad thank you so much I'm Stu Miniman, we'll be back with more coverage here
SENTIMENT ANALYSIS :
ENTITIES
Entity | Category | Confidence |
---|---|---|
Corey Quinn | PERSON | 0.99+ |
AGCO | ORGANIZATION | 0.99+ |
Amazon | ORGANIZATION | 0.99+ |
Georgia | LOCATION | 0.99+ |
Stu Miniman | PERSON | 0.99+ |
Ahmad | PERSON | 0.99+ |
Ahmad Haider | PERSON | 0.99+ |
Valtra | ORGANIZATION | 0.99+ |
Amazon Web Services | ORGANIZATION | 0.99+ |
Atlanta | LOCATION | 0.99+ |
Ahmed | PERSON | 0.99+ |
two parts | QUANTITY | 0.99+ |
Fendt | ORGANIZATION | 0.99+ |
one | QUANTITY | 0.99+ |
second part | QUANTITY | 0.99+ |
three times | QUANTITY | 0.99+ |
one part | QUANTITY | 0.99+ |
AWS' | ORGANIZATION | 0.99+ |
Agrotech | ORGANIZATION | 0.98+ |
second | QUANTITY | 0.98+ |
first time | QUANTITY | 0.98+ |
first | QUANTITY | 0.98+ |
Massey | ORGANIZATION | 0.98+ |
first part | QUANTITY | 0.97+ |
20 plus years | QUANTITY | 0.97+ |
about fifteen weeks | QUANTITY | 0.97+ |
today | DATE | 0.96+ |
Spark | TITLE | 0.96+ |
New York City Summit | EVENT | 0.96+ |
AWS Global Summit 2019 | EVENT | 0.96+ |
Secondly | QUANTITY | 0.95+ |
thirdly | QUANTITY | 0.95+ |
The Cube | TITLE | 0.91+ |
about a year ago | DATE | 0.9+ |
SageMaker | TITLE | 0.86+ |
AWS Summit New York 2019 | EVENT | 0.84+ |
AWS New York City Summit | EVENT | 0.82+ |
New York | LOCATION | 0.81+ |
Arc | TITLE | 0.8+ |
Narrator: Live from | TITLE | 0.78+ |
about a year | QUANTITY | 0.72+ |
about a 9.4 million dollar | QUANTITY | 0.72+ |
Cyclone | TITLE | 0.7+ |
Azure | EVENT | 0.62+ |
AWS | ORGANIZATION | 0.55+ |