This presentation was recorded at GOTO Chicago 2019. #GOTOcon #GOTOchgo http://gotochgo.com Phil Winder - Machine Learning whiz and Advocate for DataDevOps ABSTRACT The Internet is full of examples of how to train models. But the reality is that industrial projects spend the majority of the time working with data. The largest improvements in performance can often be found through improving the underlying data. Bad data is costing the US economy an estimated 3.1 trillion Dollars and approximately 27% of data is flawed in the world's top companies. Bad data also contributes to the failure of many Data Science projects. Who can forget Tay.ai, Microsoft's twitter-bot that learned to be genocidal when user's tweets were not cleaned. This presentation will discuss in what circumstances bad data can affect your project along with some high profile case studies. We will then spend as much time as we have to go through some of the techniques you will need to fix that bad data. This [...] Download slides and read the full abstract here: https://gotochgo.com/2019/sessions/732 RECOMMENDED BOOK Phil Winder • Reinforcement Learning • https://amzn.to/3t1S1VZ https://twitter.com/GOTOchgo https://www.linkedin.com/company/goto- https://www.facebook.com/GOTOConference #DataScience #MachineLearning #ML Looking for a unique learning experience? Attend the next GOTO Conference near you! Get your ticket at http://gotocon.com SUBSCRIBE TO OUR CHANNEL - new videos posted almost daily. https://www.youtube.com/user/GotoConferences/?sub_confirmation=1
Get notified about new features and conference additions.