loader from loading.io

Episode 5: Data Lakes for Data Science

Data Science In Production

Release Date: 03/27/2019

Episode 6: The Global AI Bootcamp with Henk Boelman show art Episode 6: The Global AI Bootcamp with Henk Boelman

Data Science In Production

This Episode is recorded at the Intelligent Cloud Conference in Copenhagen. It features an interview with Henk Boelman Microsoft AI MVP and Cloud AI Architect for Heroes in the Netherlands. 

info_outline
Episode 5: Data Lakes for Data Science show art Episode 5: Data Lakes for Data Science

Data Science In Production

In this episode we explore what is a data lake, and how to build a lake which enables data science teams to deliver models faster. We need somewhere we can store and access data which is indexed, searchable and always available. A good data lake will do this and more. Listen to find out how you can build a data lake to accelerate your teams machine learning. 

info_outline
Episode 4: MLFlow with Matei Zaharia show art Episode 4: MLFlow with Matei Zaharia

Data Science In Production

I caught up with the creator of Apache Spark and Databricks founder Matei Zaharia at this year's Big Data London. We discussed the release of MLFlow, Databricks and project Dawn. 

info_outline
Episode 3: Version control for Data Science show art Episode 3: Version control for Data Science

Data Science In Production

In this episode I talk about why you should have all your data science projects in version control / source control. I discuss why it is important, how to get started, the gotchas and how to version control data science projects. 

info_outline
Episode 2: Deploying Deep Learning models with TimTem show art Episode 2: Deploying Deep Learning models with TimTem

Data Science In Production

In this episode, I am joined by #TimTem AKA Dr Tim Scarfe and Dr Tempest van Schaik. Tempest and Tim are Machine Learning Engineers for Microsoft. This podcasts focuses on a project they delivered for Confused.com. 

info_outline
Episode 1: Setting the scene show art Episode 1: Setting the scene

Data Science In Production

This episode of Data Science in Production, focuses on the problems in Data Science and the rise of the Machine Learning Engineer. 

info_outline
 
More Episodes

In this episode we explore what is a data lake, and how to build a lake which enables data science teams to deliver models faster. We need somewhere we can store and access data which is indexed, searchable and always available. A good data lake will do this and more. Listen to find out how you can build a data lake to accelerate your teams machine learning.