Skills for ML Engineering

Posted on

1. Getting ready

  1. Do a lot of leetcode problems to understand data structures and algorithms
  2. Get exposure to software design patterns (e.g. event driven etc.)
  3. Get solid CS fundamentals by reading text books (e.g. Operating Systems, Compilers, Distributed and cloud computing)
  4. Get solid ML engineering fundamentals or or—ud188.

2. Learn from Google

Google is an industry leader in ML and surrounding best practices. What they have to say about the topic is useful.

  1. Do the crash course -
  2. Read the People+AI guidebook -
  3. Read Rules of ML -
  4. Read the other guides -
  5. Try the TensorFlow quickstart in Google Colab -
  6. Read the TFX Guide and maybe try it out -

3. Write a pipeline

  1. If you have access to an AWS account, run a complete SageMaker pipeline and go around your AWS console seeing and inspecting all the resources it created
  2. If you have access to a Google Cloud account, run a Vertex AI pipeline
  3. If you have Azure, then
  4. If you don’t have access to any account already, poke around the internet if you can figure out if any of them offer things in free tier (it’s also a skill to see if a vendor provides something in free tier and use that I guess ;)). If not, run a Kubeflow pipeline, or build an Airflow pipeline

4. Research & explore the web

Just surf the internet, maybe start with and check out what others are up to, especially companies like Uber, AirBnB, Lyft, Shopify, Netflix, etc.

FBLearner flow, particularly, a post from Facebook all the way back from 2016 is still as awesome as ever and is a brilliant design pattern introduction for doing ML at a big company

5. Andrew Ng

Do the Andrew Ng course at Just do it.

6. Talks and conferences

Develop a method to search the Internet and find people speaking about the topic. A lot of people speak about ML these days in different forums, events and conferences.

  1. Stanford MLSys seminars
  2. DeepLearning AI channel
  3. AWS re:Invent 2021 keynote
  4. A podcast I found on InfoQ’s channel
  5. The awesome-mlops index again
  6. My own talk at AWS re:Invent from 2019

Discuss this post on HN