Intro to Kubeflow: Katib Training and Certification Recap – Aug 11, 2022

On Aug 14th we hosted the “Intro to Kubeflow: Katib Training and Certification” prep course. In this blog post we’ll recap some highlights from the class.

Congratulations to Ping-Chang Shih!

The first attendee to earn the Katib certificate at the conclusion of the course was Ping-Chang Shih. Well done!

Thanks for voting for your favorite charity!

With the unprecedented circumstances facing our global community, Arrikto is looking for even more ways to contribute. With this in mind, we thought that in lieu of swag we could give course attendees the opportunity to vote for their favorite charity and help guide our monthly donation to charitable causes. The charity that won this workshop’s voting was the National Pediatric Cancer Foundation. NPCF is a nonprofit organization dedicated to funding research to eliminate childhood cancer.. We are pleased to be making a donation of $100 to them on behalf of the Kubeflow community. Again, thanks to all of you who attended and voted!

What topics were covered in the course?

This initial course aimed to get data scientists and DevOps engineers with little or no experience familiar with the fundamentals of how Kubeflow works.

  • Kubeflow Fundamentals Review
  • Katib Basics
  • Getting Started with Katib
  • Navigating the Katib UI
  • Running Katib Experiments
  • Kale Basics
  • AutoML Example: Kaggle’s OpenVaccine Notebook
  • AutoML Example: Kaggle Bluebook for Bulldozers Notebook
  • Course Review

What did I miss?

Here’s a short teaser from the 90 minute training. In this video we show you how to use Katib and Kale inside of Kubeflow to perform hyperparameter tuning on Kaggle’s OpenVaccine notebook.

Missed the Aug 11 Kubeflow training?

If you were unable to join us last week, you can sign up for upcoming Fundamentals, Notebooks, Pipelines and Kale/Katib courses here.

NEW: Advanced Kubeflow, Notebooks and Pipelines Workshops

We are excited to announce a new series of FREE workshops focused on taking popular Kaggle and Udacity machine learning examples from “Notebook to Pipeline.” Registration is now open for the following workshops:

Aug 17:  The Kaggle Blue Book for Bulldozers Machine Learning Example
Aug 18: Introduction to Kubeflow: Kubeflow Fundamentals
Aug 24: Developing Kubeflow Pipelines: Kaggle’s Facial Keypoints Detection Competition
Aug 25: Introduction to Kubeflow: Jupyter Notebooks Fundamentals

Q&A from the training

Below is a summary of some of the questions that popped into the Q&A box during the course. [Edited for readability and brevity.]

Are there any connectors available to export experiments/validation results to Elastic Serach or Snowflake from Kubeflow?

There is  no “Export to” button in Katib. But, if you are adventuresome (we are talking about open source here!) you can get to the raw data in MySQL (where Katib metrics are stored) or whatever your specified in the metricsCollectorSpec (typically StdOut).

Are there any requirements for the Docker image used for Katib jobs?

Yes, you learn more under “Running an Experiment” in the official Kubeflow documentation.

Free Technical Workshop

Turbocharge your team’s Kubeflow and MLOps skills with a free workshop.