AI/ML in Kubernetes, with Maciej Szulik, Clayton Coleman, and Dawn Chen
Kubernetes Podcast from Google
Release Date: 06/25/2024
Kubernetes Podcast from Google
Guests are Clayton Coleman and Rob Shaw. Clayton is a Core contributor to Kubernetes, the containerized cluster manager, and founding architect for OpenShift, the open source platform as a service. Clayton helped launch the shift to cloud native applications and the platforms that enable them. At Google my mission is to make Kubernetes and GKE the best place to run workloads, especially accelerated AI/ML workloads, and especially especially very large model inference at scale with the inference gateway and llm-d. Rob Shaw is an Engineering Director at Redhat and is a contributor to the vLLM...
info_outlineKubernetes Podcast from Google
This episode is a crossover with our friends at the . Kaslin joined and to talk about Kubernetes for Platform Engineering. Do you have something cool to share? Some questions? Let us know: - web: - mail: - twitter: - bluesky: News of the week Links from the interview
info_outlineKubernetes Podcast from Google
Guests are and . Pierre-Gilles and Glen are Google Developer Experts. We had an opportunity to catch up with them at Next 2025 and we spoke about Platform Engineering, GitOps, Policy as code and AI. Do you have something cool to share? Some questions? Let us know: - web: - mail: - twitter: - bluesky: News of the week Links from the interview
info_outlineKubernetes Podcast from Google
leads the Platform Infrastructure team at CERN with a strong focus on cloud native deployments and machine learning. He has led the internal effort to transition services and workloads to use cloud native technologies, as well as dissemination and training for several years. Ricardo got CERN to join the CNCF and is a member of the Technical Oversight Committee (TOC), currently chairs the End User Technical Advisory Board (TAB), as well as leading the Research User Group (RUG). Do you have something cool to share? Some questions? Let us know: - web: - mail: - twitter: - bluesky: ...
info_outlineKubernetes Podcast from Google
This week’s interview was recorded live at Google Cloud Next, and features and talking about recent developments in Kubernetes and cloud-native technologies. Including exploring highlights from KubeCon EU, and the value of community events. Do you have something cool to share? Some questions? Let us know: - web: - mail: - twitter: - bluesky: News , , and and at KubeCon North America Interview and
info_outlineKubernetes Podcast from Google
Guests are Nick Eberts and Jon Li. Nick is a Product Manager at Google working on Fleets and Multi-Cluster and Jon is a Software Engineer at Google working on AI Inference on Kubernetes. We discussed the newly announced Multi Cluster Orchestrator (MCO) and the challenges of running multiple clusters. Do you have something cool to share? Some questions? Let us know: - web: - mail: - twitter: - bluesky: News of the week Links from the interview
info_outlineKubernetes Podcast from Google
In this episode, we're bringing you a curated selection of conversations from the KubeCon EU 2025 showfloor. We'll be diving into the rise of platform engineering, exploring some cutting-edge technologies, getting updates on core Kubernetes components, and hearing some truly unique user stories, like using Kubernetes on a dairy farm! Do you have something cool to share? Some questions? Let us know: - web: - mail: - twitter: - bluesky: News of the week Links from the interview NAIS at NAV, with Hans Kristian Flaatten and Audun Fauchald Strand ...
info_outlineKubernetes Podcast from Google
is a software engineer at Solo.io, where she’s worked on Istio and API Gateway projects. She’s been part of the Kubernetes release team since v1.27 and is currently serving as the Release Lead for v1.33. Do you have something cool to share? Some questions? Let us know: - web: - mail: - twitter: - bluesky: News of the week Links from the interview (General concept, linking to K8s Gateway API) (Link to publisher's site about the book) (Link to release announcement blog) (KEP link) (KEP link) ...
info_outlineKubernetes Podcast from Google
Today we welcome and to talk about the Kubernetes Resource Orchestrator, or KRO. Jesse works as a principal product manager at AWS and Nic is a Product Manager at Google. The Kubernetes Resource Orchestrator is a new cloud agnostic tool meant to simplify Kubernetes resources for devs and platform admins. Do you have something cool to share? Some questions? Let us know: - web: - mail: - twitter: - bluesky: News of the week Kubernetes JobSets: An open-source API for managing distributed jobs as a single unit. Integrates with Kueue for better resource utilization. ...
info_outlineKubernetes Podcast from Google
and are software engineers at LinkedIn compute infrastructure team running the Kubernetes platform for LinkedIn and they joined us today to talk about how they run Kubernetes at scale and what they learned along the way. Do you have something cool to share? Some questions? Let us know: - web: - mail: - twitter: - bluesky: News of the week Links from the interview LinkedIn Engineering Blog- : Flannel is a simple and easy way to configure a layer 3 network fabric designed for Kubernetes. : Google Cloud's globally-distributed database service. - learn...
info_outlineIn this episode, we talk to three active leaders who have been around since the very beginning of Kubernetes. We explore how Kubernetes has changed since its inception, with a particular focus on current efforts in Open source Kubernetes to support AI/ML style workloads.
Maciej Szulik is currently taking a seat in the Kubernetes Steering Committee. He’s also leading Special Interests Groups responsible for kubectl, workload and batch controllers. Maciej has been contributing to Kubernetes since the early days, jumping from one area to another where help was needed. He authored the first version of audit and helped shape its current one, as well as touched multiple other places in apimachinery. He was also responsible for designing and implementing Job and CronJob controllers. In kubectl he was responsible for the plugin mechanism and several major refactors to simplify the code. Since May 2024 he joined the ranks of Production Readiness Review (PRR) approvers helping ensure high production standards for the future of Kubernetes releases.
Clayton Coleman is a long-time Kubernetes contributor, having helped launch Kubernetes as open source, being on the bootstrap steering committee, and working across a number of SIGs to make Kubernetes a reliable and powerful foundation for workloads. At Red Hat he led OpenShift’s pivot onto Kubernetes and its growth across on-premise, edge, and into cloud. At Google he is now focused on enabling the next generation of key workloads, especially AI/ML in Kubernetes and on GKE.
Dawn Chen has been a Principal Software Engineer at Google cloud since May 2007. Dawn has worked on an open source project called Kubernetes before the project was founded. She has been one of tech leads in both Kubernetes and GKE, and founded SIG Node from scratch. She also led Anthos platform team for the last 4 years, and mainly focuses on the core infrastructure. Prior to Kubernetes, she was the one of the tech leads for Google internal container infrastructure -- Borg for about 7 years. Outside of work, she is a wife, a mother of a 16-year old boy and a good friend. She enjoys reading, cooking, hiking and traveling.
Do you have something cool to share? Some questions? Let us know:
- web: kubernetespodcast.com
- mail: [email protected]
- twitter: @kubernetespod
News of the week
Links from the interview
-
KubeCon EU 2021: The Long, Winding and Bumpy Road to CronJob’s GA - Maciej Szulik, Red Hat & Alay Patel, Red Hat
-
KubeCon EU 2018: Writing Kube Controllers for Everyone - Maciej Szulik, Red Hat (Beginner Skill Level)
-
DockerCon 2014: The announcement of Kubernetes at DockerCon
-
Kueue - “Kueue is a cloud-native job queueing system for batch, HPC, AI/ML, and similar applications in a Kubernetes cluster.”
-
Whitepaper: Large-scale cluster management at {Google} with {Borg}
Links from the post-interview chat