We're sunsetting PodQuest on 2025-07-28. Thank you for your support!
Export Podcast Subscriptions
cover of episode Deploy and fine-tune LLM models on Kubernetes using KAITO

Deploy and fine-tune LLM models on Kubernetes using KAITO

2024/8/7
logo of podcast Kubernetes Bytes

Kubernetes Bytes

AI Chapters
Chapters

Shownotes Transcript

In this episode of the Kubernetes Bytes podcast, Bhavin sits down with  Sachi Desai, Product Manager and Paul Yu, Sr. Cloud Advocate at Microsoft to talk about the open source KAITO project. KAITO is the Kubernetes AI Toolchain Operator that enables AKS users to deploy open source LLM models on their Kubernetes clusters. They discuss how KAITO helps with running AI-enabled applications alongside the LLM models, how it helps users bring their own LLM models and run them as containers, and how KAITO helps them fine-tune open source LLMs on their Kubernetes clusters.  

Check out our website at https://kubernetesbytes.com/  

Cloud Native News:  

Show links: 

Timestamps: 

  • 00:02:15 Cloud Native News 
  • 00:05:34 Interview with Sachi and Paul 
  • 00:42:08 Key takeaways