انتقل إلى وضع عدم الاتصال باستخدام تطبيق Player FM !
#116 Running AI on Kubernetes: From GPUs to CRO
Manage episode 516120390 series 3430187
In this episode of De Nederlandse Kubernetes Podcast, we talk with Carlos Santana, Principal Partner Solution Architect at AWS and long-time contributor to the Kubernetes and AI communities.
Carlos joins us to explore what it really takes to run AI workloads on Kubernetes, from GPU scheduling to scaling inference and training efficiently across clusters. We discuss how AI and machine learning are transforming the cloud-native ecosystem — and why orchestration is becoming just as important as the models themselves.
He shares insights into:
- 💡 The challenges of scheduling and sharing GPUs in multi-tenant Kubernetes clusters
- ⚙️ Why Kubernetes Resource Orchestrator (CRO) could be the next big abstraction layer
- 🚀 The balance between performance, cost efficiency, and developer experience
- 🧠 His hands-on experiments with Jetson devices, edge computing, and model optimization
- 🌐 How open source projects and cloud providers are shaping the future of AI infrastructure
A forward-looking conversation about where AI, Kubernetes, and cloud-native engineering are heading — from someone building that future at scale.
ACC ICT Specialist in IT-CONTINUÏTEIT
Bedrijfskritische applicaties én data veilig beschikbaar, onafhankelijk van derden, altijd en overal
Like and subscribe! It helps out a lot.
You can also find us on:
De Nederlandse Kubernetes Podcast - YouTube
Nederlandse Kubernetes Podcast (@k8spodcast.nl) | TikTok
De Nederlandse Kubernetes Podcast
Where can you meet us:
Events
This Podcast is powered by:
ACC ICT - IT-Continuïteit voor Bedrijfskritische Applicaties | ACC ICT
115 حلقات
Manage episode 516120390 series 3430187
In this episode of De Nederlandse Kubernetes Podcast, we talk with Carlos Santana, Principal Partner Solution Architect at AWS and long-time contributor to the Kubernetes and AI communities.
Carlos joins us to explore what it really takes to run AI workloads on Kubernetes, from GPU scheduling to scaling inference and training efficiently across clusters. We discuss how AI and machine learning are transforming the cloud-native ecosystem — and why orchestration is becoming just as important as the models themselves.
He shares insights into:
- 💡 The challenges of scheduling and sharing GPUs in multi-tenant Kubernetes clusters
- ⚙️ Why Kubernetes Resource Orchestrator (CRO) could be the next big abstraction layer
- 🚀 The balance between performance, cost efficiency, and developer experience
- 🧠 His hands-on experiments with Jetson devices, edge computing, and model optimization
- 🌐 How open source projects and cloud providers are shaping the future of AI infrastructure
A forward-looking conversation about where AI, Kubernetes, and cloud-native engineering are heading — from someone building that future at scale.
ACC ICT Specialist in IT-CONTINUÏTEIT
Bedrijfskritische applicaties én data veilig beschikbaar, onafhankelijk van derden, altijd en overal
Like and subscribe! It helps out a lot.
You can also find us on:
De Nederlandse Kubernetes Podcast - YouTube
Nederlandse Kubernetes Podcast (@k8spodcast.nl) | TikTok
De Nederlandse Kubernetes Podcast
Where can you meet us:
Events
This Podcast is powered by:
ACC ICT - IT-Continuïteit voor Bedrijfskritische Applicaties | ACC ICT
115 حلقات
ทุกตอน
×مرحبًا بك في مشغل أف ام!
يقوم برنامج مشغل أف أم بمسح الويب للحصول على بودكاست عالية الجودة لتستمتع بها الآن. إنه أفضل تطبيق بودكاست ويعمل على أجهزة اندرويد والأيفون والويب. قم بالتسجيل لمزامنة الاشتراكات عبر الأجهزة.