Loading…
In-person
November 12-15
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon North America 2024 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in Mountain Standard Time (UTC -7). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis. 
Wednesday November 13, 2024 2:30pm - 3:05pm MST
In this talk, we will lead you down the rabbit-hole of AI training in Kubernetes, where idealism and reality meet. We are not here to feed you tales of digital utopia. We are here to discuss what needs to change. Can Kubernetes be taught to handle the monstrous loads of large-scale AI training? Join a diverse group of engineers, AI researchers, and Kubernetes veterans to learn what problems need to be solved for optimal AI training within Kubernetes.  We will discuss solutions and challenges within this space and do comparisons between HPC systems and Kubernetes. Topics will include fine-grained resource control, scheduling, networking, and storage. If you think you can handle the unvarnished truth about Cloud Native AI, come armed with questions, war stories, and a tolerance for the absurd. Just be prepared to leave with more questions than answers-and maybe, if you are lucky, a sliver of insight.
Speakers
avatar for Ricardo Rocha

Ricardo Rocha

Lead Platforms Infrastructure, CERN
Ricardo leads the Platform Infrastructure team at CERN with a strong focus on cloud native deployments and machine learning. He has led for several years the internal effort to transition services and workloads to use cloud native technologies, as well as dissemination and training... Read More →
avatar for Frederick Kautz

Frederick Kautz

Cloud Native Security Unicorn, TestifySec
Frederick has an extensive background in Cloud Native and Security. Some relevant highlights include previously Co-Chairing KubeCon, authoring NIST SP 800-204D, co-charing the CTA/ANSI working group in securing AI/ML pipelines. Frederick also has an extensive history in Cloud Native... Read More →
avatar for Marlow Warnicke (Weston)

Marlow Warnicke (Weston)

Principal Cloud Engineer, SchedMD
Marlow is a Principal Cloud Engineer working on scheduling at SchedMD. She also is a chair for the CNCF Environmental Sustainability TAG. Marlow has expertise in resource management, the AI/ML Kubernetes cloud compute ecosystem, embedded systems, high performance compute system tools... Read More →
avatar for Alex Scammon

Alex Scammon

Head of Open Source Development, G-research
I enjoy building motivated and effective engineering organizations. Whether at a large international company or a small local startup, I care about how we create products just as much as what those products actually are.
Wednesday November 13, 2024 2:30pm - 3:05pm MST
Salt Palace | Level 2 | 253 A

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link