Lmst

Excited to share that I am speaking at KubeCon Europe in London next month! Looking forward to catching up with friends and collaborators! You can find me at the following sessions 🧵 #KubeCon #CloudNativeCon #CloudNative #Kubernetes #DevOps #MLOps #AI #K8s @kubernetes.io #KServe #Kubeflow @cncf.io

Achieve better large language model inference with fewer GPUs

"we achieved approximately 55-65% of the throughput on a server config that is approximately 15% of the cost"

Get ready for KubeCon next week! Below are the three talks I'll be presenting! See you there and let's catch up! #KubeCon #CloudNativeCon #CloudNative #Kubernetes #DevOps #MLOps #AI #K8s @CloudNativeFdn @kubernetesio @kubeflow #KServe

An interesting talk here at #KubeCon by @terrytangyuan from #RedHat and Adam Tetelman from #NVIDIA on the many pitfalls of using LLMs in production.

#KServe and #Knative come to the rescue of many Day 2 problems, but there's still a lot to do.

And, as Adam Teleman said so well, this year's KubeCon could easily be called RAGCon, given how many talks there are about #RAG 😀

Get ready for KubeCon next week! Below are the three talks I'll be presenting! See you there! https://github.com/terrytangyuan/public-talks

- Cloud Native AI Day Keynote: Advancing Cloud Native AI Innovation Through Open Collaboration, sponsored by Red Hat

- Unlocking Potential of Large Models in Production with Adam Tetelman

- WG Serving: Accelerating AI/ML Inference Workloads on Kubernetes with Eduardo Arango

Get ready for KubeCon next week! Below are the three talks I'll be presenting! See you there and let's catch up! #KubeCon #CloudNativeCon #CloudNative #Kubernetes #DevOps #MLOps #AI #K8s @kubernetes.io #KServe #Kubeflow

AI/ML Innovation in the Kubernetes Ecosystem

#KServe