#ModelServing

AI Daily Postaidailypost
2026-02-06

New research shows a tuned recommendation engine can boost clickโ€‘through rates by 10% while cutting inference cost. The paper dives into modelโ€‘serving tricks, optimization for large language models, and deployment efficiency for production AI. Openโ€‘source practitioners will love the practical benchmarks.

๐Ÿ”— aidailypost.com/news/recommend

Q*Satoshi (@AiXsatoshi)

14GB/s๊ธ‰ SSD๋กœ ์—…๊ทธ๋ ˆ์ด๋“œํ–ˆ์ง€๋งŒ ๋งค๋ฒˆ ์ˆ˜๋ฐฑ GB์— ๋‹ฌํ•˜๋Š” ๋ชจ๋ธ ๋กœ๋“œ์— ์‹œ๊ฐ„์ด ๊ฑธ๋ ค ์„ฑ๋Šฅ ๋ณ‘๋ชฉ์ด ๋ฐœ์ƒ. ์ด๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด RAID0 ๊ฐ™์€ ์Šคํ† ๋ฆฌ์ง€ ๊ตฌ์„ฑ ๋ณ€๊ฒฝ์„ ๊ณ ๋ ค ์ค‘์ด๋ผ๋Š” ์‹ค๋ฌด์  ํ•˜๋“œ์›จ์–ดยท๋”ฅ๋Ÿฌ๋‹ ๊ฐœ๋ฐœ ํ™˜๊ฒฝ ๊ด€๋ จ ๊ณ ๋ฏผ์„ ๊ณต์œ ํ•œ ํŠธ์œ—์ด๋‹ค.

x.com/AiXsatoshi/status/201784

#ssd #storage #modelserving #hardware

AI Daily Postaidailypost
2025-11-25

Googleโ€™s new Ironwood TPU is purposeโ€‘built for inference, delivering ultraโ€‘low latency and highโ€‘volume model serving with a novel interโ€‘chip interconnect. As the industry pivots to edge AI, this hardware could reshape how we deploy models. Dive into the specs and why it matters for openโ€‘source AI projects.

๐Ÿ”— aidailypost.com/news/ironwood-

Yuan Tang :redhat:terrytangyuan@fosstodon.org
2025-09-29

๐Ÿ™Œ Huge thanks to everyone who contributed to this journey from writing code, reviewing docs, to supporting governance and community growth.

Stay tuned! Weโ€™ll be publishing a detailed announcement blog soon with more insights on what this means for users, contributors, and the future of model serving on Kubernetes.

For now: thank you to the community for making this possible. ๐Ÿ’™

#KServe #CNCF #OpenSource #ModelServing #AI #MLOps #CloudNative #Kubeflow #Kubernetes #k8s Kubeflow

This is a big step for the KServe community, and weโ€™re excited about the road ahead in making cloud-native model serving more accessible and production-ready for everyone. #KServe #CNCF #OpenSource #ModelServing #AI #MLOps #CloudNative @cncf.io @kubernetes.io @kubefloworg.bsky.social

Yuan Tang :redhat:terrytangyuan@fosstodon.org
2025-09-09

A huge thank you to Kevin Wang and Faseela K from the CNCF TOC for all the hard work. Itโ€™s been such a pleasure collaborating with you both on this milestone. Thank you to all the community members who have contributed!

This is a big step for the KServe community, and weโ€™re excited about the road ahead in making cloud-native model serving more accessible and production-ready for everyone.

#KServe #CNCF #OpenSource #ModelServing #AI #MLOps #CloudNative CNCF Kubernetes Kubeflow

Big thanks to everyone contributing code, reviews, and ideas โ€” this integration is shaping up to be a game-changer for ๐—ž๐˜‚๐—ฏ๐—ฒ๐—ฟ๐—ป๐—ฒ๐˜๐—ฒ๐˜€-๐—ป๐—ฎ๐˜๐—ถ๐˜ƒ๐—ฒ ๐—Ÿ๐—Ÿ๐—  ๐˜€๐—ฒ๐—ฟ๐˜ƒ๐—ถ๐—ป๐—ด. Stay tuned for next release! #KServe #llmd #GenerativeAI #MLOps #Kubernetes #ModelServing #AIInfrastructure

Yuan Tang :redhat:terrytangyuan@fosstodon.org
2025-08-11

Big thanks to everyone contributing code, reviews, and ideas โ€” this integration is shaping up to be a game-changer for ๐—ž๐˜‚๐—ฏ๐—ฒ๐—ฟ๐—ป๐—ฒ๐˜๐—ฒ๐˜€-๐—ป๐—ฎ๐˜๐—ถ๐˜ƒ๐—ฒ ๐—Ÿ๐—Ÿ๐—  ๐˜€๐—ฒ๐—ฟ๐˜ƒ๐—ถ๐—ป๐—ด. Stay tuned for next release!

#KServe #llmd #GenerativeAI #MLOps #Kubernetes #ModelServing #AIInfrastructure

๐ŸŽ„ Happy Holidays! KServe v0.12 release candidate is available! Try it out! https://github.com/kserve/kserve/releases/tag/v0.12.0-rc0 #KServe #kubernetes #MLOps #DevOps #CloudNative #Kubeflow #ModelServing #AI #MachineLearning @KnativeProject @LFAIDataFdn @CloudNativeFdn

Release v0.12.0-rc0 ยท kserve/k...

๐ŸŽ„ Happy Holidays! KServe v0.12 release candidate is available! Try it out! https://github.com/kserve/kserve/releases/tag/v0.12.0-rc0 #KServe #kubernetes #MLOps #DevOps #CloudNative #Kubeflow #ModelServing #AI #MachineLearning @KnativeProject @CloudNativeFdn

Release v0.12.0-rc0 ยท kserve/k...

๐Ÿ”” New chapters on model serving and workflow patterns of Distributed Machine Learning Patterns are now available! ๐Ÿ‘‰ http://bit.ly/2RKv8Zo #MachineLearning #Kubernetes #DistributedSystems #CloudComputing #DeepLearning #DataScience #DevOps #MLOps #CloudNative #ModelServing

Distributed Machine Learning P...

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst