Lmst

🚀 Why pay more for cloud AI when smarter AI fits in your watch?
Discover how Small Language Models are quietly outperforming LLMs —
• 8X faster
• 90% cheaper
• 100% offline 🤯

From Tesla to smart clinics, this is the AI story no one's telling — yet.
Read the full piece 👇
🔗 https://medium.com/@rogt.x1997/8x-faster-90-cheaper-how-tiny-ai-models-are-beating-gpt-4-at-the-edge-1ae79fe3eb64

#EdgeAI #SLMs #TinyML #FutureReady
https://medium.com/@rogt.x1997/8x-faster-90-cheaper-how-tiny-ai-models-are-beating-gpt-4-at-the-edge-1ae79fe3eb64

"Ai2 tested DataDecide across a wide range of datasets and model sizes, using 10 benchmarks to evaluate how well small models predict large-scale performance. The findings aren’t earth-shattering, but they present useful takeaways for AI developers and researchers.

For one, Ai2 found that small models (around 150 million parameters) can predict large-scale outcomes with surprising accuracy. Some benchmarks reached over 80% decision accuracy using just 0.01% of the compute compared to billion-parameter models.

Since small-model experiments use less compute than other methods, developers don’t need to run full-scale tests just to predict outcomes. “The promise of this work is lower compute costs during training,” said Pijanowski.

Ai2 found that scaling laws didn’t outperform the simpler method of ranking datasets by small-model results. Scaling laws, a more sophisticated and more costly testing method, aim to predict how accuracy improves with model size. For now, “just stick with ablating things at one scale,” advised Magnusson.

The findings should give LLM devs pause for thought, Hunt said: “There are scaling laws that have been derived from empirical studies between data volume, compute resources and performance. Ai2’s research points out that we may want to revisit some of those assumptions.”"

https://thenewstack.io/new-tools-help-llm-developers-choose-better-pre-training-data/

#AI #GenerativeAI #LLMs #AITraining #SLMs

Classifying aviation-related posts on Hacker News with SLMs

https://www.skysight.inc/blog/hacker-news-aviation

#HackerNews #Classifying #aviation-related #posts #on #Hacker #News #with #SLMs #aviation #machinelearning #hackernews #SLMs #technology

Small Language Models Are the New Rage, Researchers Say

Larger models can pull off a wider variety of feats, but the reduced footprint of smaller models makes them attractive tools.

https://www.wired.com/story/why-researchers-are-turning-to-small-language-models/

#SLMs #LLMs #AI

Are you passionate about the latest in #AI? Here's your chance to shine!

✍️ Join the #InfoQ Annual Article Writing Competition!

🏆 Win a #FreeTicket to #QCon or #InfoQDevSummit!

🔗 Submit by March 30, 2025: https://bit.ly/417KPtk

Which AI topic are you most excited to explore?

Explore topics like #LLMs, #SLMs, #vLLMs, #GenAI, #VectorDatabases, #ExplainableAI, #RAG, and more!

Are you passionate about the latest in #AI? Here's your chance to shine!

✍️ Join the #InfoQ Annual Article Writing Competition!

🏆 Win a #FreeTicket to #QCon or #InfoQDevSummit!

🔗 Submit by March 30, 2025: bit.ly/4gKC51N

Which AI topic are you most excited to explore?

Explore topics like #LLMs, #SLMs, #vLLMs, #GenAI, #VectorDatabases, #ExplainableAI, #RAG, and more!

Mistral Launches Small 3.1 Language Models, Taking On Gemma 3, GPT-4o Mini, and Claude 3.5 Haiku

#AI #Mistral #AIModels #LLMs #MistralAI #GPT4oMini #SLMs #GenAI

https://winbuzzer.com/2025/03/18/mistral-launches-small-3-1-language-models-taking-on-gemma-3-gpt-4o-mini-and-gemma-3-models-xcxwbn/

The Big Power of Small #AI in 2025

An IBM VP explains how small language models are a boon for companies of all sizes, enabling them to overcome resource and budget constraints to tap into the business value of AI.

https://www.techrepublic.com/article/ibm-small-ai/

#SLMs

MIT names 10 breakthrough technologies to watch in 2025

https://bgr.com/tech/mit-names-10-breakthrough-technologies-to-watch-in-2025/

#AI #SLMs #Innovation

This #InfoQ #eMag brings together our most popular InfoQ Trends Reports from 2024, offering a deep dive into:
💡 Cell-based architectures
💡 Socio-technical systems
💡 Large and small language models (LLMs & SLMs)
💡 State-of-the-art innovations in the Java ecosystem

Whether you're a developer, architect, technology leader, or simply a tech enthusiast, these reports provide actionable insights and valuable perspectives to help you:
🚀 Plan your future roadmaps
🚀 Explore emerging technologies & practices

🔗 Download it for free: https://bit.ly/3PEiyoG

#TrendsReport #SoftwareTrends #FreeDownload

#SoftwareArchitecture #SoftwareDevelopment #LLMs #SLMs #Java

"To prevent AI models from memorizing their input, we know exactly one robust method: differential privacy (DP). But crucially, DP requires you to precisely define what you want to protect. For example, to protect individual people, you must know which piece of data comes from which person in your dataset. If you have a dataset with identifiers, that's easy. If you want to use a humongous pile of data crawled from the open Web, that's not just hard: that's fundamentally impossible.

In practice, this means that for massive AI models, you can't really protect the massive pile of training data. This probably doesn't matter to you: chances are, you can't afford to train one from scratch anyway. But you may want to use sensitive data to fine-tune them, so they can perform better on some task. There, you may be able to use DP to mitigate the memorization risks on your sensitive data.

This still requires you to be OK with the inherent risk of the off-the-shelf LLMs, whose privacy and compliance story boils down to "everyone else is doing it, so it's probably fine?".

To avoid this last problem, and get robust protection, and probably get better results… Why not train a reasonably-sized model entirely on data that you fully understand instead?"

https://desfontain.es/blog/privacy-in-ai.html

#AI #GenerativeAI #LLMs #SLMs #Privacy #DifferentialPrivacy #Memorization

In this edition of #tech on ice, I talk about #github models and how great it is to kick the tires on different #llms
#LLMs #SLMs #ai #developer #programming #coldplunge #technology

https://www.tiktok.com/@isaacrlevin/video/7460196289192398111

In this edition of #tech on ice, I discuss the difference between #LLMs vs #SLMs in the #ai space.
#developer #programming #coldplunge #gpt #LLM

https://www.tiktok.com/@isaacrlevin/video/7459455066496453919

Microsoft's new rStar-Math framework sets new standards in mathematical reasoning benchmarks using Small Language Models #AI #Math #SLMs #AIResearch #Microsoft #rStarMath #AIBenchmarks #AIFrameworks #COT #CHainOfThought

https://winbuzzer.com/2025/01/10/microsofts-rstar-math-framework-lets-small-ai-models-outperform-openais-o1-series-xcxwbn/

Small language models: 10 Breakthrough Technologies 2025

Large language models unleashed the power of AI. Now it’s time for more efficient AIs to take over.

https://www.technologyreview.com/2025/01/03/1108800/small-language-models-ai-breakthrough-technologies-2025/

#LLMs #SLMs #AI

"Rather than building massive, complex large language models, many organizations choose smaller language models that focus on niche applications such as supply chain management or inventory control. This is the “lean AI” concept, and it entails purpose-built models able to deliver value without the high costs and complexity associated with larger systems, according to Linthicum.

“We have Agentic AI and certainly using things like small language models where we’re leveraging generative AI and AI in general for more tactical implementation,” he said. “It’s dealing with supply chain integration, dealing with inventory control. We’re not building LLMs, and I don’t think the businesses out there are going to get the value from building huge LLMs that they think they’re going to get.”"

https://siliconangle.com/2024/12/31/gen-ai-hype-analyzing-tactical-cio-approach-cubeconversations/

#AI #GenerativeAI #LLMs #SLMs #LeanAI #CIOs #AgenticAI

🔍 מחפשים לשדרג את העסק שלכם עם בינה מלאכותית חסכונית ויעילה? מודלים קטנים (SLMs) הם הפתרון המושלם! הם חוסכים בעלויות, משפרים ביצועים ושומרים על הדאטה שלכם – כל זה עם דרישות מחשוב צנועות.
קראו את המאמר המלא:
#בינה_מלאכותית #SLMs #AI #חדשנות

https://edulabs.co.il/he/blog/post-20241218

Dives into the world of #GenerativeAI & #SmallLanguageModels - #SLMs!

"So latest trend in the language model evolution are the small language models or SLMs that offer many of the same benefits as LLMs, but they're smaller in size, they're trained using smaller data sets and they don't require a lot of computing resources." - Namee Oberst, Co-Founder of LLMWare

🎧 Listen now: https://bit.ly/4f9M5Sz

#AI #RAG #LLMs #EdgeComputing

From #LLMs to #SLMs to #SAMs, how agents are redefining #AI

https://siliconangle.com/2024/09/28/llms-slms-sams-agents-redefining-ai/

#DigitalTransformation

#LLMs #SLMs #AI #GenerativeAI #Chatbots: "With the growing attention and investment in recent AI approaches such as large language models, the narrative that the larger the AI system the more valuable, powerful and interesting it is is increasingly seen as common sense. But what is this assumption based on, and how are we measuring value, power, and performance? And what are the collateral consequences of this race to ever-increasing scale? Here, we scrutinize the current scaling trends and trade-offs across multiple axes and refute two common assumptions underlying the ‘bigger-is-better’ AI paradigm: 1) that improved performance is a product of increased scale, and 2) that all interesting problems addressed by AI require large-scale models. Rather, we argue that this approach is not only fragile scientifically, but comes with undesirable consequences. First, it is not sustainable, as its compute demands increase faster than model performance, leading to unreasonable economic requirements and a disproportionate environmental footprint. Second, it implies focusing on certain problems at the expense of others, leaving aside important applications, e.g. health, education, or the climate. Finally, it exacerbates a concentration of power, which centralizes decision-making in the hands of a few actors while threatening to disempower others in the context of shaping both AI research and its applications throughout society."

https://arxiv.org/html/2409.14160v1

#SLMs

Client Info