Alibaba's New ZeroSearch Framework Slashes Training Costs For Search-Enabled AI by 88%
#AI #GenAI #ZeroSearch #AlibabaAI #AITraining #LLMs #ReinforcementLearning #AICostReduction #MachineLearning #OpenSourceAI
Alibaba's New ZeroSearch Framework Slashes Training Costs For Search-Enabled AI by 88%
#AI #GenAI #ZeroSearch #AlibabaAI #AITraining #LLMs #ReinforcementLearning #AICostReduction #MachineLearning #OpenSourceAI
#Promptengineering is crucial for developing #LLM-based apps, but it's often manual & inefficient. PRewrite is an automated method using an LLM trained with #reinforcementlearning to optimize prompts https://arxiv.org/pdf/2401.08189 #RL #AI
Sutton and Barto Book Implementation
https://github.com/ivanbelenky/RL
#HackerNews #SuttonBarto #Book #Implementation #RL #GitHub #ReinforcementLearning
🎨🤖ART: Now you can train your "LLM agents" with Open-source Reinforcement Learning, because training them closed-source would just be too mainstream. Because what the world really needed was more #GitHub #buzzwords and fewer actual results. 🚀💻
https://github.com/OpenPipe/ART #ART #LLM #agents #OpenSource #ReinforcementLearning #HackerNews #ngated
ART – a new open-source RL framework for training agents
https://github.com/OpenPipe/ART
#HackerNews #OpenSource #RLFramework #TrainingAgents #ReinforcementLearning #ART
Training AI to Persuade? - Jeremie & Edouard Harris on JRE
#reinforcementlearning #ai #aipersuasion #aimodels #openai #aiagents
Building Agents That Orchestrate Production + Inventory Flow
https://raiswarms.com/building-agents-that-orchestrate-production-inventory-flow/
#DevRAI #IndustryRAI #AI #AutonomousFactories #ConstraintSatisfaction #Industry50 #InventoryOptimization #MultiagentSystems #ProductionScheduling #ReinforcementLearning
[AGI discussion, DeepMind] Welcome to the Era of Experience
https://storage.googleapis.com/deepmind-media/Era-of-Experience%20/The%20Era%20of%20Experience%20Paper.pdf
https://old.reddit.com/r/MachineLearning/comments/1k4zr1i/r_deepmind_welcome_to_the_era_of_experience
* threshold of new era in AI that promises unprecedented level of ability
* new generation of agents will acquire superhuman capabilities, learning predominantly f. experience
* paradigm shift, accompanied by algorithmic advancements in RL, will unlock new supra-human capabilities
#Google #DeepMind #AI #AGI #RL #ReinforcementLearning #ML #LLM #AIdiscussion
📣 #DLRL Summer School 2025 – Apply now! From 21 July to 1 August in Edmonton, #Canada.
🔬 Focus: #MachineLearning, #DeepLearning & #ReinforcementLearning
🇨🇭 Highlight: Thanks to the SNSF-#CIFAR partnership, costs will be covered for participants from Switzerland!
Apply here 👉 https://dlrl.ca/apply/
⁉️#DLRL Summer School 2025 – Postulez maintenant ! Du 21 juillet au 1er août à Edmonton au #Canada.
🔬 Zoom sur : #MachineLearning, #DeepLearning et #ReinforcementLearning
🇨🇭 Avantage : les frais sont pris en charge (partenariat CIFAR) pour les participant·es de Suisse !
Postulez via👉 https://dlrl.ca/fr/postulez/
🚨#DLRL Summer School 2025 – Jetzt bewerben! Vom 21. Juli bis 1. August in Edmonton, Kanada.
🔬 Fokus: #MachineLearning, #DeepLearning & #ReinforcementLearning
🇨🇭 Vorteil: Für Teilnehmende aus der Schweiz werden die Kosten übernommen (CIFAR-Partnerschaft)!
👉 Jetzt bewerben! ➡️ https://dlrl.ca/apply/
📄 Nuestro último artículo "MELGYM: A dynamic control interface for MELCOR simulations" ha sido publicado en la revista SoftwareX.
🔗 https://www.sciencedirect.com/science/article/pii/S2352711025001153
Presentamos MELGYM, una interfaz en Python que permite el control interactivo de simulaciones con MELCOR, un código ampliamente utilizado para el análisis de seguridad en instalaciones nucleares como IFMIF-DONES.
#reinforcementlearning #ai #nuclear #control #opensource #gymnasium #paper
Oh, look! 🎉 Another groundbreaking study in which #academia leans on #buzzwords like "reinforcement learning" to suggest that someday, maybe, #AI will conquer more than just calculus and compiling code. 🤖 It's like a toddler boasting about mastering finger painting and claiming they’ll soon create the next Mona Lisa. 🖼️
https://arxiv.org/abs/2503.23829 #ReinforcementLearning #GroundbreakingStudy #TechTrends #HackerNews #ngated
Can reinforcement learning for LLMs scale beyond math and coding tasks? Probably
https://arxiv.org/abs/2503.23829
#HackerNews #reinforcementlearning #LLMs #scaling #math #codingtasks #AIresearch
@lianna Well, most #AIs and #robots in fiction I think their inputs are mostly or fully sensory-based, and they learn in real time through #ReinforcementLearning - esque techniques. AIs like LLMs are frozen in place (they never update and are just replaced over time), and they do not have any meanful interaction to the real world, nor like reflection.
I'd think that robots like #Sophia a few years ago would be more closer to the former than the latter, but #AIBros love conflating the twos.
Happy birthday to Cognitive Design for Artificial Minds (https://lnkd.in/gZtzwDn3) that was released 4 years ago!
Since then its ideas have been presented and discussed widely in the research fields of AI/Cognitive Science/Robotics and - nowadays - both the possibilities and the limitations of: #LLMs, #GenerativeAI and #ReinforcementLearning (already envisioned and discussed in the book) have become a common topic of research interests in the AI community and beyond.
Similarly also the topic concerning the evaluation - in human-like and human-level terms - of the current AI systems has become a critical theme related to the problem Anthropomorphic interpretation of AI output (see e.g. https://lnkd.in/dVi9Qf_k ).
Book reviews have been published on ACM Computing Reviews (2021) https://lnkd.in/dWQpJdkV and on Argumenta (2023): https://lnkd.in/derH3VKN
I have been invited to present the content of the book in over 20 official scientific events in international conferences, Ph.D Schools in US, China, Japan, Finland, Germany, Sweden, France, Brazil, Poland, Austria and, of course, Italy.
A news I am happy to share is that Routledge/Taylor & Francis contacted me few weeks ago for a second edition! Stay tuned!
The #book is available in many webstores:
- Routledge: https://lnkd.in/dPrC26p
- Taylor & Francis: https://lnkd.in/dprVF2w
- Amazon: https://lnkd.in/dC8rEzPi
@academicchatter @cognition
#AI #minimalcognitivegrid #CognitiveAI #cognitivescience #cognitivesystems
Implemented 18 RL Algorithms in a Simpler Way
https://github.com/FareedKhan-dev/all-rl-algorithms
Discussions: https://discu.eu/q/https://github.com/FareedKhan-dev/all-rl-algorithms
Anthropic Unveils Interpretability Framework To Make Claude’s AI Reasoning More Transparent
#AI #Anthropic #ClaudeAI #AIInterpretability #ResponsibleAI #AITransparency #MachineLearning #AIResearch #AIAlignment #AIEthics #ReinforcementLearning #AISafety
🧠✨ Dive into the abyss of reinforcement learning, where reading time feels like it needs its own reinforcement! 🙄 Thirty-one minutes to navigate a maze of jargon-laden concepts that could make even a robot short-circuit. 🤖💥 Enjoy the mental gymnastics while pondering if Markov Decision Processes are just an elaborate prank on readers. 🌀📚
https://lilianweng.github.io/posts/2018-02-19-rl-overview/ #reinforcementlearning #MarkovDecisionProcesses #technology #mentalgymnastics #AI #HackerNews #ngated
A (Long) Peek into Reinforcement Learning
https://lilianweng.github.io/posts/2018-02-19-rl-overview/
#HackerNews #ReinforcementLearning #AI #DeepLearning #MachineLearning #TechTrends