#deepreinforcementlearning

Hacker Newsh4ckernews
2025-02-27

RoboPianist: Dexterous Piano Playing with Deep Reinforcement Learning (2023) — kzakka.com/robopianist/#demo
#2023

Victor Paléologuepalaio@fediscience.org
2024-09-19

Autonomy Talks - Georgia Chalvatzaki: Shaping #Robotic Assistance through Structured #Robot #Learning: youtube.com/watch?v=e0aQC3C8P7 #robotics #machinelearning

Around 12:30 they present the training of a model-free #MDP #deepreinforcementlearning using a model-based #ai #planner #aiplanner. Indeed it drastically boosts the training.

The general idea is to guide an implicit model using a model-based approximation, and it works also for assembly tasks, computer vision, pick and place…

Annual Computer Security Applications ConferenceACSAC_Conf@infosec.exchange
2024-06-20

Last but not least, came Tekgul & Asokan's "FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial Masks" which is robust to model modification attacks. (acsac.org/2023/program/final/s) 4/4
#MachineLearningSecurity #DeepReinforcementLearning #SecurityInAI

Tekgul & Asokan's "FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial Masks"
Scripter :verified_flashing:scripter@social.tchncs.de
2024-02-10

Mini-Quadkopter lernt Fliegen in Sekunden | heise online
heise.de/-9623443 #DeepReinforcementLearning #ReinforcementLearning #RL

Victor Paléologuepalaio@fediscience.org
2023-09-05

Will the next generation of #LLM come from #DeepMind?
wired.com/story/google-deepmin
They may have a shot at it given their expertise in #DeepReinforcementLearning. If their #AI can plan tasks with solid logical grounds, can't they also produce solid explanations?

Tero Keski-Valkamatero@rukii.net
2023-06-07

Faster sorting #algorithms discovered using #DeepReinforcementLearning | #Nature

"Here we show how #ArtificialIntelligence can go beyond the current state of the art by discovering hitherto unknown routines. To realize this, we formulated the task of finding a better sorting routine as a single-player game. We then trained a new deep #ReinforcementLearning agent, #AlphaDev, to play this game. AlphaDev discovered small sorting algorithms from scratch that outperformed previously known human benchmarks. These algorithms have been integrated into the #LLVM standard C++ sort library3. This change to this part of the sort library represents the replacement of a component with an algorithm that has been automatically discovered using reinforcement learning."

nature.com/articles/s41586-023

Zahra RezazadehZahra@fediscience.org
2022-11-23

#introducton 
Here to talk #neuroscience mostly about human #decisionmaking and #learning in #ReinforcementLearning environments. 
I'm doing my #PhD at LMU #Munich, working with #EyeTracking and #EEG data, using #Python and #MATLAB, and am interested in #DeepReinforcementLearning, #senseofagency and #socialagency ! 
On the side, I play #chess and hike #Alpine !
Hello #Fediverse :)

heise online (inoffiziell)heiseonline@squeet.me
2022-11-07
Für Roboter ist das Bewegen in Menschenmengen eine Herausforderung. Aber er kann das Verhalten, der Menschen analysieren, um kollisionsfrei durchzukommen.
Navigieren in überfüllten Räumen: Roboter nutzen Menschen als "Sensoren"

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst