#DeepReinforcementLearning

2026-01-13

IA2 uses Deep Reinforcement Learning to slash database runtimes by 40%, while new Hyperbolic SVM techniques utilize semidefinite relaxation. hackernoon.com/ai-driven-datab #deepreinforcementlearning

2026-01-10

IA2 revolutionizes index selection with rapid training, reducing SQL runtime by 61% via adaptive action pruning and workload modeling hackernoon.com/adaptive-action #deepreinforcementlearning

2026-01-06

IA2 uses a two-phase framework to generate states and action pools from workloads, enabling RL agents to make sequential index selection decisions. hackernoon.com/unseen-workload #deepreinforcementlearning

2025-12-24

The TD3-TD-SWAR model advances database optimization by framing index selection as a DRL problem with adaptive action masking for faster training. hackernoon.com/adaptive-action #deepreinforcementlearning

2025-08-26

This research validates a weekly re-trained DRL agent, showing it outperforms static models & Black-Scholes for practical American option hedging. hackernoon.com/validating-hype #deepreinforcementlearning

2025-08-26

This methodology details how to train and test DRL agents for American option hedging, introducing a novel weekly re-training strategy using Chebyshev pricing. hackernoon.com/dont-just-train #deepreinforcementlearning

2025-08-26

This review of DRL hedging literature highlights the need for hyperparameter analysis, especially for real-world American option applications. hackernoon.com/avoiding-the-pi #deepreinforcementlearning

2025-08-26

This paper makes Deep Reinforcement Learning practical for hedging American options by optimizing hyperparameters and using a weekly re-training strategy. hackernoon.com/how-weekly-ai-t #deepreinforcementlearning

Hacker Newsh4ckernews
2025-02-27

RoboPianist: Dexterous Piano Playing with Deep Reinforcement Learning (2023) — kzakka.com/robopianist/#demo
#2023

Victor Paléologuepalaio@fediscience.org
2024-09-19

Autonomy Talks - Georgia Chalvatzaki: Shaping #Robotic Assistance through Structured #Robot #Learning: youtube.com/watch?v=e0aQC3C8P7 #robotics #machinelearning

Around 12:30 they present the training of a model-free #MDP #deepreinforcementlearning using a model-based #ai #planner #aiplanner. Indeed it drastically boosts the training.

The general idea is to guide an implicit model using a model-based approximation, and it works also for assembly tasks, computer vision, pick and place…

Annual Computer Security Applications ConferenceACSAC_Conf@infosec.exchange
2024-06-20

Last but not least, came Tekgul & Asokan's "FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial Masks" which is robust to model modification attacks. (acsac.org/2023/program/final/s) 4/4
#MachineLearningSecurity #DeepReinforcementLearning #SecurityInAI

Tekgul & Asokan's "FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial Masks"
Scripter :verified_flashing:scripter@social.tchncs.de
2024-02-10

Mini-Quadkopter lernt Fliegen in Sekunden | heise online
heise.de/-9623443 #DeepReinforcementLearning #ReinforcementLearning #RL

Victor Paléologuepalaio@fediscience.org
2023-09-05

Will the next generation of #LLM come from #DeepMind?
wired.com/story/google-deepmin
They may have a shot at it given their expertise in #DeepReinforcementLearning. If their #AI can plan tasks with solid logical grounds, can't they also produce solid explanations?

Tero Keski-Valkamatero@rukii.net
2023-06-07

Faster sorting #algorithms discovered using #DeepReinforcementLearning | #Nature

"Here we show how #ArtificialIntelligence can go beyond the current state of the art by discovering hitherto unknown routines. To realize this, we formulated the task of finding a better sorting routine as a single-player game. We then trained a new deep #ReinforcementLearning agent, #AlphaDev, to play this game. AlphaDev discovered small sorting algorithms from scratch that outperformed previously known human benchmarks. These algorithms have been integrated into the #LLVM standard C++ sort library3. This change to this part of the sort library represents the replacement of a component with an algorithm that has been automatically discovered using reinforcement learning."

nature.com/articles/s41586-023

Zahra RezazadehZahra@fediscience.org
2022-11-23

#introducton 
Here to talk #neuroscience mostly about human #decisionmaking and #learning in #ReinforcementLearning environments. 
I'm doing my #PhD at LMU #Munich, working with #EyeTracking and #EEG data, using #Python and #MATLAB, and am interested in #DeepReinforcementLearning, #senseofagency and #socialagency ! 
On the side, I play #chess and hike #Alpine !
Hello #Fediverse :)

heise online (inoffiziell)heiseonline@squeet.me
2022-11-07
Für Roboter ist das Bewegen in Menschenmengen eine Herausforderung. Aber er kann das Verhalten, der Menschen analysieren, um kollisionsfrei durchzukommen.
Navigieren in überfüllten Räumen: Roboter nutzen Menschen als "Sensoren"

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst