#Reinforcement

N-gated Hacker Newsngate
2025-05-06

👨‍💻🤖 Oh joy, another boasting about implementing Sutton and Barto's RL methods! Because who doesn't want to navigate through yet another maze of and autopilot writing? 🚀🎉
github.com/ivanbelenky/RL

Asheville Charlieavlcharlie
2025-05-02

Plinked out a new song on the last night.
Kinda Pink Floyd-ish.. needs a little more work and then to the mixer so maybe on my next days off I'll have another to post.

(this post to make me do it)


2025-04-21

'Learning Global Nash Equilibrium in Team Competitive Games with Generalized Fictitious Cross-Play', by Zelai Xu, Chao Yu, Yancheng Liang, Yi Wu, Yu Wang.

jmlr.org/papers/v26/24-1503.ht

#reinforcement #games #play

2025-04-14

#Zoomposium with Dr. #Patrick #Krauß: Building instructions for #artificial #consciousness

Transferring the various stages of #Damasio's #theory of consciousness 1:1 into concrete #schematics for #deeplearning. To this end, strategies such as #feedforward connections, #recurrent #connections in the form of #reinforcement learning and #unsupervised #learning are used to simulate the #biological #processes of the #neuronal #networks.

More at: philosophies.de/index.php/2023

or: youtu.be/rXamzyoggCo

photo of Patrick Krauss
2025-04-14

#Zoomposium mit Dr. #Patrick #Krauß: „Bauanleitung #Künstliches #Bewusstsein

Die verschiedenen Stufen von #Damasios #Theorie des Bewusstseins 1:1 in konkrete #Schaltpläne für ein #DeepLearning zu überführen. Hierzu werden Strategien wie #feedforward connections, #recurrent #connections in Form von #reinforcement learning und #unsupervised learning angewendet, um die #biologischen #Prozesse der #neuronalen #Netze zu simulieren.

Mehr auf: philosophies.de/index.php/2023

oder: youtu.be/rXamzyoggCo

Foto Patrick Krauß
2025-03-30

#Zoomposium with Dr. #Patrick #Krauß: Building instructions for #artificial #consciousness

Transferring the various stages of #Damasio's #theory of consciousness 1:1 into concrete #schematics for #deeplearning. To this end, strategies such as #feedforward connections, #recurrent #connections in the form of #reinforcement learning and #unsupervised #learning are used to simulate the #biological #processes of the #neuronal #networks.

More at: philosophies.de/index.php/2023

or: youtu.be/rXamzyoggCo

photo of Patrick Krauss
2025-03-30

#Zoomposium mit Dr. #Patrick #Krauß: „Bauanleitung #Künstliches #Bewusstsein

Die verschiedenen Stufen von #Damasios #Theorie des Bewusstseins 1:1 in konkrete #Schaltpläne für ein #DeepLearning zu überführen. Hierzu werden Strategien wie #feedforward connections, #recurrent #connections in Form von #reinforcement learning und #unsupervised learning angewendet, um die #biologischen #Prozesse der #neuronalen #Netze zu simulieren.

Mehr auf: philosophies.de/index.php/2023

oder: youtu.be/rXamzyoggCo

Foto von Patrick Krauß
2025-03-20

#Zoomposium with Dr. #Patrick #Krauß: Building instructions for #artificial #consciousness

Transferring the various stages of Damasio's theory of consciousness 1:1 into concrete #schematics for #deep #learning. To this end, strategies such as #feed-forward connections, #recurrent #connections in the form of #reinforcement learning and #unsupervised learning are used to simulate the #biological #processes of the #neuronal #networks.

More at: philosophies.de/index.php/2023

or: youtu.be/rXamzyoggCo

photo of Patrick Krauss
2025-03-20

#Zoomposium mit Dr. #Patrick #Krauß: „Bauanleitung #Künstliches #Bewusstsein

Die verschiedenen Stufen von Damasios Theorie des Bewusstseins 1:1 in konkrete #Schaltpläne für ein #Deep #Learning zu überführen. Hierzu werden Strategien wie #feed-forward connections, #recurrent #connections in Form von #reinforcement learning und #unsupervised learning angewendet, um die #biologischen #Prozesse der #neuronalen #Netze zu simulieren.

Mehr auf: philosophies.de/index.php/2023

oder: youtu.be/rXamzyoggCo

Foto Patrick Krauß
GripNewsGripNews
2025-03-10

🌘 GitHub - MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning
➤ 一本強化學習數學基礎的導引
github.com/MathFoundationRL/Bo
這是一本名為《強化學習的數學基礎》的新書的首頁,旨在提供對強化學習基本概念、問題和經典算法的數學而易懂的介紹。
+ 這本書看起來非常有幫助,對於想深入瞭解強化學習的人來說是個好資源!
+ 我喜歡這本書的數學解釋方式,讓我能更清楚地掌握複雜的概念。
Learning

2025-02-13

'Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of Agents', by Marco Pleines, Matthias Pallasch, Frank Zimmer, Mike Preuss.

jmlr.org/papers/v26/24-0043.ht

#memory #reinforcement #recurrent

2025-02-06

'A New, Physics-Informed Continuous-Time Reinforcement Learning Algorithm with Performance Guarantees', by Brent A. Wallace, Jennie Si.

jmlr.org/papers/v25/24-0017.ht

#control #reinforcement #exploration

2025-02-05

'Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach', by Shuang Qiu, Boxiang Lyu, Qinglin Meng, Zhaoran Wang, Zhuoran Yang, Michael I. Jordan.

jmlr.org/papers/v25/23-0159.ht

#reinforcement #reward #dynamic

2025-01-11

'Learning Regularized Graphon Mean-Field Games with Unknown Graphons', by Fengzhuo Zhang, Vincent Y. F. Tan, Zhaoran Wang, Zhuoran Yang.

jmlr.org/papers/v25/23-1409.ht

#graphon #graphons #reinforcement

Knowledge Zonekzoneind@mstdn.social
2024-12-20

#ITByte: Deep Q-learning is a #Reinforcement #Learning technique that combines Q-Learning and deep neural networks. It aims to help agents learn optimal actions in complex environments.

Here is a brief overview of Q-Learning and Deep Q-Learning.

knowledgezone.co.in/posts/Deep

2024-12-11

'Sample Complexity of Variance-Reduced Distributionally Robust Q-Learning', by Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou.

jmlr.org/papers/v25/23-0526.ht

#robust #efficiently #reinforcement

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst