#MDPs

Tero Keski-Valkamatero@rukii.net
2025-02-12

Teaching ChatGPT-4o is a great way to learn.

It's always nice to notice you know something ChatGPT doesn't know, as it typically means you know something most specialists in the field don't know:
chatgpt.com/share/67ac8053-bcf

#LLM #mathematics #MarkovChains #MDPs

2024-02-21

'Model-Free Representation Learning and Exploration in Low-Rank MDPs', by Aditya Modi, Jinglin Chen, Akshay Krishnamurthy, Nan Jiang, Alekh Agarwal.

jmlr.org/papers/v25/22-0687.ht

#reinforcement #exploration #mdps

2023-08-30

'Q-Learning for MDPs with General Spaces: Convergence and Near Optimality via Quantization under Weak Continuity', by Ali Kara, Naci Saldi, Serdar Yüksel.

jmlr.org/papers/v24/21-1457.ht

#quantization #quantized #mdps

2023-04-13

'Provably Sample-Efficient Model-Free Algorithm for MDPs with Peak Constraints', by Qinbo Bai, Vaneet Aggarwal, Ather Gattami.

jmlr.org/papers/v24/21-0117.ht

#mdps #markov #pcmdp

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst