Cong

ML PhD Student at the University of Oxford, (Offline) RL X Generative Modelling

2023-03-14

RL agents πŸ€– need a lot of data, which they usually need to gather themselves. But does that data need to be real? Enter *Synthetic Experience Replay*, leveraging recent advances in #GenerativeAI in order to vastly upsample ⬆️ an agent’s training data!

Paper: arxiv.org/abs/2303.06614

2023-01-23

πŸ’₯ ML Research Opportunity for all under-represented undergrads at the University of Oxford! πŸ’₯

Would appreciate help sharing this widely! UNIQ+ is an awesome way to spend two months getting stuck into ML in great research groups.

See proposed projects here: ox.ac.uk/admissions/graduate/a

Cong boosted:
Tim G. J. Rudnertimrudner@sigmoid.social
2022-12-05

πŸ“£ You can now find *V-D4RL*, a benchmarking suite for offline RL from pixels, on #huggingface:
huggingface.co/datasets/conglu πŸš€

Highlights:
πŸ’₯ New D4RL-style visual datasets!
πŸ’₯ Competitive baselines based on Dreamer and DrQ!
πŸ’₯ A set of exciting open problems!

This is joint work with @conglu, Phil Ball, @jparkerholder, @maosbot, and @yeewhye !

2022-12-05

Now time for a first research post...

No better time to start on offline RL from pixels! V-D4RL is now on #huggingface at huggingface.co/datasets/conglu

πŸ’₯ New D4RL-style visual datasets!
πŸ’₯ Competitive baselines based on Dreamer and DrQ!
πŸ’₯ A set of exciting open problems!

2022-12-05

#introduction

Time for a very late introduction, I'm a 4th year PhD student at the University of Oxford interested in deep reinforcement learning, generative modelling, and Bayesian methods!

Most lately, been thinking about effective ways to automate reinforcement learning (PBT, HPO) and how to extend use cases for offline reinforcement learning (learning from pixels, generalizing to unseen environments)!

Always v. v. happy to chat :)

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst