Jubayer Ibn Hamid

I am an incoming PhD student at Stanford University. My research is currently advised by Chelsea Finn and Dorsa Sadigh. I studied Mathematical Physics (B.S) and Computer Science (M.S), both at Stanford University.

I work in artificial intelligence with a focus on the intersection of reinforcement learning, generative models and representation learning. I am also interested in pure mathematics such as abstract algebra, category theory and algebraic geometry.

CV  /  Scholar  /  Email  /  Twitter    

profile photo
Research

I am currently interested in deep exploration methods for online reinforcement learning fine-tuning, spanning both language models and embodied agents. My research also focuses on test-time decoding from complex, multimodal policies and training robotic policies with long context. Additionally, I am interested in understanding what kinds of data to scale for improved generalization in robot learning.

Publications:

(*) denotes co-first authorship

Notes

Here are some introductory notes on various topics that have fascinated me. These are not meant to be in-depth. Rather, they are meant to cover some of the basic constructions that show up periodically and are also interesting in and of themselves.

Talks

Bidirectional Decoding. OpenAI. 25th February, 2025.

Teaching

CS 224R - Deep Reinforcement Learning : Head CA. Spring, 2025.

CS 229 - Machine Learning : CA. Winter, 2025.


Template