Jubayer Ibn Hamid

I am an incoming Ph.D. student, currently doing research at Stanford where I am advised by Chelsea Finn and Dorsa Sadigh. I studied Mathematical Physics (B.S) and Computer Science (M.S), both at Stanford University.

I work in artificial intelligence with a focus on the intersection of reinforcement learning, generative models and representation learning. I am also interested in pure mathematics such as abstract algebra, category theory and algebraic geometry.

CV  /  Scholar  /  Email  /  Twitter    

profile photo
Research

My research interests in building embodied agents are in online reinforcement learning fine-tuning, test-time decoding methods from complex multimodal policies and long-context policies. I am also interested in understanding what kinds of data we should scale for better generalization and why. Recently, I have become interested in sample-efficient methods for preference fine-tuning generative models.

(*) denotes co-first authorship

Notes

Here are some introductory notes on various topics that have fascinated me. These are not meant to be in-depth. Rather, they are meant to cover some of the basic constructions that show up periodically and are also interesting in and of themselves.

Talks

Bidirectional Decoding. OpenAI. 25th February, 2025.

Teaching

CS 224R - Deep Reinforcement Learning : Head CA. Spring, 2025.

CS 229 - Machine Learning : CA. Winter, 2025.


Template