news

Jan, 2026 My MSR internship project Representation-Based Exploration for Language Models: From Test-Time to Post-Training was accepted to ICLR 2026!
Jan, 2026 Our new LLM post-training algorithm called RepExp was open-sourced into the verl-recipe repo.
Oct, 2025 Completed internship at Microsoft Research, NYC, where I worked with Dylan Foster, Akshay Krishnamurthy, and Jordan Ash! Check out our recent preprint on “Representation-Based Exploration for Language Models: From Test-Time to Post-Training”.
Oct, 2025 Our work “Can a MISL Fly? Analysis and Ingredients for Mutual Information Skill Learning” was accepted as Oral at ICLR 2025!
Oct, 2024 Our work “Can a MISL Fly? Analysis and Ingredients for Mutual Information Skill Learning” was accepted to the IMOL workshop at NeurIPS.