news
| Jan, 2026 | My MSR internship project Representation-Based Exploration for Language Models: From Test-Time to Post-Training was accepted to ICLR 2026! |
|---|---|
| Jan, 2026 | Our new LLM post-training algorithm called RepExp was open-sourced into the verl-recipe repo. |
| Oct, 2025 | Completed internship at Microsoft Research, NYC, where I worked with Dylan Foster, Akshay Krishnamurthy, and Jordan Ash! Check out our recent preprint on “Representation-Based Exploration for Language Models: From Test-Time to Post-Training”. |
| Oct, 2025 | Our work “Can a MISL Fly? Analysis and Ingredients for Mutual Information Skill Learning” was accepted as Oral at ICLR 2025! |
| Oct, 2024 | Our work “Can a MISL Fly? Analysis and Ingredients for Mutual Information Skill Learning” was accepted to the IMOL workshop at NeurIPS. |