Bidipta Sarkar


Publications

Evolution Strategies at the Hyperscale

Bidipta Sarkar*, Mattie Fellows*, Juan Agustin Duque*, Alistair Letcher\(^\dagger\), Antonio León Villares\(^\dagger\), Anya Sims\(^\dagger\), Dylan Cope\(^\dagger\), Jarek Liesen\(^\dagger\), Lukas Seier\(^\dagger\), Theo Wolf\(^\dagger\), Uljad Berdica\(^\dagger\), Alexander David Goldie, Aaron Courville, Karin Sevegnani, Shimon Whiteson*, Jakob Nicolaus Foerster*

arXiv preprint, November 2025

Paper / Website / Code / Nano-EGG Code

diagram.png

Discrete Flow Matching is a Surprisingly Effective Post-training Method to Address Compound Error in Autoregressive Models

Kang Li*, Bidipta Sarkar*, Zheng Xiong, Sascha Frey, Zilin Wang, Frensi Zejnullahu, Alfred Backhouse, Stefan Zohren, Anisoara Calinescu, Mihai Cucuringu\(^\dagger\), Jakob Foerster\(^\dagger\)

ACM International Conference on AI in Finance (Oral), November 2025

Paper

LOB-Bench: Benchmarking Generative AI for Finance – an Application to Limit Order Book Data

Peer Nagy*, Sascha Frey*, Kang Li, Bidipta Sarkar, Svitlana Vyetrenko, Stefan Zohren, Ani Calinescu, Jakob Foerster

International Conference on Machine Learning (ICML), July 2025

Paper / Website / Code

lobbench_img.png

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Bidipta Sarkar, Warren Xia, C. Karen Liu, Dorsa Sadigh

International Conference on Autonomous Agents and Multiagent Systems (AAMAS) (Oral), May 2025
Stanford Senior Honors Thesis, May 2024

Paper / Stanford Digital Repository / Website / Code / Models / Poster

AmongUsDiagrams.png

Physically Grounded Vision-Language Models for Robotic Manipulation

Jensen Gao, Bidipta Sarkar, Fei Xia, Ted Xiao, Jiajun Wu, Brian Ichter, Anirudha Majumdar, Dorsa Sadigh

International Conference on Robotics and Automation (ICRA), May 2024

Paper / Website / Video

vlm_image.png

Diverse Conventions for Human-AI Collaboration

Bidipta Sarkar, Andy Shih, Dorsa Sadigh

Conference on Neural Information Processing Systems (NeurIPS), December 2023

Paper / Website / Code / Video / Poster

XPHandshake.png

An Extensible, Data-Oriented Architecture for High-Performance, Many-World Simulation

Brennan Shacklett, Luc Guy Rosenzweig, Zhiqiang Xie, Bidipta Sarkar, Andrew Szot, Erik Wijmans, Vladlen Koltun, Dhruv Batra, Kayvon Fatahalian

Transactions on Graphics 2023

Paper / Website / RL Environments / Blog / Colab

madrona.png

PantheonRL: A MARL Library for Dynamic Training Interactions

Bidipta Sarkar*, Aditi Talati*, Andy Shih*, Dorsa Sadigh

Proceedings of the 36th AAAI Conference on Artificial Intelligence (Demo Track), February 2022

Paper / Code / Video / DOI / Poster

round_robin.png

Workshop Papers

An Interactive Agent Foundation Model

Zane Durante*, Bidipta Sarkar*, Ran Gong*, Rohan Taori, Yusuke Noda, Paul Tang, Ehsan Adeli, Shrinidhi Kowshika Lakshmikanth, Kevin Schulman, Arnold Milstein, Demetri Terzopoulos, Ade Famoti, Noboru Kuno, Ashley Llorens, Hoi Vo, Katsu Ikeuchi, Li Fei-Fei, Jianfeng Gao, Naoki Wake*, Qiuyuan Huang*

Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR) Workshops, June 2025

Paper / Website

Agentframework.png