Bidipta Sarkar

I am a first-year DPhil student in Engineering Science at the University of Oxford in the FLAIR and WhiRL labs, co-supervised by Professor Jakob Foerster and Professor Shimon Whiteson. I am funded by the Clarendon Fund Scholarship in partnership with a Department of Engineering Science Studentship.

I received my BS in Computer Science with Honors and Distinction at Stanford University (2020-2024), where I was also a member of Professor Dorsa Sadigh's ILIAD lab since my sophomore year.

I am interested in creating AI agents that can interact with their environment and safely work alongside humans and other autonomous agents, with a growing focus on integrating natural language into AI coordination. My research broadly spans three subfields of computer science:

• Multi-Agent Reinforcement Learning: Enabling independently trained agents to cooperate on a common task and form conventions.

• Vision: Capturing meaningful information about an agent's environment from sensors.

• Graphics: Simulating environments while balancing speed and realism.


Posts

Publications

Physically Grounded Vision-Language Models for Robotic Manipulation

Jensen Gao, Bidipta Sarkar, Fei Xia, Ted Xiao, Jiajun Wu, Brian Ichter, Anirudha Majumdar, Dorsa Sadigh

International Conference on Robotics and Automation (ICRA), May 2024

Paper / Website / Video

vlm_image.png

Diverse Conventions for Human-AI Collaboration

Bidipta Sarkar, Andy Shih, Dorsa Sadigh

Conference on Neural Information Processing Systems (NeurIPS), December 2023

Paper / Website / Code / Video

XPHandshake.png

An Extensible, Data-Oriented Architecture for High-Performance, Many-World Simulation

Brennan Shacklett, Luc Guy Rosenzweig, Zhiqiang Xie, Bidipta Sarkar, Andrew Szot, Erik Wijmans, Vladlen Koltun, Dhruv Batra, Kayvon Fatahalian

Transactions on Graphics 2023

Paper / Website / RL Environments / Blog / Colab

madrona.png

PantheonRL: A MARL Library for Dynamic Training Interactions

Bidipta Sarkar*, Aditi Talati*, Andy Shih*, Dorsa Sadigh

Proceedings of the 36th AAAI Conference on Artificial Intelligence (Demo Track), February 2022

Paper / Code / Video / DOI

round_robin.png

Bachelor's Honors Thesis

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Bidipta Sarkar, Warren Xia, C. Karen Liu, Dorsa Sadigh

Stanford Digital Repository, May 2024

Paper / Website / Code / DOI

AmongUsDiagrams.png

Preprints

An Interactive Agent Foundation Model

Zane Durante*, Bidipta Sarkar*, Ran Gong*, Rohan Taori, Yusuke Noda, Paul Tang, Ehsan Adeli, Shrinidhi Kowshika Lakshmikanth, Kevin Schulman, Arnold Milstein, Demetri Terzopoulos, Ade Famoti, Noboru Kuno, Ashley Llorens, Hoi Vo, Katsu Ikeuchi, Li Fei-Fei, Jianfeng Gao, Naoki Wake*, Qiuyuan Huang*

arXiv, February 2024

Paper / Website

Agentframework.png


Other Projects

Temporally and Spatially Novel Video Frame Synthesis using 4D Video Autoencoder

Bidipta Sarkar, Xinyi Wang, Kathy Yu

CS231n Final Project, Spring 2022

(Best Project Poster Award)

Report / Poster / CS231n Tweet / Code

4dencoder.png

Simulating Food Interactions with Material Point Methods in Houdini

CS348C Final Project, Winter 2022

Report / Houdini File / Video

348c_img.png

Real-time Cel Shading

CS248 Final Project, Winter 2022

Report / Video

cel_shading.png

Virtual Hand Interactions with ARKit

CS231a Final Project, Winter 2022

Report / Code / Demo Video

ar_hands.png

The Guardian

CS148 Final Project, Fall 2022

Report / Final Image / View 2 / No Material / Blend File

bidiptas.png

Multi-Agent Self-Learning Tank Game

Bidipta Sarkar, Henry Ang

CS221 Final Project, Spring 2021

Report / Code / Video

221.jpeg


Teaching and Outreach

Stanford Center for Teaching and Learning CS Tutor

CTL.png

Tau Beta Pi Mentor

TBP.png

Section Leader (CS 106A and 106B)

198.png