Sitemap

A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.

Pages

Posts

portfolio

publications

OmniCode: A Benchmark for Evaluating Software Development Agents

Published in Submitted to ICLR 2026 (Under Review), 2026

We propose a benchmark containing a broader and more diverse set of tasks for code-generated AI agents.

Recommended citation: Sonwane, Atharv*, Eng-Shen Tu*, Wei-Chung Lu*, Claas Beger*, Carter Larsen, Debjit Dhar, Rachel Chen et al. "OmniCode: A Benchmark for Evaluating Software Engineering Agents." arXiv preprint arXiv:2602.02262 (2026).
Download Paper

talks

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.