publications | Xingyao Wang

2025

Preprint

Coding Agents with Multimodal Browsing are Generalist Problem Solvers

Aditya Bharat Soni, Boxuan Li, Xingyao Wang, Valerie Chen, and Graham Neubig

Arxiv preprint, Jun 2025

Paper Code
ACL 2025

LocAgent: Graph-Guided LLM Agents for Code Localization

Zhaoling Chen, Xiangru Tang, Gangda Deng, Fang Wu, Jialong Wu, Zhiwei Jiang, Viktor Prasanna, Arman Cohan, and Xingyao Wang

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, Mar 2025

Paper Code
ICML 2025

SyncMind: Measuring Agent Out-of-Sync Recovery in Collaborative Software Engineering

Xuehang Guo, Xingyao Wang, Yangyi Chen, Sha Li, Chi Han, Manling Li, and Heng Ji

ICML 2025, Feb 2025

Paper Website Code
Preprint

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Alejandro Cuadron, Dacheng Li, Wenjie Ma, Xingyao Wang, Yichuan Wang, Siyuan Zhuang, Shu Liu, Luis Gaspar Schroeder, Tian Xia, Huanzhi Mao, Nicholas Thumiger, Aditya Desai, Ion Stoica, Ana Klimovic, Graham Neubig, and Joseph E. Gonzalez

Arxiv preprint, Feb 2025

Paper Code

2024

ICML 2025 DL4C
@ ICLR 2025

Training Software Engineering Agents and Verifiers with SWE-Gym

Jiayi Pan*, Xingyao Wang*, Graham Neubig, Navdeep Jaitly, Heng Ji, Alane Suhr*, and Yizhe Zhang*

ICML 2025; DL4C Workshop @ ICLR 2025, Dec 2024

Paper Code X/Twitter Thread
ICLR 2025

OpenHands: An Open Platform for AI Software Developers as Generalist Agents

Xingyao Wang, Boxuan Li, Yufan Song, Frank F. Xu, Xiangru Tang, Mingchen Zhuge, Jiayi Pan, Yueqi Song, Bowen Li, Jaskirat Singh, Hoang H. Tran, Fuqiang Li, Ren Ma, Mingzhang Zheng, Bill Qian, Yanjun Shao, Niklas Muennighoff, Yizhe Zhang, Binyuan Hui, Junyang Lin, Robert Brennan, Hao Peng, Heng Ji, and Graham Neubig

ICLR 2025, Jul 2024

Paper Code
ICML 2024 LLMAgents
@ ICLR 2024 Oral

Executable Code Actions Elicit Better LLM Agents

Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, and Heng Ji

ICML 2024; LLMAgents Workshop @ ICLR 2024 (Oral), Feb 2024

Paper Code Slide Blog X/Twitter Thread Poster Demo
TMLR 2024

A Single Transformer for Scalable Vision-Language Modeling

Yangyi Chen*, Xingyao Wang*, Hao Peng, and Heng Ji

Transactions on Machine Learning Research, 2024, Jul 2024

Paper Code X/Twitter Thread
ICLR 2024

MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback

Xingyao Wang*, Zihan Wang*, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng, and Heng Ji

In Proceedings of the International Conference on Learning Representations, May 2024

Paper Website Code Blog X/Twitter Thread Poster
ICLR 2024

CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets

Lifan Yuan*, Yangyi Chen*, Xingyao Wang, Yi R. Fung, Hao Peng, and Heng Ji

In Proceedings of the International Conference on Learning Representations, May 2024

Paper Code
NAACL 2024 Outstanding
Paper

R-Tuning: Teaching Large Language Models to Refuse Unknown Questions

Hanning Zhang*, Shizhe Diao*, Yong Lin*, Yi R. Fung, Qing Lian, Xingyao Wang, Yangyi Chen, Heng Ji, and Tong Zhang

In Proceedings of the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), Outstanding Paper, Jun 2024

Paper Code
Preprint

Advancing LLM Reasoning Generalists with Preference Trees

Lifan Yuan, Ganqu Cui, Hanbin Wang, Ning Ding, Xingyao Wang, Jia Deng, Boji Shan, Huimin Chen, Ruobing Xie, Yankai Lin, Zhenghao Liu, Bowen Zhou, Hao Peng, Zhiyuan Liu, and Maosong Sun

Arxiv preprint, Apr 2024

Paper Code X/Twitter Thread
LLMAgents
@ ICLR 2024

If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents

Ke Yang, Jiateng Liu, John Wu, Chaoqi Yang, Yi Ren Fung, Sha Li, Zixuan Huang, Xu Cao, Xingyao Wang, Yiquan Wang, Heng Ji, and Chengxiang Zhai

LLMAgents Workshop @ ICLR 2024, Jan 2024

Paper
EMNLP 2024

SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales

Tianyang Xu, Shujin Wu, Shizhe Diao, Xiaoze Liu, Xingyao Wang, Yangyi Chen, and Jing Gao

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Jan 2024

Paper Code
Preprint

Text-Based Reasoning About Vector Graphics

Zhenhailong Wang, Joy Hsu, Xingyao Wang, Kuan-Hao Huang, Manling Li, Jiajun Wu, and Heng Ji

Arxiv preprint, Apr 2024

Paper Website Code
Preprint

Examining LLMs’ Uncertainty Expression Towards Questions Outside Parametric Knowledge

Genglin Liu, Xingyao Wang, Lifan Yuan, Yangyi Chen, and Hao Peng

Arxiv preprint, Feb 2024

Paper Code
NAACL 2024
(Findings) DaSH @
NAACL 2024 Oral

LETI: Learning to Generate from Textual Interactions

Xingyao Wang, Hao Peng, Reyhaneh Jabbarvand, and Heng Ji

In Findings of the Association for Computational Linguistics: (NAACL); DaSH Workshop @ NAACL 2024 (Oral), Jun 2024

Paper Code Slide X/Twitter Thread Poster

2023

ACL 2023

Code4Struct: Code Generation for Few-Shot Event Structure Prediction

Xingyao Wang, Sha Li, and Heng Ji

In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul 2023

Paper Code X/Twitter Thread Poster
EMNLP 2023

ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided Code-Vision Representation

Yangyi Chen, Xingyao Wang, Manling Li, Derek Hoiem, and Heng Ji

In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Dec 2023

Paper
EMNLP 2023
(Findings)

Defining a New NLP Playground

Sha Li, Chi Han, Pengfei Yu, Carl Edwards, Manling Li, Xingyao Wang, Yi Fung, Charles Yu, Joel Tetreault, Eduard Hovy, and Heng Ji

In Findings of the Association for Computational Linguistics: EMNLP 2023, Dec 2023

Paper

2022

EMNLP 2022

POTATO: The Portable Text Annotation Tool

Jiaxin Pei, Aparna Ananthasubramaniam, Xingyao Wang, Naitian Zhou, Apostolos Dedeloudis, Jackson Sargent, and David Jurgens

In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Dec 2022

Paper Code

2021

MLSys 2021

Towards Scalable Distributed Training of Deep Learning on Public Cloud Clusters

Shaohuai Shi*, Xianhao Zhou*, Shutao Song*, Xingyao Wang, Zilin Zhu, Xue Huang, Xinan Jiang, Feihu Zhou, Zhenyu Guo, Liqiang Xie, Rui Lan, Xianbin Ouyang, Yan Zhang, Jieqian Wei, Jing Gong, Weiliang Lin, Ping Gao, Peng Meng, Xiaomin Xu, Chenyang Guo, Bo Yang, Zhibo Chen, Yongjian Wu, and Xiaowen Chu

In Proceedings of Machine Learning and Systems, Dec 2021

Paper
EMNLP 2021
(Findings)

An animated picture says at least a thousand words: Selecting Gif-based Replies in Multimodal Dialog

Xingyao Wang, and David Jurgens

In Findings of the Association for Computational Linguistics: EMNLP 2021, Nov 2021

Paper Code X/Twitter Thread Poster Presentation Video Imgur Post