publications

* denotes equal contribution.

2024

  1. OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
    Xingyao Wang, Boxuan Li, Yufan Song, Frank F. Xu, Xiangru Tang, Mingchen Zhuge, Jiayi Pan, Yueqi Song, Bowen Li, Jaskirat Singh, Hoang H. Tran, Fuqiang Li, Ren Ma, Mingzhang Zheng, Bill Qian, Yanjun Shao, Niklas Muennighoff, Yizhe Zhang, Binyuan Hui, Junyang Lin, Robert Brennan, Hao PengHeng Ji, and Graham Neubig
    Arxiv preprint, Jul 2024
  2. Executable Code Actions Elicit Better LLM Agents
    ICML 2024; LLMAgents Workshop @ ICLR 2024 (Oral), Feb 2024
  3. A Single Transformer for Scalable Vision-Language Modeling
    Yangyi Chen*Xingyao Wang*Hao Peng, and Heng Ji
    Arxiv preprint, Jul 2024
  4. MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback
    In Proceedings of the International Conference on Learning Representations, May 2024
  5. CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets
    Lifan Yuan*Yangyi Chen*Xingyao WangYi R. FungHao Peng, and Heng Ji
    In Proceedings of the International Conference on Learning Representations, May 2024
  6. R-Tuning: Teaching Large Language Models to Refuse Unknown Questions
    Hanning Zhang*, Shizhe Diao*, Yong Lin*, Yi R. Fung, Qing Lian, Xingyao WangYangyi ChenHeng Ji, and Tong Zhang
    In Proceedings of the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), Outstanding Paper, Jun 2024
  7. Advancing LLM Reasoning Generalists with Preference Trees
    Lifan Yuan, Ganqu Cui, Hanbin Wang, Ning Ding, Xingyao Wang, Jia Deng, Boji Shan, Huimin Chen, Ruobing Xie, Yankai Lin, Zhenghao Liu, Bowen Zhou, Hao Peng, Zhiyuan Liu, and Maosong Sun
    Arxiv preprint, Apr 2024
  8. If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
    Ke YangJiateng Liu, John Wu, Chaoqi Yang, Yi Ren FungSha Li, Zixuan Huang, Xu Cao, Xingyao Wang, Yiquan Wang, Heng Ji, and Chengxiang Zhai
    LLMAgents Workshop @ ICLR 2024, Jan 2024
  9. SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales
    Tianyang Xu, Shujin Wu, Shizhe Diao, Xiaoze Liu, Xingyao WangYangyi Chen, and Jing Gao
    Arxiv preprint, Jan 2024
  10. Text-Based Reasoning About Vector Graphics
    Arxiv preprint, Apr 2024
  11. Examining LLMs’ Uncertainty Expression Towards Questions Outside Parametric Knowledge
    Genglin LiuXingyao WangLifan YuanYangyi Chen, and Hao Peng
    Arxiv preprint, Feb 2024
  12. LETI: Learning to Generate from Textual Interactions
    Xingyao WangHao PengReyhaneh Jabbarvand, and Heng Ji
    In Findings of the Association for Computational Linguistics: (NAACL); DaSH Workshop @ NAACL 2024 (Oral), Jun 2024

2023

  1. Code4Struct: Code Generation for Few-Shot Event Structure Prediction
    Xingyao WangSha Li, and Heng Ji
    In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul 2023
  2. ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided Code-Vision Representation
    Yangyi ChenXingyao WangManling Li, Derek Hoiem, and Heng Ji
    In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Dec 2023
  3. Defining a New NLP Playground
    Sha LiChi HanPengfei YuCarl EdwardsManling LiXingyao WangYi Fung, Charles Yu, Joel Tetreault, Eduard Hovy, and Heng Ji
    In Findings of the Association for Computational Linguistics: EMNLP 2023, Dec 2023

2022

  1. POTATO: The Portable Text Annotation Tool
    Jiaxin PeiAparna AnanthasubramaniamXingyao WangNaitian Zhou, Apostolos Dedeloudis, Jackson Sargent, and David Jurgens
    In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Dec 2022

2021

  1. MLSys 2021
    Towards Scalable Distributed Training of Deep Learning on Public Cloud Clusters
    Shaohuai Shi*, Xianhao Zhou*, Shutao Song*, Xingyao Wang, Zilin Zhu, Xue Huang, Xinan Jiang, Feihu Zhou, Zhenyu Guo, Liqiang Xie, Rui Lan, Xianbin Ouyang, Yan Zhang, Jieqian Wei, Jing Gong, Weiliang Lin, Ping Gao, Peng Meng, Xiaomin Xu, Chenyang Guo, Bo Yang, Zhibo Chen, Yongjian Wu, and Xiaowen Chu
    In Proceedings of Machine Learning and Systems, Dec 2021
  2. An animated picture says at least a thousand words: Selecting Gif-based Replies in Multimodal Dialog
    Xingyao Wang, and David Jurgens
    In Findings of the Association for Computational Linguistics: EMNLP 2021, Nov 2021