Xingyao Wang
  • about
  • publications
  • blog (current)
  • life outside work

42

the answer to life, the universe, and everything

  • LLM
  • •

  • agent
  • •

  • evaluation
  • •

  • analysis
  • •

  • dataset
  • •

  • research
  • Introducing OpenDevin CodeAct 1.0, a new State-of-the-art in Coding Agents

    By Xingyao Wang, Bowen Li, and Graham Neubig.

    6 min read   ·   May 07, 2024

    2024   ·   LLM   agent   swe-bench   code     ·   research  

  • How should LLM agents best interact with our world?

    Blog for paper "Executable Code Actions Elicit Better LLM Agents"

    5 min read   ·   February 06, 2024

    2024   ·   LLM   agent   dataset   analysis     ·   research  

  • A new benchmark tailored for LLMs' multi-turn interactions

    Blog for ICLR 2024 paper "MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback"

    7 min read   ·   September 23, 2023

    2023   ·   LLM   agent   evaluation     ·   research  

© Copyright 2025 Xingyao Wang. Powered by Jekyll with al-folio theme. Last updated: May 01, 2025.