publications
* denotes equal contribution.
2024
-
- A Single Transformer for Scalable Vision-Language ModelingTransactions on Machine Learning Research, 2024, Jul 2024
-
- Advancing LLM Reasoning Generalists with Preference TreesArxiv preprint, Apr 2024
- If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent AgentsLLMAgents Workshop @ ICLR 2024, Jan 2024
-
- LETI: Learning to Generate from Textual InteractionsIn Findings of the Association for Computational Linguistics: (NAACL); DaSH Workshop @ NAACL 2024 (Oral), Jun 2024
2023
- Code4Struct: Code Generation for Few-Shot Event Structure PredictionIn Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul 2023
- ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided Code-Vision RepresentationIn Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Dec 2023
- Defining a New NLP PlaygroundIn Findings of the Association for Computational Linguistics: EMNLP 2023, Dec 2023
2022
2021
- MLSys 2021Towards Scalable Distributed Training of Deep Learning on Public Cloud ClustersIn Proceedings of Machine Learning and Systems, Dec 2021
- An animated picture says at least a thousand words: Selecting Gif-based Replies in Multimodal DialogIn Findings of the Association for Computational Linguistics: EMNLP 2021, Nov 2021