I'm a final-year PhD Candidate at University of Notre Dame, advised by Prof. Xiangliang Zhang.
I study Agents and Systems, and how they co-evolve on the path to superintelligence, especially Long-Horizon Agent Training, Automated Recursive Self-Improvement/Systems (Code Intelligence, AutoResearch), Agent-System Co-Design, and Language Model Post-Training. I work as a research engineer who is driven to build novel & fundamental systems from scratch. My work (all first-authored) includes:
| (1) Long-Horizon Agent Training: |
AutoLLMResearch 🆕 (In Submission)
MTSQL-R1 🆕 (ACL'26 Main)
|
| (2) Automated Recursive Self-Improvement / Systems: |
AutoLLMResearch (In Submission)
MTSQL-R1 (ACL'26 Main)
NeurIPS AutoML Competition (NeurIPS 2nd Place 🥈)
|
| (3) Agent-System Co-Design: |
ASAP: Efficient Agent-System Co-Design (In Submission)
LLM MultiAgents (IJCAI'24, 1800+ citations)
KGLA (18+ citations)
|
|
| (4) Language Model Post-Training: |
NewsRecTransfer (WWW'23 Oral)
ReactionTeam (IEEE Conference on BigData'25 Oral)
NLP Classification Competition (Kaggle 1st & 5th Place 🏅)
KDD Cup RecSys Competition (KDD Cup 2nd Place 🥈)
|
| (5) LLM Reasoning: |
LLMChemBench (NeurIPS'23, 400+ citations)
|
|
Selected Publications
Full Paper List | Citations: 2868 (80%+ from first-author papers)
-
-
|
MTSQL-R1: Towards Long-Horizon Multi-Turn Text-to-SQL via Agentic Training
Taicheng Guo, Hai Wang, ChaoChun Liu, Mohsen Golalikhani, Xin Chen, Xiangliang Zhang, Chandan K. Reddy.
In ACL 2026 Main (Acceptance Rate 19%)
Problem: Multi-turn Text-to-SQL is one-shot translation without execution feedback, giving non-executable queries.
Contribution: Agentic training as an MDP: propose, execute, verify, refine SQL.
Key Insight: Execution-based verification + dialogue memory enable self-correction over one-shot generation.
[Code]
|
-
|
Large Language Model based Multi-Agents: A Survey of Progress and Challenges.
Taicheng Guo, Xiuying Chen, Yaqi Wang, Ruidi Chang, Shichao Pei, Nitesh V. Chawla, Olaf Wiest, Xiangliang Zhang.
In IJCAI 2024 (Acceptance Rate 20%)
Problem: LLM-based multi-agent systems lack a unified view of architectures, communication, and limits.
Contribution: A survey organizing them by agent profiling, communication, capability enhancement, and benchmarks.
Key Insight: LLM planning and reasoning enable multi-agent collaboration beyond single agents.
[Code]
|
-
|
ReactionTeam: Teaming Model Experts for Divergent Thinking Beyond Typical Reaction Patterns.
Taicheng Guo, Changsheng Ma, Xiuying Chen, Bozhao Nan, Kehan Guo, Shichao Pei, Nitesh V. Chawla, Olaf Wiest, Xiangliang Zhang.
In IEEE Conference on BigData 2025 Oral (Acceptance Rate 18%)
Problem: Likelihood-maximizing models predict only the top outcome, ignoring reactions' stochastic nature.
Contribution: An ensemble of pattern-specialized experts plus a ranking expert for diverse predictions.
Key Insight: Emulating chemists' divergent thinking captures plausible rare outcomes single models miss.
[Code]
|
-
|
What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks.
Taicheng Guo, Kehan Guo, Bozhao Nan, Zhenwen Liang, Zhichun Guo, Nitesh V. Chawla, Olaf Wiest, Xiangliang Zhang.
In NeurIPS 2023 (Acceptance Rate 32%)
Problem: LLMs' chemistry capabilities are poorly understood and unevaluated.
Contribution: A benchmark of eight chemistry tasks across five LLMs (zero-/few-shot).
Key Insight: LLM competence varies sharply by task; in-context learning is decisive.
[Code]
|
-
|
Few-shot News Recommendation via Cross-lingual Transfer.
Taicheng Guo, Lu Yu, Basem Shihada, Xiangliang Zhang.
In the ACM Web Conference(WWW 2023 Oral) (Acceptance Rate 19%)
Problem: Early-stage news platforms face cold-start with only a few interaction records.
Contribution: A cross-lingual transfer framework from data-rich to few-shot platforms.
Key Insight: News shares topics across languages, so cross-lingual transfer beats data scarcity.
[Code]
[Code]
|
-
|
Knowledge Graph Enhanced Language Agents for Recommendation
Taicheng Guo, Chaochun Liu, Hai Wang, Varun Mannam, Fang Wang, Xin Chen, Xiangliang Zhang, Chandan K. Reddy.
In Submission
Problem: Recommendation language agents miss user–item relationships, giving weak profiles.
Contribution: KGLA verbalizes knowledge-graph paths into the agent's user–item simulation.
Key Insight: KG paths expose the reasons behind user preferences.
|
-
|
Data Interpreter: An LLM Agent For Data Science.
Sirui Hong, Yizhang Lin, ... Taicheng Guo
In ACL 2025 Findings
Problem: LLM agents struggle with long-horizon, dynamic data-science workflows.
Contribution: Hierarchical graph task decomposition + programmable, self-refining code nodes.
Key Insight: Subgraph modeling adapts to evolving dependencies while verification boosts reliability.
[Code]
|
Competition Award
-
KDD Cup Large-Scale RecSys Competition.
Ranked 2nd place. 🥈 ($1000)
-
NeurIPS Black-Box AutoML Optimization Challenge.
Ranked 2nd place in warm-start friendly leaderboard. 🥈
-
Kaggle Arabic Sentiment Analaysis.
Ranked 5th place.
-
Kaggle Expert
-
IEEE Computer Society Global student challenge.
Ranked 1st place. 🏅 ($1000)
Experience
Applied Scientist Intern. Amazon. Jan. 2025 - Jul. 2025
Applied Scientist Intern. Amazon. Jun. 2024 - Sept. 2024
Research Intern. MBZUAI. Aug. 2022 - Oct. 2022
Visiting Student. KAUST. Dec. 2020 - Aug. 2021
Senior Machine Learning Engineer. AI Platform, Tuyoo Games (Lead a 4-person team to build Game AI Intelligence: Monte Carlo tree search (MCTS), MinMax for Zero-sum Games (Multiplayer poker)) Aug. 2018 - Oct. 2020
MISC
Avid runner 🏃🏻, swimmer 🏊🏻
Favorite books: The Unbearable Lightness of Being, 1Q84.
Last update: Apr, 2026. Webpage template borrows from
Stephen Bach.
|