Agent Programming Frameworks & Infrastructure
Programming frameworks, runtime, and infrastructure for building LLM agent applications, including agent memory and multi-agent coordination (e.g., AgentScope).
Senior Director and Research Scientist
Data Analytics and Intelligence Lab (DAIL)
Tongyi Lab, Alibaba Group
My work is generally about making systems intelligent and efficient with machine learning and optimization techniques. My current research interests include programming frameworks for building LLM agents, systems and algorithms for tuning agent models, efficient data pipelines for training LLMs and agent models, and building novel agent applications (e.g., for data analytics and social science).
Previously, I also worked on database systems, data privacy, data pricing, and federated learning.
I lead several projects including AgentScope (agent programming framework, runtime & infra of agent applications, agent memory, and agent tuning) and Data-Juicer (multimodal data processing for LLMs and agents), which have been widely adopted in production. Some earlier research initiatives include PilotScope (AI4DB middleware) and FederatedScope (federated learning platform).
I'm always interested in discussing research collaborations and speaking opportunities. We are hiring research scientists, engineers, and research interns for our lab!
Making systems intelligent and efficient with machine learning and optimization techniques.
Programming frameworks, runtime, and infrastructure for building LLM agent applications, including agent memory and multi-agent coordination (e.g., AgentScope).
Systems and algorithms for tuning agent models and large language models, including preference optimization and scaling laws for test-time compute.
Efficient multimodal data processing pipelines for training LLMs and agent models at scale (e.g., Data-Juicer).
Novel agent applications, particularly for data analytics and social science, including Text-to-SQL, AI4DB, and large-scale multi-agent simulation.
Database systems, data privacy, data pricing, and federated learning (e.g., PilotScope, FederatedScope).
Major research initiatives and open-source platforms.
A flexible yet robust multi-agent platform that enables agent-oriented programming for building diverse LLM applications, including very large-scale multi-agent simulations.
arXiv 2024
A one-stop multimodal data processing system for large language models, providing 50+ core operators, data-model co-development, and cloud-scale adaptive processing.
SIGMOD 2024, NeurIPS 2025 (Spotlight), ICML 2025 (Spotlight)
Tools for translating natural language to executable analytics actions, including Text-to-SQL benchmarks and unified data manipulation frameworks empowered by large language models.
VLDB 2024, MLSys 2024
Interdisciplinary research combining economics, social science, and machine learning through agent-based simulation, auction design, and information design.
ICML 2024/2025, SODA 2023, NAACL 2025
An AI4DB middleware system that steers databases with machine learning drivers, enabling easy deployment of learned database components in real database systems.
VLDB 2024
A comprehensive federated learning platform with packages for privacy-preserving learning, GNN federation, and LLM fine-tuning in federated settings.
VLDB 2023, KDD 2022 (Best Paper in ADS), KDD 2024
A selection of recent research contributions.
SIGMOD 2026 (industrial track)
SIGMOD 2026 (demo)
NeurIPS 2025 (Spotlight)
ICML 2025 (Spotlight)
ACL Findings 2025
EMNLP Findings 2025
Transactions on Machine Learning Research 2025
IEEE Data Engineering Bulletin 2025
IEEE Data Engineering Bulletin 2025
SIGMOD 2024 (Industrial Track)
SIGMOD 2024 (Tutorial)
Foundations and Trends in Databases 2024
VLDB 2023 (Demo)
VLDB 2023
ICLR 2023
Transactions on Information Systems 2023
NeurIPS 2022 (Datasets and Benchmarks)
KDD 2022 (Best Paper in ADS)
WWW 2022
WWW 2022
KDD 2022 (Tutorial)
EDBT 2022 (Tutorial)
NeurIPS Workshop 2018 (PPML)
NeurIPS Workshop 2018 (PPML)
Data Mining and Knowledge Discovery 2012
SIGMOD 2012
VLDB Journal 2009
DASFAA 2007
ICDE 2007 (Best Student Paper)
If you are interested in my research and projects, and would like to join our lab as Research Scientists/Engineers or Research Interns, please drop me a line.
I have worked with some amazing interns in Microsoft Research (2013-2017) and Alibaba (2018-now):