Ph.D. Candidate · Future Network Laboratory · USTC
Tao Zhang
Efficient LLM Serving · AI Infrastructure · Multi-Agent Systems
I study how modern AI systems can serve large language models faster and more efficiently across heterogeneous GPU clusters, disaggregated inference pipelines, networked workloads, and collaborative agents.
LLM Serving AI Infrastructure Disaggregated Systems KV Cache Reuse Multi-Agent Communication Multimodal Efficiency
8 First/co-first papers
1 Oral paper
2 SCI Q1 journal papers
2026 CVPR · ACL · EMNLP · MM
Research Focus
Serving systems for modern AI workloads
My research centers on efficient inference serving and AI infrastructure: resource scheduling for disaggregated LLM serving, KV-cache optimization for RAG, MoE training systems, multimodal token pruning, and communication-efficient multi-agent collaboration.
- DisHelisDeployment and resource allocation for heterogeneous disaggregated LLM serving.
- SpecCacheSpeculative KV cache reuse for efficient RAG serving.
- LatComLatent compression for efficient multi-agent collaboration.
Selected Publications
First-author and co-first-author work
01 DisHelis IEEE TCCN · 2026 · First author · SCI 一区 02 SpecCache ACL 2026 · 2026 · Co-first author · Oral 03 HAWK CVPR 2026 · 2026 · Co-first author · Poster 04 SAVP EMNLP 2026 · 2026 · Co-first author · Poster 05 GSTEP ACM Multimedia 2026 · 2026 · Co-first author · Poster 06 LatCom EMNLP 2026 · 2026 · Co-first author · Poster 07 Multi-Timescale Joint Optimization for Disaggregated LLM Serving IEEE TCCN · 2026 · First author · SCI 二区 08 FAESR IEEE TCCN · 2025 · First author · SCI 一区
Education
- Ph.D. CandidateUniversity of Science and Technology of China, Institute of Advanced Technology and Future Network Laboratory, 2023.09 - Present
- B.Eng.Chongqing University of Posts and Telecommunications, School of Communication and Information Engineering, 2019.09 - 2023.06
Recent Honors
- National ScholarshipUniversity of Science and Technology of China, 2025
- Graduate Academic First-Class ScholarshipUniversity of Science and Technology of China, 2023 and 2024
- Outstanding GraduateChongqing Municipality, 2023