CV
Summary
Ph.D. candidate at the University of Science and Technology of China working on efficient LLM inference serving, AI infrastructure, and multi-agent systems.
Education
- Institute of Advanced Technology; Future Network LaboratoryPresentUniversity of Science and Technology of China
- School of Communication and Information Engineering2023-06Chongqing University of Posts and Telecommunications
Skills
Research
- LLM inference serving
- AI infrastructure
- multi-agent systems
- distributed scheduling
- multimodal model efficiency
Software Development
- Backend development
- Frontend development
- Android development
- distributed databases
- traffic data processing
Publications
- FAESR: Fine-Grained Rate Adaptation for Energy-Aware Super Resolution in Mobile Panoramic Video Streaming2025
- DisHelis: Optimizing Deployment of Disaggregated LLMs Inference Serving over Heterogeneous Environments via Hierarchical Max-Flow2026
- Multi-Timescale Joint Optimization of Task Scheduling, Instance Switching, and Resource Scaling for Disaggregated LLM Serving2026
- HAWK: Head Importance-Aware Visual Token Pruning in Multimodal Models2026
- SpecCache: Speculative KV Cache Reuse for Efficient RAG Serving2026
- SAVP: Scene-Aware Vision Token Pruning for Efficient Video Large Language Models2026
- GSTEP: Global Spatio-Temporal Density-Driven Visual Token Pruning for Efficient Video Large Language Models2026
- LatCom: Latent Compression for Efficient Multi-Agent Collaboration2026
Languages
- EnglishCET-4 and CET-6; professional literature reading and academic writing
Interests
- Research InterestsEfficient LLM inference serving, AI infrastructure, Disaggregated serving, Multi-agent collaboration, Visual token pruning