Multi-Timescale Joint Optimization of Task Scheduling, Instance Switching, and Resource Scaling for Disaggregated LLM Serving
Published in IEEE Transactions on Cognitive Communications and Networking, 2026
Published in IEEE Transactions on Cognitive Communications and Networking, 2026