Multi-Timescale Joint Optimization of Task Scheduling, Instance Switching, and Resource Scaling for Disaggregated LLM Serving

Published in IEEE Transactions on Cognitive Communications and Networking, 2026