DisHelis: Optimizing Deployment of Disaggregated LLMs Inference Serving over Heterogeneous Environments via Hierarchical Max-Flow
Published in IEEE Transactions on Cognitive Communications and Networking, 2026
Published in IEEE Transactions on Cognitive Communications and Networking, 2026