Bio
We haven't found any bio for you yet.
Researcher Links
Loading links...
Publications by Type
Loading publications…
The last 5 uploaded publications
View all
Towards Efficient and Practical GPU Multitasking in the Era of LLM
Jiarong Xing, Yifan Qiao, Simon Mo, Xiao-Bing Cui, Gur-Eyal Sela, Yang Zhou, Joseph E. Gonzalez, Ion Stoica (2025). Towards Efficient and Practical GPU Multitasking in the Era of LLM. , DOI: https://doi.org/10.48550/arxiv.2508.08448.
Preprint25 days agoPrism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving
Shan Yu, Jiarong Xing, Yifan Qiao, Mingyuan Ma, Yangmin Li, Yang Wang, Shuo Yang, Zhiqiang Xie, Shiyi Cao, Ke Bao, Ion Stoica, Harry Xu, Ying Sheng (2025). Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving. , DOI: https://doi.org/10.48550/arxiv.2505.04021.
Preprint25 days ago