Bio
We haven't found any bio for you yet.
Researcher Links
Loading links...
Publications by Type
Loading publications…
The last 5 uploaded publications
Revisiting Cache Freshness for Emerging Real-Time Applications
Ziming Mao, Rishabh Iyer, Scott Shenker, Ion Stoica (2024). Revisiting Cache Freshness for Emerging Real-Time Applications. , DOI: https://doi.org/10.1145/3696348.3696858.
Preprint122 days agoSkyServe: Serving AI Models across Regions and Clouds with Spot Instances
Ziming Mao, Tian Xia, Zhanghao Wu, Wei-Lin Chiang, Tyler Griggs, Romil Bhardwaj, Zongheng Yang, Scott Shenker, Ion Stoica (2024). SkyServe: Serving AI Models across Regions and Clouds with Spot Instances. , DOI: https://doi.org/10.48550/arxiv.2411.01438.
Preprint141 days agoLEANN: A Low-Storage Vector Index
Yichuan Wang, Zhifei Li, Shu Liu, Yongji Wu, Ziming Mao, Yilong Zhao, Yan Xiao, Zhiying Xu, Yang Zhou, Ion Stoica, Sewon Min, Matei Zaharia, Joseph E. Gonzalez (2025). LEANN: A Low-Storage Vector Index. , DOI: https://doi.org/10.48550/arxiv.2506.08276.
Preprint130 days agoLocality-aware Fair Scheduling in LLM Serving
Shiyi Cao, Yichuan Wang, Ziming Mao, P.-h.J. Hsu, Liangsheng Yin, Tian Xia, Dacheng Li, Shu Liu, Yuanhang Zhang, Yang Zhou, Ying Sheng, Joseph E. Gonzalez, Ion Stoica (2025). Locality-aware Fair Scheduling in LLM Serving. , DOI: https://doi.org/10.48550/arxiv.2501.14312.
Preprint130 days agoThe Streaming Batch Model for Efficient and Fault-Tolerant Heterogeneous Execution
Frank Sifei Luan, Ziming Mao, R. Wang, Chi‐Wei Lin, Amog Kamsetty, Hao Chen, Cheng Su, Balaji Veeramani, Scott Lee, SangBin Cho, Clark Zinzow, Eric Liang, Ion Stoica, Stephanie Wang (2025). The Streaming Batch Model for Efficient and Fault-Tolerant Heterogeneous Execution. , DOI: https://doi.org/10.48550/arxiv.2501.12407.
Preprint130 days ago