Bio
We haven't found any bio for you yet.
Researcher Links
Loading links...
Publications by Type
Loading publications…
The last 5 uploaded publications
EIE
Song Han, Xingyu Liu, Huizi Mao, Jing Pu, Ardavan Pedram, Mark Horowitz, William J. Dally (2016). EIE. ACM SIGARCH Computer Architecture News, 44(3), pp. 243-254, DOI: 10.1145/3007787.3001163.
Article275 days agoEIE: Efficient Inference Engine on Compressed Deep Neural Network
Song Han, Xingyu Liu, Huizi Mao, Jing Pu, Ardavan Pedram, Mark Horowitz, William J. Dally (2016). EIE: Efficient Inference Engine on Compressed Deep Neural Network. , pp. 243-254, DOI: 10.1109/isca.2016.30.
Preprint275 days agoDeep compression and EIE: Efficient inference engine on compressed deep neural network
Song Han, Xingyu Liu, Huizi Mao, Pu Jing, Ardavan Pedram, Mark Horowitz, Bill Dally (2016). Deep compression and EIE: Efficient inference engine on compressed deep neural network. , DOI: 10.1109/hotchips.2016.7936226.
Article275 days agoAccelerating Large-Scale Reasoning Model Inference with Sparse Self-Speculative Decoding
Yilong Zhao, Jiaming Tang, Kan Zhu, Zihao Ye, Chi-Chih Chang, Chih‐Jen Lin, Jongseok Park, Guangxuan Xiao, Mohamed S. Abdelfattah, Mingyu Gao, Baris Kasikci, Song Han, Ion Stoica (2025). Accelerating Large-Scale Reasoning Model Inference with Sparse Self-Speculative Decoding. , DOI: https://doi.org/10.48550/arxiv.2512.01278.
Preprint20 days ago