📄 学术主页

📖 Short Biography
❤️ Research Area

大模型计算系统、高性能计算、计算机体系结构

📢 招生宣言

【欢迎加入本课题组】

联系邮箱: mlsys_research@163.com

****欢迎2027年免试推免入学的研究生(3名)、博士生(本科直博和普博,1-2名)带简历成绩单等提前联系我****

(温馨提示:免试推荐的研究生、博士生(本科直博在9月)需在大三暑假结束后报名教育部推免系统,欢迎在报名之前提前与我先取得联系和沟通。同时,也非常欢迎优秀的本科生进组开展科研工作。)

🏅 Recent Rewards
👥 科研项目
📖 Selected Publications
【TPDS】 Xiaqing Li, Qi Guo, Guangyan Zhang, Siwei Ye, Guanhua He, Yiheng Yao, Yifan Hao, Zidong Du, Weimin Zheng. "FastTuning: Enabling Fast and Efficient Hyper-Parameter Tuning with Partitioning and Parallelism of Search Space," IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume:35, Issue:7, Pages:1174-1188, July 2024. (计算机系统/高性能计算,CCF A)
【TPDS】 Xiaqing Li, Guangyan Zhang, Weimin Zheng. "SmartTuning: Selecting HyperParameters of a ConvNet System for Fast Training and Small Working Memory," IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume: 32, Issue: 7, Pages: 1690-1701, July 2021. (计算机系统/高性能计算,CCF A)
【TPDS】 Xiaqing Li, Guangyan Zhang, Zhufan Wang, Weimin Zheng. "HyConv: Accelerating Multi-phase CNN Computation by Fine-grained Policy Selection," IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume: 30, Issue: 2, Pages: 388-399, Feb. 2019. (计算机系统/高性能计算,CCF A)
【ICPP】 Xiaqing Li, Guangyan Zhang, H. Howie Huang, Zhufan Wang, Weimin Zheng. "Performance Analysis of GPU-based Convolutional Neural Networks," in 45th International Conference on Parallel Processing (ICPP-2016), Philadelphia, PA USA, August 2016. (计算机系统/高性能计算,CCF B)
【Book】 Xiaqing Li, Guangyan Zhang, Keqin Li, Weimin Zheng. "Deep learning and its parallel acceleration techniques," in Big Data: Principles and Paradigms, Morgan Kaufmann/Elsevier, 2015. (计算机系统/高性能计算)
【TODAES】 Zhengyang Lyu, Xiaqing Li, Zidong Du, Rui Zhang, Qi Guo. "LE-Timing: Layout-Aware and Explainable Timing Prediction," in ACM Transactions on Design Automation of Electronic Systems, Just Accepted, May 2026. (计算机系统/高性能计算,CCF B)
【SCIS】 Wenkai He, Xiaqing Li, Xinkai Song, Yifan Hao, Rui Zhang, Zidong Du, Yunji Chen. "Chip design with machine learning: a survey from algorithm perspective," Science China Information Science (SCIS), Oct 19, 2023. (计算机系统/高性能计算,CCF A)
【ICCAD】 Zhengyang Lv, Xiaqing Li, Zidong Du, Qi Guo. "Explainable and Layout-Aware Timing Prediction," 2024 ACM/IEEE International Conference on Computer-Aided Design (ICCAD-2024), Volume:35, July 2024. (计算机系统/高性能计算,CCF B)
【ICLR】 Jun Bi, Xiaqing Li, Qi Guo, Rui Zhang, Yuanbo Wen, Xing Hu, Zidong Du, Xinkai Song, Yifan Hao, Yunji Chen. "BALTO: Efficient Tensor Program Optimization with Diversity-Based Active Learning," International Conference on Learning Representations (ICLR 2023). (计算机系统/高性能计算,TH-CPL A类)
【CoRR】 Feiyu Yao, Zhixiong Niu, Xiaqing Li, Yongqiang Xiong, Juan Fang, Qian Wang. "An Efficient Hybrid Sparse Attention with CPU-GPU Parallelism for Long-Context Inference," in CoRR, abs/2605.07719, 2026. (计算机系统/高性能计算)
【TMC】 Jianhang Xie, Chuntao Ding, Xiaqing Li, Shenyuan Ren, Yidong Li, Zhichao Lu. "NestQuant: Post-Training Integer-Nesting Quantization for On-Device DNN," IEEE Transactions on Mobile Computing (TMC), 2025. (计算机系统/高性能计算,CCF A)
【JCST】 Zhexing Zhang, Yuanbo Wen, Hanqi Lyu, Chang Liu, Rui Zhang, Xiaqing Li, Chao Wang, Zidong Du, Qi Guo, Ling Li, Xuehai Zhou, Yunji Chen. "AI Computing Systems for Large Language Models Training," in JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, vol.40, no.1, pp.6-41, Jan. 2025. (计算机系统/高性能计算,CCF B)
【ASPLOS】 Bi Jun, Qi Guo, Xiaqing Li, Yongwei Zhao, Yuanbo Wen, Yuxuan Guo, Enshuai Zhou, Xing Hu, Zidong Du, Ling Li, Huaping Chen, Tianshi Chen. "Heron: Automatically Constrained High-performance Library Generation for Deep Learning Accelerators," Proceedings of the 28th Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'23), in Canada, Volume 3: 314-328. (计算机系统/高性能计算,CCF A)
【TCAD】 Hongrui Guo, Yongwei Zhao, Zhangmai Li, Yifan Hao, Tianrui Ma, Mo Zou, Chang Liu, Xinkai Song, Xiaqing Li, Zidong Du, Rui Zhang, Qi Guo, Zhiwei Xu, Tianshi Chen. "A Systolic Random Increment Memory Architecture for Unary Computing," in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, pp.1-1, 2026. (计算机系统/高性能计算,CCF A)
【TCAD】 Chongxiao Li, Di Huang, Pengwei Jin, Tianyun Ma, Husheng Han, Shuyao Cheng, Yifan Hao, Yongwei Zhao, Guanglin Xu, Zidong Du, Rui Zhang, Xiaqing Li, Yuanbo Wen, Xing Hu, Qi Guo. "AGON: Automated Design Framework for Customizing Processors From ISA Documents," in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol.45, no.5, pp.2362-2375, 2026. (计算机系统/高性能计算,CCF A)
【TC】 Jun Bi, Yuanbo Wen, Xiaqing Li, Yongwei Zhao, Yuxuan Guo, Enshuai Zhou, Xing Hu, Zidong Du, Ling Li, Huaping Chen, Tianshi Chen, Qi Guo. "Efficient and Fast High-Performance Library Generation for Deep Learning Accelerators," IEEE Transactions on Computers, 74(1):155-169, 2025. (计算机系统/高性能计算,CCF A)
【TCAD】 Pengwei Jin, Zhe Fan, Yongwei Zhao, Zidong Du, Hongrui Guo, Ziyuan Nan, Yifan Hao, Chongxiao Li, Tianyun Ma, Zhenxing Zhang, Xiaqing Li, Wei Li, Xing Hu, Qi Guo, Zhiwei Xu, Tianshi Chen. "SaaP: Rearchitect SoC-as-a-Processor to Orchestrate Hardware Heterogeneity," IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 44(10):3962-3975, 2025. (计算机系统/高性能计算,CCF A)
【TCAD】 Ximing Liu, Yongwei Zhao, Mo Zou, Yang Liu, Yifan Hao, Xiaqing Li, Rui Zhang, Yuanbo Wen, Xing Hu, Zidong Du, Qi Guo, Tianshi Chen. "VariPar: Variation-Aware Workload Partitioning in Chiplet-Based DNN Accelerators," IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 44(12):4643-4656, 2025. (计算机系统/高性能计算,CCF A)
【NeurIPS】 Haochen Li, Rui Zhang, Hantao Yao, Xin Zhang, Yifan Hao, Xinkai Song, Xiaqing Li, Yongwei Zhao, Yunji Chen, Ling Li. "DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection," Advances in Neural Information Processing Systems 37 (NeurIPS), 2024. (计算机系统/高性能计算,CCF A)
【MICRO】 Yi Chen, Yongwei Zhao, Yifan Hao, Yuntao Dai, Yang Liu, Rui Zhang, Mo Zou, Yuanbo Wen, Xinkai Song, Xiaqing Li, Xing Hu, Zidong Du, Huaping Chen, Qi Guo, Tianshi Chen. "EMP: Efficient 4-bit Matrix Unit via Primitivization," The 57th ACM/IEEE International Symposium on Microarchitecture (MICRO 2024). (计算机系统/高性能计算,CCF A)
【ASPLOS】 Husheng Han, Xinyao Zheng, Yifan Hao, Ling Liang, Erhu Feng, Jianan Mu, Xiaqing Li, Tianyun Ma, Pengwei Jin, Yuanbo Wen, Xinkai Song, Zidong Du, Qi Guo, Xing Hu. "TensorTEE: Unified Granularity Heterogeneous TEE for Efficient Secure Collaborative Computing," Proceedings of the 28th Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'24). (计算机系统/高性能计算,CCF A)
【DAC】 Shuyao Cheng, Chongxiao Li, Zidong Du, Rui Zhang, Xing Hu, Xiaqing Li, Guanglin Xu, Yuanbo Wen, Qi Guo. "Revisiting Automatic Pipelining: Gate-level Forwarding and Speculation," Design Automation Conference 2024 (DAC'24), July 2024. (计算机系统/高性能计算,CCF A)
【ISCA】 Weihao Kong, Yifan Hao, Yongwei Zhao, Xinkai Song, Xiaqing Li, Mo Zou, Rui Zhang, Xing Hu, Wei Li, Zidong Du, Qi Guo, Zhiwei Xu, Tianshi Chen. "DiffBoost: Full-Network Differential Acceleration for Diffusion Models," The 51st Annual International Symposium on Computer Architecture 2024 (ISCA '24). (计算机系统/高性能计算,CCF A)
【MICRO】 Hongrui Guo, Yongwei Zhao, Zhangmai Li, Yifan Hao, Chang Liu, Xinkai Song, Xiaqing Li, Zidong Du, Rui Zhang, Qi Guo, Tianshi Chen, Zhiwei Xu. "Cambricon-U: A Systolic Random Increment Memory Architecture for Unary Computing," The 56th ACM/IEEE International Symposium on Microarchitecture (MICRO 2023). (计算机系统/高性能计算,CCF A)
【MICRO】 Yifan Hao, Yongwei Zhao, Chenxiao Liu, Zidong Du, Shuyao Cheng, Xiaqing Li, Xing Hu, Qi Guo, Zhiwei Xu, Tianshi Chen. "Cambricon-P: A Bitflow Architecture for Arbitrary Precision Computing," The 55th ACM/IEEE International Symposium on Microarchitecture (MICRO 2022). (计算机系统/高性能计算,CCF A)
【ICML】 Yuanbo Wen, Qi Guo, Qiang Fu, Xiaqing Li, Jianxing Xu, Yanlin Tang, Yongwei Zhao, Xing Hu, Zidong Du, Ling Li, Chao Wang, Xuehai Zhou, Yunji Chen. "BabelTower: Learning to Auto-parallelized Program Translation," International Conference on Machine Learning (ICML), July 2022. (计算机系统/高性能计算,CCF A)
【MCSoC】 Bao Zhenshan, Guo Junnan, Xiaqing Li, Zhang Wenbo. "MSCU: Accelerating CNN Inference with Multiple Sizes of Compute Unit on FPGAs," IEEE MCSoC, 2021. (计算机系统/高性能计算)
【Micromachines】 Juan Fang, Di Zhang, Xiaqing Li. "ParRouting: An Efficient Area Partition-Based Congestion-Aware Routing Algorithm for NoCs," Micromachines, 2020, 11(12). (计算机系统/高性能计算,SCI三区)
👥 Professional Services
中国计算机学会(CCF):
Session Chair:
Member of the Research Posters Program:
TPC Member:
Reviewer for journals:
External reviewer for conferences: