Publications
Accelerating Transformer Pre-training with 2:4 Sparsity [arXiv] [OpenReview] [PDF] [Project page]
Yuezhou Hu, Kang Zhao, Weiyu Huang, Jianfei Chen, Jun Zhu
International Conference on Machine Learning (ICML), 2024
S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training [arXiv] [OpenReview] [Project page]
Yuezhou Hu, Jun Zhu, Jianfei Chen
Neural Information Processing Systems (NeurIPS), 2024
Pruning Large Language Models with Semi-Structural Adaptive Sparse Training [arXiv] [Project page]
Weiyu Huang, Yuezhou Hu, Guohao Jian, Jun Zhu, Jianfei Chen
AAAI Conference on Artificial Intelligence, 2025
Working Papers
Identifying Sensitive Weights via Post-quantization Integral [arXiv]
Yuezhou Hu, Weiyu Huang, Zichen Liang, Chang Chen, Jintao Zhang, Jun Zhu, Jianfei Chen