Publications

Google Scholar

\(*\) equal contribution.

2025

  1. NeurIPS Oral
    Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning
    Jiayi Yuan, Hao Li, Xinheng Ding, and 7 more authors
    2025
  2. NeurIPS Workshop
    Who Routes the Router: Rethinking the Evaluation of LLM Routing Systems
    Jiayi Yuan, Yifan Lu, Rixin Liu, and 7 more authors
    2025
  3. NeurIPS Workshop
    Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencode
    Xuansheng Wu, Jiayi Yuan, Wenlin Yao, and 2 more authors
    2025
  4. EMNLP Findings
    LoRATK: LoRA Once, Backdoor Everywhere in the Share-and-Play Ecosystem
    Hongyi Liu, Shaochen Zhong, Xintong Sun, and 12 more authors
    2025
  5. TMLR
    Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
    Yang Sui, Yu-Neng Chuang, Guanchu Wang, and 8 more authors
    Transactions on Machine Learning Research, 2025
  6. ACL Findings
    ReasonerRank: Redefining Language Model Evaluation with Ground-Truth-Free Ranking Frameworks
    Jiamu Zhang, Jiayi Yuan, Andrew Wen, and 5 more authors
    The 63rd Annual Meeting of the Association for Computational Linguistics, 2025
  7. NAACL Findings
    DHP Benchmark: Are LLMs Good NLG Evaluators?
    Yicheng Wang*, Jiayi Yuan*, Yu-Neng Chuang, and 7 more authors
    the 2025 Conference of the North American Chapter of the Association for Computational Linguistics, 2025

2024

  1. EMNLP Findings
    Kv cache compression, but what must we give in return? a comprehensive benchmark of long context capable approaches
    Jiayi Yuan*, Hongyi Liu*, Shaochen Zhong*, and 9 more authors
    the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
  2. EMNLP
    Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion
    Guanchu Wang*, Yu-Neng Chuang*, Ruixiang Tang, and 8 more authors
    the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
  3. ICML
    KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache
    Zirui Liu*, Jiayi Yuan*, Hongye Jin, and 5 more authors
    2024
  4. ICML
    GNNs Also Deserve Editing, and They Need It More Than Once
    Shaochen Zhong, Duy Le, Zirui Liu, and 8 more authors
    2024

2023

  1. NeurIPS
    Setting the trap: Capturing and defeating backdoors in pretrained language models through honeypots
    Ruixiang Tang*, Jiayi Yuan*, Yiming Li, and 3 more authors
    Advances in Neural Information Processing Systems, 2023
  2. AMIA
    Large language models for healthcare data augmentation: An example on patient-trial matching
    Jiayi Yuan, Ruixiang Tang, Xiaoqian Jiang, and 1 more author
    In AMIA Annual Symposium Proceedings, 2023
  3. AMIA
    Towards fair patient-trial matching via patient-criterion level fairness constraint
    Chia-Yuan Chang, Jiayi Yuan, Sirui Ding, and 5 more authors
    In AMIA Annual Symposium Proceedings, 2023
  4. ACM-BCB
    Can Attention Be Used to Explain EHR-Based Mortality Prediction Tasks: A Case Study on Hemorrhagic Stroke
    Qizhang Feng, Jiayi Yuan, Forhan Bin Emdad, and 3 more authors
    In Proceedings of the 14th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics, 2023
  5. DAC
    Robust tickets can transfer better: Drawing more transferable subnetworks in transfer learning
    Yonggan Fu, Ye Yuan, Shang Wu, and 2 more authors
    In 2023 60th ACM/IEEE Design Automation Conference (DAC), 2023
  6. DAC
    NetBooster: Empowering Tiny Deep Learning By Standing on the Shoulders of Deep Giants
    Zhongzhi Yu, Yonggan Fu, Jiayi Yuan, and 2 more authors
    In 2023 60th ACM/IEEE Design Automation Conference (DAC), 2023
  7. ISCA
    Gen-NeRF: Efficient and Generalizable Neural Radiance Fields via Algorithm-Hardware Co-Design
    Yonggan Fu, Zhifan Ye, Jiayi Yuan, and 4 more authors
    In Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023
  8. ICASSP
    ERSAM: Neural Architecture Search for Energy-Efficient and Real-Time Social Ambiance Measurement
    Chaojian Li*, Wenwan Chen*, Jiayi Yuan*, and 2 more authors
    In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023

2022

  1. ICML
    DepthShrinker: a new compression paradigm towards boosting real-hardware efficiency of compact neural networks
    Yonggan Fu, Haichuan Yang, Jiayi Yuan, and 5 more authors
    In International Conference on Machine Learning, 2022
  2. ISCA
    EyeCoD: eye tracking system acceleration via flatcam-based algorithm & accelerator co-design
    Haoran You*, Cheng Wan*, Yang Zhao*, and 8 more authors
    In Proceedings of the 49th Annual International Symposium on Computer Architecture, 2022