Publications | Jiayi Yuan

Google Scholar

\(*\) equal contribution.

2025

NeurIPS Oral

Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning

Jiayi Yuan, Hao Li, Xinheng Ding, and 7 more authors

2025

PDF
NeurIPS Workshop

Who Routes the Router: Rethinking the Evaluation of LLM Routing Systems

Jiayi Yuan, Yifan Lu, Rixin Liu, and 7 more authors

2025
NeurIPS Workshop

Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencode

Xuansheng Wu, Jiayi Yuan, Wenlin Yao, and 2 more authors

2025
EMNLP Findings

LoRATK: LoRA Once, Backdoor Everywhere in the Share-and-Play Ecosystem

Hongyi Liu, Shaochen Zhong, Xintong Sun, and 12 more authors

2025

PDF
TMLR

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

Yang Sui, Yu-Neng Chuang, Guanchu Wang, and 8 more authors

Transactions on Machine Learning Research, 2025

PDF
ACL Findings

ReasonerRank: Redefining Language Model Evaluation with Ground-Truth-Free Ranking Frameworks

Jiamu Zhang, Jiayi Yuan, Andrew Wen, and 5 more authors

The 63rd Annual Meeting of the Association for Computational Linguistics, 2025

PDF
NAACL Findings

DHP Benchmark: Are LLMs Good NLG Evaluators?

Yicheng Wang*, Jiayi Yuan*, Yu-Neng Chuang, and 7 more authors

the 2025 Conference of the North American Chapter of the Association for Computational Linguistics, 2025

PDF

2024

EMNLP Findings

Kv cache compression, but what must we give in return? a comprehensive benchmark of long context capable approaches

Jiayi Yuan*, Hongyi Liu*, Shaochen Zhong*, and 9 more authors

the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

PDF
EMNLP

Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion

Guanchu Wang*, Yu-Neng Chuang*, Ruixiang Tang, and 8 more authors

the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

PDF
ICML

KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Zirui Liu*, Jiayi Yuan*, Hongye Jin, and 5 more authors

2024

PDF
ICML

GNNs Also Deserve Editing, and They Need It More Than Once

Shaochen Zhong, Duy Le, Zirui Liu, and 8 more authors

2024

PDF

2023

NeurIPS

Setting the trap: Capturing and defeating backdoors in pretrained language models through honeypots

Ruixiang Tang*, Jiayi Yuan*, Yiming Li, and 3 more authors

Advances in Neural Information Processing Systems, 2023

PDF
AMIA

Large language models for healthcare data augmentation: An example on patient-trial matching

Jiayi Yuan, Ruixiang Tang, Xiaoqian Jiang, and 1 more author

In AMIA Annual Symposium Proceedings, 2023

PDF
AMIA

Towards fair patient-trial matching via patient-criterion level fairness constraint

Chia-Yuan Chang, Jiayi Yuan, Sirui Ding, and 5 more authors

In AMIA Annual Symposium Proceedings, 2023

PDF
ACM-BCB

Can Attention Be Used to Explain EHR-Based Mortality Prediction Tasks: A Case Study on Hemorrhagic Stroke

Qizhang Feng, Jiayi Yuan, Forhan Bin Emdad, and 3 more authors

In Proceedings of the 14th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics, 2023

PDF
DAC

Robust tickets can transfer better: Drawing more transferable subnetworks in transfer learning

Yonggan Fu, Ye Yuan, Shang Wu, and 2 more authors

In 2023 60th ACM/IEEE Design Automation Conference (DAC), 2023

PDF
DAC

NetBooster: Empowering Tiny Deep Learning By Standing on the Shoulders of Deep Giants

Zhongzhi Yu, Yonggan Fu, Jiayi Yuan, and 2 more authors

In 2023 60th ACM/IEEE Design Automation Conference (DAC), 2023

PDF
ISCA

Gen-NeRF: Efficient and Generalizable Neural Radiance Fields via Algorithm-Hardware Co-Design

Yonggan Fu, Zhifan Ye, Jiayi Yuan, and 4 more authors

In Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

PDF
ICASSP

ERSAM: Neural Architecture Search for Energy-Efficient and Real-Time Social Ambiance Measurement

Chaojian Li*, Wenwan Chen*, Jiayi Yuan*, and 2 more authors

In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023

PDF

2022

ICML

DepthShrinker: a new compression paradigm towards boosting real-hardware efficiency of compact neural networks

Yonggan Fu, Haichuan Yang, Jiayi Yuan, and 5 more authors

In International Conference on Machine Learning, 2022

PDF
ISCA

EyeCoD: eye tracking system acceleration via flatcam-based algorithm & accelerator co-design

Haoran You*, Cheng Wan*, Yang Zhao*, and 8 more authors

In Proceedings of the 49th Annual International Symposium on Computer Architecture, 2022

PDF