Publications

Google Scholar

\(*\) equal contribution.

2024

  1. EMNLP
    Kv cache compression, but what must we give in return? a comprehensive benchmark of long context capable approaches
    Jiayi Yuan*, Hongyi Liu*, Shaochen Zhong*, and 9 more authors
    the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
  2. EMNLP
    Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion
    Guanchu Wang*, Yu-Neng Chuang*, Ruixiang Tang, and 8 more authors
    the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
  3. ICML
    KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache
    Zirui Liu*, Jiayi Yuan*, Hongye Jin, and 5 more authors
    2024
  4. ICML
    GNNs Also Deserve Editing, and They Need It More Than Once
    Shaochen Zhong, Duy Le, Zirui Liu, and 8 more authors
    2024

2023

  1. NeurIPS
    Setting the trap: Capturing and defeating backdoors in pretrained language models through honeypots
    Ruixiang Tang*, Jiayi Yuan*, Yiming Li, and 3 more authors
    Advances in Neural Information Processing Systems, 2023
  2. AMIA
    Large language models for healthcare data augmentation: An example on patient-trial matching
    Jiayi Yuan, Ruixiang Tang, Xiaoqian Jiang, and 1 more author
    In AMIA Annual Symposium Proceedings, 2023
  3. AMIA
    Towards fair patient-trial matching via patient-criterion level fairness constraint
    Chia-Yuan Chang, Jiayi Yuan, Sirui Ding, and 5 more authors
    In AMIA Annual Symposium Proceedings, 2023
  4. ACM-BCB
    Can Attention Be Used to Explain EHR-Based Mortality Prediction Tasks: A Case Study on Hemorrhagic Stroke
    Qizhang Feng, Jiayi Yuan, Forhan Bin Emdad, and 3 more authors
    In Proceedings of the 14th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics, 2023
  5. DAC
    Robust tickets can transfer better: Drawing more transferable subnetworks in transfer learning
    Yonggan Fu, Ye Yuan, Shang Wu, and 2 more authors
    In 2023 60th ACM/IEEE Design Automation Conference (DAC), 2023
  6. DAC
    NetBooster: Empowering Tiny Deep Learning By Standing on the Shoulders of Deep Giants
    Zhongzhi Yu, Yonggan Fu, Jiayi Yuan, and 2 more authors
    In 2023 60th ACM/IEEE Design Automation Conference (DAC), 2023
  7. IEEE Micro
    EyeCoD: Eye Tracking System Acceleration via FlatCam-based Algorithm&Hardware Co-Design
    Haoran You*, Yang Zhao*, Cheng Wan*, and 8 more authors
    IEEE Micro, 2023
  8. ISCA
    Gen-NeRF: Efficient and Generalizable Neural Radiance Fields via Algorithm-Hardware Co-Design
    Yonggan Fu, Zhifan Ye, Jiayi Yuan, and 4 more authors
    In Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023
  9. ICASSP
    ERSAM: Neural Architecture Search for Energy-Efficient and Real-Time Social Ambiance Measurement
    Jiayi Yuan*, Chaojian Li*, Wenwan Chen*, and 2 more authors
    In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023

2022

  1. ICML
    DepthShrinker: a new compression paradigm towards boosting real-hardware efficiency of compact neural networks
    Yonggan Fu, Haichuan Yang, Jiayi Yuan, and 5 more authors
    In International Conference on Machine Learning, 2022
  2. ISCA
    EyeCoD: eye tracking system acceleration via flatcam-based algorithm & accelerator co-design
    Haoran You*, Cheng Wan*, Yang Zhao*, and 8 more authors
    In Proceedings of the 49th Annual International Symposium on Computer Architecture, 2022