Qizhi Pei (裴启智 in Chinese) is currently a fourth year Ph.D. student at the ALOHA group of Gaoling School of Artificial Intelligence (GSAI) in Renmin University of China (RUC), supervised by Prof. Rui Yan. He got the B.S. degree from School of Computer Science and Technology, University of Science and Technology of China (USTC) in 2022. He currently is an intern of OpenDataLab in Shanghai Artificial Intelligent Laboratory, mentored by Dr. Lijun Wu. His researches focus on

πŸ”₯ News

πŸ“ AI4Science

  1. EMNLP 2023: BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations, Qizhi Pei, Wei Zhang, Jinhua Zhu, Kehan Wu, Kaiyuan Gao, Lijun Wu, Yingce Xia, Rui Yan, | | Hugging Face (>12W model downloads)

  2. ACL 2024 (Findings): BioT5+: Towards Generalized Biological Understanding with IUPAC Integration and Multi-task Tuning, Qizhi Pei, Lijun Wu, Kaiyuan Gao, Xiaozhuan Liang, Yin Fang, Jinhua Zhu, Shufang Xie, Tao Qin, Rui Yan | | Hugging Face

  3. ICLR 2025: 3D-MolT5: Leveraging Discrete Structural Information for Molecule-Text Modeling, Qizhi Pei, Lijun Wu, Kaiyuan Gao, Jinhua Zhu, Rui Yan | | Hugging Face

  4. NeurIPS 2023: FABind: Fast and Accurate Protein-Ligand Binding, Qizhi Pei (co-first author), Kaiyuan Gao, Lijun Wu, Jinhua Zhu, Yingce Xia, Shufang Xie, Tao Qin, Kun He, Tie-Yan Liu, Rui Yan | Project Page | | Hugging Face

  5. Language + Molecules @ ACL 2024 Workshop (Oral): Enhanced BioT5+ for Molecule-Text Translation: A Three-Stage Approach with Data Distillation, Diverse Training, and Voting Ensemble, Qizhi Pei, Lijun Wu, Kaiyuan Gao, Jinhua Zhu, Rui Yan
    1. πŸ₯‡ 1st Place in the Text-based Molecule Generation Track.
    2. πŸ₯ˆ 2nd Place in the Molecular Captioning Track.
  6. CIKM 2024: Exploiting Pre-trained Models for Drug Target Affinity Prediction with Nearest Neighbors, Qizhi Pei (co-first author), Lijun Wu, Zhenyu He, Jinhua Zhu, Yingce Xia, Shufang Xie, Rui Yan

  7. Briefings in Bioinformatics 2023: SSM-DTA: Breaking the Barriers of Data Scarcity in Drug-Target Affinity Prediction, Qizhi Pei, Lijun Wu, Jinhua Zhu, Yingce Xia, Shufang Xie, Tao Qin, Haiguang Liu, Tie-Yan Liu, Rui Yan |

  8. KDD 2025: FABind+: Enhancing Molecular Docking through Improved Pocket Prediction and Pose Generation
    Kaiyuan Gao, Qizhi Pei, Jinhua Zhu, Tao Qin, Kun He, Lijun Wu |

  9. Nature Communications 2024: TamGen: drug design with target-aware molecule generation through a chemical language model, Kehan Wu, Yingce Xia, Pan Deng, Renhe Liu, Yuan Zhang, Han Guo, Yumeng Cui, Qizhi Pei, … , Tao Qin, Tie-Yan Liu |

  10. Preprint: Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey, Qizhi Pei, Lijun Wu, Kaiyuan Gao, Jinhua Zhu, Yue Wang, Zun Wang, Tao Qin, Rui Yan |

  11. Preprint: Nature Language Model: Deciphering the Language of Nature for Scientific Discovery, Yingce Xia, Peiran Jin, Shufang Xie, Liang He, Chuan Cao, … , Qizhi Pei, … , Tie-Yan Liu, Haiguang Liu, Tao Qin | Project | | Hugging Face

  12. Preprint Tokenizing 3D Molecule Structure with Quantized Spherical Coordinates, Kaiyuan Gao, Yusong Wang, Haoxiang Guan, Zun Wang, Qizhi Pei, John E. Hopcroft, Kun He, and Lijun Wu |

πŸ“ LLMs

  1. ACL 2025: MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction Fusion, Qizhi Pei, Lijun Wu, Zhuoshi Pan, Yu Li, Honglin Lin, Chenlin Ming, Xin Gao, Conghui He, Rui Yan | | Hugging Face

  2. ACL 2025: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis, Xin Gao, Qizhi Pei, Zinan Tang, Yu Li, Honglin Lin, Jiang Wu, Lijun Wu, Conghui He |

  3. ACL 2025 (Findings): CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges, Yu Li, Qizhi Pei, Mengyuan Sun, Honglin Lin, Chenlin Ming, Xin Gao, Jiang Wu, Conghui He, Lijun Wu |

  4. ACL 2025 (Findings): LEMMA: Learning from Errors for MatheMatical Advancement in LLMs, Zhuoshi Pan, Yu Li, Honglin Lin, Qizhi Pei, Zinan Tang, Wei Wu, Chenlin Ming, H. Vicky Zhao, Conghui He, Lijun Wu | | Hugging Face

  5. EMNLP 2025: Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning, Zinan Tang, Xin Gao, Qizhi Pei, Zhuoshi Pan, Mengzhang Cai, Jiang Wu, Conghui He, Lijun Wu

  6. EMNLP 2025 (Findings): MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer, Honglin Lin, Zhuoshi Pan, Yu Li, Qizhi Pei, Xin Gao, Mengzhang Cai, Conghui He, Lijun Wu | Project |

  7. Preprint: Stress Testing Large Reasoning Models by Asking Multiple Problems at Once, Zhuoshi Pan, Qizhi Pei (co-first author), Yu Li, Qiyao Sun, Zinan Tang, H. Vicky Zhao, Conghui He, Lijun Wu | Project |

  8. Preprint: IDEAL: Data Equilibrium Adaptation for Multi-Capability Language Model Alignment, Chenlin Ming, Chendi Qu, Mengzhang Cai, Qizhi Pei, Zhuoshi Pan, Yu Li, Xiaoming Duan, Lijun Wu, Conghui He

  9. Preprint: Examining User-Friendly and Open-Sourced Large GPT Models: A Survey on Language, Multimodal, and Scientific GPT Models, Kaiyuan Gao, Sunan He, Zhenyu He, Jiacheng Lin, Qizhi Pei, Jie Shao, Wei Zhang |

πŸŽ– Honors and Awards

  • 2023, Doctoral Scholarship for Elite Innovative Talents of Renmin University of China (δΈ­ε›½δΊΊζ°‘ε€§ε­¦ζ‹”ε°–εˆ›ζ–°δΊΊζ‰).
  • 2022, Excellent Graduation Thesis, USTC.
  • 2022, Outstanding Undergraduate Awards, USTC.
  • 2018~2021, Outstanding Student Scholarship, USTC.

πŸ’¬ Academic Service

  • Reviewer: NeurIPS, ACL, EMNLP, KDD, ICLR

πŸ“– Educations

  • 2022.09 - Now, Ph.D. student in the Gaoling School of Artificial Intelligence, Renmin University of China.
  • 2018.09 - 2022.06, undergraduate student in the School of Computer Science and Technology, University of Science and Technology of China.

πŸ’» Internships