Zhijun Chen
Ph.D. graduate from Beihang University (Beijing, China)
Currently, I am a Visiting Scholar in the Lab of Prof. Yikun Ban at Beihang University. I earned my PhD from the School of Computer Science at Beihang University, advised by Prof. Hailong Sun. I have also had the privilege of working closely with and learning from Prof. Jie Yang from Delft University of Technology.
Research. Recently, focus on 1) Ensemble Learning for LLMs (arXiv 2025), including Ensemble Inference/RL/SFT for LLMs, and other related topics (e.g., Best-of-N Test-Time Scaling, Multi-Prompt Learning); 2) Reinforcement Learning for LLMs. Looking ahead, I will stay open-minded and aim to embark on research that is more broadly significant.
Ongoing. 1) Ensemble RL for LLMs (MARL); 2) LLM Ensemble Inference + Test-Time Scaling; 3) An extended Journal version of our survey (arXiv 2025) on LLM Ensemble;
Collaboration. If you are interested in my research, please feel free to reach out; for early-stage researchers, I am also happy to provide guidance and discuss ideas.
- [2026-05] I will start my postdoctoral research soon.
- [2026-05] Our survey "A Survey on LLM Ensemble" (100+ citations and 200+ GitHub stars) accepted by IJCAI 2026. Project, Github, Paper.
- [2026-04/05] Three papers accepted by ICML 2026, ICDE 2026, and ACL Findings 2026.
- [2025-12] New arXiv preprint "Scoring, Reasoning, and Selecting the Best! Ensembling Large Language Models via a Peer-Review Process", introduces an unsupervised method to ensemble multiple LLM outputs. Project, Github, Paper.
- [2025-12] Our 2025 survey "A Survey on LLM Ensemble" has gained 50+ citations and 180+ GitHub stars as of late Dec. 2025. Project, Github, Paper.