Zhijun Chen
Postdoc at The Hong Kong Polytechnic University; Ph.D. from Beihang University.
I am a Postdoctoral Researcher with Prof. Xiao Huang at the Department of Computing, The Hong Kong Polytechnic University. Previously, I was a Visiting Scholar in the Lab of Prof. Yikun Ban at Beihang University. I earned my Ph.D. from the School of Computer Science at Beihang University, advised by Prof. Hailong Sun. I have also had the privilege of working closely with Prof. Jie Yang from Delft University of Technology.
Research. Recently, focus on 1) Ensemble Learning for LLMs (arXiv 2025), including Ensemble Inference/RL/SFT for LLMs, and other related topics (e.g., Best-of-N Test-Time Scaling, Multi-Prompt Learning); 2) Reinforcement Learning for LLMs. Looking ahead, I will stay open-minded and aim to embark on research that is more broadly significant.
Ongoing. 1) Ensemble RL for LLMs (MARL); 2) LLM Ensemble Inference + Test-Time Scaling; 3) An extended Journal version of our survey (arXiv 2025) on LLM Ensemble;
Collaboration. If you are interested in my research, please feel free to reach out; for early-stage researchers, I am also happy to provide guidance and discuss ideas.
- [2026-05] Our survey "A Survey on LLM Ensemble" (100+ citations and 200+ GitHub stars) accepted by IJCAI 2026. Project, Github, Paper.
- [2026-04/05] Three papers accepted by ICML 2026, ICDE 2026, and ACL Findings 2026.
- [2025-12] New arXiv preprint "Scoring, Reasoning, and Selecting the Best! Ensembling Large Language Models via a Peer-Review Process", introduces an unsupervised method to ensemble multiple LLM outputs. Project, Github, Paper.
- [2025-12] Our 2025 survey "A Survey on LLM Ensemble" has gained 50+ citations and 180+ GitHub stars as of late Dec. 2025. Project, Github, Paper.