About
I will join Nanjing University as an incoming PhD student in Fall 2026, advised by Prof. Guanghui Zhu. Prior to this, I obtained an M.Sc. in Software Engineering (under the supervision of Prof. Yafei Li and a B.Sc. in Mechanical Engineering from Zhengzhou University.
My current research focuses on the training and fine-tuning of large language models, with an emphasis on preference alignment and reinforcement learning from human feedback.
Selected Publications
View All โCredit Assignment and Fine-Tuning Enhanced Reinforcement Learning for Collaborative Spatial Crowdsourcing
Wei Chen, Yafei Li, Baolong Mei, Guanglei Zhu, Jiaqi Wu, Mingliang Xu
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence
We propose CAFE, a multi-agent RL framework for spatial crowdsourcing that addresses delayed rewards and non-stationary distributions through credit assignment mechanisms and adaptive fine-tuning, achieving superior task completion and equitable reward distribution.
Gradient-Guided Credit Assignment and Joint Optimization for Dependency-Aware Spatial Crowdsourcing
Yafei Li, Wei Chen, Jinxing Yan, Huiling Li, Lei Gao, Mingliang Xu
Proceedings of the AAAI Conference on Artificial Intelligence
We propose RMO, a two-stage framework for dependency-aware spatial crowdsourcing that uses multi-agent RL for subtask recommendation and utility-based matching, employing meta-gradients and gradient synchronization to address credit assignment and joint optimization challenges.
News
My paper Credit Assignment and Fine-Tuning Enhanced Reinforcement Learning for Collaborative Spatial Crowdsourcing has been accepted to IJCAI 2025 ๐.
My paper Gradient-Guided Credit Assignment and Joint Optimization for Dependency-Aware Spatial Crowdsourcing has been accepted to AAAI 2025 Oral ๐ .

