Wei Chen

Incoming Ph.D. Student

Nanjing University

Research Interests

Artificial Intelligence

Large Language Models

Reinforcement Learning

About

I will join Nanjing University as an incoming PhD student in Fall 2026, advised by Prof. Guanghui Zhu. Prior to this, I obtained an M.Sc. in Software Engineering (under the supervision of Prof. Yafei Li and a B.Sc. in Mechanical Engineering from Zhengzhou University.

My current research focuses on the training and fine-tuning of large language models, with an emphasis on preference alignment and reinforcement learning from human feedback.

Selected Publications

View All →

Credit Assignment and Fine-Tuning Enhanced Reinforcement Learning for Collaborative Spatial Crowdsourcing

Wei Chen, Yafei Li, Baolong Mei, Guanglei Zhu, Jiaqi Wu, Mingliang Xu

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence

We propose CAFE, a multi-agent RL framework for spatial crowdsourcing that addresses delayed rewards and non-stationary distributions through credit assignment mechanisms and adaptive fine-tuning, achieving superior task completion and equitable reward distribution.

Gradient-Guided Credit Assignment and Joint Optimization for Dependency-Aware Spatial Crowdsourcing

Yafei Li, Wei Chen, Jinxing Yan, Huiling Li, Lei Gao, Mingliang Xu

Proceedings of the AAAI Conference on Artificial Intelligence

We propose RMO, a two-stage framework for dependency-aware spatial crowdsourcing that uses multi-agent RL for subtask recommendation and utility-based matching, employing meta-gradients and gradient synchronization to address credit assignment and joint optimization challenges.

News

2025-04

My paper Credit Assignment and Fine-Tuning Enhanced Reinforcement Learning for Collaborative Spatial Crowdsourcing has been accepted to IJCAI 2025 🎉.

2024-12

My paper Gradient-Guided Credit Assignment and Joint Optimization for Dependency-Aware Spatial Crowdsourcing has been accepted to AAAI 2025 Oral 🎉 .