About Me

I received my Ph.D. from Peking University in 2026, advised by Prof. Zongqing Lu. I received my Bachelor's degree from the Turing Class at Peking University in 2021.

My research interests lie in Embodied AI and Reinforcement Learning (RL). My current research focuses on foundation models for robotic manipulation.

I am open to collaborations and discussions.

Selected Papers

qwenrobotmanip

Qwen-RobotManip Technical Report: Alignment Unlocks Scale for Robotic Manipulation Foundation Models

Haoqi Yuan*, Zhixuan Liang*, Anzhe Chen*, Ye Wang*, Haoyang Li*, Pei Lin*, Yiyang Huang*, Zixing Lei*, Tong Zhang*, Jiazhao Zhang, Jie Zhang, Jingyang Fan, Gengze Zhou, Qihang Peng, Chenxu Lv, Xiaoyue Chen, An Yang, Fei Huang, Junyang Lin, Dayiheng Liu, Jingren Zhou, Chenfei Wu§, Xiong-Hui Chen§
2026

A scalable Vision-Language-Action foundation model that aligns heterogeneous robots and substantially outperforms prior models in generalization.

DemoGrasp

DemoGrasp: Universal Dexterous Grasping from a Single Demonstration

Haoqi Yuan*, Ziye Huang*, Ye Wang, Chuan Mao, Chaoyi Xu, Zongqing Lu
ICLR 2026

A simple and effective RL approach for universal grasping using any dexterous hand.

CrossDex

Cross-Embodiment Dexterous Grasping with Reinforcement Learning

Haoqi Yuan, Bohan Zhou, Yuhui Fu, Zongqing Lu
ICLR 2025

Train a human-hand policy that can generalize to different dexterous hands.

RLGPT

RL-GPT: Integrating Reinforcement Learning and Code-as-Policy

Shaoteng Liu, Haoqi Yuan, Minda Hu, Yanwei Li, Yukang Chen, Shu Liu, Zongqing Lu, Jiaya Jia
NeurIPS 2024 oral

An LLM agent equipped with RL for continual learning in open-world environments.

MGPO

Pre-Trained Multi-Goal Transformers with Prompt Optimization for Efficient Online Adaptation

Haoqi Yuan, Yuhui Fu, Feiyang Xie, Zongqing Lu
NeurIPS 2024

Propose a multi-goal-conditioned Transformer policy for fast adaptation.

PTGM

Pre-Training Goal-Based Models for Sample-Efficient Reinforcement Learning

Haoqi Yuan, Zhancun Mu, Feiyang Xie, Zongqing Lu
ICLR 2024 oral (top 1.2%)

Leverage pre-training to bootstrap RL for open-world long-horizon tasks.

CORRO

Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning

Haoqi Yuan, Zongqing Lu
ICML 2022

A contrastive learning method for robust task representations in offline meta-RL.

Services

Conference Reviewer

ICML'22,24,25; NeurIPS'22,23,24,25; ICLR'24,25,26; AAAI'23,24,25,26; CVPR'24,26; CORL'25; IROS'25.

Teaching Assistant

Deep Reinforcement Learning, Zongqing Lu, 2023 Spring
Computational Thinking in Social Science, Xiaoming Li, 2020 Autumn
Deep Generative Models, Hao Dong, 2020 Spring