Yuqiao Tan

I am a first year master student at Institute of Automation, advised by Prof. Shizhu He. Before that, I received my B.E. from UESTC.

My recent research focuses on LLM reasoning [DyPRAG/Zero-Step], LLM interpretability [LaTen/BuPO], and reinforcement learning [BuPO].

Email  /  Github  /  Google Scholar

Yuqiao Tan
Publications

2025


Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
Yuqiao Tan*, Minzheng Wang*, Shizhu He, et al.
Preprint
paper  /  code
The Zero-Step Thinking: An Empirical Study of Mode Selection as Harder Early Exit in Reasoning Models
Yuqiao Tan, Shizhu He, Kang Liu, Jun Zhao
NeurIPS 2025 Efficient Reasoning Workshop
paper  /  code
Neural Incompatibility: The Unbridgeable Gap of Cross-Scale Parametric Knowledge Transfer in LLMs
Yuqiao Tan, Shizhu He, Kang Liu, Jun Zhao
ACL 2025
paper  /  code
Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge Enhancement
Yuqiao Tan, Shizhu He, Huanxuan Liao, Jun Zhao, Kang Liu
Preprint
paper  /  code
RobustPT: Dynamic Disentanglement Prompt Tuning in Vision-Language Models with Missing Modalities
Ruiting Dai*, Yuqiao Tan*, et al.
ICMR 2025
paper  /  code

2024


MuAP: Multi-step Adaptive Prompt Learning for Vision-Language Model with Missing Modality
Ruiting Dai*, Yuqiao Tan*, et al.
Preprint
paper
G-SAP: Graph-based Structure-Aware Prompt Learning over Heterogeneous Knowledge
Ruiting Dai*, Yuqiao Tan*, et al.
ICMR 2024
paper
Education

M.E., Pattern Recognition and Intelligent System, Institute of Automation, UCAS, 2025 - present

B.E., Software Engineering, University of Electronic Science and Technology of China, 2021 - 2025

Internship

Tsinghua University, SIG, Research Intern, 2023.07 - 2024.05

ByteDance, DCar-AI-Y, Research Intern, 2024.01 - 2024.07

Award

Outstanding Graduate of Sichuan Province, 2024

First Prize, Baidu Business AI Technology Innovation Competition (80000 RMB), 2024

Soong Ching Ling Scholarship, UESTC, 2023

National Scholarship, Ministry of Education, 2022

Invited Talk

NICE - Internal Policy of LLMs and Reinforcement Learning, 2026.01 [Video]

Reviewer

ICMR 2025, NeurIPS ER 2025