Yuqiao Tan (谭宇乔)

Yuqiao Tan

I am a first year master student at Institute of Automation, advised by Prof. Shizhu He. Before that, I received my B.E. from UESTC.

My recent research focuses on LLM reasoning [DyPRAG/Zero-Step], LLM interpretability [LaTen/BuPO], and reinforcement learning [BuPO/PreRL].

Email / Github / Google Scholar

Publications

2026

From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space
Yuqiao Tan*, Minzheng Wang*, Bo Liu, Zichen Liu, Tian Liang, Shizhu He, Jun Zhao, Kang Liu
Preprint
paper / code

2025

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
Yuqiao Tan*, Minzheng Wang*, Shizhu He, Huanxuan Liao, Chengfeng Zhao, Qiunan Lu, Tian Liang, Jun Zhao, Kang Liu
Preprint
paper / code

The Zero-Step Thinking: An Empirical Study of Mode Selection as Harder Early Exit in Reasoning Models
Yuqiao Tan, Shizhu He, Kang Liu, Jun Zhao
NeurIPS 2025 Efficient Reasoning Workshop
paper / code

Neural Incompatibility: The Unbridgeable Gap of Cross-Scale Parametric Knowledge Transfer in LLMs
Yuqiao Tan, Shizhu He, Kang Liu, Jun Zhao
ACL 2025
paper / code

Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge Enhancement
Yuqiao Tan, Shizhu He, Huanxuan Liao, Jun Zhao, Kang Liu
Preprint
paper / code

RobustPT: Dynamic Disentanglement Prompt Tuning in Vision-Language Models with Missing Modalities
Ruiting Dai*, Yuqiao Tan*, et al.
ICMR 2025
paper / code

2024

MuAP: Multi-step Adaptive Prompt Learning for Vision-Language Model with Missing Modality
Ruiting Dai*, Yuqiao Tan*, et al.
Preprint
paper

G-SAP: Graph-based Structure-Aware Prompt Learning over Heterogeneous Knowledge
Ruiting Dai*, Yuqiao Tan*, et al.
ICMR 2024
paper

Education

M.E., Pattern Recognition and Intelligent System, Institute of Automation, UCAS, 2025 - present

B.E., Software Engineering, University of Electronic Science and Technology of China, 2021 - 2025

Internship

Tsinghua University, SIG, Research Intern, 2023.07 - 2024.05, Focus on GNN, IoT

ByteDance, DCar-AI-Y, Research Intern, 2024.01 - 2024.07, Focus on RAG, GenIR

Award

Outstanding Graduate of Sichuan Province, 2024

First Prize, Baidu Business AI Technology Innovation Competition (80000 RMB), 2024

Soong Ching Ling Scholarship, UESTC, 2023

National Scholarship, Ministry of Education, 2022

Invited Talk

NICE - Internal Policy of LLMs and Reinforcement Learning, 2026.01 [Video]

Reviewer

COLM 2026, ICMR 2025, NeurIPS ER 2025