|
Publications
|
|
2025
|
Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
Yuqiao Tan*, Minzheng Wang*, Shizhu He, et al.
Preprint
paper  / 
code
|
The Zero-Step Thinking: An Empirical Study of Mode Selection as Harder Early Exit in Reasoning Models
Yuqiao Tan, Shizhu He, Kang Liu, Jun Zhao
NeurIPS 2025 Efficient Reasoning Workshop
paper  / 
code
|
Neural Incompatibility: The Unbridgeable Gap of Cross-Scale Parametric Knowledge Transfer in LLMs
Yuqiao Tan, Shizhu He, Kang Liu, Jun Zhao
ACL 2025
paper  / 
code
|
Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge Enhancement
Yuqiao Tan, Shizhu He, Huanxuan Liao, Jun Zhao, Kang Liu
Preprint
paper  / 
code
|
RobustPT: Dynamic Disentanglement Prompt Tuning in Vision-Language Models with Missing Modalities
Ruiting Dai*, Yuqiao Tan*, et al.
ICMR 2025
paper  / 
code
|
|
2024
|
MuAP: Multi-step Adaptive Prompt Learning for Vision-Language Model with Missing Modality
Ruiting Dai*, Yuqiao Tan*, et al.
Preprint
paper
|
G-SAP: Graph-based Structure-Aware Prompt Learning over Heterogeneous Knowledge
Ruiting Dai*, Yuqiao Tan*, et al.
ICMR 2024
paper
|
|
Education
|
|
M.E., Pattern Recognition and Intelligent System, Institute of Automation, UCAS, 2025 - present
B.E., Software Engineering, University of Electronic Science and Technology of China, 2021 - 2025
|
|
Award
|
|
Outstanding Graduate of Sichuan Province, 2024
First Prize, Baidu Business AI Technology Innovation Competition (80000 RMB), 2024
Soong Ching Ling Scholarship, UESTC, 2023
National Scholarship, Ministry of Education, 2022
|
|
Invited Talk
|
|
NICE - Internal Policy of LLMs and Reinforcement Learning, 2026.01
[Video]
|
|
Reviewer
|
|
ICMR 2025, NeurIPS ER 2025
|
|