Yuqiao Tan (è°å®‡ä¹”)
First-year master student at CASIA, supervised by Shizhu He. My research focuses on LLM Reasoning, LLM Interpretability, Reinforcement Learning, and Personalized Agent. I'm willing to any discussion, and I'm open to internship opportunities and collaborations. If you find my work interesting, feel free to contact me!
News
- [2026.01] Invited Talk at NICE: Internal Policy of Large Language Models and Reinforcement Learning — [Video]
- [2025.12] Paper "Bottom-up Policy Optimization" released — Ranked #3 on Huggingface Daily Papers!
- [2025.10] One paper accepted by NeurIPS 2025 (Efficient Reasoning Workshop).
- [2025.05] One paper accepted by ACL 2025.
- [2025.03] One paper accepted by ICMR 2025.
Publications
— 2025 —
Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies [paper] [code]
Yuqiao Tan*, Minzheng Wang*, Shizhu He, et al.
Preprint
The Zero-Step Thinking: An Empirical Study of Mode Selection as Harder Early Exit in Reasoning Models [paper] [code]
Yuqiao Tan, Shizhu He, Kang Liu, Jun Zhao
NeurIPS 2025 Efficient Reasoning Workshop
Neural Incompatibility: The Unbridgeable Gap of Cross-Scale Parametric Knowledge Transfer in LLMs [paper] [code]
Yuqiao Tan, Shizhu He, Kang Liu, Jun Zhao
ACL 2025
Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge Enhancement [paper] [code]
Yuqiao Tan, Shizhu He, Huanxuan Liao, Jun Zhao, Kang Liu
Preprint
MuAP: Multi-step Adaptive Prompt Learning for Vision-Language Model with Missing Modality [paper] [code]
Ruiting Dai*, Yuqiao Tan*, et al.
ICMR 2025
— 2024 —
G-SAP: Graph-based Structure-Aware Prompt Learning over Heterogeneous Knowledge [paper]
Ruiting Dai*, Yuqiao Tan*, et al.
ICMR 2024
Projects
- Baidu-AI Differential Search Index Competition — Differential search index for advertisement search based on LLMs. Top-1 (1/3600)
Education
- M.S. — Pattern Recognition and Intelligent System, CASIA · 2025 – Now
- B.S. — Software Engineering, UESTC · 2021 – 2025
Internships
Research Intern — Smart Internet Group (SIG), Tsinghua University · 2023.07 – 2024.05
Research Intern — DCar-AI-Y, ByteDance · 2024.01 – 2024.07
Honors
- Outstanding Graduate of Sichuan Province · 2024
- First Prize, Baidu Business AI Technology Innovation Competition (CTI) · 2024
- Soong Ching Ling Scholarship, UESTC · 2023
- National Scholarship, Ministry of Education · 2022
Invited Talks
- NICE (2026.01) — Internal Policy of Large Language Models and Reinforcement Learning — [Video]
Services
- Reviewer: ICMR 2025, NeurIPS ER 2025