Picture for Yulei Qin

Yulei Qin

Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

Add code
Dec 31, 2025
Viaarxiv icon

SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

Add code
Dec 26, 2025
Viaarxiv icon

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Add code
Sep 26, 2025
Figure 1 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Figure 2 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Figure 3 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Figure 4 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Viaarxiv icon

Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models

Add code
Dec 19, 2024
Figure 1 for Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models
Figure 2 for Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models
Figure 3 for Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models
Figure 4 for Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models
Viaarxiv icon

Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models

Add code
Aug 28, 2024
Figure 1 for Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Figure 2 for Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Figure 3 for Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Figure 4 for Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Viaarxiv icon

Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

Add code
Aug 07, 2024
Figure 1 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Figure 2 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Figure 3 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Figure 4 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Viaarxiv icon

RESTORE: Towards Feature Shift for Vision-Language Prompt Learning

Add code
Mar 10, 2024
Figure 1 for RESTORE: Towards Feature Shift for Vision-Language Prompt Learning
Figure 2 for RESTORE: Towards Feature Shift for Vision-Language Prompt Learning
Figure 3 for RESTORE: Towards Feature Shift for Vision-Language Prompt Learning
Figure 4 for RESTORE: Towards Feature Shift for Vision-Language Prompt Learning
Viaarxiv icon

Sinkhorn Distance Minimization for Knowledge Distillation

Add code
Feb 27, 2024
Figure 1 for Sinkhorn Distance Minimization for Knowledge Distillation
Figure 2 for Sinkhorn Distance Minimization for Knowledge Distillation
Figure 3 for Sinkhorn Distance Minimization for Knowledge Distillation
Figure 4 for Sinkhorn Distance Minimization for Knowledge Distillation
Viaarxiv icon

Towards Robust Text Retrieval with Progressive Learning

Add code
Nov 20, 2023
Figure 1 for Towards Robust Text Retrieval with Progressive Learning
Figure 2 for Towards Robust Text Retrieval with Progressive Learning
Figure 3 for Towards Robust Text Retrieval with Progressive Learning
Figure 4 for Towards Robust Text Retrieval with Progressive Learning
Viaarxiv icon

CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes

Add code
Oct 15, 2023
Figure 1 for CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes
Figure 2 for CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes
Figure 3 for CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes
Figure 4 for CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes
Viaarxiv icon