Picture for Qi Wang

Qi Wang

Lattice

TopKD: Top-scaled Knowledge Distillation

Add code
Aug 06, 2025
Figure 1 for TopKD: Top-scaled Knowledge Distillation
Figure 2 for TopKD: Top-scaled Knowledge Distillation
Figure 3 for TopKD: Top-scaled Knowledge Distillation
Figure 4 for TopKD: Top-scaled Knowledge Distillation
Viaarxiv icon

RLEP: Reinforcement Learning with Experience Replay for LLM Reasoning

Add code
Jul 10, 2025
Figure 1 for RLEP: Reinforcement Learning with Experience Replay for LLM Reasoning
Figure 2 for RLEP: Reinforcement Learning with Experience Replay for LLM Reasoning
Figure 3 for RLEP: Reinforcement Learning with Experience Replay for LLM Reasoning
Figure 4 for RLEP: Reinforcement Learning with Experience Replay for LLM Reasoning
Viaarxiv icon

TGRPO :Fine-tuning Vision-Language-Action Model via Trajectory-wise Group Relative Policy Optimization

Add code
Jun 11, 2025
Figure 1 for TGRPO :Fine-tuning Vision-Language-Action Model via Trajectory-wise Group Relative Policy Optimization
Figure 2 for TGRPO :Fine-tuning Vision-Language-Action Model via Trajectory-wise Group Relative Policy Optimization
Figure 3 for TGRPO :Fine-tuning Vision-Language-Action Model via Trajectory-wise Group Relative Policy Optimization
Figure 4 for TGRPO :Fine-tuning Vision-Language-Action Model via Trajectory-wise Group Relative Policy Optimization
Viaarxiv icon

DynTok: Dynamic Compression of Visual Tokens for Efficient and Effective Video Understanding

Add code
Jun 04, 2025
Viaarxiv icon

What Makes a Good Reasoning Chain? Uncovering Structural Patterns in Long Chain-of-Thought Reasoning

Add code
May 28, 2025
Figure 1 for What Makes a Good Reasoning Chain? Uncovering Structural Patterns in Long Chain-of-Thought Reasoning
Figure 2 for What Makes a Good Reasoning Chain? Uncovering Structural Patterns in Long Chain-of-Thought Reasoning
Figure 3 for What Makes a Good Reasoning Chain? Uncovering Structural Patterns in Long Chain-of-Thought Reasoning
Figure 4 for What Makes a Good Reasoning Chain? Uncovering Structural Patterns in Long Chain-of-Thought Reasoning
Viaarxiv icon

TUNA: Comprehensive Fine-grained Temporal Understanding Evaluation on Dense Dynamic Videos

Add code
May 26, 2025
Viaarxiv icon

Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval

Add code
May 26, 2025
Figure 1 for Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval
Figure 2 for Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval
Figure 3 for Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval
Figure 4 for Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval
Viaarxiv icon

ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World

Add code
May 25, 2025
Figure 1 for ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World
Figure 2 for ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World
Figure 3 for ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World
Figure 4 for ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World
Viaarxiv icon

LogicCat: A Chain-of-Thought Text-to-SQL Benchmark for Multi-Domain Reasoning Challenges

Add code
May 24, 2025
Viaarxiv icon

Guiding the Experts: Semantic Priors for Efficient and Focused MoE Routing

Add code
May 24, 2025
Viaarxiv icon