Picture for Qi Wang

Qi Wang

Lattice

AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning

Add code
Aug 09, 2025
Viaarxiv icon

LoRA in LoRA: Towards Parameter-Efficient Architecture Expansion for Continual Visual Instruction Tuning

Add code
Aug 08, 2025
Viaarxiv icon

TopKD: Top-scaled Knowledge Distillation

Add code
Aug 06, 2025
Viaarxiv icon

RLEP: Reinforcement Learning with Experience Replay for LLM Reasoning

Add code
Jul 10, 2025
Viaarxiv icon

TGRPO :Fine-tuning Vision-Language-Action Model via Trajectory-wise Group Relative Policy Optimization

Add code
Jun 11, 2025
Viaarxiv icon

DynTok: Dynamic Compression of Visual Tokens for Efficient and Effective Video Understanding

Add code
Jun 04, 2025
Viaarxiv icon

What Makes a Good Reasoning Chain? Uncovering Structural Patterns in Long Chain-of-Thought Reasoning

Add code
May 28, 2025
Viaarxiv icon

Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval

Add code
May 26, 2025
Viaarxiv icon

TUNA: Comprehensive Fine-grained Temporal Understanding Evaluation on Dense Dynamic Videos

Add code
May 26, 2025
Viaarxiv icon

ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World

Add code
May 25, 2025
Viaarxiv icon