Picture for Qi Wang

Qi Wang

Lattice

DynTok: Dynamic Compression of Visual Tokens for Efficient and Effective Video Understanding

Add code
Jun 04, 2025
Viaarxiv icon

What Makes a Good Reasoning Chain? Uncovering Structural Patterns in Long Chain-of-Thought Reasoning

Add code
May 28, 2025
Viaarxiv icon

TUNA: Comprehensive Fine-grained Temporal Understanding Evaluation on Dense Dynamic Videos

Add code
May 26, 2025
Viaarxiv icon

Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval

Add code
May 26, 2025
Viaarxiv icon

ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World

Add code
May 25, 2025
Viaarxiv icon

LogicCat: A Chain-of-Thought Text-to-SQL Benchmark for Multi-Domain Reasoning Challenges

Add code
May 24, 2025
Viaarxiv icon

Guiding the Experts: Semantic Priors for Efficient and Focused MoE Routing

Add code
May 24, 2025
Viaarxiv icon

DualComp: End-to-End Learning of a Unified Dual-Modality Lossless Compressor

Add code
May 22, 2025
Viaarxiv icon

Clapper: Compact Learning and Video Representation in VLMs

Add code
May 21, 2025
Viaarxiv icon

Fourier-Invertible Neural Encoder (FINE) for Homogeneous Flows

Add code
May 21, 2025
Viaarxiv icon