Picture for Wei Li

Wei Li

Tsinghua University, Beijing, China

V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning

Add code
Mar 14, 2025
Figure 1 for V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning
Figure 2 for V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning
Figure 3 for V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning
Figure 4 for V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning
Viaarxiv icon

Odysseus Navigates the Sirens' Song: Dynamic Focus Decoding for Factual and Diverse Open-Ended Text Generation

Add code
Mar 11, 2025
Viaarxiv icon

FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation

Add code
Mar 09, 2025
Viaarxiv icon

MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice

Add code
Mar 07, 2025
Viaarxiv icon

Spatial Distillation based Distribution Alignment (SDDA) for Cross-Headset EEG Classification

Add code
Mar 07, 2025
Viaarxiv icon

Feature Point Extraction for Extra-Affine Image

Add code
Mar 05, 2025
Viaarxiv icon

LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant

Add code
Mar 05, 2025
Figure 1 for LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant
Figure 2 for LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant
Figure 3 for LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant
Figure 4 for LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant
Viaarxiv icon

EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection

Add code
Mar 03, 2025
Figure 1 for EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection
Figure 2 for EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection
Figure 3 for EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection
Figure 4 for EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection
Viaarxiv icon

Fine-Grained Controllable Apparel Showcase Image Generation via Garment-Centric Outpainting

Add code
Mar 03, 2025
Viaarxiv icon

MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing

Add code
Feb 28, 2025
Viaarxiv icon