Picture for Lemeng Wu

Lemeng Wu

Small Vision-Language Models are Smart Compressors for Long Video Understanding

Add code
Apr 09, 2026
Viaarxiv icon

dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models

Add code
Mar 19, 2026
Viaarxiv icon

MobileLLM-Flash: Latency-Guided On-Device LLM Design for Industry Scale

Add code
Mar 16, 2026
Viaarxiv icon

VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice

Add code
Jan 08, 2026
Viaarxiv icon

Steepest Descent Density Control for Compact 3D Gaussian Splatting

Add code
May 08, 2025
Viaarxiv icon

EdgeTAM: On-Device Track Anything Model

Add code
Jan 13, 2025
Viaarxiv icon

Efficient Track Anything

Add code
Nov 28, 2024
Viaarxiv icon

Llama Guard 3-1B-INT4: Compact and Efficient Safeguard for Human-AI Conversations

Add code
Nov 18, 2024
Viaarxiv icon

LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Add code
Oct 22, 2024
Figure 1 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Figure 2 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Figure 3 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Figure 4 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Viaarxiv icon

InverseMeetInsert: Robust Real Image Editing via Geometric Accumulation Inversion in Guided Diffusion Models

Add code
Sep 18, 2024
Viaarxiv icon