Picture for Jiuxiang Gu

Jiuxiang Gu

Refer to Anything with Vision-Language Prompts

Add code
Jun 05, 2025
Viaarxiv icon

R-KV: Redundancy-aware KV Cache Compression for Training-Free Reasoning Models Acceleration

Add code
May 30, 2025
Viaarxiv icon

Towards Visual Text Grounding of Multimodal Large Language Model

Add code
Apr 07, 2025
Viaarxiv icon

QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge

Add code
Mar 20, 2025
Viaarxiv icon

Robust Latent Matters: Boosting Image Generation with Sampling Error

Add code
Mar 11, 2025
Viaarxiv icon

METAL: A Multi-Agent Framework for Chart Generation with Test-Time Scaling

Add code
Feb 24, 2025
Viaarxiv icon

From Selection to Generation: A Survey of LLM-based Active Learning

Add code
Feb 17, 2025
Viaarxiv icon

Efficient Reasoning with Hidden Thinking

Add code
Jan 31, 2025
Viaarxiv icon

MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data

Add code
Dec 18, 2024
Figure 1 for MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data
Figure 2 for MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data
Figure 3 for MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data
Figure 4 for MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data
Viaarxiv icon

LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers

Add code
Dec 17, 2024
Viaarxiv icon