Picture for Weilong Dai

Weilong Dai

DynFrame: Adaptive Reasoning-Driven Multimodal Framework with Dynamic Frame Augmentation for Complex Video Understanding

Add code
May 26, 2026
Viaarxiv icon

Think When Needed: Adaptive Reasoning-Driven Multimodal Embeddings with a Dual-LoRA Architecture

Add code
May 14, 2026
Viaarxiv icon

DSI-Bench: A Benchmark for Dynamic Spatial Intelligence

Add code
Oct 21, 2025
Viaarxiv icon

Fast-Slow Thinking for Large Vision-Language Model Reasoning

Add code
Apr 25, 2025
Viaarxiv icon

MINT: Multi-modal Chain of Thought in Unified Generative Models for Enhanced Image Generation

Add code
Mar 03, 2025
Viaarxiv icon