Picture for Juan Carlos Niebles

Juan Carlos Niebles

Exploring Diffusion Transformer Designs via Grafting

Add code
Jun 06, 2025
Viaarxiv icon

Understanding Complexity in VideoQA via Visual Program Generation

Add code
May 19, 2025
Viaarxiv icon

VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making

Add code
May 06, 2025
Viaarxiv icon

AdaVid: Adaptive Video-Language Pretraining

Add code
Apr 16, 2025
Viaarxiv icon

Artificial Intelligence Index Report 2025

Add code
Apr 08, 2025
Viaarxiv icon

APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay

Add code
Apr 08, 2025
Viaarxiv icon

Re-thinking Temporal Search for Long-Form Video Understanding

Add code
Apr 03, 2025
Viaarxiv icon

ActionStudio: A Lightweight Framework for Data and Training of Large Action Models

Add code
Mar 31, 2025
Viaarxiv icon

SocialGen: Modeling Multi-Human Social Interaction with Language Models

Add code
Mar 28, 2025
Viaarxiv icon

Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas

Add code
Mar 04, 2025
Viaarxiv icon