Picture for Jan Kautz

Jan Kautz

NVIDIA

FRAG: Frame Selection Augmented Generation for Long Video and Long Document Understanding

Add code
Apr 24, 2025
Viaarxiv icon

LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field Enlargement

Add code
Apr 22, 2025
Viaarxiv icon

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

Add code
Apr 21, 2025
Viaarxiv icon

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Add code
Apr 17, 2025
Viaarxiv icon

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning

Add code
Apr 15, 2025
Viaarxiv icon

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Add code
Apr 10, 2025
Viaarxiv icon

One-Minute Video Generation with Test-Time Training

Add code
Apr 07, 2025
Viaarxiv icon

OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning

Add code
Apr 06, 2025
Viaarxiv icon

Scaling Vision Pre-Training to 4K Resolution

Add code
Mar 25, 2025
Viaarxiv icon

GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

Add code
Mar 18, 2025
Viaarxiv icon