Picture for Jan Kautz

Jan Kautz

NVIDIA

Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence

Add code
Apr 27, 2026
Viaarxiv icon

SpaCeFormer: Fast Proposal-Free Open-Vocabulary 3D Instance Segmentation

Add code
Apr 22, 2026
Viaarxiv icon

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Apr 14, 2026
Viaarxiv icon

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

Add code
Mar 19, 2026
Viaarxiv icon

SOMA: Unifying Parametric Human Body Models

Add code
Mar 17, 2026
Viaarxiv icon

Kimodo: Scaling Controllable Human Motion Generation

Add code
Mar 16, 2026
Viaarxiv icon

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

Add code
Mar 12, 2026
Viaarxiv icon

Stateful Token Reduction for Long-Video Hybrid VLMs

Add code
Feb 27, 2026
Viaarxiv icon

World Action Models are Zero-shot Policies

Add code
Feb 17, 2026
Viaarxiv icon

PhyCritic: Multimodal Critic Models for Physical AI

Add code
Feb 11, 2026
Viaarxiv icon