Picture for Jan Kautz

Jan Kautz

NVIDIA

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

Add code
Jan 14, 2026
Viaarxiv icon

NitroGen: An Open Foundation Model for Generalist Gaming Agents

Add code
Jan 04, 2026
Viaarxiv icon

NVIDIA Nemotron 3: Efficient and Open Intelligence

Add code
Dec 24, 2025
Viaarxiv icon

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Dec 23, 2025
Viaarxiv icon

Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed

Add code
Dec 16, 2025
Figure 1 for Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed
Figure 2 for Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed
Figure 3 for Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed
Figure 4 for Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed
Viaarxiv icon

TiDAR: Think in Diffusion, Talk in Autoregression

Add code
Nov 12, 2025
Figure 1 for TiDAR: Think in Diffusion, Talk in Autoregression
Figure 2 for TiDAR: Think in Diffusion, Talk in Autoregression
Figure 3 for TiDAR: Think in Diffusion, Talk in Autoregression
Figure 4 for TiDAR: Think in Diffusion, Talk in Autoregression
Viaarxiv icon

SONIC: Supersizing Motion Tracking for Natural Humanoid Whole-Body Control

Add code
Nov 11, 2025
Viaarxiv icon

NVIDIA Nemotron Nano V2 VL

Add code
Nov 07, 2025
Viaarxiv icon

ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge

Add code
Oct 21, 2025
Figure 1 for ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge
Figure 2 for ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge
Figure 3 for ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge
Figure 4 for ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge
Viaarxiv icon

3D Aware Region Prompted Vision Language Model

Add code
Sep 16, 2025
Figure 1 for 3D Aware Region Prompted Vision Language Model
Figure 2 for 3D Aware Region Prompted Vision Language Model
Figure 3 for 3D Aware Region Prompted Vision Language Model
Figure 4 for 3D Aware Region Prompted Vision Language Model
Viaarxiv icon