Picture for Jan Kautz

Jan Kautz

NVIDIA

Factorized Implicit Global Convolution for Automotive Computational Fluid Dynamics Prediction

Add code
Feb 06, 2025
Figure 1 for Factorized Implicit Global Convolution for Automotive Computational Fluid Dynamics Prediction
Figure 2 for Factorized Implicit Global Convolution for Automotive Computational Fluid Dynamics Prediction
Figure 3 for Factorized Implicit Global Convolution for Automotive Computational Fluid Dynamics Prediction
Figure 4 for Factorized Implicit Global Convolution for Automotive Computational Fluid Dynamics Prediction
Viaarxiv icon

Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation

Add code
Feb 04, 2025
Figure 1 for Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation
Figure 2 for Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation
Figure 3 for Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation
Figure 4 for Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation
Viaarxiv icon

Parallel Sequence Modeling via Generalized Spatial Propagation Network

Add code
Jan 21, 2025
Figure 1 for Parallel Sequence Modeling via Generalized Spatial Propagation Network
Figure 2 for Parallel Sequence Modeling via Generalized Spatial Propagation Network
Figure 3 for Parallel Sequence Modeling via Generalized Spatial Propagation Network
Figure 4 for Parallel Sequence Modeling via Generalized Spatial Propagation Network
Viaarxiv icon

SimAvatar: Simulation-Ready Avatars with Layered Hair and Clothing

Add code
Dec 12, 2024
Viaarxiv icon

StreamChat: Chatting with Streaming Video

Add code
Dec 11, 2024
Figure 1 for StreamChat: Chatting with Streaming Video
Figure 2 for StreamChat: Chatting with Streaming Video
Figure 3 for StreamChat: Chatting with Streaming Video
Figure 4 for StreamChat: Chatting with Streaming Video
Viaarxiv icon

RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models

Add code
Dec 10, 2024
Figure 1 for RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models
Figure 2 for RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models
Figure 3 for RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models
Figure 4 for RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models
Viaarxiv icon

Gated Delta Networks: Improving Mamba2 with Delta Rule

Add code
Dec 09, 2024
Viaarxiv icon

NaVILA: Legged Robot Vision-Language-Action Model for Navigation

Add code
Dec 05, 2024
Figure 1 for NaVILA: Legged Robot Vision-Language-Action Model for Navigation
Figure 2 for NaVILA: Legged Robot Vision-Language-Action Model for Navigation
Figure 3 for NaVILA: Legged Robot Vision-Language-Action Model for Navigation
Figure 4 for NaVILA: Legged Robot Vision-Language-Action Model for Navigation
Viaarxiv icon

NVILA: Efficient Frontier Visual Language Models

Add code
Dec 05, 2024
Figure 1 for NVILA: Efficient Frontier Visual Language Models
Figure 2 for NVILA: Efficient Frontier Visual Language Models
Figure 3 for NVILA: Efficient Frontier Visual Language Models
Figure 4 for NVILA: Efficient Frontier Visual Language Models
Viaarxiv icon

Hymba: A Hybrid-head Architecture for Small Language Models

Add code
Nov 20, 2024
Figure 1 for Hymba: A Hybrid-head Architecture for Small Language Models
Figure 2 for Hymba: A Hybrid-head Architecture for Small Language Models
Figure 3 for Hymba: A Hybrid-head Architecture for Small Language Models
Figure 4 for Hymba: A Hybrid-head Architecture for Small Language Models
Viaarxiv icon