Picture for Ali Taghibakhshi

Ali Taghibakhshi

NVIDIA Nemotron 3: Efficient and Open Intelligence

Add code
Dec 24, 2025
Viaarxiv icon

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Dec 23, 2025
Viaarxiv icon

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Add code
Aug 21, 2025
Figure 1 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Figure 2 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Figure 3 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Figure 4 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Viaarxiv icon

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning

Add code
Apr 15, 2025
Viaarxiv icon

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Add code
Apr 10, 2025
Figure 1 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Figure 2 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Figure 3 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Figure 4 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Viaarxiv icon

Elucidating Optimal Reward-Diversity Tradeoffs in Text-to-Image Diffusion Models

Add code
Sep 09, 2024
Viaarxiv icon

NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment

Add code
May 02, 2024
Figure 1 for NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment
Figure 2 for NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment
Figure 3 for NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment
Figure 4 for NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment
Viaarxiv icon

Hierarchical Graph Neural Network with Cross-Attention for Cross-Device User Matching

Add code
Apr 06, 2023
Viaarxiv icon

MG-GNN: Multigrid Graph Neural Networks for Learning Multilevel Domain Decomposition Methods

Add code
Jan 26, 2023
Viaarxiv icon

Optimized Sparse Matrix Operations for Reverse Mode Automatic Differentiation

Add code
Dec 10, 2022
Figure 1 for Optimized Sparse Matrix Operations for Reverse Mode Automatic Differentiation
Figure 2 for Optimized Sparse Matrix Operations for Reverse Mode Automatic Differentiation
Figure 3 for Optimized Sparse Matrix Operations for Reverse Mode Automatic Differentiation
Figure 4 for Optimized Sparse Matrix Operations for Reverse Mode Automatic Differentiation
Viaarxiv icon