Picture for Sean Narenthiran

Sean Narenthiran

NVIDIA Nemotron 3: Efficient and Open Intelligence

Add code
Dec 24, 2025
Viaarxiv icon

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Dec 23, 2025
Viaarxiv icon

Scaling Test-Time Compute to Achieve IOI Gold Medal with Open-Weight Models

Add code
Oct 16, 2025
Figure 1 for Scaling Test-Time Compute to Achieve IOI Gold Medal with Open-Weight Models
Figure 2 for Scaling Test-Time Compute to Achieve IOI Gold Medal with Open-Weight Models
Figure 3 for Scaling Test-Time Compute to Achieve IOI Gold Medal with Open-Weight Models
Figure 4 for Scaling Test-Time Compute to Achieve IOI Gold Medal with Open-Weight Models
Viaarxiv icon

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Add code
Aug 21, 2025
Figure 1 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Figure 2 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Figure 3 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Figure 4 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Viaarxiv icon

Llama-Nemotron: Efficient Reasoning Models

Add code
May 02, 2025
Figure 1 for Llama-Nemotron: Efficient Reasoning Models
Figure 2 for Llama-Nemotron: Efficient Reasoning Models
Figure 3 for Llama-Nemotron: Efficient Reasoning Models
Figure 4 for Llama-Nemotron: Efficient Reasoning Models
Viaarxiv icon

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Add code
Apr 10, 2025
Figure 1 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Figure 2 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Figure 3 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Figure 4 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Viaarxiv icon

OpenCodeReasoning: Advancing Data Distillation for Competitive Coding

Add code
Apr 02, 2025
Viaarxiv icon

Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models

Add code
Jul 29, 2024
Figure 1 for Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models
Figure 2 for Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models
Figure 3 for Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models
Figure 4 for Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Add code
Feb 15, 2024
Figure 1 for OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Figure 2 for OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Figure 3 for OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Figure 4 for OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Viaarxiv icon