Picture for Sadegh Mahdavi

Sadegh Mahdavi

NVIDIA Nemotron 3: Efficient and Open Intelligence

Add code
Dec 24, 2025
Viaarxiv icon

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Dec 23, 2025
Viaarxiv icon

Nemotron-Math: Efficient Long-Context Distillation of Mathematical Reasoning from Multi-Mode Supervision

Add code
Dec 17, 2025
Figure 1 for Nemotron-Math: Efficient Long-Context Distillation of Mathematical Reasoning from Multi-Mode Supervision
Figure 2 for Nemotron-Math: Efficient Long-Context Distillation of Mathematical Reasoning from Multi-Mode Supervision
Figure 3 for Nemotron-Math: Efficient Long-Context Distillation of Mathematical Reasoning from Multi-Mode Supervision
Figure 4 for Nemotron-Math: Efficient Long-Context Distillation of Mathematical Reasoning from Multi-Mode Supervision
Viaarxiv icon

Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection

Add code
Nov 17, 2025
Figure 1 for Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection
Figure 2 for Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection
Figure 3 for Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection
Figure 4 for Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection
Viaarxiv icon

Advantage Shaping as Surrogate Reward Maximization: Unifying Pass@K Policy Gradients

Add code
Oct 27, 2025
Viaarxiv icon

Leveraging Online Olympiad-Level Math Problems for LLMs Training and Contamination-Resistant Evaluation

Add code
Jan 24, 2025
Viaarxiv icon

From Graph Diffusion to Graph Classification

Add code
Nov 26, 2024
Figure 1 for From Graph Diffusion to Graph Classification
Figure 2 for From Graph Diffusion to Graph Classification
Figure 3 for From Graph Diffusion to Graph Classification
Figure 4 for From Graph Diffusion to Graph Classification
Viaarxiv icon

Leveraging Environment Interaction for Automated PDDL Generation and Planning with Large Language Models

Add code
Jul 17, 2024
Figure 1 for Leveraging Environment Interaction for Automated PDDL Generation and Planning with Large Language Models
Figure 2 for Leveraging Environment Interaction for Automated PDDL Generation and Planning with Large Language Models
Figure 3 for Leveraging Environment Interaction for Automated PDDL Generation and Planning with Large Language Models
Figure 4 for Leveraging Environment Interaction for Automated PDDL Generation and Planning with Large Language Models
Viaarxiv icon

Memorization Capacity of Multi-Head Attention in Transformers

Add code
Jun 03, 2023
Viaarxiv icon

Towards Better Out-of-Distribution Generalization of Neural Algorithmic Reasoning Tasks

Add code
Nov 01, 2022
Figure 1 for Towards Better Out-of-Distribution Generalization of Neural Algorithmic Reasoning Tasks
Figure 2 for Towards Better Out-of-Distribution Generalization of Neural Algorithmic Reasoning Tasks
Figure 3 for Towards Better Out-of-Distribution Generalization of Neural Algorithmic Reasoning Tasks
Figure 4 for Towards Better Out-of-Distribution Generalization of Neural Algorithmic Reasoning Tasks
Viaarxiv icon