Picture for Mike Chrzanowski

Mike Chrzanowski

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Add code
Aug 21, 2025
Viaarxiv icon

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Add code
Apr 10, 2025
Viaarxiv icon

Methods of improving LLM training stability

Add code
Oct 22, 2024
Figure 1 for Methods of improving LLM training stability
Figure 2 for Methods of improving LLM training stability
Figure 3 for Methods of improving LLM training stability
Figure 4 for Methods of improving LLM training stability
Viaarxiv icon

Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation

Add code
Jun 02, 2022
Figure 1 for Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Figure 2 for Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Figure 3 for Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Figure 4 for Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Viaarxiv icon

Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling

Add code
Oct 08, 2020
Figure 1 for Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling
Figure 2 for Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling
Figure 3 for Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling
Figure 4 for Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling
Viaarxiv icon

Towards Robust Image Classification Using Sequential Attention Models

Add code
Dec 04, 2019
Figure 1 for Towards Robust Image Classification Using Sequential Attention Models
Figure 2 for Towards Robust Image Classification Using Sequential Attention Models
Figure 3 for Towards Robust Image Classification Using Sequential Attention Models
Figure 4 for Towards Robust Image Classification Using Sequential Attention Models
Viaarxiv icon

Towards Interpretable Reinforcement Learning Using Attention Augmented Agents

Add code
Jun 06, 2019
Figure 1 for Towards Interpretable Reinforcement Learning Using Attention Augmented Agents
Figure 2 for Towards Interpretable Reinforcement Learning Using Attention Augmented Agents
Figure 3 for Towards Interpretable Reinforcement Learning Using Attention Augmented Agents
Figure 4 for Towards Interpretable Reinforcement Learning Using Attention Augmented Agents
Viaarxiv icon

Learning and Evaluating General Linguistic Intelligence

Add code
Jan 31, 2019
Figure 1 for Learning and Evaluating General Linguistic Intelligence
Figure 2 for Learning and Evaluating General Linguistic Intelligence
Figure 3 for Learning and Evaluating General Linguistic Intelligence
Figure 4 for Learning and Evaluating General Linguistic Intelligence
Viaarxiv icon

Relational recurrent neural networks

Add code
Jun 28, 2018
Figure 1 for Relational recurrent neural networks
Figure 2 for Relational recurrent neural networks
Figure 3 for Relational recurrent neural networks
Figure 4 for Relational recurrent neural networks
Viaarxiv icon

Deep Voice: Real-time Neural Text-to-Speech

Add code
Mar 07, 2017
Figure 1 for Deep Voice: Real-time Neural Text-to-Speech
Figure 2 for Deep Voice: Real-time Neural Text-to-Speech
Figure 3 for Deep Voice: Real-time Neural Text-to-Speech
Figure 4 for Deep Voice: Real-time Neural Text-to-Speech
Viaarxiv icon