Picture for Sandeep Subramanian

Sandeep Subramanian

Magistral

Add code
Jun 12, 2025
Viaarxiv icon

Pixtral 12B

Add code
Oct 09, 2024
Figure 1 for Pixtral 12B
Figure 2 for Pixtral 12B
Figure 3 for Pixtral 12B
Figure 4 for Pixtral 12B
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

Nemotron-4 15B Technical Report

Add code
Feb 27, 2024
Figure 1 for Nemotron-4 15B Technical Report
Figure 2 for Nemotron-4 15B Technical Report
Figure 3 for Nemotron-4 15B Technical Report
Figure 4 for Nemotron-4 15B Technical Report
Viaarxiv icon

Mixtral of Experts

Add code
Jan 08, 2024
Figure 1 for Mixtral of Experts
Figure 2 for Mixtral of Experts
Figure 3 for Mixtral of Experts
Figure 4 for Mixtral of Experts
Viaarxiv icon

Retrieval meets Long Context Large Language Models

Add code
Oct 04, 2023
Figure 1 for Retrieval meets Long Context Large Language Models
Figure 2 for Retrieval meets Long Context Large Language Models
Figure 3 for Retrieval meets Long Context Large Language Models
Figure 4 for Retrieval meets Long Context Large Language Models
Viaarxiv icon

Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation

Add code
Jun 02, 2022
Figure 1 for Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Figure 2 for Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Figure 3 for Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Figure 4 for Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Viaarxiv icon

NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21

Add code
Nov 16, 2021
Figure 1 for NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21
Figure 2 for NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21
Figure 3 for NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21
Figure 4 for NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21
Viaarxiv icon

Multi-scale Transformer Language Models

Add code
May 01, 2020
Figure 1 for Multi-scale Transformer Language Models
Figure 2 for Multi-scale Transformer Language Models
Figure 3 for Multi-scale Transformer Language Models
Figure 4 for Multi-scale Transformer Language Models
Viaarxiv icon

On Extractive and Abstractive Neural Document Summarization with Transformer Language Models

Add code
Sep 07, 2019
Figure 1 for On Extractive and Abstractive Neural Document Summarization with Transformer Language Models
Figure 2 for On Extractive and Abstractive Neural Document Summarization with Transformer Language Models
Figure 3 for On Extractive and Abstractive Neural Document Summarization with Transformer Language Models
Figure 4 for On Extractive and Abstractive Neural Document Summarization with Transformer Language Models
Viaarxiv icon