Picture for Vedanuj Goswami

Vedanuj Goswami

Jack

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks

Add code
Feb 24, 2025
Figure 1 for Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks
Figure 2 for Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks
Figure 3 for Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks
Figure 4 for Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Llama 2: Open Foundation and Fine-Tuned Chat Models

Add code
Jul 19, 2023
Figure 1 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 2 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 3 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 4 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Viaarxiv icon

Multilingual Speech-to-Speech Translation into Multiple Target Languages

Add code
Jul 17, 2023
Figure 1 for Multilingual Speech-to-Speech Translation into Multiple Target Languages
Figure 2 for Multilingual Speech-to-Speech Translation into Multiple Target Languages
Figure 3 for Multilingual Speech-to-Speech Translation into Multiple Target Languages
Figure 4 for Multilingual Speech-to-Speech Translation into Multiple Target Languages
Viaarxiv icon

Revisiting Machine Translation for Cross-lingual Classification

Add code
May 23, 2023
Figure 1 for Revisiting Machine Translation for Cross-lingual Classification
Figure 2 for Revisiting Machine Translation for Cross-lingual Classification
Figure 3 for Revisiting Machine Translation for Cross-lingual Classification
Figure 4 for Revisiting Machine Translation for Cross-lingual Classification
Viaarxiv icon

Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic Capacity

Add code
May 03, 2023
Figure 1 for Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic Capacity
Figure 2 for Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic Capacity
Figure 3 for Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic Capacity
Figure 4 for Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic Capacity
Viaarxiv icon

MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation

Add code
Mar 07, 2023
Viaarxiv icon

Language-Aware Multilingual Machine Translation with Self-Supervised Learning

Add code
Feb 10, 2023
Figure 1 for Language-Aware Multilingual Machine Translation with Self-Supervised Learning
Figure 2 for Language-Aware Multilingual Machine Translation with Self-Supervised Learning
Figure 3 for Language-Aware Multilingual Machine Translation with Self-Supervised Learning
Figure 4 for Language-Aware Multilingual Machine Translation with Self-Supervised Learning
Viaarxiv icon

Causes and Cures for Interference in Multilingual Translation

Add code
Dec 14, 2022
Viaarxiv icon