Picture for Frank Zhang

Frank Zhang

Jack

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Optimizing Pretraining Data Mixtures with LLM-Estimated Utility

Add code
Jan 20, 2025
Figure 1 for Optimizing Pretraining Data Mixtures with LLM-Estimated Utility
Figure 2 for Optimizing Pretraining Data Mixtures with LLM-Estimated Utility
Figure 3 for Optimizing Pretraining Data Mixtures with LLM-Estimated Utility
Figure 4 for Optimizing Pretraining Data Mixtures with LLM-Estimated Utility
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation

Add code
Mar 14, 2024
Viaarxiv icon

Pushing the performances of ASR models on English and Spanish accents

Add code
Dec 22, 2022
Figure 1 for Pushing the performances of ASR models on English and Spanish accents
Figure 2 for Pushing the performances of ASR models on English and Spanish accents
Figure 3 for Pushing the performances of ASR models on English and Spanish accents
Figure 4 for Pushing the performances of ASR models on English and Spanish accents
Viaarxiv icon

Scaling ASR Improves Zero and Few Shot Learning

Add code
Nov 29, 2021
Figure 1 for Scaling ASR Improves Zero and Few Shot Learning
Figure 2 for Scaling ASR Improves Zero and Few Shot Learning
Figure 3 for Scaling ASR Improves Zero and Few Shot Learning
Figure 4 for Scaling ASR Improves Zero and Few Shot Learning
Viaarxiv icon

Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings

Add code
Oct 08, 2021
Figure 1 for Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings
Figure 2 for Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings
Figure 3 for Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings
Figure 4 for Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings
Viaarxiv icon

Improved Language Identification Through Cross-Lingual Self-Supervised Learning

Add code
Aug 04, 2021
Figure 1 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 2 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 3 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 4 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Viaarxiv icon

On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models

Add code
Jul 09, 2021
Figure 1 for On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models
Figure 2 for On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models
Figure 3 for On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models
Figure 4 for On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models
Viaarxiv icon

Improving RNN Transducer Based ASR with Auxiliary Tasks

Add code
Nov 09, 2020
Figure 1 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Figure 2 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Figure 3 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Figure 4 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Viaarxiv icon