Picture for Jacob Kahn

Jacob Kahn

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Add code
Mar 12, 2024
Figure 1 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 2 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 3 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 4 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Viaarxiv icon

TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch

Add code
Oct 27, 2023
Figure 1 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 2 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 3 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 4 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Viaarxiv icon

RA-DIT: Retrieval-Augmented Dual Instruction Tuning

Add code
Oct 08, 2023
Figure 1 for RA-DIT: Retrieval-Augmented Dual Instruction Tuning
Figure 2 for RA-DIT: Retrieval-Augmented Dual Instruction Tuning
Figure 3 for RA-DIT: Retrieval-Augmented Dual Instruction Tuning
Figure 4 for RA-DIT: Retrieval-Augmented Dual Instruction Tuning
Viaarxiv icon

The Framework Tax: Disparities Between Inference Efficiency in Research and Deployment

Add code
Feb 13, 2023
Figure 1 for The Framework Tax: Disparities Between Inference Efficiency in Research and Deployment
Figure 2 for The Framework Tax: Disparities Between Inference Efficiency in Research and Deployment
Figure 3 for The Framework Tax: Disparities Between Inference Efficiency in Research and Deployment
Figure 4 for The Framework Tax: Disparities Between Inference Efficiency in Research and Deployment
Viaarxiv icon

OLLA: Optimizing the Lifetime and Location of Arrays to Reduce the Memory Usage of Neural Networks

Add code
Nov 02, 2022
Figure 1 for OLLA: Optimizing the Lifetime and Location of Arrays to Reduce the Memory Usage of Neural Networks
Figure 2 for OLLA: Optimizing the Lifetime and Location of Arrays to Reduce the Memory Usage of Neural Networks
Figure 3 for OLLA: Optimizing the Lifetime and Location of Arrays to Reduce the Memory Usage of Neural Networks
Figure 4 for OLLA: Optimizing the Lifetime and Location of Arrays to Reduce the Memory Usage of Neural Networks
Viaarxiv icon

Reasoning over Public and Private Data in Retrieval-Based Systems

Add code
Mar 14, 2022
Figure 1 for Reasoning over Public and Private Data in Retrieval-Based Systems
Figure 2 for Reasoning over Public and Private Data in Retrieval-Based Systems
Figure 3 for Reasoning over Public and Private Data in Retrieval-Based Systems
Figure 4 for Reasoning over Public and Private Data in Retrieval-Based Systems
Viaarxiv icon

Flashlight: Enabling Innovation in Tools for Machine Learning

Add code
Jan 29, 2022
Figure 1 for Flashlight: Enabling Innovation in Tools for Machine Learning
Figure 2 for Flashlight: Enabling Innovation in Tools for Machine Learning
Figure 3 for Flashlight: Enabling Innovation in Tools for Machine Learning
Figure 4 for Flashlight: Enabling Innovation in Tools for Machine Learning
Viaarxiv icon

Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training

Add code
Apr 02, 2021
Figure 1 for Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Figure 2 for Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Figure 3 for Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Figure 4 for Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Viaarxiv icon

Rethinking Evaluation in ASR: Are Our Models Robust Enough?

Add code
Oct 22, 2020
Figure 1 for Rethinking Evaluation in ASR: Are Our Models Robust Enough?
Figure 2 for Rethinking Evaluation in ASR: Are Our Models Robust Enough?
Figure 3 for Rethinking Evaluation in ASR: Are Our Models Robust Enough?
Figure 4 for Rethinking Evaluation in ASR: Are Our Models Robust Enough?
Viaarxiv icon

slimIPL: Language-Model-Free Iterative Pseudo-Labeling

Add code
Oct 22, 2020
Figure 1 for slimIPL: Language-Model-Free Iterative Pseudo-Labeling
Figure 2 for slimIPL: Language-Model-Free Iterative Pseudo-Labeling
Figure 3 for slimIPL: Language-Model-Free Iterative Pseudo-Labeling
Figure 4 for slimIPL: Language-Model-Free Iterative Pseudo-Labeling
Viaarxiv icon