Picture for Egor Lakomkin

Egor Lakomkin

Jack

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Towards General-Purpose Speech Abilities for Large Language Models Using Unpaired Data

Add code
Nov 12, 2023
Viaarxiv icon

End-to-End Speech Recognition Contextualization with Large Language Models

Add code
Sep 19, 2023
Figure 1 for End-to-End Speech Recognition Contextualization with Large Language Models
Figure 2 for End-to-End Speech Recognition Contextualization with Large Language Models
Figure 3 for End-to-End Speech Recognition Contextualization with Large Language Models
Figure 4 for End-to-End Speech Recognition Contextualization with Large Language Models
Viaarxiv icon

Prompting Large Language Models with Speech Recognition Abilities

Add code
Jul 21, 2023
Figure 1 for Prompting Large Language Models with Speech Recognition Abilities
Figure 2 for Prompting Large Language Models with Speech Recognition Abilities
Figure 3 for Prompting Large Language Models with Speech Recognition Abilities
Figure 4 for Prompting Large Language Models with Speech Recognition Abilities
Viaarxiv icon

SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision

Add code
Apr 03, 2023
Figure 1 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Figure 2 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Figure 3 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Figure 4 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Viaarxiv icon

Egocentric Audio-Visual Noise Suppression

Add code
Nov 07, 2022
Figure 1 for Egocentric Audio-Visual Noise Suppression
Figure 2 for Egocentric Audio-Visual Noise Suppression
Figure 3 for Egocentric Audio-Visual Noise Suppression
Figure 4 for Egocentric Audio-Visual Noise Suppression
Viaarxiv icon

KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos

Add code
Mar 01, 2019
Figure 1 for KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Figure 2 for KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Figure 3 for KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Figure 4 for KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Viaarxiv icon

Incorporating End-to-End Speech Recognition Models for Sentiment Analysis

Add code
Feb 28, 2019
Figure 1 for Incorporating End-to-End Speech Recognition Models for Sentiment Analysis
Figure 2 for Incorporating End-to-End Speech Recognition Models for Sentiment Analysis
Figure 3 for Incorporating End-to-End Speech Recognition Models for Sentiment Analysis
Figure 4 for Incorporating End-to-End Speech Recognition Models for Sentiment Analysis
Viaarxiv icon

On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks

Add code
Apr 06, 2018
Figure 1 for On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks
Figure 2 for On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks
Figure 3 for On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks
Figure 4 for On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks
Viaarxiv icon

EmoRL: Continuous Acoustic Emotion Classification using Deep Reinforcement Learning

Add code
Apr 03, 2018
Figure 1 for EmoRL: Continuous Acoustic Emotion Classification using Deep Reinforcement Learning
Figure 2 for EmoRL: Continuous Acoustic Emotion Classification using Deep Reinforcement Learning
Figure 3 for EmoRL: Continuous Acoustic Emotion Classification using Deep Reinforcement Learning
Figure 4 for EmoRL: Continuous Acoustic Emotion Classification using Deep Reinforcement Learning
Viaarxiv icon