Alert button
Picture for Egor Lakomkin

Egor Lakomkin

Alert button

Towards General-Purpose Speech Abilities for Large Language Models Using Unpaired Data

Add code
Bookmark button
Alert button
Nov 12, 2023
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer

Viaarxiv icon

End-to-End Speech Recognition Contextualization with Large Language Models

Add code
Bookmark button
Alert button
Sep 19, 2023
Egor Lakomkin, Chunyang Wu, Yassir Fathullah, Ozlem Kalinli, Michael L. Seltzer, Christian Fuegen

Figure 1 for End-to-End Speech Recognition Contextualization with Large Language Models
Figure 2 for End-to-End Speech Recognition Contextualization with Large Language Models
Figure 3 for End-to-End Speech Recognition Contextualization with Large Language Models
Figure 4 for End-to-End Speech Recognition Contextualization with Large Language Models
Viaarxiv icon

Prompting Large Language Models with Speech Recognition Abilities

Add code
Bookmark button
Alert button
Jul 21, 2023
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer

Figure 1 for Prompting Large Language Models with Speech Recognition Abilities
Figure 2 for Prompting Large Language Models with Speech Recognition Abilities
Figure 3 for Prompting Large Language Models with Speech Recognition Abilities
Figure 4 for Prompting Large Language Models with Speech Recognition Abilities
Viaarxiv icon

SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision

Add code
Bookmark button
Alert button
Apr 03, 2023
Xubo Liu, Egor Lakomkin, Konstantinos Vougioukas, Pingchuan Ma, Honglie Chen, Ruiming Xie, Morrie Doulaty, Niko Moritz, Jáchym Kolář, Stavros Petridis, Maja Pantic, Christian Fuegen

Figure 1 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Figure 2 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Figure 3 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Figure 4 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Viaarxiv icon

Egocentric Audio-Visual Noise Suppression

Add code
Bookmark button
Alert button
Nov 07, 2022
Roshan Sharma, Weipeng He, Ju Lin, Egor Lakomkin, Yang Liu, Kaustubh Kalgaonkar

Figure 1 for Egocentric Audio-Visual Noise Suppression
Figure 2 for Egocentric Audio-Visual Noise Suppression
Figure 3 for Egocentric Audio-Visual Noise Suppression
Figure 4 for Egocentric Audio-Visual Noise Suppression
Viaarxiv icon

KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos

Add code
Bookmark button
Alert button
Mar 01, 2019
Egor Lakomkin, Sven Magg, Cornelius Weber, Stefan Wermter

Figure 1 for KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Figure 2 for KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Figure 3 for KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Figure 4 for KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Viaarxiv icon

Incorporating End-to-End Speech Recognition Models for Sentiment Analysis

Add code
Bookmark button
Alert button
Feb 28, 2019
Egor Lakomkin, Mohammad Ali Zamani, Cornelius Weber, Sven Magg, Stefan Wermter

Figure 1 for Incorporating End-to-End Speech Recognition Models for Sentiment Analysis
Figure 2 for Incorporating End-to-End Speech Recognition Models for Sentiment Analysis
Figure 3 for Incorporating End-to-End Speech Recognition Models for Sentiment Analysis
Figure 4 for Incorporating End-to-End Speech Recognition Models for Sentiment Analysis
Viaarxiv icon

On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks

Add code
Bookmark button
Alert button
Apr 06, 2018
Egor Lakomkin, Mohammad Ali Zamani, Cornelius Weber, Sven Magg, Stefan Wermter

Figure 1 for On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks
Figure 2 for On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks
Figure 3 for On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks
Figure 4 for On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks
Viaarxiv icon

EmoRL: Continuous Acoustic Emotion Classification using Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 03, 2018
Egor Lakomkin, Mohammad Ali Zamani, Cornelius Weber, Sven Magg, Stefan Wermter

Figure 1 for EmoRL: Continuous Acoustic Emotion Classification using Deep Reinforcement Learning
Figure 2 for EmoRL: Continuous Acoustic Emotion Classification using Deep Reinforcement Learning
Figure 3 for EmoRL: Continuous Acoustic Emotion Classification using Deep Reinforcement Learning
Figure 4 for EmoRL: Continuous Acoustic Emotion Classification using Deep Reinforcement Learning
Viaarxiv icon