Picture for Tatiana Likhomanenko

Tatiana Likhomanenko

Path-Constrained Mixture-of-Experts

Add code
Mar 18, 2026
Viaarxiv icon

SpeakStream: Streaming Text-to-Speech with Interleaved Data

Add code
May 25, 2025
Viaarxiv icon

Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis

Add code
Nov 26, 2024
Figure 1 for Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis
Figure 2 for Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis
Figure 3 for Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis
Figure 4 for Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis
Viaarxiv icon

Towards Automatic Assessment of Self-Supervised Speech Models using Rank

Add code
Sep 16, 2024
Figure 1 for Towards Automatic Assessment of Self-Supervised Speech Models using Rank
Figure 2 for Towards Automatic Assessment of Self-Supervised Speech Models using Rank
Figure 3 for Towards Automatic Assessment of Self-Supervised Speech Models using Rank
Figure 4 for Towards Automatic Assessment of Self-Supervised Speech Models using Rank
Viaarxiv icon

Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models

Add code
Sep 16, 2024
Figure 1 for Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models
Figure 2 for Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models
Figure 3 for Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models
Figure 4 for Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models
Viaarxiv icon

Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels

Add code
Sep 16, 2024
Figure 1 for Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels
Figure 2 for Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels
Figure 3 for Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels
Figure 4 for Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels
Viaarxiv icon

Theory, Analysis, and Best Practices for Sigmoid Self-Attention

Add code
Sep 06, 2024
Figure 1 for Theory, Analysis, and Best Practices for Sigmoid Self-Attention
Figure 2 for Theory, Analysis, and Best Practices for Sigmoid Self-Attention
Figure 3 for Theory, Analysis, and Best Practices for Sigmoid Self-Attention
Figure 4 for Theory, Analysis, and Best Practices for Sigmoid Self-Attention
Viaarxiv icon

Generating Gender Alternatives in Machine Translation

Add code
Jul 29, 2024
Figure 1 for Generating Gender Alternatives in Machine Translation
Figure 2 for Generating Gender Alternatives in Machine Translation
Figure 3 for Generating Gender Alternatives in Machine Translation
Figure 4 for Generating Gender Alternatives in Machine Translation
Viaarxiv icon

dMel: Speech Tokenization made Simple

Add code
Jul 22, 2024
Figure 1 for dMel: Speech Tokenization made Simple
Figure 2 for dMel: Speech Tokenization made Simple
Figure 3 for dMel: Speech Tokenization made Simple
Figure 4 for dMel: Speech Tokenization made Simple
Viaarxiv icon

Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition

Add code
May 24, 2024
Figure 1 for Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Figure 2 for Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Figure 3 for Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Figure 4 for Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Viaarxiv icon