Picture for Sakriani Sakti

Sakriani Sakti

Training-Free Intelligibility-Guided Observation Addition for Noisy ASR

Add code
Feb 24, 2026
Viaarxiv icon

SimulSense: Sense-Driven Interpreting for Efficient Simultaneous Speech Translation

Add code
Sep 26, 2025
Figure 1 for SimulSense: Sense-Driven Interpreting for Efficient Simultaneous Speech Translation
Figure 2 for SimulSense: Sense-Driven Interpreting for Efficient Simultaneous Speech Translation
Figure 3 for SimulSense: Sense-Driven Interpreting for Efficient Simultaneous Speech Translation
Figure 4 for SimulSense: Sense-Driven Interpreting for Efficient Simultaneous Speech Translation
Viaarxiv icon

SC-SOT: Conditioning the Decoder on Diarized Speaker Information for End-to-End Overlapped Speech Recognition

Add code
Jun 15, 2025
Viaarxiv icon

Indonesian-English Code-Switching Speech Synthesizer Utilizing Multilingual STEN-TTS and Bert LID

Add code
Dec 26, 2024
Viaarxiv icon

Continual Learning in Machine Speech Chain Using Gradient Episodic Memory

Add code
Nov 27, 2024
Figure 1 for Continual Learning in Machine Speech Chain Using Gradient Episodic Memory
Figure 2 for Continual Learning in Machine Speech Chain Using Gradient Episodic Memory
Figure 3 for Continual Learning in Machine Speech Chain Using Gradient Episodic Memory
Figure 4 for Continual Learning in Machine Speech Chain Using Gradient Episodic Memory
Viaarxiv icon

A Transformer Framework for Simultaneous Segmentation, Classification, and Caller Identification of Marmoset Vocalization

Add code
Nov 06, 2024
Figure 1 for A Transformer Framework for Simultaneous Segmentation, Classification, and Caller Identification of Marmoset Vocalization
Figure 2 for A Transformer Framework for Simultaneous Segmentation, Classification, and Caller Identification of Marmoset Vocalization
Figure 3 for A Transformer Framework for Simultaneous Segmentation, Classification, and Caller Identification of Marmoset Vocalization
Figure 4 for A Transformer Framework for Simultaneous Segmentation, Classification, and Caller Identification of Marmoset Vocalization
Viaarxiv icon

A Neural Transformer Framework for Simultaneous Tasks of Segmentation, Classification, and Caller Identification of Marmoset Vocalization

Add code
Oct 30, 2024
Figure 1 for A Neural Transformer Framework for Simultaneous Tasks of Segmentation, Classification, and Caller Identification of Marmoset Vocalization
Figure 2 for A Neural Transformer Framework for Simultaneous Tasks of Segmentation, Classification, and Caller Identification of Marmoset Vocalization
Figure 3 for A Neural Transformer Framework for Simultaneous Tasks of Segmentation, Classification, and Caller Identification of Marmoset Vocalization
Figure 4 for A Neural Transformer Framework for Simultaneous Tasks of Segmentation, Classification, and Caller Identification of Marmoset Vocalization
Viaarxiv icon

Enhancing Indonesian Automatic Speech Recognition: Evaluating Multilingual Models with Diverse Speech Variabilities

Add code
Oct 11, 2024
Figure 1 for Enhancing Indonesian Automatic Speech Recognition: Evaluating Multilingual Models with Diverse Speech Variabilities
Figure 2 for Enhancing Indonesian Automatic Speech Recognition: Evaluating Multilingual Models with Diverse Speech Variabilities
Figure 3 for Enhancing Indonesian Automatic Speech Recognition: Evaluating Multilingual Models with Diverse Speech Variabilities
Figure 4 for Enhancing Indonesian Automatic Speech Recognition: Evaluating Multilingual Models with Diverse Speech Variabilities
Viaarxiv icon

Contrastive Feedback Mechanism for Simultaneous Speech Translation

Add code
Jul 31, 2024
Viaarxiv icon

On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition

Add code
Jul 31, 2024
Viaarxiv icon