Picture for Shuichiro Shimizu

Shuichiro Shimizu

CS-FLEURS: A Massively Multilingual and Code-Switched Speech Dataset

Add code
Sep 17, 2025
Viaarxiv icon

ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems

Add code
Mar 11, 2025
Figure 1 for ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems
Figure 2 for ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems
Figure 3 for ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems
Figure 4 for ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems
Viaarxiv icon

When Large Language Models Meet Speech: A Survey on Integration Approaches

Add code
Feb 26, 2025
Figure 1 for When Large Language Models Meet Speech: A Survey on Integration Approaches
Figure 2 for When Large Language Models Meet Speech: A Survey on Integration Approaches
Figure 3 for When Large Language Models Meet Speech: A Survey on Integration Approaches
Figure 4 for When Large Language Models Meet Speech: A Survey on Integration Approaches
Viaarxiv icon

MELD-ST: An Emotion-aware Speech Translation Dataset

Add code
May 21, 2024
Viaarxiv icon

SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition

Add code
Jan 18, 2024
Figure 1 for SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition
Figure 2 for SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition
Figure 3 for SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition
Figure 4 for SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition
Viaarxiv icon

Video-Helpful Multimodal Machine Translation

Add code
Oct 31, 2023
Viaarxiv icon

Towards Speech Dialogue Translation Mediating Speakers of Different Languages

Add code
May 22, 2023
Figure 1 for Towards Speech Dialogue Translation Mediating Speakers of Different Languages
Figure 2 for Towards Speech Dialogue Translation Mediating Speakers of Different Languages
Figure 3 for Towards Speech Dialogue Translation Mediating Speakers of Different Languages
Figure 4 for Towards Speech Dialogue Translation Mediating Speakers of Different Languages
Viaarxiv icon

VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation

Add code
Jan 21, 2022
Figure 1 for VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation
Figure 2 for VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation
Figure 3 for VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation
Figure 4 for VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation
Viaarxiv icon