Picture for Satoshi Nakamura

Satoshi Nakamura

ARTA: Collection and Classification of Ambiguous Requests and Thoughtful Actions

Add code
Jun 15, 2021
Figure 1 for ARTA: Collection and Classification of Ambiguous Requests and Thoughtful Actions
Figure 2 for ARTA: Collection and Classification of Ambiguous Requests and Thoughtful Actions
Figure 3 for ARTA: Collection and Classification of Ambiguous Requests and Thoughtful Actions
Figure 4 for ARTA: Collection and Classification of Ambiguous Requests and Thoughtful Actions
Viaarxiv icon

Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS

Add code
Nov 11, 2020
Figure 1 for Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS
Figure 2 for Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS
Viaarxiv icon

Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis

Add code
Nov 04, 2020
Figure 1 for Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis
Figure 2 for Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis
Figure 3 for Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis
Figure 4 for Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis
Viaarxiv icon

Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition

Add code
Nov 04, 2020
Figure 1 for Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition
Figure 2 for Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition
Figure 3 for Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition
Figure 4 for Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition
Viaarxiv icon

Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time

Add code
Nov 04, 2020
Figure 1 for Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time
Figure 2 for Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time
Figure 3 for Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time
Viaarxiv icon

Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework

Add code
Nov 04, 2020
Figure 1 for Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework
Figure 2 for Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework
Figure 3 for Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework
Figure 4 for Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework
Viaarxiv icon

Image Captioning with Visual Object Representations Grounded in the Textual Modality

Add code
Oct 20, 2020
Figure 1 for Image Captioning with Visual Object Representations Grounded in the Textual Modality
Figure 2 for Image Captioning with Visual Object Representations Grounded in the Textual Modality
Figure 3 for Image Captioning with Visual Object Representations Grounded in the Textual Modality
Figure 4 for Image Captioning with Visual Object Representations Grounded in the Textual Modality
Viaarxiv icon

ReMOTS: Self-Supervised Refining Multi-Object Tracking and Segmentation

Add code
Jul 08, 2020
Figure 1 for ReMOTS: Self-Supervised Refining Multi-Object Tracking and Segmentation
Figure 2 for ReMOTS: Self-Supervised Refining Multi-Object Tracking and Segmentation
Figure 3 for ReMOTS: Self-Supervised Refining Multi-Object Tracking and Segmentation
Figure 4 for ReMOTS: Self-Supervised Refining Multi-Object Tracking and Segmentation
Viaarxiv icon

Reflection-based Word Attribute Transfer

Add code
Jul 07, 2020
Viaarxiv icon

Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge

Add code
May 24, 2020
Figure 1 for Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge
Figure 2 for Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge
Figure 3 for Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge
Figure 4 for Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge
Viaarxiv icon