Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Satoshi Nakamura

Using Perturbed Length-aware Positional Encoding for Non-autoregressive Neural Machine Translation


Jul 29, 2021
Yui Oka, Katsuhito Sudoh, Satoshi Nakamura

* 5 pages, 1 figures. Will be presented at ACL SRW 2021 

  Access Paper or Ask Questions

ARTA: Collection and Classification of Ambiguous Requests and Thoughtful Actions


Jun 15, 2021
Shohei Tanaka, Koichiro Yoshino, Katsuhito Sudoh, Satoshi Nakamura

* Accepted by The 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL2021) 

  Access Paper or Ask Questions

Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS


Nov 11, 2020
Katsuhito Sudoh, Takatomo Kano, Sashi Novitasari, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura

* 6 pages 

  Access Paper or Ask Questions

Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis


Nov 04, 2020
Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura

* Accepted in SLTU-CCURL 2020 

  Access Paper or Ask Questions

Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition


Nov 04, 2020
Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura

* Accepted in INTERSPEECH 2019 

  Access Paper or Ask Questions

Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time


Nov 04, 2020
Sashi Novitasari, Andros Tjandra, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura

* Accepted in INTERSPEECH 2020 

  Access Paper or Ask Questions

Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework


Nov 04, 2020
Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura

* Accepted at INTERSPEECH 2020 

  Access Paper or Ask Questions

Image Captioning with Visual Object Representations Grounded in the Textual Modality


Oct 20, 2020
Dušan Variš, Katsuhito Sudoh, Satoshi Nakamura


  Access Paper or Ask Questions

ReMOTS: Self-Supervised Refining Multi-Object Tracking and Segmentation


Jul 08, 2020
Fan Yang, Xin Chang, Chenyu Dang, Ziqiang Zheng, Sakriani Sakti, Satoshi Nakamura, Yang Wu

* 4 pages 

  Access Paper or Ask Questions

Reflection-based Word Attribute Transfer


Jul 07, 2020
Yoichi Ishibashi, Katsuhito Sudoh, Koichiro Yoshino, Satoshi Nakamura

* Accepted at ACL 2020 Student Research Workshop (SRW) 

  Access Paper or Ask Questions

Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge


May 24, 2020
Andros Tjandra, Sakriani Sakti, Satoshi Nakamura

* Submitted to INTERSPEECH 2020 

  Access Paper or Ask Questions

Caption Generation of Robot Behaviors based on Unsupervised Learning of Action Segments


Mar 23, 2020
Koichiro Yoshino, Kohei Wakimoto, Yuta Nishimura, Satoshi Nakamura

* Will appear in IWSDS2020 

  Access Paper or Ask Questions

Using panoramic videos for multi-person localization and tracking in a 3D panoramic coordinate


Dec 05, 2019
Fan Yang, Feiran Li, Yang Wu, Sakriani Sakti, Satoshi Nakamura

* 5 pages 

  Access Paper or Ask Questions

Simultaneous Neural Machine Translation using Connectionist Temporal Classification


Nov 27, 2019
Katsuki Chousa, Katsuhito Sudoh, Satoshi Nakamura


  Access Paper or Ask Questions

Deja-vu: Double Feature Presentation in Deep Transformer Networks


Oct 23, 2019
Andros Tjandra, Chunxi Liu, Frank Zhang, Xiaohui Zhang, Yongqiang Wang, Gabriel Synnaeve, Satoshi Nakamura, Geoffrey Zweig


  Access Paper or Ask Questions

Speech-to-speech Translation between Untranscribed Unknown Languages


Oct 05, 2019
Andros Tjandra, Sakriani Sakti, Satoshi Nakamura

* Accepted in IEEE ASRU 2019. Web-page for more samples & details: https://sp2code-translation-v1.netlify.com/ 

  Access Paper or Ask Questions

Make Skeleton-based Action Recognition Model Smaller, Faster and Better


Jul 29, 2019
Fan Yang, Sakriani Sakti, Yang Wu, Satoshi Nakamura

* 6 pages, 5 figures 

  Access Paper or Ask Questions

Conversational Response Re-ranking Based on Event Causality and Role Factored Tensor Event Embedding


Jun 24, 2019
Shohei Tanaka, Koichiro Yoshino, Katsuhito Sudoh, Satoshi Nakamura

* Accepted by 1st Workshop NLP for Conversational AI, ACL 2019 Workshop (ConvAI) 

  Access Paper or Ask Questions

From Speech Chain to Multimodal Chain: Leveraging Cross-modal Data Augmentation for Semi-supervised Learning


Jun 03, 2019
Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura

* Submitted to INTERSPEECH 2019 

  Access Paper or Ask Questions

VQVAE Unsupervised Unit Discovery and Multi-scale Code2Spec Inverter for Zerospeech Challenge 2019


May 29, 2019
Andros Tjandra, Berrak Sisman, Mingyang Zhang, Sakriani Sakti, Haizhou Li, Satoshi Nakamura

* Submitted to Interspeech 2019 

  Access Paper or Ask Questions

An Incremental Turn-Taking Model For Task-Oriented Dialog Systems


May 28, 2019
Andrei C. Coman, Koichiro Yoshino, Yukitoshi Murase, Satoshi Nakamura, Giuseppe Riccardi

* submitted to INTERSPEECH 2019 

  Access Paper or Ask Questions

Optimization of Information-Seeking Dialogue Strategy for Argumentation-Based Dialogue System


Nov 26, 2018
Hisao Katsumi, Takuya Hiraoka, Koichiro Yoshino, Kazeto Yamamoto, Shota Motoura, Kunihiko Sadamasa, Satoshi Nakamura

* Accepted by AAAI2019 DEEP-DIAL 2019 workshop 

  Access Paper or Ask Questions

Another Diversity-Promoting Objective Function for Neural Dialogue Generation


Nov 21, 2018
Ryo Nakamura, Katsuhito Sudoh, Koichiro Yoshino, Satoshi Nakamura

* AAAI 2019 Workshop on Reasoning and Learning for Human-Machine Dialogues (DEEP-DIAL 2019) 

  Access Paper or Ask Questions

End-to-End Feedback Loss in Speech Chain Framework via Straight-Through Estimator


Oct 31, 2018
Andros Tjandra, Sakriani Sakti, Satoshi Nakamura


  Access Paper or Ask Questions