Alert button
Picture for Chiori Hori

Chiori Hori

Alert button

NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization

Add code
Bookmark button
Alert button
Feb 27, 2024
Yoshiki Masuyama, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux

Viaarxiv icon

Interactive Planning Using Large Language Models for Partially Observable Robotics Tasks

Add code
Bookmark button
Alert button
Dec 11, 2023
Lingfeng Sun, Devesh K. Jha, Chiori Hori, Siddarth Jain, Radu Corcodel, Xinghao Zhu, Masayoshi Tomizuka, Diego Romeres

Figure 1 for Interactive Planning Using Large Language Models for Partially Observable Robotics Tasks
Figure 2 for Interactive Planning Using Large Language Models for Partially Observable Robotics Tasks
Figure 3 for Interactive Planning Using Large Language Models for Partially Observable Robotics Tasks
Figure 4 for Interactive Planning Using Large Language Models for Partially Observable Robotics Tasks
Viaarxiv icon

Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction

Add code
Bookmark button
Alert button
Oct 30, 2023
Zexu Pan, Gordon Wichern, Yoshiki Masuyama, Francois G. Germain, Sameer Khurana, Chiori Hori, Jonathan Le Roux

Viaarxiv icon

Generation or Replication: Auscultating Audio Latent Diffusion Models

Add code
Bookmark button
Alert button
Oct 16, 2023
Dimitrios Bralios, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux

Viaarxiv icon

Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos

Add code
Bookmark button
Alert button
Jun 27, 2023
Chiori Hori, Puyuan Peng, David Harwath, Xinyu Liu, Kei Ota, Siddarth Jain, Radu Corcodel, Devesh Jha, Diego Romeres, Jonathan Le Roux

Figure 1 for Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos
Figure 2 for Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos
Figure 3 for Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos
Figure 4 for Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos
Viaarxiv icon

(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering

Add code
Bookmark button
Alert button
Feb 18, 2022
Anoop Cherian, Chiori Hori, Tim K. Marks, Jonathan Le Roux

Figure 1 for (2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering
Figure 2 for (2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering
Figure 3 for (2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering
Figure 4 for (2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering
Viaarxiv icon

Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning

Add code
Bookmark button
Alert button
Oct 13, 2021
Ankit P. Shah, Shijie Geng, Peng Gao, Anoop Cherian, Takaaki Hori, Tim K. Marks, Jonathan Le Roux, Chiori Hori

Figure 1 for Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning
Figure 2 for Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning
Figure 3 for Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning
Figure 4 for Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning
Viaarxiv icon

Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers

Add code
Bookmark button
Alert button
Aug 04, 2021
Chiori Hori, Takaaki Hori, Jonathan Le Roux

Figure 1 for Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers
Figure 2 for Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers
Figure 3 for Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers
Figure 4 for Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers
Viaarxiv icon

Advanced Long-context End-to-end Speech Recognition Using Context-expanded Transformers

Add code
Bookmark button
Alert button
Apr 19, 2021
Takaaki Hori, Niko Moritz, Chiori Hori, Jonathan Le Roux

Figure 1 for Advanced Long-context End-to-end Speech Recognition Using Context-expanded Transformers
Figure 2 for Advanced Long-context End-to-end Speech Recognition Using Context-expanded Transformers
Figure 3 for Advanced Long-context End-to-end Speech Recognition Using Context-expanded Transformers
Viaarxiv icon

Multi-Pass Transformer for Machine Translation

Add code
Bookmark button
Alert button
Sep 23, 2020
Peng Gao, Chiori Hori, Shijie Geng, Takaaki Hori, Jonathan Le Roux

Figure 1 for Multi-Pass Transformer for Machine Translation
Figure 2 for Multi-Pass Transformer for Machine Translation
Figure 3 for Multi-Pass Transformer for Machine Translation
Figure 4 for Multi-Pass Transformer for Machine Translation
Viaarxiv icon