Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Florian Metze

Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models


Mar 18, 2021
Po-Yao Huang, Mandela Patrick, Junjie Hu, Graham Neubig, Florian Metze, Alexander Hauptmann

* accepted by NAACL 2021 

  Access Paper or Ask Questions

Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning


Mar 18, 2021
Mandela Patrick, Yuki M. Asano, Bernie Huang, Ishan Misra, Florian Metze, Joao Henriques, Andrea Vedaldi


  Access Paper or Ask Questions

NoiseQA: Challenge Set Evaluation for User-Centric Question Answering


Feb 16, 2021
Abhilasha Ravichander, Siddharth Dalmia, Maria Ryskina, Florian Metze, Eduard Hovy, Alan W Black

* EACL 2021 

  Access Paper or Ask Questions

Audio-Visual Event Recognition through the lens of Adversary


Nov 15, 2020
Juncheng B Li, Kaixin Ma, Shuhui Qu, Po-Yao Huang, Florian Metze

* 4 pages 

  Access Paper or Ask Questions

Multimodal Speech Recognition with Unstructured Audio Masking


Oct 16, 2020
Tejas Srinivasan, Ramon Sanabria, Florian Metze, Desmond Elliott

* Accepted to NLP Beyond Text workshop, EMNLP 2020 

  Access Paper or Ask Questions

On Long-Tailed Phenomena in Neural Machine Translation


Oct 10, 2020
Vikas Raunak, Siddharth Dalmia, Vivek Gupta, Florian Metze

* Accepted to Findings of EMNLP 2020 

  Access Paper or Ask Questions

Support-set bottlenecks for video-text representation learning


Oct 06, 2020
Mandela Patrick, Po-Yao Huang, Yuki Asano, Florian Metze, Alexander Hauptmann, João Henriques, Andrea Vedaldi


  Access Paper or Ask Questions

Fine-Grained Grounding for Multimodal Speech Recognition


Oct 05, 2020
Tejas Srinivasan, Ramon Sanabria, Florian Metze, Desmond Elliott

* Accepted to Findings of EMNLP 2020 

  Access Paper or Ask Questions

Revisiting Factorizing Aggregated Posterior in Learning Disentangled Representations


Sep 12, 2020
Ze Cheng, Juncheng Li, Chenxu Wang, Jixuan Gu, Hao Xu, Xinjian Li, Florian Metze


  Access Paper or Ask Questions

How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language


Aug 18, 2020
Amanda Duarte, Shruti Palaskar, Deepti Ghadiyaram, Kenneth DeHaan, Florian Metze, Jordi Torres, Xavier Giro-i-Nieto

* Presented as an extended abstract at the Sign Language Recognition, Translation & Production workshop (SLRTP) at European Conference on Computer Vision 2020 

  Access Paper or Ask Questions

AlloVera: A Multilingual Allophone Database


Apr 17, 2020
David R. Mortensen, Xinjian Li, Patrick Littell, Alexis Michaud, Shruti Rijhwani, Antonios Anastasopoulos, Alan W. Black, Florian Metze, Graham Neubig

* 8 pages, LREC 2020 

  Access Paper or Ask Questions

ASR Error Correction and Domain Adaptation Using Machine Translation


Mar 13, 2020
Anirudh Mani, Shruti Palaskar, Nimshi Venkat Meripo, Sandeep Konam, Florian Metze

* Accepted for Oral Presentation at ICASSP 2020 

  Access Paper or Ask Questions

Universal Phone Recognition with a Multilingual Allophone System


Feb 26, 2020
Xinjian Li, Siddharth Dalmia, Juncheng Li, Matthew Lee, Patrick Littell, Jiali Yao, Antonios Anastasopoulos, David R. Mortensen, Graham Neubig, Alan W Black, Florian Metze

* ICASSP 2020 

  Access Paper or Ask Questions

Towards Zero-shot Learning for Automatic Phonemic Transcription


Feb 26, 2020
Xinjian Li, Siddharth Dalmia, David R. Mortensen, Juncheng Li, Alan W Black, Florian Metze

* AAAI 2020 

  Access Paper or Ask Questions

Looking Enhances Listening: Recovering Missing Speech Using Images


Feb 13, 2020
Tejas Srinivasan, Ramon Sanabria, Florian Metze

* Accepted to ICASSP 2020 

  Access Paper or Ask Questions

Gun Source and Muzzle Head Detection


Jan 29, 2020
Zhong Zhou, Isak Czeresnia Etinger, Florian Metze, Alexander Hauptmann, Alexander Waibel

* EI 2020 

  Access Paper or Ask Questions

On Compositionality in Neural Machine Translation


Dec 14, 2019
Vikas Raunak, Vaibhav Kumar, Florian Metze

* Accepted at Context and Compositionality Workshop, NeurIPS 2019 

  Access Paper or Ask Questions

Adversarial Music: Real World Audio Adversary Against Wake-word Detection System


Dec 06, 2019
Juncheng B. Li, Shuhui Qu, Xinjian Li, Joseph Szurley, J. Zico Kolter, Florian Metze

* NIPS2019_9362, pages = {11908--11918}, year = {2019}, publisher = {Curran Associates, Inc.}, url = {http://papers.nips.cc/paper/9362-adversarial-music-real-world-audio-adversary-against-wake-word-detection-system.pdf} } 
* 9 pages, In Proceedings of NeurIPS 2019 Conference 

  Access Paper or Ask Questions

Enforcing Encoder-Decoder Modularity in Sequence-to-Sequence Models


Nov 09, 2019
Siddharth Dalmia, Abdelrahman Mohamed, Mike Lewis, Florian Metze, Luke Zettlemoyer


  Access Paper or Ask Questions

Multitask Learning For Different Subword Segmentations In Neural Machine Translation


Oct 27, 2019
Tejas Srinivasan, Ramon Sanabria, Florian Metze

* Accepted to 16th International Workshop on Spoken Language Translation (IWSLT) 2019 

  Access Paper or Ask Questions

On Leveraging the Visual Modality for Neural Machine Translation


Oct 07, 2019
Vikas Raunak, Sang Keun Choe, Quanyang Lu, Yi Xu, Florian Metze

* Accepted to INLG 2019 

  Access Paper or Ask Questions

On Dimensional Linguistic Properties of the Word Embedding Space


Oct 05, 2019
Vikas Raunak, Vaibhav Kumar, Vivek Gupta, Florian Metze

* Accepted at ACL SRW 2019 

  Access Paper or Ask Questions

SANTLR: Speech Annotation Toolkit for Low Resource Languages


Aug 02, 2019
Xinjian Li, Zhong Zhou, Siddharth Dalmia, Alan W. Black, Florian Metze

* Interspeech 2019 (Show and Tell) 

  Access Paper or Ask Questions

Multilingual Speech Recognition with Corpus Relatedness Sampling


Aug 02, 2019
Xinjian Li, Siddharth Dalmia, Alan W. Black, Florian Metze

* Interspeech 2019 

  Access Paper or Ask Questions

Cross-Attention End-to-End ASR for Two-Party Conversations


Jul 24, 2019
Suyoun Kim, Siddharth Dalmia, Florian Metze

* Interspeech 2019 

  Access Paper or Ask Questions

Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions


Jun 30, 2019
Tejas Srinivasan, Ramon Sanabria, Florian Metze


  Access Paper or Ask Questions

Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion


Jun 27, 2019
Suyoun Kim, Siddharth Dalmia, Florian Metze

* ACL 2019 

  Access Paper or Ask Questions