Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
The effectiveness of unsupervised subword modeling with autoregressive and cross-lingual phone-aware networks

Dec 17, 2020
Siyuan Feng, Odette Scharenborg

* 18 pages (including 1 page for supplementary material), 13 figures. Submitted to IEEE Open Journal of Signal Processing (OJ-SP) 

  Access Paper or Ask Questions

Show and Speak: Directly Synthesize Spoken Description of Images

Oct 23, 2020
Xinsheng Wang, Siyuan Feng, Jihua Zhu, Mark Hasegawa-Johnson, Odette Scharenborg

  Access Paper or Ask Questions

How Phonotactics Affect Multilingual and Zero-shot ASR Performance

Oct 22, 2020
Siyuan Feng, Piotr Żelasko, Laureano Moro-Velázquez, Ali Abavisani, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak

* Submitted to ICASSP 2021. The first 2 authors contributed equally to this work 

  Access Paper or Ask Questions

Unsupervised Subword Modeling Using Autoregressive Pretraining and Cross-Lingual Phone-Aware Modeling

Aug 06, 2020
Siyuan Feng, Odette Scharenborg

* 5 pages, 3 figures. Accepted for publication in INTERSPEECH 2020, Shanghai, China 

  Access Paper or Ask Questions

Evaluating Automatically Generated Phoneme Captions for Images

Jul 31, 2020
Justin van der Hout, Zoltán D'Haese, Mark Hasegawa-Johnson, Odette Scharenborg

* Accepted at Interspeech2020 

  Access Paper or Ask Questions

Detecting and analysing spontaneous oral cancer speech in the wild

Jul 28, 2020
Bence Mark Halpern, Rob van Son, Michiel van den Brekel, Odette Scharenborg

* Accepted to Interspeech 2020 

  Access Paper or Ask Questions

Learning to Recognise Words using Visually Grounded Speech

May 31, 2020
Sebastiaan Scholten, Danny Merkx, Odette Scharenborg

  Access Paper or Ask Questions

That Sounds Familiar: an Analysis of Phonetic Representations Transfer Across Languages

May 16, 2020
Piotr Żelasko, Laureano Moro-Velázquez, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak

* Submitted to Interspeech 2020. For some reason, the ArXiv Latex engine rendered it in more than 4 pages 

  Access Paper or Ask Questions

S2IGAN: Speech-to-Image Generation via Adversarial Learning

May 14, 2020
Xinsheng Wang, Tingting Qiao, Jihua Zhu, Alan Hanjalic, Odette Scharenborg

  Access Paper or Ask Questions

Investigating the Effect of Music and Lyrics on Spoken-Word Recognition

Mar 13, 2018
Odette Scharenborg, Martha Larson

* Preliminary study 

  Access Paper or Ask Questions

Bayesian Models for Unit Discovery on a Very Low Resource Language

Feb 20, 2018
Lucas Ondel, Pierre Godard, Laurent Besacier, Elin Larsen, Mark Hasegawa-Johnson, Odette Scharenborg, Emmanuel Dupoux, Lukas Burget, François Yvon, Sanjeev Khudanpur

* Accepted to ICASSP 2018 

  Access Paper or Ask Questions

Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop

Feb 14, 2018
Odette Scharenborg, Laurent Besacier, Alan Black, Mark Hasegawa-Johnson, Florian Metze, Graham Neubig, Sebastian Stueker, Pierre Godard, Markus Mueller, Lucas Ondel, Shruti Palaskar, Philip Arthur, Francesco Ciannella, Mingxing Du, Elin Larsen, Danny Merkx, Rachid Riad, Liming Wang, Emmanuel Dupoux

* Accepted to ICASSP 2018 

  Access Paper or Ask Questions