Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Adam Lopez

What do character-level models learn about morphology? The case of dependency parsing

Aug 28, 2018

Clara Vania, Andreas Grivas, Adam Lopez

Figure 1 for What do character-level models learn about morphology? The case of dependency parsing

Figure 2 for What do character-level models learn about morphology? The case of dependency parsing

Figure 3 for What do character-level models learn about morphology? The case of dependency parsing

Figure 4 for What do character-level models learn about morphology? The case of dependency parsing

Abstract:When parsing morphologically-rich languages with neural models, it is beneficial to model input at the character level, and it has been claimed that this is because character-level models learn morphology. We test these claims by comparing character-level models to an oracle with access to explicit morphological analysis on twelve languages with varying morphological typologies. Our results highlight many strengths of character-level models, but also show that they are poor at disambiguating some words, particularly in the face of case syncretism. We then demonstrate that explicitly modeling morphological case improves our best model, showing that character-level models can benefit from targeted forms of explicit morphological modeling.

* EMNLP 2018

Via

Access Paper or Ask Questions

Low-Resource Speech-to-Text Translation

Jun 18, 2018

Sameer Bansal, Herman Kamper, Karen Livescu, Adam Lopez, Sharon Goldwater

Figure 1 for Low-Resource Speech-to-Text Translation

Figure 2 for Low-Resource Speech-to-Text Translation

Figure 3 for Low-Resource Speech-to-Text Translation

Figure 4 for Low-Resource Speech-to-Text Translation

Abstract:Speech-to-text translation has many potential applications for low-resource languages, but the typical approach of cascading speech recognition with machine translation is often impossible, since the transcripts needed to train a speech recognizer are usually not available for low-resource languages. Recent work has found that neural encoder-decoder models can learn to directly translate foreign speech in high-resource scenarios, without the need for intermediate transcription. We investigate whether this approach also works in settings where both data and computation are limited. To make the approach efficient, we make several architectural changes, including a change from character-level to word-level decoding. We find that this choice yields crucial speed improvements that allow us to train with fewer computational resources, yet still performs well on frequent words. We explore models trained on between 20 and 160 hours of data, and find that although models trained on less data have considerably lower BLEU scores, they can still predict words with relatively high precision and recall---around 50% for a model trained on 50 hours of data, versus around 60% for the full 160 hour model. Thus, they may still be useful for some low-resource scenarios.

* Added references; results remain unchanged. Accepted to Interspeech 2018

Via

Access Paper or Ask Questions

A Generative Parser with a Discriminative Recognition Algorithm

Aug 17, 2017

Jianpeng Cheng, Adam Lopez, Mirella Lapata

Figure 1 for A Generative Parser with a Discriminative Recognition Algorithm

Figure 2 for A Generative Parser with a Discriminative Recognition Algorithm

Figure 3 for A Generative Parser with a Discriminative Recognition Algorithm

Abstract:Generative models defining joint distributions over parse trees and sentences are useful for parsing and language modeling, but impose restrictions on the scope of features and are often outperformed by discriminative models. We propose a framework for parsing and language modeling which marries a generative model with a discriminative recognition model in an encoder-decoder setting. We provide interpretations of the framework based on expectation maximization and variational inference, and show that it enables parsing and language modeling within a single implementation. On the English Penn Treen-bank, our framework obtains competitive performance on constituency parsing while matching the state-of-the-art single-model language modeling score.

* ACL 2017

Via

Access Paper or Ask Questions

From Characters to Words to in Between: Do We Capture Morphology?

Apr 26, 2017

Clara Vania, Adam Lopez

Figure 1 for From Characters to Words to in Between: Do We Capture Morphology?

Figure 2 for From Characters to Words to in Between: Do We Capture Morphology?

Figure 3 for From Characters to Words to in Between: Do We Capture Morphology?

Figure 4 for From Characters to Words to in Between: Do We Capture Morphology?

Abstract:Words can be represented by composing the representations of subword units such as word segments, characters, and/or character n-grams. While such representations are effective and may capture the morphological regularities of words, they have not been systematically compared, and it is not understood how they interact with different morphological typologies. On a language modeling task, we present experiments that systematically vary (1) the basic unit of representation, (2) the composition of these representations, and (3) the morphological typology of the language modeled. Our results extend previous findings that character representations are effective across typologies, and we find that a previously unstudied combination of character trigram representations composed with bi-LSTMs outperforms most others. But we also find room for improvement: none of the character-level models match the predictive accuracy of a model with access to true morphological analyses, even when learned from an order of magnitude more data.

* Accepted at ACL 2017

Via

Access Paper or Ask Questions

Towards speech-to-text translation without speech recognition

Feb 13, 2017

Sameer Bansal, Herman Kamper, Adam Lopez, Sharon Goldwater

Figure 1 for Towards speech-to-text translation without speech recognition

Figure 2 for Towards speech-to-text translation without speech recognition

Figure 3 for Towards speech-to-text translation without speech recognition

Figure 4 for Towards speech-to-text translation without speech recognition

Abstract:We explore the problem of translating speech to text in low-resource scenarios where neither automatic speech recognition (ASR) nor machine translation (MT) are available, but we have training data in the form of audio paired with text translations. We present the first system for this problem applied to a realistic multi-speaker dataset, the CALLHOME Spanish-English speech translation corpus. Our approach uses unsupervised term discovery (UTD) to cluster repeated patterns in the audio, creating a pseudotext, which we pair with translations to create a parallel text and train a simple bag-of-words MT model. We identify the challenges faced by the system, finding that the difficulty of cross-speaker UTD results in low recall, but that our system is still able to correctly translate some content words in test data.

* To appear in EACL 2017 (short papers)

Via

Access Paper or Ask Questions

Weakly supervised spoken term discovery using cross-lingual side information

Sep 21, 2016

Sameer Bansal, Herman Kamper, Sharon Goldwater, Adam Lopez

Figure 1 for Weakly supervised spoken term discovery using cross-lingual side information

Figure 2 for Weakly supervised spoken term discovery using cross-lingual side information

Figure 3 for Weakly supervised spoken term discovery using cross-lingual side information

Figure 4 for Weakly supervised spoken term discovery using cross-lingual side information

Abstract:Recent work on unsupervised term discovery (UTD) aims to identify and cluster repeated word-like units from audio alone. These systems are promising for some very low-resource languages where transcribed audio is unavailable, or where no written form of the language exists. However, in some cases it may still be feasible (e.g., through crowdsourcing) to obtain (possibly noisy) text translations of the audio. If so, this information could be used as a source of side information to improve UTD. Here, we present a simple method for rescoring the output of a UTD system using text translations, and test it on a corpus of Spanish audio with English translations. We show that it greatly improves the average precision of the results over a wide range of system configurations and data preprocessing methods.

* 5 pages, 4 figures, submitted for ICASSP 2017

Via

Access Paper or Ask Questions

Evaluating Informal-Domain Word Representations With UrbanDictionary

Jun 27, 2016

Naomi Saphra, Adam Lopez

Abstract:Existing corpora for intrinsic evaluation are not targeted towards tasks in informal domains such as Twitter or news comment forums. We want to test whether a representation of informal words fulfills the promise of eliding explicit text normalization as a preprocessing step. One possible evaluation metric for such domains is the proximity of spelling variants. We propose how such a metric might be computed and how a spelling variant dataset can be collected using UrbanDictionary.

Via

Access Paper or Ask Questions