Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Benjamin Lecouteux

Improving the Coverage and the Generalization Ability of Neural Word Sense Disambiguation through Hypernymy and Hyponymy Relationships

Nov 02, 2018

Loïc Vial, Benjamin Lecouteux, Didier Schwab

Figure 1 for Improving the Coverage and the Generalization Ability of Neural Word Sense Disambiguation through Hypernymy and Hyponymy Relationships

Figure 2 for Improving the Coverage and the Generalization Ability of Neural Word Sense Disambiguation through Hypernymy and Hyponymy Relationships

Figure 3 for Improving the Coverage and the Generalization Ability of Neural Word Sense Disambiguation through Hypernymy and Hyponymy Relationships

Figure 4 for Improving the Coverage and the Generalization Ability of Neural Word Sense Disambiguation through Hypernymy and Hyponymy Relationships

Abstract:In Word Sense Disambiguation (WSD), the predominant approach generally involves a supervised system trained on sense annotated corpora. The limited quantity of such corpora however restricts the coverage and the performance of these systems. In this article, we propose a new method that solves these issues by taking advantage of the knowledge present in WordNet, and especially the hypernymy and hyponymy relationships between synsets, in order to reduce the number of different sense tags that are necessary to disambiguate all words of the lexical database. Our method leads to state of the art results on most WSD evaluation tasks, while improving the coverage of supervised systems, reducing the training time and the size of the models, without additional training data. In addition, we exhibit results that significantly outperform the state of the art when our method is combined with an ensembling technique and the addition of the WordNet Gloss Tagged as training corpus.

Via

Access Paper or Ask Questions

Analyzing Learned Representations of a Deep ASR Performance Prediction Model

Aug 28, 2018

Zied Elloumi, Laurent Besacier, Olivier Galibert, Benjamin Lecouteux

Figure 1 for Analyzing Learned Representations of a Deep ASR Performance Prediction Model

Figure 2 for Analyzing Learned Representations of a Deep ASR Performance Prediction Model

Figure 3 for Analyzing Learned Representations of a Deep ASR Performance Prediction Model

Figure 4 for Analyzing Learned Representations of a Deep ASR Performance Prediction Model

Abstract:This paper addresses a relatively new task: prediction of ASR performance on unseen broadcast programs. In a previous paper, we presented an ASR performance prediction system using CNNs that encode both text (ASR transcript) and speech, in order to predict word error rate. This work is dedicated to the analysis of speech signal embeddings and text embeddings learnt by the CNN while training our prediction model. We try to better understand which information is captured by the deep model and its relation with different conditioning factors. It is shown that hidden layers convey a clear signal about speech style, accent and broadcast type. We then try to leverage these 3 types of information at training time through multi-task learning. Our experiments show that this allows to train slightly more efficient ASR performance prediction systems that - in addition - simultaneously tag the analyzed utterances according to their speech style, accent and broadcast program origin.

* EMNLP 2018 Workshop

Via

Access Paper or Ask Questions

ASR Performance Prediction on Unseen Broadcast Programs using Convolutional Neural Networks

Apr 23, 2018

Zied Elloumi, Laurent Besacier, Olivier Galibert, Juliette Kahn, Benjamin Lecouteux

Figure 1 for ASR Performance Prediction on Unseen Broadcast Programs using Convolutional Neural Networks

Figure 2 for ASR Performance Prediction on Unseen Broadcast Programs using Convolutional Neural Networks

Figure 3 for ASR Performance Prediction on Unseen Broadcast Programs using Convolutional Neural Networks

Figure 4 for ASR Performance Prediction on Unseen Broadcast Programs using Convolutional Neural Networks

Abstract:In this paper, we address a relatively new task: prediction of ASR performance on unseen broadcast programs. We first propose an heterogenous French corpus dedicated to this task. Two prediction approaches are compared: a state-of-the-art performance prediction based on regression (engineered features) and a new strategy based on convolutional neural networks (learnt features). We particularly focus on the combination of both textual (ASR transcription) and signal inputs. While the joint use of textual and signal features did not work for the regression baseline, the combination of inputs for CNNs leads to the best WER prediction performance. We also show that our CNN prediction remarkably predicts the WER distribution on a collection of speech recordings.

* IEEE ICASSP 2018

Via

Access Paper or Ask Questions

Automatic Quality Assessment for Speech Translation Using Joint ASR and MT Features

Sep 20, 2016

Ngoc-Tien Le, Benjamin Lecouteux, Laurent Besacier

Figure 1 for Automatic Quality Assessment for Speech Translation Using Joint ASR and MT Features

Figure 2 for Automatic Quality Assessment for Speech Translation Using Joint ASR and MT Features

Figure 3 for Automatic Quality Assessment for Speech Translation Using Joint ASR and MT Features

Figure 4 for Automatic Quality Assessment for Speech Translation Using Joint ASR and MT Features

Abstract:This paper addresses automatic quality assessment of spoken language translation (SLT). This relatively new task is defined and formalized as a sequence labeling problem where each word in the SLT hypothesis is tagged as good or bad according to a large feature set. We propose several word confidence estimators (WCE) based on our automatic evaluation of transcription (ASR) quality, translation (MT) quality, or both (combined ASR+MT). This research work is possible because we built a specific corpus which contains 6.7k utterances for which a quintuplet containing: ASR output, verbatim transcript, text translation, speech translation and post-edition of translation is built. The conclusion of our multiple experiments using joint ASR and MT features for WCE is that MT features remain the most influent while ASR feature can bring interesting complementary information. Our robust quality estimators for SLT can be used for re-scoring speech translation graphs or for providing feedback to the user in interactive speech translation or computer-assisted speech-to-text scenarios.

* submitted to MT Journal (special issue on spoken language translation)

Via

Access Paper or Ask Questions