Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ciprian Chelba

Music

Richer Syntactic Dependencies for Structured Language Modeling

Oct 03, 2001

Ciprian Chelba, Peng Xu

Figure 1 for Richer Syntactic Dependencies for Structured Language Modeling

Figure 2 for Richer Syntactic Dependencies for Structured Language Modeling

Figure 3 for Richer Syntactic Dependencies for Structured Language Modeling

Figure 4 for Richer Syntactic Dependencies for Structured Language Modeling

Abstract:The paper investigates the use of richer syntactic dependencies in the structured language model (SLM). We present two simple methods of enriching the dependencies in the syntactic parse trees used for intializing the SLM. We evaluate the impact of both methods on the perplexity (PPL) and word-error-rate(WER, N-best rescoring) performance of the SLM. We show that the new model achieves an improvement in PPL and WER over the baseline results reported using the SLM on the UPenn Treebank and Wall Street Journal (WSJ) corpora, respectively.

* Proceedings of ASRU 2001, 4 pages

Via

Access Paper or Ask Questions

Information Extraction Using the Structured Language Model

Aug 29, 2001

Ciprian Chelba, Milind Mahajan

Figure 1 for Information Extraction Using the Structured Language Model

Figure 2 for Information Extraction Using the Structured Language Model

Figure 3 for Information Extraction Using the Structured Language Model

Figure 4 for Information Extraction Using the Structured Language Model

Abstract:The paper presents a data-driven approach to information extraction (viewed as template filling) using the structured language model (SLM) as a statistical parser. The task of template filling is cast as constrained parsing using the SLM. The model is automatically trained from a set of sentences annotated with frame/slot labels and spans. Training proceeds in stages: first a constrained syntactic parser is trained such that the parses on training data meet the specified semantic spans, then the non-terminal labels are enriched to contain semantic information and finally a constrained syntactic+semantic parser is trained on the parse trees resulting from the previous stage. Despite the small amount of training data used, the model is shown to outperform the slot level accuracy of a simple semantic grammar authored manually for the MiPad --- personal information management --- task.

* EMNLP/NAACL 2001 Conference Proceedings
* EMNLP'01, Pittsburgh; 8 pages

Via

Access Paper or Ask Questions

Portability of Syntactic Structure for Language Modeling

Aug 29, 2001

Ciprian Chelba

Figure 1 for Portability of Syntactic Structure for Language Modeling

Figure 2 for Portability of Syntactic Structure for Language Modeling

Figure 3 for Portability of Syntactic Structure for Language Modeling

Figure 4 for Portability of Syntactic Structure for Language Modeling

Abstract:The paper presents a study on the portability of statistical syntactic knowledge in the framework of the structured language model (SLM). We investigate the impact of porting SLM statistics from the Wall Street Journal (WSJ) to the Air Travel Information System (ATIS) domain. We compare this approach to applying the Microsoft rule-based parser (NLPwin) for the ATIS data and to using a small amount of data manually parsed at UPenn for gathering the intial SLM statistics. Surprisingly, despite the fact that it performs modestly in perplexity (PPL), the model initialized on WSJ parses outperforms the other initialization methods based on in-domain annotated data, achieving a significant 0.4% absolute and 7% relative reduction in word error rate (WER) over a baseline system whose word error rate is 5.8%; the improvement measured relative to the minimum WER achievable on the N-best lists we worked with is 12%.

* ICASSP 2001 Proceedings
* ICASSP 2001, Salt Lake City; 4 pages

Via

Access Paper or Ask Questions

Structured Language Modeling for Speech Recognition

Jan 25, 2000

Ciprian Chelba, Frederick Jelinek

Figure 1 for Structured Language Modeling for Speech Recognition

Figure 2 for Structured Language Modeling for Speech Recognition

Figure 3 for Structured Language Modeling for Speech Recognition

Figure 4 for Structured Language Modeling for Speech Recognition

Abstract:A new language model for speech recognition is presented. The model develops hidden hierarchical syntactic-like structure incrementally and uses it to extract meaningful information from the word history, thus complementing the locality of currently used trigram models. The structured language model (SLM) and its performance in a two-pass speech recognizer --- lattice decoding --- are presented. Experiments on the WSJ corpus show an improvement in both perplexity (PPL) and word error rate (WER) over conventional trigram models.

* Proceedings of NLDB'99, Klagenfurt, Austria
* 4 pages + 2 pages of ERRATA

Via

Access Paper or Ask Questions

A Structured Language Model

Jan 25, 2000

Ciprian Chelba

Figure 1 for A Structured Language Model

Figure 2 for A Structured Language Model

Figure 3 for A Structured Language Model

Figure 4 for A Structured Language Model

Abstract:The paper presents a language model that develops syntactic structure and uses it to extract meaningful information from the word history, thus enabling the use of long distance dependencies. The model assigns probability to every joint sequence of words - binary-parse-structure with headword annotation. The model, its probabilistic parametrization, and a set of experiments meant to evaluate its predictive power are presented.

* changed ACM-class membership, Proceedings of ACL-EACL'97, Student Section, Madrid, Spain

Via

Access Paper or Ask Questions

Expoiting Syntactic Structure for Language Modeling

Jan 25, 2000

Ciprian Chelba, Frederick Jelinek

Figure 1 for Expoiting Syntactic Structure for Language Modeling

Figure 2 for Expoiting Syntactic Structure for Language Modeling

Figure 3 for Expoiting Syntactic Structure for Language Modeling

Figure 4 for Expoiting Syntactic Structure for Language Modeling

Abstract:The paper presents a language model that develops syntactic structure and uses it to extract meaningful information from the word history, thus enabling the use of long distance dependencies. The model assigns probability to every joint sequence of words--binary-parse-structure with headword annotation and operates in a left-to-right manner --- therefore usable for automatic speech recognition. The model, its probabilistic parameterization, and a set of experiments meant to evaluate its predictive power are presented; an improvement over standard trigram modeling is achieved.

* Proceedings of ACL'98, Montreal, Canada
* changed ACM-class membership and buggy author names

Via

Access Paper or Ask Questions

Recognition Performance of a Structured Language Model

Jan 24, 2000

Ciprian Chelba, Frederick Jelinek

Figure 1 for Recognition Performance of a Structured Language Model

Figure 2 for Recognition Performance of a Structured Language Model

Figure 3 for Recognition Performance of a Structured Language Model

Figure 4 for Recognition Performance of a Structured Language Model

Abstract:A new language model for speech recognition inspired by linguistic analysis is presented. The model develops hidden hierarchical structure incrementally and uses it to extract meaningful information from the word history - thus enabling the use of extended distance dependencies - in an attempt to complement the locality of currently used trigram models. The structured language model, its probabilistic parameterization and performance in a two-pass speech recognizer are presented. Experiments on the SWITCHBOARD corpus show an improvement in both perplexity and word error rate over conventional trigram models.

* Proceedings of Eurospeech, 1999, pp. 1567-1570, Budapest, Hungary
* 4 pages

Via

Access Paper or Ask Questions

Refinement of a Structured Language Model

Jan 24, 2000

Ciprian Chelba, Frederick Jelinek

Figure 1 for Refinement of a Structured Language Model

Figure 2 for Refinement of a Structured Language Model

Figure 3 for Refinement of a Structured Language Model

Figure 4 for Refinement of a Structured Language Model

Abstract:A new language model for speech recognition inspired by linguistic analysis is presented. The model develops hidden hierarchical structure incrementally and uses it to extract meaningful information from the word history - thus enabling the use of extended distance dependencies - in an attempt to complement the locality of currently used n-gram Markov models. The model, its probabilistic parametrization, a reestimation algorithm for the model parameters and a set of experiments meant to evaluate its potential for speech recognition are presented.

* Proceedings of the International Conference on Advances in Pattern Recognition, 1998, pp. 275-284, Plymouth, UK
* 10 pages

Via

Access Paper or Ask Questions

Exploiting Syntactic Structure for Natural Language Modeling

Jan 24, 2000

Ciprian Chelba

Figure 1 for Exploiting Syntactic Structure for Natural Language Modeling

Figure 2 for Exploiting Syntactic Structure for Natural Language Modeling

Figure 3 for Exploiting Syntactic Structure for Natural Language Modeling

Figure 4 for Exploiting Syntactic Structure for Natural Language Modeling

Abstract:The thesis presents an attempt at using the syntactic structure in natural language for improved language models for speech recognition. The structured language model merges techniques in automatic parsing and language modeling using an original probabilistic parameterization of a shift-reduce parser. A maximum likelihood reestimation procedure belonging to the class of expectation-maximization algorithms is employed for training the model. Experiments on the Wall Street Journal, Switchboard and Broadcast News corpora show improvement in both perplexity and word error rate - word lattice rescoring - over the standard 3-gram language model. The significance of the thesis lies in presenting an original approach to language modeling that uses the hierarchical - syntactic - structure in natural language to improve on current 3-gram modeling techniques for large vocabulary speech recognition.

* Advisor: Frederick Jelinek, Ph.D. Thesis, 122 pages; removed unused .eps file

Via

Access Paper or Ask Questions