Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

"Text": models, code, and papers

Deep Neural Networks for Relation Extraction

Apr 05, 2021
Tapas Nayak

Relation extraction from text is an important task for automatic knowledge base population. In this thesis, we first propose a syntax-focused multi-factor attention network model for finding the relation between two entities. Next, we propose two joint entity and relation extraction frameworks based on encoder-decoder architecture. Finally, we propose a hierarchical entity graph convolutional network for relation extraction across documents.

* PhD Thesis, National University of Singapore (2020) 

  Access Paper or Ask Questions

The End-of-End-to-End: A Video Understanding Pentathlon Challenge (2020)

Aug 03, 2020
Samuel Albanie, Yang Liu, Arsha Nagrani, Antoine Miech, Ernesto Coto, Ivan Laptev, Rahul Sukthankar, Bernard Ghanem, Andrew Zisserman, Valentin Gabeur, Chen Sun, Karteek Alahari, Cordelia Schmid, Shizhe Chen, Yida Zhao, Qin Jin, Kaixu Cui, Hui Liu, Chen Wang, Yudong Jiang, Xiaoshuai Hao

We present a new video understanding pentathlon challenge, an open competition held in conjunction with the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2020. The objective of the challenge was to explore and evaluate new methods for text-to-video retrieval-the task of searching for content within a corpus of videos using natural language queries. This report summarizes the results of the first edition of the challenge together with the findings of the participants.

* Individual reports, dataset information, rules, and released source code can be found at the competition webpage (

  Access Paper or Ask Questions

To the problem of "The Instrumental complex for ontological engineering purpose" software system design

Feb 21, 2018
A. V. Palagin, N. G. Petrenko, V. Yu. Velychko, K. S. Malakhov, Yu. L. Tikhonov

The given work describes methodological principles of design instrumental complex of ontological purpose. Instrumental complex intends for the implementation of the integrated information technologies automated build of domain ontologies. Results focus on enhancing the effectiveness of the automatic analysis and understanding of natural-language texts, building of knowledge description of subject areas (primarily in the area of science and technology) and for interdisciplinary research in conjunction with the solution of complex problems.

* Problems in programming, 2012, (2-3), pp.289-298. PROBLEMS IN PROGRAMMING SCIENTIFIC JOURNAL , (2-3), pp.289-298 
* in Russian; updated "Bibliography" section for correct identification of references by the Google Scholar parser software; updated figures; 10 pages; 8 figures 

  Access Paper or Ask Questions

Improving Semantic Composition with Offset Inference

Apr 21, 2017
Thomas Kober, Julie Weeds, Jeremy Reffin, David Weir

Count-based distributional semantic models suffer from sparsity due to unobserved but plausible co-occurrences in any text collection. This problem is amplified for models like Anchored Packed Trees (APTs), that take the grammatical type of a co-occurrence into account. We therefore introduce a novel form of distributional inference that exploits the rich type structure in APTs and infers missing data by the same mechanism that is used for semantic composition.

* to appear at ACL 2017 (short papers) 

  Access Paper or Ask Questions

Generating Politically-Relevant Event Data

Sep 20, 2016
John Beieler

Automatically generated political event data is an important part of the social science data ecosystem. The approaches for generating this data, though, have remained largely the same for two decades. During this time, the field of computational linguistics has progressed tremendously. This paper presents an overview of political event data, including methods and ontologies, and a set of experiments to determine the applicability of deep neural networks to the extraction of political events from news text.

  Access Paper or Ask Questions

Evaluating Informal-Domain Word Representations With UrbanDictionary

Jun 27, 2016
Naomi Saphra, Adam Lopez

Existing corpora for intrinsic evaluation are not targeted towards tasks in informal domains such as Twitter or news comment forums. We want to test whether a representation of informal words fulfills the promise of eliding explicit text normalization as a preprocessing step. One possible evaluation metric for such domains is the proximity of spelling variants. We propose how such a metric might be computed and how a spelling variant dataset can be collected using UrbanDictionary.

  Access Paper or Ask Questions

A Sparse and Adaptive Prior for Time-Dependent Model Parameters

Nov 07, 2015
Dani Yogatama, Bryan R. Routledge, Noah A. Smith

We consider the scenario where the parameters of a probabilistic model are expected to vary over time. We construct a novel prior distribution that promotes sparsity and adapts the strength of correlation between parameters at successive timesteps, based on the data. We derive approximate variational inference procedures for learning and prediction with this prior. We test the approach on two tasks: forecasting financial quantities from relevant text, and modeling language contingent on time-varying financial measurements.

  Access Paper or Ask Questions

Ivan Franko's novel Dlja domashnjoho ohnyshcha (For the Hearth) in the light of the frequency dictionary

Jun 01, 2010
Solomiya Buk

In the article, the methodology and the principles of the compilation of the Frequency dictionary for Ivan Franko's novel Dlja domashnjoho ohnyshcha (For the Hearth) are described. The following statistical parameters of the novel vocabulary are obtained: variety, exclusiveness, concentration indexes, correlation between word rank and text coverage, etc. The main quantitative characteristics of Franko's novels Perekhresni stezhky (The Cross-Paths) and Dlja domashnjoho ohnyshcha are compared on the basis of their frequency dictionaries.

* 11 pages, in Ukrainian 

  Access Paper or Ask Questions

Moving Other Way: Exploring Word Mover Distance Extensions

Feb 08, 2022
Ilya Smirnov, Ivan P. Yamshchikov

The word mover's distance (WMD) is a popular semantic similarity metric for two texts. This position paper studies several possible extensions of WMD. We experiment with the frequency of words in the corpus as a weighting factor and the geometry of the word vector space. We validate possible extensions of WMD on six document classification datasets. Some proposed extensions show better results in terms of the k-nearest neighbor classification error than WMD.

  Access Paper or Ask Questions

Representing Inferences and their Lexicalization

Dec 14, 2021
David McDonald, James Pustejovsky

We have recently begun a project to develop a more effective and efficient way to marshal inferences from background knowledge to facilitate deep natural language understanding. The meaning of a word is taken to be the entities, predications, presuppositions, and potential inferences that it adds to an ongoing situation. As words compose, the minimal model in the situation evolves to limit and direct inference. At this point we have developed our computational architecture and implemented it on real text. Our focus has been on proving the feasibility of our design.

* Advances in Cognitive Systems 3 (2014) 143-162 
* 20 pages, 1 figure 

  Access Paper or Ask Questions