"Topic": models, code, and papers

ParseTalk about Textual Ellipsis

Sep 28, 1995
Michael Strube, Udo Hahn

A hybrid methodology for the resolution of text-level ellipsis is presented in this paper. It incorporates conceptual proximity criteria applied to ontologically well-engineered domain knowledge bases and an approach to centering based on functional topic/comment patterns. We state text grammatical predicates for ellipsis and then turn to the procedural aspects of their evaluation within the framework of an actor-based implementation of a lexically distributed parser.

* RANLP 95: Proc. of the Intl. Conf. on Recent Advances in Natural Language Processing. Tzigov Chark, Bulgaria, Sep. 14-16 1995, pp.62-72. 
* 11 pages, uuencoded compressed PS file (see also Technical Report at:

Notes on Deep Learning Theory

Dec 10, 2020
Eugene A. Golikov

These are the notes for the lectures that I was giving during Fall 2020 at the Moscow Institute of Physics and Technology (MIPT) and at the Yandex School of Data Analysis (YSDA). The notes cover some aspects of initialization, loss landscape, generalization, and a neural tangent kernel theory. While many other topics (e.g. expressivity, a mean-field theory, a double descent phenomenon) are missing in the current version, we plan to add them in future revisions.

* 68 pages 

VATEX Captioning Challenge 2019: Multi-modal Information Fusion and Multi-stage Training Strategy for Video Captioning

Oct 13, 2019
Ziqi Zhang, Yaya Shi, Jiutong Wei, Chunfeng Yuan, Bing Li, Weiming Hu

Multi-modal information is essential to describe what has happened in a video. In this work, we represent videos by various appearance, motion and audio information guided with video topic. By following multi-stage training strategy, our experiments show steady and significant improvement on the VATEX benchmark. This report presents an overview and comparative analysis of our system designed for both Chinese and English tracks on VATEX Captioning Challenge 2019.

Exploring Properties of Icosoku by Constraint Satisfaction Approach

Aug 16, 2019
Ke Liu, Sven Löffler, Petra Hofstedt

Icosoku is a challenging and interesting puzzle that exhibits highly symmetrical and combinatorial nature. In this paper, we pose the questions derived from the puzzle, but with more difficulty and generality. In addition, we also present a constraint programming model for the proposed questions, which can provide the answers to our first two questions. The purpose of this paper is to share our preliminary result and problems to encourage researchers in both group theory and constraint communities to consider this topic further.

The State and Future of Genetic Improvement

Jun 27, 2019
William B. Langdon, Westley Weimer, Christopher Timperley, Oliver Krauss, Zhen Yu Ding, Yiwei Lyu, Nicolas Chausseau, Eric Schulte, Shin Hwei Tan, Kevin Leach, Yu Huang, Gabin An

We report the discussion session at the sixth international Genetic Improvement workshop, GI-2019 @ ICSE, which was held as part of the 41st ACM/IEEE International Conference on Software Engineering on Tuesday 28th May 2019. Topics included GI representations, the maintainability of evolved code, automated software testing, future areas of GI research, such as co-evolution, and existing GI tools and benchmarks.

* University College London, Computer Science 

Energy saving for building heating via a simple and efficient model-free control design: First steps with computer simulations

Sep 06, 2017
Hassane Abouaïssa, Ola Alhaj Hasan, Cédric Join, Michel Fliess, Didier Defer

The model-based control of building heating systems for energy saving encounters severe physical, mathematical and calibration difficulties in the numerous attempts that has been published until now. This topic is addressed here via a new model-free control setting, where the need of any mathematical description disappears. Several convincing computer simulations are presented. Comparisons with classic PI controllers and flatness-based predictive control are provided.

* 21st International Conference on System Theory, Control and Computing, October 2017, Sinaia, Romania 

Anomaly detection in video with Bayesian nonparametrics

Jun 27, 2016
Olga Isupova, Danil Kuzin, Lyudmila Mihaylova

A novel dynamic Bayesian nonparametric topic model for anomaly detection in video is proposed in this paper. Batch and online Gibbs samplers are developed for inference. The paper introduces a new abnormality measure for decision making. The proposed method is evaluated on both synthetic and real data. The comparison with a non-dynamic model shows the superiority of the proposed dynamic one in terms of the classification performance for anomaly detection.

* 5 pages 

Clustering of scientific citations in Wikipedia

Jun 12, 2008
Finn Aarup Nielsen

The instances of templates in Wikipedia form an interesting data set of structured information. Here I focus on the cite journal template that is primarily used for citation to articles in scientific journals. These citations can be extracted and analyzed: Non-negative matrix factorization is performed on a (article x journal) matrix resulting in a soft clustering of Wikipedia articles and scientific journals, each cluster more or less representing a scientific topic.

* 7 pages; 2 figures, Wikimania 2008; Corrected typos 

Recent Progress in Conversational AI

Apr 08, 2022
Zijun Xue, Ruirui Li, Mingda Li

Conversational artificial intelligence (AI) is becoming an increasingly popular topic among industry and academia. With the fast development of neural network-based models, a lot of neural-based conversational AI system are developed. We will provide a brief review of the recent progress in the Conversational AI, including the commonly adopted techniques, notable works, famous competitions from academia and industry and widely used datasets.

* 6 pages 

Bhā$\unicode{x1E63}$ācitra: Visualising the dialect geography of South Asia

Jun 08, 2021
Aryaman Arora, Adam Farris, Gopalakrishnan R, Samopriya Basu

We present Bh\=a$\unicode{x1E63}$\=acitra, a dialect mapping system for South Asia built on a database of linguistic studies of languages of the region annotated for topic and location data. We analyse language coverage and look towards applications to typology by visualising example datasets. The application is not only meant to be useful for feature mapping, but also serves as a new kind of interactive bibliography for linguists of South Asian languages.

* 5 pages, 4 figures. To appear at LChange'21 workshop located at ACL 2021 

