Alert button
Picture for Anton Alekseev

Anton Alekseev

Alert button

Benchmarking Multilabel Topic Classification in the Kyrgyz Language

Aug 30, 2023
Anton Alekseev, Sergey I. Nikolenko, Gulnara Kabaeva

Kyrgyz is a very underrepresented language in terms of modern natural language processing resources. In this work, we present a new public benchmark for topic classification in Kyrgyz, introducing a dataset based on collected and annotated data from the news site 24.KG and presenting several baseline models for news classification in the multilabel setting. We train and evaluate both classical statistical and neural models, reporting the scores, discussing the results, and proposing directions for future work.

* Accepted to AIST 2023 
Viaarxiv icon

Machine Learning for SAT: Restricted Heuristics and New Graph Representations

Jul 18, 2023
Mikhail Shirokikh, Ilya Shenbin, Anton Alekseev, Sergey Nikolenko

Boolean satisfiability (SAT) is a fundamental NP-complete problem with many applications, including automated planning and scheduling. To solve large instances, SAT solvers have to rely on heuristics, e.g., choosing a branching variable in DPLL and CDCL solvers. Such heuristics can be improved with machine learning (ML) models; they can reduce the number of steps but usually hinder the running time because useful models are relatively large and slow. We suggest the strategy of making a few initial steps with a trained ML model and then releasing control to classical heuristics; this simplifies cold start for SAT solving and can decrease both the number of steps and overall runtime, but requires a separate decision of when to release control to the solver. Moreover, we introduce a modification of Graph-Q-SAT tailored to SAT problems converted from other domains, e.g., open shop scheduling problems. We validate the feasibility of our approach with random and industrial SAT problems.

Viaarxiv icon

DetIE: Multilingual Open Information Extraction Inspired by Object Detection

Jun 24, 2022
Michael Vasilkovsky, Anton Alekseev, Valentin Malykh, Ilya Shenbin, Elena Tutubalina, Dmitriy Salikhov, Mikhail Stepnov, Andrey Chertok, Sergey Nikolenko

Figure 1 for DetIE: Multilingual Open Information Extraction Inspired by Object Detection
Figure 2 for DetIE: Multilingual Open Information Extraction Inspired by Object Detection
Figure 3 for DetIE: Multilingual Open Information Extraction Inspired by Object Detection
Figure 4 for DetIE: Multilingual Open Information Extraction Inspired by Object Detection

State of the art neural methods for open information extraction (OpenIE) usually extract triplets (or tuples) iteratively in an autoregressive or predicate-based manner in order not to produce duplicates. In this work, we propose a different approach to the problem that can be equally or more successful. Namely, we present a novel single-pass method for OpenIE inspired by object detection algorithms from computer vision. We use an order-agnostic loss based on bipartite matching that forces unique predictions and a Transformer-based encoder-only architecture for sequence labeling. The proposed approach is faster and shows superior or similar performance in comparison with state of the art models on standard benchmarks in terms of both quality metrics and inference time. Our model sets the new state of the art performance of 67.7% F1 on CaRB evaluated as OIE2016 while being 3.35x faster at inference than previous state of the art. We also evaluate the multilingual version of our model in the zero-shot setting for two languages and introduce a strategy for generating synthetic multilingual data to fine-tune the model for each specific language. In this setting, we show performance improvement 15% on multilingual Re-OIE2016, reaching 75% F1 for both Portuguese and Spanish languages. Code and models are available at https://github.com/sberbank-ai/DetIE.

* Accepted to the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22) 
Viaarxiv icon

Near-Zero-Shot Suggestion Mining with a Little Help from WordNet

Nov 25, 2021
Anton Alekseev, Elena Tutubalina, Sejeong Kwon, Sergey Nikolenko

Figure 1 for Near-Zero-Shot Suggestion Mining with a Little Help from WordNet
Figure 2 for Near-Zero-Shot Suggestion Mining with a Little Help from WordNet
Figure 3 for Near-Zero-Shot Suggestion Mining with a Little Help from WordNet
Figure 4 for Near-Zero-Shot Suggestion Mining with a Little Help from WordNet

In this work, we explore the constructive side of online reviews: advice, tips, requests, and suggestions that users provide about goods, venues, services, and other items of interest. To reduce training costs and annotation efforts needed to build a classifier for a specific label set, we present and evaluate several entailment-based zero-shot approaches to suggestion classification in a label-fully-unseen fashion. In particular, we introduce the strategy of assigning target class labels to sentences in English language with user intentions, which significantly improves prediction quality. The proposed strategies are evaluated with a comprehensive experimental study that validated our results both quantitatively and qualitatively.

* Accepted to the 10th International Conference on Analysis of Images, Social Networks and Texts (AIST 2021) 
Viaarxiv icon

Improving unsupervised neural aspect extraction for online discussions using out-of-domain classification

Jun 17, 2020
Anton Alekseev, Elena Tutubalina, Valentin Malykh, Sergey Nikolenko

Figure 1 for Improving unsupervised neural aspect extraction for online discussions using out-of-domain classification
Figure 2 for Improving unsupervised neural aspect extraction for online discussions using out-of-domain classification
Figure 3 for Improving unsupervised neural aspect extraction for online discussions using out-of-domain classification
Figure 4 for Improving unsupervised neural aspect extraction for online discussions using out-of-domain classification

Deep learning architectures based on self-attention have recently achieved and surpassed state of the art results in the task of unsupervised aspect extraction and topic modeling. While models such as neural attention-based aspect extraction (ABAE) have been successfully applied to user-generated texts, they are less coherent when applied to traditional data sources such as news articles and newsgroup documents. In this work, we introduce a simple approach based on sentence filtering in order to improve topical aspects learned from newsgroups-based content without modifying the basic mechanism of ABAE. We train a probabilistic classifier to distinguish between out-of-domain texts (outer dataset) and in-domain texts (target dataset). Then, during data preparation we filter out sentences that have a low probability of being in-domain and train the neural model on the remaining sentences. The positive effect of sentence filtering on topic coherence is demonstrated in comparison to aspect extraction models trained on unfiltered texts.

* Journal of Intelligent & Fuzzy Systems, pre-press, https://content.iospress.com/articles/journal-of-intelligent-and-fuzzy-systems/ifs179908 
Viaarxiv icon

RecVAE: a New Variational Autoencoder for Top-N Recommendations with Implicit Feedback

Dec 24, 2019
Ilya Shenbin, Anton Alekseev, Elena Tutubalina, Valentin Malykh, Sergey I. Nikolenko

Figure 1 for RecVAE: a New Variational Autoencoder for Top-N Recommendations with Implicit Feedback
Figure 2 for RecVAE: a New Variational Autoencoder for Top-N Recommendations with Implicit Feedback
Figure 3 for RecVAE: a New Variational Autoencoder for Top-N Recommendations with Implicit Feedback
Figure 4 for RecVAE: a New Variational Autoencoder for Top-N Recommendations with Implicit Feedback

Recent research has shown the advantages of using autoencoders based on deep neural networks for collaborative filtering. In particular, the recently proposed Mult-VAE model, which used the multinomial likelihood variational autoencoders, has shown excellent results for top-N recommendations. In this work, we propose the Recommender VAE (RecVAE) model that originates from our research on regularization techniques for variational autoencoders. RecVAE introduces several novel ideas to improve Mult-VAE, including a novel composite prior distribution for the latent codes, a new approach to setting the $\beta$ hyperparameter for the $\beta$-VAE framework, and a new approach to training based on alternating updates. In experimental evaluation, we show that RecVAE significantly outperforms previously proposed autoencoder-based models, including Mult-VAE and RaCT, across classical collaborative filtering datasets, and present a detailed ablation study to assess our new developments. Code and models are available at https://github.com/ilya-shenbin/RecVAE.

* In The Thirteenth ACM International Conference on Web Search and Data Mining (WSDM '20), February 3-7, 2020, Houston, TX, USA. ACM, New York, NY, USA, 9 pages 
Viaarxiv icon

AspeRa: Aspect-based Rating Prediction Model

Jan 23, 2019
Sergey I. Nikolenko, Elena Tutubalina, Valentin Malykh, Ilya Shenbin, Anton Alekseev

Figure 1 for AspeRa: Aspect-based Rating Prediction Model
Figure 2 for AspeRa: Aspect-based Rating Prediction Model

We propose a novel end-to-end Aspect-based Rating Prediction model (AspeRa) that estimates user rating based on review texts for the items and at the same time discovers coherent aspects of reviews that can be used to explain predictions or profile users. The AspeRa model uses max-margin losses for joint item and user embedding learning and a dual-headed architecture; it significantly outperforms recently proposed state-of-the-art models such as DeepCoNN, HFT, NARRE, and TransRev on two real world data sets of user reviews. With qualitative examination of the aspects and quantitative evaluation of rating prediction models based on these aspects, we show how aspect embeddings can be used in a recommender system.

* accepted to ECIR 2019 
Viaarxiv icon