Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

"Topic": models, code, and papers

UCB Algorithm for Exponential Distributions

Apr 07, 2012
Wassim Jouini, Christophe Moy

We introduce in this paper a new algorithm for Multi-Armed Bandit (MAB) problems. A machine learning paradigm popular within Cognitive Network related topics (e.g., Spectrum Sensing and Allocation). We focus on the case where the rewards are exponentially distributed, which is common when dealing with Rayleigh fading channels. This strategy, named Multiplicative Upper Confidence Bound (MUCB), associates a utility index to every available arm, and then selects the arm with the highest index. For every arm, the associated index is equal to the product of a multiplicative factor by the sample mean of the rewards collected by this arm. We show that the MUCB policy has a low complexity and is order optimal.

* 10 pages. Introduces Multiplicative Upper Confidence Bound (MUCB) algorithms for Multi-Armed Bandit problems 

  Access Paper or Ask Questions

Text Classification: A Sequential Reading Approach

Aug 29, 2011
Gabriel Dulac-Arnold, Ludovic Denoyer, Patrick Gallinari

We propose to model the text classification process as a sequential decision process. In this process, an agent learns to classify documents into topics while reading the document sentences sequentially and learns to stop as soon as enough information was read for deciding. The proposed algorithm is based on a modelisation of Text Classification as a Markov Decision Process and learns by using Reinforcement Learning. Experiments on four different classical mono-label corpora show that the proposed approach performs comparably to classical SVM approaches for large training sets, and better for small training sets. In addition, the model automatically adapts its reading process to the quantity of training information provided.

* Lecture Notes in Computer Science, 2011, Volume 6611/2011, 411-423 
* ECIR2011 

  Access Paper or Ask Questions

System Dynamics Modelling of the Processes Involving the Maintenance of the Naive T Cell Repertoire

Apr 29, 2010
Grazziela P. Figueredo, Uwe Aickelin, Amanda Whitbrook

The study of immune system aging, i.e. immunosenescence, is a relatively new research topic. It deals with understanding the processes of immunodegradation that indicate signs of functionality loss possibly leading to death. Even though it is not possible to prevent immunosenescence, there is great benefit in comprehending its causes, which may help to reverse some of the damage done and thus improve life expectancy. One of the main factors influencing the process of immunosenescence is the number and phenotypical variety of naive T cells in an individual. This work presents a review of immunosenescence, proposes system dynamics modelling of the processes involving the maintenance of the naive T cell repertoire and presents some preliminary results.

* Proceedings of the 9th Annual Workshop on Computational Intelligence (UKCI 2009), Nottingham, UK, p13-18, 
* 6 pages, 2 figures, 1 table, 9th Annual Workshop on Computational Intelligence (UKCI 2009), Nottingham, UK 

  Access Paper or Ask Questions

Kannada Character Recognition System A Review

Jan 29, 2010
K. Indira, S. Sethu Selvi

Intensive research has been done on optical character recognition ocr and a large number of articles have been published on this topic during the last few decades. Many commercial OCR systems are now available in the market, but most of these systems work for Roman, Chinese, Japanese and Arabic characters. There are no sufficient number of works on Indian language character recognition especially Kannada script among 12 major scripts in India. This paper presents a review of existing work on printed Kannada script and their results. The characteristics of Kannada script and Kannada Character Recognition System kcr are discussed in detail. Finally fusion at the classifier level is proposed to increase the recognition accuracy.

* 12 pages, 8 figures 

  Access Paper or Ask Questions

A Survey of Multi-task Learning in Natural Language Processing: Regarding Task Relatedness and Training Methods

Apr 07, 2022
Zhihan Zhang, Wenhao Yu, Mengxia Yu, Zhichun Guo, Meng Jiang

Multi-task learning (MTL) has become increasingly popular in natural language processing (NLP) because it improves the performance of related tasks by exploiting their commonalities and differences. Nevertheless, it is still not understood very well how multi-task learning can be implemented based on the relatedness of training tasks. In this survey, we review recent advances of multi-task learning methods in NLP, with the aim of summarizing them into two general multi-task training methods based on their task relatedness: (i) joint training and (ii) multi-step training. We present examples in various NLP downstream applications, summarize the task relationships and discuss future directions of this promising topic.

  Access Paper or Ask Questions

Neural Forecasting of the Italian Sovereign Bond Market with Economic News

Mar 11, 2022
Sergio Consoli, Luca Tiozzo Pezzoli, Elisa Tosetti

In this paper we employ economic news within a neural network framework to forecast the Italian 10-year interest rate spread. We use a big, open-source, database known as Global Database of Events, Language and Tone to extract topical and emotional news content linked to bond markets dynamics. We deploy such information within a probabilistic forecasting framework with autoregressive recurrent networks (DeepAR). Our findings suggest that a deep learning network based on Long-Short Term Memory cells outperforms classical machine learning techniques and provides a forecasting performance that is over and above that obtained by using conventional determinants of interest rates alone.

* Journal of the Royal Statistical Society - Series A (Statistics in Society), 2022 
* 24 pages, 8 figures, in press 

  Access Paper or Ask Questions

Corpus for Automatic Structuring of Legal Documents

Jan 31, 2022
Prathamesh Kalamkar, Aman Tiwari, Astha Agarwal, Saurabh Karn, Smita Gupta, Vivek Raghavan, Ashutosh Modi

In populous countries, pending legal cases have been growing exponentially. There is a need for developing techniques for processing and organizing legal documents. In this paper, we introduce a new corpus for structuring legal documents. In particular, we introduce a corpus of legal judgment documents in English that are segmented into topical and coherent parts. Each of these parts is annotated with a label coming from a list of pre-defined Rhetorical Roles. We develop baseline models for automatically predicting rhetorical roles in a legal document based on the annotated corpus. Further, we show the application of rhetorical roles to improve performance on the tasks of summarization and legal judgment prediction. We release the corpus and baseline model code along with the paper.

* 10 Pages (8 page main paper + 2 page references) 

  Access Paper or Ask Questions

A Brief History of Updates of Answer-Set Programs

Dec 27, 2021
João Leite, Martin Slota

Over the last couple of decades, there has been a considerable effort devoted to the problem of updating logic programs under the stable model semantics (a.k.a. answer-set programs) or, in other words, the problem of characterising the result of bringing up-to-date a logic program when the world it describes changes. Whereas the state-of-the-art approaches are guided by the same basic intuitions and aspirations as belief updates in the context of classical logic, they build upon fundamentally different principles and methods, which have prevented a unifying framework that could embrace both belief and rule updates. In this paper, we will overview some of the main approaches and results related to answer-set programming updates, while pointing out some of the main challenges that research in this topic has faced.

* Under consideration in Theory and Practice of Logic Programming (TPLP) 

  Access Paper or Ask Questions

A Bibliometric Analysis of the BPM Conference Using Computational Data Analytics

Nov 18, 2021
Fabian Muff, Felix Härer, Hans-Georg Fill

The BPM conference has a long tradition as the premier venue for publishing research on business process management. For exploring the evolution of research topics, we present the findings from a computational bibliometric analysis of the BPM conference proceedings from the past 15 years. We used the publicly available DBLP dataset as a basis for the analysis, which we enriched with data from websites and databases of the relevant publishers. In addition to a detailed analysis of the publication results, we performed a content-based analysis of over 1,200 papers from the BPM conference and its workshops using Latent Dirichlet Allocation. This offers insights into historical developments in Business Process Management research and provides the community with potential future prospects.

  Access Paper or Ask Questions

Improving Large-scale Language Models and Resources for Filipino

Nov 11, 2021
Jan Christian Blaise Cruz, Charibeth Cheng

In this paper, we improve on existing language resources for the low-resource Filipino language in two ways. First, we outline the construction of the TLUnified dataset, a large-scale pretraining corpus that serves as an improvement over smaller existing pretraining datasets for the language in terms of scale and topic variety. Second, we pretrain new Transformer language models following the RoBERTa pretraining technique to supplant existing models trained with small corpora. Our new RoBERTa models show significant improvements over existing Filipino models in three benchmark datasets with an average gain of 4.47% test accuracy across the three classification tasks of varying difficulty.

* Resources are available at 

  Access Paper or Ask Questions