Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Imed Keraghel

Graph-Convolutional Networks: Named Entity Recognition and Large Language Model Embedding in Document Clustering

Dec 19, 2024

Imed Keraghel, Mohamed Nadif

Figure 1 for Graph-Convolutional Networks: Named Entity Recognition and Large Language Model Embedding in Document Clustering

Figure 2 for Graph-Convolutional Networks: Named Entity Recognition and Large Language Model Embedding in Document Clustering

Figure 3 for Graph-Convolutional Networks: Named Entity Recognition and Large Language Model Embedding in Document Clustering

Figure 4 for Graph-Convolutional Networks: Named Entity Recognition and Large Language Model Embedding in Document Clustering

Abstract:Recent advances in machine learning, particularly Large Language Models (LLMs) such as BERT and GPT, provide rich contextual embeddings that improve text representation. However, current document clustering approaches often ignore the deeper relationships between named entities (NEs) and the potential of LLM embeddings. This paper proposes a novel approach that integrates Named Entity Recognition (NER) and LLM embeddings within a graph-based framework for document clustering. The method builds a graph with nodes representing documents and edges weighted by named entity similarity, optimized using a graph-convolutional network (GCN). This ensures a more effective grouping of semantically related documents. Experimental results indicate that our approach outperforms conventional co-occurrence-based methods in clustering, notably for documents rich in named entities.

* 11 pages, 4 figures

Via

Access Paper or Ask Questions

A survey on recent advances in named entity recognition

Jan 19, 2024

Imed Keraghel, Stanislas Morbieu, Mohamed Nadif

Abstract:Named Entity Recognition seeks to extract substrings within a text that name real-world objects and to determine their type (for example, whether they refer to persons or organizations). In this survey, we first present an overview of recent popular approaches, but we also look at graph- and transformer- based methods including Large Language Models (LLMs) that have not had much coverage in other surveys. Second, we focus on methods designed for datasets with scarce annotations. Third, we evaluate the performance of the main NER implementations on a variety of datasets with differing characteristics (as regards their domain, their size, and their number of classes). We thus provide a deep comparison of algorithms that are never considered together. Our experiments shed some light on how the characteristics of datasets affect the behavior of the methods that we compare.

* 30 pages

Via

Access Paper or Ask Questions