Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A novel text representation which enables image classifiers to perform text classification

Sep 27, 2019

Stephen M. Petrie, T'Mir D. Julius

Figure 1 for A novel text representation which enables image classifiers to perform text classification

Figure 2 for A novel text representation which enables image classifiers to perform text classification

Figure 3 for A novel text representation which enables image classifiers to perform text classification

Figure 4 for A novel text representation which enables image classifiers to perform text classification

Share this with someone who'll enjoy it:

Abstract:We introduce a novel method for converting text data into abstract image representations, which allows image-based processing techniques (e.g. image classification networks) to be applied to text-based comparison problems. We apply the technique to entity disambiguation of inventor names in US patents. The method involves converting text from each pairwise comparison between two inventor name records into a 2D RGB (stacked) image representation. We then train an image classification neural network to discriminate between such pairwise comparison images, and use the trained network to label each pair of records as either matched (same inventor) or non-matched (different inventors), obtaining highly accurate results (F1: 99.09%, precision: 99.41%, recall: 98.76%). Our new text-to-image representation method could potentially be used more broadly for other NLP comparison problems, such as disambiguation of academic publications, or for problems that require simultaneous classification of both text and images.

* Minor changes, with a shorter abstract and title, and with a figure, table, and some text moved to Appendices to make the main body shorter

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:A novel text representation which enables image classifiers to perform text classification

Paper and Code