Text Categorization

41 papers with code • 0 benchmarks • 6 datasets

Text Categorization is the task of automatically assigning pre-defined categories to documents written in natural languages. Several types of Text Categorization have been studied, each of which deals with different types of documents and categories, such as topic categorization to detect discussed topics (e.g., sports, politics), spam detection, and sentiment classification to determine the sentiment typically in product or movie reviews.

Source: Effective Use of Word Order for Text Categorization with Convolutional Neural Networks

Libraries

Use these libraries to find Text Categorization models and implementations

Improving Document Classification with Multi-Sense Embeddings

vgupta123/SCDV-MS 18 Nov 2019

Through extensive experiments on multiple real-world datasets, we show that SCDV-MS embeddings outperform previous state-of-the-art embeddings on multi-class and multi-label text categorization tasks.

14
18 Nov 2019

t-SS3: a text classifier with dynamic n-grams for early risk detection over text streams

sergioburdisso/pyss3 11 Nov 2019

SS3 was created to deal with ERD problems naturally since: it supports incremental training and classification over text streams, and it can visually explain its rationale.

331
11 Nov 2019

Learning to Few-Shot Learn Across Diverse Natural Language Classification Tasks

iesl/leopard COLING 2020

LEOPARD is trained with the state-of-the-art transformer architecture and shows better generalization to tasks not seen at all during training, with as few as 4 examples per label.

23
10 Nov 2019

Ensemble Quantile Classifier

CliffordLai/eqc 28 Oct 2019

It is also shown that the ensemble quantile classifier is Bayes optimal under suitable assumptions with asymmetric Laplace distribution inputs.

0
28 Oct 2019

Text Categorization by Learning Predominant Sense of Words as Auxiliary Task

ShimShim46/TRF_Multitask ACL 2019

Distributions of the senses of words are often highly skewed and give a strong influence of the domain of a document.

5
01 Jul 2019

Leap-LSTM: Enhancing Long Short-Term Memory for Text Categorization

ht1221/leap-lstm 28 May 2019

Compared to previous models which can also skip words, our model achieves better trade-offs between performance and efficiency.

6
28 May 2019

Rep the Set: Neural Networks for Learning Set Representations

giannisnik/repset 3 Apr 2019

In several domains, data objects can be decomposed into sets of simpler objects.

26
03 Apr 2019

Learning Graph Pooling and Hybrid Convolutional Operations for Text Representations

HongyangGao/hConv-gPool-Net 21 Jan 2019

Another limitation of GCN when used on graph-based text representation tasks is that, GCNs do not consider the order information of nodes in graph.

27
21 Jan 2019

Structure-Aware Convolutional Neural Networks

vector-1127/SACNNs NeurIPS 2018

Convolutional neural networks (CNNs) are inherently subject to invariable filters that can only aggregate local inputs with the same topological structures.

25
01 Dec 2018