Text Categorization

41 papers with code • 0 benchmarks • 6 datasets

Text Categorization is the task of automatically assigning pre-defined categories to documents written in natural languages. Several types of Text Categorization have been studied, each of which deals with different types of documents and categories, such as topic categorization to detect discussed topics (e.g., sports, politics), spam detection, and sentiment classification to determine the sentiment typically in product or movie reviews.

Source: Effective Use of Word Order for Text Categorization with Convolutional Neural Networks

Libraries

Use these libraries to find Text Categorization models and implementations

Most implemented papers

Revisiting neural relation classification in clinical notes with external information

SimonSuster/seg_cnn WS 2018

Recently, segment convolutional neural networks have been proposed for end-to-end relation extraction in the clinical domain, achieving results comparable to or outperforming the approaches with heavy manual feature engineering.

Structure-Aware Convolutional Neural Networks

vector-1127/SACNNs NeurIPS 2018

Convolutional neural networks (CNNs) are inherently subject to invariable filters that can only aggregate local inputs with the same topological structures.

Learning Graph Pooling and Hybrid Convolutional Operations for Text Representations

HongyangGao/hConv-gPool-Net 21 Jan 2019

Another limitation of GCN when used on graph-based text representation tasks is that, GCNs do not consider the order information of nodes in graph.

Rep the Set: Neural Networks for Learning Set Representations

giannisnik/repset 3 Apr 2019

In several domains, data objects can be decomposed into sets of simpler objects.

Leap-LSTM: Enhancing Long Short-Term Memory for Text Categorization

ht1221/leap-lstm 28 May 2019

Compared to previous models which can also skip words, our model achieves better trade-offs between performance and efficiency.

Text Categorization by Learning Predominant Sense of Words as Auxiliary Task

ShimShim46/TRF_Multitask ACL 2019

Distributions of the senses of words are often highly skewed and give a strong influence of the domain of a document.

Ensemble Quantile Classifier

CliffordLai/eqc 28 Oct 2019

It is also shown that the ensemble quantile classifier is Bayes optimal under suitable assumptions with asymmetric Laplace distribution inputs.

t-SS3: a text classifier with dynamic n-grams for early risk detection over text streams

sergioburdisso/pyss3 11 Nov 2019

SS3 was created to deal with ERD problems naturally since: it supports incremental training and classification over text streams, and it can visually explain its rationale.

Improving Document Classification with Multi-Sense Embeddings

vgupta123/SCDV-MS 18 Nov 2019

Through extensive experiments on multiple real-world datasets, we show that SCDV-MS embeddings outperform previous state-of-the-art embeddings on multi-class and multi-label text categorization tasks.