Document Classification
206 papers with code • 19 benchmarks • 15 datasets
Document Classification is a procedure of assigning one or more labels to a document from a predetermined set of labels.
Libraries
Use these libraries to find Document Classification models and implementationsDatasets
Most implemented papers
Geometric deep learning on graphs and manifolds using mixture model CNNs
Recently, there has been an increasing interest in geometric deep learning, attempting to generalize deep learning methods to non-Euclidean structured data such as graphs and manifolds, with a variety of applications from the domains of network analysis, computational social science, or computer graphics.
Learning to Skim Text
Recurrent Neural Networks are showing much promise in many sub-areas of natural language processing, ranging from document classification to machine translation to automatic question answering.
MultiFiT: Efficient Multi-lingual Language Model Fine-tuning
Pretrained language models are promising particularly for low-resource languages as they only require unlabelled data.
Pairwise Multi-Class Document Classification for Semantic Relations between Wikipedia Articles
In this paper, we model the problem of finding the relationship between two documents as a pairwise document classification task.
HDLTex: Hierarchical Deep Learning for Text Classification
This is because along with this growth in the number of documents has come an increase in the number of categories.
Combining Similarity Features and Deep Representation Learning for Stance Detection in the Context of Checking Fake News
Specifically, we use bi-directional Recurrent Neural Networks, together with max-pooling over the temporal/sequential dimension and neural attention, for representing (i) the headline, (ii) the first two sentences of the news article, and (iii) the entire news article.
DocBERT: BERT for Document Classification
We present, to our knowledge, the first application of BERT to document classification.
Multimodal deep networks for text and image-based document classification
Classification of document images is a critical step for archival of old manuscripts, online subscription and administrative procedures.
Hierarchical Transformers for Long Document Classification
BERT, which stands for Bidirectional Encoder Representations from Transformers, is a recently introduced language representation model based upon the transfer learning paradigm.