Multi-Modal Document Classification
4 papers with code • 5 benchmarks • 6 datasets
Most implemented papers
Are These Birds Similar: Learning Branched Networks for Fine-grained Representations
In recent years, natural language descriptions are used to obtain information on discriminative parts of the object.
Message Passing Attention Networks for Document Understanding
In this paper, we represent documents as word co-occurrence networks and propose an application of the message passing framework to NLP, the Message Passing Attention network for Document understanding (MPAD).
Improving accuracy and speeding up Document Image Classification through parallel systems
This paper presents a study showing the benefits of the EfficientNet models compared with heavier Convolutional Neural Networks (CNNs) in the Document Classification task, essential problem in the digitalization process of institutions.
Image and Text fusion for UPMC Food-101 \\using BERT and CNNs
The modern digital world is becoming more and more multimodal.