Form Understanding in Noisy Scanned Documents (FUNSD) comprises 199 real, fully annotated, scanned forms. The documents are noisy and vary widely in appearance, making form understanding (FoUn) a challenging task. The proposed dataset can be used for various tasks, including text detection, optical character recognition, spatial layout analysis, and entity labeling/linking.
142 PAPERS • 3 BENCHMARKS
LabPics Chemistry Dataset
5 PAPERS • NO BENCHMARKS YET
Green family of datasets for emergent communications on relations.
1 PAPER • NO BENCHMARKS YET