The CVL Database is a public database for writer retrieval, writer identification and word spotting. The database consists of 7 different handwritten texts (1 German and 6 Englisch Texts). In total 310 writers participated in the dataset. 27 of which wrote 7 texts and 283 writers had to write 5 texts. For each text a rgb color image (300 dpi) comprising the handwritten text and the printed text sample is available as well as a cropped version (only handwritten). An unique id identifies the writer, whereas the Bounding Boxes for each single word are stored in an XML file.

The CVL-database consists of images with cursively handwritten german and english texts which has been choosen from literary works. All pages have a unique writer id and the text number (separated by a dash) at the upper right corner, followed by the printed sample text. The text is placed between two horizontal separatores. Beneath the printed text individuals have been asked to write the text using a ruled undersheet to prevent curled text lines. The layout follows the style of the IAM database. The database was updated on 12/09/2013 since one writer ID (265/266) was wrong. The version number was changed to 1.1.

Samples of the following texts have been used:

Edwin A. Abbot – Flatland: A Romance of Many Dimension (92 words).
William Shakespeare – Mac Beth (49 words).
Wikipedia – Mailüfterl (73 words, under CC Attribution-ShareALike License).
Charles Darwin – Origin of Species (52 words).
Johann Wolfgang von Goethe – Faust. Eine Tragödie (50 words).
Oscar Wilde – The Picture of Dorian Gray (66 words).
Edgar Allan Poe – The Fall of the House of Usher (78 words).

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Similar Datasets


License


  • Unknown

Modalities


Languages