Search Results for author: Jorge Calvo-Zaragoza

Found 18 papers, 6 papers with code

Align, Minimize and Diversify: A Source-Free Unsupervised Domain Adaptation Method for Handwritten Text Recognition

no code implementations28 Apr 2024 María Alfaro-Contreras, Jorge Calvo-Zaragoza

This paper serves to introduce the Align, Minimize and Diversify (AMD) method, a Source-Free Unsupervised Domain Adaptation approach for Handwritten Text Recognition (HTR).

Handwritten Text Recognition HTR +1

Spatial Context-based Self-Supervised Learning for Handwritten Text Recognition

no code implementations17 Apr 2024 Carlos Penarrubia, Carlos Garrido-Munoz, Jose J. Valero-Mas, Jorge Calvo-Zaragoza

Handwritten Text Recognition (HTR) is a relevant problem in computer vision, and implies unique challenges owing to its inherent variability and the rich contextualization required for its interpretation.

Handwritten Text Recognition HTR +1

Sheet Music Transformer: End-To-End Optical Music Recognition Beyond Monophonic Transcription

1 code implementation12 Feb 2024 Antonio Ríos-Vila, Jorge Calvo-Zaragoza, Thierry Paquet

State-of-the-art end-to-end Optical Music Recognition (OMR) has, to date, primarily been carried out using monophonic transcription techniques to handle complex score layouts, such as polyphony, often by resorting to simplifications or specific adaptations.

Proceedings of the 5th International Workshop on Reading Music Systems

1 code implementation7 Nov 2023 Jorge Calvo-Zaragoza, Alexander Pacha, Elona Shatri

The International Workshop on Reading Music Systems (WoRMS) is a workshop that tries to connect researchers who develop systems for reading music, such as in the field of Optical Music Recognition, with other researchers and practitioners that could benefit from such systems, like librarians or musicologists.

Information Retrieval Music Information Retrieval +1

Image Transformation Sequence Retrieval with General Reinforcement Learning

no code implementations13 Jul 2023 Enrique Mas-Candela, Antonio Ríos-Vila, Jorge Calvo-Zaragoza

In this work, the novel Image Transformation Sequence Retrieval (ITSR) task is presented, in which a model must retrieve the sequence of transformations between two given images that act as source and target, respectively.

Model-based Reinforcement Learning reinforcement-learning +1

End-to-End Page-Level Assessment of Handwritten Text Recognition

1 code implementation14 Jan 2023 Enrique Vidal, Alejandro H. Toselli, Antonio Ríos-Vila, Jorge Calvo-Zaragoza

The evaluation of Handwritten Text Recognition (HTR) systems has traditionally used metrics based on the edit distance between HTR and ground truth (GT) transcripts, at both the character and word levels.

Handwritten Text Recognition HTR

Proceedings of the 1st International Workshop on Reading Music Systems

no code implementations1 Dec 2022 Jorge Calvo-Zaragoza, Jan Hajič jr., Alexander Pacha

The International Workshop on Reading Music Systems (WoRMS) is a workshop that tries to connect researchers who develop systems for reading music, such as in the field of Optical Music Recognition, with other researchers and practitioners that could benefit from such systems, like librarians or musicologists.

Information Retrieval Music Information Retrieval +1

Proceedings of the 3rd International Workshop on Reading Music Systems

no code implementations1 Dec 2022 Jorge Calvo-Zaragoza, Alexander Pacha

The International Workshop on Reading Music Systems (WoRMS) is a workshop that tries to connect researchers who develop systems for reading music, such as in the field of Optical Music Recognition, with other researchers and practitioners that could benefit from such systems, like librarians or musicologists.

Information Retrieval Music Information Retrieval +1

Proceedings of the 2nd International Workshop on Reading Music Systems

no code implementations1 Dec 2022 Jorge Calvo-Zaragoza, Alexander Pacha

The International Workshop on Reading Music Systems (WoRMS) is a workshop that tries to connect researchers who develop systems for reading music, such as in the field of Optical Music Recognition, with other researchers and practitioners that could benefit from such systems, like librarians or musicologists.

Information Retrieval Music Information Retrieval +1

Proceedings of the 4th International Workshop on Reading Music Systems

no code implementations23 Nov 2022 Jorge Calvo-Zaragoza, Alexander Pacha, Elona Shatri

The International Workshop on Reading Music Systems (WoRMS) is a workshop that tries to connect researchers who develop systems for reading music, such as in the field of Optical Music Recognition, with other researchers and practitioners that could benefit from such systems, like librarians or musicologists.

Information Retrieval Music Information Retrieval +1

Late multimodal fusion for image and audio music transcription

no code implementations6 Apr 2022 María Alfaro-Contreras, Jose J. Valero-Mas, José M. Iñesta, Jorge Calvo-Zaragoza

When addressing this challenge in computational terms, the MIR community follows two lines of research: music documents, which is the case of Optical Music Recognition (OMR), or audio recordings, which is the case of Automatic Music Transcription (AMT).

Information Retrieval Music Information Retrieval +2

Region-based Layout Analysis of Music Score Images

no code implementations11 Jan 2022 Francisco J. Castellanos, Carlos Garrido-Munoz, Antonio Ríos-Vila, Jorge Calvo-Zaragoza

The Layout Analysis (LA) stage is of vital importance to the correct performance of an Optical Music Recognition (OMR) system.

Synthetic Data Generation

Unsupervised Neural Domain Adaptation for Document Image Binarization

no code implementations2 Dec 2020 Francisco J. Castellanos, Antonio-Javier Gallego, Jorge Calvo-Zaragoza

These techniques take advantage of the knowledge learned in one domain, for which labeled data are available, to apply it to other domains for which there are no labeled data.

Binarization Domain Adaptation

Incremental Unsupervised Domain-Adversarial Training of Neural Networks

1 code implementation13 Jan 2020 Antonio-Javier Gallego, Jorge Calvo-Zaragoza, Robert B. Fisher

In the context of supervised statistical learning, it is typically assumed that the training set comes from the same distribution that draws the test samples.

Unsupervised Domain Adaptation

A holistic approach to polyphonic music transcription with neural networks

1 code implementation26 Oct 2019 Miguel A. Román, Antonio Pertusa, Jorge Calvo-Zaragoza

We present a framework based on neural networks to extract music scores directly from polyphonic audio in an end-to-end fashion.

Music Transcription Quantization

Understanding Optical Music Recognition

no code implementations7 Aug 2019 Jorge Calvo-Zaragoza, Jan Hajič jr., Alexander Pacha

For over 50 years, researchers have been trying to teach computers to read music notation, referred to as Optical Music Recognition (OMR).

Cannot find the paper you are looking for? You can Submit a new open access paper.