2 code implementations • 4 Apr 2024 • Lorenzo Bianchi, Fabio Carrara, Nicola Messina, Fabrizio Falchi
Modern applications increasingly demand flexible computer vision models that adapt to novel concepts not encountered during training.
1 code implementation • 29 Nov 2023 • Lorenzo Bianchi, Fabio Carrara, Nicola Messina, Claudio Gennaro, Fabrizio Falchi
Recent advancements in large vision-language models enabled visual object detection in open-vocabulary scenarios, where object classes are defined in free-text formats during inference.
1 code implementation • 25 May 2023 • Nicola Messina, Jan Sedmidubsky, Fabrizio Falchi, Tomáš Rebok
Due to recent advances in pose-estimation methods, human motion can be extracted from a common video in the form of 3D skeleton sequences.
no code implementations • 26 Apr 2023 • Paweł Foszner, Agnieszka Szczęsna, Luca Ciampi, Nicola Messina, Adam Cygan, Bartosz Bizoń, Michał Cogiel, Dominik Golba, Elżbieta Macioszek, Michał Staniszewski
Generally, crowd datasets can be collected or generated from real or synthetic sources.
no code implementations • 11 Apr 2023 • Paweł Foszner, Agnieszka Szczęsna, Luca Ciampi, Nicola Messina, Adam Cygan, Bartosz Bizoń, Michał Cogiel, Dominik Golba, Elżbieta Macioszek, Michał Staniszewski
Data scarcity has become one of the main obstacles to developing supervised models based on Artificial Intelligence in Computer Vision.
no code implementations • 4 Nov 2022 • Fabio Carrara, Fabrizio Falchi, Maria Girardi, Nicola Messina, Cristina Padovani, Daniele Pellegrini
Thanks to recent advancements in numerical methods, computer power, and monitoring technology, seismic ambient noise provides precious information about the structural behavior of old buildings.
no code implementations • 24 Aug 2022 • Marco Avvenuti, Marco Bongiovanni, Luca Ciampi, Fabrizio Falchi, Claudio Gennaro, Nicola Messina
Automatic people counting from images has recently drawn attention for urban monitoring in modern Smart Cities due to the ubiquity of surveillance camera networks.
1 code implementation • 29 Jul 2022 • Nicola Messina, Matteo Stefanini, Marcella Cornia, Lorenzo Baraldi, Fabrizio Falchi, Giuseppe Amato, Rita Cucchiara
In literature, this task is often used as a pre-training objective to forge architectures able to jointly deal with images and texts.
Ranked #22 on Cross-Modal Retrieval on COCO 2014
2 code implementations • 21 Jun 2022 • Nicola Messina, Davide Alessandro Coccomini, Andrea Esuli, Fabrizio Falchi
With the increased accessibility of web and online encyclopedias, the amount of data to manage is constantly increasing.
no code implementations • 29 Nov 2021 • Nicola Messina, Giuseppe Amato, Fabio Carrara, Claudio Gennaro, Fabrizio Falchi
In the end, this study can lay the basis for a deeper understanding of the role of attention and recurrent connections for solving visual abstract reasoning tasks.
1 code implementation • 22 Nov 2021 • Davide Coccomini, Nicola Messina, Claudio Gennaro, Fabrizio Falchi
Space exploration has always been a source of inspiration for humankind, and thanks to modern telescopes, it is now possible to observe celestial bodies far away from us.
1 code implementation • SEMEVAL 2021 • Nicola Messina, Fabrizio Falchi, Claudio Gennaro, Giuseppe Amato
This paper describes the system used by the AIMH Team to approach the SemEval Task 6.
3 code implementations • 6 Jul 2021 • Davide Coccomini, Nicola Messina, Claudio Gennaro, Fabrizio Falchi
Traditionally, Convolutional Neural Networks (CNNs) have been used to perform video deepfake detection, with the best results obtained using methods based on EfficientNet B7.
Ranked #1 on DeepFake Detection on DFDC (using extra training data)
no code implementations • 1 Jun 2021 • Nicola Messina, Giuseppe Amato, Fabrizio Falchi, Claudio Gennaro, Stéphane Marchand-Maillet
It is designed for producing fixed-size 1024-d vectors describing whole images and sentences, as well as variable-length sets of 1024-d vectors describing the various building components of the two modalities (image regions and sentence words respectively).
no code implementations • 22 Jan 2021 • Nicola Messina, Giuseppe Amato, Fabio Carrara, Claudio Gennaro, Fabrizio Falchi
With the experiments carried out in this work, we demonstrate that residual connections, and more generally the skip connections, seem to have only a marginal impact on the learning of the proposed problems.
1 code implementation • 12 Aug 2020 • Nicola Messina, Giuseppe Amato, Andrea Esuli, Fabrizio Falchi, Claudio Gennaro, Stéphane Marchand-Maillet
In this work, we tackle the task of cross-modal retrieval through image-sentence matching based on word-region alignments, using supervision only at the global image-sentence level.
Ranked #6 on Image Retrieval on Flickr30K 1K test
1 code implementation • 20 Apr 2020 • Nicola Messina, Fabrizio Falchi, Andrea Esuli, Giuseppe Amato
State-of-the-art results in image-text matching are achieved by inter-playing image and text features from the two different processing pipelines, usually using mutual attention mechanisms.
no code implementations • 9 Jan 2020 • Luca Ciampi, Nicola Messina, Fabrizio Falchi, Claudio Gennaro, Giuseppe Amato
Furthermore, we demonstrate that with our Domain Adaptation techniques, we can reduce the Synthetic2Real Domain Shift, making closer the two domains and obtaining a performance improvement when testing the network over the real-world images.