no code implementations • 11 Apr 2024 • Xavier Alameda-Pineda, Angus Addlesee, Daniel Hernández García, Chris Reinke, Soraya Arias, Federica Arrigoni, Alex Auternaud, Lauriane Blavette, Cigdem Beyan, Luis Gomez Camara, Ohad Cohen, Alessandro Conti, Sébastien Dacunha, Christian Dondrup, Yoav Ellinson, Francesco Ferro, Sharon Gannot, Florian Gras, Nancie Gunson, Radu Horaud, Moreno D'Incà, Imad Kimouche, Séverin Lemaignan, Oliver Lemon, Cyril Liotard, Luca Marchionni, Mordehay Moradi, Tomas Pajdla, Maribel Pino, Michal Polic, Matthieu Py, Ariel Rado, Bin Ren, Elisa Ricci, Anne-Sophie Rigaud, Paolo Rota, Marta Romeo, Nicu Sebe, Weronika Sieińska, Pinchas Tandeitnik, Francesco Tonini, Nicolas Turro, Timothée Wintz, Yanchao Yu
Despite the many recent achievements in developing and deploying social robotics, there are still many underexplored environments and applications for which systematic evaluation of such systems by end-users is necessary.
no code implementations • 16 Aug 2023 • Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue
Compared to existing video modeling architectures for action anticipation, NAOGAT captures the relationship between objects and the global scene context in order to predict detections for the next active object and anticipate relevant future actions given these detections, leveraging the objects' dynamics to improve accuracy.
2 code implementations • ICCV 2023 • Francesco Tonini, Nicola Dall'Asen, Cigdem Beyan, Elisa Ricci
Gaze target detection aims to predict the image location where the person is looking and the probability that a gaze is out of the scene.
1 code implementation • 4 Jul 2023 • Anil Osman Tur, Nicola Dall'Asen, Cigdem Beyan, Elisa Ricci
This paper aims to address the unsupervised video anomaly detection (VAD) problem, which involves classifying each frame in a video as normal or abnormal, without any access to labels.
1 code implementation • 25 May 2023 • Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue
In this technical report, we describe the Guided-Attention mechanism based solution for the short-term anticipation (STA) challenge for the EGO4D challenge.
Ranked #1 on Short-term Object Interaction Anticipation on Ego4D
1 code implementation • 22 May 2023 • Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue
To this end, we propose a novel approach that applies a guided attention mechanism between the objects, and the spatiotemporal features extracted from video clips, enhancing the motion and contextual information, and further decoding the object-centric and motion-centric information to address the problem of STA in egocentric videos.
no code implementations • 12 Apr 2023 • Anil Osman Tur, Nicola Dall'Asen, Cigdem Beyan, Elisa Ricci
This paper investigates the performance of diffusion models for video anomaly detection (VAD) within the most challenging but also the most operational scenario in which the data annotations are not used.
no code implementations • 13 Feb 2023 • Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue
This paper addresses the problem of anticipating the next-active-object location in the future, for a given egocentric video clip where the contact might happen, before any action takes place.
1 code implementation • 23 Aug 2022 • Francesco Tonini, Cigdem Beyan, Elisa Ricci
This paper addresses the gaze target detection problem in single images captured from the third-person perspective.
1 code implementation • 23 Jul 2022 • Riccardo Franceschini, Enrico Fini, Cigdem Beyan, Alessandro Conti, Federica Arrigoni, Elisa Ricci
Our method, as being based on contrastive loss between pairwise modalities, is the first attempt in MER literature.
Cultural Vocal Bursts Intensity Prediction Multimodal Emotion Recognition
no code implementations • 20 Jul 2022 • Cigdem Beyan, Alessandro Vinciarelli, Alessio Del Bue
Automated co-located human-human interaction analysis has been addressed by the use of nonverbal communication as measurable evidence of social and psychological phenomena.
1 code implementation • 21 Apr 2022 • Giancarlo Paoletti, Jacopo Cavazza, Cigdem Beyan, Alessio Del Bue
This paper presents a novel end-to-end method for the problem of skeleton-based unsupervised human action recognition.
no code implementations • 6 May 2021 • Ömer Sümer, Cigdem Beyan, Fabian Ruth, Olaf Kramer, Ulrich Trautwein, Enkelejda Kasneci
One approach that can promote efficient development of presentation competence is the automated analysis of human behavior during a speech based on visual and audio features and machine learning.
1 code implementation • 21 Jun 2020 • Giancarlo Paoletti, Jacopo Cavazza, Cigdem Beyan, Alessio Del Bue
This paper tackles the problem of human action recognition, defined as classifying which action is displayed in a trimmed sequence, from skeletal data.
Ranked #1 on Skeleton Based Action Recognition on MSR ActionPairs