Search Results for author: Toby Perrett

Found 16 papers, 10 papers with code

Spatial Cognition from Egocentric Video: Out of Sight, Not Out of Mind

no code implementations • 7 Apr 2024 • Chiara Plizzari, Shubham Goel, Toby Perrett, Jacob Chalk, Angjoo Kanazawa, Dima Damen

As humans move around, performing their daily tasks, they are able to recall where they have positioned objects in their environment, even if these objects are currently out of sight.

Paper
Add Code

Centre Stage: Centricity-based Audio-Visual Temporal Action Detection

1 code implementation • 28 Nov 2023 • Hanyuan Wang, Majid Mirmehdi, Dima Damen, Toby Perrett

Previous one-stage action detection approaches have modelled temporal dependencies using only the visual modality.

Action Detection

Paper
Code

What can a cook in Italy teach a mechanic in India? Action Recognition Generalisation Over Scenarios and Locations

no code implementations • ICCV 2023 • Chiara Plizzari, Toby Perrett, Barbara Caputo, Dima Damen

We propose and address a new generalisation problem: can a model trained for action recognition successfully classify actions when they are performed within a previously unseen scenario and in a previously unseen location?

Action Recognition

Paper
Add Code

Use Your Head: Improving Long-Tail Video Recognition

1 code implementation • CVPR 2023 • Toby Perrett, Saptarshi Sinha, Tilo Burghardt, Majid Mirmehdi, Dima Damen

We demonstrate that, unlike naturally-collected video datasets and existing long-tail image benchmarks, current video benchmarks fall short on multiple long-tailed properties.

Video Recognition

Paper
Code

Refining Action Boundaries for One-stage Detection

1 code implementation • 25 Oct 2022 • Hanyuan Wang, Majid Mirmehdi, Dima Damen, Toby Perrett

We obtain state-of-the-art performance on the challenging EPIC-KITCHENS-100 action detection as well as the standard THUMOS14 action detection benchmarks, and achieve improvement on the ActivityNet-1. 3 benchmark.

Action Detection

Paper
Code

Inertial Hallucinations -- When Wearable Inertial Devices Start Seeing Things

no code implementations • 14 Jul 2022 • Alessandro Masullo, Toby Perrett, Tilo Burghardt, Ian Craddock, Dima Damen, Majid Mirmehdi

We propose a novel approach to multimodal sensor fusion for Ambient Assisted Living (AAL) which takes advantage of learning using privileged information (LUPI).

Hallucination Sensor Fusion

Paper
Add Code

An Evaluation of OCR on Egocentric Data

1 code implementation • 11 Jun 2022 • Valentin Popescu, Dima Damen, Toby Perrett

In this paper, we evaluate state-of-the-art OCR methods on Egocentric data.

Optical Character Recognition (OCR)

Paper
Code

TVNet: Temporal Voting Network for Action Localization

1 code implementation • 2 Jan 2022 • Hanyuan Wang, Dima Damen, Majid Mirmehdi, Toby Perrett

This incorporates a novel Voting Evidence Module to locate temporal boundaries, more accurately, where temporal contextual evidence is accumulated to predict frame-level probabilities of start and end action boundaries.

Action Localization

Paper
Code

Temporal-Relational CrossTransformers for Few-Shot Action Recognition

2 code implementations • CVPR 2021 • Toby Perrett, Alessandro Masullo, Tilo Burghardt, Majid Mirmehdi, Dima Damen

We propose a novel approach to few-shot action recognition, finding temporally-corresponding frame tuples between the query and videos in the support set.

Ranked #3 on Few Shot Action Recognition on Something-Something-100

Few-Shot action recognition Few Shot Action Recognition

103

Paper
Code

Meta-Learning with Context-Agnostic Initialisations

1 code implementation • 29 Jul 2020 • Toby Perrett, Alessandro Masullo, Tilo Burghardt, Majid Mirmehdi, Dima Damen

This produces an initialisation for fine-tuning to target which is both context-agnostic and task-generalised.

Meta-Learning

Paper
Code

Rescaling Egocentric Vision

7 code implementations • 23 Jun 2020 • Dima Damen, Hazel Doughty, Giovanni Maria Farinella, Antonino Furnari, Evangelos Kazakos, Jian Ma, Davide Moltisanti, Jonathan Munro, Toby Perrett, Will Price, Michael Wray

This paper introduces the pipeline to extend the largest dataset in egocentric vision, EPIC-KITCHENS.

Ranked #6 on Action Anticipation on EPIC-KITCHENS-100

Action Anticipation Action Detection +4

116

Paper
Code

The EPIC-KITCHENS Dataset: Collection, Challenges and Baselines

2 code implementations • 29 Apr 2020 • Dima Damen, Hazel Doughty, Giovanni Maria Farinella, Sanja Fidler, Antonino Furnari, Evangelos Kazakos, Davide Moltisanti, Jonathan Munro, Toby Perrett, Will Price, Michael Wray

Our dataset features 55 hours of video consisting of 11. 5M frames, which we densely labelled for a total of 39. 6K action segments and 454. 2K object bounding boxes.

Object

Paper
Code

Sit-to-Stand Analysis in the Wild using Silhouettes for Longitudinal Health Monitoring

no code implementations • 3 Oct 2019 • Alessandro Masullo, Tilo Burghardt, Toby Perrett, Dima Damen, Majid Mirmehdi

We present the first fully automated Sit-to-Stand or Stand-to-Sit (StS) analysis framework for long-term monitoring of patients in free-living environments using video silhouettes.

STS

Paper
Add Code

DDLSTM: Dual-Domain LSTM for Cross-Dataset Action Recognition

no code implementations • CVPR 2019 • Toby Perrett, Dima Damen

Domain alignment in convolutional networks aims to learn the degree of layer-specific feature alignment beneficial to the joint learning of source and target datasets.

Action Recognition Temporal Action Localization