Activity Detection

63 papers with code • 1 benchmarks • 12 datasets

Detecting activities in extended videos.

Libraries

Use these libraries to find Activity Detection models and implementations

Online speaker diarization of meetings guided by speech separation

egruttadauria98/sspavaldo 30 Jan 2024

The results show that our system improves the state-of-the-art on the AMI headset mix, using no oracle information and under full evaluation (no collar and including overlapped speech).

16
30 Jan 2024

Advanced Image Segmentation Techniques for Neural Activity Detection via C-fos Immediate Early Gene Expression

dystopians/cfoscraft 13 Dec 2023

This research contributes to the development of more efficient and automated image segmentation methods, advancing the understanding of neural function in neuroscience research.

0
13 Dec 2023

Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations

w-wu/steer 14 Aug 2023

Two metrics are proposed to evaluate AER performance with automatic segmentation based on time-weighted emotion and speaker classification errors.

7
14 Aug 2023

ivrit.ai: A Comprehensive Dataset of Hebrew Speech for AI Research and Development

yairl/ivrit.ai 17 Jul 2023

We introduce "ivrit. ai", a comprehensive Hebrew speech dataset, addressing the distinct lack of extensive, high-quality resources for advancing Automated Speech Recognition (ASR) technology in Hebrew.

14
17 Jul 2023

Long-term Conversation Analysis: Exploring Utility and Privacy

ol-mega/ppca 28 Jun 2023

The analysis of conversations recorded in everyday life requires privacy protection.

2
28 Jun 2023

FunASR: A Fundamental End-to-End Speech Recognition Toolkit

alibaba-damo-academy/FunASR 18 May 2023

FunASR offers models trained on large-scale industrial corpora and the ability to deploy them in applications.

3,256
18 May 2023

Evaluation of Noise Reduction Methods for Sentence Recognition by Sinhala Speaking Listeners

chathukiket/evaluation-of-noise-reduction-methods 31 Mar 2023

Noise reduction is a crucial aspect of hearing aids, which researchers have been striving to address over the years.

0
31 Mar 2023

Token Turing Machines

google-research/scenic CVPR 2023

The model's memory module ensures that a new observation will only be processed with the contents of the memory (and not the entire history), meaning that it can efficiently process long sequences with a bounded computational cost at each step.

2,990
16 Nov 2022

Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural Diarization

butspeechfit/eend 12 Nov 2022

End-to-end diarization presents an attractive alternative to standard cascaded diarization systems because a single system can handle all aspects of the task at once.

64
12 Nov 2022

SG-VAD: Stochastic Gates Based Speech Activity Detection

jsvir/vad 28 Oct 2022

Our key idea is to model VAD as a denoising task, and construct a network that is designed to identify nuisance features for a speech classification task.

14
28 Oct 2022