Activity Detection

63 papers with code • 1 benchmarks • 12 datasets

Detecting activities in extended videos.

Benchmarks

Add a Result

These leaderboards are used to track progress in Activity Detection

Trend	Dataset	Best Model	Paper	Code	Compare
	AVA-Speech	CNN-BiLSTM_best			See all

Libraries

Use these libraries to find Activity Detection models and implementations

alibaba-damo-academy/FunASR

3 papers

3,256

Datasets

Latest papers

Most implemented Social Latest No code

Online speaker diarization of meetings guided by speech separation

egruttadauria98/sspavaldo • • 30 Jan 2024

The results show that our system improves the state-of-the-art on the AMI headset mix, using no oracle information and under full evaluation (no collar and including overlapped speech).

30 Jan 2024

Paper
Code

Advanced Image Segmentation Techniques for Neural Activity Detection via C-fos Immediate Early Gene Expression

dystopians/cfoscraft • • 13 Dec 2023

This research contributes to the development of more efficient and automated image segmentation methods, advancing the understanding of neural function in neuroscience research.

13 Dec 2023

Paper
Code

Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations

w-wu/steer • • 14 Aug 2023

Two metrics are proposed to evaluate AER performance with automatic segmentation based on time-weighted emotion and speaker classification errors.

14 Aug 2023

Paper
Code

ivrit.ai: A Comprehensive Dataset of Hebrew Speech for AI Research and Development

yairl/ivrit.ai • • 17 Jul 2023

We introduce "ivrit. ai", a comprehensive Hebrew speech dataset, addressing the distinct lack of extensive, high-quality resources for advancing Automated Speech Recognition (ASR) technology in Hebrew.

17 Jul 2023

Paper
Code

Long-term Conversation Analysis: Exploring Utility and Privacy

ol-mega/ppca • • 28 Jun 2023

The analysis of conversations recorded in everyday life requires privacy protection.

28 Jun 2023

Paper
Code

FunASR: A Fundamental End-to-End Speech Recognition Toolkit

alibaba-damo-academy/FunASR • • 18 May 2023

FunASR offers models trained on large-scale industrial corpora and the ability to deploy them in applications.

3,256

18 May 2023

Paper
Code

Evaluation of Noise Reduction Methods for Sentence Recognition by Sinhala Speaking Listeners

chathukiket/evaluation-of-noise-reduction-methods • 31 Mar 2023

Noise reduction is a crucial aspect of hearing aids, which researchers have been striving to address over the years.

31 Mar 2023

Paper
Code

Token Turing Machines

google-research/scenic • • CVPR 2023

The model's memory module ensures that a new observation will only be processed with the contents of the memory (and not the entire history), meaning that it can efficiently process long sequences with a bounded computational cost at each step.

2,990

16 Nov 2022

Paper
Code

Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural Diarization

butspeechfit/eend • • 12 Nov 2022

End-to-end diarization presents an attractive alternative to standard cascaded diarization systems because a single system can handle all aspects of the task at once.

12 Nov 2022

Paper
Code

SG-VAD: Stochastic Gates Based Speech Activity Detection

jsvir/vad • • 28 Oct 2022

Our key idea is to model VAD as a denoising task, and construct a network that is designed to identify nuisance features for a speech classification task.

28 Oct 2022

Paper
Code

Activity Detection

Benchmarks Add a Result

Libraries

Datasets

Latest papers

Content

Benchmarks

Add a Result