FSC-P2 (Fearless Steps Challenge Phase2)

Introduced by Aditya et al. in FEARLESS STEPS Challenge (FS-2): Supervised Learning with Massive Naturalistic Apollo Data

The Fearless Steps Initiative by UTDallas-CRSS led to the digitization, recovery, and diarization of 19,000 hours of original analog audio data, as well as the development of algorithms to extract meaningful information from this multichannel naturalistic data resource. As an initial step to motivate a stream-lined and collaborative effort from the speech and language community, UTDallas-CRSS is hosting a series of progressively complex tasks to promote advanced research on naturalistic “Big Data” corpora. This began with ISCA INTERSPEECH-2019: "The FEARLESS STEPS Challenge: Massive Naturalistic Audio (FS-#1)". This first edition of this challenge encouraged the development of core unsupervised/semi-supervised speech and language systems for single-channel data with low resource availability, serving as the “First Step” towards extracting high-level information from such massive unlabeled corpora. As a natural progression following the successful Inaugural Challenge FS#1, the FEARLESS STEPS Challenge Phase-#2 focuses on the development of single-channel supervised learning strategies. This FS#2 provides 80 hours of ground-truth data through Training and Development sets, with an additional 20 hours of blind-set Evaluation data. Based on feedback from the Fearless Steps participants, additional Tracks for streamlined speech recognition and speaker diarization have been included in the FS#2. The results for this Challenge will be presented at the ISCA INTERSPEECH-2020 Special Session. We encourage participants to explore any and all research tasks of interest with the Fearless Steps Corpus – with suggested Task Domains listed below. Research participants can, however, also utilize the FS#2 corpus to explore additional problems dealing with naturalistic data, which we welcome as part of the special session.

--- This (FS-02) edition of the FEARLESS STEPS Challenge includes the following 6 tasks ---

TASK 1: Speech Activity Detection (SAD)

TASK 2: Speaker Identification (using Speaker Segments) (SID)

TASK 3: Speaker Diarization

├── (3.a.) Track 1: Diarization using system SAD (SD_track1)

└── (3.b.) Track 2: Diarization using reference SAD (SD_track2)

TASK 4: Automatic Speech Recognition

├── (4.a.) Track 1: ASR using system Diarization/SAD (ASR_track1)

└── (4.b.) Track 2: ASR using Diarized Segments (ASR_track2)

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Modalities


Languages