Texts

FSC-P2 (Fearless Steps Challenge Phase2)

Introduced by Aditya et al. in FEARLESS STEPS Challenge (FS-2): Supervised Learning with Massive Naturalistic Apollo Data

The Fearless Steps Initiative by UTDallas-CRSS led to the digitization, recovery, and diarization of 19,000 hours of original analog audio data, as well as the development of algorithms to extract meaningful information from this multichannel naturalistic data resource. As an initial step to motivate a stream-lined and collaborative effort from the speech and language community, UTDallas-CRSS is hosting a series of progressively complex tasks to promote advanced research on naturalistic “Big Data” corpora. This began with ISCA INTERSPEECH-2019: "The FEARLESS STEPS Challenge: Massive Naturalistic Audio (FS-#1)". This first edition of this challenge encouraged the development of core unsupervised/semi-supervised speech and language systems for single-channel data with low resource availability, serving as the “First Step” towards extracting high-level information from such massive unlabeled corpora. As a natural progression following the successful Inaugural Challenge FS#1, the FEARLESS STEPS Challenge Phase-#2 focuses on the development of single-channel supervised learning strategies. This FS#2 provides 80 hours of ground-truth data through Training and Development sets, with an additional 20 hours of blind-set Evaluation data. Based on feedback from the Fearless Steps participants, additional Tracks for streamlined speech recognition and speaker diarization have been included in the FS#2. The results for this Challenge will be presented at the ISCA INTERSPEECH-2020 Special Session. We encourage participants to explore any and all research tasks of interest with the Fearless Steps Corpus – with suggested Task Domains listed below. Research participants can, however, also utilize the FS#2 corpus to explore additional problems dealing with naturalistic data, which we welcome as part of the special session.

--- This (FS-02) edition of the FEARLESS STEPS Challenge includes the following 6 tasks ---

TASK 1: Speech Activity Detection (SAD)

TASK 2: Speaker Identification (using Speaker Segments) (SID)

TASK 3: Speaker Diarization

├── (3.a.) Track 1: Diarization using system SAD (SD_track1)

└── (3.b.) Track 2: Diarization using reference SAD (SD_track2)

TASK 4: Automatic Speech Recognition

├── (4.a.) Track 1: ASR using system Diarization/SAD (ASR_track1)

└── (4.b.) Track 2: ASR using Diarized Segments (ASR_track2)

Homepage

Benchmarks

Add a new result Link an existing benchmark

No benchmarks yet. Start a new benchmark or link an existing one.

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

No data loaders found. You can submit your data loader here.

Tasks

Usage

FSC-P2 (Fearless Steps Challenge Phase2)

TASK 1: Speech Activity Detection (SAD)

TASK 2: Speaker Identification (using Speaker Segments) (SID)

TASK 3: Speaker Diarization

├── (3.a.) Track 1: Diarization using system SAD (SD_track1)

└── (3.b.) Track 2: Diarization using reference SAD (SD_track2)

TASK 4: Automatic Speech Recognition

├── (4.a.) Track 1: ASR using system Diarization/SAD (ASR_track1)

└── (4.b.) Track 2: ASR using Diarized Segments (ASR_track2)

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit

Usage

License Edit

Modalities Edit

Languages Edit

Benchmarks

Add a new result Link an existing benchmark

Dataset Loaders

Add Remove

Tasks

License

Modalities

Languages