no code implementations • EMNLP (NLP+CSS) 2020 • Sarang Gupta, Kumari Nishu
Mapping local news coverage from textual content is a challenging problem that requires extracting precise location mentions from news articles.
no code implementations • 31 Aug 2023 • Alexandre Bittar, Paul Dixon, Mohammad Samragh, Kumari Nishu, Devang Naik
Using a vision-inspired keyword spotting framework, we propose an architecture with input-dependent dynamic depth capable of processing streaming audio.
no code implementations • 12 Aug 2023 • Kumari Nishu, Minsik Cho, Paul Dixon, Devang Naik
Spotting user-defined/flexible keywords represented in text frequently uses an expensive text encoder for joint analysis with an audio encoder in an embedding space, which can suffer from heterogeneous modality representation (i. e., large mismatch) and increased complexity.
no code implementations • 8 Jun 2023 • Kumari Nishu, Minsik Cho, Devang Naik
Using audio and text embeddings jointly for Keyword Spotting (KWS) has shown high-quality results, but the key challenge of how to semantically align two embeddings for multi-word keywords of different sequence lengths remains largely unsolved.