1 code implementation • 29 Jul 2021 • Eric Slyman, Chris Daw, Morgan Skrabut, Ana Usenko, Brian Hutchinson
We obtain strong results on the new fine-grained task and state-of-the-art on the 4-way task: our best model obtains frame-level error rates of 6. 2%, 7. 7% and 28. 0% when generalizing to unseen instructors for the 4-way, 5-way, and 9-way classification tasks, respectively (relative reductions of 35. 4%, 48. 3% and 21. 6% over a strong baseline).
no code implementations • 28 Jul 2021 • Piper Wolters, Logan Sizemore, Chris Daw, Brian Hutchinson, Lauren Phillips
Many applications involve detecting and localizing specific sound events within long, untrimmed documents, including keyword spotting, medical observation, and bioacoustic monitoring for conservation.