Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers

This paper introduces Timers and Such, a new open source dataset of spoken English commands for common voice control use cases involving numbers. We describe the gap in existing spoken language understanding datasets that Timers and Such fills, the …

UncommonVoice: A Crowdsourced Dataset of Dysphonic Speech

UncommonVoice is a freely-available dataset of crowd-sourced voice disorder speech from 57 speakers. Spasmodic Dysphonia (SD) is the primary voice disorder represented in this dataset, however, the collection was not limited to only SD voices.

Representation, Exploration, and Recommendation of Music Playlists

With an aim towards playlist discovery and recommendation, we leverage sequence-to-sequence modeling to learn a fixed-length representation of playlists in an unsupervised manner.