This paper introduces Timers and Such, a new open source dataset of spoken English commands for common voice control use cases involving numbers. We describe the gap in existing spoken language understanding datasets that Timers and Such fills, the …
UncommonVoice is a freely-available dataset of crowd-sourced voice disorder speech from 57 speakers. Spasmodic Dysphonia (SD) is the primary voice disorder represented in this dataset, however, the collection was not limited to only SD voices.
With an aim towards playlist discovery and recommendation, we leverage sequence-to-sequence modeling to learn a fixed-length representation of playlists in an unsupervised manner.