Publikationsansicht

Recent Improvements of an Auditory Model Based Front-End for the Transcription of Vocal Queries (2004)

Abstract
In this paper recent improvements of an existing acoustic frontend for the transcription of vocal (hummed, sung) musical queries is presented. Thanks to the addition of a new second pitch extractor and the introduction of a novel multi-stage segmentation algorithm, the application domain of the front-end could be extended to whistled queries, and on top of that, the performance on the other two query types could be improved. Experiments have shown that the new system can transcribe vocal queries with an accuracy ranging from 76 % (whistling) to 85 % (humming), and that it clearly outperforms other state-of-the art systems on all three query types. 1.

Details der Publikation
Download http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.59.6827
Quelle http://www.ipem.ugent.be/MAMI/Public/Papers/DeMulderICASSP2004.pdf
Herausgeber MIT Press
Mitarbeiter CiteSeerX
Archiv CiteSeerX - Scientific Literature Digital Library and Search Engine (United States)
Typ text
Sprache Englisch
Verknüpfungen 10.1.1.108.8515, 10.1.1.10.5174, 10.1.1.8.3936, 10.1.1.101.8244, 10.1.1.109.4489, 10.1.1.73.6190