Xiaoquan Li
A music cognition-guided framework for multi-pitch estimation.
Li, Xiaoquan; Yan, Yijun; Soraghan, John; Wang, Zheng; Ren, Jinchang
Authors
Dr Yijun Yan y.yan2@rgu.ac.uk
Research Fellow
John Soraghan
Zheng Wang
Professor Jinchang Ren j.ren@rgu.ac.uk
Professor of Computing Science
Abstract
As one of the most important subtasks of automatic music transcription (AMT), multi-pitch estimation (MPE) has been studied extensively for predicting the fundamental frequencies in the frames of audio recordings during the past decade. However, how to use music perception and cognition for MPE has not yet been thoroughly investigated. Motivated by this, this demonstrates how to effectively detect the fundamental frequency and the harmonic structure of polyphonic music using a cognitive framework. Inspired by cognitive neuroscience, an integration of the constant Q transform and a state-of-the-art matrix factorization method called shift-invariant probabilistic latent component analysis (SI-PLCA) are proposed to resolve the polyphonic short-time magnitude log-spectra for multiple pitch estimation and source-specific feature extraction. The cognitions of rhythm, harmonic periodicity and instrument timbre are used to guide the analysis of characterizing contiguous notes and the relationship between fundamental frequency and harmonic frequencies for detecting the pitches from the outcomes of SI-PLCA. In the experiment, we compare the performance of proposed MPE system to a number of existing state-of-the-art approaches (seven weak learning methods and four deep learning methods) on three widely used datasets (i.e. MAPS, BACH10 and TRIOS) in terms of F-measure (F1) values. The experimental results show that the proposed MPE method provides the best overall performance against other existing methods.
Citation
LI, X., YAN, Y., SORAGHAN, J., WANG, Z. and REN, J. 2023. A music cognition-guided framework for multi-pitch estimation. Cognitive computation [online], 15(1), pages 23-35. Available from: https://doi.org/10.1007/s12559-022-10031-5
Journal Article Type | Article |
---|---|
Acceptance Date | May 26, 2022 |
Online Publication Date | Jun 14, 2022 |
Publication Date | Jan 31, 2023 |
Deposit Date | Jun 16, 2022 |
Publicly Available Date | Jun 16, 2022 |
Journal | Cognitive computation |
Print ISSN | 1866-9956 |
Electronic ISSN | 1866-9964 |
Publisher | Springer |
Peer Reviewed | Peer Reviewed |
Volume | 15 |
Issue | 1 |
Pages | 23-35 |
DOI | https://doi.org/10.1007/s12559-022-10031-5 |
Keywords | Pitch estimation; Cognition; Artificial intelligence; Machine learning; Music cognition; Automatic music transcription; Multi-pitch estimation; Harmonic structure detection (HSD); Polyphonic music detection |
Public URL | https://rgu-repository.worktribe.com/output/1687960 |
Files
LI 2023 A music cognition-guided
(3.1 Mb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by/4.0/
Copyright Statement
© The Author(s) 2022.
Version
Final VOR updated 23.03.2023
You might also like
Hyperspectral imaging based corrosion detection in nuclear packages.
(2023)
Journal Article
Downloadable Citations
About OpenAIR@RGU
Administrator e-mail: publications@rgu.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search