Tamer NC, Özer Y, Müller M, Serra X (2023)
Publication Type: Conference contribution
Publication year: 2023
Publisher: Institute of Electrical and Electronics Engineers Inc.
Conference Proceedings Title: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
DOI: 10.1109/ICASSP49357.2023.10096762
Pitch estimation of a target musical source within a multi-source polyphonic signal is of great interest for music performance analysis. One possible approach for extracting the pitch of a target source is to first perform source separation and then estimate the pitch of the separated track. However, as we will show, this typically leads to poor results. As an alternative to this approach, we introduce a timbre-aware pitch estimator (TAPE), which estimates the pitch of a target source in an end-to-end manner without the need for an explicit source separation step. Opposed to existing approaches that assume the predominance of a lead voice, our approach builds upon other cues that only rely on the timbral characteristics. Our results on real violin-piano duets show that, without any pre-processing step, TAPE trained on synthetic mixes outperforms the sequential procedure of source separation and pitch estimation under many settings, even if the target source is not predominant.
APA:
Tamer, N.C., Özer, Y., Müller, M., & Serra, X. (2023). TAPE: An End-to-End Timbre-Aware Pitch Estimator. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Rhodes Island, GR: Institute of Electrical and Electronics Engineers Inc..
MLA:
Tamer, Nazif Can, et al. "TAPE: An End-to-End Timbre-Aware Pitch Estimator." Proceedings of the 48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023, Rhodes Island Institute of Electrical and Electronics Engineers Inc., 2023.
BibTeX: Download