Towards Unconstrained Audio Splicing Detection and Localization with Neural Networks

Moussa D, Hirsch G, Rieß C (2022)


Publication Language: English

Publication Type: Conference contribution, Conference Contribution

Publication year: 2022

Publisher: Springer

Pages Range: 264-280

Conference Proceedings Title: Pattern Recognition, Computer Vision, and Image Processing. ICPR 2022 International Workshops and Challenges.

Event location: Montréal,Québec CA

ISBN: 978-3-031-37741-9

URI: https://faui1-files.cs.fau.de/public/publications/mmsec/2022-Moussa-MMFORWILD.pdf

DOI: 10.1007/978-3-031-37742-6

Open Access Link: https://faui1-files.cs.fau.de/public/publications/mmsec/2022-Moussa-MMFORWILD.pdf

Abstract

Freely available and easy-to-use audio editing tools make it straightforward to perform audio splicing. Convincing forgeries can be created by combining various speech samples from the same person. Detection of such splices is important both in the public sector when considering misinformation, and in a legal context to verify the integrity of evidence. Unfortunately, most existing detection algorithms for audio splicing use handcrafted features and make specific assumptions. However, criminal investigators are often faced with audio samples from unconstrained sources with unknown characteristics, which raises the need for more generally applicable methods.

 With this work, we aim to take a first step towards unconstrained audio splicing detection to address this need. We simulate various attack scenarios in the form of post-processing operations that may disguise splicing. We propose a Transformer sequence-to-sequence (seq2seq) network for splicing detection and localization. Our extensive evaluation shows that the proposed method outperforms existing dedicated approaches for splicing detection [3, 10] as well as the general-purpose networks EfficientNet [28] and RegNet [25].

Authors with CRIS profile

How to cite

APA:

Moussa, D., Hirsch, G., & Rieß, C. (2022). Towards Unconstrained Audio Splicing Detection and Localization with Neural Networks. In Springer Cham (Eds.), Pattern Recognition, Computer Vision, and Image Processing. ICPR 2022 International Workshops and Challenges. (pp. 264-280). Montréal,Québec, CA: Springer.

MLA:

Moussa, Denise, Germans Hirsch, and Christian Rieß. "Towards Unconstrained Audio Splicing Detection and Localization with Neural Networks." Proceedings of the The 2nd Workshop on MultiMedia FORensics in the WILD, Montréal,Québec Ed. Springer Cham, Springer, 2022. 264-280.

BibTeX: Download