Joint Segmentation and Sub-pixel Localization in Structured Light Laryngoscopy

Henningson JO, Semmler M, Döllinger M, Stamminger M (2023)


Publication Type: Conference contribution

Publication year: 2023

Journal

Publisher: Springer Science and Business Media Deutschland GmbH

Book Volume: 14225 LNCS

Pages Range: 34-43

Conference Proceedings Title: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Event location: Vancouver, BC CA

ISBN: 9783031439865

DOI: 10.1007/978-3-031-43987-2_4

Abstract

In recent years, phoniatric diagnostics has seen a surge of interest in structured light-based high-speed video endoscopy, as it enables the observation of oscillating human vocal folds in vertical direction. However, structured light laryngoscopy suffers from practical problems: specular reflections interfere with the projected pattern, mucosal tissue dilates the pattern, and lastly the algorithms need to deal with huge amounts of data generated by a high-speed video camera. To address these issues, we propose a neural approach for the joint semantic segmentation and keypoint detection in structured light high-speed video endoscopy that improves the robustness, accuracy, and performance of current human vocal fold reconstruction pipelines. Major contributions are the reformulation of one channel of a semantic segmentation approach as a single-channel heatmap regression problem, and the prediction of sub-pixel accurate 2D point locations through weighted least squares in a fully-differentiable manner with negligible computational cost. Lastly, we expand the publicly available Human Laser Endoscopic dataset to also include segmentations of the human vocal folds itself. The source code and dataset are available at: github.com/Henningson/SSSLsquared

Authors with CRIS profile

How to cite

APA:

Henningson, J.-O., Semmler, M., Döllinger, M., & Stamminger, M. (2023). Joint Segmentation and Sub-pixel Localization in Structured Light Laryngoscopy. In Hayit Greenspan, Hayit Greenspan, Anant Madabhushi, Parvin Mousavi, Septimiu Salcudean, James Duncan, Tanveer Syeda-Mahmood, Russell Taylor (Eds.), Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp. 34-43). Vancouver, BC, CA: Springer Science and Business Media Deutschland GmbH.

MLA:

Henningson, Jann-Ole, et al. "Joint Segmentation and Sub-pixel Localization in Structured Light Laryngoscopy." Proceedings of the 26th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2023, Vancouver, BC Ed. Hayit Greenspan, Hayit Greenspan, Anant Madabhushi, Parvin Mousavi, Septimiu Salcudean, James Duncan, Tanveer Syeda-Mahmood, Russell Taylor, Springer Science and Business Media Deutschland GmbH, 2023. 34-43.

BibTeX: Download