Marginal Space Deep Learning: Efficient Architecture for Detection in Volumetric Image Data

Ghesu FC, Georgescu B, Zheng Y, Hornegger J, Comaniciu D (2015)

Publication Status: Published

Publication Type: Conference contribution

Publication year: 2015

Journal

Lecture Notes in Computer Science Springer Verlag

Publisher: Springer-verlag

Book Volume: 9349

Pages Range: 710-718

Conference Proceedings Title: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part I

DOI: 10.1007/978-3-319-24553-9_87

Abstract

Current state-of-the-art techniques for fast and robust parsing of volumetric medical image data exploit large annotated image databases and are typically based on machine learning methods. Two main challenges to be solved are the low efficiency in scanning large volumetric input images and the need for manual engineering of image features. This work proposes Marginal Space Deep Learning (MSDL) as an effective solution, that combines the strengths of efficient object parametrization in hierarchical marginal spaces with the automated feature design of Deep Learning (DL) network architectures. Representation learning through DL automatically identifies, disentangles and learns explanatory factors directly from low-level image data. However, the direct application of DL to volumetric data results in a very high complexity, due to the increased number of transformation parameters. For example, the number of parameters defining a similarity transformation increases to 9 in 3D (3 for location, 3 for orientation and 3 for scale). The mechanism of marginal space learning provides excellent run-time performance by learning classifiers in high probability regions in spaces of gradually increasing dimensionality, for example starting from location only (3D) to location and orientation (6D) and full parameter space (9D). In addition, for parametrized feature computation, we propose to simplify the network by replacing the standard, pre-determined feature sampling pattern with a sparse, adaptive, self-learned pattern. The MSDL framework is evaluated on detecting the aortic heart valve in 3D ultrasound data. The dataset contains 3795 volumes from 150 patients. Our method outperforms the state-of-the-art with an improvement of 36%, running in less than one second. To our knowledge this is the first successful demonstration of the DL potential to detection in full 3D data with parametrized representations.

Authors with CRIS profile

Joachim Hornegger Lehrstuhl für Informatik 14 (Bild- und Sprachverarbeitung)

Involved external institutions

Siemens AG, Sector Corporate Technology

Germany (DE)

How to cite

APA:

Ghesu, F.C., Georgescu, B., Zheng, Y., Hornegger, J., & Comaniciu, D. (2015). Marginal Space Deep Learning: Efficient Architecture for Detection in Volumetric Image Data. In 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part I (pp. 710-718). Springer-verlag.

MLA:

Ghesu, Florin C., et al. "Marginal Space Deep Learning: Efficient Architecture for Detection in Volumetric Image Data." Proceedings of the 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part I Springer-verlag, 2015. 710-718.

BibTeX: Download