Ultrasound Video Transformers for Cardiac Ejection Fraction Estimation

Reynaud H, Vlontzos A, Hou B, Beqiri A, Leeson P, Kainz B (2021)


Publication Language: English

Publication Status: Published

Publication Type: Authored book, Volume of book series

Publication year: 2021

Publisher: Springer Science and Business Media Deutschland GmbH

Pages Range: 495-505

ISBN: 9783030872304

DOI: 10.1007/978-3-030-87231-1_48

Open Access Link: https://arxiv.org/abs/2107.00977

Abstract

Cardiac ultrasound imaging is used to diagnose various heart diseases. Common analysis pipelines involve manual processing of the video frames by expert clinicians. This suffers from intra- and inter-observer variability. We propose a novel approach to ultrasound video analysis using a transformer architecture based on a Residual Auto-Encoder Network and a BERT model adapted for token classification. This enables videos of any length to be processed. We apply our model to the task of End-Systolic (ES) and End-Diastolic (ED) frame detection and the automated computation of the left ventricular ejection fraction. We achieve an average frame distance of 3.36 frames for the ES and 7.17 frames for the ED on videos of arbitrary length. Our end-to-end learnable approach can estimate the ejection fraction with a MAE of 5.95 and R2 of 0.52 in 0.15 s per video, showing that segmentation is not the only way to predict ejection fraction.

Authors with CRIS profile

Involved external institutions

How to cite

APA:

Reynaud, H., Vlontzos, A., Hou, B., Beqiri, A., Leeson, P., & Kainz, B. (2021). Ultrasound Video Transformers for Cardiac Ejection Fraction Estimation. Springer Science and Business Media Deutschland GmbH.

MLA:

Reynaud, Hadrien, et al. Ultrasound Video Transformers for Cardiac Ejection Fraction Estimation. Springer Science and Business Media Deutschland GmbH, 2021.

BibTeX: Download