PED-DATA: A Privacy-Preserving Framework for Data-Driven, Pediatric Multi-Center Studies

Yilmaz G, Mang JM, Metzler M, Prokosch HU, Rauh M, Zierk J (2025)


Publication Type: Book chapter / Article in edited volumes

Publication year: 2025

Publisher: IOS Press

Edited Volumes: German Medical Data Sciences 2025: GMDS Illuminates Health

Series: Studies in Health Technology and Informatics

Book Volume: 331

Pages Range: 307-317

DOI: 10.3233/SHTI251409

Abstract

INTRODUCTION: Data-driven analysis of clinical databases is an efficient method for clinical knowledge generation, which is especially suitable when exceptional ethical and practical restrictions apply, such as in pediatrics. In the multi-center PEDREF 2.0 study, we are analyzing children's laboratory test results, diagnoses, and procedures from more than 20 German tertiary care centers to establish pediatric reference intervals. The PEDREF 2.0 study uses the framework of the German Medical Informatics Initiative, but the specific study needs require the development of a customized module for distributed pediatric analyses. METHODS: We developed the Pediatric Distributed Analysis, Anonymization, and Aggregation Module (PED-DATA), which is a containerized application that we deployed to all participating centers. PED-DATA transforms the input datasets to a harmonized internal representation and enables their decentralized analysis in compliance with data protection rules, resulting in an anonymous output dataset that is transferred for central analysis. RESULTS: In a preliminary analysis of data from 15 centers, we analyzed 52,807,236 laboratory test results from 753,774 different patients (323,943 to 4,338,317 test results per laboratory test), enabling us to establish pediatric reference intervals with previously unmatched precision. CONCLUSION: PED-DATA facilitates the implementation of pediatric data-driven multicenter studies in a decentralized and privacy-respecting manner, and its use throughout German University Hospitals in the PEDREF 2.0 study demonstrates its usefulness in a real-world use case.

Authors with CRIS profile

How to cite

APA:

Yilmaz, G., Mang, J.M., Metzler, M., Prokosch, H.-U., Rauh, M., & Zierk, J. (2025). PED-DATA: A Privacy-Preserving Framework for Data-Driven, Pediatric Multi-Center Studies. In Rainer Röhrig, Thomas Ganslandt, Klaus Jung, Ann-Kristin Kock-Schoppenhauer, Jochem König, Ulrich Sax, Martin Sedlmayr, Cord Spreckelsen, Antonia Zapf (Eds.), German Medical Data Sciences 2025: GMDS Illuminates Health. (pp. 307-317). IOS Press.

MLA:

Yilmaz, Görkem, et al. "PED-DATA: A Privacy-Preserving Framework for Data-Driven, Pediatric Multi-Center Studies." German Medical Data Sciences 2025: GMDS Illuminates Health. Ed. Rainer Röhrig, Thomas Ganslandt, Klaus Jung, Ann-Kristin Kock-Schoppenhauer, Jochem König, Ulrich Sax, Martin Sedlmayr, Cord Spreckelsen, Antonia Zapf, IOS Press, 2025. 307-317.

BibTeX: Download