Quantifying the Trade-Offs Between Dimensions of Trustworthy AI - An Empirical Study on Fairness, Explainability, Privacy, and Robustness

Kemmerzell N, Schreiner A (2024)


Publication Type: Conference contribution

Publication year: 2024

Journal

Publisher: Springer

Series: Lecture Notes in Computer Science

City/Town: Cham

Pages Range: 128-146

Conference Proceedings Title: KI 2024: Advances in Artificial Intelligence

Event location: Würzburg DE

ISBN: 9783031708923

DOI: 10.1007/978-3-031-70893-0_10

Abstract

Trustworthy AI encompasses various requirements for AI systems, including explainability, fairness, privacy, and robustness. Addressing these dimensions concurrently is challenging due to inherent tensions and trade-offs between them. Current research highlights these trade-offs, focusing on specific interactions, but comprehensive and systematic evaluations remain insufficient. This study aims to enhance the understanding of trade-offs among explainability, fairness, privacy, and robustness in AI. By conducting extensive experiments in the domain of image classification, it quantitatively assesses how methods to improve one requirement impact others. More specifically, it explores different training adaptations to enhance each requirement and measures their effects on the others on four datasets for gender classification. The experiments revealed that the Local Gradient Alignment method improved explainability and robustness but introduced trade-offs in fairness, privacy, and accuracy. Fairness-focused training adaptations only enhanced fairness for the most biased models. For all other cases, fairness, explainability and robustness are reduced. Differential Privacy improved privacy but compromised explainability, fairness, and accuracy, with varied impacts on robustness. Data augmentation techniques enhanced robustness, explainability and accuracy with minor trade-offs in privacy and fairness.

Authors with CRIS profile

How to cite

APA:

Kemmerzell, N., & Schreiner, A. (2024). Quantifying the Trade-Offs Between Dimensions of Trustworthy AI - An Empirical Study on Fairness, Explainability, Privacy, and Robustness. In KI 2024: Advances in Artificial Intelligence (pp. 128-146). Würzburg, DE: Cham: Springer.

MLA:

Kemmerzell, Nils, and Annika Schreiner. "Quantifying the Trade-Offs Between Dimensions of Trustworthy AI - An Empirical Study on Fairness, Explainability, Privacy, and Robustness." Proceedings of the KI 2024: Advances in Artificial Intelligence, Würzburg Cham: Springer, 2024. 128-146.

BibTeX: Download