SNOBERT: A Benchmark for clinical notes entity linking in the SNOMED CT clinical terminology

Kulyabin M, Sokolov G, Galaida A, Maier A, Arias Vergara T (2024)


Publication Language: English

Publication Type: Conference contribution, Original article

Publication year: 2024

Conference Proceedings Title: Proceedings of the 27th International Conference on Pattern Recognition 2024

Event location: Kolkata, India IN

URI: https://arxiv.org/abs/2405.16115

Abstract

The extraction and analysis of insights from medical data, primarily stored in free-text formats by healthcare workers, presents significant challenges due to its unstructured nature. Medical coding, a crucial process in healthcare, remains minimally automated due to the complexity of medical ontologies and restricted access to medical texts for training Natural Language Processing models. In this paper, we proposed a method, "SNOBERT," of linking text spans in clinical notes to specific concepts in the SNOMED CT using BERT-based models. The method consists of two stages: candidate selection and candidate matching. The models were trained on one of the largest publicly available datasets of labelled clinical notes. SNOBERT outperforms other classical methods based on deep learning, as confirmed by the results of a challenge in which it was applied.

Authors with CRIS profile

How to cite

APA:

Kulyabin, M., Sokolov, G., Galaida, A., Maier, A., & Arias Vergara, T. (2024). SNOBERT: A Benchmark for clinical notes entity linking in the SNOMED CT clinical terminology. In Proceedings of the 27th International Conference on Pattern Recognition 2024. Kolkata, India, IN.

MLA:

Kulyabin, Mikhail, et al. "SNOBERT: A Benchmark for clinical notes entity linking in the SNOMED CT clinical terminology." Proceedings of the 27th International Conference on Pattern Recognition, Kolkata, India 2024.

BibTeX: Download