Please use this identifier to cite or link to this item: https://hdl.handle.net/2445/214819
Title: The DERMACLEAR study: Verification results of a natural language processing system in dermatology
Author: Ortiz de Frutos, Francisco Javier
Giménez Arnau, Ana M.
Puig, Lluís
Silvestre, Juan F.
Serra, Esther
Salgado Boquete, Laura
García-Patos Briones, Vicente
Estebaranz, Jose L. L.
Notario, Jaime
Martín Santiago, Ana
Pontevia, Gabriel M.
Martín, Víctor
Guinea, Guillermo
Terradas, Pau
Daudén, Esteban
Keywords: Aprenentatge automàtic
Dermatologia
Machine learning
Dermatology
Issue Date: 11-Aug-2023
Publisher: Wiley
Abstract: Background: Accurately determining the epidemiology of dermatological diseases such as hidradenitis suppurativa (HS), psoriasis (PsO), chronic urticaria (CU) and/or atopic dermatitis (AD) is challenging due to variations in prevalence and disease severity in the reported literature. Objectives: The DERMACLEAR study aims to use natural language processing (NLP) to assess the proportions of patients with HS, PsO, CU and/or AD, and obtain information on patient profiles, patient journeys, and disease and healthcare burden in Spain. Here, the study design and objectives of the DERMACLEAR study are described and the precision of the NLP system used is assessed. Methods: This study will retrospectively collect patient information from electronic health records (EHRs) at dermatology departments from seven tertiary hospitals in Spain. The NLP system was developed by IOMED Medical Solutions and was verified internally (IOMED scientific team) and externally (principal investigators of each hospital) to determine its precision in identifying patients with HS, PsO, CU and/or AD. Furthermore, internal verification was performed on other medical variables relevant to the study. Results: To date, the DERMACLEAR study has retrospectively collected data from 54,458 patients with HS, PsO, CU and/or AD (HS: 5045; PsO: 32,559; CU: 8397; AD: 12,492). The average precision of the NLP system to identify patients diagnosed with HS, PsO, CU, and/or AD across all hospitals exceeded 95% via external and internal verification. Conclusions: Results from the DERMACLEAR study will increase the real-world evidence of clinical practice, obtaining a large amount of information on patients with the studied diseases. The NLP system used is precise in identifying patients diagnosed with HS, PsO, CU and/or AD, and other medical variables from EHRs, highlighting that it is a valid system to use in the DERMACLEAR study.
Note: Reproducció del document publicat a: https://doi.org/10.1002/jvc2.217
It is part of: JEADV Clinical Practice, 2023, vol. 2, num. 4, p. 775-785
URI: https://hdl.handle.net/2445/214819
Related resource: https://doi.org/10.1002/jvc2.217
ISSN: 2768-6566
Appears in Collections:Articles publicats en revistes (Institut d'lnvestigació Biomèdica de Bellvitge (IDIBELL))



This item is licensed under a Creative Commons License Creative Commons