| Title: |
A Systematic Approach to Prioritise Diagnostically Useful Findings for Inclusion in Electronic Health Records as Discrete Data to Improve Clinical Artificial Intelligence Tools and Genomic Research |
| Authors: |
Guillod, P; Savvas, A; Robinson, PN; Nai, D; Naresh, KN; Ott, G; Schuh, A; Sewell, WA ; https://orcid.org/0000-0002-3586-8761; Anderson, M; Matentzoglu, N; Durgavarjhula, D; Xu, ML; Druzdzel, MJ; Astle, JM |
| Source: |
urn:ISSN:0936-6555; urn:ISSN:1433-2981; Clinical Oncology, 39, 103737 |
| Publisher Information: |
Elsevier 2025-03-01 |
| Document Type: |
Electronic Resource |
| Abstract: |
Aims: The recent widespread use of electronic health records (EHRs) has opened the possibility for innumerable artificial intelligence (AI) tools to aid in genomics, phenomics, and other research, as well as disease prevention, diagnosis, and therapy. Unfortunately, much of the data contained in EHRs are not optimally structured for even the most sophisticated AI approaches. There are very few published efforts investigating methods for recording discrete data in EHRs that would not slow current clinical workflows or ways to prioritise patient characteristics worth recording. Here, we propose an approach to identify and prioritise findings (phenotypes) useful for differentiating diseases, with an initial focus on relatively common small B-cell lymphomas. Materials and methods: A website enabling crowd-sourced recording of diseases and phenotypes was developed. An expert committee in the field of B-cell lymphomas standardised phenotype terminology for use in digital resources, and select terms were included in the Human Phenotype Ontology (HPO). A total of 100 patient lymph node biopsy samples were evaluated, and phenotypes were recorded as discrete data. Bayesian networks (BNs) were developed based on these data, and their diagnostic accuracy and ability to prioritise these phenotypes for inclusion in EHRs were assessed. Results: Out of 146 phenotypes identified from the website as potentially useful for differentiating four different lymphomas from each other and from benign lymph nodes, 70–75 were included in BNs. The diagnostic accuracy of different naïve BNs was 96.3% for non–marginal zone lymphoma cases and 50% for marginal zone lymphoma cases when all of the included phenotypes were used and 93.8% for non–marginal zone lymphoma cases and 27.5% for marginal zone lymphoma cases when only 15 phenotypes were included in the BNs. Conclusion: This pilot provides a starting point for systematic improvement and a dataset for comparing related approaches. |
| Index Terms: |
32 Biomedical and Clinical Sciences; 3211 Oncology and Carcinogenesis; Lymphoma; Genetics; Machine Learning and Artificial Intelligence; Rare Diseases; Hematology; Cancer; Networking and Information Technology R&D (NITRD); Lymphatic Research; Patient Safety; Bioengineering; Human Genome; Generic health relevance; 3 Good Health and Well Being; Humans; Artificial Intelligence; Electronic Health Records; Genomics; Lymphoma, B-Cell; Phenotype; Bayes Theorem; bayesian network; diagnostic decision support; genomic; phenomic; anzsrc-for: 32 Biomedical and Clinical Sciences; anzsrc-for: 3211 Oncology and Carcinogenesis; anzsrc-for: 1112 Oncology and Carcinogenesis; journal article; http://purl.org/coar/resource_type/c_6501 |
| URL: |
http://hdl.handle.net/1959.4/105322; https://unsworks.unsw.edu.au/bitstreams/b2bfcf56-2d08-4c4a-923e-c8a173f5bd27/download; https://doi.org/10.1016/j.clon.2024.103737 |
| Availability: |
Open access content. Open access content; open access; https://purl.org/coar/access_right/c_abf2; CC-BY-NC-ND; https://creativecommons.org/licenses/by-nc-nd/4.0; CC BY-NC-ND; https://creativecommons.org/licenses/by-nc-nd/4.0; free_to_read |
| Note: |
application/pdf |
| Other Numbers: |
LJ1 oai:unsworks.library.unsw.edu.au:1959.4/105322; 1534798232 |
| Contributing Source: |
UNIV OF NEW S WALES; From OAIster®, provided by the OCLC Cooperative. |
| Accession Number: |
edsoai.on1534798232 |
| Database: |
OAIster |