High-Throughput Phenotyping of Clinical Text Using Large Language Models

Abstract

High-throughput phenotyping automates the mapping of patient signs to standardized concepts, such as those in Human Phenotype Ontology (HPO), a process critical to precision medicine. We evaluated the automated phenotyping of clinical summaries from the Online Mendelian Inheritance in Man (OMIM) database using a large language model. Various APIs were used to automate text retrieval, sign identification, categorization, and normalization. GPT-4 outperformed GPT-3.5Turbo in identifying, categorizing, and normalizing signs, achieving concordance with manual annotators comparable to concordance between manual annotators. While GPT-4 demonstrates high accuracy in sign identification and categorization, limitations remain in sign normalization, particularly in retrieving the correct HPO ID for a normalized term. Methods such as retrieval-augmented generation, changes in pre-training, and additional fine-tuning may help address these limitations. The combination of APIs with large language models presents a promising approach for high-throughput phenotyping of free text.

Department(s)

Cooperative Engineering Program

Document Type

Conference Proceeding

DOI

10.1109/BHI62660.2024.10913712

Keywords

GPT-4, high-throughput, HPO, large language model, natural language processing, neurology, OMIM, phenotype

Publication Date

1-1-2024

Journal Title

Bhi 2024 IEEE EMBS International Conference on Biomedical and Health Informatics Proceedings

Share

COinS