A formal explanation space for the simultaneous clustering of neurologic diseases based on their signs and symptoms
Abstract
Objective: Clustering is widely used to identify meaningful subgroups in biomedical data, but interpretation remains challenging, especially in the absence of ground-truth labels. Moreover, clustering often produces multiple plausible solutions without a single correct answer. Using dementia phenotypes as a case study, we introduce a Formal Explanation Space (FES) to improve interpretability and facilitate comparison across competing cluster solutions. Methods: We used spectral coclustering and spectral biclustering to cluster a dataset of dementia patients based on clinical phenotypes (signs and symptoms). To enhance interpretability, we constructed an FES to explain algorithm behavior, assess cluster quality, identify influential features, and characterize cluster composition. Although simultaneous clustering is unsupervised, interpretation was aided by diagnostic labels, which we used for external validation of cluster composition. Results: Spectral coclustering and spectral biclustering each identified five biologically plausible dementia subgroups, though subgroup composition differed by method. The FES provided a structured framework for comparing these divergent outputs. Conclusions: Clustering complex biomedical data often produces multiple biologically plausible solutions. Retaining and comparing such solutions within a formal explanation space enhances interpretability and supports the discovery of complementary insights across methods.
Department(s)
Cooperative Engineering Program
Document Type
Article
DOI
10.1186/s12911-025-03297-w
Keywords
Biclusters, Biological plausibility, Cluster composition, Cluster interpretation, Explainable AI, Explanation space, Heat maps, Phenotype, Simultaneous clustering, Word clouds
Publication Date
12-1-2026
Recommended Citation
Obafemi-Ajayi, Tayo; Yelugam, Raghu; Hier, Daniel B.; Carrithers, Michael D.; and Wunsch II, Donald C., "A formal explanation space for the simultaneous clustering of neurologic diseases based on their signs and symptoms" (2026). Faculty Scholarship. 3.
https://bearworks.missouristate.edu/articles00/3
Journal Title
BMC Medical Informatics and Decision Making