Ensemble validation paradigm for intelligent data analysis in autism spectrum disorders
Abstract
Cluster analysis is an important exploratory tool for a broad range of applications including data analysis of biomedical datasets to uncover meaningful subgroups such as in autism spectrum disorder (ASD). For a given clustering algorithm, multiple results can be obtained on the same dataset by varying the algorithm parameters. In biomedical applications, discovering meaningful subgroups, not just the optimal number of clusters, is expedient. It is imperative to develop quality measures capable of identifying optimal partitions for a given dataset. In this paper, we apply varied clustering methods to subgroup an ASD simplex sample based on relevant phenotype features that may uncover meaningful subtypes. We present a detailed cluster validation analysis using an ensemble validation paradigm and visualization techniques. We present a rigorous clinical/behavioral analysis of the top highly ranked results. The evaluation demonstrated that both configurations yielded similar clinical significance results: 2-subgroups configuration with distinct clinical profile.
Department(s)
Engineering Program
Document Type
Conference Proceeding
DOI
https://doi.org/10.1109/CIBCB.2018.8404960
Publication Date
7-5-2018
Recommended Citation
Nguyen, Thy, Kerri Nowell, Kimberly E. Bodner, and Tayo Obafemi-Ajayi. "Ensemble validation paradigm for intelligent data analysis in autism spectrum disorders." In 2018 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), pp. 1-8. IEEE, 2018.
Journal Title
2018 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2018