Evaluation of Archetypal Analysis and Manifold Learning for Phenotyping of Acute Kidney Injury

Rodriquez, Dylan M

doi:10.25394/PGS.14479881.v1

Evaluation_of_Archetypal_Analysis_and_Manifold_Learning_for_phenotyping_of_AKI.pdf (1.61 MB)

Evaluation of Archetypal Analysis and Manifold Learning for Phenotyping of Acute Kidney Injury

thesis

posted on 2021-05-07, 11:28 authored by Dylan M RodriquezDylan M Rodriquez

Disease subtyping has been a critical aim of precision and personalized medicine. With the potential to improve patient outcomes, unsupervised and semi-supervised methods for determining phenotypes of subtypes have emerged with a recent focus on matrix and tensor factorization. However, interpretability of proposed models is debatable. Principal component analysis (PCA), a traditional method of dimensionality reduction, does not impose non-negativity constraints. Thus coefficients of the principal components are, in cases, difficult to translate to real physical units. Non-negative matrix factorization (NMF) constrains the factorization to positive numbers such that representative types resulting from the factorization are additive. Archetypal analysis (AA) extends this idea and seeks to identify pure types, archetypes, at the extremes of the data from which all other data can be expressed as a convex combination, or by proportion, of the archetypes. Using AA, this study sought to evaluate the sufficiency of AKI staging criteria through unsupervised subtyping. Archetype analysis failed to find a direct 1:1 mapping of archetypes to physician staging and also did not provide additional insight into patient outcomes. Several factors of the analysis such as quality of the data source and the difficulty in selecting features contributed to the outcome. Additionally, after performing feature selection with lasso across data subsets, it was determined that current staging criteria is sufficient to determine patient phenotype with serum creatinine at time of diagnosis to be a necessary factor.

History

Degree Type

Master of Science

Department

Computer Science

Campus location

West Lafayette

Advisor/Supervisor/Committee Chair

Ananth Grama

Additional Committee Member 2

Mohammad Adibuzzaman

Additional Committee Member 3

Petros Drineas

Usage metrics

Keywords

Archetypal Analysis Manifold Learning Dimensionality reduction Uniform manifold approximation and projection Clinical Informatics Acute Kidney Injury Outcomes electronic health records (EHR)Cerner Health Facts database UMAP Disease subtype discovery Applied Computer Science

Licence

CC BY 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Evaluation of Archetypal Analysis and Manifold Learning for Phenotyping of Acute Kidney Injury

History

Degree Type

Department

Campus location

Advisor/Supervisor/Committee Chair

Additional Committee Member 2

Additional Committee Member 3

Usage metrics

Categories

Keywords

Licence

Exports