Purdue University Graduate School
Browse

PHYSICS INFORMED CONSTRAINED LEARNING OF DYNAMICS FROM STATICDATA

Reason: This work is under review.

1

year(s)

6

month(s)

29

day(s)

until file(s) become available

DEVELOPMENT OF DATA-DRIVEN AND AI-POWERED SYSTEMS BIOLOGY METHODS FOR UNDERSTANDING HUMAN DISEASE

thesis
posted on 2024-09-03, 12:31 authored by Pengtao DangPengtao Dang

Systems biology dynamic models, which are based on differential equations, offer a flexible and accurate framework to explain physiological properties emerging from complex biochem- ical or biological systems. These models enable explicit quantification and interpretation, allowing for simulation and perturbation analysis to study biological features and their inter- actions, as well as understanding system progression and convergence under various initial conditions. However, their application in human disease systems is limited due to unknown kinetics parameters under disease conditions and a reductionist paradigm that fails to cap- ture the complexity of diseases. Meanwhile, the advent of omics technologies provides high- resolution molecular measurements from single cells and spatially resolved samples, as well as comprehensive disease-specific molecular signatures from large patient cohorts. This wealth of data holds the promise for characterizing complex biological systems, necessitating ad- vanced systems biology models and computational tools that can harness multi-omics data to reliably depict biological processes. However, this endeavor faces the challenge of nonlinear relationships between omics data and the system’s dynamic properties, such as the global or local low-rank gene expression patterns across cell types and the nonlinear complexities within transcriptional regulatory networks revealed by single-cell RNA sequencing.

The overall goal of this report is to develop new computational frameworks, AI-empowered methods, and related mathematical theories to explicitly represent and approximate the dy- namics of complex biological systems by using biological omics data. Our aim is to unravel the intricacies of context-specific dynamic systems using multi-Omics data. Specifically, we solved two different but related computational tasks and enabled the first-of-its-kind methods to (1) identify local low-rank matrices from large omics data, and (2) a robust optimization strategy to approximate metabolic flux. Subsequently, we delve into the realm of data-driven and AI-powered systems biology, harnessing the power of statistical learning and artificial intelligence to approximate differential equations or their representations. This research en- deavor not only contributes to the advancement of subspace modeling but also offers insights into a wide array of complex phenomena across diverse domains, with profound implications for computational biology and beyond.

History

Degree Type

  • Doctor of Philosophy

Department

  • Electrical and Computer Engineering

Campus location

  • Indianapolis

Advisor/Supervisor/Committee Chair

Paul Salama

Advisor/Supervisor/Committee co-chair

Chi Zhang

Additional Committee Member 2

Sha Cao

Additional Committee Member 3

Brian King

Additional Committee Member 4

Zina Ben-Miled

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC