Purdue University Graduate School
Browse

Performance and Cost Optimization for Distributed Cloud-native Systems

Download (32.38 MB)
thesis
posted on 2022-07-28, 16:32 authored by Ashraf Y MahgoubAshraf Y Mahgoub

 First, NoSQL data-stores provide a set of features that is demanded by high perfor?mance computing (HPC) applications such as scalability, availability and schema flexibility. High performance computing (HPC) applications, such as metagenomics and other big data systems, need to store and analyze huge volumes of semi-structured data. Such applica?tions often rely on NoSQL-based datastores, and optimizing these databases is a challenging endeavor, with over 50 configuration parameters in Cassandra alone. As the application executes, database workloads can change rapidly over time (e.g. from read-heavy to write-heavy), and a system tuned for one phase of the workload becomes suboptimal when the workload changes. 

History

Degree Type

  • Doctor of Philosophy

Department

  • Computer Science

Campus location

  • West Lafayette

Advisor/Supervisor/Committee Chair

Saurabh Bagchi

Advisor/Supervisor/Committee co-chair

Ananth Grama

Additional Committee Member 2

Ninghui Li

Additional Committee Member 3

Changhee Jung

Additional Committee Member 4

Somali Chaterji

Additional Committee Member 5

Sonia Fahmy

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC