UTILIZING MICROSERVICE REQUEST TRACES TO ENHANCE WORKLOAD PREDICTION

Mahajan, Isham Jitendra

doi:10.25394/PGS.27947901.v1

UTILIZING MICROSERVICE REQUEST TRACES TO ENHANCE WORKLOAD PREDICTION

thesis

posted on 2024-12-07, 13:35 authored by Isham Jitendra MahajanIsham Jitendra Mahajan

Container orchestration systems, such as Kubernetes, often rely on manual resource allocation to manage resources, which can be inefficient and inflexible due to frequent over-provisioning or underprovisioning. Kubernetes horizontal pod autoscaler (HPA), vertical pod autoscaler (VPA), and Google Kubernetes Engine (GKE) Autopilot are primarily threshold-based, making them reactive rather than proactive since they adjust resources after exceeding utilization thresholds, leading to temporary degradation in quality of service~(QoS). While some solutions utilize calls per minute (CPM) counts for requests to microservices to estimate resource consumption dynamically, they do not fully exploit distributed traces or associated microservices' interdependencies. This thesis hypothesizes that more profound insights into future workload patterns can be gained by exploiting microservices' interaction and the CPM counts for each pair of communicating microservices. This thesis proposes a comprehensive machine learning workflow to assess whether factoring in the interdependencies between microservices results in improved workload prediction. The findings of this study indicate that a long short-term memory (LSTM) model performs well, with average mean absolute error (MAE) and root mean square error (RMSE) values of 7.02 and 10.54, respectively. The highest \(R^2\) score observed was 0.07. This suggests that although incorporating distributed traces and inter-microservice CPM counts provides valuable insights, the models fail to capture the full complexity of workload dynamics. These results highlight the potential for enhancing workload prediction accuracy and underscore the need to refine these methods further to achieve more proactive and efficient resource allocation in container orchestration systems.

History

Degree Type

Master of Science

Department

Computer and Information Technology

Campus location

West Lafayette

Advisor/Supervisor/Committee Chair

Deepak Nadig Anantha

Additional Committee Member 2

Robert C. Deadman

Additional Committee Member 3

Thomas J. Hacker

Usage metrics

Keywords

workload prediction kubernetes call graph

Licence

CC BY-NC-SA 4.0

UTILIZING MICROSERVICE REQUEST TRACES TO ENHANCE WORKLOAD PREDICTION

History

Degree Type

Department

Campus location

Advisor/Supervisor/Committee Chair

Additional Committee Member 2

Additional Committee Member 3

Usage metrics

Categories

Keywords

Licence

Exports