Probabilistic Diagnostic Model for Handling Classifier Degradation in Machine Learning

Valencia-Zapata, Gustavo A.

doi:10.25394/PGS.11312474.v1

PROBABILISTIC DIAGNOSTIC MODEL FOR HANDLING CLASSIFIER DEGRADATION IN MACHINE LEARNING.pdf (2.89 MB)

Probabilistic Diagnostic Model for Handling Classifier Degradation in Machine Learning

thesis

posted on 2019-12-04, 20:50 authored by Gustavo A. Valencia-Zapata

Several studies point out different causes of performance degradation in supervised machine learning. Problems such as class imbalance, overlapping, small-disjuncts, noisy labels, and sparseness limit accuracy in classification algorithms. Even though a number of approaches either in the form of a methodology or an algorithm try to minimize performance degradation, they have been isolated efforts with limited scope. This research consists of three main parts: In the first part, a novel probabilistic diagnostic model based on identifying signs and symptoms of each problem is presented. Secondly, the behavior and performance of several supervised algorithms are studied when training sets have such problems. Therefore, prediction of success for treatments can be estimated across classifiers. Finally, a probabilistic sampling technique based on training set diagnosis for avoiding classifier degradation is proposed

History

Degree Type

Doctor of Philosophy

Department

Electrical and Computer Engineering

Campus location

West Lafayette

Advisor/Supervisor/Committee Chair

Gerhard Klimeck

Additional Committee Member 2

Kadri O. Ersoy

Additional Committee Member 3

Vinayak A. Rao

Additional Committee Member 4

Michael G. Zentner

Usage metrics

Keywords

Class imbalance Overlapping Small-disjuncts Noisy labels Sparseness Gaussian Mixture Models Separation index Classifier degradation Bayesian Information Criterion (BIC)Pattern Recognition and Data Mining Statistics Artificial Intelligence and Image Processing

Licence

CC BY 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Probabilistic Diagnostic Model for Handling Classifier Degradation in Machine Learning

History

Degree Type

Department

Campus location

Advisor/Supervisor/Committee Chair

Additional Committee Member 2

Additional Committee Member 3

Additional Committee Member 4

Usage metrics

Categories

Keywords

Licence

Exports