A SYSTEMATIC STUDY OF SPARSE DEEP LEARNING
WITH DIFFERENT PENALTIES

Tao, Xinlin

doi:10.25394/PGS.22693573.v1

File(s) under embargo

8

month(s)

29

day(s)

until file(s) become available

A SYSTEMATIC STUDY OF SPARSE DEEP LEARNING WITH DIFFERENT PENALTIES

thesis

posted on 2023-04-25, 19:08 authored by Xinlin TaoXinlin Tao

Deep learning has been the driving force behind many successful data science achievements. However, the deep neural network (DNN) that forms the basis of deep learning is

often over-parameterized, leading to training, prediction, and interpretation challenges. To

address this issue, it is common practice to apply an appropriate penalty to each connection

weight, limiting its magnitude. This approach is equivalent to imposing a prior distribution

on each connection weight from a Bayesian perspective. This project offers a systematic investigation into the selection of the penalty function or prior distribution. Specifically, under

the general theoretical framework of posterior consistency, we prove that consistent sparse

deep learning can be achieved with a variety of penalty functions or prior distributions.

Examples include amenable regularization penalties (such as MCP and SCAD), spike-and?slab priors (such as mixture Gaussian distribution and mixture Laplace distribution), and

polynomial decayed priors (such as the student-t distribution). Our theory is supported by

numerical results.

History

Degree Type

Doctor of Philosophy

Department

Statistics

Campus location

West Lafayette

Advisor/Supervisor/Committee Chair

Faming Liang

Additional Committee Member 2

Xiao Wang

Additional Committee Member 3

Vinayak Rao

Additional Committee Member 4

Bruno Ribeiro

Usage metrics

Keywords

Network Compression Sparse Deep Learning Nonlinear feature selection Posterior Consistency

Licence

CC BY 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC