Interpretable natural language processing models with deep hierarchical structures and effective statistical training

Luo, Zhaoxin

doi:10.25394/PGS.24492517.v1

PurdueThesis_zhaoxin (2).pdf (2.35 MB)

Interpretable natural language processing models with deep hierarchical structures and effective statistical training

thesis

posted on 2023-11-03, 19:18 authored by Zhaoxin LuoZhaoxin Luo

The research focuses on improving natural language processing (NLP) models by integrating the hierarchical structure of language, which is essential for understanding and generating human language. The main contributions of the study are:

Hierarchical RNN Model: Development of a deep Recurrent Neural Network model that captures both explicit and implicit hierarchical structures in language.
Hierarchical Attention Mechanism: Use of a multi-level attention mechanism to help the model prioritize relevant information at different levels of the hierarchy.
Latent Indicators and Efficient Training: Integration of latent indicators using the Expectation-Maximization algorithm and reduction of computational complexity with Bootstrap sampling and layered training strategies.
Sequence-to-Sequence Model for Translation: Extension of the model to translation tasks, including a novel pre-training technique and a hierarchical decoding strategy to stabilize latent indicators during generation.

The study claims enhanced performance in various NLP tasks with results comparable to larger models, with the added benefit of increased interpretability.

History

Degree Type

Doctor of Philosophy

Department

Statistics

Campus location

West Lafayette

Advisor/Supervisor/Committee Chair

Michael Zhu

Additional Committee Member 2

Faming Liang

Additional Committee Member 3

Xiao Wang

Additional Committee Member 4

Vinayak Rao

Usage metrics

Keywords

Hierarchical Mechanics Recurrent Neural Network Model (RNN)language processes

Licence

CC BY 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Interpretable natural language processing models with deep hierarchical structures and effective statistical training

History

Degree Type

Department

Campus location

Advisor/Supervisor/Committee Chair

Additional Committee Member 2

Additional Committee Member 3

Additional Committee Member 4

Usage metrics

Categories

Keywords

Licence

Exports