Exploring Lexical Sensitivities in Word Prediction Models: A case study on BERT

Misra, Kanishka

doi:10.25394/PGS.13308830.v1

Exploring_Lexical_Sensitivities_in_Word_Prediction_Models__A_case_study_on_BERT (2).pdf (1.74 MB)

Exploring Lexical Sensitivities in Word Prediction Models: A case study on BERT

thesis

posted on 2020-12-01, 15:04 authored by Kanishka MisraKanishka Misra

Estimating word probabilities in context is the most fundamental mechanism underlying the training of neural network-based language processing models.

Models pre-trained using this mechanism tend to learn task independent representations that exhibit a variety of semantic regularities that are desirable for language processing.

While prediction based tasks have become an important component for these models, much is unknown about what kinds of information the models draw from context to inform word probabilities.

The present work aims to advance the understanding of word prediction models by integrating perspectives from the psycholinguistic phenomenon of semantic priming, and presents a case study analyzing the lexical properties of the pretrained BERT model.

Using stimuli that cause priming in humans, this thesis relates BERT's sensitivity towards lexical cues with predictive contextual constraints and finer-grained lexical relations.

To augment the empirical methodology utilized to behaviorally analyze BERT, this thesis draws on the knowledge-rich paradigm of Ontological Semantics and fuzzy-inferences supported by its practical realization, the Ontological Semantics Technology, to qualitatively relate BERT's predictive mechanisms to meaning interpretation in context.

The findings establish the importance of considering predictive constraint effects of context in studies that behaviorally analyze language processing models, and highlight possible parallels with human processing.

History

Degree Type

Master of Science

Department

Computer and Information Technology

Campus location

West Lafayette

Advisor/Supervisor/Committee Chair

Julia Taylor Rayz

Additional Committee Member 2

John Springer

Additional Committee Member 3

Victor Raskin

Usage metrics

Keywords

Natural language processing Semantic Priming Interpretability Cognitive science Natural Language Processing Cognitive Science not elsewhere classified Knowledge Representation and Machine Learning

Licence

CC BY 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Exploring Lexical Sensitivities in Word Prediction Models: A case study on BERT

History

Degree Type

Department

Campus location

Advisor/Supervisor/Committee Chair

Additional Committee Member 2

Additional Committee Member 3

Usage metrics

Categories

Keywords

Licence

Exports