NETWORK AND TOPOLOGICAL ANALYSIS OF SCHOLARLY METADATA: A PLATFORM TO MODEL AND PREDICT COLLABORATION

Novak, Lance C

doi:10.25394/PGS.9108482.v1

Lance Novak Thesis Deposit 25 July 2019.pdf (2.84 MB)

NETWORK AND TOPOLOGICAL ANALYSIS OF SCHOLARLY METADATA: A PLATFORM TO MODEL AND PREDICT COLLABORATION

thesis

posted on 2019-08-15, 14:54 authored by Lance C NovakLance C Novak

The scale of the scholarly community complicates searches within scholarly databases, necessitating keywords to index the topics of any given work. As a result, an author’s choice in keywords affects the visibility of each publication; making the sum of these choices a key representation of the author’s academic profile. As such the underlying network of investigators are often viewed through the lens of their keyword networks. Current keyword networks connect publications only if they use the exact same keyword, meaning uncontrolled keyword choice prevents connections despite semantic similarity. Computational understanding of semantic similarity has already been achieved through the process of word embedding, which transforms words to numerical vectors with context-correlated values. The resulting vectors preserve semantic relations and can be analyzed mathematically. Here we develop a model that uses embedded keywords to construct a network which circumvents the limitations caused by uncontrolled vocabulary. The model pipeline begins with a set of faculty, the publications and keywords of which are retrieved by SCOPUS API. These keywords are processed and then embedded. This work develops a novel method of network construction that leverages the interdisciplinarity of each publication, resulting in a unique network construction for any given set of publications. Postconstruction the network is visualized and analyzed with topological data analysis (TDA). TDA is used to calculate the connectivity and the holes within the network, referred to as the zero and first homology. These homologies inform how each author connects and where publication data is sparse. This platform has successfully modelled collaborations within the biomedical department at Purdue University and provides insight into potential future collaborations.

History

Degree Type

Master of Science in Biomedical Engineering

Department

Biomedical Engineering

Campus location

West Lafayette

Advisor/Supervisor/Committee Chair

Tamara Kinzer-Ursem, Ph.D.

Advisor/Supervisor/Committee co-chair

Pete Pascuzzi, Ph.D.

Additional Committee Member 2

Jacqueline Linnes, Ph.D.

Usage metrics

Keywords

Topological Data Analysis Keyword network Meta-data Scholarly communications Biomechanical Engineering Library and Information Studies Topology

Licence

CC BY 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

NETWORK AND TOPOLOGICAL ANALYSIS OF SCHOLARLY METADATA: A PLATFORM TO MODEL AND PREDICT COLLABORATION

History

Degree Type

Department

Campus location

Advisor/Supervisor/Committee Chair

Advisor/Supervisor/Committee co-chair

Additional Committee Member 2

Usage metrics

Categories

Keywords

Licence

Exports