Purdue University Graduate School
daniel-thesis-final.pdf (28.38 MB)

Machine Learning-Based Multimedia Analytics

Download (28.38 MB)
posted on 2020-07-07, 15:01 authored by Daniel Mas MontserratDaniel Mas Montserrat
Machine learning is widely used to extract meaningful information from video, images, audio, text, and other multimedia data.  Through a hierarchical structure, modern neural networks coupled with backpropagation learn to extract information from large amounts of data and to perform specific tasks such as classification or regression. In this thesis, we explore various approaches to multimedia analytics with neural networks. We present several image synthesis and rendering techniques to generate new images for training neural networks. Furthermore, we present multiple neural network architectures and systems for commercial logo detection, 3D pose estimation and tracking, deepfakes detection, and manipulation detection in satellite images.


Degree Type

  • Doctor of Philosophy


  • Electrical and Computer Engineering

Campus location

  • West Lafayette

Advisor/Supervisor/Committee Chair

Edward J. Delp

Additional Committee Member 2

Jan P. Allebach

Additional Committee Member 3

Fengqing M. Zhu

Additional Committee Member 4

Qian Lin