RCNX: RESIDUAL CAPSULE NEXT

Anilkumar, Arjun Narukkanchira

doi:10.25394/PGS.14489091.v1

RCNX.pdf (2.35 MB)

RCNX: RESIDUAL CAPSULE NEXT

thesis

posted on 2021-05-10, 15:38 authored by Arjun Narukkanchira AnilkumarArjun Narukkanchira Anilkumar

Machine learning models are rising every day. Most of the Computer Vision oriented

machine learning models arise from Convolutional Neural Network’s(CNN) basic structure.

Machine learning developers use CNNs extensively in Image classification, Object Recognition,

and Image segmentation. Although CNN produces highly compatible models with

superior accuracy, they have their disadvantages. Estimating pose and transformation for

computer vision applications is a difficult task for CNN. The CNN’s functions are capable of

learning only shift-invariant features of an image. These limitations give machine learning

developers motivation towards generating more complex algorithms.

Search for new machine learning models led to Capsule Networks. This Capsule Network

was able to estimate objects’ pose in an image and recognize transformations to these

objects. Handwritten digit classification is the task for which capsule networks are to solve

at the initial stages. Capsule Networks outperforms all models for the MNIST dataset for

handwritten digits, but to use Capsule networks for image classification is not a straightforward

multiplication of parameters. By replacing the Capsule Network’s initial layer, a

simple Convolutional Layer, with complex architectures in CNNs, authors of Residual Capsule

Network achieved a tremendous change in capsule network applications without a high

number of parameters.

This thesis focuses on improving this recent Residual Capsule Network (RCN) to an

extent where accuracy and model size is optimal for the Image classification task with a

benchmark of the CIFAR-10 dataset. Our search for an exemplary capsule network led to

the invention of RCN2: Residual Capsule Network 2 and RCNX: Residual Capsule NeXt.

RCNX, as the next generation of RCN. They outperform existing architectures in the domain

of Capsule networks, focusing on image classification such as 3-level RCN, DCNet, DC

Net++, Capsule Network, and even outperforms compact CNNs like MobileNet V3.

RCN2 achieved an accuracy of 85.12% with 1.95 Million parameters, and RCNX achieved

89.31% accuracy with 1.58 Million parameters on the CIFAR-10 benchmark.

History

Degree Type

Master of Science in Electrical and Computer Engineering

Department

Electrical and Computer Engineering

Campus location

Indianapolis

Advisor/Supervisor/Committee Chair

Mohamed El-Sharkawy,

Additional Committee Member 2

Brian King

Additional Committee Member 3

Maher Rizkalla

Usage metrics

Keywords

RCNX Residual Capsule Next RCN2 Residual Capsule Network V2 Cifar10 Computer Engineering

Licence

CC BY 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

RCNX: RESIDUAL CAPSULE NEXT

History

Degree Type

Department

Campus location

Advisor/Supervisor/Committee Chair

Additional Committee Member 2

Additional Committee Member 3

Usage metrics

Categories

Keywords

Licence

Exports