Encoding IP Address as a Feature for Network Intrusion Detection

Shao, Enchun

doi:10.25394/PGS.11307287.v1

Encoding IP Address as a Feature for Network Intrusion Detection.pdf (675.78 kB)

Encoding IP Address as a Feature for Network Intrusion Detection

thesis

posted on 2019-12-03, 13:49 authored by Enchun ShaoEnchun Shao

As machine learning algorithms take on more important roles in various areas of big data analysis, more accurate research is needed. Although machine learning techniques have been applied in network intrusion detection, encoding IP addresses as a feature of network intrusion detection has not been discussed. Since the IP address is strongly relevant to network intrusion detection, it cannot be ignored when predicting a network attack. Therefore, three machine learning algorithms - random forest, support vector machine (SVM), and decision tree have been applied for the present study to examine three IP address encoding methods for NetFlow data: converting into four individual numbers, converting into binary integers, and one hot encoding. The pivot of the study was to analyze the F-1, precision, recall, and accuracy scores of the machine learning algorithms and determine the best method of encoding IP addresses for network intrusion detection. In addition, 21 features of the data set related to the destination port, packets, destination IP, source port, flow duration, source IP, flags, and labels were also considered, as well as speed. The study shows that the best method of encoding an IP address was splitting the IP address into four numbers and the decision tree, random forest, and SVM accuracy scores for this method were 0.9562, 0.9631, and 0.9296, respectively.

History

Degree Type

Master of Science

Department

Computer and Information Technology

Campus location

West Lafayette

Advisor/Supervisor/Committee Chair

Baijian Yang

Additional Committee Member 2

Tonglin Zhang

Additional Committee Member 3

Wenhai Sun

Usage metrics

Keywords

machine learning-based Network intrusion detection Applied Computer Science

Licence

CC BY 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Encoding IP Address as a Feature for Network Intrusion Detection

History

Degree Type

Department

Campus location

Advisor/Supervisor/Committee Chair

Additional Committee Member 2

Additional Committee Member 3

Usage metrics

Categories

Keywords

Licence

Exports