(1) Understanding Dataset: UNSW-NB15The raw network packets of the UNSW-NB15 | dataset was created...

Question

(1) Understanding Dataset: UNSW-NB15The raw network packets of the UNSW-NB15 | dataset was created by the IXIA PerfectStormtool in the Cyber Range Lab of the Australian Centre for Cyber Security (ACCS) for generatinga hyid of real modem normal activities and synthetic contemporary attack behaviours.Tepdump tool used to capture 100 GB of the raw traffic (e.g., Pcap files). This data set hasnine types of attacks, namely, Fuzzers, Analysis, Backdoors, DoS, Exploits, Generic,Reconnaissance, Shellcode and Worms. The Argus and Bro-IDS tools are used and twelvealgorithms are developed to generate totally 49 features with the class label.a) The features are described here.) The number of attacks and their sub-categories is described here,©) In this coursework, we use the total number of 10-million records that was stored inthe CSV file (download). The total size is about 600MB, which is big enough toemploy big data methodologies for analytics. As a big data specialist, firstly, wewould like to read and understand its features, then apply modeling techniques. Ifyou want to see a few records of this dataset, you can import t into Hadoop HDF,then make a Hive query for printing the first 5-10 records for your understanding.(2) Big Data Query & Analysis by Apache Hive [30 marks]This task is using Apache Hive for converting big raw data into useful information for theend users. To do so, firstly understand the dataset carefully. Then, make at least 4 HiveQueries (refer to the marking scheme). Apply appropriate visualization tools to presentyour findings numerically and graphically. Interpret shortly your findings.Einally, take screenshot of your outcomes (e.q.. tables and plots) together with thescripts/queries into the report.

Banasree · Accepted Answer

Ans.
List of attacks UNSW NB -15
	Category
	Training set
	Testing Set
	Normal
	37000
	560000
	Analysis
	677
	2000
	Backdoor
	583
	1746
	DoS
	4089
	12264
	Exploits
	11132
	33393
	Fuzzers
	6062
	18184
	Generic
	18871
	40000
	Reconnaissance
	3496
	10491
	Shellcode
	378
	1133
	Worms
	44
	130
	Total
	82,332
	1,75,

(1) Understanding Dataset: UNSW-NB15 The raw network packets of the UNSW-NB15 | dataset was created by the IXIA PerfectStorm tool in the Cyber Range Lab of the Australian Centre for Cyber Security...

Solution

Answer To This Question Is Available To Download

Related Questions & Answers

Submit New Assignment