Name		Name	Last commit message	Last commit date
parent directory ..
processed_dataset		processed_dataset
raw_dataset		raw_dataset
scripts		scripts
README.md		README.md
plot-section6.sh		plot-section6.sh
preprocessing.py		preprocessing.py

README.md

Web Browsing

This folder contains details of the artifacts related to Section 6 (Web Browsing). We provide details of the dataset, analysis scripts as well as plotting scripts to generate Figures 19 to 22 and Table 6.

Folder structure

Foldername/Filename	Description
raw_dataset/[date]/har-file/[timeStampID]	The collected HAR dataset.
raw_dataset/[date]/tcpdump-file/[timeStampID]	The collected TCPDUMP dataset.
processed_dataset	Processed data can be found here
scripts	Scripts for generating plots and results.
preprocessing.py	Preprocessing Script
plot-section6.sh	One command to run all the scripts
plots	Bash Scripts generated results and save in plots folder

Requirements

Python (>=3.6)
Scapy (>=2.4.4)
Numpy (>=1.19.5)
haralyzer (>=1.8.0)
seaborn (>=0.10.0)
Scikit-learn (>=0.24.1)
Matplotlib (>=3.1.3)
graphviz (>=0.16)

Also, install the graphviz package using the following commands on Ubuntu:

sudo apt-get install graphviz

Generating Plots

Step 1 (Data Preprocessing)

The dataset is large due to which the preprocessing can take several hours to complete on a normal laptop. To save time, we provide fully processed dataset.

Follow these instructions to skip preprocessing step

Clone the repository to your computer and move to Web-Browsing subfolder
Replace processed_dataset folder with the one in Google Drive
Proceed to step 2

Follow these instructions to preprocess complete dataset

Clone the repository to your computer and move to Web-Browsing subfolder
Replace the raw_dataset with the one in Google Drive
Run python preprocessing.py
After the preprocessing is complete, two pickle files WebSet.pickle and fileStatistics.pickle will be saved in the processed_dataset folder. Details of the pickle files can be found here.

By default, raw_dataset and processed_data contain sample files that one could use to test the environment or play with it.

Step 2 (Generate the plots)

Once the both pickle files are present in processed_dataset folder, use the following bash command to generate results/plots

bash plot-section6.sh

The generated results will be saved in the plots folder.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Web-Browsing

Web-Browsing

README.md

Web Browsing

Folder structure

Requirements

Generating Plots

Step 1 (Data Preprocessing)

Step 2 (Generate the plots)

Files

Web-Browsing

Directory actions

More options

Directory actions

More options

Latest commit

History

Web-Browsing

Folders and files

parent directory

README.md

Web Browsing

Folder structure

Requirements

Generating Plots

Step 1 (Data Preprocessing)

Step 2 (Generate the plots)