Skip to content

Sprizgola/web-traffic-log-analyzer

Repository files navigation

web-traffic-log-analyzer

This repo aim to analyze the web traffic data exploiting the Apache access log using an unsupervised approach. The approach used here is inspired by this paper.

Processing data

To process the data and extract the features run:

python preprocess_data.py --input_path PATH_TO_RAW_LOG --output_path PATH_TO_SAVE_DATA

There is a logfile sample in the raw_data directory.

Analyze data

To start the log analyzer run:

python main.py --data_path PATH_TO_PROCESSED_LOG --plot_output_path PATH_TO_SAVE_PLOT.

plot_output_path is an optional argument and can be ignored if you don't want to save the plot.

About

Web traffic analysis through ML techniques

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages