Skip to content

moalk101/PageRank_Wikilinks

Repository files navigation

PageRank_Wikilinks

Usage:

This code is made to calculate the PagRank of the Wikipidia links using MapeReduce from MRJob. To run the code first you need to run all the cells in the notebook datacleaning.ipynb. This will generate a csv file for the actual calculation of the PageRank. Run the runjobs.py file. This will calculate the pageranks and output them as a key-value pairs.

Dataset:

The data used here is from WikiLinkGraphs specifically the code was tested on wikilink_graph.2005-03-01. The whole dataset can be found here: https://consonni.dev/datasets/wikilinkgraphs/

Note

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published