Skip to content

Extracts and cleans text from Wikipedia database dump and stores output in a number of files of similar size in a given directory.

Notifications You must be signed in to change notification settings

gbotev1/WikiExtractor

This branch is 12 commits ahead of, 7 commits behind apertium/WikiExtractor:master.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
Mar 24, 2021

About

Extracts and cleans text from Wikipedia database dump and stores output in a number of files of similar size in a given directory.

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%