Releases: georgew79/CommonCrawler
Releases · georgew79/CommonCrawler
Basic Text Processing
Repository is now very basically capable of text processing. Planned improvements added to the readme. Significant documentation work still needed. Significant work on cleaning parameters also still needed.
More complex filtering will follow, along with formal tests later.