Skip to content

Latest commit

 

History

History
3 lines (2 loc) · 206 Bytes

README.md

File metadata and controls

3 lines (2 loc) · 206 Bytes

OSCAR-CommonCrawl-Collab

This repository contains notes, documents and reports about the collaboration between CommonCrawl and OSCAR regarding a new format of extracted text from CommonCrawl WARC files.