Releases: nlnwa/nlnwa-notebooks
Releases · nlnwa/nlnwa-notebooks
v.0.4.1-alpha
added requirements.txt
for the Web News Collection notebook
v0.4-alpha
Delete notebooks/corpus/metadata/file
v0.3-alpha
Added experimental notebook to work with the Web News Corpus, published via DHlab's API.
The notebook is meant to serve as an example of usage for early adopters and others interested in testing to work with the Web News Corpus using dhlab
for python.
Features:
- build corpus from
doctype:"nettavis"
- visualise items in corpus for insight purposes
- get concordances (snippets of text around keyword)
- get collocated words
- calculate relative frequency of collocated words
- export corpus, concordances and colocates as Excel and JSONL
v0.2.0-alpha
feat:Update README.md Updated with new notebook (warc2any) and rewrote a link to documentation