This repository contains several Jupyter Notebooks to explore the Patristic Text Archive beyond the tools provided in the web frontend.
pta_nlp_cld.ipynb
using CLDpta_nlp_spacy.ipynb
using grc_proiel_sm (cf. greCy. Ancient Greek models for spaCy)keywords-in-context.ipynb
collocations.ipynb
analyse_corpus.ipynb
(using TF-IDF)biblical_quotations.ipynb
(usingseverian_quotes.json
and pta_metadata repository)convert_pta_totext.ipynb
Helper notebook to convert PTA-XML to a csv filelemmatize_all.ipynb
Helper notebook to lemmatize all text generated by convert_pta_totext.ipynb
DejaVuSans.ttf
(to be able to use Greek Extended in wordclouds)severian_plaintext.csv
as generated byconvert_pta_totext.ipynb
severian_plaintext_lemmatized.csv
as generated bylemmatize_all.ipynb
severian_quotes.json
as generated byconvert_pta_totext.ipynb
- SpaCy model
grc_proiel_sm-3.7-py3-none-any.whl
(downloaded from https://huggingface.co/Jacobo/grc_proiel_sm and filename adjusted to adhere to format convention of pip > 24.1)