Skip to content

Files

Latest commit

cbbd333 · Jan 26, 2024

History

History
This branch is 3 commits ahead of, 191 commits behind DDMAL/e2e-omr-resources:main.

document_analysis

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
Jan 26, 2024
Apr 27, 2023
Apr 7, 2022
Aug 4, 2021
Apr 7, 2022
Apr 27, 2023
Apr 27, 2023
Apr 27, 2023
Feb 1, 2023

Document Analysis

In this folder you will find the training data and models (generated with this data) for the individual manuscripts processed at DDMAL. Some models where generated with an old method (see this archived page on document analysis). However, the current method for document analysis can be found in the Document Analysis by Iterative Training with Paco Classifier page. The deault settings of this Paco Trainer work very good, but here are a few notes on when changing these settings could be useful (as it was the case for MS 73).

The MS_73 manuscript shows the current way of training, using the Paco's method for iterative training (see documentation). In this method, we train with ZIP Pixel files, each containing the image and layers corresponding to one page. The workflows used to label the data of a page as belonging to the different layers, train the models, and classify a new page are provided by the three JSON files in this folder.