kraken-voulgaris-aeneid

GitHub repo size

https://ryanfb.github.io/kraken-voulgaris-aeneid/groundtruth/

This is a project for generating an edition-specific OCR training file for Kraken for Evgenios Voulgaris’ Greek translation of the Aeneid.

Data

The following Google Books volumes were used as source data:

Copies of these are available as renamed PDFs in the pdfs directory.

Training

Run make, or override defaults with e.g.

USE_DOCKER=false CUDA_DEVICE=cuda:0 make