cremma-16-17-print
kraken
cremma-16-17-print | kraken | |
---|---|---|
1 | 2 | |
0 | 643 | |
- | - | |
0.0 | 9.1 | |
over 1 year ago | 7 days ago | |
XSLT | Python | |
Creative Commons Zero v1.0 Universal | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
cremma-16-17-print
-
What would you love to see in a modern edition of a 16th-century work?
I would also recommend to look into OCR and the progress in this area, at least to help you kickstart the trancription. Simon Gabay at Geneva is leading a project where they host eScriptorium, and their results are quite good. I have produced myself some data for Latin ( https://github.com/HTR-United/cremma-16-17-print ) and, except for some rare characters, the results are more than promising (much more than what Abbyy can do).
kraken
- Kraken: Turn-key OCR system optimised for historical and non-Latin script
-
Where should I learn first if I want to code my own bots?
just realise there are lots of pre-made open source packages for the exchanges. This one looks very robust: https://github.com/ccxt/ccxt So all the hard work has been done. I will warn you the learning curve is very steep and making a trading bot is in itself difficult and dangerous if there is a bug. Here's kraken's python script which is a stand alone program used to access their API, looks greek to me: https://github.com/mittagessen/kraken/blob/master/kraken/kraken.py
What are some alternatives?
EasyOCR - Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
tesseract-ocr - Tesseract Open Source OCR Engine (main repository)
handprint - Apply different text recognition services to images of handwritten documents.
benchmarks - Public dataset benchmarks used for measuring the performance of MindsDB.
flow-forecast - Deep learning PyTorch library for time series forecasting, classification, and anomaly detection (originally for flood forecasting).
igel - a delightful machine learning tool that allows you to train, test, and use models without writing code
PdfPig - Read and extract text and other content from PDFs in C# (port of PDFBox)
NCRFpp - NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
ccxt - A JavaScript / TypeScript / Python / C# / PHP cryptocurrency trading API with support for more than 100 bitcoin/altcoin exchanges
robin - RObust document image BINarization
Easter2 - Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION