Our great sponsors
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
(1) For large scale processing/tokenizing your data I would consider using something like NLTK or Spacy. That's if your books are already in text form. If they are scans, you'll need to use some OCR software first.
(1) For large scale processing/tokenizing your data I would consider using something like NLTK or Spacy. That's if your books are already in text form. If they are scans, you'll need to use some OCR software first.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
- Tell HN: Selling My SaaS
- Is it home bias or is data wrangling for machine learning in python much less intuitive and much more burdensome than in R?
- Named Entity Recognition with Spacy
- Selling the Hype: Coding Sentiment Analysis for Stock Market News in 4 STEPS
- Topic modelling with Gensim and SpaCy on startup news