How to index PDF files with AppSeach

This page summarizes the projects mentioned and recommended in the original post on /r/elasticsearch

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • fscrawler

    Elasticsearch File System Crawler (FS Crawler)

  • Did you sir give FSCrawler a shot? it's also a file system crawler that automatically ingests files into an ElasticSearch index and it is built over Tika so it performs typically the same (multiple file formats, multiple languages, ...). It can also perform OCR while indexing (uses Tesseract) by just toggling it on its config (I didn't try the OCR that much for a lack of need and for the ridiculous amount of times it added to the indexation process on my small environment).

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Tools for converting assorted text files to JSON?

    2 projects | /r/elasticsearch | 8 Oct 2022
  • Elasticsearch Version 9

    1 project | /r/elasticsearch | 5 Dec 2023
  • pelias VS photon - a user suggested alternative

    2 projects | 15 Nov 2023
  • Elasticsearch VS openobserve - a user suggested alternative

    2 projects | 30 Aug 2023
  • A dedicated Elasticsearch query language (ES|QL)

    1 project | news.ycombinator.com | 9 Aug 2023