Official Elasticsearch Python library no longer works with open-source forks

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • elasticsearch-py

    Official Python client for Elasticsearch

  • duckduckgo-locales

    Translation files for <a href="https://duckduckgo.com"> </a>

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • pg_similarity

    set of functions and operators for executing similarity queries

  • While I don't doubt that you know your usecase and weighed/tried the option.

    > Postgres search is essentially an easier to use regex engine.

    I'm not sure exactly what you meant to convey here, but if you're searching with LIKE or `~` you're not doing Postgres's proper Full Text Search. You should be dealing with tsvectors[0]

    > As soon as you need multiple languages

    Postgres FTS supports multiple languages and you can create your own configurations[1]

    > advanced autocomplete

    I'm not sure what "advanced" autocomplete is but you can get pretty fast trigram searches going[2] (back to LIKE/ILIKE here but obviously this is an isolated usecase). In the end I'd expect auto complete results to actually not hit your DB most of the time (maybe I'm naive but that feels like a caching > cache invalidation > cache pushdown problem to me)

    > misspelling detection

    pg_similarity_extension[3] might be of some help here, but it may require some wrangling.

    > large documents, large datasets,

    PG has TOAST[4], and obviously can scale (maybe not necessarily great at it) -- see pg_partman/Timescale/Citus/etc.

    > custom scoring

    Postgres only has basic ranking features[5], but you can write your own functions and extend it of course.

    Solr/ES are definitely the right tools for the job (tm) when the job is search, but you can get surprisingly far with Postgres. I'd argue that many usecases actually don't want/need a perfect full text search solution -- it's often minor features that turn into overkill fests and ops people learning/figuring out how to properly manage and scale an ES cluster and falling into pitfalls along the way.

    [0]: https://www.postgresql.org/docs/current/textsearch-intro.htm...

    [1]: https://www.postgresql.org/docs/current/textsearch-intro.htm...

    [2]: https://about.gitlab.com/blog/2016/03/18/fast-search-using-p...

    [3]: https://github.com/eulerto/pg_similarity

    [4]: https://www.postgresql.org/docs/current/storage-toast.html

    [5]: https://www.postgresql.org/docs/9.5/textsearch-controls.html...

  • git-imerge

    Incremental merge for git

  • I wonder what criteria you apply to determine the trustworthiness of a project. For me, signing a CLA or otherwise not using inbound=outbound licensing is a major one, as well as any project backed by a company with VC funding. Any project backed by a single organisation instead of a group of people from lots of different organisations is a red flag, with some exceptions for long-term known-trustworthy non-profits.

    None of that helps with a situation like Audacity/MuseCore though, if developers are willing to sell out their project copyrights, that isn't something that you can really protect against, except maybe discussing people's opinions on that openly.

    The other issue with withholding changes from upstream is the potentially infinite cost of updating your changes as the project evolves, things like git-imerge, mergify or git-mergify-rebase can reduce that burden by letting you do incremental rebases/merges though. Normally I don't contribute to projects with a CLA assigning extra rights to corporations over the license, but I've been considering signing one just to drop the maintenance burden.

    https://github.com/mhagger/git-imerge

  • mergify

    Merge git changes on commit at a time.

  • git-mergify-rebase

    Merge git changes one commit at a time.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Bountysource Stole at Least $17,000 from Open Source Developers

    2 projects | news.ycombinator.com | 3 May 2024
  • I Parsed Git Statistics

    1 project | news.ycombinator.com | 4 Apr 2024
  • How to setup Black and pre-commit in python for auto text-formatting on commit

    3 projects | dev.to | 29 Mar 2024
  • The half-life of code and the ship of Theseus

    1 project | news.ycombinator.com | 14 Mar 2024
  • Build Initramfs Rootless

    8 projects | news.ycombinator.com | 8 Mar 2024