elasticsearch-py
image-match
Our great sponsors
elasticsearch-py | image-match | |
---|---|---|
21 | 5 | |
4,121 | 2,911 | |
0.8% | 0.4% | |
8.7 | 0.0 | |
6 days ago | over 1 year ago | |
Python | Python | |
Apache License 2.0 | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
elasticsearch-py
- An alternative to Elasticsearch that runs on a few MBs of RAM
- Elastic Open Sources Their Endpoint Security Protection YARA Ruleset
-
OpenSearch – open-source search and analytics based on Apache 2.0 Elasticsearch
And my bet is it's the one most are going to be using from now on. I used to think this was a fairly black and white issue, but now two things have coloured it for me.
Firstly, dick moves like this: https://github.com/elastic/elasticsearch-py/pull/1623
Secondly, I don't buy the argument from Elastic any more. Yes, the ethical thing to do when you're making money from someone's work is at least contribute back. At the same time though, they're making money from packaging it up and selling it _as a service_. That "as a service" part is where they're making the bucks.
A bonus thirdly; OpenSearch really is Open Source, and ElasticSearch no longer is.
FD: I have a friend who works at Elastic, though he doesn't really colour my opinions of things.
> Firstly, dick moves like this: https://github.com/elastic/elasticsearch-py/pull/1623
I understand that this is unpopular, but you can make a very strong argument that it's to prevent weird errors in the future. I'm also guilty of littering my code with Asserts to ensure the universe is working fine.
The alternative is to allow it to work and then you end up with weird issues like when you connect mysql client to mariadb server (and vice-versa): https://stackoverflow.com/questions/50169576/mysql-8-0-11-er...
> Secondly, I don't buy the argument from Elastic any more. Yes, the ethical thing to do when you're making money from someone's work is at least contribute back. At the same time though, they're making money from packaging it up and selling it _as a service_. That "as a service" part is where they're making the bucks.
That's just an opinion, yes they have a service, and yes it competes with Amazon. Is it cool for Amazon to take a body of work and sell it without supporting it? Are amazon actually supporting it? Is it the same as Elastic using Lucene? (not really because Elastic submits a the majority of fixes to Lucene, but, you get it).
it's kinda gray, I'm sure Amazon thinks they're the good guy, but it's hard for me to look at Elastic as the bad guy in all this.
-
I Don't Think Elasticsearch Is a Good Logging System
Oh man, https://github.com/elastic/elasticsearch-py/issues/1734 is a disappointing read. I know ES wants to save their business, but alienating users isn't exactly the path to success.
- Official Elasticsearch Python library no longer works with open-source forks
- Elasticsearch adding code to reject connections to OpenSearch clusters or to clusters running open source distributions of ES7
image-match
-
Find visual similar photos
Not a full solution but imagematch is a widely adopted algorithm for this purpose. There’s even a very nice dockerized version
-
Non-Machine Learning Image Matching with a Vector DB
I also developed a service based on the Goldberg paper in the mid 2010s. We flattened and discretized the signature to make it searchable which gave us pretty good results: https://github.com/ProvenanceLabs/image-match
Sorry I haven’t maintained the project in years so it’s unlikely to work out of the box. But who knows, maybe you’ll find something useful here for your project!
-
Here is a demonstration of the new features for the tool I'm working on
There are libraries available online like this one that will compare between images and return the closest match.
-
How to find photo duplicates (not hash duplicates) and clean them?
https://github.com/ProvenanceLabs/image-match
What are some alternatives?
searxng - SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
quickwit - Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.
match - :crystal_ball: Scalable reverse image search built on Kubernetes and Elasticsearch
helm-charts
evtx2es - A library for fast parse & import of Windows Eventlogs into Elasticsearch.
qryn - qryn is a polyglot, high-performance observability framework for ClickHouse. Ingest, store and analyze logs, metrics and telemetry traces from any agent supporting Loki, Prometheus, OTLP, Tempo, Elastic, InfluxDB and many more formats and query transparently using Grafana or any other compatible client.
zeek-clickhouse
git-imerge - Incremental merge for git
orama - 🌌 Fast, dependency-free, full-text and vector search engine with typo tolerance, filters, facets, stemming, and more. Works with any JavaScript runtime, browser, server, service!
mergify - Merge git changes on commit at a time.
GARI - GARI (Genetic Algorithm for Reproducing Images) reproduces a single image using Genetic Algorithm (GA) by evolving pixel values.
pg_similarity - set of functions and operators for executing similarity queries