Presto
Elasticsearch
Our great sponsors
Presto | Elasticsearch | |
---|---|---|
14 | 91 | |
15,591 | 67,632 | |
0.9% | 1.2% | |
9.9 | 10.0 | |
5 days ago | 3 days ago | |
Java | Java | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Presto
-
Multi-Database Support in DuckDB
We have some of this functionality in Presto (https://github.com/prestodb/presto), but it takes fair bit of work to implement it for all the different backends.
-
Rust std:fs slower than Python
Note that glibc has a similar problem in multithreaded contexts. It strands unused memory in thread-local pools, which grows your memory usage over time like a memory leak. We got lower memory usage that didn't grow over time by switching to jemalloc.
Example of this: https://github.com/prestodb/presto/issues/8993
- Ask HN: What are some SQL transpilers?
-
Cheat sheet for quotes usage?
I look at the grammar. Here is preto's grammar which is mostly similar to other sql engines: https://github.com/prestodb/presto/blob/master/presto-parser/src/main/antlr4/com/facebook/presto/sql/parser/SqlBase.g4
-
After a few recent events, opening a Linux terminal in public places is a big no-no
export MVNW_VERBOSE=true git clone https://github.com/prestodb/presto.git cd presto bash ./mvnw clean install
- presto: The official home of the Presto distributed SQL query engine for big data
- Compile the Minecraft Server (Java Edition) to Native with GraalVM Native Image
-
What are y'all learning right now?
more specifically, recently started learning about Presto [paper], and have been diving deeper into [source] code.
-
DeWitt Clause, or Can You Benchmark %DATABASE% and Get Away With It
Presto
- Let's write a compiler, part 5: A code generator
Elasticsearch
-
Elasticsearch Version 9
You could check out their GitHub and see what is going on https://github.com/elastic/elasticsearch/issues
- One .gitignore to rule them all
-
Who's hiring developer advocates? (October 2023)
Link to GitHub -->
-
Do we think about vector dbs wrong?
I believe the 1024 limit has been upped in recent versions of Elasticsearch
https://github.com/elastic/elasticsearch/issues/92458
-
Elasticsearch VS openobserve - a user suggested alternative
2 projects | 30 Aug 2023
- A dedicated Elasticsearch query language (ES|QL)
- Fleet datastreams: custom index templates
-
Integrating Elasticsearch with Node.js Applications
Elasticsearch is written in Java and its source code is available on Github.
-
Murmur3 hash plugin for nested objects?
I don't think the murmur3 hash implementation has changed since it was added as the default in version 2.0 (see the [changes](https://github.com/elastic/elasticsearch/commits/main/server/src/main/java/org/elasticsearch/cluster/routing/Murmur3HashFunction.java)). The plugin itself has seen [more changes](https://github.com/elastic/elasticsearch/commits/main/plugins/mapper-murmur3) but that's IMO because of internals and not visible changes in the calculations.
-
Mongo or Mysql for 10tb of JSON documents, I'm questioning my previous choice.
Mysql is not as open source as postgres (long story). And you can see how open elasticsearch is by just having access to the bugs database https://github.com/elastic/elasticsearch/issue
What are some alternatives?
Trino - Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
OpenSearch - 🔎 Open source distributed and RESTful search engine.
Apache Phoenix - Apache Phoenix
Apache Superset - Apache Superset is a Data Visualization and Data Exploration Platform [Moved to: https://github.com/apache/superset]
Apache Calcite - Apache Calcite
bleve - A modern text/numeric/geo-spatial/vector indexing library for go
HikariCP - 光 HikariCP・A solid, high-performance, JDBC connection pool at last.
pgvector - Open-source vector similarity search for Postgres
jOOQ - jOOQ is the best way to write SQL in Java
Whoosh
Spring Data JPA - Simplifies the development of creating a JPA-based data access layer.
MeiliSearch - A lightning-fast search API that fits effortlessly into your apps, websites, and workflow