duckdf vs Typesense

duckdf

🦆 SQL for R dataframes, with ducks (by phillc73)

Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences (by typesense)

Source Code

typesense.org

Suggest alternative

Edit details

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

duckdf		Typesense
	Project
3	Mentions	129
41	Stars	17,876
-	Growth	4.4%
0.0	Activity	9.8
4 months ago	Latest Commit	9 days ago
R	Language	C++
GNU General Public License v3.0 only	License	GNU General Public License v3.0 only

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

duckdf

Posts with mentions or reviews of duckdf. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-10.

DuckDB – in-process SQL OLAP database management system
4 projects | news.ycombinator.com | 10 Feb 2023

Quite a while ago, when duckdb was just a duckling, I wrote an R package that supported direct manipulation of R dataframes using SQL.[1] duckdb was the engine for this.
The approach was never as fast as data.table but did approach the speed of dplyr for more complex queries.
Life had other things in store for me and I haven’t touched this library for a while now.
At the time there was no Julia connector for duckdb, but now that there is, I’d like to try this approach in that language.
[1] https://github.com/phillc73/duckdf
ClickHouse as an alternative to Elasticsearch for log storage and analysis
13 projects | news.ycombinator.com | 2 Mar 2021

Yeah, I agree sqldf is quite slow. Fair point.
As you've seen, duckdb registers an "R data frame as a virtual table." I'm not sure what they mean by "yet" either.
Of course it is possible to write an R dataframe to an on-disk duckdb table, if that's what you want to do.
There are some simple benchmarks on the bottom of the duckdf README[1]. Essentially I found for basic SQL SELECT queries, dplyr is quicker, but for much more complex queries, the duckdf/duckdb combination performs better.
If you really want speed of course, just use data.table.
[1] https://github.com/phillc73/duckdf
Julia 1.6: what has changed since Julia 1.0?
9 projects | news.ycombinator.com | 14 Feb 2021

That's a really good point that I'd not really thought about. I'd never really considered the difference between calling just functions versus macros.
Thinking about Query.jl and DataFramesMeta.jl, and I am for sure not an expert in either, I can't specifically speak to your `head` example, but other base functions can be combined with macros. For example, see the LINQ examples from DataFramesMeta.jl[1] where `mean` is being used. Or again the LINQ style examples in Query.jl[2], where `descending` is used in the first example, or `length` later in the Grouping examples.
Is that the kind of thing you meant?
For whatever reason, with the way my brain is wired, the LINQ style of query just works for me. I have never directly used LINQ, but do have some SQL experience. In fact, I wrote some dinky little wrapper functions[3] around duckdb[4] so I could directly query R dataframes and datatables with SQL using that backend, rather than sqldf[5].
[1] https://juliadata.github.io/DataFramesMeta.jl/stable/#@linq-...
[2] https://www.queryverse.org/Query.jl/stable/linqquerycommands...
[3] https://github.com/phillc73/duckdf
[4] https://duckdb.org/
[5] https://cran.r-project.org/web/packages/sqldf/index.html

Typesense

Posts with mentions or reviews of Typesense. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-26.

Website Search Hurts My Feelings
2 projects | news.ycombinator.com | 26 Dec 2023

There are actually plenty of non-ES products that are way easier to integrate and tune (and get better results with less effort).
- Typesense (https://github.com/typesense/typesense)
- Algolia
- Google Programmable Search Engine (https://programmablesearchengine.google.com/about/)
Remote Machine Learning and Searching on a Raspberry Pi 5
2 projects | /r/immich | 11 Dec 2023
Open Source alternatives to tools you Pay for
21 projects | dev.to | 8 Dec 2023

Typesense - Open Source Alternative to Algolia
DNS record "hn.algolia.com" is gone
3 projects | news.ycombinator.com | 9 Oct 2023

If you like your penny take a look at Typesense https://typesense.org/ - nothing to complain here. Especially nothing complain about pricing.
Vector databases: analyzing the trade-offs
5 projects | news.ycombinator.com | 20 Aug 2023

I work on Typesense [1] (historically considered an open source alternative to Algolia).
We then launched vector search in Jan 2023, and just last week we launched the ability to generate embeddings from within Typesense.
You'd just need to send JSON data, and Typesense can generate embeddings for your data using OpenAI, PaLM API, or built-in models like S-BERT, E-5, etc (running on a GPU if you prefer) [2]
You can then do a hybrid (keyword + semantic) search by just sending the search keywords to Typesense, and Typesense will automatically generate embeddings for you internally and return a ranked list of keyword results weaved with semantic results (using Rank Fusion).
You can also combine filtering, faceting, typo tolerance, etc - the things Typesense already had.
[1] https://github.com/typesense/typesense
[2] https://typesense.org/docs/0.25.0/api/vector-search.html
Creating an advanced search engine with PostgreSQL
9 projects | news.ycombinator.com | 12 Jul 2023

For something small with a minimal footprint, I'd recommend Typesense. https://github.com/typesense/typesense
Obsidian Publish full text search
1 project | /r/ObsidianMD | 28 Jun 2023

I haven’t used Publish, but I’d assume you could use something like https://typesense.org/ to index and search the vault.
DynamoDB search options
1 project | /r/aws | 18 May 2023

A cheaper option would be to use https://typesense.org. You can use DynamoDb streams to automatically load records. It has worked well for me.
[Guide] A Tour Through the Python Framework Galaxy: Discovering the Stars
14 projects | /r/coder_corner | 29 Apr 2023

Try tigris | typesense for faster search
Is it worth using Postgres' builtin full-text search or should I go straight to Elastic?
2 projects | /r/PostgreSQL | 25 Apr 2023

I’m also checking out Typesense as a possibility for replacing Elastic: https://typesense.org/

What are some alternatives?

When comparing duckdf and Typesense you can also consider the following projects:

tidyquery - Query R data frames with SQL

MeiliSearch - A lightning-fast search API that fits effortlessly into your apps, websites, and workflow

julia - The Julia Programming Language

Elasticsearch - Free and Open, Distributed, RESTful Search Engine

loki - Like Prometheus, but for logs.

Apache Solr - Apache Lucene and Solr open-source search software

Makie.jl - Interactive data visualizations and plotting in Julia

meilisearch-laravel-scout - MeiliSearch integration for Laravel Scout

meilisearch-js-plugins - The search client to use Meilisearch with InstantSearch.

sonic - 🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.

duckdf vs tidyquery Typesense vs MeiliSearch duckdf vs julia Typesense vs Elasticsearch duckdf vs loki Typesense vs Apache Solr duckdf vs Makie.jl Typesense vs meilisearch-laravel-scout duckdf vs MeiliSearch Typesense vs loki duckdf vs meilisearch-js-plugins Typesense vs sonic

Compare duckdf vs Typesense and see what are their differences.

duckdf

Typesense

duckdf

Typesense

What are some alternatives?