The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 21 Vectorization Open-Source Projects
-
anything-llm
A multi-user ChatGPT for any LLMs and vector database. Unlimited documents, messages, and storage in one privacy-focused app. Now available as a desktop application with a built-in LLM!
Project mention: Ask HN: How do I train a custom LLM/ChatGPT on my own documents in Dec 2023? | news.ycombinator.com | 2023-12-24anything-llm looks pretty interesting and easy to use https://github.com/Mintplex-Labs/anything-llm
-
I was curious about these libraries a few weeks ago and did some searching. Is there one that's got a clearly dominating set of users or contributors?
I don't know what a good way to compare these might be, other than perhaps activity/contributor count.
[1] https://github.com/simd-everywhere/simde
[2] https://github.com/ermig1979/Simd
[3] https://github.com/google/highway
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
xsimd
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
https://github.com/xtensor-stack/xsimd
GH topics > HashMap:
-
-
-
incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
-
doc-chatbot
Document chatbot — multiple files, topics, chat windows and chat history. Powered by GPT.
Project mention: Are there any good free GPT-powered AI summarizer for very long text? | /r/ChatGPTPro | 2023-06-20 -
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
I'm the main author of Highway, so I have some opinions :D Number of operations/platforms supported are important criteria.
A hopefully unbiased commentary:
Simde allows you to take existing nonportable intrinsics and get them to run on another platform. This is useful when you have a bunch of existing code and tight deadlines. The downside is less than optimal performance - a portable abstraction can be more efficient than forcing one platform to exactly match the semantics of another. Although a ton of effort has gone into Simde, sometimes it also resorts to autovectorization which may or may not work.
Eigen and SLEEF are mostly math-focused projects that also have a portability layer. SLEEF is designed for C and thus has type suffixes which are rather verbose, see https://github.com/shibatch/sleef/blob/master/src/libm/sleef... But it offers a complete (more so than Highway's) libm.
-
-
-
-
-
embedditor
⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation with one click, add images, and download in .veml to share it with your team.
-
In VecCore (a small C++ SIMD abstraction library on top of Vc and std::simd), I created some simple examples to show how to use the library to optimize code using SIMD in a somewhat generic way. You can find it on GitHub at https://github.com/root-project/veccore
I have examples for Julia sets and the Mandelbrot set, including an implementation with AVX2 intrinsics.
These days with std::simd more widely available there's less of a reason to use VecCore, but the examples may still be educational enough.
-
Image2CAD
An application to translate raster image of CAD drawing sheet to a user editable DXF format.
-
-
-
-
-
-
VectorizedKernel
Running GPGPU-like kernels on CPU with auto-vectorization for SSE/AVX/AVX512 SIMD Architectures
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Vectorization related posts
- Show HN: Retrieve Stock Data with KlongPy
- Show HN: KlongPy now supports ticker plants
- How to reduce costs on embeddings up to 70%
- Open-source AI /LLM Embedding Preprocessing Editor
- Open-source AI /LLM Embedding Pre-processing Editor.
- AI Embedding Pre-processing Editor.
- Show HN: Open-Source AI Embedding Pre-Processing Editor
-
A note from our sponsor - WorkOS
workos.com | 28 Mar 2024
Index
What are some of the best open-source Vectorization projects? This list will help you:
Project | Stars | |
---|---|---|
1 | anything-llm | 9,669 |
2 | simde | 2,127 |
3 | xsimd | 2,008 |
4 | Vc | 1,405 |
5 | neanderthal | 1,042 |
6 | incubator-gluten | 949 |
7 | doc-chatbot | 760 |
8 | sleef | 577 |
9 | autotrace | 462 |
10 | gerbolyze | 335 |
11 | KuiBaDB | 311 |
12 | Hybridizer | 228 |
13 | embedditor | 202 |
14 | veccore | 79 |
15 | Image2CAD | 76 |
16 | frovedis | 64 |
17 | HLML | 24 |
18 | ThatsALotOfDMG | 22 |
19 | std_find_simd | 18 |
20 | qFib | 10 |
21 | VectorizedKernel | 7 |