Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 21 Vectorization Open-Source Projects
-
anything-llm
The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
xsimd
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
-
incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
-
doc-chatbot
Document chatbot — multiple files, topics, chat windows and chat history. Powered by GPT.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
embedditor
⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation with one click, add images, and download in .veml to share it with your team.
-
Image2CAD
An application to translate raster image of CAD drawing sheet to a user editable DXF format.
-
VectorizedKernel
Running GPGPU-like kernels on CPU with auto-vectorization for SSE/AVX/AVX512 SIMD Architectures
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: AnythingLLM: Chat with your documents using any LLM | news.ycombinator.com | 2024-04-19
I was curious about these libraries a few weeks ago and did some searching. Is there one that's got a clearly dominating set of users or contributors?
I don't know what a good way to compare these might be, other than perhaps activity/contributor count.
[1] https://github.com/simd-everywhere/simde
[2] https://github.com/ermig1979/Simd
[3] https://github.com/google/highway
[4] https://gitlab.com/libeigen/eigen
[5] https://github.com/shibatch/sleef
https://github.com/xtensor-stack/xsimd
GH topics > HashMap:
Project mention: Are there any good free GPT-powered AI summarizer for very long text? | /r/ChatGPTPro | 2023-06-20
I'm the main author of Highway, so I have some opinions :D Number of operations/platforms supported are important criteria.
A hopefully unbiased commentary:
Simde allows you to take existing nonportable intrinsics and get them to run on another platform. This is useful when you have a bunch of existing code and tight deadlines. The downside is less than optimal performance - a portable abstraction can be more efficient than forcing one platform to exactly match the semantics of another. Although a ton of effort has gone into Simde, sometimes it also resorts to autovectorization which may or may not work.
Eigen and SLEEF are mostly math-focused projects that also have a portability layer. SLEEF is designed for C and thus has type suffixes which are rather verbose, see https://github.com/shibatch/sleef/blob/master/src/libm/sleef... But it offers a complete (more so than Highway's) libm.
In VecCore (a small C++ SIMD abstraction library on top of Vc and std::simd), I created some simple examples to show how to use the library to optimize code using SIMD in a somewhat generic way. You can find it on GitHub at https://github.com/root-project/veccore
I have examples for Julia sets and the Mandelbrot set, including an implementation with AVX2 intrinsics.
These days with std::simd more widely available there's less of a reason to use VecCore, but the examples may still be educational enough.
Vectorization related posts
- Show HN: Retrieve Stock Data with KlongPy
- Show HN: KlongPy now supports ticker plants
- How to reduce costs on embeddings up to 70%
- Open-source AI /LLM Embedding Preprocessing Editor
- Open-source AI /LLM Embedding Pre-processing Editor.
- AI Embedding Pre-processing Editor.
- Show HN: Open-Source AI Embedding Pre-Processing Editor
-
A note from our sponsor - InfluxDB
www.influxdata.com | 28 Apr 2024
Index
What are some of the best open-source Vectorization projects? This list will help you:
Project | Stars | |
---|---|---|
1 | anything-llm | 11,955 |
2 | simde | 2,171 |
3 | xsimd | 2,036 |
4 | Vc | 1,418 |
5 | neanderthal | 1,043 |
6 | incubator-gluten | 984 |
7 | doc-chatbot | 772 |
8 | sleef | 589 |
9 | autotrace | 478 |
10 | gerbolyze | 340 |
11 | KuiBaDB | 311 |
12 | Hybridizer | 230 |
13 | embedditor | 206 |
14 | veccore | 79 |
15 | Image2CAD | 76 |
16 | frovedis | 64 |
17 | HLML | 24 |
18 | ThatsALotOfDMG | 22 |
19 | std_find_simd | 18 |
20 | qFib | 10 |
21 | VectorizedKernel | 7 |
Sponsored