proton
haystack
proton | haystack | |
---|---|---|
10 | 55 | |
1,293 | 13,711 | |
3.8% | 3.1% | |
9.7 | 9.9 | |
6 days ago | 6 days ago | |
C++ | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
proton
- FLaNK-AIM Weekly 06 May 2024
-
Loading a trillion rows of weather data into TimescaleDB
What's the process for adding support for other databases to your tool qStudio?
I'm thinking perhaps you could add support for Timeplus [1]? Timeplus is a streaming-first database built on ClickHouse. The core DB engine Timeplus Proton is open source [2].
It seems that qStudio is open source [3] and written in Java and will need a JDBC driver to add support for a new RDBMS? If yes, Timeplus Proton has an open source JDBC driver [4] based on ClickHouse's driver but with modifications added for streaming use cases.
1: https://www.timeplus.com/
2: https://github.com/timeplus-io/proton
3: https://github.com/timeseries/qstudio
4: https://github.com/timeplus-io/proton-java-driver
-
Comparing Timeplus Proton and ksqlDB for stream processing
* Proton is more developer friendly
To explore Proton yourself, visit the [Proton GitHub repo](https://github.com/timeplus-io/proton) or create your own workspace on [Timeplus Cloud](https://timeplus.com).
- FLaNK Stack Weekly 19 Feb 2024
- Proton, a fast and lightweight alternative to Apache Flink
- Proton, extending the historical data, storage, and computing of ClickHouse
- Proton, a unified database for streaming and historical data in a single binary
-
First 15 Open Source Advent projects
5. Proton by Timeplus | Github | tutorial
- Timeplus has open-sourced its core streaming processing engine Proton
haystack
-
Haystack DB – 10x faster than FAISS with binary embeddings by default
I was confused for a bit but there is no relation to https://haystack.deepset.ai/
-
Release Radar • March 2024 Edition
View on GitHub
-
First 15 Open Source Advent projects
4. Haystack by Deepset | Github | tutorial
-
Generative AI Frameworks and Tools Every Developer Should Know!
Haystack can be classified as an end-to-end framework for building applications powered by various NLP technologies, including but not limited to generative AI. While it doesn't directly focus on building generative models from scratch, it provides a robust platform for:
-
Best way to programmatically extract data from a set of .pdf files?
But if you want an API that you can use to develop your own flow, Haystack from Deepset could be worth a look.
-
Which LLM framework(s) do you use in production and why?
Haystack for production. We cannot afford breaking changes in our production apps. Its stable, documentation is excellent and did I mention its' STABLE!??
- Overview: AI Assembly Architectures
-
Llama2 and Haystack on Colab
I recently conducted some experiments with Llama2 and Haystack (https://github.com/deepset-ai/haystack), the NLP/LLM framework.
The notebook can be helpful for those trying to load Llama2 on Colab.
1) Installed Transformers from the main branch (and other libraries)
- Build with LLMs for production with Haystack – has 10k stars on GitHub
- Show HN: Haystack – Production-Ready LLM Framework
What are some alternatives?
ytsaurus - YTsaurus is a scalable and fault-tolerant open-source big data platform.
langchain - 🦜🔗 Build context-aware reasoning applications
proton-python-driver - Python driver for Proton which support Proton native wire protocol
langchain - ⚡ Building applications with LLMs through composability ⚡ [Moved to: https://github.com/langchain-ai/langchain]
OSQuery - SQL powered operating system instrumentation, monitoring, and analytics.
gpt-neo - An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
duckdb - DuckDB is an in-process SQL OLAP Database Management System
BentoML - The most flexible way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Inference Graph/Pipelines, Compound AI systems, Multi-Modal, RAG as a Service, and more!
POCO - The POCO C++ Libraries are powerful cross-platform C++ libraries for building network- and internet-based applications that run on desktop, server, mobile, IoT, and embedded systems.
label-studio - Label Studio is a multi-type data labeling and annotation tool with standardized output format
ClickHouse - ClickHouse® is a free analytics DBMS for big data
jina - ☁️ Build multimodal AI applications with cloud-native stack