seatunnel
Milvus
seatunnel | Milvus | |
---|---|---|
31 | 109 | |
7,524 | 27,747 | |
3.8% | 2.8% | |
9.8 | 10.0 | |
about 16 hours ago | about 17 hours ago | |
Java | Go | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
seatunnel
- SeaTunnel – super high-performance, distributed data integration tool
- Apache SeaTunnel: Next-generation high-performance, distributed integration tool
- FLaNK Weekly 31 December 2023
-
Five Apache projects you probably didn't know about
Apache SeaTunnel is a data integration platform that offers the three pillars of data pipelines: sources, transforms, and sinks. It offers an abstract API over three possible engines: the Zeta engine from SeaTunnel or a wrapper around Apache Spark or Apache Flink. Be careful, as each engine comes with its own set of features.
-
SymmetricDS: Open-Source, cross platform database replication software
looks that way. there is an other project that does similar things Apache SeaTunnel: https://seatunnel.apache.org/
- Breakthrough in the book search field! Use Apache SeaTunnel to improve the efficiency of book title similarity search
-
Questions Regarding design DW
https://seatunnel.apache.org/ Might be an overkill though...
-
SeaTunnel Zeta engine, the first choice for massive data synchronization, is officially released!
See the specific Change log: https://github.com/apache/incubator-seatunnel/releases/tag/2.3.0
-
The Ultimate Beginner’s Guide to Open Source Contribution
Apache SeaTunnel (Incubating) SeaTunnel is a very easy-to-use ultra-high-performance distributed data integration platform that supports real-time synchronization of massive data. It can synchronize tens of billions of data stably and efficiently every day, and has been used in the production of nearly 100 companies. Official website https://seatunnel.apache.org/ GitHub projects https://github.com/apache/incubator-seatunnel
- Major Release! SeaTunnel 2.3.0-beta supports the self-innovate SeaTunnel Engine and more connectors!
Milvus
-
Recapping the AI, Machine Learning and Data Science Meetup - May 30, 2024
Milvus open source vector database
-
Using Milvus-Lite Now
If you saw my recent newsletter you can see I joined Zilliz to work on the Open Source AI Database, Milvus.
-
AIM Weekly 27 May 2024
🎥 Playlist: Unstructured Data Meetup https://www.meetup.com/unstructured-data-bay-area/events/ 🖥️ Website: https://www.youtube.com/@MilvusVectorDatabase/videos X Twitter - / milvusio https://x.com/milvusio 🔗 Linkedin: / zilliz https://www.linkedin.com/company/zilliz/ 😺 GitHub: https://github.com/milvus-io/milvus 🦾 Invitation to join discord: / discord https://discord.com/invite/FjCMmaJng6
- FLaNK-AIM: 20 May 2024 Weekly
-
Computer Vision Meetup: Develop a Legal Search Application from Scratch using Milvus and DSPy!
Legal practitioners often need to find specific cases and clauses across thousands of dense documents. While traditional keyword-based search techniques are useful, they fail to fully capture semantic content of queries and case files. Vector search engines and large language models provide an intriguing alternative. In this talk, I will show you how to build a legal search application using the DSPy framework and the Milvus vector search engine.
-
Ask HN: Who is hiring? (April 2024)
Zilliz (zilliz.com) | Hybrid/ONSITE (SF, NYC) | Full-time
I am part of the hiring team for DevRel
NYC - https://boards.greenhouse.io/zilliz/jobs/4307910005
SF - https://boards.greenhouse.io/zilliz/jobs/4317590005
Zilliz is the company behind Milvus (https://github.com/milvus-io/milvus), the most starred vector database on GitHub. Milvus is a distributed vector database that shines in 1B+ vector use cases. Examples include autonomous driving, e-commerce, and drug discovery. (and, of course, RAG)
We are also hiring for other roles that I am not personally involved in the hiring process for such as product managers, software engineers, and recruiters.
-
Unlock Advanced Search Capabilities with Milvus and Read about RAG
Get started with Milvus on GitHub.
-
Milvus VS pgvecto.rs - a user suggested alternative
2 projects | 13 Mar 2024
-
How to choose the right type of database
Milvus: An open-source vector database designed for AI and ML applications. It excels in handling large-scale vector similarity searches, making it suitable for recommendation systems, image and video retrieval, and natural language processing tasks.
-
Simplifying the Milvus Selection Process
Selecting the right version of open-source Milvus is important to the success of any project leveraging vector search technology. With Milvus offering different versions of its vector database tailored to varying requirements, understanding the significance of selecting the correct version is key for achieving desired outcomes.
What are some alternatives?
airbyte - The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
pgvector - Open-source vector similarity search for Postgres
kestra - Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
faiss - A library for efficient similarity search and clustering of dense vectors.
Leetcode - Solutions to LeetCode problems; updated daily. Subscribe to my YouTube channel for more.
qdrant - Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
hudi - Upserts, Deletes And Incremental Processing on Big Data.
Weaviate - Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
com.openai.unity - A Non-Official OpenAI Rest Client for Unity (UPM)
Elasticsearch - Free and Open, Distributed, RESTful Search Engine
Apache Hive - Apache Hive
Face Recognition - The world's simplest facial recognition api for Python and the command line