hnsqlite
Coral
hnsqlite | Coral | |
---|---|---|
6 | 10 | |
143 | 1,863 | |
1.4% | 0.1% | |
5.5 | 9.9 | |
10 months ago | 7 days ago | |
Python | TypeScript | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
hnsqlite
-
LangChain: The Missing Manual
For anyone thinking about applications of langchain and pinecone but who are looking for something more turn-key check out https://jiggy.ai
The core is actually open source as well, allowing you to take your data back out via sqlite and hnswlib (https://github.com/jiggy-ai/hnsqlite)
-
I built an open source website that lets you upload large files, such as in-depth novels or academic papers, and ask ChatGPT questions based on your specific knowledge base. So far, I've tested it with long books like the Odyssey and random research papers that I like, and it works shockingly well.
We are built on open core https://github.com/jiggy-ai. Our open source hnsqlite is light weight, easy to use. And best of all, we make it easy for you to get your data out of JiggyBase. You can download a sqlite file that contains your document text data, metadata, embedding vectors, and embedding index. This can be used directly in the open source hnsqlite package.
-
What Is a Vector Database
After working through several projects that utilized local hnswlib and different databases for text and vector persistence, I integrated open source hnswlib with sqlite to create an embedded vector search engine that can easily scale up to millions of embeddings. For self-hosted situations of under 10M embeddings and less than insane throughput I think this combo is hard to beat.
https://github.com/jiggy-ai/hnsqlite
- Show HN: Hnsqlite: hnswlib and SQLite integrated for text embedding search
-
Faiss: A library for efficient similarity search
Thanks Leobg!
For anyone else: you pass it directly in metadata see https://github.com/jiggy-ai/hnsqlite/blob/main/test/test_col...
Coral
-
What Is a Vector Database
The Coral Project [0] (commenting platform used on Washington Post, New York Times, The Verge) uses an Apache 2.0 license [1]. Which doesn't seem to have prevented it from raking in big SaaS customers.
A lot of people worry about copy-cat services, but it's kind of rare that someone will be able to compete with you as the original in hosting your own service as well as you can. Especially when you consider support and maintenance requirements of a new product you aren't personally developing.
I could see copy-cat services being more of an issue in the late stage of a product though? When everyone knows lots about how to stand it up and use it?
[0] https://coralproject.net/
-
What's the result of Knight-Mozilla Initiative: Challenge 2 – Beyond Comment Threads
The Coral Project was created inline with this initiative. They have lots of guides that provide some of the research that was conducted: https://coralproject.net/
-
Commento - A Self Hosted Comment System for Websites
For comment system, I choose Coral Project Talk because it could use Akismet and Google Perspective API for reducing spam and harassment. I also need to think about the remove comments when user delete their account (GDPR stuff). Coral Talk has the above functions in the UI.
-
Everything you need to know about Opensource Jamstack
Another great API that could be self-hosted is Coral. It’s a commenting platform where users can leave online comments. It’s received contributions from over 40 people on Github. It has a good-first-issue tag and also offers a contribution guide.
-
Node.js 16 Available Now
Yup! We do a Typescript/Node.js/GraphQL back-end with React/Relay/Typescript on the front end.
https://github.com/coralproject/talk
It's pretty nice having the whole code base share types, syntax, structure, etc.
-
Show HN: I'm working on a open-source, self-host alternative to Disqus
Coral is poorly advertised outside it's ecosystem, but should be considered. https://github.com/coralproject/talk
See https://docs.coralproject.net/coral/v5/integrating/cms/ to get an idea of it's use.
-
I made a student publication @ university & discovered a deep hate for WordPress — so I made my dream publishing platform
Our highest tier comment system is quite powerful, and is based off Coral Talk by Vox. For beginners like yourself, if we allowed users to integrate Disqus on all tiers, would that alleviate your concerns with using Storipress?
-
Caching data on Apollo server
If you need some inspiration, we added support for server caching of responses on Coral: https://github.com/coralproject/talk/blob/develop/src/core/server/app/middleware/graphql/apolloServer.ts#L85-L88
-
Disqus, the Dark Commenting System
I've seen some examples in which people embed Discourse discussions.
There's also Coral (https://github.com/coralproject/talk) which used to be Mozilla + Vox project before Mozilla handed it over to Vox completely, but I have no experience with it.
What are some alternatives?
langchainrb - Build LLM-powered applications in Ruby
Discourse - A platform for community discussion. Free, open, simple.
NeMo-Guardrails - NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
phpBB - phpBB Development: phpBB is a popular open-source bulletin board written in PHP. This repository also contains the history of version 2.
guidance - A guidance language for controlling large language models. [Moved to: https://github.com/guidance-ai/guidance]
GNU social - GNU social is social communication software for both public and private communications.
annoy - Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
Mastodon - Your self-hosted, globally interconnected microblogging community
GPT4Memory
remark42 - comment engine
raft - RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
commento - A fast, bloat-free comments platform (Github mirror)