semantic-search-through-wikipedia-with-weaviate
sample-apps
Our great sponsors
semantic-search-through-wikipedia-with-weaviate | sample-apps | |
---|---|---|
9 | 3 | |
223 | 281 | |
- | 3.9% | |
3.2 | 9.6 | |
11 months ago | 8 days ago | |
Python | Jupyter Notebook | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
semantic-search-through-wikipedia-with-weaviate
-
Named entity recognition extraction from website
Although the Wikipedia demo dataset does not have NER enabled, you can play around with the interface. You can create a custom setup for NER using this configurator. Good luck!
-
Find anything fast with Google's vector search technology
* Wikipedia demo dataset: https://github.com/semi-technologies/semantic-search-through...
- Semantic search through Wikipedia with the Weaviate vector search engine
-
[D] Are you seeing any compelling use cases of semantic search being leveraged at scale?
Semantic search through Wikipedia with the Weaviate vector search engine
- [P] Semantic search through a vectorized Wikipedia
-
Semantic search through complete EN-language Wikipedia with the Weaviate vector search engine
The source code to run the dataset yourself is completely open on Github
-
Semantic search using GraphQL through the complete EN-Wikipedia
Github
-
[P] Semantic search through Wikipedia with Weaviate and Sentence-BERT transformers
Github: https://github.com/semi-technologies/semantic-search-through-Wikipedia-with-Weaviate
sample-apps
-
[P] I'm building a Neural Search Plugin for Elastic/Opensearch
See this blog post https://blog.vespa.ai/pretrained-transformer-language-models-for-search-part-1/ and the open source sample app it describes: https://github.com/vespa-engine/sample-apps/tree/master/msmarco-ranking
-
Find anything fast with Google's vector search technology
>
Vespa.ai supports combining dense vector search with keyword search and ranking, see https://docs.google.com/presentation/d/1vWKhSvFH-4MFcs4aNa9C...
There is also a Vespa sample application (open source, Apache 2) demonstrating multiple different retrieval and ranking strategies over at https://github.com/vespa-engine/sample-apps/blob/master/msma...
-
What Are Some Open Source NLP Framework Pipelines For QA Task
Look up Vespa.ai. https://github.com/vespa-engine/sample-apps/tree/master/dense-passage-retrieval-with-ann
What are some alternatives?
qdrant - Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
awesome-vector-search - Collections of vector search related libraries, service and research papers
hnswlib - Header-only C++/python library for fast approximate nearest neighbors
Weaviate - Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
biggraph-wikidata-search-with-weaviate - Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine
pgvector - Open-source vector similarity search for Postgres
google-research - Google Research