-
semantic-search-through-wikipedia-with-weaviate
Discontinued Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine
-
Weaviate
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Although the Wikipedia demo dataset does not have NER enabled, you can play around with the interface. You can create a custom setup for NER using this configurator. Good luck!
We see some users using the Weaviate vector search engine for this. You can store the crawled web pages in Weaviate and use the sentence embedding and NER modules to distill the information you need from the individual pages.
Related posts
-
GOMEMLIMIT (Go 1.19) is a game-changer for high-memory applications
-
[D] Mining restaurant reviews for vibe
-
Best engine for semantic search?
-
What are some available tools for multilingual emotion analysis (also question about LIWC)?
-
[P] Effects of Metadata filtering with HNSW on Recall and Query time