Named entity recognition extraction from website

This page summarizes the projects mentioned and recommended in the original post on /r/LanguageTechnology

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • semantic-search-through-wikipedia-with-weaviate

    Discontinued Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine

  • Although the Wikipedia demo dataset does not have NER enabled, you can play around with the interface. You can create a custom setup for NER using this configurator. Good luck!

  • Weaviate

    Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

  • We see some users using the Weaviate vector search engine for this. You can store the crawled web pages in Weaviate and use the sentence embedding and NER modules to distill the information you need from the individual pages.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • GOMEMLIMIT (Go 1.19) is a game-changer for high-memory applications

    2 projects | /r/golang | 17 Aug 2022
  • [D] Mining restaurant reviews for vibe

    1 project | /r/MachineLearning | 20 May 2022
  • Best engine for semantic search?

    1 project | /r/OpenAI | 19 Dec 2021
  • What are some available tools for multilingual emotion analysis (also question about LIWC)?

    2 projects | /r/LanguageTechnology | 16 Dec 2021
  • [P] Effects of Metadata filtering with HNSW on Recall and Query time

    1 project | /r/MachineLearning | 21 Oct 2021