Named entity recognition extraction from website

This page summarizes the projects mentioned and recommended in the original post on reddit.com/r/LanguageTechnology

Our great sponsors
  • Scout APM - Less time debugging, more time building
  • SonarQube - Static code analysis for 29 languages.
  • SaaSHub - Software Alternatives and Reviews
  • semantic-search-through-wikipedia-with-weaviate

    Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine

    Although the Wikipedia demo dataset does not have NER enabled, you can play around with the interface. You can create a custom setup for NER using this configurator. Good luck!

  • Weaviate

    Weaviate is a cloud-native, modular, real-time vector search engine

    We see some users using the Weaviate vector search engine for this. You can store the crawled web pages in Weaviate and use the sentence embedding and NER modules to distill the information you need from the individual pages.

  • Scout APM

    Less time debugging, more time building. Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts