information-extraction

Top 23 information-extraction Open-Source Projects

information-extraction
  • PaddleNLP

    πŸ‘‘ Easy-to-use and powerful NLP and LLM library with πŸ€— Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including πŸ—‚Text Classification, πŸ” Neural Search, ❓ Question Answering, ℹ️ Information Extraction, πŸ“„ Document Intelligence, πŸ’Œ Sentiment Analysis etc.

  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
  • DeepKE

    [EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

  • MITIE

    MITIE: library and tools for information extraction

  • InvoiceNet

    Deep neural network to extract intelligent information from invoice documents.

  • kor

    LLM(😽)

  • Project mention: Pydentic in prompt engineering | /r/LangChain | 2023-11-29

    Check out kor

  • awesome-document-understanding

    A curated list of resources for Document Understanding (DU) topic

  • 007-TheBond

    This Script will help you to gather information about your victim or friend.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • ontogpt

    LLM-based ontological extraction tools, including SPIRES

  • Project mention: GPT-based ontological extraction tools, including SPIRES | news.ycombinator.com | 2023-07-24
  • ail-framework

    AIL framework - Analysis Information Leak framework

  • Project mention: Ask HN: Show me your half baked project | news.ycombinator.com | 2023-10-12

    First time coming across this, looks very cool! Definitely some ideas there that I'd like to implement for osintbuddy. Another project I'm going to be taking some ideas from is: https://github.com/ail-project/ail-framework - a modular framework to analyse potential information leaks

  • RomBuster

    RomBuster is a router exploitation tool that allows to disclosure network router admin password.

  • medaCy

    :hospital: Medical Text Mining and Information Extraction with spaCy

  • MedCAT

    Medical Concept Annotation Tool

  • HugNLP

    CIKM2023 Best Demo Paper Award. HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NLP now!😊 (by HugAILab)

  • awesome-bioie

    🧫 A curated list of resources relevant to doing Biomedical Information Extraction (including BioNLP)

  • Project mention: Snomed CT Entity Linking Challenge | news.ycombinator.com | 2023-12-22

    > The objective of this competition is to link spans of text in clinical notes with specific topics in the SNOMED CT clinical terminology. Participants will train models based on real-world doctor's notes which have been de-identified and annotated with SNOMED CT concepts by medically trained professionals. This is the largest publicly available dataset of labelled clinical notes, and you can be one of the first to use it!

    NER: Named Entity Recognition: https://en.wikipedia.org/wiki/Named-entity_recognition

    awsome-medical-coding-nlp: https://github.com/acadTags/Awesome-medical-coding-NLP

    awesome-ehr-deep-learning: https://github.com/hurcy/awesome-ehr-deeplearning

    awesome-ner: https://github.com/smiyawaki0820/awesome-ner

    awesome-bioie > Research groups: https://github.com/caufieldjh/awesome-bioie#groups-active-in...

    SNOMED-CT as RDF: https://sphn-semantic-framework.readthedocs.io/en/latest/ext...

  • GoLLIE

    Guideline following Large Language Model for Information Extraction

  • Project mention: A LLM trained to follow annotation guidelines, for information extraction tasks | news.ycombinator.com | 2023-10-30
  • awesome-hungarian-nlp

    A curated list of NLP resources for Hungarian

  • huspacy

    HuSpaCy: industrial-strength Hungarian natural language processing

  • ChatGPT_for_IE

    Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness

  • htmldate

    Fast and robust date extraction from web pages, with Python or on the command-line

  • minie

    An open information extraction system that provides compact extractions

  • targetedSummarization

    TextReducer - A Tool for Summarization and Information Extraction

  • stargather

    A fast GitHub stargazers information gathering tool

  • odinson

    Odinson is a powerful and highly optimized open-source framework for rule-based information extraction. Odinson couples a simple, yet powerful pattern language that can operate over multiple representations of text, with a runtime system that operates in near real time.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

information-extraction discussion

Log in or Post with

information-extraction related posts

  • Pydentic in prompt engineering

    1 project | /r/LangChain | 29 Nov 2023
  • 27-Jun-2023

    1 project | /r/dailyainews | 29 Jun 2023
  • Guidance on creating a very lightweight model that does one task very well

    2 projects | /r/LocalLLaMA | 26 Jun 2023
  • Kor: Extract structured data using LLMs

    1 project | /r/hypeurls | 26 Jun 2023
  • Kor: Extract structured data using LLMs

    1 project | news.ycombinator.com | 26 Jun 2023
  • Google Local Results AI Parser

    1 project | news.ycombinator.com | 24 Jun 2023
  • Ruby gem to parse structured data from Google Local Search Results

    1 project | news.ycombinator.com | 22 Jun 2023
  • A note from our sponsor - Scout Monitoring
    www.scoutapm.com | 14 Jun 2024
    Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today. Learn more β†’

Index

What are some of the best open-source information-extraction projects? This list will help you:

Project Stars
1 PaddleNLP 11,622
2 DeepKE 3,079
3 MITIE 2,906
4 InvoiceNet 2,407
5 kor 1,540
6 awesome-document-understanding 1,156
7 007-TheBond 1,075
8 ontogpt 539
9 ail-framework 514
10 RomBuster 441
11 medaCy 421
12 MedCAT 414
13 HugNLP 370
14 awesome-bioie 300
15 GoLLIE 239
16 awesome-hungarian-nlp 212
17 huspacy 149
18 ChatGPT_for_IE 133
19 htmldate 114
20 minie 88
21 targetedSummarization 87
22 stargather 68
23 odinson 65

Sponsored
Free Django app performance insights with Scout Monitoring
Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
www.scoutapm.com