Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 23 nlp-library Open-Source Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
Awesome-pytorch-list
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
-
FARM
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
-
tika-python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
-
contextualized-topic-models
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
OCTIS
OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)
-
lingua
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
-
Giveme5W1H
Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?
-
camel_tools
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
-
bllip-parser
BLLIP reranking parser (also known as Charniak-Johnson parser, Charniak parser, Brown reranking parser) See http://pypi.python.org/pypi/bllipparser/ for Python module.
-
lingo
package lingo provides the data structures and algorithms required for natural language processing (by chewxy)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Maxtext: A simple, performant and scalable Jax LLM | news.ycombinator.com | 2024-04-23Is t5x an encoder/decoder architecture?
Some more general options.
The Flax ecosystem
https://github.com/google/flax?tab=readme-ov-file
or dm-haiku
https://github.com/google-deepmind/dm-haiku
were some of the best developed communities in the Jax AI field
Perhaps the “trax” repo? https://github.com/google/trax
Some HF examples https://github.com/huggingface/transformers/tree/main/exampl...
Sadly it seems much of the work is proprietary these days, but one example could be Grok-1, if you customize the details. https://github.com/xai-org/grok-1/blob/main/run.py
Project mention: Step by step guide to create customized chatbot by using spaCy (Python NLP library) | dev.to | 2024-03-23Hi Community, In this article, I will demonstrate below steps to create your own chatbot by using spaCy (spaCy is an open-source software library for advanced natural language processing, written in the programming languages Python and Cython):
Otherwise it depends on your use case. There are NLP libraries like this one that can do the job.
Project mention: A transformer-based method for zero and few-shot biomedical NER | news.ycombinator.com | 2023-05-12
nlp-library related posts
- DataDreamer
- [P] MultiEL: Multilingual Entity Linking model by BELA model
- YouTube content creation assistant
- Seeking your insights on "Loquax": A tool for phonological analysis
- I used GPT-4 to create code that automates absolutely everything in creating YouTube Shorts, from voiceover to editing, even down to choosing the illustration images.
- [Arabic>latin transliteration] any apps for this?
- Guidance needed: Extracting diseases and symptoms from medical text
-
A note from our sponsor - InfluxDB
www.influxdata.com | 25 Apr 2024
Index
What are some of the best open-source nlp-library projects? This list will help you:
Project | Stars | |
---|---|---|
1 | transformers | 124,557 |
2 | spaCy | 28,704 |
3 | Awesome-pytorch-list | 14,932 |
4 | OpenPrompt | 4,146 |
5 | FARM | 1,723 |
6 | tika-python | 1,411 |
7 | contextualized-topic-models | 1,157 |
8 | pythainlp | 927 |
9 | skweak | 909 |
10 | janome | 828 |
11 | kagome | 789 |
12 | Sudachi | 740 |
13 | OCTIS | 681 |
14 | lingua | 657 |
15 | DataDreamer | 632 |
16 | Giveme5W1H | 500 |
17 | medspacy | 474 |
18 | camel_tools | 376 |
19 | zshot | 316 |
20 | bllip-parser | 225 |
21 | mutate | 149 |
22 | lingo | 146 |
23 | turkish-deasciifier | 143 |
Sponsored