Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 7 sentence-boundary-detection Open-Source Projects
-
pySBD
🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
wtpsplit
Code for Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation
-
vid2cleantxt
Python API & command-line tool to easily transcribe speech-based video files into clean text
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
Project mention: Show HN: Next-token prediction in JavaScript – build fast LLMs from scratch | news.ycombinator.com | 2024-04-10This is awesome, thanks. I've been messing with wink's NLP library (https://winkjs.org/wink-nlp/) to transform user queries and format responses so I can make a proper chat bot - will see what I can learn from these!
GoSBD is a library for segmenting text into sentences for Go. It is rule-based and works out-of-the-box.
This library builds upon the excellent foundations laid by pySBD and pragmatic_segmenter.
The roadmap includes language support expansion, text cleaning features, and improved testing. Contributions are greatly appreciated.
Repository: https://github.com/gosbd/gosbd
sentence-boundary-detection related posts
- Show HN: WinkNLP introduces key sentence extraction
- WinkNLP's recent feature — key sentence extraction delivers a performance of over 450,000 tokens/second or 1500 sentences/second on Apple M1/16GB
- WinkNLP's recent feature — key sentence extraction delivers a performance of over 450,000 tokens/second or 1500 sentences/second on Apple M1/16GB
- WinkNLP's recent feature — key sentence extraction delivers a performance of over 450,000 tokens/second or 1500 sentences/second on Apple M1/16GB
- WinkNLP's recent feature — key sentence extraction delivers a performance of over 450,000 tokens/second or 1500 sentences/second on Apple M1/16GB
- How to visualize timeline of a Wiki article?
- WinkNLP delivers 600k tokens/second speed on browsers (MBP M1)
-
A note from our sponsor - InfluxDB
www.influxdata.com | 28 Apr 2024
Index
What are some of the best open-source sentence-boundary-detection projects? This list will help you:
Project | Stars | |
---|---|---|
1 | wink-nlp | 1,143 |
2 | pySBD | 731 |
3 | wtpsplit | 495 |
4 | razdel | 243 |
5 | vid2cleantxt | 156 |
6 | wink-eng-lite-model | 10 |
7 | gosbd | 6 |
Sponsored