SaaSHub helps you find the best software and product alternatives Learn more →
Top 4 Python unstructured-data Projects
-
towhee
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
-
Project mention: Show HN: Generate JSON mock data for testing/initial app development | news.ycombinator.com | 2023-10-03
A friend of mine built a tool called Trex that you might find helpful, check it out here: https://github.com/automorphic-ai/trex
It's very consistent at generating templated data.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
-
etl-texts
ETL-Texts aims to be a simple and efficient pipeline designed for extracting, translating, cleaning, and transforming text files.
NOTE:
The open source projects on this list are ordered by number of github stars.
The number of mentions indicates repo mentiontions in the last 12 Months or
since we started tracking (Dec 2020).
The latest post mention was on 2024-01-14.
Python unstructured-data related posts
- Intelligently transform unstructured to structured output (JSON, Regex, CFG)
- Vector Database in a Jupyter Notebook
- Beginner-ish resources for choosing a vector database?
- Semantic Similarity Search
- What Is DocArray?
- Join Hacktoberfest 2021 with Milvus!
- Sunday Daily Thread: What's everyone working on this week?
-
A note from our sponsor - SaaSHub
www.saashub.com | 28 Mar 2024
Index
What are some of the best open-source unstructured-data projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | towhee | 2,941 |
2 | trex | 237 |
3 | relevanceai | 89 |
4 | etl-texts | 5 |
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com