SaaSHub helps you find the best software and product alternatives Learn more →
Top 16 data-exploration Open-Source Projects
-
ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
dataprep
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
-
Optimus
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark (by ironmussa)
-
odd-platform
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
desbordante-core
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
-
dwata
AI studio for you and your business. Create assistants, connect databases, APIs (like Stripe) or CSV/Excel files. Use AI to create insights, workflows, action items.
-
pandas-paddles
Access the parent Pandas data frame in loc[], iloc[], assign(), and others Pandas helpers
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Show HN: Use an "eraser" to clean data on flight without breaking your workflow | news.ycombinator.com | 2024-03-15
Project mention: OpenDataDiscovery 0.15 with Data Deprecation and Metadata Stale | news.ycombinator.com | 2023-08-04
Project mention: Show HN: PipeRider – open-source Data Impact Analysis for dbt changes | news.ycombinator.com | 2023-09-06
Project mention: I'm writing a new vector search SQLite Extension | news.ycombinator.com | 2024-05-02I have been looking at your extension for the last couple weeks and I think I will end up using it. I am also looking at running qdrant locally. I am creating an open source AI studio (desktop app) (1). It is in very early stages but it is fun to get to know this landscape. I would be happy to contribute to your project in any way I can. And thank you for this project.
1. https://github.com/brainless/dwata
data-exploration related posts
-
Show HN: Desbordante 1.0.0 Released
-
Show HN: PipeRider – open-source Data Impact Analysis for dbt changes
-
Kangas: Pandas for Multimedia Datasets
-
Reimagining Santa Clause with Stable Diffusion
-
Reimagining Santa Claus with Stable Diffusion
-
[D] Is accurately estimating image quality even possible?
-
Business Intelligence, The Key To Company Success
-
A note from our sponsor - SaaSHub
www.saashub.com | 12 May 2024
Index
What are some of the best open-source data-exploration projects? This list will help you:
Project | Stars | |
---|---|---|
1 | ydata-profiling | 12,085 |
2 | pygwalker | 9,864 |
3 | Rath | 3,987 |
4 | sweetviz | 2,841 |
5 | dataprep | 1,927 |
6 | Optimus | 1,446 |
7 | odd-platform | 1,120 |
8 | kangas | 1,029 |
9 | cleanvision | 925 |
10 | piperider | 469 |
11 | desbordante-core | 354 |
12 | dwata | 104 |
13 | pivot-chart | 94 |
14 | tonic | 70 |
15 | thesis_undergrad | 6 |
16 | pandas-paddles | 5 |
Sponsored