data-exploration

Top 16 data-exploration Open-Source Projects

  • ydata-profiling

    1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

  • Project mention: FLaNK 25 December 2023 | dev.to | 2023-12-26
  • pygwalker

    PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis

  • Project mention: Show HN: Use an "eraser" to clean data on flight without breaking your workflow | news.ycombinator.com | 2024-03-15
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Rath

    Next generation of automated data exploratory analysis and visualization platform.

  • Project mention: FLaNK Stack for 15 May 2023 | dev.to | 2023-05-15
  • sweetviz

    Visualize and compare datasets, target values and associations, with one line of code.

  • dataprep

    Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.

  • Optimus

    :truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark (by ironmussa)

  • odd-platform

    First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.

  • Project mention: OpenDataDiscovery 0.15 with Data Deprecation and Metadata Stale | news.ycombinator.com | 2023-08-04
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • kangas

    🦘 Explore multimedia datasets at scale

  • cleanvision

    Automatically find issues in image datasets and practice data-centric computer vision.

  • piperider

    Code review for data in dbt

  • Project mention: Show HN: PipeRider – open-source Data Impact Analysis for dbt changes | news.ycombinator.com | 2023-09-06
  • desbordante-core

    Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

  • Project mention: Show HN: Desbordante 1.0.0 Released | news.ycombinator.com | 2023-12-11
  • dwata

    AI studio for you and your business. Create assistants, connect databases, APIs (like Stripe) or CSV/Excel files. Use AI to create insights, workflows, action items.

  • Project mention: I'm writing a new vector search SQLite Extension | news.ycombinator.com | 2024-05-02

    I have been looking at your extension for the last couple weeks and I think I will end up using it. I am also looking at running qdrant locally. I am creating an open source AI studio (desktop app) (1). It is in very early stages but it is fun to get to know this landscape. I would be happy to contribute to your project in any way I can. And thank you for this project.

    1. https://github.com/brainless/dwata

  • pivot-chart

    light and fast implementation of web pivot table / pivot chart components.

  • tonic

    🍸 Digital Collections Framework (by Subgin)

  • thesis_undergrad

    Documentation: Methodology and Exploratory Data Analysis

  • pandas-paddles

    Access the parent Pandas data frame in loc[], iloc[], assign(), and others Pandas helpers

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

data-exploration related posts

Index

What are some of the best open-source data-exploration projects? This list will help you:

Project Stars
1 ydata-profiling 12,085
2 pygwalker 9,864
3 Rath 3,987
4 sweetviz 2,841
5 dataprep 1,927
6 Optimus 1,446
7 odd-platform 1,120
8 kangas 1,029
9 cleanvision 925
10 piperider 469
11 desbordante-core 354
12 dwata 104
13 pivot-chart 94
14 tonic 70
15 thesis_undergrad 6
16 pandas-paddles 5

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com