Python exploratory-data-analysis

Open-source Python projects categorized as exploratory-data-analysis

Top 11 Python exploratory-data-analysis Projects

exploratory-data-analysis
  1. ydata-profiling

    1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

    Project mention: The DuckDB Local UI | news.ycombinator.com | 2025-03-12

    WhatTheDuck does SQL with duckdb-wasm IIRC

    Pygwalker does open-source descriptive statistics and charts from pandas dataframes: https://github.com/Kanaries/pygwalker

    ydata-profiling does Exploratory Data Analysis (EDA) with Pandas and Spark DataFrames and integrates with various apps: https://github.com/ydataai/ydata-profiling

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. great_expectations

    Always know what to expect from your data.

  4. cleanlab

    The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

    Project mention: Ask HN: Not a webdev, why are these sites so good? | news.ycombinator.com | 2024-06-18

    https://cleanlab.ai/

  5. lux

    Automatically visualize your pandas dataframe via a single print! 📊 💡 (by lux-org)

  6. sweetviz

    Visualize and compare datasets, target values and associations, with one line of code.

  7. scattertext

    Beautiful visualizations of how language differs among document types.

  8. dataprep

    Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. cleanvision

    Automatically find issues in image datasets and practice data-centric computer vision.

  11. piperider

    Code review for data in dbt

  12. sliceguard

    A library for detecting problematic data segments in structured and unstructured data with few lines of code.

  13. wordview

    A Python package for Exploratory Data Analysis (EDA) for text-based data.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python exploratory-data-analysis discussion

Log in or Post with

Python exploratory-data-analysis related posts

  • Name of library that creates multille charts quickly

    1 project | /r/learnpython | 4 Feb 2023
  • Do you see SQL being under threat in any way as a way of querying databases? I know it's possibly a dumb question but wondering.

    3 projects | /r/BusinessIntelligence | 27 Sep 2021
  • Lux - A Python API for Intelligent Visual Discovery

    1 project | /r/Python | 27 May 2021
  • Python API for Intelligent Visual Data Discovery

    1 project | news.ycombinator.com | 29 Mar 2021

Index

What are some of the best open-source exploratory-data-analysis projects in Python? This list will help you:

# Project Stars
1 ydata-profiling 12,792
2 great_expectations 10,253
3 cleanlab 10,227
4 lux 5,262
5 sweetviz 2,994
6 scattertext 2,286
7 dataprep 2,137
8 cleanvision 1,056
9 piperider 484
10 sliceguard 63
11 wordview 11

Sponsored
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai