InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more β
Ydata-profiling Alternatives
Similar projects and alternatives to ydata-profiling
-
-
InfluxDB
InfluxDB β Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
haystack
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
-
perspective
A data visualization and analytics component, especially well-suited for large and/or streaming datasets.
-
-
-
label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
OpenRefine
OpenRefine is a free, open source power tool for working with messy data and improving it
-
-
-
jupyterlab-lsp
Coding assistance for JupyterLab (code navigation + hover suggestions + linters + autocompletion + rename) using Language Server Protocol
-
-
-
dataprep
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
-
-
evidently
Evidently is ββan open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
-
-
-
compressio
Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same data.
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
ydata-profiling discussion
ydata-profiling reviews and mentions
-
The DuckDB Local UI
WhatTheDuck does SQL with duckdb-wasm IIRC
Pygwalker does open-source descriptive statistics and charts from pandas dataframes: https://github.com/Kanaries/pygwalker
ydata-profiling does Exploratory Data Analysis (EDA) with Pandas and Spark DataFrames and integrates with various apps: https://github.com/ydataai/ydata-profiling
- FLaNK 25 December 2023
-
First 15 Open Source Advent projects
6. Ydata-synthetic and Ydata-profiling by YData | Github | tutorial
-
Coding Wonderland: Contribute to YData Profiling and YData Synthetic in this Advent of Code
Send us your North βοΈ: "On the first day of Christmas, my true contributor gave to me..." a star in my GitHub tree! π΅ If you love these projects too, star ydata-profiling or ydata-synthetic and let your friends know why you love it so much!
- Data exploration is not dead
- Explore your data in a single line of code
-
Which preprocessing steps to improve the performance of a naive bayes classifier
My suggestion start with the EDA - there are a lot of packages that automate that for you already. My usual go-to: https://github.com/ydataai/ydata-profiling.
-
Simulating sales data
If you're not sure about the behaviour of your data (i.e., if the original data has properties like seasonality), you can use ydata-profiling to profile your data first.
-
I recorded a Data Science Project using Python and uploaded it on Youtube
Super cool! For EDA, you could give ydata-profiling a spin sometime and speed up the process!
-
Ydata-Profiling and Dask
Hey guys,
We've been recently at the Dask Demo Day and we're hoping to launch a new feature on ydata-profiling, with the support for Dask dataframes!
We're looking for Dask Wizards to start collaborating on this feature, so if you're interested, please join us to define the roadmap of the project and start making it real
Current GitHub branch is here: https://github.com/ydataai/ydata-profiling/tree/feat/dask
Dedicated dask channel here: https://discord.gg/EHDBuSSDuy
-
A note from our sponsor - InfluxDB
www.influxdata.com | 17 May 2025
Stats
ydataai/ydata-profiling is an open source project licensed under MIT License which is an OSI approved license.
The primary programming language of ydata-profiling is Python.
Popular Comparisons
- ydata-profiling VS DataProfiler
- ydata-profiling VS dtale
- ydata-profiling VS dataframe-go
- ydata-profiling VS dataprep
- ydata-profiling VS evidently
- ydata-profiling VS awesome-python
- ydata-profiling VS sweetviz
- ydata-profiling VS lux
- ydata-profiling VS get-started-with-JAX
- ydata-profiling VS jupyterlab-lsp