Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression. Learn more →
Ydata-profiling Alternatives
Similar projects and alternatives to ydata-profiling
-
DataProfiler
What's in your data? Extract schema, statistics and entities from datasets
-
-
InfluxDB
Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.
-
dataframe-go
DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
-
get-started-with-JAX
The purpose of this repo is to make it easy to get started with JAX, Flax, and Haiku. It contains my "Machine Learning with JAX" series of tutorials (YouTube videos and Jupyter Notebooks) as well as the content I found useful while learning about the JAX ecosystem.
-
lux
Automatically visualize your pandas dataframe via a single print! 📊 💡 (by lux-org)
-
-
evidently
Evaluate and monitor ML models from validation to production. Join our Discord: https://discord.com/invite/xZjKRaNp8b
-
Sonar
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
-
compressio
Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same data.
-
sweetviz
Visualize and compare datasets, target values and associations, with one line of code.
-
jupyterlab-lsp
Coding assistance for JupyterLab (code navigation + hover suggestions + linters + autocompletion + rename) using Language Server Protocol
-
dataprep
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
-
best-of-ml-python
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
-
-
awesome-python
A curated list of awesome Python frameworks, libraries, software and resources
-
OpenRefine
OpenRefine is a free, open source power tool for working with messy data and improving it
-
cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
-
snorkel
A system for quickly generating training data with weak supervision
-
-
-
jupyter-vim-binding
Jupyter meets Vim. Vimmer will fall in love.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
ydata-profiling reviews and mentions
-
pandas-profiling VS Rath - a user suggested alternative
2 projects | 12 Jan 2023
-
The Data-Centric AI Community is on Discord 👾
Alternatively, if you found DCAI through pandas-profiling or ydata-synthetic you can find support for your troubleshooting and provide feedback on interesting features!
- [Discussion] - "data sourcing will be more important than model building in the era of foundational model fine-tuning"
- Data profiling as part of a data reliability strategy?
-
GitHub repository with helpful python programs to quickly run through datasets and give a brief summary of it's statistics.
As a learning project, this is nice, but for standard use, what would be the advantage of this over just loading a program into Pandas and calling df.describe()? And if you need more complete details on a data set, using the pandas-profiling package?
-
[P] You Only Plot Once (YOPO) -> Simple low code visualization library
Nice try making it clickable to generate different charts based on loaded data, but I can't help but notice that YOPO's functionality overlaps with another quite big tool called pandas-profiling. It automatically creates report in html or json format to explore dataset and has been used quite successfully in many production solutions.
-
Visions – User defined data type systems
Visions is a python library for working with user defined data type systems. Out of the box, it provides type inference and automated data cleaning of sequence data with backend specific implementations for pandas, spark, python, and numpy. We often use it as a first pass cleaning step when working with tabular data and to simplify the backend logic of both pandas-profiling and our tabular data compression library compressio.
-
Show HN: Visions – User defined data type systems
If you're interested in learning more about the project, the original paper is available on JOSS[3] you can also check out our Numpy Global 2020 talk[4]
-
A note from our sponsor - InfluxDB
www.influxdata.com | 24 Mar 2023
Stats
ydataai/ydata-profiling is an open source project licensed under MIT License which is an OSI approved license.