Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression. Learn more →
Similar projects and alternatives to ydata-profiling
What's in your data? Extract schema, statistics and entities from datasets
Visualizer for pandas data structures
Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.
DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
The purpose of this repo is to make it easy to get started with JAX, Flax, and Haiku. It contains my "Machine Learning with JAX" series of tutorials (YouTube videos and Jupyter Notebooks) as well as the content I found useful while learning about the JAX ecosystem.
Automatically visualize your pandas dataframe via a single print! 📊 💡 (by lux-org)
Type System for Data Analysis in Python
Evaluate and monitor ML models from validation to production. Join our Discord: https://discord.com/invite/xZjKRaNp8b
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same data.
Visualize and compare datasets, target values and associations, with one line of code.
Coding assistance for JupyterLab (code navigation + hover suggestions + linters + autocompletion + rename) using Language Server Protocol
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
👾 Fast and simple video download library and CLI tool written in Go
A curated list of awesome Python frameworks, libraries, software and resources
OpenRefine is a free, open source power tool for working with messy data and improving it
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
A system for quickly generating training data with weak supervision
Issue tracker for Codewars
Automated data exploratory analysis and visualization tools.
Jupyter meets Vim. Vimmer will fall in love.
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
ydata-profiling reviews and mentions
pandas-profiling VS Rath - a user suggested alternative
2 projects | 12 Jan 2023
The Data-Centric AI Community is on Discord 👾
2 projects | dev.to | 20 Dec 2022
Alternatively, if you found DCAI through pandas-profiling or ydata-synthetic you can find support for your troubleshooting and provide feedback on interesting features!
[Discussion] - "data sourcing will be more important than model building in the era of foundational model fine-tuning"
4 projects | reddit.com/r/MachineLearning | 3 Dec 2022
Data profiling as part of a data reliability strategy?
2 projects | reddit.com/r/dataengineering | 15 Sep 2022
GitHub repository with helpful python programs to quickly run through datasets and give a brief summary of it's statistics.
2 projects | reddit.com/r/datasets | 26 Mar 2022
As a learning project, this is nice, but for standard use, what would be the advantage of this over just loading a program into Pandas and calling df.describe()? And if you need more complete details on a data set, using the pandas-profiling package?
[P] You Only Plot Once (YOPO) -> Simple low code visualization library
2 projects | reddit.com/r/MachineLearning | 27 Feb 2022
Nice try making it clickable to generate different charts based on loaded data, but I can't help but notice that YOPO's functionality overlaps with another quite big tool called pandas-profiling. It automatically creates report in html or json format to explore dataset and has been used quite successfully in many production solutions.
Visions – User defined data type systems
3 projects | reddit.com/r/Python | 4 Feb 2022
Visions is a python library for working with user defined data type systems. Out of the box, it provides type inference and automated data cleaning of sequence data with backend specific implementations for pandas, spark, python, and numpy. We often use it as a first pass cleaning step when working with tabular data and to simplify the backend logic of both pandas-profiling and our tabular data compression library compressio.3 projects | reddit.com/r/Python | 4 Feb 20223 projects | reddit.com/r/datascience | 4 Feb 2022
Show HN: Visions – User defined data type systems
3 projects | news.ycombinator.com | 1 Feb 2022
If you're interested in learning more about the project, the original paper is available on JOSS you can also check out our Numpy Global 2020 talk
A note from our sponsor - InfluxDB
www.influxdata.com | 24 Mar 2023
ydataai/ydata-profiling is an open source project licensed under MIT License which is an OSI approved license.