ydata-profiling
dtale
Our great sponsors
ydata-profiling | dtale | |
---|---|---|
43 | 46 | |
12,022 | 4,539 | |
1.5% | 1.9% | |
8.5 | 8.5 | |
5 days ago | 6 days ago | |
Python | TypeScript | |
MIT License | GNU Lesser General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ydata-profiling
- FLaNK 25 December 2023
-
First 15 Open Source Advent projects
6. Ydata-synthetic and Ydata-profiling by YData | Github | tutorial
-
Coding Wonderland: Contribute to YData Profiling and YData Synthetic in this Advent of Code
Send us your North βοΈ: "On the first day of Christmas, my true contributor gave to me..." a star in my GitHub tree! π΅ If you love these projects too, star ydata-profiling or ydata-synthetic and let your friends know why you love it so much!
- Data exploration is not dead
- Explore your data in a single line of code
-
Which preprocessing steps to improve the performance of a naive bayes classifier
My suggestion start with the EDA - there are a lot of packages that automate that for you already. My usual go-to: https://github.com/ydataai/ydata-profiling.
-
Simulating sales data
If you're not sure about the behaviour of your data (i.e., if the original data has properties like seasonality), you can use ydata-profiling to profile your data first.
-
I recorded a Data Science Project using Python and uploaded it on Youtube
Super cool! For EDA, you could give ydata-profiling a spin sometime and speed up the process!
-
Ydata-Profiling and Dask
Hey guys,
We've been recently at the Dask Demo Day and we're hoping to launch a new feature on ydata-profiling, with the support for Dask dataframes!
We're looking for Dask Wizards to start collaborating on this feature, so if you're interested, please join us to define the roadmap of the project and start making it real
Current GitHub branch is here: https://github.com/ydataai/ydata-profiling/tree/feat/dask
Dedicated dask channel here: https://discord.gg/EHDBuSSDuy
-
π§ ydata-profiling + Dask!
We're looking for Dask Wizards π§π»ββοΈ to start collaborating on this branch, so if you're interested, please join us to define the roadmap of the project and start making it real π
dtale
-
The free pandas visualizer, D-Tale, has now been integrated with ArcticDB which will allow users to load huge datasets and easily navigate their databases
[D-Tale](https://github.com/man-group/dtale) has recently released version 3.2.0 on pypi & conda-forge: ``` pip install -U dtale conda install dtale -c conda-forge ``` But if you want to take it one step further you can now integrate it with [ArcticDB](https://github.com/man-group/ArcticDB): ``` pip install -U dtale[arcticdb] ``` This allows you the ability to navigate your libraries of datasets saved to your ArcticDB database! But the best part is that all the reads are occuring directly against ArcticDB so some of the memory constraints you may have been hit with before are now a thing of the past. Here's a full write up how to use this functionality along with a quick demo: https://github.com/man-group/dtale/blob/master/docs/arcticdb/ARCTICDB\_INTEGRATION.md Hope this helps & please support open-source by throwing your star on the [repo](https://github.com/man-group/dtale). Thanks! π
-
Data Scientists using neovim: how do you explore dataframes?
I've looked into external tooling, libs such as dtale, which feel overly complicated for my use case (but I'm open to alternatives). What I would like to have instead is something akin to Spyder's variable viewer, which allows sorting by column. VSCode goes a step further and also provides the ability to filter the dataframe.
-
I need help lol
D-Tale: A Python library that provides an interactive web-based interface for data exploration and analysis.
-
Something better than pandas? with interactive graphical UI?
Try this: https://github.com/man-group/dtale
-
Mito β Excel-like interface for Pandas dataframes in Jupyter notebook
https://github.com/man-group/dtale
I find that I'm actually a lot faster using basic Pandas methods to get the data I want in exactly the form I want it.
If I really want to show everything, I just use:
'''
- Memray is a memory profiler for Python by Bloomberg
- Show HN: D-Tale, easy to use pandas GUI
-
Added visualizations of statsmodels time series analysis functions to the free pandas visualizer, D-Tale
Just added "Time Series Analysis" in v1.60.1 of D-Tale on pypi & conda-forge: pip install -U dtale conda install dtale -c conda-forge This feature provides a quick and easy way to visualize the usage of the following time series analysis function in statsmodels:
- Show HN: Open-source pandas dataframe visualizer
-
For all the python/pandas users out there I just released a bunch of UI updates to the free visualizer, D-Tale
Your data is stored in memory so the size of your dataframe is limited to the memory of your machine. That being said weβve allowed users to swap out the machanism which stores the data so you can use something like Redis or Shelve to allieviate memory. Hereβs some documentation: https://github.com/man-group/dtale/blob/master/docs/GLOBAL_STATE.md
What are some alternatives?
DataProfiler - What's in your data? Extract schema, statistics and entities from datasets
PandasGUI - A GUI for Pandas DataFrames
dataframe-go - DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
jupyterlab-autoplot - Magical Plotting in JupyterLab
lux - Automatically visualize your pandas dataframe via a single print! π π‘
pandastable - Table analysis in Tkinter using pandas DataFrames.
get-started-with-JAX - The purpose of this repo is to make it easy to get started with JAX, Flax, and Haiku. It contains my "Machine Learning with JAX" series of tutorials (YouTube videos and Jupyter Notebooks) as well as the content I found useful while learning about the JAX ecosystem.
sqliteviz - Instant offline SQL-powered data visualisation in your browser
evidently - Evaluate and monitor ML models from validation to production. Join our Discord: https://discord.com/invite/xZjKRaNp8b
best-of-ml-python - π A ranked list of awesome machine learning Python libraries. Updated weekly.
dataprep - Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
dtale-desktop - Build a data visualization dashboard with simple snippets of python code