TidyverseSkeptic
seaborn
TidyverseSkeptic | seaborn | |
---|---|---|
13 | 77 | |
508 | 11,969 | |
- | - | |
3.3 | 8.4 | |
4 months ago | 9 days ago | |
TeX | Python | |
- | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
TidyverseSkeptic
-
Why Pandas feels clunky when coming from R
I just don't get these to be honest -- besides the fact that author missed simple things like `df.groupby('var',as_index=False)`, isn't this obviously arbitrary "this is easier my way" complaints? (I did R before all the chaining stuff was popular, and I wouldn't stuff everything into a single command like that even now. It isn't like you get lazy evaluation or any special data processing magic.)
So I get people love chaining and tidyverse, good for you, I don't. But at least I can acknowledge that my way (or this way) people have different preferences and one is not intrinsically easier.
Norm Matloff has a blog where he essentially just argues the opposite of all the tidyverse stuff, https://github.com/matloff/TidyverseSkeptic, but it is the same idea in reverse to me (one is not obviously easier to learn than the other IMO).
-
Where to learn R?
On the other hand, there is also a more traditional universe outside of the of the newer tidyverse approach. See the criticism of the tidyverse ecosystem by Prof Norm Matloff (of UC Davis). He provides a freely available introductory course in base R.
-
I will take that odds
Whenever I hear tidyverse, I just feel the need to leave this: TidyverseSceptic
-
Base-R Is Alive and Well
Yeah, I had never heard of him before, but I followed the link in the article above to his GitHub page and think he made some really great points about conciseness and clarity in base R code, and, I admittedly had no idea tapply() was so useful and easy to use, because I almost never see it used in any examples online. Although I agree with others here that he's misrepresenting why package developers use base R (which is to avoid dependences in their packages, which is very important), I also find myself agreeing with him that future R programmers not being taught base R is worrisome (I'm thinking of dependencies in future package development).
-
Your thoughts on base R? I never considered it and, after reading seemingly know little about it.
I was in an R group meeting. One of the members mentioned Prof. Norm Matloff and said he has comments about tidyverse. I searched and found Matloff's explanation here. What are your thoughts on tidyverse and Matloff's comments about it? As I read it, I found myself agreeing with certain points. I do not have a computer science background; I'm someone trying to learn coding because I see uses for it in my work. I started my learning, about a year ago, with tidyverse tutorials. My patchwork jumping around, maybe in addition to some of the gaps Matloff indicates, show me that I know very little about base R.
-
In charge of making the transition from Excel to R at the office
There are good arguments against tidyverse, especially for beginners. It doesn't lead to a growth in understanding the language fundamentals and requires to learn many functions, paradigms, and syntaxes not shared by base R, which can easily be overwhelming and lead to a learn-by-heart approach more than to a learn-by-understanding. There are many good articles on the topic, such as this one or a more in-depth one, suggesting to consider tidyverse a more advanced application for specific use cases, if you like the dialect. I don't, so I might be biased.
- Teaching R in a Kinder, Gentler, More Effective Manner
- An opinionated view of the Tidyverse “dialect” of the R language
-
Thoughts on book?
I would discourage you to get into the tidyverse, at least in the first stages of your R training. It's like trying to learn english AND scottish together as a foreigner. You can read some better worded discussions here https://github.com/matloff/TidyverseSkeptic and here https://towardsdatascience.com/a-thousand-gadgets-my-thoughts-on-the-r-tidyverse-2441d8504433?gi=1b0a3648b6e6
-
Ho everyone I am R beginer. I need to to change the data type of these two columns, I tried as many ways I could find on the internet but it just won't work for me. This is really frustrating especially when you are a beginer, can you pleae provide a solution ? Thanks a lot in advance !
My opinions are largely in agreement with Norm Matloff on the subject actually.
seaborn
- "No" is not an actionable error message
-
Apache Superset
If you are doing data analysis I don't think any of the 3 pieces of software you mentioned are going to be that helpful.
I see these products as tools for data visualization and reporting i.e. presenting prepared datasets to users in a visually appealing way. They aren't as well suited for serious analytics.
I can't comment on Superset or Tableau but I am familiar with Power BI (it has been rolled out across my org), the type of statistics you can do with it are fairly rudimentary. If you need to do any thing beyond summarizing (counts, averages, min, max etc). It is not particularly easy.
For data analysis I use SAS or R. This software allows you do things like multivariate regression, timeseries forecasting, PCA, Cluster analysis etc. There is also plotting capability.
Both these products are kind of old school, I've been using them since early 2000's, the "new school" seems to be Python. Pretty much all the recent data science people in my organization use Python. Particularly Pandas and libraries like Seaborn (https://seaborn.pydata.org/).
The "power" users of Power BI in my organization tend to be finance/HR people for use cases like drill down into cost figures or Interactively presenting KPI's and other headline figures to management things like that.
-
Seaborn bug responsible for finding of declining disruptiveness in science
It's referring to the seaborn library (https://seaborn.pydata.org/), a Python library for data visualization (built on top of matplotlib).
-
Why Pandas feels clunky when coming from R
While it’s not perfect and it’s not ggplot2, Seaborn is definitely a big improvement over bare matplotlib. You can still use matplotlib to modify the plots it spits out if you want to but the defaults are pretty good most of the time.
https://seaborn.pydata.org/
-
Releasing The Force Of Machine Learning: A Novice’s Guide 😃
Seaborn: A statistical data visualization library based on Matplotlib, enhancing the aesthetics and visual appeal of statistical graphics.
-
Seven Python Projects to Elevate Your Coding Skills
Matplotlib Seaborn Example data sets
-
Mastering Matplotlib: A Step-by-Step Tutorial for Beginners
Seaborn - Statistical data visualization using Matplotlib.
-
Top 10 growing data visualization libraries in Python in 2023
Github: https://github.com/mwaskom/seaborn
-
Best Portfolio Projects for Data Science
Seaborn Documentation
-
[OC] Nationwide Public Transit Ridership is down 30% from pre-lockdown levels; San Francisco's BART ridership is down almost 70%
You've done a great job presenting this. Maybe you already know, but seaborne is an extension of matplotlib that makes it pretty easy to "beautify" matplotlib charts
What are some alternatives?
Chain.jl - A Julia package for piping a value through a series of transformation expressions using a more convenient syntax than Julia's native piping functionality.
bokeh - Interactive Data Visualization in the browser, from Python
RCall.jl - Call R from Julia
Altair - Declarative statistical visualization library for Python
VegaLite.jl - Julia bindings to Vega-Lite
plotly - The interactive graphing library for Python :sparkles: This project now includes Plotly Express!
swirl - :cyclone: Learn R, in R.
ggplot - ggplot port for python
Transformers.jl - Julia Implementation of Transformer models
plotnine - A Grammar of Graphics for Python
magrittr - Improve the readability of R code with the pipe
matplotlib - matplotlib: plotting with Python