tidytuesday vs data

tidytuesday

Official repo for the #tidytuesday project (by rfordatascience)

Suggest topics

Source Code

Suggest alternative

Edit details

data

Data and code behind the articles and graphics at FiveThirtyEight (by fivethirtyeight)

Data

Source Code

data.fivethirtyeight.com

Suggest alternative

Edit details

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

tidytuesday		data
	Project
79	Mentions	116
6,362	Stars	16,617
1.4%	Growth	0.3%
8.4	Activity	8.5
10 days ago	Latest Commit	about 1 month ago
HTML	Language	Jupyter Notebook
Creative Commons Zero v1.0 Universal	License	Creative Commons Attribution 4.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

tidytuesday

Posts with mentions or reviews of tidytuesday. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-24.

Recommendation for interesting datasets to work with?
1 project | /r/datasets | 5 Dec 2023

TidyTuesday is a weekly data cleaning project where a new, interesting data source is linked to each week: https://github.com/rfordatascience/tidytuesday
Rfordatascience/tidytuesday: Official repo for the tidytuesday project
1 project | news.ycombinator.com | 7 Nov 2023
[OC] Tornados in the U.S. are becoming more frequent in off-peak months
1 project | /r/dataisbeautiful | 9 Jul 2023
Too old to continue my education? I'm lost.
3 projects | /r/malaysiauni | 24 Jun 2023

For R, I don't have specific resources, but I remember I started out with doing tidytuesdays challenge (https://github.com/rfordatascience/tidytuesday).
First Project
1 project | /r/Rlanguage | 11 May 2023

Tidy Tuesday has data and links to more data. The nice thing about those data sets is that you can search for what other people did with the data on social media (e.g. Twitter).
[OC] Popularity of Horror Movie Poster Color Schemes from 1970
2 projects | /r/dataisbeautiful | 29 Apr 2023

Dataset: https://github.com/rfordatascience/tidytuesday/tree/master/data/2022/2022-11-01
Tips on getting experience in R on GitHub
1 project | /r/Rlanguage | 20 Apr 2023

What you're describing is contributing to open source. Some things I'd suggest doing: - learn some git first - create GitHub account and create at least a practice repo - look at learning community-related repos, like Tidy Tuesday - follow R "power" users, people associated with RStudio, and similar folks on social media. Those folks will sometimes mention projects aimed at beginners.
[OC] 2021-22 EPL Home/Away Goal Differential
1 project | /r/dataisbeautiful | 6 Apr 2023

Data: TidyTuesday April 4
Publicly available datasets?
1 project | /r/datascience | 25 Mar 2023

The Tidy Tuesday git repo has a lot of example datasets to work with.
[OC] Kyle Feldt and his Chevalier Sheriffs: An Infographic of Feldt's NRL Tries
1 project | /r/nrl | 4 Mar 2023

I mostly use ggplot2 in R for visualisations which means that The R Graph Gallery is my starting point for inspiration. The best thing to do is start with a simple idea that tells a story, and one of the best guys out there that does this is Cedric Scherer. He is involved a bit with the TidyTuesday project which I wish I had more time to play around with, and is a great starting point for developing a library of vis techniques.

data

Posts with mentions or reviews of data. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-02.

[USMNT] It only took 20 caps for Jesus Ferreira to get double-digit goals. The fastest in #USMNT history.
1 project | /r/MLS | 29 Jun 2023

You of course already know this answer, but just to put it into more perspective. Here are the SPI ranking equivalents to what he did with these 11 goals in Scotland and Switzerland.
[Effortpost] Advanced stats on which players are contributing the most to the Heat's playoff run.
1 project | /r/heat | 24 May 2023

To answer these questions I decided to look at 538’s RAPTOR ratings. RAPTOR uses player tracking data to estimate how much each player contributes on the offensive and defensive ends. The total RAPTOR score should be something like the “number of points a player contributes to his team’s offense and defense per 100 possessions, relative to a league-average player.” Higher is better, best during the regular season has been Nikola Jokic at +14. You can read more about it here or play with an interactive tool on their website here. I don’t really care about the details of why it’s a good statistic, but it seems pretty helpful and most importantly for my purposes you can download the data here for free.
Consanguineous marriage percentage per country
1 project | /r/dataisbeautiful | 23 May 2023

EDIT: I came to this data from this repository which has a nice csv collection for machine training.
USMNT is a European club. How did they do this season?
1 project | /r/ussoccer | 22 May 2023

Looks like we may actually be collectively underrating our guys now. That's an interesting change. Based on SPI (rating = 72.4) we would be:
Derrick White's WAR over the past season has been ~6.7 according to a composite of various metrics. Derrick White's WAR in the playoffs has been ~0.1 according to RAPTOR. The worst among the main Boston roster
1 project | /r/nba | 19 May 2023
Nate Silver: Some personal news
2 projects | news.ycombinator.com | 2 May 2023

Before Disney/ABC get any -ideas-, might be a good chance to get our hands on at least their data[0]!
[0]: https://data.fivethirtyeight.com/
In honor of Sexual Assault Awareness Month, make sure neither you nor friends harbor any misconceptions about consent
1 project | /r/MensLib | 30 Apr 2023

Most young women expect words to be involved when their partner seeks their consent. 43% of young men actually ask for verbal confirmation of consent. Overall, verbal indicators of consent or nonconsent are more common than nonverbal indicators. More open communication also increases the likelihood of orgasm for women.
CMV: When selecting a movie to watch, the audience's rating is the only thing that matters and the critic's rating is entirely irrelevant.
1 project | /r/changemyview | 29 Apr 2023
Slight majority of people in WA want to leave state, poll finds
1 project | /r/SeattleWA | 28 Apr 2023

DHM does not use an equity sample. Of all polling operations they rank 250 out of 517. Id like to see another pollster https://github.com/fivethirtyeight/data/blob/master/pollster-ratings/pollster-ratings.csv
Optimism is bad for your health. So lets just do some maths! How can Liverpool FC get top 4? part 2
1 project | /r/LiverpoolFC | 23 Apr 2023

LOL My github’s pretty sparse but I’m pulling data from this API; 538 also provides the data they use for their club predictions here if that interests you

What are some alternatives?

When comparing tidytuesday and data you can also consider the following projects:

gganimate - A Grammar of Animated Graphics

uawardata - The data behind uawardata.com

cheatsheets - Posit Cheat Sheets - Can also be found at https://posit.co/resources/cheatsheets/.

ydata-quality - Data Quality assessment with one line of code

r4ds - R for data science: a book

CodeSearchNet - Datasets, tools, and benchmarks for representation learning of code.

awesome-public-datasets - A topic-centric list of HQ open datasets.

quilt - Quilt is a data mesh for connecting people with actionable data

ggsunburst

Video-Swin-Transformer - This is an official implementation for "Video Swin Transformers".

big-mac-data - Data and methodology for the Big Mac index

datagen - Generates customer, sales reps, sales mgrs, products, manufacturer, and transaction data and creates and populates MySQL database with it. Also, can generate single tables of random data.

tidytuesday vs gganimate data vs uawardata tidytuesday vs cheatsheets data vs ydata-quality tidytuesday vs r4ds data vs CodeSearchNet tidytuesday vs awesome-public-datasets data vs quilt tidytuesday vs ggsunburst data vs Video-Swin-Transformer tidytuesday vs big-mac-data data vs datagen

Compare tidytuesday vs data and see what are their differences.

tidytuesday

data

tidytuesday

data

What are some alternatives?