reddit-top-2.5-million
This is a dataset of the all-time top 1,000 posts, from the top 2,500 subreddits by subscribers, pulled from reddit between August 15–20, 2013. (by umbrae)
tidytuesday
Official repo for the #tidytuesday project (by rfordatascience)
reddit-top-2.5-million | tidytuesday | |
---|---|---|
3 | 79 | |
608 | 6,400 | |
- | 1.0% | |
0.0 | 8.4 | |
about 4 years ago | 9 days ago | |
HTML | ||
GNU General Public License v3.0 or later | Creative Commons Zero v1.0 Universal |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
reddit-top-2.5-million
Posts with mentions or reviews of reddit-top-2.5-million.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-06-07.
-
Since Everyone liked the Brave New World load screens, I pulled all the relevant artwork I could find out of the game files and put them in one album including the G&K load screens, wonder screens, and victory screens. (plus /u/IAMA_Ghost_Boo 's touchups of the BNW load screens. Thanks!) Enjoy!
google couldn't even find it, but searching for various parts of the title eventually pointed me to a github csv file with the top 1000 posts as of 2013: https://github.com/umbrae/reddit-top-2.5-million/blob/master/data/civ.csv
- reddit-top-2.5-million/madisonwi.csv at master · umbrae/reddit-top-2.5-million
-
Looking for packages full of datasets
Reddit top dataset
tidytuesday
Posts with mentions or reviews of tidytuesday.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-06-24.
-
Recommendation for interesting datasets to work with?
TidyTuesday is a weekly data cleaning project where a new, interesting data source is linked to each week: https://github.com/rfordatascience/tidytuesday
- Rfordatascience/tidytuesday: Official repo for the tidytuesday project
- [OC] Tornados in the U.S. are becoming more frequent in off-peak months
-
Too old to continue my education? I'm lost.
For R, I don't have specific resources, but I remember I started out with doing tidytuesdays challenge (https://github.com/rfordatascience/tidytuesday).
-
First Project
Tidy Tuesday has data and links to more data. The nice thing about those data sets is that you can search for what other people did with the data on social media (e.g. Twitter).
-
[OC] Popularity of Horror Movie Poster Color Schemes from 1970
Dataset: https://github.com/rfordatascience/tidytuesday/tree/master/data/2022/2022-11-01
-
Tips on getting experience in R on GitHub
What you're describing is contributing to open source. Some things I'd suggest doing: - learn some git first - create GitHub account and create at least a practice repo - look at learning community-related repos, like Tidy Tuesday - follow R "power" users, people associated with RStudio, and similar folks on social media. Those folks will sometimes mention projects aimed at beginners.
-
[OC] 2021-22 EPL Home/Away Goal Differential
Data: TidyTuesday April 4
-
Publicly available datasets?
The Tidy Tuesday git repo has a lot of example datasets to work with.
-
[OC] Kyle Feldt and his Chevalier Sheriffs: An Infographic of Feldt's NRL Tries
I mostly use ggplot2 in R for visualisations which means that The R Graph Gallery is my starting point for inspiration. The best thing to do is start with a simple idea that tells a story, and one of the best guys out there that does this is Cedric Scherer. He is involved a bit with the TidyTuesday project which I wish I had more time to play around with, and is a great starting point for developing a library of vis techniques.
What are some alternatives?
When comparing reddit-top-2.5-million and tidytuesday you can also consider the following projects:
awesome-public-datasets - A topic-centric list of HQ open datasets.
data - Data and code behind the articles and graphics at FiveThirtyEight
dataRetrieval - This R package is designed to obtain USGS or EPA water quality sample data, streamflow data, and metadata directly from web services.
gganimate - A Grammar of Animated Graphics
cheatsheets - Posit Cheat Sheets - Can also be found at https://posit.co/resources/cheatsheets/.
rnoaa - R interface to many NOAA data APIs
r4ds - R for data science: a book
big-mac-data - Data and methodology for the Big Mac index
ggsunburst
EconomicTracker - Download data from the Opportunity Insights Economic Tracker — https://tracktherecovery.org/
reddit-top-2.5-million vs awesome-public-datasets
tidytuesday vs data
reddit-top-2.5-million vs dataRetrieval
tidytuesday vs gganimate
reddit-top-2.5-million vs cheatsheets
tidytuesday vs cheatsheets
reddit-top-2.5-million vs rnoaa
tidytuesday vs r4ds
tidytuesday vs awesome-public-datasets
tidytuesday vs big-mac-data
tidytuesday vs ggsunburst
tidytuesday vs EconomicTracker