opentripplanner
disk.frame
opentripplanner | disk.frame | |
---|---|---|
1 | 5 | |
77 | 593 | |
- | 0.2% | |
4.1 | 0.0 | |
about 2 months ago | 3 months ago | |
R | R | |
GNU General Public License v3.0 only | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
opentripplanner
-
R packages for transit planning?
Transportation planner / data scientist here: The R opentripplanner package (https://github.com/ropensci/opentripplanner) (also a Robin Lovelace-related package!) is a particular favorite, just wanted to call that out! Also, the Open Transit Data Toolkit (https://transitdatatoolkit.com/) might give some ideas on topics to cover. I think the methods are a bit dated at this point (i.e. not a lot of tidyverse, sf) but in general it's a great resource.
disk.frame
-
Do you code from memory? Or do you reference things?
Say hello to disk.frame.
- How can I read in only two columns from a massive 10+ GB tab file?
-
Data cleaning/ analysis 100-200 million rows of data. Is this doable in R, or is there another program I should try instead?
It depends on your hardware, but it should not be a problem. You might look into disk frame (https://diskframe.com) or similar packages.
-
is it possible to have my enviroment objects and work with them on my local drive instead of RAM?
If that doesn't work, the disk.frame package might help. It is new-ish and not common, but does seem to work with data on disk rather than in memory
-
We Test PCIe 4.0 Storage: The AnandTech 2021 SSD Benchmark Suite
> The speeds were just stunning to say the least at 15GB/s.
That is amazing. That is around DDR4-1866 speeds, and not far from DDR4-2666 (~21 GB/s). At those speeds I would happily work with dataframes sitting on the disk rather than in memory [1, 2]. Did you benchmark RAID 0 with less than four disks?
[1] R: https://github.com/xiaodaigh/disk.frame
What are some alternatives?
blogdown - Create Blogs and Websites with R Markdown
db-benchmark - reproducible benchmark of database-like ops
police-settlements - A FiveThirtyEight/The Marshall Project effort to collect comprehensive data on police misconduct settlements from 2010-19.
drake - An R-focused pipeline toolkit for reproducibility and high-performance computing
owidR - An R Package for Importing Data from Our World in Data
babynames - An R package containing US baby names from the SSA
r4ds - R for data science: a book
awesome-R - A curated list of awesome R packages, frameworks and software.