cheatsheets
nbdev
Our great sponsors
cheatsheets | nbdev | |
---|---|---|
60 | 45 | |
5,596 | 4,740 | |
1.5% | 0.9% | |
7.6 | 6.5 | |
5 days ago | about 1 month ago | |
TeX | Jupyter Notebook | |
Creative Commons Attribution 4.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
cheatsheets
-
Tools a Data Scientist should know:
If you're an R user, stringr + its cheatsheet gets you very close to remembering what to do without needing to look further!
-
JSON to PDF Magic: Harnessing LaTeX and JSON for Effortless Customization and Dynamic PDF Generation
For more information on how to use ggplot2 and create charts consult the ggplot2 official page or the ggplot2 cheat graphic.
-
Best packages to learn?
I'd suggest you have a look at cheatsheets (or download them from GitHub) if you want to get to know your way around a package or set if functions, it saves you a lot of time.
-
How do I make these shapes (pictured below) in ggplot?
You could use geom_hline and geom_vline, geom_abline, or geom_segment for this. (The ggplot cheat sheet is very useful for answering these kinds of questions, BTW.)
-
Why does my scatter plot look like this?
I can't say for sure because I don't know what your ultimate aim is for your visualization. Check out the cheat sheet for ggplot2 here.
-
Import from Excel
Finally just do your analysis. You should also should give a try and see the cheat sheet for data importing on the tidyverse package.
-
[Request] How to best visualize percentages with R?
That said, when I’m trying to come up with an interesting way to visualize data, I find the ggplot cheat sheet very helpful: https://github.com/rstudio/cheatsheets/raw/main/data-visualization-2.1.pdf
-
Need help with variables
Here's a cheat sheet: https://github.com/rstudio/cheatsheets/blob/main/strings.pdf
-
Data manipulation in R
The cheat sheet of the stringr package should give you good overview of string manipulation/ regex in R.
-
I'm trying to recreate this plot but I keep failing
I would very highly recommend that rather than trying to get started by translating an existing graph, you check out some documentation about ggplot first. If nothing else, the ggplot cheat sheet from RStudio should help explain what the component parts of the code are, and that might help you figure out what you actually want to do.
nbdev
- The Jupyter+Git problem is now solved
-
What is literate programming used for?
One example I've seen is ML/DL folks using jupyter notebooks to develop DL libraries in jupyter notebooks, see https://github.com/fastai/nbdev
-
GitHub Accelerator: our first cohort and what's next
- https://github.com/fastai/nbdev: Increase developer productivity by 10x with a new exploratory programming workflow.
-
Startups are in first batch of GitHub OS Accelerator
9. Nbdev: Boost developer productivity with an exploratory programming workflow - https://nbdev.fast.ai/
-
Start learning python for a Statistician with SAS experience and little R experience
See if you like nbdev way of working with data through python and jupyter. nbdev is an optional part that will create python packages from jupyter notebooks. Also even the simple tutorials are opinionated and will guide you to unit test your code and write CICD pipelines.
- FastKafka - free open source python lib for building Kafka-based services
-
isn't this just too much for a take home assignment?
You probably don’t have time for this for the purposes of your task, but I will also throw in the recommendation of nbdev especially if you’re a Python person. I haven’t had a project to use it on yet, but I’ve gone through the docs and the walkthrough and it seems like a great framework for starting potential projects with all the infrastructure needed for if/when they eventually get big and need all the packaging and stuff
-
Any experience dealing with a non-technical manager?
nbdev: jupyter notebooks -> python package
-
Resources to bridge the gap between jupyter notebooks and regular python development
Take a look at https://github.com/fastai/nbdev - haven't used it but supposedly the whole if fast.ai library was written that way. It sounds like a natural direction in your scenario - allowing your to keep working in a familiar environment and still producing production ready code (will, at least in paper 😅)
- Rant: Jupyter notebooks are trash.
What are some alternatives?
tidytuesday - Official repo for the #tidytuesday project
papermill - 📚 Parameterize, execute, and analyze notebooks
forcats - 🐈🐈🐈🐈: tools for working with categorical variables (factors)
ploomber - The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
mostly-adequate-guide - Mostly adequate guide to FP (in javascript)
dbt - dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications. [Moved to: https://github.com/dbt-labs/dbt-core]
ggplot2-book - ggplot2: elegant graphics for data analysis
jupytext - Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts
mech - 🦾 Main repository for the Mech programming language. Start here!
rr - Record and Replay Framework
ggplot2 - An implementation of the Grammar of Graphics in R
Jupyter-PowerShell - Jupyter Kernel for PowerShell