spark-sbt.g8
emr-job-templates
spark-sbt.g8 | emr-job-templates | |
---|---|---|
2 | 1 | |
73 | 2 | |
- | - | |
1.8 | 10.0 | |
about 3 years ago | over 1 year ago | |
Scala | Python | |
- | Creative Commons Zero v1.0 Universal |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
spark-sbt.g8
-
Open source PySpark project idea
I built this for Spark Scala projects with SBT and it has been used extensively: https://github.com/MrPowers/spark-sbt.g8
-
Ask HN: What are some tools / libraries you built yourself?
I built daria (https://github.com/MrPowers/spark-daria) to make it easier to write Spark and spark-fast-tests (https://github.com/MrPowers/spark-fast-tests) to provide a good testing workflow.
quinn (https://github.com/MrPowers/quinn) and chispa (https://github.com/MrPowers/chispa) are the PySpark equivalents.
Built bebe (https://github.com/MrPowers/bebe) to expose the Spark Catalyst expressions that aren't exposed to the Scala / Python APIs.
Also build spark-sbt.g8 to create a Spark project with a single command: https://github.com/MrPowers/spark-sbt.g8
emr-job-templates
-
Open source PySpark project idea
I've played with the structure a bit in this repo ( https://github.com/dacort/emr-job-templates ), but am still learning poetry. Happy to collaborate and maybe build a a good shared repo?
What are some alternatives?
Pion WebRTC - Pure Go implementation of the WebRTC API
Nullboard - Nullboard is a minimalist kanban board, focused on compactness and readability.
tera - A template engine for Rust based on Jinja2/Django
GoJS, a JavaScript Library for HTML Diagrams - JavaScript diagramming library for interactive flowcharts, org charts, design tools, planning tools, visual languages.
yadm - Yet Another Dotfiles Manager
Shynet - Modern, privacy-friendly, and detailed web analytics that works without cookies or JS.
sqldb-logger - A logger for Go SQL database driver without modifying existing *sql.DB stdlib usage.
Tabula - Extract tables from PDF files
vaku - vaku extends the vault api & cli
leapp - Leapp is the DevTool to access your cloud
kondo - Cleans dependencies and build artifacts from your projects.