bebe
null
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
bebe
-
Ask HN: What are some tools / libraries you built yourself?
I built daria (https://github.com/MrPowers/spark-daria) to make it easier to write Spark and spark-fast-tests (https://github.com/MrPowers/spark-fast-tests) to provide a good testing workflow.
quinn (https://github.com/MrPowers/quinn) and chispa (https://github.com/MrPowers/chispa) are the PySpark equivalents.
Built bebe (https://github.com/MrPowers/bebe) to expose the Spark Catalyst expressions that aren't exposed to the Scala / Python APIs.
Also build spark-sbt.g8 to create a Spark project with a single command: https://github.com/MrPowers/spark-sbt.g8
-
Finished porting all the Spark SQL functions that aren't exposed via the Scala API to the bebe project
The bebe project fills all these gaps in the Scala API. See the project README for examples on how each function works.
-
Making the Spark DataFrame composition type safe(r)
See here for a more detailed discussion and let me know your thoughts!!
null
-
Ask HN: What are some tools / libraries you built yourself?
I build a logging library for Go, because I couldn't find one that logs to stdout AND stderr. If you used a logging lib on GCP for example, all log output went into the same pile of junk and it was hard to find "real" errors: https://github.com/emvi/logbuch
Then there is "null", also because I couldn't find one that got both, marshalling to JSON and be able to store null values in db: https://github.com/emvi/null
And finally, our "flagship" open-source project Pirsch, an embedded library for web analytics: https://github.com/pirsch-analytics/pirsch
What are some alternatives?
frameless - Expressive types for Spark.
go-edlib - 📚 String comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...
kondo - Cleans dependencies and build artifacts from your projects.
nan - Zero allocation Nullable structures in one library with handy conversion functions, marshallers and unmarshallers
sqldb-logger - A logger for Go SQL database driver without modifying existing *sql.DB stdlib usage.
gocache - ☔️ A complete Go cache library that brings you multiple ways of managing your caches
gutenberg - A fast static site generator in a single binary with everything built-in. https://www.getzola.org
algorithms - CLRS study. Codes are written with golang.
yadm - Yet Another Dotfiles Manager
gota - Gota: DataFrames and data wrangling in Go (Golang)
Shynet - Modern, privacy-friendly, and detailed web analytics that works without cookies or JS.
bitmap - Simple dense bitmap index in Go with binary operators