pyspark methods to enhance developer productivity 📣 👯 🎉 (by MrPowers)
quinn is a library with PySpark helper functions. I need to work through all the open issues / PRs and bump all versions. I should do another release. This library gets around 600,000 monthly downloads.
PySpark test helper methods with beautiful error messages
chispa is a library of PySpark testing functions.
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
Delta Acceptance Testing (by delta-incubator)
Delta Acceptance Testing (dat) is a library that creates Delta Lake reference tables. This project is being done with the core Delta Lake devs. We need to build out all the reference tables and write tests to make sure PySpark can fully implement the Delta Lake protocol.
Pyspark now provides a native Pandas API
3 projects | reddit.com/r/Python | 2 Jan 2022
Why Databricks Is Winning
5 projects | news.ycombinator.com | 14 Feb 2021
Spark open source community is awesome
5 projects | reddit.com/r/apachespark | 29 Dec 2022
installing pyspark on my m1 mac, getting an env error
2 projects | reddit.com/r/apachespark | 4 Jun 2022
Spark: local dev environment
2 projects | reddit.com/r/dataengineering | 7 Feb 2022