Apache Spark - A unified analytics engine for large-scale data processing
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.
What do I need to know about distributed algorithms and systems?
1 project | reddit.com/r/AskProgramming | 22 May 2022
AWS Glue: what is it and how does it work?
1 project | dev.to | 5 May 2022
Top Responsibilities of a Data Engineering Manager
1 project | reddit.com/r/dataengineering | 2 May 2022
Cannot find col function in pyspark
1 project | reddit.com/r/codehunter | 22 Apr 2022
1 project | reddit.com/r/196 | 24 Mar 2022