Projects of the Udacity Data Engineering Nanodegree Program.
Coding examples here.
Apache Spark - A unified analytics engine for large-scale data processing
Static code analysis for 29 languages.. Your projects are multi-language. So is SonarQube analysis. Find Bugs, Vulnerabilities, Security Hotspots, and Code Smells so you can release quality code every time. Get started analyzing your projects today for free.
Hadoop is an ecosystem of tools for big data storage and data analysis. It is older than Spark and writes intermediate results to disk whereas Spark tires to keep data in memory whenever possible, so this is faster in many use cases.
Big Data Processing, EMR with Spark and Hadoop | Python, PySpark
2 projects | dev.to | 27 Mar 2022
Spark is lit once again
6 projects | dev.to | 29 Oct 2021
5 Best Big Data Frameworks You Can Learn in 2021
3 projects | dev.to | 18 Jun 2021
How would a professional structure this project idea?
1 project | reddit.com/r/datascience | 19 May 2022
(re: notebooks in production) Notebooks and MLOps. Choose one.
1 project | reddit.com/r/mlops | 15 May 2022