Our great sponsors
-
uber-expenses-tracking
The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such as Apache Airflow, AWS Redshift and Power BI.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
pyspark-on-aws-emr
The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.
-
wbz
A parallel implementation of the bzip2 data compressor in python, this data compression pipeline is using algorithms like Burrows–Wheeler transform (BWT) and Move to front (MTF) to improve the Huffman compression. For now, this tool only will be focused on compressing .csv files, and other files on tabular format.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
data-engineering-challenge-th
Dockerizing a Python Script for Web Scraping and consume the scraped data using FastApi (www.metroscubicos.com)
-
distance-metrics
Distance metrics are one of the most important parts of some machine learning algorithms, supervised and unsupervised learning, it will help us to calculate and measure similarities between numerical values expressed as data points
-
text-analysis-speeches-amlo
Text analysis of the speeches, conferences and interviews of the current president of Mexico
-
Dropout-Students-Prediction
The goal of this project is to identify students at risk of dropping out the school
Tracking your Uber Rides and Uber Eats expenses through a data engineering process
Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag
Building Big Data Pipelines in the Cloud with AWS EMR
Building a Lossless Data Compression and Data Decompression Pipeline
Learn how to dockerize an Apache Spark Standalone Cluster
Dockerizing and Consuming an Apache Livy environment
Design, Development and Deployment of a simple Data Pipeline
Dockerizing a Python Script for Faster Web Scraping
Understanding Similarity Measures for Text Analysis
Learn how to build a content-based Movie Recommender System
A Text Analysis of Speeches
Dropout Students Prediction