A peek into Location Data Science at Ola

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

Dask

32 12,022 9.6 Python

Parallel computing with task scheduling

Data scientists work on phenomenally large datasets, and Dask is a handy tool for exploration within the confines of a single cloud VM or their local PCs. Location data visualization is an essential part of deciding further algorithm development and roadmap for projects. This lays the foundation for data engineering and science to work at scale, with petabytes of data.

Apache Spark

101 38,414 10.0 Scala

Apache Spark - A unified analytics engine for large-scale data processing

This requires the use of distributed computation tools such as Spark and Hadoop, Flink and Kafka are used. But for occasional experimentation, Pandas, Geopandas and Dask are some of the commonly used tools.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Pandas

396 41,983 10.0 Python

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

This requires the use of distributed computation tools such as Spark and Hadoop, Flink and Kafka are used. But for occasional experimentation, Pandas, Geopandas and Dask are some of the commonly used tools.

ApacheKafka

104 28 0.0

A curated re-sources list for awesome Apache Kafka

This requires the use of distributed computation tools such as Spark and Hadoop, Flink and Kafka are used. But for occasional experimentation, Pandas, Geopandas and Dask are some of the commonly used tools.

Apache Hadoop

26 14,342 9.9 Java

Apache Hadoop

This requires the use of distributed computation tools such as Spark and Hadoop, Flink and Kafka are used. But for occasional experimentation, Pandas, Geopandas and Dask are some of the commonly used tools.

flink-statefun

18 495 5.1 Java

Apache Flink Stateful Functions

This requires the use of distributed computation tools such as Spark and Hadoop, Flink and Kafka are used. But for occasional experimentation, Pandas, Geopandas and Dask are some of the commonly used tools.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Build A Covid-19 EDA & Viz App Using Streamlit

3 projects | dev.to | 5 Dec 2022
Machine Learning Pipelines with Spark: Introductory Guide (Part 1)

5 projects | dev.to | 23 Oct 2022
AutoCodeRover resolves 22% of real-world GitHub in SWE-bench lite

8 projects | news.ycombinator.com | 9 Apr 2024
Introducing Flama for Robust Machine Learning APIs

11 projects | dev.to | 18 Dec 2023
Apache Spark VS quix-streams - a user suggested alternative

2 projects | 7 Dec 2023

A peek into Location Data Science at Ola

This page summarizes the projects mentioned and recommended in the original post on dev.to
Python Science and Data analysis Pandas MapReduce Machine Learning
Post date: 26 Sep 2022

Dask

Apache Spark

InfluxDB

Pandas

ApacheKafka

Apache Hadoop

flink-statefun

Related posts

Build A Covid-19 EDA & Viz App Using Streamlit

Machine Learning Pipelines with Spark: Introductory Guide (Part 1)

AutoCodeRover resolves 22% of real-world GitHub in SWE-bench lite

Introducing Flama for Robust Machine Learning APIs

Apache Spark VS quix-streams - a user suggested alternative

A peek into Location Data Science at Ola

This page summarizes the projects mentioned and recommended in the original post on dev.to Python Science and Data analysis Pandas MapReduce Machine Learning Post date: 26 Sep 2022

Dask

Apache Spark

InfluxDB

Pandas

ApacheKafka

Apache Hadoop

flink-statefun

Related posts

Build A Covid-19 EDA & Viz App Using Streamlit

Machine Learning Pipelines with Spark: Introductory Guide (Part 1)

AutoCodeRover resolves 22% of real-world GitHub in SWE-bench lite

Introducing Flama for Robust Machine Learning APIs

Apache Spark VS quix-streams - a user suggested alternative

This page summarizes the projects mentioned and recommended in the original post on dev.to
Python Science and Data analysis Pandas MapReduce Machine Learning
Post date: 26 Sep 2022