A peek into Location Data Science at Ola

This page summarizes the projects mentioned and recommended in the original post on dev.to

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • Dask

    Parallel computing with task scheduling

  • Data scientists work on phenomenally large datasets, and Dask is a handy tool for exploration within the confines of a single cloud VM or their local PCs. Location data visualization is an essential part of deciding further algorithm development and roadmap for projects. This lays the foundation for data engineering and science to work at scale, with petabytes of data.

  • Apache Spark

    Apache Spark - A unified analytics engine for large-scale data processing

  • This requires the use of distributed computation tools such as Spark and Hadoop, Flink and Kafka are used. But for occasional experimentation, Pandas, Geopandas and Dask are some of the commonly used tools.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Pandas

    Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

  • This requires the use of distributed computation tools such as Spark and Hadoop, Flink and Kafka are used. But for occasional experimentation, Pandas, Geopandas and Dask are some of the commonly used tools.

  • ApacheKafka

    A curated re-sources list for awesome Apache Kafka

  • This requires the use of distributed computation tools such as Spark and Hadoop, Flink and Kafka are used. But for occasional experimentation, Pandas, Geopandas and Dask are some of the commonly used tools.

  • Apache Hadoop

    Apache Hadoop

  • This requires the use of distributed computation tools such as Spark and Hadoop, Flink and Kafka are used. But for occasional experimentation, Pandas, Geopandas and Dask are some of the commonly used tools.

    This requires the use of distributed computation tools such as Spark and Hadoop, Flink and Kafka are used. But for occasional experimentation, Pandas, Geopandas and Dask are some of the commonly used tools.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Build A Covid-19 EDA & Viz App Using Streamlit

    3 projects | dev.to | 5 Dec 2022
  • Machine Learning Pipelines with Spark: Introductory Guide (Part 1)

    5 projects | dev.to | 23 Oct 2022
  • AutoCodeRover resolves 22% of real-world GitHub in SWE-bench lite

    8 projects | news.ycombinator.com | 9 Apr 2024
  • Introducing Flama for Robust Machine Learning APIs

    11 projects | dev.to | 18 Dec 2023
  • Apache Spark VS quix-streams - a user suggested alternative

    2 projects | 7 Dec 2023