How to be Test Driven with Spark: Chapter 5: Leverage spark in a container

This page summarizes the projects mentioned and recommended in the original post on dev.to

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io
featured
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
  1. docker

    FreeBSD port of docker, take a look at PORTING-FREEBSD.md in freebsd-compat branch (by kvasdopil)

    If you are unfamiliar with the concept of containers and docker images, I suggest you have a look at docker. It will be leveraged here to start the Spark server for the tests; it's important to mention there are other opensource alternatives like podman or nerdctl to allow containerization.

  2. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  3. nerdctl

    contaiNERD CTL - Docker-compatible CLI for containerd, with support for Compose, Rootless, eStargz, OCIcrypt, IPFS, ...

    If you are unfamiliar with the concept of containers and docker images, I suggest you have a look at docker. It will be leveraged here to start the Spark server for the tests; it's important to mention there are other opensource alternatives like podman or nerdctl to allow containerization.

  4. podman

    Podman: A tool for managing OCI containers and pods.

    If you are unfamiliar with the concept of containers and docker images, I suggest you have a look at docker. It will be leveraged here to start the Spark server for the tests; it's important to mention there are other opensource alternatives like podman or nerdctl to allow containerization.

  5. spark_tdd

    The same error should appear, indeed the spark_session needs to be adapted to connect to the server you have just created. In test/conftest.py:

  6. testcontainers-python

    Testcontainers is a Python library that providing a friendly API to run Docker container. It is designed to create runtime environment to use during your automatic tests.

    To improve this, you will leverage testcontainers which empower you with easy docker creation and removal at the test level.

  7. ffizer

    ffizer is a files and folders initializer / generator. Create any kind (or part) of project from template.

    Templatize the repository for easier reusage with the help of ffizer

  8. spec

    Development Containers: Use a container as a full-featured development environment. (by devcontainers)

    Leverage devcontainer to improve ci and local development

  9. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Container and image vocabulary

    4 projects | dev.to | 3 Apr 2023
  • [Networking][Podman]:Need help setting up my Jellyfin server using Podman, which is accessible ONLY to LAN

    1 project | /r/jellyfin | 10 Mar 2023
  • Is it recommended to chose `pasta` over `slirp4netns` if native IPs are required?

    5 projects | /r/podman | 6 Mar 2023
  • backing up unprivileged Podman volume mounts with restic and `podman unshare`?

    1 project | /r/restic | 5 Mar 2023
  • netavark via homebrew?

    2 projects | /r/podman | 3 Mar 2023

Did you know that Go is
the 4th most popular programming language
based on number of references?