-
import jinja2 template = jinja2.Template(""" # Awesome Big Data A curated list of awesome big data frameworks, libraries, software and resources. Inspired by [awesome-php](https://github.com/ziadoz/awesome-php). """) template.render()
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
zfs-localpv
Dynamically provision Stateful Persistent Node-Local Volumes & Filesystems for Kubernetes that is integrated with a backend ZFS data storage stack.
-
-
ListItem(name='Apache Superset', website='https://superset.apache.org/', category='Visualization Frameworks', short_description='Apache Superset is an open-source software cloud-native application for data exploration and data visualization able to handle data at petabyte scale.'),
-
ListItem(name='Apache Spark', website='https://spark.apache.org/', category='Batch Processing', short_description='Apache Spark is an open-source unified analytics engine for large-scale data processing. Spark provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.'),
-
ListItem(name='Apache Hive', website='https://hive.apache.org/', category='Interactive Query', short_description='Apache Hive is a data warehouse software project built on top of Apache Hadoop for providing data query and analysis. Hive gives an SQL-like interface to query data stored in various databases and file systems that integrate with Hadoop.'),
-
CouchDB
Seamless multi-master syncing database with an intuitive HTTP/JSON API, designed for reliability
ListItem(name='CouchDB', website='https://couchdb.apache.org/', category='NoSQL :: Document Databases', short_description='Apache CouchDB is an open-source document-oriented NoSQL database, implemented in Erlang. CouchDB uses multiple formats and protocols to store, transfer, and process its data. It uses JSON to store data, JavaScript as its query language using MapReduce, and HTTP for an API.')
-
ListItem(name='Apache Beam', website='https://beam.apache.org/', category='Batch Processing', short_description='Apache Beam is an open source unified programming model to define and execute data processing pipelines, including ETL, batch and stream processing'),
-
aws-lambda-java-libs
Official mirror for interface definitions and helper classes for Java code running on the AWS Lambda platform.
ListItem(name='AWS Lambda', website='https://aws.amazon.com/lambda/', category='Serverless Functions', short_description='AWS Lambda is an event-driven, serverless computing platform provided by Amazon as a part of Amazon Web Services. It is a computing service that runs code in response to events and automatically manages the computing resources required by that code.'),
-
ListItem(name='Apache Airflow', website='https://airflow.apache.org/', category='Workflow Engine', short_description='Apache Airflow is an open-source workflow management platform for data engineering pipelines.'),