zef
v6d
zef | v6d | |
---|---|---|
- | 5 | |
107 | 804 | |
0.9% | 0.6% | |
2.8 | 9.5 | |
about 2 months ago | 3 days ago | |
Python | C++ | |
Apache 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
zef
We haven't tracked posts mentioning zef yet.
Tracking mentions began in Dec 2020.
v6d
-
Has anyone here had experience using Vineyard?
Brief Overview for any interested: Vineyard (v6d) is an in-memory immutable data manager that provides out-of-the-box high-level abstraction and zero-copy in-memory sharing for distributed data in big data tasks, such as graph analytics (e.g., GraphScope), numerical computing (e.g., Mars), and machine learning.
-
GitHub “allows” unauthorized users “merging” PRs, bypass write permission check
- https://github.com/v6d-io/v6d/pull/948
-
[P] Bridging Dask and Tensorflow for distributed machine learniing with Vineyard
We propose vineyard, https://github.com/v6d-io/v6d to address such challenges, which, provides efficient zero-copy data sharing between different compute engines, without extra cost of copying and serialization, compared other similar solutions.
- Vineyard 0.2.7: Airflow, Dask, and better ML experience
- Vineyard v0.2.0: big-data applications optimization on Kubernetes
What are some alternatives?
Optimus - :truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
cpp-ipc - C++ IPC Library: A high-performance inter-process communication using shared memory on Linux/Windows.
ga-extractor - Tool for extracting Google Analytics data suitable for migrating to other platforms/databases
shadesmar - Fast C++ IPC using shared memory
AWS Data Wrangler - pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
iceoryx - Eclipse iceoryx™ - true zero-copy inter-process-communication
NebulaGraph Database - A distributed, fast open-source graph database featuring horizontal scalability and high availability
GraphScope - 🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统