xvc
Daft
Our great sponsors
xvc | Daft | |
---|---|---|
3 | 7 | |
22 | 1,684 | |
- | 38.2% | |
7.7 | 9.8 | |
about 1 month ago | 3 days ago | |
Rust | Rust | |
GNU General Public License v3.0 only | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
xvc
-
Ask HN: Freelancer? Seeking freelancer? (January 2023)
SEEKING WORK | REMOTE | Istanbul | UTC Business Hours
ML / MLOps / Data Engineering
Looking for projects that I can contribute in machine learning, data management, data pipelines, and similar technologies.
These days, I'm building an MLOps tool to manage data and pipelines on top of Git, in Rust.
https://github.com/iesahin/xvc (https://docs.xvc.dev)
I'm experienced in libraries like Tensorflow and PyTorch, languages (Rust, Python, Go, Dart, C, ...) and Docker, AWS, GCP. I used Vue.js & Flask for web, and Flutter for mobile in the past but these are not my current focus now.
I'm also a technical writer, and studying copywriting and marketing these days.
I can also architect your software.
https://emresahin.net/cv/ (I'm in the middle of updating the site but gives you an idea.)
-
Ask HN: What Are You Working on This Year?
Working on a new MLOps tool to manage files, data, pipelines, experiments and models.
I'm writing it in Rust. Using ECS instead of OOP. Going well so far.
It's open (and alpha.) https://github.com/iesahin/xvc
Documentation: https://docs.xvc.dev
- Ask HN: Who wants to be hired? (December 2022)
Daft
-
Daft: Distributed DataFrame for Python
There are benchmarks here - https://github.com/Eventual-Inc/Daft?tab=readme-ov-file#benc.... Seems to outperform Dask by a fair bit.
-
Daft: A High-Performance Distributed Dataframe Library for Multimodal Data
Hi (one of the maintainers here), that is a good suggestion! I wasn't aware of that project. I went ahead and made an issue to add `export DO_NOT_TRACK=1` as one of the variables we track! https://github.com/Eventual-Inc/Daft/issues/1015
-
Daft: The Distributed Python Dataframe
We are looking at supporting other distributed backends as well - please drop by our discussion forums (https://github.com/Eventual-Inc/Daft/discussions) and drop us a message if you have any suggestions! We’d love to hear from you :)
What are some alternatives?
Home Assistant - :house_with_garden: Open source home automation that puts local control and privacy first.
polars - Dataframes powered by a multithreaded, vectorized query engine, written in Rust
Resume - Add latex resume here in case online latex generators blow up
hamilton - A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton
roqr - QR codes that will rock your world
deeplake - Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
linux-surface - Linux Kernel for Surface Devices
quokka - Making data lake work for time series
zfsbootmenu - ZFS Bootloader for root-on-ZFS systems with support for snapshots and native full disk encryption
lightflus - A Lightweight, Cloud-Native Stateful Distributed Dataflow Engine
git-bug - Distributed, offline-first bug tracker embedded in git, with bridges
hamilton - Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.