warc-parquet
lance
warc-parquet | lance | |
---|---|---|
4 | 10 | |
99 | 3,275 | |
- | 2.7% | |
6.8 | 9.8 | |
2 days ago | about 17 hours ago | |
Rust | Rust | |
- | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
warc-parquet
lance
- The Nimble File Format by Meta
-
Supabase Storage: now supports the S3 protocol
you should look at lance(https://lancedb.github.io/lance/)
-
Understanding Parquet, Iceberg and Data Lakehouses
Parquet has been the lakehouse file format of choice for nearly half a decade. But we are starting to see other contenders that are optimized more for lower latency like lance https://github.com/lancedb/lance
- FLaNK Stack Weekly for 12 June 2023
- FLaNK Stack 5-June-2023
- [Show HN] Lance is a Rust-based alternative to Parquet for ML data
-
Show HN: Lance is a Rust-based alternative to Parquet for ML data
getting bunch of 404s on the docs. for example https://eto-ai.github.io/lance/format.html (But this works: https://lancedb.github.io/lance/*)
Did you guys just pivot from eto-ai to lancedb?
-
Any job processing framework like Spark but in Rust?
For Feature Stores check out: https://github.com/eto-ai/lance
- Show HN: Lance – Deep Learning with DuckDB and Arrow
What are some alternatives?
sqlite-parquet-vtable - A SQLite vtable extension to read Parquet files
roop - one-click face swap
deeplake - Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
Lixur - Lixur is an open-sourced project that seeks to build a scalable, feeless, decentralized, quantum-secure, and easy-to-use blockchain with smart, and intelligent (A.I.) contract functionality.
polars - Dataframes powered by a multithreaded, vectorized query engine, written in Rust
Rio - A hardware-accelerated GPU terminal emulator focusing to run in desktops and browsers.
chatdocs - Chat with your documents offline using AI.
scratch-pdf-bot - Prototyping a question and answer bot over PDFs
datafusion-ballista - Apache Arrow Ballista Distributed Query Engine
qdrant - Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
documenso - The Open Source DocuSign Alternative.
dinov2 - PyTorch code and models for the DINOv2 self-supervised learning method.