SDV
kuasar
SDV | kuasar | |
---|---|---|
59 | 4 | |
2,141 | 1,177 | |
2.4% | 2.0% | |
9.4 | 8.5 | |
7 days ago | 24 days ago | |
Python | Rust | |
GNU General Public License v3.0 or later | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
SDV
-
Synthetic data generation for tabular data
Can someone help me understand the licensing of this?
https://github.com/sdv-dev/SDV/blob/main/LICENSE
It was MIT licensed up until 2022 where it was changed to what it is now, where they say that it will become MIT again 4 years after release... but is that from when the license was changed or the first release of the software in GitHub?
- SDV: NEW Data - star count:1441.0
- FLaNK Stack Weekly for 30 April 2023
- SDV: NEW Data - star count:1196.0
kuasar
-
The advantage of WASM compared with container runtimes
Right now most early examples alas boot a container with a wasm runtime for each wasm instance, which is a sad waste. The whole advantage of wasm should be very lightweight low overhead wasm runtime instances atop a common wasm process. Having a process or container for each instance loses a ton of the benefit, makes it not much better than a regular container.
Thankfully there is work like the Containerd Sandbox API which enables new architectures like this. https://github.com/containerd/containerd/issues/4131
It's still being used to spawn a wasm processes per instance for now, but container runtime project Kuasar is already using the Sandbox API to save significant resources, and has already chimed in in comments on HN to express a desire to have shared-process/multi-wasm-instamxe runtimes, which could indeed allow sub ms spawning that could enable instance per request architectures. https://github.com/kuasar-io/kuasar
- FLaNK Stack Weekly for 30 April 2023
- Kuasar - A Container Runtime in Rust
- Kuasar: An efficient multi-sandbox container runtime
What are some alternatives?
CTGAN - Conditional GAN for generating synthetic tabular data.
kata-containers - Kata Containers is an open source project and community working to build a standard implementation of lightweight Virtual Machines (VMs) that feel and perform like containers, but provide the workload isolation and security advantages of VMs. https://katacontainers.io/
gretel-python-client - The Gretel Python Client allows you to interact with the Gretel REST API.
keras-ocr - A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
machine-learning-for-trading - Code for Machine Learning for Algorithmic Trading, 2nd edition.
pandas-ai - Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
tsfresh - Automatic extraction of relevant features from time series:
HealthGPT - Query your Apple Health data with natural language 💬 🩺
Copulas - A library to model multivariate data using copulas.
orbstack - Fast, light, simple Docker containers & Linux machines for macOS
TimeSynth - A Multipurpose Library for Synthetic Time Series Generation in Python
agorakube - Agorakube is a Certified Kubernetes Distribution built on top of CNCF ecosystem that provides an enterprise grade solution following best practices to manage a conformant Kubernetes cluster for on-premise and public cloud providers.