zilla-examples
PowerInfer
zilla-examples | PowerInfer | |
---|---|---|
2 | 4 | |
19 | 7,008 | |
- | 3.6% | |
8.4 | 9.8 | |
13 days ago | 3 days ago | |
Shell | C++ | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
zilla-examples
- FLaNK 25 December 2023
-
A Primer on Server-Sent Events (SSE) — The “what”, “how” and “why” of one of the best ways to push data (and Kafka) across the web.
If you want to see how Zilla brings together Kafka and SSE, you can run a quick demo example here: https://github.com/aklivity/zilla-examples/tree/main/sse.kafka.fanout
PowerInfer
- FLaNK 25 December 2023
- High-Speed Large Language Model Serving on PCs with Consumer-Grade GPUs
-
PowerInfer: Fast Large Language Model Serving with a Consumer-Grade GPU [pdf]
> PowerInfer’s source code is publicly available at https://github.com/SJTU-IPADS/PowerInfer
- PowerInfer: High-Speed Large Language Model Serving on Consumer-Grade GPUs
What are some alternatives?
splatter-image - Official implementation of `Splatter Image: Ultra-Fast Single-View 3D Reconstruction' CVPR 2024
Cgml - GPU-targeted vendor-agnostic AI library for Windows, and Mistral model implementation.
MonitoFi - MonitoFi: Health & Performance Monitor for your Apache NiFi
llama.cpp - LLM inference in C/C++
huh - Build terminal forms and prompts 🤷🏻♀️
GenerativeAIExamples - Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
jetson_easy - 🔩 Automatically script to setup and configure your NVIDIA Jetson [Nano, Xavier, TX2i, TX2, TX1, TK1] . This script run different modules to update, fix and patch the kernel, install ROS and other...
FLaNK-SaoPauloBrazil - FLaNK-SaoPauloBrazil
yq - yq is a portable command-line YAML, JSON, XML, CSV, TOML and properties processor
k3s - Lightweight Kubernetes
tbmq - Open-source, scalable, and fault-tolerant MQTT broker able to handle 4M+ concurrent client connections, supporting at least 3M messages per second throughput per single cluster node with low latency delivery. The cluster mode supports more than 100M concurrently connected clients.
llama2-high-level-cpp - Inference Llama2 with High-Level C++.