ChatGLM-6B
datagen
ChatGLM-6B | datagen | |
---|---|---|
17 | 7 | |
39,341 | 135 | |
1.6% | 3.0% | |
8.4 | 6.1 | |
2 months ago | about 2 months ago | |
Python | TypeScript | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ChatGLM-6B
-
What are the current fastest multi-gpu inference frameworks?
ChatGLM seems to be pretty popular but I've never used this before.
-
A CEO is spending more than $2,000 a month on ChatGPT Plus accounts for all of his employees, and he says it's saving 'hours' of time
There are also locally hosted options that approach the effectiveness of ChatGPT. This GLM for example was specifically trained to be able to be processed on a single consumer grade GPU
- Open Source Chinese LLMs
- ChatGLM-6B: run locally on consumer graphics card (6GB of GPU memory required)
- Ask HN: Open source LLM for commercial use?
-
Coding LLaMa Modell?
A link to for y'all. Definitely gonna try to mess around with this!
- 关于GPT,AI和未来的一些社会经济问题,向诸位请教
- FLiPN-FLaNK Stack Weekly for 20 March 2023
- ChatGLM-6B - an open source 6.2 billion parameter English/Chinese bilingual LLM trained on 1T tokens, supplemented by supervised fine-tuning, feedback bootstrap, and Reinforcement Learning from Human Feedback. Runs on consumer grade GPUs
- ChatGLM: Open bilingual language model based on General Language Model framework
datagen
-
What are your favorite tools or components in the Kafka ecosystem?
For fake data, shameless plug for https://github.com/MaterializeInc/datagen/tree/main
- What are some good publicly available real-time data sources?
-
Simulating Streaming Data for Fraud Detection with Datagen CLI
Building and testing a real-time fraud detection application requires a continuous stream of realistic data. But generating that data can be a challenge. That's why we recently created the Datagen CLI, a simple tool that helps you create believable fake data using the FakerJS API.
-
How train my SQL skills with real world data engineering problems ?
Generate fake data with a normalized schema of your choosing with this tool from Materialize, then denormalize it and build a warehouse model.
- FLiPN-FLaNK Stack Weekly for 20 March 2023
- Datagen CLI: Stream Fake Relational Data
What are some alternatives?
llama.cpp - LLM inference in C/C++
CloudDemo2023 - 2023 Demos
alpaca.cpp - Locally run an Instruction-Tuned Chat-Style LLM
halp - A CLI tool to get help with CLI tools 🐙
stanford_alpaca - Code and documentation to train Stanford's Alpaca models, and generate the data.
awesome-public-real-time-datasets - A list of publicly available datasets with real-time data maintained by the team at bytewax.io
Open-Assistant - OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
RedfinScraper - Scrapes Redfin data.
basaran - Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Transformers-based text generation models.
cf-url-shortener - URL Shortener Cloudflare function that uses Upstash Redis and Kafka along with https://materialize.com
accelerate - 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
debezium-ui - A web UI for Debezium; Please log issues at https://issues.redhat.com/browse/DBZ.