StableLM
safetensors
StableLM | safetensors | |
---|---|---|
43 | 31 | |
15,853 | 2,488 | |
0.2% | 5.4% | |
5.0 | 8.2 | |
about 1 month ago | 10 days ago | |
Jupyter Notebook | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
StableLM
-
The Era of 1-bit LLMs: ternary parameters for cost-effective computing
https://github.com/Stability-AI/StableLM?tab=readme-ov-file#...
-
Stable LM 3B: Bringing Sustainable, High-Performance LMs to Smart Devices
https://mistral.ai/news/announcing-mistral-7b/
looking at the 3b results (here https://github.com/Stability-AI/StableLM#stablelm-alpha-v2 ?), it looks like Mistral (which outperforms Llama-2 13b) is far more powerful
-
FreeWilly 1 and 2, two new open-access LLMs
Does this mean Stability gave up on StableLM?
I notice that the repo hasn’t been updated since April, and a question asking for an update has been ignored for at least a month: https://github.com/Stability-AI/StableLM/issues/83
-
In five years, there will be no programmers left, believes Stability AI CEO
I'm not "ignoring" StableLM, if anything it's the impetus for my post. The alpha models were so bad and unusable that it seems they may have simply abandoned the project. It's clear they basically didn't know what they were doing, which is silly for a company of their size and specialization.
-
Losing the plot
1) StableLM released a checkpoint at 800B for their 3B and 7B at 800B tokens with 4096 context size, but perform very poorly on different benchmarks and finetuning is discouraged with such a weak base model
-
UAE's Technology Innovation Institute Launches Open-Source "Falcon 40B" Large Language Model for Research & Commercial Utilization
It is the best open-source model currently available. Falcon-40B outperforms LLaMA, StableLM, RedPajama, MPT, etc. See the OpenLLM Leaderboard.
- Consulta API GPT
- Google "We Have No Moat, And Neither Does OpenAI"
-
New to StableLM--is it possible to use this locally to fine-tune on a small subset of documents yet?
Someone shared this link on another recent post
-
[N] Stability AI releases StableVicuna: the world's first open source chatbot trained via RLHF
Github: https://github.com/Stability-AI/StableLM
safetensors
-
Llamafile lets you distribute and run LLMs with a single file
The ML field is doing work in that area: https://github.com/huggingface/safetensors
-
Hugging Face raises $235M from investors including Salesforce and Nvidia
FYI the file format, safetensors, was proposed, developed and maintained by HF, and involved people from groups such as Eleuther and Stability for external security audits.
https://github.com/huggingface/safetensors https://huggingface.co/blog/safetensors-security-audit
-
I Made Stable Diffusion XL Smarter by Finetuning It on Bad AI-Generated Images
Thank you for note on this. I had not heard there were already trojan horse malware being slipped into tensor files as python scripts. Apparently torch pickle uses eval on the tensor file with no filter.
Heard surprisingly little commentary on this topic. The full explanation of how Safetensors are "Safe" can be found from the developer at: https://github.com/huggingface/safetensors/discussions/111
- Pickle safety in Python
-
What makes .safetensors files safe?
Here the developer goes into some detail about what kinds of protections .safetensor files have : https://github.com/huggingface/safetensors/discussions/111
-
Security PSA: huggingface models are code. not just data.
Use the safetensors format, which allows safe persistence and loading of models for common libraries - TensorFlow, PyTorch, JAX, etc. We went through external audits in the last few months (blog post). The current direction will be to have this as the default format.
- What's your favorite model. Right now I'm really enjoying dreamshaper.
- Lora, ggml, safetensors, hf, etc. Is there a glossary and guide on which model to choose?
-
Stability AI Launches the First of Its StableLM Suite of Language Models
I've been diving in lately and while it's not efficient, the only way to do manage is to create a new conda/mamba environment, or a custom Docker image for all the conflicting packages.
For safety and speed, you should prefer the safetensor format: https://huggingface.co/docs/safetensors/speed
If you know what you are doing you can do your own conversions: https://github.com/huggingface/safetensors or for safety, https://huggingface.co/spaces/diffusers/convert
-
CKPT to Safetensors
GitHub - huggingface/safetensors: Simple, safe way to store and distribute tensors
What are some alternatives?
text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
stable-diffusion-webui - Stable Diffusion web UI
lm-evaluation-harness - A framework for few-shot evaluation of language models.
llama.cpp - LLM inference in C/C++
Safe-and-Stable-Ckpt2Safetensors-Conversion-Tool-GUI - Convert your Stable Diffusion checkpoints quickly and easily.
ggml - Tensor library for machine learning
InvokeAI - InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.
Open-Assistant - OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Stable-Diffusion-Pickle-Scanner-GUI - Pickle Scanner GUI
alpaca_lora_4bit
stable-diffusion-webui-model-toolkit - A Multipurpose toolkit for managing, editing and creating models.