llm-vscode-inference-server
An endpoint server for efficiently serving quantized open-source LLMs for code. (by wangcx18)
ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型 (by THUDM)
llm-vscode-inference-server | ChatGLM2-6B | |
---|---|---|
1 | 4 | |
44 | 15,549 | |
- | 0.3% | |
5.3 | 6.6 | |
8 months ago | about 2 months ago | |
Python | Python | |
Apache License 2.0 | GNU General Public License v3.0 or later |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
llm-vscode-inference-server
Posts with mentions or reviews of llm-vscode-inference-server.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-10-11.
-
Replit's new AI Model now available on Hugging Face
Requests for code generation are made via an HTTP request.
You can use the Hugging Face Inference API or your own HTTP endpoint, provided it adheres to the API specified here[1] or here[2]."
It's fairly easy to use your own model locally with the plugin. You can just use the one of the community developed inference servers, which are listed at the bottom of the page, but here's the links[3] to both[4].
[1]: https://huggingface.co/docs/api-inference/detailed_parameter...
[2]: https://huggingface.github.io/text-generation-inference/#/Te...
[3]: https://github.com/wangcx18/llm-vscode-inference-server
[4]: https://github.com/wangcx18/llm-vscode-inference-server
ChatGLM2-6B
Posts with mentions or reviews of ChatGLM2-6B.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-06-25.
- Are We Overlooking China's Progress in AI?
-
A new open-source language model claims to have surpassed GPT-4 right now. This needs to be fact-checked
If its benchmark results, eg on MMLU few-shot, hold and are indicative of its actual performance (which, mind you, isn't a given, for 6B nor for 130B), this 6B should be competitive with decent 30Bs. Plus natively long context and MQA. This is genuinely interesting, unlike boomer noises about CCP poisoning our checkpoints or whatever.
What are some alternatives?
When comparing llm-vscode-inference-server and ChatGLM2-6B you can also consider the following projects:
llmflows - LLMFlows - Simple, Explicit and Transparent LLM Apps
PentestGPT - A GPT-empowered penetration testing tool