DeepSeek-Coder
setfit
DeepSeek-Coder | setfit | |
---|---|---|
8 | 13 | |
5,567 | 2,014 | |
8.9% | 5.5% | |
8.6 | 9.2 | |
about 1 month ago | 19 days ago | |
Python | Jupyter Notebook | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
DeepSeek-Coder
-
Meta Llama 3
deepseek-coder-instruct 6.7B still looks like is better than llama 3 8B on HumanEval [0], and deepseek-coder-instruct 33B still within reach to run on 32 GB Macbook M2 Max - Lamma 3 70B on the other hand will be hard to run locally unless you really have 128GB ram or more. But we will see in the following days how it performs in real life.
[0] https://github.com/deepseek-ai/deepseek-coder?tab=readme-ov-...
-
Mistral Remove "Committing to open models" from their website
Deepseek (https://github.com/deepseek-ai/DeepSeek-Coder?tab=readme-ov-...) code is MIT and the model license is available too.
- FLaNK Stack 05 Feb 2024
-
Stable Code 3B: Coding on the Edge
https://github.com/deepseek-ai/deepseek-coder
33B Instruct doesn’t beat 6.7B Instruct by much but maybe those % improvements mean more for your usage.
I run 6.7B since I have 16GB RAM.
-
What the heck is so great about this model?
Deepseek Coder: https://github.com/deepseek-ai/DeepSeek-Coder (Best open source coding model right now)
- Deepseek Coder instruct – 6.7B model beats gpt3.5-turbo in coding
- FLaNK Stack Weekly for 13 November 2023
- DeepSeek-Coder: Has anyone tried this one?
setfit
- FLaNK Stack 05 Feb 2024
- Smarter Summaries with Finetuning GPT-3.5 and Chain of Density
-
[Discussion] Convince me that this training set contamination is fine (or not)
It did, sorry for the hasty edits! I removed that part b/c I realized that there isn't a compelling-enough reason for me to believe that text similarity is clearly inappropriate. In fact, you can train the Pr(condition | chat) classifier I suggested above using similarity training! Use SetFit for that. In the end you'll get a classifier and a similarity model.
-
Ask HN: What's the best framework for text classification (few-shot learning)?
[3] https://github.com/huggingface/setfit
-
Is it worth using LLMs like GPT-3 for text classification?
There's also kinda related approaches like SetFit which calculate embeddings from pretrained transformer models then then fit a classifier on top of the embeddings. I've yet to try it but it supposedly works well with very few labelled examples.
- LLMs for Text Classification (7B parameters)
- GPT-3 vs GPT-Neo / GPT-J for startup classification
-
Ideas on how to improve classification and scoring using Mean Pooled Sentence Embeddings
You could have a look at setfit.
-
SetFit (Sentence Transformer Fine-tuning) - Fewshot Learning without prompts [D]
Found relevant code at https://github.com/huggingface/setfit + all code implementations here
-
Most Popular AI Research Sept 2022 - Ranked Based On Total GitHub Stars
Efficient Few-Shot Learning Without Prompts https://github.com/huggingface/setfit https://arxiv.org/abs/2209.11055v1
What are some alternatives?
draw-a-ui - Draw a mockup and generate html for it
iris - Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.
FT-Merge-Quantize-Infer-CML
whisper - Robust Speech Recognition via Large-Scale Weak Supervision
cucim - cuCIM - RAPIDS GPU-accelerated image processing library
VToonify - [SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer
linen.dev - Lightweight Google-searchable Slack alternative for Communities
motion-diffusion-model - The official PyTorch implementation of the paper "Human Motion Diffusion Model"
wubloader
git-re-basin - Code release for "Git Re-Basin: Merging Models modulo Permutation Symmetries"
clipea - 📎🟢 Like Clippy but for the CLI. A blazing fast AI helper for your command line
storydalle