setfit
TornadoVM
setfit | TornadoVM | |
---|---|---|
13 | 22 | |
1,990 | 1,108 | |
3.7% | 1.4% | |
9.2 | 9.9 | |
2 days ago | 3 days ago | |
Jupyter Notebook | Java | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
setfit
- FLaNK Stack 05 Feb 2024
- Smarter Summaries with Finetuning GPT-3.5 and Chain of Density
-
[Discussion] Convince me that this training set contamination is fine (or not)
It did, sorry for the hasty edits! I removed that part b/c I realized that there isn't a compelling-enough reason for me to believe that text similarity is clearly inappropriate. In fact, you can train the Pr(condition | chat) classifier I suggested above using similarity training! Use SetFit for that. In the end you'll get a classifier and a similarity model.
-
Ask HN: What's the best framework for text classification (few-shot learning)?
[3] https://github.com/huggingface/setfit
-
Is it worth using LLMs like GPT-3 for text classification?
There's also kinda related approaches like SetFit which calculate embeddings from pretrained transformer models then then fit a classifier on top of the embeddings. I've yet to try it but it supposedly works well with very few labelled examples.
- LLMs for Text Classification (7B parameters)
- GPT-3 vs GPT-Neo / GPT-J for startup classification
-
Ideas on how to improve classification and scoring using Mean Pooled Sentence Embeddings
You could have a look at setfit.
-
SetFit (Sentence Transformer Fine-tuning) - Fewshot Learning without prompts [D]
Found relevant code at https://github.com/huggingface/setfit + all code implementations here
-
Most Popular AI Research Sept 2022 - Ranked Based On Total GitHub Stars
Efficient Few-Shot Learning Without Prompts https://github.com/huggingface/setfit https://arxiv.org/abs/2209.11055v1
TornadoVM
-
Intel Gaudi 3 AI Accelerator
You don't need to use C++ to interface with CUDA or even write it.
A while ago NVIDIA and the GraalVM team demoed grCUDA which makes it easy to share memory with CUDA kernels and invoke them from any managed language that runs on GraalVM (which includes JIT compiled Python). Because it's integrated with the compiler the invocation overhead is low:
https://developer.nvidia.com/blog/grcuda-a-polyglot-language...
And TornadoVM lets you write kernels in JVM langs that are compiled through to CUDA:
https://www.tornadovm.org
There are similar technologies for other languages/runtimes too. So I don't think that will cause NVIDIA to lose ground.
- Java VectorAPI compatiblity with TornadoVM GPU programming framework
- Java GPU pre/post processing with ONNX RT and TornadoVM
- FLaNK Stack 05 Feb 2024
- FLaNK 25 December 2023
- GPU Acceleration for Python, JavaScript, Ruby from Java with Truffle
- TornadoVM v1.0 Released
- TornadoVM 1.0
-
From CPU to GPU and FPGAs: Supercharging Java Applications with TornadoVM [video]
Presented by Juan Fumero, PhD & Research Fellow (The University of Manchester, UK) during the JVM Language Summit 2023 (Santa Clara CA).
More information on TornadoVM can be found at https://www.tornadovm.org/
Tags: #Java #JVMLS #GPU #FPGA #OpenJDK #GraalVM #AI
What are some alternatives?
iris - Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.
Aparapi - The New Official Aparapi: a framework for executing native Java and Scala code on the GPU.
whisper - Robust Speech Recognition via Large-Scale Weak Supervision
openapi4j - OpenAPI 3 parser, JSON schema and request validator.
VToonify - [SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer
GraalVMREPL - REPL (read–eval–print loop) shell built on top of JavaFX and GraalVM stack, incorporating GraalJS, GraalPython, TruffleRuby and FastR
motion-diffusion-model - The official PyTorch implementation of the paper "Human Motion Diffusion Model"
kattlo-cli - Kattlo CLI Project
git-re-basin - Code release for "Git Re-Basin: Merging Models modulo Permutation Symmetries"
junodb - JunoDB is PayPal's home-grown secure, consistent and highly available key-value store providing low, single digit millisecond, latency at any scale.
storydalle
jr - JR: streaming quality random data from the command line