finetuner
CLIP
Our great sponsors
finetuner | CLIP | |
---|---|---|
36 | 96 | |
1,192 | 17,792 | |
4.9% | 5.9% | |
0.0 | 4.6 | |
2 months ago | about 1 month ago | |
Python | Jupyter Notebook | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
finetuner
-
How can I create a dataset to refine Whisper AI from old videos with subtitles?
You can try creating your own dataset. Get some audio data that you want, preprocess it, and then create a custom dataset you can use to fine tune. You could use finetuners like these if you want as well.
-
A Guide to Using OpenTelemetry in Jina for Monitoring and Tracing Applications
We derived the dataset by pre-processing the deepfashion dataset using Finetuner. The image label generated by Finetuner is extracted and formatted to produce the text attribute of each product.
-
[D] Looking for an open source Downloadable model to run on my local device.
You can either use Hugging Face Transformers as they have a lot of pre-trained models that you can customize. Or Finetuners like this one: which is a toolkit for fine-tuning multiple models.
-
Improving Search Quality for Non-English Queries with Fine-tuned Multilingual CLIP Models
Very recently, a few non-English and multilingual CLIP models have appeared, using various sources of training data. In this article, we’ll evaluate a multilingual CLIP model’s performance in a language other than English, and show how you can improve it even further using Jina AI’s Finetuner.
-
Classification using prompt or fine tuning?
you can try prompt-based classification or fine-tuning with a Finetuner. Prompts work well for simple tasks but fine-tuning may give better results for complex ones. Althouigh it's going to need more resources, but try both and see what works best for you.
-
Asking questions about lengthy texts
If you've got a set of Q&A pairs for your 60-page lease or medical paper, you could use finetuners to help answer questions about the text. But if you don't have those pairs, fine-tuning might not be good. Try summarizing the doc or extract the info. And if you're hitting the token limit, try using a bigger model or breaking up the text into smaller pieces.
-
What are the best Python libraries to learn for beginners?
Actually further in applying ML, Finetuner is pretty handy for getting the last mile done which I found useful.
-
Fine-tuning open source models to emulate ChatGPT for code explanation.
One option I’m considering is using fine tuners like the one from HuggingFace or Jina AI to fine-tune open source models like GPT-J or OPT to improve specific use-cases like code explanation. With the funding that we have, I wouldn’t want to cheap out on fine-tuning and expect something good.
-
Efficient way to tune a network by changing hyperparameters?
Off the top of my head you can either use Grid Search to test hyperparam combinations, Random Search to randomize hyperparams and Neural search uses ML to optimize hyperparameter tuning. You can use finetuners for this as well.
-
Seeking advice on improving NLP search results
Back then, I came across some info about a self-supervised sentence embedding system that surpasses Sentence Transformers NLI models, but forgot where it was. You could use Jina’s Finetuner. It lets you boost your pre-trained models' performance, making them ready for production without having to spend a lot of time labeling or buying expensive hardware.
CLIP
-
COMFYUI SDXL WORKFLOW INBOUND! Q&A NOW OPEN! (WIP EARLY ACCESS WORKFLOW INCLUDED!)
in the modal card it says: pretrained text encoders (OpenCLIP-ViT/G and CLIP-ViT/L).
-
Stability Matrix v1.1.0 - Portable mode, Automatic updates, Revamped console, and more
Command: "C:\StabilityMatrix\Packages\stable-diffusion-webui\venv\Scripts\python.exe" -m pip install https://github.com/openai/CLIP/archive/d50d76daa670286dd6cacf3bcd80b5e4823fc8e1.zip --prefer-binary
-
[D] LLM or model that does image -> prompt?
CLIP might work for your needs.
-
Where can this be used? I have seen some tutorials to run deepfloyd on Google colab. Any way it can be done on local?
pip install deepfloyd_if==1.0.2rc0 pip install xformers==0.0.16 pip install git+https://github.com/openai/CLIP.git --no-deps pip install huggingface_hub --upgrade
-
Can anybody advise open-sourced neural net model to tag/recognize photos on a harddrive?
I recommend https://laion.ai/blog/large-openclip/ or https://github.com/openai/CLIP .
-
Expressive Text-to-Image Generation with Rich Text
pip install git+https://github.com/openai/CLIP.git
- [D] Data Annotation Done by Machine Learning/AI?
-
OpenAI Tokenizer
It needs to be sanitized against very long words (like 10k character long words :) ).
In previous tokenizer like CLIP (https://github.com/openai/CLIP/blob/main/clip/simple_tokeniz... ) , they used additional preprocessing steps like html escaping and various cleanup preprocessing using some python library (ftfy, html and regex), which made porting the code exactly to other languages a real pain.
Sadly this library doesn't solve that :'-(
-
Why is there speculation that midjourney is based on stable diffusion if MJ is released earlier than SD?
People who made these colabs better and better also the same people who are at Midjourney now. But the "mother" of it all, was Katherine Crowson. She made a fine tuned model that uses a 512x512 unconditional ImageNet diffusion model fine-tuned from OpenAI's 512x512 class-conditional ImageNet diffusion model (https://github.com/openai/guided-diffusion) together with CLIP (https://github.com/openai/CLIP) to connect text prompts with images. It uses a smaller secondary diffusion model trained by Katherine Crowson to remove noise from intermediate timesteps to prepare them for CLIP.
-
Stable Diffusion v2-1-unCLIP model released
So there's CLIP (Contrastive Language-Image Pretraining), which I thought this was referring to. And then there's CLIP Guided Stable Diffusion, which "can help to generate more realistic images by guiding stable diffusion at every denoising step with an additional CLIP model", which is just using that same CLIP model.
What are some alternatives?
open_clip - An open source implementation of CLIP.
sentence-transformers - Multilingual Sentence & Image Embeddings with BERT
latent-diffusion - High-Resolution Image Synthesis with Latent Diffusion Models
DALLE2-pytorch - Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
disco-diffusion
gpt_index - LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM's with external data. [Moved to: https://github.com/jerryjliu/llama_index]
txtai - 💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
Jina AI examples - Jina examples and demos to help you get started
segment-anything - The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
fastbook - The fastai book, published as Jupyter Notebooks
dalle-2-preview
stylegan3 - Official PyTorch implementation of StyleGAN3