|finetuner||Jina AI examples|
|2 months ago||almost 2 years ago|
|Apache License 2.0||Apache License 2.0|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
How can I create a dataset to refine Whisper AI from old videos with subtitles?
4 projects | /r/OpenAI | 17 Feb 2023
You can try creating your own dataset. Get some audio data that you want, preprocess it, and then create a custom dataset you can use to fine tune. You could use finetuners like these if you want as well.
A Guide to Using OpenTelemetry in Jina for Monitoring and Tracing Applications
6 projects | dev.to | 16 Feb 2023
We derived the dataset by pre-processing the deepfashion dataset using Finetuner. The image label generated by Finetuner is extracted and formatted to produce the text attribute of each product.
[D] Looking for an open source Downloadable model to run on my local device.
2 projects | /r/MachineLearning | 12 Feb 2023
You can either use Hugging Face Transformers as they have a lot of pre-trained models that you can customize. Or Finetuners like this one: which is a toolkit for fine-tuning multiple models.
Improving Search Quality for Non-English Queries with Fine-tuned Multilingual CLIP Models
2 projects | dev.to | 10 Feb 2023
Very recently, a few non-English and multilingual CLIP models have appeared, using various sources of training data. In this article, we’ll evaluate a multilingual CLIP model’s performance in a language other than English, and show how you can improve it even further using Jina AI’s Finetuner.
Classification using prompt or fine tuning?
2 projects | /r/GPT3 | 6 Feb 2023
you can try prompt-based classification or fine-tuning with a Finetuner. Prompts work well for simple tasks but fine-tuning may give better results for complex ones. Althouigh it's going to need more resources, but try both and see what works best for you.
Asking questions about lengthy texts
2 projects | /r/GPT3 | 4 Feb 2023
If you've got a set of Q&A pairs for your 60-page lease or medical paper, you could use finetuners to help answer questions about the text. But if you don't have those pairs, fine-tuning might not be good. Try summarizing the doc or extract the info. And if you're hitting the token limit, try using a bigger model or breaking up the text into smaller pieces.
What are the best Python libraries to learn for beginners?
7 projects | /r/learnpython | 30 Jan 2023
Actually further in applying ML, Finetuner is pretty handy for getting the last mile done which I found useful.
Fine-tuning open source models to emulate ChatGPT for code explanation.
2 projects | /r/learnmachinelearning | 27 Jan 2023
One option I’m considering is using fine tuners like the one from HuggingFace or Jina AI to fine-tune open source models like GPT-J or OPT to improve specific use-cases like code explanation. With the funding that we have, I wouldn’t want to cheap out on fine-tuning and expect something good.
Efficient way to tune a network by changing hyperparameters?
2 projects | /r/deeplearning | 25 Jan 2023
Off the top of my head you can either use Grid Search to test hyperparam combinations, Random Search to randomize hyperparams and Neural search uses ML to optimize hyperparameter tuning. You can use finetuners for this as well.
Seeking advice on improving NLP search results
4 projects | /r/LanguageTechnology | 22 Jan 2023
Back then, I came across some info about a self-supervised sentence embedding system that surpasses Sentence Transformers NLI models, but forgot where it was. You could use Jina’s Finetuner. It lets you boost your pre-trained models' performance, making them ready for production without having to spend a lot of time labeling or buying expensive hardware.
Jina AI examples
Show HN: Search PDFs with Transformers and Python Notebook
4 projects | news.ycombinator.com | 25 Jul 2022
- Modern PDFs - if you wanna extract text and images, then the PDFSegmenter used in my example will work. If tables too, might need some additional jiggery-pokery, but definitely doable. I know other ppl using the same framework (Jina) who've accomplished it.
- Exact word search - pretty simple. I've focused on more advanced stuff because color vs colour is same same but different. Also just because it's pretty easy since I'm just using pre-defined building blocks, not manually integrating stuff
- Cross platform frontend - I've seen a lyrics search frontend  and I've built stuff in Streamlit before. Jina offers RESTful/gRPC/WebSockets gateways so it can't be too tough
- Lightweight? I mean how lightweight do you want it? C? Bash? Assembly? I've found Python good for text parsing
- Long-term: The notebook I wrote has a few (each of which have their own), but compared to others they're relatively lightweight.
- Gluing code: I've been using pre-existing building blocks, and writing new Executors (i.e. building blocks) is relatively straightforward, and then scaling them up with shards, replicas, etc is just a parameter away.
I'm more into the search side then the PDF stuff. The PDF side I've had experience with through bitter suffering and torment. Not a fun format to work with (unless you're into sado-masochism)
Getting started with Jina AI
5 projects | dev.to | 19 Feb 2022
Semantic Wikipedia Search
Build your own Google Image search powered by deep-learning, open-source
2 projects | /r/privacy | 16 Aug 2021
[P] Open-source Neural Search framework to implement semantic search & multimedia search. Just released 2.0, seeking your feedback.
6 projects | /r/MachineLearning | 3 Jul 2021
There are already some examples on music search, pdf search and video search that shows some POC of it's capabilities around those use cases. You can discuss your specific use case in detail with Jina community on slack
I was wrong! A big thank you to r/python members 🙏
2 projects | /r/Python | 13 Jun 2021
Thank you so much for the appreciation and sharing your use cases. Checkout examples for chatbot and financial analysis - https://github.com/jina-ai/examples
PDF search - Another project I built using Jina(AI Search framework)
3 projects | /r/datascience | 16 May 2021
git clone --depth 1 --filter=blob:none --sparse https://github.com/jina-ai/examples git sparse-checkout set multimodal-search-pdf3 projects | /r/datascience | 16 May 2021
Source Code on Github
[Project] Framework to build AI powered search with just 7 lines of code. Supports semantic, text, image, audio & video search
Checkout these production-ready examples
This can be done by Jina, for video data we transform them into frames(images), then we can search with images we want in those indexed data. Checkout tumblr gif search example and extend it to meet your goals. Do you feel confident to do it? What change would make it easier for you to implement your usecase? If you need any help, you can always ask in Jina Slack community. There are many helpful community members who'd assist you.
You can do that right now. Here's an example of gif search(video-to-video), example of object search(image to image object search). Note: Jina currently provides pre-trained models which can be extended further for better support in this domain.
What are some alternatives?
gpt_index - LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM's with external data. [Moved to: https://github.com/jerryjliu/llama_index]
jina - ☁️ Build multimodal AI applications with cloud-native stack
RWKV-LM - RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
jina-hub - An open-registry for hosting Jina executors via container images
Promptify - Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
jina-app-store-example - App store search example, using Jina as backend and Streamlit as frontend [Moved to: https://github.com/jina-ai/example-app-store]
jina-meme-search-example - Meme search engine built with Jina neural search framework. Search with captions or image files to find matching memes. [Moved to: https://github.com/jina-ai/example-meme-search]
DearPyGui - Dear PyGui: A fast and powerful Graphical User Interface Toolkit for Python with minimal dependencies
pdfminer.six - Community maintained fork of pdfminer - we fathom PDF
pysot - SenseTime Research platform for single object tracking, implementing algorithms like SiamRPN and SiamMask.