uform
LinkBERT
uform | LinkBERT | |
---|---|---|
8 | 2 | |
894 | 389 | |
9.3% | - | |
9.2 | 1.8 | |
10 days ago | about 2 years ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
uform
-
CatLIP: Clip Vision Accuracy with 2.7x Faster Pre-Training on Web-Scale Data
question: any good on-device size image embedding models?
tried https://github.com/unum-cloud/uform which i do like, especially they also support languages other than English. Any recommendations on other alternatives?
- Multimodal Embeddings for JavaScript, Swift, and Python
- Show HN: UForm v2 Featuring Multimodal Matryoshka, Multimodal DPO, and ONNX
- UForm v1: Multimodal Chat in 1.5B Parameters
-
Show HN: I scraped 25M Shopify products to build a search engine
As you scale, you may benefit from these two projects I maintain, and the Big Tech uses :)
https://github.com/unum-cloud/usearch - for faster search
https://github.com/unum-cloud/uform - for cheaper multi-lingual multi-modal embeddings
-
Show HN: U)Search Images demo in 200 lines of Python
[2]: https://github.com/unum-cloud/uform
- Show HN: UForm v2 โ tiny CLIP-like embeddings in 21 languages and Graphcore API
-
Unum: Vector Search engine in a single file
Ouch! Thatโs fat! Which model is that?
We have built a few video-search system by now, using USearch and UForm for embedding. They are only 256 dims and you can concatenate a few from different parts of the video. Any chance it would help?
https://github.com/unum-cloud/uform
LinkBERT
-
[D] Fine tuning LLM - VM requirements.
I am quite new to this - at the stage of experimenting. I would like to fine tune some large language model like LinkBert for instance: https://github.com/michiyasunaga/LinkBERT
-
Stanford AI Researchers Propose โLinkBERTโ: A New Pretraining Method That Improves Language Model Training with Document Links
Continue reading | Check out the paper, github and blog post
What are some alternatives?
CogVLM - a state-of-the-art-level open visual language model | ๅคๆจกๆ้ข่ฎญ็ปๆจกๅ
PaddleNLP - ๐ Easy-to-use and powerful NLP and LLM library with ๐ค Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including ๐Text Classification, ๐ Neural Search, โ Question Answering, โน๏ธ Information Extraction, ๐ Document Intelligence, ๐ Sentiment Analysis etc.
usearch - Fast Open-Source Search & Clustering engine ร for Vectors & ๐ Strings ร in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram ๐
kuzu - Embeddable property graph database management system built for query speed and scalability. Implements Cypher.
transformers - ๐ค Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
neural-file-sorter - A neural network based file sorter. Trains an autoencoder to sort images or audio based on the similarity of their encodings, or uses the OpenAI CLIP model.
qagnn - [NAACL 2021] QAGNN: Question Answering using Language Models and Knowledge Graphs ๐ค
ucall - Remote Procedure Calls - 50x lower latency and 70x higher bandwidth than FastAPI, implementing JSON-RPC & ๐ REST over io_uring and SIMDJSON โ๏ธ
LMFlow - An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
semantic-search-app-template - Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPI
emoji_search - Semantically Search Emojis From the Command Line!