ColBERT
TornadoVM
ColBERT | TornadoVM | |
---|---|---|
4 | 22 | |
2,524 | 1,127 | |
7.0% | 3.1% | |
8.4 | 9.9 | |
about 1 month ago | 1 day ago | |
Python | Java | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ColBERT
-
Why Vector Compression Matters
I’ll conclude by explaining how vector compression relates to ColBERT, a higher-level technique that Astra DB customers are starting to use successfully.
-
How ColBERT Helps Developers Overcome the Limits of Retrieval-Augmented Generation
ColBERT is a new way of scoring passage relevance using a BERT language model that substantially solves the problems with DPR. This diagram from the first ColBERT paper shows why it’s so exciting:
- FLaNK Stack 05 Feb 2024
-
New free tool that uses fine-tuned BERT model to surface answers from research papers
ColBERT and successors for retrieval.
TornadoVM
-
Intel Gaudi 3 AI Accelerator
You don't need to use C++ to interface with CUDA or even write it.
A while ago NVIDIA and the GraalVM team demoed grCUDA which makes it easy to share memory with CUDA kernels and invoke them from any managed language that runs on GraalVM (which includes JIT compiled Python). Because it's integrated with the compiler the invocation overhead is low:
https://developer.nvidia.com/blog/grcuda-a-polyglot-language...
And TornadoVM lets you write kernels in JVM langs that are compiled through to CUDA:
https://www.tornadovm.org
There are similar technologies for other languages/runtimes too. So I don't think that will cause NVIDIA to lose ground.
- Java VectorAPI compatiblity with TornadoVM GPU programming framework
- Java GPU pre/post processing with ONNX RT and TornadoVM
- FLaNK Stack 05 Feb 2024
- FLaNK 25 December 2023
- GPU Acceleration for Python, JavaScript, Ruby from Java with Truffle
- TornadoVM v1.0 Released
- TornadoVM 1.0
-
From CPU to GPU and FPGAs: Supercharging Java Applications with TornadoVM [video]
Presented by Juan Fumero, PhD & Research Fellow (The University of Manchester, UK) during the JVM Language Summit 2023 (Santa Clara CA).
More information on TornadoVM can be found at https://www.tornadovm.org/
Tags: #Java #JVMLS #GPU #FPGA #OpenJDK #GraalVM #AI
What are some alternatives?
qdrant - Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
Aparapi - The New Official Aparapi: a framework for executing native Java and Scala code on the GPU.
similarity - TensorFlow Similarity is a python package focused on making similarity learning quick and easy.
openapi4j - OpenAPI 3 parser, JSON schema and request validator.
elasticsearch-learning-to-rank - Plugin to integrate Learning to Rank (aka machine learning for better relevance) with Elasticsearch
GraalVMREPL - REPL (read–eval–print loop) shell built on top of JavaFX and GraalVM stack, incorporating GraalJS, GraalPython, TruffleRuby and FastR
Milvus - A cloud-native vector database, storage for next generation AI applications
kattlo-cli - Kattlo CLI Project
haystack - :mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
junodb - JunoDB is PayPal's home-grown secure, consistent and highly available key-value store providing low, single digit millisecond, latency at any scale.
awesome-semantic-search - A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.
jr - JR: streaming quality random data from the command line