keras-nlp
ivy
keras-nlp | ivy | |
---|---|---|
2 | 17 | |
701 | 14,015 | |
3.1% | 0.1% | |
9.5 | 10.0 | |
5 days ago | 10 days ago | |
Python | Python | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
keras-nlp
-
Keras 3.0
Yes, Keras can be used to build LLMs. In fact this is one of the main use cases.
There are some tutorials about how to do it "from scratch", like this: https://keras.io/examples/nlp/neural_machine_translation_wit...
Otherwise, if you want to reuse an existing LLM (or just see how a large one would be implemented in practice) you can check out the models from KerasNLP. For instance, this is BERT, basically just a stack of TransformerEncoders. https://github.com/keras-team/keras-nlp/blob/master/keras_nl...
-
Keras Core: Keras for TensorFlow, Jax, and PyTorch
Yes, you can check out KerasCV and KerasNLP which host pretrained models like ResNet, BERT, and many more. They run on all backends as of the latest releases (today), and converting them to be backend-agnostic was pretty smooth! It took a couple of weeks to convert the whole packages.
https://github.com/keras-team/keras-nlp/tree/master/keras_nl...
ivy
-
Keras 3.0
See also https://github.com/unifyai/ivy which I have not tried but seems along the lines of what you are describing, working with all the major frameworks
-
Show HN: Carton β Run any ML model from any programming language
is this ancillary to what [these guys](https://github.com/unifyai/ivy) are trying to do?
- Ivy: All in one machine learning framework
- Ivy ML Transpiler and Framework
-
[D] Keras 3.0 Announcement: Keras for TensorFlow, JAX, and PyTorch
https://unify.ai/ They are trying to do what Ivy is doing already.
-
Ask for help: what is the best way to have code both support torch and numpy?
Check Ivy.
-
CoreML Stable Diffusion
ROCm's great for data centers, but good luck finding anything about desktop GPUs on their site apart from this lone blog post: https://community.amd.com/t5/instinct-accelerators/exploring...
There's a good explanation of AMD's ROCm targets here: https://news.ycombinator.com/item?id=28200477
It's currently a PITA to get common Python libs like Numba to even talk to AMD cards (admittedly Numba won't talk to older Nvidia cards either and they deprecate ruthlessly; I had to downgrade 8 versions to get it working with a 5yo mobile workstation). YC-backed Ivy claims to be working on unifying ML frameworks in a hardware-agnostic way but I don't have enough experience to assess how well they're succeeding yet: https://lets-unify.ai
I was happy to see DiffusionBee does talk the GPU in my late-model intel Mac, though for some reason it only uses 50% of its power right now. I'm sure the situation will improve as Metal 3.0 and Vulkan get more established.
-
DL Frameworks in a nutshell
Won't it all come together with https://lets-unify.ai/ ?
- Unified Machine Learning
-
[Discussion] Opinions on unify AI
What do you think about unify AI https://lets-unify.ai.
What are some alternatives?
keras-core - A multi-backend implementation of the Keras API, with support for TensorFlow, JAX, and PyTorch.
PaddleNLP - π Easy-to-use and powerful NLP and LLM library with π€ Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including πText Classification, π Neural Search, β Question Answering, βΉοΈ Information Extraction, π Document Intelligence, π Sentiment Analysis etc.
MAGIST-Algorithm - Multi-Agent Generally Intelligent Simultaneous Training Algorithm for Project Zeta
ColossalAI - Making large AI models cheaper, faster and more accessible
i6_experiments
DeepFaceLive - Real-time face swap for PC streaming or video calls
Spectrum - Spectrum is an AI that uses machine learning to generate Rap song lyrics
PaddleOCR - Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
returnn - The RWTH extensible training framework for universal recurrent neural networks
lisp - Toy Lisp 1.5 interpreter
keras-cv - Industry-strength Computer Vision workflows with Keras
Kornia - Geometric Computer Vision Library for Spatial AI