LASER
Arraymancer
LASER | Arraymancer | |
---|---|---|
5 | 21 | |
3,520 | 1,307 | |
0.3% | - | |
5.7 | 8.2 | |
7 days ago | 4 days ago | |
Jupyter Notebook | Nim | |
GNU General Public License v3.0 or later | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
LASER
-
SentenceTransformers: Python framework for sentence, text and image embeddings
I'm curious how people are handling multi-lingual embeddings.
I've found LASER[1] which originally had the idea to embed all languages in the same vector space, though it's a bit harder to use than models available through SentenceTransformers. LASER2 stuck with this approach, but LASER3 switched to language-specific models. However, I haven't found benchmarks for these models, and they were released about 2 years ago.
Another alternative would be to translate everything before embedding, which would introduce some amount of error, though maybe it wouldn't be significant.
1. https://github.com/facebookresearch/LASER
-
[D] Hey Reddit! We're a bunch of research scientists and software engineers and we just open sourced a new state-of-the-art AI model that can translate between 200 different languages. We're excited to hear your thoughts so we're hosting an AMA on 07/21/2022 @ 9:00AM PT. Ask Us Anything!
You can check out some of our materials and open sourced artifacts here: - Our latest blog post: https://ai.facebook.com/blog/nllb-200-high-quality-machine-translation - Project Overview: https://ai.facebook.com/research/no-language-left-behind/ - Product demo: https://nllb.metademolab.com/ - Research paper: https://research.facebook.com/publications/no-language-left-behind - NLLB-200: https://github.com/facebookresearch/fairseq/tree/nllb - FLORES-200: https://github.com/facebookresearch/flores - LASER3: https://github.com/facebookresearch/LASER Joining us today for the AMA are: - Angela Fan (AF), Research Scientist - Jean Maillard (JM), Research Scientist - Maha Elbayad (ME), Research Scientist - Philipp Koehn (PK), Research Scientist - Shruti Bhosale (SB), Software Engineer We’ll be here from 07/21/2022 @09:00AM PT - 10:00AM PT Thanks and we’re looking forward to answering your questions!
-
School project : sentiments analysis with my country Arabic Dialect
This may be helpful: https://github.com/facebookresearch/LASER
-
[P] Bilingual text alignment tools for NMT - help needed
Check FB's LASER: https://github.com/facebookresearch/LASER/tree/master/tasks/CCMatrix Also , Sentence-Transformers has a pretty neat model for crosslingual sentence similarity: https://huggingface.co/sentence-transformers/stsb-xlm-r-multilingual
-
Help with aligned word embeddings
You want LASER its a superbig model trained on tons of languages you can use it with sentence_transformers in python to compute embedings. Then you can use faiss or datasketch to find matches at K
Arraymancer
-
Arraymancer – Deep Learning Nim Library
It is a small DSL written using macros at https://github.com/mratsim/Arraymancer/blob/master/src/array....
Nim has pretty great meta-programming capabilities and arraymancer employs some cool features like emitting cuda-kernels on the fly using standard templates depending on backend !
-
Go, Python, Rust, and production AI applications
Nim has also a powerful deep learning library called Arraymancer. It's selling point is that you don't have to rewrite your code from research to production. It's used in various machine learning projects, but one recent one that caught my eye was https://github.com/amkrajewski/nimCSO "Composition Space Optimization"
https://github.com/mratsim/Arraymancer
-
D Programming Language
- https://github.com/mratsim/Arraymancer/blob/master/src/array...
It's worth noting that nim async/await transformation is fully implemented as a library in macros.
- Prospects of utilising Nim in scientific computation?
-
How to write performant Nim?
https://github.com/mratsim/Arraymancer 11. « Premature optimisation is the root of all evil », Donald Knuth, The art of computer Programming It would be quite useful that someone writes one with examples for all these recommendations and more ...
-
Deeplearning in Nim?
In particular for deep learning as bobsyourunkl already mentioned there is arraymancer on the one hand and also flambeau on the other. The latter is a Nim wrapper around libtorch (i.e. the PyTorch C++ backend). It is missing things (to be wrapped by adding a few lines) and has some rough edges, but if one needs to get stuff done, it's possible.
-
Mastering Nim – now available on Amazon
how are u compiling (optimization, custom compilation flags etc.?) In my case https://github.com/mratsim/Arraymancer big project compile under your 4.2s so or you have like 10k+ lines of codes with macros or you just pass some debug flags to compiler :D
- Nim Version 1.6.6 Released
- The counter-intuitive rise of Python in scientific computing (2020)
-
Computer Programming with Nim
We have both raw wrappers for BLAS:
https://github.com/andreaferretti/nimblas
as well as LAPACK:
https://github.com/andreaferretti/nimlapack
For an example, consider calling the least squares routine `dgelsd` in arraymancer:
https://github.com/mratsim/Arraymancer/blob/master/src/array...
wrapped up in a nicer user facing API.
Feel free to hop onto matrix, if you have more questions!
What are some alternatives?
MUSE - A library for Multilingual Unsupervised or Supervised word Embeddings
nimtorch - PyTorch - Python + Nim
electra - ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
Nim - Nim is a statically typed compiled systems programming language. It combines successful concepts from mature languages like Python, Ada and Modula. Its design focuses on efficiency, expressiveness, and elegance (in that order of priority).
fairseq - Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
nimble - Package manager for the Nim programming language.
flores - Facebook Low Resource (FLoRes) MT Benchmark
awesome-tensor-compilers - A list of awesome compiler projects and papers for tensor computation and deep learning.
nvim-treesitter - Nvim Treesitter configurations and abstraction layer
prologue - Powerful and flexible web framework written in Nim
application - Buckets Desktop Application
nvim-treesitter-textobjects