Laserembeddings Alternatives
Similar projects and alternatives to laserembeddings
-
user.js
Firefox privacy, security and anti-tracking: a comprehensive user.js template for configuration and hardening
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
firefox-translations
Discontinued Firefox Translations is a webextension that enables client side translations for web browsers.
-
bergamot-translator
Cross platform C++ library focusing on optimized machine translation on the consumer-grade device.
-
duckling
Language, engine, and tooling for expressing, testing, and evaluating composable language rules on input strings.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
syntaxdot
Neural syntax annotator, supporting sequence labeling, lemmatization, and dependency parsing.
-
translatelocally-web-ext
TranslateLocally for the Browser is a web-extension that enables client side in-page translations for web browsers.
-
berga-translator
A browser extension that provides client-sided translation via the Bergamot Project
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
laserembeddings reviews and mentions
-
Firefox Translations doesn't use the cloud
You're pretty much right on the money. For ParaCrawl[1] (which I worked on) we used fast machine translation systems that were "good enough" to translate one side of each pair to the language of the other, see whether they'd match sufficiently, and then deal with all the false positives through various filtering methods. Other datasets I know of use multilingual sentence embeddings, like LASER[2], to compute the distance between two sentences.
Both of these methods have a bootstrapping problem, but at this point in the MT for many languages we have enough data to get started. Previous iterations of ParaCrawl used things like document structure and overlap of named entities among sentences to identify matching pairs. But this is much less robust. I don't know how they solve this problem today for low-resource languages.
-
SpaCy v3.0 Released (Python Natural Language Processing)
I've been using LASER from Facebook Research via https://github.com/yannvgn/laserembeddings to accept multi-lingual input in front of the the domain-specific models for recommendations and stuff (that are trained on English annotated examples).
Stats
yannvgn/laserembeddings is an open source project licensed under BSD 3-clause "New" or "Revised" License which is an OSI approved license.
The primary programming language of laserembeddings is Python.