|7 days ago||10 days ago|
|GNU Lesser General Public License v2.1 only||GNU General Public License v3.0 or later|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Gensim – a Python library for topic modelling, document indexing
1 project | news.ycombinator.com | 25 Nov 2021
How to build a search engine with word embeddings
2 projects | dev.to | 22 Nov 2021
We will be using gensim to load our Google News pre-trained word vectors. Find the code for this here.
The unthinking application of this regex-efficiency check wasted our attention
1 project | news.ycombinator.com | 30 Sep 2021
The Levenshtein Distance in Production
4 projects | news.ycombinator.com | 6 Jun 2021
> Problem statement: the Levenshtein distance is a string metric for measuring the difference between two sequences
Another variant is "I have a bunch of words (a dictionary) and one query word, and want to find all words from the dictionary that are close to the query word".
This leads to an interesting class of problems, because you can do clever things where you precompute search structures (Levenshtein automata ) from the dictionary. The similarity queries then run (much) faster – in production, performance matters.
We recently merged a PR like that into Gensim .
This gave a ~1,500x speed-up compared to naively comparing all pairwise strings with Levenshtein distance. A difference between the training step running for years (=unusable) and minutes.
Superior tools to Gensim's similarity
1 project | reddit.com/r/LanguageTechnology | 20 Mar 2021
So Gensim's Similarity module seems like a good fit for this problem, especially soft cosine similarity checking. But inside I can't get comfortable, because transformers are very popular lately.
Koan: A word2vec negative sampling implementation with correct CBOW update
2 projects | news.ycombinator.com | 2 Jan 2021
Apparently it did: https://github.com/RaRe-Technologies/gensim/issues/1873
Ask HN: What would a reality show for Software Engineers look like?
1 project | news.ycombinator.com | 29 Nov 2021
Very well put. You got me thinking about what I do as a software developer that might be considered entertaining. Maybe computer security events like capture the flag (https://ctftime.org/) or coding an AI agent in a simulated environment to achieve a goal (https://www.codingame.com, https://gym.openai.com/) or simply competing with others to solve an algorithmic problem either on time constraint or code length constraint.
Without appealing visuals none of it will be interesting to people who don’t have a development background. Maybe generative art is the answer but IMO it is more about art than programming.
[N] OpenAI Gym maintainer plans to deprecate and replace MuJoCo and Box2D environments with Brax-based environments.
1 project | reddit.com/r/MachineLearning | 24 Oct 2021
What are some fun ais that you can play around with now
1 project | reddit.com/r/artificial | 22 Oct 2021
DeepMind buys & open-sources MuJoCo
1 project | reddit.com/r/reinforcementlearning | 21 Oct 2021
OA Gym plans: https://github.com/openai/gym/issues/2456 Basically, move everything possible to Brax.
Update on Plans for the MuJoCo, Robotics and Box2d Environments and the Status of Brax and Hardware Accelerated Environments in Gym
1 project | reddit.com/r/reinforcementlearning | 21 Oct 2021
Ideas for physics based modelling of solids using machine learning
1 project | reddit.com/r/learnmachinelearning | 15 Oct 2021
8+ Reinforcement Learning Project Ideas
8 projects | dev.to | 30 Sep 2021
OpenAI Gym has become the de facto standard for reinforcement learning frameworks among researchers and practitioners. Solving toy problems from the gym library will help familiarize you with this popular framework. Good starting points include Cartpole, Lunar Lander and Taxi.
The third party environment list is now fixed up and maintained- please submit PRs for any missing environments you're aware of
1 project | reddit.com/r/reinforcementlearning | 23 Sep 2021
OpenAI Gym: How to assign values to a state variable while remaining its format in a custom environment
1 project | reddit.com/r/reinforcementlearning | 16 Sep 2021
Don't use spaces for state variables. Use regular python variables for state variables. Box and Discrete are to provide information to a program using the environment about the size of the action tuples expected by .step() and the size of the state tuples returned by .step() and .reset(). The only variables that should be set to Box or Discrete are self.action_space and self._observation space. You can look at this file an example https://github.com/openai/gym/blob/master/gym/envs/classic_control/cartpole.py
[N] Gym version 0.20.0, the largest single update since Gym was first released, is now out
1 project | reddit.com/r/MachineLearning | 14 Sep 2021
Release notes are here: https://github.com/openai/gym/releases/tag/v0.20.0
What are some alternatives?
scikit-learn - scikit-learn: machine learning in Python
tensorflow - An Open Source Machine Learning Framework for Everyone
MLflow - Open source platform for the machine learning lifecycle
ml-agents - Unity Machine Learning Agents Toolkit
Keras - Deep Learning for humans
BERTopic - Leveraging BERT and c-TF-IDF to create easily interpretable topics.
xgboost - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Prophet - Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
LightFM - A Python implementation of LightFM, a hybrid recommendation algorithm.
NuPIC - Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.
hebel - GPU-Accelerated Deep Learning Library in Python