Do we really need 300 floats to represent the meaning of a word? Representing words with words - a logical approach to word embedding using a self-supervised Tsetlin Machine Autoencoder.

This page summarizes the projects mentioned and recommended in the original post on /r/machinelearningnews

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • tmu

    Implements the Tsetlin Machine, Coalesced Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, and Weighted Tsetlin Machine, with support for continuous features, drop clause, Type III Feedback, focused negative sampling, multi-task classifier, autoencoder, literal budget, and one-vs-one multi-class classifier. TMU is written in Python with wrappers for C and CUDA-based clause evaluation and updating.

  • Here is a new self-supervised machine learning approach that captures word meaning with concise logical expressions. The logical expressions consist of contextual words like “black,” “cup,” and “hot” to define other words like “coffee,” thus being human-understandable. I raise the question in the heading because our logical embedding performs competitively on several intrinsic and extrinsic benchmarks, matching pre-trained GLoVe embeddings on six downstream classification tasks. Thanks to my clever PhD student Bimal, we now have even more fun and exciting research ahead of us. Our long term research goal is, of course, to provide an energy efficient and transparent alternative to deep learning. You find the paper here: https://arxiv.org/abs/2301.00709 , an implementation of the Tsetlin Machine Autoencoder here: https://github.com/cair/tmu, and a simple word embedding demo here: https://github.com/cair/tmu/blob/main/examples/IMDbAutoEncoderDemo.py.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Vector Databases for generative AI applications

    1 project | dev.to | 6 May 2024
  • Fingerpaint – Draw using your laptop's touchpad

    1 project | news.ycombinator.com | 6 May 2024
  • I analyzed code review best practices for a year. This is what I learned.

    1 project | dev.to | 6 May 2024
  • From English Teacher to AWS Architect

    1 project | dev.to | 6 May 2024
  • Pure-PyTorch Implementation of Kolmogorov-Arnold Network (Kan)

    1 project | news.ycombinator.com | 6 May 2024