How to learn Categorial Embeddings in Unsupervised Learning?

This page summarizes the projects mentioned and recommended in the original post on /r/deeplearning

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • autoembedder

    PyTorch autoencoder with additional embeddings layer for categorical data 🚘

  • Solutions I found here and here propose to save the Input Batch as a in a variable after feeding it into the Embeddings Layer (but before the AE) and use that as the target for the loss function.

  • attention-mixed-type-clustering

    Attention in Mixed-Type Clustering

  • I am a ML/DL beginner, but this sounds fishy to me, because the Embeddings will not be trained by gradient descent. I tested this approach on a small tabular dataset vs. just feeding the categorial data into the AE (no Embeddings) and found that using the first approach (saving embedded cols as variable) to moderatly degrade Clustering Accuracy and NMI Score (This is not representative - just a small test on a small dataset). Here is my Notebook.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • LVE Project: A Repository of Language Model Vulnerabilities and Exposures

    1 project | news.ycombinator.com | 15 May 2024
  • Extracting Words from Scanned Books: A Step-by-Step Tutorial with Python, OpenCV

    1 project | news.ycombinator.com | 15 May 2024
  • Ask HN: Running LLMs Locally

    1 project | news.ycombinator.com | 15 May 2024
  • Show HN: 3-2-1 backups using Rustic and RClone

    1 project | news.ycombinator.com | 15 May 2024
  • Battlesnake Challenge #1 - Python

    1 project | dev.to | 15 May 2024