-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Solutions I found here and here propose to save the Input Batch as a in a variable after feeding it into the Embeddings Layer (but before the AE) and use that as the target for the loss function.
I am a ML/DL beginner, but this sounds fishy to me, because the Embeddings will not be trained by gradient descent. I tested this approach on a small tabular dataset vs. just feeding the categorial data into the AE (no Embeddings) and found that using the first approach (saving embedded cols as variable) to moderatly degrade Clustering Accuracy and NMI Score (This is not representative - just a small test on a small dataset). Here is my Notebook.