Tranformer-based Denoising AutoEncoder for ST Unsupervised pre-training

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Scout Monitoring - Free Django app performance insights with Scout Monitoring
Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
www.scoutapm.com
featured
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
  • tsdae

    Tranformer-based Denoising AutoEncoder for Sentence Transformers Unsupervised pre-training.

  • A new PyPI package for training sentence embedding models in just 2 lines.

    The acquisition of sentence embeddings often necessitates a substantial volume of labeled data. However, in many cases and fields, labeled data is rarely accessible, and the procurement of such data is costly. In this project, we employ an unsupervised process grounded in pre-trained Transformers-based Sequential Denoising Auto-Encoder (TSDAE), introduced by the Ubiquitous Knowledge Processing Lab of Darmstadt, which can realize a performance level reaching 93.1% of in-domain supervised methodologies.

    The TSDAE schema comprises two components: an encoder and a decoder. Throughout the training process, TSDAE translates tainted sentences into uniform-sized vectors, necessitating the decoder to reconstruct the original sentences utilizing this sentence embedding. For good reconstruction quality, the semantics must be captured well in the sentence embeddings from the encoder. Subsequently, during inference, the encoder is solely utilized to form sentence embeddings.

    GitHub : https://github.com/louisbrulenaudet/tsdae

    Installation :

  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • AIM Weekly 03 June 2024

    19 projects | dev.to | 3 Jun 2024
  • Llama3V is suspected to have been stolen from the MiniCPM-Llama3-v2.5 project

    2 projects | news.ycombinator.com | 2 Jun 2024
  • Lama3-V project from a Stanford team plagiarized a lot from MiniCPM-Llama3-v2.5

    1 project | news.ycombinator.com | 3 Jun 2024
  • [2209.02842] ASR2K: Speech Recognition for Around 2000 Languages without Audio

    1 project | /r/speechtech | 10 Sep 2022
  • Text-to-Speech with Speaker Diarization

    1 project | news.ycombinator.com | 2 Jun 2024