Transformers: How to compare performance to base model?

This page summarizes the projects mentioned and recommended in the original post on /r/MLQuestions

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • simpleT5

    simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets you quickly train your T5 models.

  • Currently I just took ~42000 samples and trained a translation task directly on codeT5 with https://github.com/Shivanandroy/simpleT5. Validation loss and at least the qualitative results are not to bad. Im now going to try to compare it to the base codeT5-model with the *.loss function as suggested above.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • [P] SimpleT5 : Train T5 models in just 3 lines of code

    1 project | /r/MachineLearning | 2 Jun 2021
  • has anyone here implemented Convolutional Vision Transformer (CvT)?

    2 projects | /r/pytorch | 16 May 2023
  • CvT: Introducing Convolutions to Vision Transformers

    1 project | /r/computervision | 30 Mar 2021
  • OpenAdapt: AI-First Process Automation with Large Multimodal Models

    1 project | news.ycombinator.com | 5 May 2024
  • Adapter between LMMs and traditional desktop and web GUI

    1 project | news.ycombinator.com | 1 May 2024