Seeking advice on fine-tuning Pythia for semantic search in a non-English language

This page summarizes the projects mentioned and recommended in the original post on /r/learnmachinelearning

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • DALLE-mtf

    Open-AI's DALL-E for large scale training in mesh-tensorflow.

  • My current idea is to utilize the EleutherAI pythia (Databricks Dolly). I would like to know whether translating the Dolly-15k dataset into the desired language using state-of-the-art translation techniques like DeepL would be a viable approach to fine-tune the Pythia base model. I want to use this model for semantic search, so perfection is not a necessity.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • The open source learning curve for AI researchers

    1 project | news.ycombinator.com | 20 Jul 2023
  • EleutherAI: Empowering Open-Source Artificial Intelligence Research

    1 project | news.ycombinator.com | 11 Jul 2023
  • Does anyone want to collaborate to make anti-capitalist AI?

    1 project | /r/antiwork | 17 May 2023
  • ChatGPT is bonkers.

    1 project | /r/Praise_AI_Overlords | 21 Apr 2023
  • My teacher has falsely accused me of using ChatGPT to use an assignment.

    1 project | /r/ChatGPT | 18 Apr 2023