Aplaca dataset translated into polish [N] [R]

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

owca

2 21 5.0

The OWCA dataset is a polish translated dataset of instructions for fine-tuning the Alpaca model made by Stanford .

The OWCA dataset is a Polish-translated dataset of instructions for fine-tuning the Alpaca model made by Stanford. https://github.com/Emplocity/owca https://news.ycombinator.com/from?site=huggingface.co

kruk

1 77 5.9 Jupyter Notebook

Ukrainian instruction-tuned language models and datasets

Somewhat related, there's also a Ukrainian translation of the Alpaca dataset. It comes with UAlpaca -- a LLaMA fine-tuned on this translated data, as well as on some other sources: https://github.com/robinhad/kruk https://huggingface.co/robinhad/ualpaca-7b-llama

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
stanford_alpaca

108 28,816 2.0 Python

Code and documentation to train Stanford's Alpaca models, and generate the data.

yes, we also have data_license as you can see. But keep in mind that Stanford ( which we forked original dataset for translation and upgrade) changed their data_license to cc 4.0 non commercial. When we started working on dataset it was ODC-By so we are clear. But I felt obliged to mention that : https://github.com/tatsu-lab/stanford_alpaca/commit/7ad0c6b4f75c7365aca85bda8ad8fbc24915c7ed https://twitter.com/abacaj/status/1643045717907218432

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

[P] fastLLaMa, A python wrapper to run llama.cpp

5 projects | /r/MachineLearning | 21 Mar 2023
LocalAI: Self-hosted OpenAI alternative reaches 2.14.0

1 project | news.ycombinator.com | 3 May 2024
Haystack DB – 10x faster than FAISS with binary embeddings by default

3 projects | news.ycombinator.com | 28 Apr 2024
Lossless Acceleration of LLM via Adaptive N-Gram Parallel Decoding

3 projects | news.ycombinator.com | 21 Apr 2024
Schedule-Free Learning – A New Way to Train

3 projects | news.ycombinator.com | 6 Apr 2024

Aplaca dataset translated into polish [N] [R]

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning
instructions NLP alpaca language-model llama
Post date: 12 Apr 2023

owca

kruk

InfluxDB

stanford_alpaca

Related posts

[P] fastLLaMa, A python wrapper to run llama.cpp

LocalAI: Self-hosted OpenAI alternative reaches 2.14.0

Haystack DB – 10x faster than FAISS with binary embeddings by default

Lossless Acceleration of LLM via Adaptive N-Gram Parallel Decoding

Schedule-Free Learning – A New Way to Train

Aplaca dataset translated into polish [N] [R]

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning instructions NLP alpaca language-model llama Post date: 12 Apr 2023

owca

kruk

InfluxDB

stanford_alpaca

Related posts

[P] fastLLaMa, A python wrapper to run llama.cpp

LocalAI: Self-hosted OpenAI alternative reaches 2.14.0

Haystack DB – 10x faster than FAISS with binary embeddings by default

Lossless Acceleration of LLM via Adaptive N-Gram Parallel Decoding

Schedule-Free Learning – A New Way to Train

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning
instructions NLP alpaca language-model llama
Post date: 12 Apr 2023