-
owca
The OWCA dataset is a polish translated dataset of instructions for fine-tuning the Alpaca model made by Stanford .
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
The OWCA dataset is a Polish-translated dataset of instructions for fine-tuning the Alpaca model made by Stanford. https://github.com/Emplocity/owca https://news.ycombinator.com/from?site=huggingface.co
Somewhat related, there's also a Ukrainian translation of the Alpaca dataset. It comes with UAlpaca -- a LLaMA fine-tuned on this translated data, as well as on some other sources: https://github.com/robinhad/kruk https://huggingface.co/robinhad/ualpaca-7b-llama
yes, we also have data_license as you can see. But keep in mind that Stanford ( which we forked original dataset for translation and upgrade) changed their data_license to cc 4.0 non commercial. When we started working on dataset it was ODC-By so we are clear. But I felt obliged to mention that : https://github.com/tatsu-lab/stanford_alpaca/commit/7ad0c6b4f75c7365aca85bda8ad8fbc24915c7ed https://twitter.com/abacaj/status/1643045717907218432