-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
langchain
Discontinued ⚡ Building applications with LLMs through composability ⚡ [Moved to: https://github.com/langchain-ai/langchain] (by hwchase17)
-
open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
The data pipeline here https://github.com/lamini-ai/lamini uses a seed dataset from self-instruct (Apache 2 license), and edited models from Pythia (Apache 2) and Dolly (Apache 2). We release our code and data under a CC-BY 4.0 license.
SQL data (https://github.com/lamini-ai/lamini-sql/)
If you do want to fine-tune on the generated data yourself, and are willing to bring your own GPU, etc. Consider one of many good open fine-tuning frameworks, e.g. Alpaca-Lora: https://github.com/tloen/alpaca-lora
how is lamini different from [LangChain](https://github.com/hwchase17/langchain)
I'd like to see the same benchmarks being used by other researchers iterating LLMs close to the state of the art. I mentioned a couple other papers already, but for a great example, take a look at the Evaluation section of the OpenLLama repository.