SDV
h2o-llmstudio
SDV | h2o-llmstudio | |
---|---|---|
59 | 13 | |
2,141 | 3,583 | |
2.4% | 2.8% | |
9.4 | 9.3 | |
7 days ago | 7 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
SDV
-
Synthetic data generation for tabular data
Can someone help me understand the licensing of this?
https://github.com/sdv-dev/SDV/blob/main/LICENSE
It was MIT licensed up until 2022 where it was changed to what it is now, where they say that it will become MIT again 4 years after release... but is that from when the license was changed or the first release of the software in GitHub?
- SDV: NEW Data - star count:1441.0
- FLaNK Stack Weekly for 30 April 2023
- SDV: NEW Data - star count:1196.0
h2o-llmstudio
- Paid dev gig: develop a basic LLM PEFT finetuning utility
-
building LLM model to answer question
Vector databases are probably a good place to start, though you've already tried LlamaIndex. You might want to try https://github.com/h2oai/h2o-llmstudio and https://github.com/h2oai/h2ogpt.
- [P] Uptraining a pretrained model using company data?
-
Permissive LLaMA 7b chat/instruct model
Training framework: https://github.com/h2oai/h2o-llmstudio
-
Is what I need possible currently?
Check out LLM Studio for fine tuning LLMs. Open source: https://github.com/h2oai/h2o-llmstudio
- FLaNK Stack Weekly for 30 April 2023
- FLaNK Stack Weekly for 24April2023
- GitHub - h2oai/h2o-llmstudio: H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs
- New Open Source Framework and No-Code GUI for Fine-Tuning LLMs: H2O LLM Studio
- Can an average person learn how to build a LLM model?
What are some alternatives?
CTGAN - Conditional GAN for generating synthetic tabular data.
h2ogpt - Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/
gretel-python-client - The Gretel Python Client allows you to interact with the Gretel REST API.
killport - A command-line tool to easily kill processes running on a specified port.
machine-learning-for-trading - Code for Machine Learning for Algorithmic Trading, 2nd edition.
HealthGPT - Query your Apple Health data with natural language 💬 🩺
tsfresh - Automatic extraction of relevant features from time series:
bark - 🔊 Text-Prompted Generative Audio Model
Copulas - A library to model multivariate data using copulas.
pandas-ai - Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
TimeSynth - A Multipurpose Library for Synthetic Time Series Generation in Python
ue5-llama-lora - A proof-of-concept project that showcases the potential for using small, locally trainable LLMs to create next-generation documentation tools.