datasetGPT
botbots
datasetGPT | botbots | |
---|---|---|
6 | 4 | |
275 | 166 | |
- | - | |
6.0 | 3.8 | |
8 months ago | about 1 year ago | |
Python | ||
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
datasetGPT
- datasetGPT is a command-line interface and a Python library for inferencing Large Language Models to generate textual datasets. (Regenerative feedback loops)
-
[R] [P] I generated a 30K-utterance dataset by making GPT-4 prompt two ChatGPT instances to converse.
A dataset consisting of dialogues between two instances of ChatGPT (gpt-3.5-turbo). The CLI commands and dialogue prompts themselves have been written by GPT-4. The dataset covers a wide range of contexts (questions and answers, arguing and reasoning, task-oriented dialogues) and downstream tasks (e.g., hotel reservations, medical advice). Texts have been generated with datasetGPT and the OpenAI API as a backend. Approximate cost for generation: $35.
-
[P] two copies of gpt-3.5 (one playing as the oracle, and another as the guesser) performs poorly on the game of 20 Questions (68/1823).
Last week I released a CLI that can do this at scale: https://github.com/radi-cho/datasetGPT. Will use personal funds to generate somewhat big task oriented dataset later today with gpt-3.5 or gpt-4. Will open source it along a way for people to contribute their own datasets so we can collect bigger ones. Would be helpful both for analysis of how LLMs work and for fine tuning downstream models (Alpaca-like).
- DatasetGPT - A command-line interface to generate textual and conversational datasets with LLMs.
- DatasetGPT – an open-source command line tool for generating datasets with LLMs
-
[P] [D] datasetGPT - A command-line tool to generate datasets by inferencing LLMs. Supports OpenAI, Cohere, and Petals.
GitHub: https://github.com/radi-cho/datasetGPT
botbots
What are some alternatives?
isort - A Python utility / library to sort imports.
Chat-GPT-4-Bing-AI-API - ChatGPT 4 Bing AI Chat API
PyInquirer - A Python module for common interactive command line user interfaces
awesome-generative-deep-art - A curated list of Generative AI tools, works, models, and references [Moved to: https://github.com/filipecalegario/awesome-generative-ai]
pyreports - pyreports is a python library that allows you to create complex report from various sources
awesome-gpt-prompt-engineering - A curated list of awesome resources, tools, and other shiny things for GPT prompt engineering.
pytermgui - Python TUI framework with mouse support, modular widget system, customizable and rapid terminal markup language and more!
AlignLLMHumanSurvey - Aligning Large Language Models with Human: A Survey
isort - A Python utility / library to sort imports. [Moved to: https://github.com/PyCQA/isort]
chatgpt-vscode - Your best AI pair programmer in VS Code