SaaSHub helps you find the best software and product alternatives Learn more →
datasetGPT Alternatives
Similar projects and alternatives to datasetGPT based on common topics and language
-
botbots
A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering various contexts and tasks (task-oriented dialogue systems, abstract reasoning, brainstorming).
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
pyreports
pyreports is a python library that allows you to create complex report from various sources
-
pytermgui
Python TUI framework with mouse support, modular widget system, customizable and rapid terminal markup language and more!
-
isort
Discontinued A Python utility / library to sort imports. [Moved to: https://github.com/PyCQA/isort] (by timothycrosley)
datasetGPT reviews and mentions
- datasetGPT is a command-line interface and a Python library for inferencing Large Language Models to generate textual datasets. (Regenerative feedback loops)
-
[R] [P] I generated a 30K-utterance dataset by making GPT-4 prompt two ChatGPT instances to converse.
A dataset consisting of dialogues between two instances of ChatGPT (gpt-3.5-turbo). The CLI commands and dialogue prompts themselves have been written by GPT-4. The dataset covers a wide range of contexts (questions and answers, arguing and reasoning, task-oriented dialogues) and downstream tasks (e.g., hotel reservations, medical advice). Texts have been generated with datasetGPT and the OpenAI API as a backend. Approximate cost for generation: $35.
-
[P] two copies of gpt-3.5 (one playing as the oracle, and another as the guesser) performs poorly on the game of 20 Questions (68/1823).
Last week I released a CLI that can do this at scale: https://github.com/radi-cho/datasetGPT. Will use personal funds to generate somewhat big task oriented dataset later today with gpt-3.5 or gpt-4. Will open source it along a way for people to contribute their own datasets so we can collect bigger ones. Would be helpful both for analysis of how LLMs work and for fine tuning downstream models (Alpaca-like).
- DatasetGPT - A command-line interface to generate textual and conversational datasets with LLMs.
- DatasetGPT – an open-source command line tool for generating datasets with LLMs
-
[P] [D] datasetGPT - A command-line tool to generate datasets by inferencing LLMs. Supports OpenAI, Cohere, and Petals.
GitHub: https://github.com/radi-cho/datasetGPT
-
A note from our sponsor - SaaSHub
www.saashub.com | 27 Apr 2024
Stats
The primary programming language of datasetGPT is Python.
Sponsored