-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Last week I released a CLI that can do this at scale: https://github.com/radi-cho/datasetGPT. Will use personal funds to generate somewhat big task oriented dataset later today with gpt-3.5 or gpt-4. Will open source it along a way for people to contribute their own datasets so we can collect bigger ones. Would be helpful both for analysis of how LLMs work and for fine tuning downstream models (Alpaca-like).
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
-
datasetGPT is a command-line interface and a Python library for inferencing Large Language Models to generate textual datasets. (Regenerative feedback loops)
-
DatasetGPT - A command-line interface to generate textual and conversational datasets with LLMs.
-
DatasetGPT – an open-source command line tool for generating datasets with LLMs
-
[R] [P] I generated a 30K-utterance dataset by making GPT-4 prompt two ChatGPT instances to converse.
-
Typer: Python library for building CLI applications