-
spider
scripts and baselines for Spider: Yale complex and cross-domain semantic parsing and text-to-SQL challenge
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
I've been looking for databases with real-world schema and faker data (eg 10,000 entries of fake users) to test my natural langaugae to SQL generative model, as well as the efficiency of the generated queries
The cloest thing I can find is annotated dataset like Spider (https://yale-lily.github.io/spider) but after digging more into it, it's not as real-world-ish as I've hoped for.
Are there any SaSS, paid services, etc, where I can have access databases with complex real-world(-ish) schemas (populated with real-world-ish data)?
Thanks!
Related posts
-
Show HN: FileKitty – Combine and label text files for LLM prompt contexts
-
Ask HN: Freelancer? Seeking freelancer? (May 2024)
-
More Low-Bit LLMs
-
Kolmogorov-Arnold Network for Reinforcement Leaning, Initial Experiments
-
Create an AI prototyping environment using Jupyter Lab IDE with Typescript, LangChain.js and Ollama for rapid AI prototyping