How to create a 1M record table with a single query

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • Mergify - Updating dependencies is time-consuming.
  • Sonar - Write Clean Python Code. Always.
  • InfluxDB - Collect and Analyze Billions of Data Points in Real Time
  • zefaker

    zefaker is a command-line tool for generating CSV, Excel, JSON and SQL files from a Groovy DSL

    If you need another repeatable way to create random data you can export as SQL INSERTs (or CSV/Excel files) you may find a tool we built and use at work useful: https://github.com/creditdatamw/zefaker

    Needs a little Groovy but very convenient for generating random (or non-random) data

  • synth

    The Declarative Data Generator

    This looks convenient (and performant). But how does it scale as queries join across tables?

    If you need to create test data with complex business logic, referential integrity and constraints we've been working on declarative data generator that is build exactly for this: https://github.com/openquery-io/synth.

  • Mergify

    Updating dependencies is time-consuming.. Solutions like Dependabot or Renovate update but don't merge dependencies. You need to do it manually while it could be fully automated! Add a Merge Queue to your workflow and stop caring about PR management & merging. Try Mergify for free.

  • faker

    Faker is a Python package that generates fake data for you. (by joke2k)

    Creating realistic fake data is useful in lower environments and for load testing. Outside of SQL I like faker: https://github.com/joke2k/faker

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts