"We have great datasets"

This page summarizes the projects mentioned and recommended in the original post on /r/dataengineering

Our great sponsors
  • LearnThisRepo.com - Learn 300+ open source libraries for free using AI.
  • WorkOS - The modern API for authentication & user identity.
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • PolyFuzz

    Fuzzy string matching, grouping, and evaluation.

  • OpenRefine

    OpenRefine is a free, open source power tool for working with messy data and improving it

    Open Refine will get you about 70% there. It's FOSS

  • LearnThisRepo.com

    Learn 300+ open source libraries for free using AI. LearnThisRepo lets you learn 300+ open source repos including Postgres, Langchain, VS Code, and more by chatting with them using AI!

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts