wimsey

Easy and flexible data contracts (by benrutter)

Wimsey Alternatives

Similar projects and alternatives to wimsey

  1. OpenRefine

    OpenRefine is a free, open source power tool for working with messy data and improving it

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. FlashLearn

    Integrate LLM in any pipeline - fit/predict pattern, JSON driven flows, and built in concurency support.

  4. Scrapling

    2 wimsey VS Scrapling

    🕷️ An undetectable, powerful, flexible, high-performance Python library that makes Web Scraping easy again!

  5. hal9

    7 wimsey VS hal9

    Hal9 — Create and Share Generative Apps

  6. abacus-minimal

    A minimal event-based ledger in Python that follows accounting rules

  7. data-engineer-handbook

    3 wimsey VS data-engineer-handbook

    This is a repo with links to everything you'd ever want to learn about data engineering

  8. finstruments

    Financial instrument definitions built with Python and Pydantic

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better wimsey alternative or higher similarity.

wimsey discussion

Log in or Post with

wimsey reviews and mentions

Posts with mentions or reviews of wimsey. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2025-02-09.
  • Classic Data science pipelines built with LLMs
    5 projects | news.ycombinator.com | 9 Feb 2025
    I'm definitely biased because my day job is writing ETL pipelines and supporting software, and my current side project is a data contracts library for helping the above[0]. Still I'm not sure I see this happening.

    80% of the focus of an ETL pipeline is in ensuring edge cases are handled appropriately (i.e. not producing models from potentially erroneous data, dead letter queing unknown fields etc).

    I think an LLM would be great for "take this json and make it a pandas dataframe", but a lot less great for interact with this billing API to produce auditable payment tables.

    For areas that are reliability focused, LLMs still need a lot more improvments to be useful.

    [0] https://github.com/benrutter/wimsey

  • The Data Engineering Handbook
    2 projects | news.ycombinator.com | 19 Nov 2024
    Nice list! Although as somebody who works on open source tools for data engineering, it kills me a little to see "companies" as the the list header rather than, say, "projects".

    (also, shameless plug for my.latest project Wimsey which is non-company affiliated but does let you test data in a nice, lightweight way: https://github.com/benrutter/wimsey)

  • Wimsey: A flexible, lightweight data contracts library
    1 project | news.ycombinator.com | 15 Nov 2024
  • This Week In Python
    5 projects | dev.to | 1 Nov 2024
    wimsey – Easy and flexible data testing and documentation
  • A note from our sponsor - SaaSHub
    www.saashub.com | 24 Mar 2025
    SaaSHub helps you find the best software and product alternatives Learn more →

Stats

Basic wimsey repo stats
4
125
7.5
about 1 month ago

benrutter/wimsey is an open source project licensed under MIT License which is an OSI approved license.

The primary programming language of wimsey is Python.


Sponsored
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai

Did you know that Python is
the 2nd most popular programming language
based on number of references?