wimsey VS hal9

Compare wimsey vs hal9 and see what are their differences.

Judoscale - Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
judoscale.com
featured
InfluxDB high-performance time series database
Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
influxdata.com
featured
wimsey hal9
4 7
128 165
3.9% 3.0%
7.3 9.5
16 days ago 3 days ago
Python Python
MIT License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

wimsey

Posts with mentions or reviews of wimsey. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2025-02-09.
  • Classic Data science pipelines built with LLMs
    5 projects | news.ycombinator.com | 9 Feb 2025
    I'm definitely biased because my day job is writing ETL pipelines and supporting software, and my current side project is a data contracts library for helping the above[0]. Still I'm not sure I see this happening.

    80% of the focus of an ETL pipeline is in ensuring edge cases are handled appropriately (i.e. not producing models from potentially erroneous data, dead letter queing unknown fields etc).

    I think an LLM would be great for "take this json and make it a pandas dataframe", but a lot less great for interact with this billing API to produce auditable payment tables.

    For areas that are reliability focused, LLMs still need a lot more improvments to be useful.

    [0] https://github.com/benrutter/wimsey

  • The Data Engineering Handbook
    2 projects | news.ycombinator.com | 19 Nov 2024
    Nice list! Although as somebody who works on open source tools for data engineering, it kills me a little to see "companies" as the the list header rather than, say, "projects".

    (also, shameless plug for my.latest project Wimsey which is non-company affiliated but does let you test data in a nice, lightweight way: https://github.com/benrutter/wimsey)

  • Wimsey: A flexible, lightweight data contracts library
    1 project | news.ycombinator.com | 15 Nov 2024
  • This Week In Python
    5 projects | dev.to | 1 Nov 2024
    wimsey – Easy and flexible data testing and documentation

hal9

Posts with mentions or reviews of hal9. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2025-02-23.

What are some alternatives?

When comparing wimsey and hal9 you can also consider the following projects:

Scrapling - 🕷️ An undetectable, powerful, flexible, high-performance Python library that makes Web Scraping simple and easy again!

marimo - A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. All in a modern, AI-native editor.

finstruments - Financial instrument definitions built with Python and Pydantic

mistreevous - A tool to build and execute behaviour trees in JavaScript and TypeScript

abacus-minimal - A minimal event-based ledger in Python that follows accounting rules

feed-visualizer - Feed Visualizer creates interactive visualizations by clustering RSS/Atom feed items based on semantic similarity. Feed Visualizer also attempts to automatically predict the labels for each cluster. This application will create a "semantic summary" of a website's contents by scanning its RSS/Atom feed, allowing for easy discovery and navigation to topics of interest. Feed Visualizer creates interactive visualizations in the form of static HTML and JS files, which may be edited and sent to a server.

Judoscale - Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
judoscale.com
featured
InfluxDB high-performance time series database
Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
influxdata.com
featured

Did you know that Python is
the 2nd most popular programming language
based on number of references?