hamilton VS hamilton

Compare hamilton vs hamilton and see what are their differences.

hamilton

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does. (by DAGWorks-Inc)

hamilton

A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton (by stitchfix)
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
hamilton hamilton
21 26
1,321 878
3.7% -
9.8 8.1
6 days ago about 1 year ago
Jupyter Notebook Python
GNU General Public License v3.0 or later BSD 3-clause Clear License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

hamilton

Posts with mentions or reviews of hamilton. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-26.
  • Show HN: Hamilton's UI – observability, lineage, and catalog for data pipelines
    1 project | news.ycombinator.com | 2 May 2024
  • Building an Email Assistant Application with Burr
    6 projects | dev.to | 26 Apr 2024
    Note that this uses simple OpenAI calls — you can replace this with Langchain, LlamaIndex, Hamilton (or something else) if you prefer more abstraction, and delegate to whatever LLM you like to use. And, you should probably use something a little more concrete (E.G. instructor) to guarantee output shape.
  • Using IPython Jupyter Magic commands to improve the notebook experience
    1 project | dev.to | 3 Mar 2024
    In this post, we’ll show how your team can turn any utility function(s) into reusable IPython Jupyter magics for a better notebook experience. As an example, we’ll use Hamilton, my open source library, to motivate the creation of a magic that facilitates better development ergonomics for using it. You needn’t know what Hamilton is to understand this post.
  • FastUI: Build Better UIs Faster
    12 projects | news.ycombinator.com | 1 Mar 2024
    We built an app with it -- https://blog.dagworks.io/p/building-a-lightweight-experiment. You can see the code here https://github.com/DAGWorks-Inc/hamilton/blob/main/hamilton/....

    Usually we've been prototyping with streamlit, but found that at times to be clunky. FastUI still has rough edges, but we made it work for our lightweight app.

  • Show HN: On Garbage Collection and Memory Optimization in Hamilton
    1 project | news.ycombinator.com | 24 Oct 2023
  • Facebook Prophet: library for generating forecasts from any time series data
    7 projects | news.ycombinator.com | 26 Sep 2023
    This library is old news? Is there anything new that they've added that's noteworthy to take it for another spin?

    [disclaimer I'm a maintainer of Hamilton] Otherwise FYI Prophet gels well with https://github.com/DAGWorks-Inc/hamilton for setting up your features and dataset for fitting & prediction[/disclaimer].

  • Show HN: Declarative Spark Transformations with Hamilton
    1 project | news.ycombinator.com | 24 Aug 2023
  • Langchain Is Pointless
    16 projects | news.ycombinator.com | 8 Jul 2023
    I had been hearing these pains from Langchain users for quite a while. Suffice to say I think:

    1. too many layers of OO abstractions are a liability in production contexts. I'm biased, but a more functional approach is a better way to model what's going on. It's easier to test, wrap a function with concerns, and therefore reason about.

    2. as fast as the field is moving, the layers of abstractions actually hurt your ability to customize without really diving into the details of the framework, or requiring you to step outside it -- in which case, why use it?

    Otherwise I definitely love the small amount of code you need to write to get an LLM application up with Langchain. However you read code more often than you write it, in which case this brevity is a trade-off. Would you prefer to reduce your time debugging a production outage? or building the application? There's no right answer, other than "it depends".

    To that end - we've come up with a post showing how one might use Hamilton (https://github.com/dagWorks-Inc/hamilton) to easily create a workflow to ingest data into a vector database that I think has a great production story. https://open.substack.com/pub/dagworks/p/building-a-maintain...

    Note: Hamilton can cover your MLOps as well as LLMOps needs; you'll invariably be connecting LLM applications with traditional data/ML pipelines because LLMs don't solve everything -- but that's a post for another day.

  • Free access to beta product I'm building that I'd love feedback on
    1 project | /r/quants | 31 May 2023
    This is me. I drive an open source library Hamilton that people doing time-series/ML work love to use. I'm building a paid product around it at DAGWorks, and I'm after feedback on our current version. Can I entice anyone to:
  • IPyflow: Reactive Python Notebooks in Jupyter(Lab)
    5 projects | news.ycombinator.com | 10 May 2023
    From a nuts and bolts perspective, I've been thinking of building some reactivity on top of https://github.com/dagworks-inc/hamilton (author here) that could get at this. (If you have a use case that could be documented, I'd appreciate it.)

hamilton

Posts with mentions or reviews of hamilton. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-27.
  • Write production grade pandas (and other libraries!) with Hamilton
    2 projects | /r/Python | 27 Feb 2023
    And find the repository here: https://github.com/dagworks-inc/hamilton/
  • Useful libraries for data engineering in various programming languages
    1 project | /r/dataengineering | 16 Sep 2022
    Python - https://github.com/stitchfix/hamilton (author here). It's great if you want your code to be always unit testable and documentation friendly, and you want to be able to visualize execution. Blog post on using it with Pandas https://link.medium.com/XhyYD9BAntb.
  • Cognitive Loads in Programming
    5 projects | news.ycombinator.com | 31 Aug 2022
    Yes! As one of the creators of https://github.com/stitchfix/hamilton this was one of the aims. Simplifying the cognitive burden for those developing and managing data transforms over the course of years, and in particular for ones they didn't write!

    For example in Hamilton -- we force people to write "declarative functions" which then are stitched together to create a dataflow.

    E.g. example function -- my guess is that you can read and understand/guess what it does very easily.

  • Prefect vs other things question
    2 projects | /r/mlops | 3 Aug 2022
    For (1) there are quite a few options - prefect is one, metaflow is another, airflow, dagster, even https://github.com/stitchfix/hamilton (core contributor here), etc.
  • Field Lineage
    4 projects | /r/dataengineering | 2 Aug 2022
    If you're want to do more python https://github.com/stitchfix/hamilton allows you to model dependencies at a columnar (field) level.
  • Show HN
    1 project | news.ycombinator.com | 1 Aug 2022
  • [D] Is anyone working on interesting ML libraries and looking for contributors?
    4 projects | /r/MachineLearning | 17 Jun 2022
    Take a look at https://github.com/stitchfix/hamilton - we're after contributors who can help us grow the project, e.g. make documentation great, dog fooding features and suggesting/contributing usability improvements.
  • Useful Python decorators for Data Scientists
    1 project | /r/Python | 23 May 2022
    For a real world example of their power, we built an entire framework (https://github.com/stitchfix/hamilton) at Stitch Fix, where a lot of cool magic is provide via decorators - see https://hamilton-docs.gitbook.io/docs/reference/api-reference/available-decorators and these two source files (https://github.com/stitchfix/hamilton/blob/main/hamilton/function_modifiers_base.py, https://github.com/stitchfix/hamilton/blob/main/hamilton/function_modifiers.py ). Note we do some non-trivial stuff via them.
  • unit tests
    1 project | /r/mlops | 23 May 2022
    For data processing/transform code, I would recommend looking at https://github.com/stitchfix/hamilton, especially if you're trying to test pandas code. Short getting started here - https://towardsdatascience.com/how-to-use-hamilton-with-pandas-in-5-minutes-89f63e5af8f5 (disclaimer: I'm one of the authors).
  • Dealing with hundreds of customer/computed columns
    1 project | /r/dataengineering | 19 May 2022
    The python package, hamilton, from Stitch Fix (https://hamilton-docs.gitbook.io/docs/) can help manage transformations on pandas dataframes. This DAG of transformations is managed separately in a file - so it can be versioned, in case the transformations change. The memory required is reduced, because only the API call tables and mapping parameter table have to be in memory. The calculated columns can be produced as needed. Just like dbt, transformations are separate from the source tables - but hamilton can be used on any python object - not just dataframes. dbt is SQL based.

What are some alternatives?

When comparing hamilton and hamilton you can also consider the following projects:

dagster - An orchestration platform for the development, production, and observation of data assets.

prosto - Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby

haystack - :mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

versatile-data-kit - One framework to develop, deploy and operate data workflows with Python and SQL.

tree-of-thought-llm - [NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

plumbing - Prismatic's Clojure(Script) utility belt

snowpark-python - Snowflake Snowpark Python API

OpenLineage - An Open Standard for lineage metadata collection

aipl - Array-Inspired Pipeline Language

composer - Supercharge Your Model Training

vscode-reactive-jupyter - A simple Reactive Python Extension for Visual Studio Code

polars - Dataframes powered by a multithreaded, vectorized query engine, written in Rust