LaVague: Open-source Large Action Model to automate Selenium browsing

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • LaVague

    Copilot for web automation

  • browserpilot

    Natural language browser automation

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • skyvern

    Automate browser-based workflows with LLMs and Computer Vision

  • We're also working in the space and just open sourced Skyvern

    https://github.com/Skyvern-AI/Skyvern

  • Playwright

    Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.

  • If you ever find that you need to automate some browsing and Selenium comes to your mind, banish that thought! :)

    Do yourself a favour, use Playwright instead.

    https://playwright.dev/

    It's a headless browser that's both faster and less flaky than Selenium.

  • open-interpreter

    A natural language interface for computers

  • I think openinterpreter [1] were one of the first teams in this space along with shroominic code interpreter api and afaik they started with selenium but have expanded to do a lot more os level work but wonder if having a more narrow specialization could help these newer projects be better at the one thing they are focused on.

    [1] https://openinterpreter.com/

  • mitta-community

    Community repository for MittaAI users.

  • I built this with Playwright and OpenAI's function calling stuff (sorry, no time for docs): https://github.com/MittaAI/mitta-community/tree/main/service...

    My thought was to put the results of this in a vector store, with any errors that resulted as opposed to wasting time training a model.

  • puterbot

    Discontinued AI-First Process Automation with Large [Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models [Moved to: https://github.com/OpenAdaptAI/OpenAdapt]

  • https://github.com/mldsai/puterbot is designed for all desktop applications, including browsers. We're also working on a chrome extension to support reading/writing directly to DOM: https://github.com/OpenAdaptAI/OpenAdapt/pull/364

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • OpenAdapt

    AI-First Process Automation with Large [Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models

  • https://github.com/mldsai/puterbot is designed for all desktop applications, including browsers. We're also working on a chrome extension to support reading/writing directly to DOM: https://github.com/OpenAdaptAI/OpenAdapt/pull/364

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts