skyvern
OpenAdapt
skyvern | OpenAdapt | |
---|---|---|
8 | 22 | |
4,229 | 516 | |
26.8% | 46.1% | |
9.3 | 9.3 | |
1 day ago | 5 days ago | |
Python | Python | |
GNU Affero General Public License v3.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
skyvern
-
ScrapeGraphAI: Web scraping using LLM and direct graph logic
https://github.com/Skyvern-AI/skyvern
This is pretty much what we're building at Skyvern. The only problem is that inference cost is still a little bit too high for scraping, but we expect that to change in the next year
-
Show HN: Skyvern – open-source browser automation tool
This is a great point. This is something already on our roadmap. We call it "prompt caching", but I realize writing this that it's a terrible name. Will update! (https://github.com/Skyvern-AI/Skyvern?tab=readme-ov-file#fea...)
Thank you for this feedback
-
LaVague: Open-source Large Action Model to automate Selenium browsing
We're also working in the space and just open sourced Skyvern
https://github.com/Skyvern-AI/Skyvern
OpenAdapt
- Rabbit R1 can be run on a Android device
- OpenAdapt: AI-First Process Automation with Large Multimodal Models
- Adapter between LMMs and traditional desktop and web GUI
-
I Witnessed the Future of AI, and It's a Broken Toy
> Rabbit has said the device will be able to learn any app, if you teach it.
We're building this over at https://github.com/OpenAdaptAI/OpenAdapt. OpenAdapt learns to automate tasks in desktop apps by observing human demonstrations.
Early demo: https://twitter.com/abrichr/status/1784307190062342237 (more coming soon!)
The demo is overly simplistic to keep it short -- it also works with arbitrary applications and operations.
Also, we're open source. Contributions and feedback are welcome and encouraged :)
-
Memary is a cutting-edge long-term memory system based on a knowledge graph
Very interesting, thank you for making this available!
At OpenAdapt (https://github.com/OpenAdaptAI/OpenAdapt) we are looking into using pm4py (https://github.com/pm4py) to extract a process graph from a recording of user actions.
I will look into this more closely. In the meantime, could the authors share their perspective on whether Memary could be useful here?
-
Rabbit r1 source code [part 1]
See https://github.com/OpenAdaptAI/OpenAdapt for an alternative that works with desktop GUIs.
-
Survey Study on AI Agents Architectures(2024)
Not mentioned: learning from demonstration. This is the approach we are taking at https://github.com/OpenAdaptAI/OpenAdapt.
- AI-First Process Automation with LLMs/Action/Multimodal/Visual Language Models
-
Show HN: Skyvern – open-source browser automation tool
Congratulations on shipping!
Check out https://github.com/OpenAdaptAI/OpenAdapt for an open source (MIT license) alternative that also works on desktop (including Citrix!)
-
LaVague: Open-source Large Action Model to automate Selenium browsing
https://github.com/mldsai/puterbot is designed for all desktop applications, including browsers. We're also working on a chrome extension to support reading/writing directly to DOM: https://github.com/OpenAdaptAI/OpenAdapt/pull/364
What are some alternatives?
LaVague - Large Action Model framework to turn natural language into browser actions
ios-mail - Secure email that protects your privacy
browserpilot - Natural language browser automation
CogVLM - a state-of-the-art-level open visual language model | 多模态预训练模型
LLaVA - [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
adept-inference - Inference code for Persimmon-8B
IfcOpenShell - Open source IFC library and geometry engine
strawberry - A GraphQL library for Python that leverages type annotations 🍓
vimGPT - Browse the web with GPT-4V and Vimium
obsidian-releases - Community plugins list, theme list, and releases of Obsidian.
apertium - Core tools (driver script, transfer, tagger, formatters) for the FOSS RBMT system Apertium
share-file-systems - Use a Windows/OSX like GUI in the browser to share files cross OS privately. No cloud, no server, no third party.