selenium-python-helium
waggle-dance
selenium-python-helium | waggle-dance | |
---|---|---|
11 | 5 | |
3,696 | 150 | |
- | - | |
0.0 | 9.9 | |
6 months ago | 5 months ago | |
Python | TypeScript | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
selenium-python-helium
-
Autotab – Boring AI Agents for real world tasks
I don't get why people still use XPaths, CSS selectors or HTML IDs to identify elements, even when they are "recorded". Please please please just use my https://github.com/mherrmann/selenium-python-helium instead. It makes so much more sense.
- Selenium-Python but Lighter: Helium
-
Automating some process with PyAutoGui?
You can, though it might not be the best tool for this. Automation of web entry is better done with selenium, or my favorite variation helium.
-
Would you be interested in POM generator for Selenium?
Regarding automatic selectors: check out my open source library https://github.com/mherrmann/selenium-python-helium.
-
Python Selenium Can't Grab CSS_Selector TEXT
quick example https://github.com/mherrmann/selenium-python-helium
-
Beginner scraping question for beautiful soup after series of 'clicks' on webpage
If this all fails you'll have to fall back on a web driver like selenium or one of its wrappers like helium.
-
Simplest code to load webpage in selenium, plaintext dump and send actions to certain elements
Simplest would probably be not using Selenium directly, but a wrapper like helium. Most of its functions you'd need are listed here.
-
PyAutoGUI with CSS Selector
If you're talking about CSS I reckon you want to click/input things on a website inside your browser. In this case you would use a web driver which can automate a web browser like Chrome or Firefox. Something like Helium, Selenium or pyppeteer.
- Playwright-test – cross-browser end-to-end test suite with Playwright
- Lookup ERROR while trying to log in instagram using Helium/Selenium
waggle-dance
-
Show HN: Demystifying Advanced Rag Pipelines
This seems very similar to LangSmith’s trace monitoring, which I have been leaning on heavily for observability. You also mention LlamaIndex— how do you see your project fitting into the ecosystem?
This is a great README, but I don’t think I would able to use this because it is serial.
In my experimental agent system, waggledance.ai, I have been working on a pre-agent step of [picking and synthesizing the right context and tools](https://github.com/agi-merge/waggle-dance/blob/main/packages...) for a given subtask of a larger goal, and it seems to be boosting results. It looks like now I have to try sub-question answering in the mix as well.
-
Autotab – Boring AI Agents for real world tasks
This is amazing. I will try to have it automate my system of agents web app (turtles all the way down) (shameless plug: https://github.com/agi-merge/waggle-dance)
-
Ask HN: Show me your half baked project
- source code: https://github.com/agi-merge/waggle-dance
-
Language Agent Tree Search Unifies Reasoning Acting and Planning in LMs
Any advice for trying to implement this in my project over at https://github.com/agi-merge/waggle-dance
Currently I am creating different agent types for planned subtasks using langchain, so perhaps implementing a custom AgentExecutor? Or would I need to lift it up higher in the logic stack? I am not sure that I understand how the graph search and thought-action-reflection selection process is deciding when and how to reflect if a branch fails, and how it backpropogates the failure to other nodes?
-
Why AutoGPT engineers ditched vector databases
I have been working on a system of agents over at https://github.com/agi-merge/waggle-dance - I already split problems up into subtasks for agents to work on independently. I give agents access to vector databases, using a simple global key for now, but soon a context/parent/child key. Access to the vector DBs is proxied via tools (agents have to “call” saveMemory or retrieveMemory). I also check for looping/repetition FREQUENTLY using in-memory vector databases of the langchain agent callback events.
My opinion on this: eh, who cares? AutoGPT and similar are non-standard use cases for Vector DBs right now, and Vector DBs are useful for RAG.
What are some alternatives?
undetected-chromedriver - Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)
playwright-chrome-recorder - playwright-chrome-recorder
SeleniumBase - 📊 Python's all-in-one framework for web crawling, scraping, testing, and reporting. Supports pytest. UC Mode provides stealth. Includes many tools.
webdriver-bidi - Bidirectional WebDriver protocol for browser automation
Playwright - Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
RVS_GTDriver - A "Pure Swift" Low-Level SDK for Bluetooth Low-Energy Devices (Work In Progress)
pyppeteer - Headless chrome/chromium automation library (unofficial port of puppeteer)
rag-demystified - An LLM-powered advanced RAG pipeline built from scratch
helium - Selenium-python but lighter: Helium is the best Python library for web automation. [Moved to: https://github.com/mherrmann/selenium-python-helium]
paperless-ngx - A community-supported supercharged version of paperless: scan, index and archive all your physical documents
captcha_bypass - Captcha bypassing methods for Selenium
pyleniumio - Bring the best of Selenium and Cypress into a single Python package