Cascadia.jl
HTTP.jl
Our great sponsors
Cascadia.jl | HTTP.jl | |
---|---|---|
2 | 7 | |
116 | 623 | |
0.0% | 1.3% | |
3.2 | 7.7 | |
almost 2 years ago | 8 days ago | |
Julia | Julia | |
GNU General Public License v3.0 or later | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Cascadia.jl
-
I Need to Convert HTML Files to CSV
https://github.com/Algocircle/Cascadia.jl is a julia library for css-style queries on Gumbo.jl parsed HTML.
-
Recommendations on how to start web scraping with julia for price updates? (if possible)
I haven't seen that tutorial, but I agree that HTTP.jl, Gumbo.jl, and Cascadia.jl are the way. I used them to export public wishlists from bookdepository, which has no API nor a built in exporting tool.
HTTP.jl
-
Machine learning with Julia - Solve Titanic competition on Kaggle and deploy trained AI model as a web service
The req.url field contains the URL of the received request, the req.method field contains request method, like GET or POST, the req.body field contains the POST body of the request in binary format. HTTP request object contains much other information. All this you can find in HTTP.jl documentation. Our web application will only check the request method. If the received request is a POST request, it will parse req.body to JSON object and send the data from this object to the isSurvived function to make a prediction and return it to the client browser. For all other request types, it will just return the content of the index.html file, to display the web interface. This is how the whole source of titanic.jl web service looks:
-
How can I use Julia to search on the web automatically?
If you want to just get the html of a website whose url you already have you can make requests from the http.jl package. https://juliaweb.github.io/HTTP.jl/stable/
-
Automate the boring stuff with Julia?
HTTP.jl and Gumbo.jl for web-scraping
- PyTorch: Where we are headed and why it looks a lot like Julia (but not exactly)
-
Recommendations on how to start web scraping with julia for price updates? (if possible)
I haven't seen that tutorial, but I agree that HTTP.jl, Gumbo.jl, and Cascadia.jl are the way. I used them to export public wishlists from bookdepository, which has no API nor a built in exporting tool.
-
Why not Julia?
I find some of the library documentation hard to understand. Compare http.jl with python's requests, for example. Something as core as HTTP requests should have clear docs with tonnes of examples. Part of this is also a personal dislike of documenter.jl styling. Idk why the contrast is so low – would prefer a standard readthedocs theme.
- Julia 1.6: what has changed since Julia 1.0?
What are some alternatives?
Gumbo.jl - Julia wrapper around Google's gumbo C library for parsing HTML
geni-performance-benchmark
autoscraper - A Smart, Automatic, Fast and Lightweight Web Scraper for Python
julia - The Julia Programming Language
dude - dude uncomplicated data extraction: A simple framework for writing web scrapers using Python decorators
DaemonMode.jl - Client-Daemon workflow to run faster scripts in Julia
Huginn - Create agents that monitor and act on your behalf. Your agents are standing by!
JET.jl - An experimental code analyzer for Julia. No need for additional type annotations.
gazpacho - 🥫 The simple, fast, and modern web scraping library
BinaryBuilder.jl - Binary Dependency Builder for Julia
PackageCompiler.jl - Compile your Julia Package