goskyr
soup
goskyr | soup | |
---|---|---|
2 | 4 | |
32 | 2,128 | |
- | - | |
8.7 | 0.0 | |
1 day ago | 6 months ago | |
Go | Go | |
GNU General Public License v3.0 only | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
goskyr
-
No code command line webscraper
I am currently building a webscraper, called goskyr, that can be run from the command line and is supposed to be easily configurable. So instead of having to write code to scrape a website you'd just write a configuration snippet and run the scraper. I realize that there are a number of gui based scraping services that make it extremely easy to setup a scraping process for any website, so for people having no coding experience whatsoever that would probably be the easiest solution. I'm trying to come close to those gui based solutions in terms of functionality by providing a 'smart' way of finding potentially interesting data/fields and letting the user select a subset in a terminal based ui. Also date extraction & parsing and the newly added machine learning capability is probably worth mentioning. Still, those other, gui based solutions are really awesome, eg octoparse or scrapestorm.
-
Crowdsourced concert scraping project
I am currently working on a configurable command line webscraper, called goskyr and my first use case is collecting as much concert data as possible for this website idea I had, croncert.ch I am hoping that people other than me are willing to contribute to the scraper configuration file in this repository, https://github.com/jakopako/croncert-config, which also contains a github action to regularly run the scraper. What do you think? Could this work? How should I spread the word?
soup
-
Beautiful Soup: We called him Tortoise because he taught us
> Does anyone know if there as a good equivalent for Go
Yes: https://github.com/anaskhan96/soup
It works well.
-
Web Scraping in Golang
Web scraping is a handy tool to have in a data scientist's skill set. It can be useful in a variety of situations to gather data, such as when a website does not provide an API. We will be using this golang package github.com/anaskhan96/soup. It performs the same as beautifulsoup of python. This is the webpage we are going to be scraping.
-
How to generate excel using html + css on data?
Check this library: https://github.com/anaskhan96/soup to parse html
- Golang for Browser Automation
What are some alternatives?
croncert-config - configuration and github actions for concertcloud.live (fka croncert.ch), a website that shows you concerts in various cities
google-maps-scraper - scrape data data from Google Maps. Extracts data such as the name, address, phone number, website URL, rating, reviews number, latitude and longitude, reviews,email and more for each place
colly - Elegant Scraper and Crawler Framework for Golang
get-sauce - A command line program to download Hentai videos and images from multiple websites
fitter - New way for collect information from the API's/Websites
Looking for Maintainer - Selenium/Webdriver client for Go
lux - 👾 Fast and simple video download library and CLI tool written in Go
Rendora - dynamic server-side rendering using headless Chrome to effortlessly solve the SEO problem for modern javascript websites
go - The Go programming language
Geziyor - Geziyor, blazing fast web crawling & scraping framework for Go. Supports JS rendering.
Ferret - Declarative web scraping