Preferred Programming Languages for Web Scraping

This page summarizes the projects mentioned and recommended in the original post on /r/programming

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • mojo

    :sparkles: Mojolicious - Perl real-time web framework (by mojolicious)

    I'm a bit biased, but I like Perl's Mojolicious. It's not that's it's Perl so much as the particular organization of its framework that makes it really easy to see what's going on (where Python's requests, for example, isn't so forthcoming). BeautifulSoup is nice, but I really like that Mojolicious has CSS4 selectors (including some of the experimental things). Since Mojolicious provides the complete environment, everything cooperates nicely (where another Perl framework, LWP, isn't so well integrated).

  • colly

    Elegant Scraper and Crawler Framework for Golang

    in addition the article mention that "Also, C++ is not good for creating web crawlers, is programming language is great for simple web scraping" but in reality when you need a fast and performant crawler C++ will be beat languages like ruby and python from performance perspective , the same for golang programming language which will perform better most of the programming languages written on the article and has bunch of very good libraries like http://go-colly.org/ in addition to the https://pkg.go.dev/net/http which comes part of the core

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts