Our great sponsors
-
crux
Crux offers a flexible plugin-based API & implementation to extract interesting information from Web pages. (by chimbori)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
unfurl is not very different from your gist. It is slightly more comprehensive by supporting both Open Graph Protocol and Twitter Cards, plus falls back to back to reading HTML head tags. It is also extensible for websites that use javascript and can't be scraped such as twitter.
Nice, was looking for something like this. Currently using https://github.com/chimbori/crux
I actually don't lol. This is old code that I extracted out from Dank.