TWINT
gallery-dl
Our great sponsors
TWINT | gallery-dl | |
---|---|---|
77 | 187 | |
13,272 | - | |
- | - | |
0.0 | - | |
almost 2 years ago | - | |
Python | ||
MIT License | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
TWINT
-
Twitter will be purging accounts with no activity for several years soon. We need to archive as many as we can. Any ideas on Methods
twint is a project that can scrape twitter data via the webpages rather than the twitter API, which means that it can get more than the last 3200 tweets of an account. Unfortunately it seems that the repo was archived and is no longer in development, so I'm not sure if it even still works. It's also a bit heavy on dependencies and is written in Python, neither of which make it easier to install and use.
- How Do I Use Twint?
- NYC's transport authority will no longer post service alerts on Twitter
-
New OSINT tool
The tool doesn't work anymore since Twitter changed its APIs, but a good example is twint. Most people in OSINT are not highly technical and don't know their way around a CLI. On the other hand, a CLI tool is one of the quickest, lowest (dev) cost ways to release a tool to the public, and many developers who build tools for the OSINT community do so for free (open source).
- Show HN: Twitter API Reverse Engineered
-
What’s currently the best method to archive a twitter account?
You can try twint which is extensive and should be able to do that. Another is via this twitter downloader but might require multiple runs depending on what you want to archive.
-
Gbf.life will be gone at the end of April
They do have examples that don't specify a username such as number 3 on this page or this one on the main page: "twint -g="48.880048,2.385939,1km" -o file.csv --csv - Scrape Tweets from a radius of 1km around a place in Paris and export them to a csv file."
- Do I have to pay now for the Twitter API if I want to use it for data analysis?
-
Twitter’s $42,000-per-Month API Prices Out Nearly Everyone | Tiers will start at $500,000 a year for access to 0.3 percent of the company’s tweets. Researchers say that’s too much for too little data
This will motivate researchers to web scrape to circumvent these restrictions. Twint can scrape tweets and it supports proxies. It can also be multi threaded. A huge hassle and it's prone to breaking when the site changes.
-
Basically the current state of granblue
The comment I saw said they used this: https://github.com/twintproject/twint
gallery-dl
- Open Source Instagram Scraper?
-
How to download all tweets of other accounts?
If you want to grab all the images and videos take a look at gallery-dl, found here https://github.com/mikf/gallery-dl
-
Can someone help me with gallery-dl (i am halfway there)
My question is about the docs. Here is the link to gallery-dl on github. from what i see it's kept up-to-date. I'm trying to download a post from ig through any method. Also thinking of creating a configuration-file if necessary. My OS: windows 10, i have worked with a cli before but on linux and my knowledge is very superficial. I have managed to use this tool on other websites like pinterest but can't think of any solutions for instagram as of now. The errors i get are about using the authentication method and when trying to set the cookies-from-browser option.
-
Gothub: Alternative front-end for GitHub written with Go
I could set up a redirect to the '/raw/' pages but then the syntax highlighting is gone.
The same page is perfectly viewable over plain html on gothub[2] though.
Github also seems to be hiding their "Assets" (binaries et al) on the "/releases" page for some projects behind javascript(especially older versions).[3] Something else that wasn't the case about ~1.5 years ago.
Would be great if gothub could unshackle the links to those as well[4], but that appears to not be working at the moment[5] .
This project appears to be a more performant(measurably so), more privacy friendly(as Microsoft won't have a record of your interest in certain projects) alternative front-end for "non logged in" github users.
I like it, but it still needs work.
[1] https://github.com/mackyle/sqlite/blob/18cf47156abe94255ae14...
[2] https://gh.bloatcat.tk/mackyle/sqlite/blob/18cf47156abe94255...
[3] https://github.com/mikf/gallery-dl/releases
-
I don't publish to Instagram but I want to follow the posts of people that are only on that network. What are my options?
since it's primarily a picture/video platform, you can use gallery-dl to scrape the feeds you want to follow if you provide authentication, and you should be able to write the captions and stuff to metadata files, but not sure how you'd display it in a useful way.
-
How to download TikTok slideshow pics?
you could use gallery-dl to do this as well.
-
If I have a subscription to a paid adult site that allows subscribers to download all galleries and videos at full resolution, etc, what's the best way to rip the entire site? Any thoughts appreciated. Thanks.
gallery-dl (CLI based) focuses on images.
-
Gallery-dl (custom folder configs, reddit "galleries" and archive-format)
Maybe there is a way to set specific configs only for reddit gallery images, but I'm not sure how it works. Gallery-dl's documentation did not help me, and I also tried to find information from the source code.
-
Thumbnail
I use https://github.com/mikf/gallery-dl
-
What is the best way to get images for a dataset
You can try image scraping with something like gallery-dl or grabber. I use grabber for stuff off of booru style sites (anime weeb shit), and grabber for everything else. The challenge with these tools is:
What are some alternatives?
snscrape - A social networking service scraper in Python
instaloader - Download pictures (or videos) along with their captions and other metadata from Instagram.
Scweet - A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers, user info, images...
bulk-downloader-for-reddit - Downloads and archives content from reddit
newspaper - newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
RedditDownloader - Scrapes Reddit to download media of your choice.
twitterscraper - Scrape Twitter for Tweets
imgbrd-grabber - Very customizable imageboard/booru downloader with powerful filenaming features.
trafilatura - Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
twitter_scraping - Grab all a user's tweets (and get past 3200 limit)
htmldate - Fast and robust date extraction from web pages, with Python or on the command-line
hakuneko - Manga & Anime Downloader for Linux, Windows & MacOS