-
TWINT
Discontinued An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Twitter data tends to get gigantic, and coupled with the one-week-limit of their API it is tricky to acquire suitable historic data. The API actually allows searches further in the past, e.g. https://twitter.com/search?lang=en&q=(%23metoo)%20until%3A2020-01-01%20since%3A2019-01-01&src=typed_query%20until%3A2020-01-01%20since%3A2019-01-01&src=typed_query) But that will not give you the tweets in a form suitable for processing. Tools like TwitterScraper attempt to do this automatically, but Twitter has gotten more strict in its rate limits, and in my experience it basically does not work any more.
You need Twint. Get it here: https://github.com/twintproject/twint
Related posts
-
Twitter will be purging accounts with no activity for several years soon. We need to archive as many as we can. Any ideas on Methods
-
How Do I Use Twint?
-
NYC's transport authority will no longer post service alerts on Twitter
-
What’s currently the best method to archive a twitter account?
-
Do I have to pay now for the Twitter API if I want to use it for data analysis?