psaw
Python Pushshift.io API Wrapper (for comment/submission search) (by dmarx)
PrawWrapper
A wrapper around PRAW for easier unit testing (by Watchful1)
psaw | PrawWrapper | |
---|---|---|
20 | 4 | |
311 | 7 | |
- | - | |
0.0 | 5.5 | |
over 2 years ago | about 2 months ago | |
Python | Python | |
BSD 2-clause "Simplified" License | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
psaw
Posts with mentions or reviews of psaw.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-12-14.
-
"Unable to connect to pushshift.io."
PSAW is deprecated: https://github.com/dmarx/psaw Try PMAW. Although, be aware of the well documented, ongoing issues with Pushshift itself; some of the wrappers aren't working as expected. I suspect that once the API itself is functioning normally there may be further updates to the wrappers.
-
I've been getting Response status code 404 since Monday morning. Is this due to the system update? Should I be changing my script someway to access the updated API?
This information is contained in the Readme on Github but is not in the readthedocs page for some reason.
-
How to make the bot respond based on invocation and not subreddit
You could use https://github.com/dmarx/psaw to monitor keywords. I haven't personally used it but it is a popular method.
-
How to collect top submissions per day of a specific subreddit?
You can do the first part easily with PSAW if you use python, it lets you get submissions from pushshift and then updates them with the current data from the reddit api. Then you would have to sort them, which is also fairly easy with python.
-
PRAW - How do I get more responses
Most of PRAWs methods have a limit argument. Usually with a default of 100. Set it to None, which actually sets the limit to 1000. You'd have to resort to other APIs if you want more than 1000 items like this: https://github.com/dmarx/psaw.
-
Most posts in search results are only showing a score of 1.
To get live scores or other metadata, you should incorporate accessing the reddit API into your workflow. One easy way to do this is using the 3rd party Pushshift wrapper called PSAW. See the note about setting r = praw.Reddit(...) and api = PushshiftAPI(r).
-
[OC] Modelling /r/CryptoCurrency's Time Variant Subconcious Using Deep Learning!
PSAW: https://github.com/dmarx/psaw
-
Removing deleted/archived posts
Try using psaw. That will query Pushshift first and copy over updated data from reddit.
-
PSAW user question
Try without the `asc` sort parameter. From the source code we can see that it can cause issues https://github.com/dmarx/psaw/blob/master/psaw/PushshiftAPI.py#L162-L164
-
Question/Help - Getting data about user flairs on r/Hololive
I again changed the way I collect data. Adapted from (https://deepnote.com/@deepnote/Mining-and-Exploring-Reddit-Data-using-Python-rfZ7TRRAT2unpCqU6egaKA) and using PSAW (Python Pushshift.io API Wrapper) (https://github.com/dmarx/psaw).
PrawWrapper
Posts with mentions or reviews of PrawWrapper.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-12-14.
-
I've been getting Response status code 404 since Monday morning. Is this due to the system update? Should I be changing my script someway to access the updated API?
It's also worth mentioning that u/RemindMeBot is currently operational, and it relies on PushShift via a custom praw wrapper. That praw wrapper has it's own PushShift client object. You could try figuring out how to use that, although there's no documentation for it.
-
Unit tests for praw?
The wrapper is here, the test version is here and some example tests using it are here. I also do the same thing with my sqlite database store.
-
Pushshift is back up
I use this function in u/RemindMeBot to get the current lag. It's abstracted a bunch since I set it up to also work with the pushshift beta api, but if you follow the logic it's not all that complicated. Basically just get the latest comment and check how old it is.
-
Scan for new arrivals to a subreddits 'top' category
There's a couple approaches here, but you've got the right general idea. The first thing to do is use a "queue", which is like a store of ids, but it automatically kicks out old ones. I use an implementation here, you can look farther down that file for places it gets used.
What are some alternatives?
When comparing psaw and PrawWrapper you can also consider the following projects:
Pushshift API - Pushshift API
RemindMeBot - u/RemindMeBot on reddit
Removeddit - View deleted stuff from reddit
PRAW - PRAW, an acronym for "Python Reddit API Wrapper", is a python package that allows for simple access to Reddit's API.
huggingface_hub - The official Python client for the Huggingface Hub.
SubredditDownloader - This python tools allows downloading of all submissions from a subreddit using Pushshift (API/Files) and the reddit API
reddit-flair-popularity