NLP-CNN-Subreddit-Sorter-Heroku-App
newsemble
NLP-CNN-Subreddit-Sorter-Heroku-App | newsemble | |
---|---|---|
4 | 12 | |
1 | 44 | |
- | - | |
0.0 | 0.0 | |
about 2 years ago | almost 2 years ago | |
Jupyter Notebook | Python | |
- | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
NLP-CNN-Subreddit-Sorter-Heroku-App
- The outputs of my jupyter notebooks inside of Github repos only show half of what they used to. Why did this happen and how to fix? I am certain that the outputs used to show everything when viewed in Github, and I have not reuploaded the notebooks to the repo's since then.
- The outputs of my jupyter notebooks inside of Github repos only show half of what they used to. Why did this happen and how to fix? I am certain that the outputs used to show everything when viewed in Github.
-
I created an app (CNNet, URL in description) that tells you what subreddit to post to based on your title, and I used r/Python, r/learnmachinelearning, r/compsci and r/datascience. This app could be expanded to include other technical subreddits and serve as a way to decide where to crosspost.
This app could be expanded to include other similar technical subreddits and serve as a way to decide where to crosspost, or for moderators to auto flag posts that are off topic. Here is the repo: https://github.com/djthorne333/NLP-CNN-Subreddit-Sorter-Application, and link to the app: https://datascience-reddit-post-sorter.herokuapp.com/. I think I thought of a way to extract from the dataset the optimal amount of filters to use for each filter size for the CNN. I have some typos to fix right now it seems, but it's generally done. Please let me know what you think, and give me any advice, as I am trying to break into data science.
newsemble
- [Project] Newsemble: An API to fetch current news data
-
Newsemble: An API to fetch current news data
I read through the documentation and tinkered around with it -- great work! One recommendation I would make, particularly if you're hoping that this will be useful long-term for NLP, is not to delete the previously scraped data. For instance, http://www.newsemble.ml/news only contains 129 results, which is nowhere near comprehensive enough to ensure any kind of statistically significant NLP.
What are some alternatives?
CSGO-Pro-Gear-Performance-and-EDA - Modeling Professional (CS:GO) Gamer's Accuracy Performance Based on Gear and Settings, and Exploratory Data Analysis.
deemix-foobar2000 - Converts foobar2000 corrupted text list to Deezer album URL with Deezer API.
fellowship-prediction - Analyzes your GitHub Profile and presents you with a report on how likely you are to become the next MLH Fellow!
pycraigslist - Craigslist API wrapper
MLOps - End to End toy example of MLOps
screpe - High-level Python web scraping.
extractnet - A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one package
steam-library-year-visualizer - Creates a bar chart of release year of games in library from a given steam profile link.
gutenberg-search - Web application to search the Gutenberg Project's 📖 database, made with Python Flask and MongoDB
python-client - Newsdata.io Official Python Client
random-memer - Returns random meme images scraped from Memedroid