Top 7 Jupyter Notebook Webscraping Projects
-
datadoubleconfirm
Simple datasets and notebooks for data visualization, statistical analysis and modelling - with write-ups here: http://projectosyo.wix.com/datadoubleconfirm.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
Data-extraction-and-text-analysis
The objective of this assignment is to extract textual data articles from the URL and perform text analysis to compute variables.
-
NLP-CNN-Subreddit-Sorter-Heroku-App
End-to-end development of an application using a convolutional neural network that suggests to users/moderators which technical subreddit a post actually belongs to. Novel method to determine # of CNN filters. Custom Word2vec embeddings. The subreddits chosen are all technical and similar, and benefit users/moderators interested in data science and related fields. (Exploratory data analysis, feature engineering, custom word2vec embeddings, convolutional neural network, deployment via flask to
-
Goodreads-Review-Webscraping-and-Text-Analysis
Goodreads review data along with scraping functions and machine learning models for spam detection
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Project mention: Control the browser using GPT-4 vision by AgentGPT team | news.ycombinator.com | 2023-11-12
Something like this solutionGitHub Ipynb file
Jupyter Notebook Webscraping related posts
Index
What are some of the best open-source Webscraping projects in Jupyter Notebook? This list will help you:
Project | Stars | |
---|---|---|
1 | tarsier | 486 |
2 | datadoubleconfirm | 55 |
3 | Data-extraction-and-text-analysis | 11 |
4 | newsapi | 1 |
5 | PokeData | 1 |
6 | NLP-CNN-Subreddit-Sorter-Heroku-App | 1 |
7 | Goodreads-Review-Webscraping-and-Text-Analysis | 0 |
Sponsored