awesome-public-real-time-datasets
RedfinScraper
awesome-public-real-time-datasets | RedfinScraper | |
---|---|---|
8 | 5 | |
366 | 54 | |
10.4% | - | |
5.1 | 6.9 | |
10 days ago | 10 months ago | |
Python | ||
Creative Commons Zero v1.0 Universal | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
awesome-public-real-time-datasets
- List of publicly available datasets with real-time data
- FLaNK Stack Weekly for 20 Nov 2023
- Bytewax: Stream processing library built using Python and Rust
- Public Real-Time Datasets and Sources
-
What are some good publicly available real-time data sources?
Added for now - https://github.com/bytewax/awesome-public-real-time-datasets/commit/94ca4a3d40dc212690c6cdc22c107289b4268661
I am attempting to source via the wisdom of the crowd here. I often find it hard to find good real-time data sources for learning about streaming, prototyping, or building hobby projects. I started researching and then created an "Awesome List" in a GitHub repo - https://github.com/bytewax/awesome-public-real-time-datasets.
-
Ask HN: What are some public real-time data sources?
I started an awesome list with real-time data sources here: https://github.com/bytewax/awesome-public-real-time-datasets . Have any datasets or data sources I should add to this list? Comment below or PRs welcome :).
RedfinScraper
-
What are some good publicly available real-time data sources?
Shameless plug for RedfinScraper
-
Are there gov sites I can download/scrape real estate data from? Sale prices, property taxes etc?
Yup, this seems like a good idea. There is a repo that scrapes Redfin using it's unofficial API.
-
Scrape Thousands of Records of Housing Data Using Python [Self-Promotion]
So, here's an actual dataset of CA housing data I generated using the RedfinScraper library. Scraping these 47,000 records took just over 3 minutes.
-
Scrape Thousands of Housing Records in Minutes! [Self-Promotion]
RedfinScraper is a scalable Python library that leverages Redfin's unofficial Stringray API to quickly scrape thousands of housing records.
- Scrape Thousands of Housing Records in Minutes!
What are some alternatives?
datagen - Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.
screenshot-to-code - Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
owl-shop
superset - Apache Superset is a Data Visualization and Data Exploration Platform
eventsim - Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.
mockingbird - Mockingbird is a mock streaming data generator
depthai-python - DepthAI Python Library
Scada-LTS - Scada-LTS is an Open Source, web-based, multi-platform solution for building your own SCADA (Supervisory Control and Data Acquisition) system.
torchgeo - TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
ai-exploits - A collection of real world AI/ML exploits for responsibly disclosed vulnerabilities
memq - MemQ is an efficient, scalable cloud native PubSub system