fma
mac-miller-lyrics-dataset
fma | mac-miller-lyrics-dataset | |
---|---|---|
1 | 2 | |
2,108 | 2 | |
- | - | |
0.0 | 1.8 | |
over 1 year ago | about 3 years ago | |
Jupyter Notebook | Jupyter Notebook | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
fma
-
Analyzing music to determine subgenre?
This dataset seems worth looking into: https://github.com/mdeff/fma. I think you'll have a hard time identifying subgenres since even people don't know what subgenre a song belongs to. It's a very subjective classification compared to distinguishing between main genres; e.g. rock, rap, and country. Also, from my work with the Spotify API, there a lot of seemingly synonymous subgenres which will make this task even more tedious (what is the difference between "pop dance" and "dance pop"?).
mac-miller-lyrics-dataset
What are some alternatives?
SKAB - SKAB - Skoltech Anomaly Benchmark. Time-series data for evaluating Anomaly Detection algorithms.
covid-chestxray-dataset - We are building an open database of COVID-19 cases with chest X-ray or CT images.
toiletmap - API/UI server for the Great British Public Toilet Map
COVID-CT - COVID-CT-Dataset: A CT Scan Dataset about COVID-19
essentia - C++ library for audio and music analysis, description and synthesis, including Python bindings
raccoon_dataset - The dataset is used to train my own raccoon detector and I blogged about it on Medium
covid19za - Coronavirus COVID-19 (2019-nCoV) Data Repository and Dashboard for South Africa
100daysofpractice-dataset - Data from Instagram posts with the hashtag #100daysofpractice.
clusterdata - cluster data collected from production clusters in Alibaba for cluster management research
datasets - 🎁 5,400,000+ Unsplash images made available for research and machine learning
TheVault - [EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation
cpi - Quickly adjust U.S. dollars for inflation using the Consumer Price Index (CPI)