psych-verbs
world-languages
Our great sponsors
psych-verbs | world-languages | |
---|---|---|
1 | 2 | |
0 | 0 | |
- | - | |
0.0 | 0.0 | |
almost 3 years ago | about 3 years ago | |
Jupyter Notebook | Jupyter Notebook | |
MIT License | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
psych-verbs
-
4 tips for creating an impressive data science portfolio on GitHub
For example, for my research project on psych-verbs I wrote a paper-like README targeted at academics/fellow linguists, whereas for my movie recommender system I wrote an informal short description and included a screencast, aimed at a general audience.
world-languages
-
4 tips for creating an impressive data science portfolio on GitHub
So instead of investing more time on these datasets, pick a new one of your own interest, apply different models and answer questions that you'd find insightful. I personally focused on projects that reflect my interest in Linguistics and NLP – you can explore data related to your experience or the industry you'd like to work in.
-
Data analysis of endangered languages with pandas
This exploratory analysis is only a starting point, there are many other questions you can explore from this dataset. For example, find what dialects are critically endangered, what is the geographic distribution of endangered languages, or maybe analyse and visualise the data with other libraries than pandas and matplotlib. Have a look at my Jupyter notebook and play around with the data!
What are some alternatives?
Fake-News-Detection - To combat the spread of fake news, it’s critical to determine the information’s legitimacy, which this Data Science project can help with. To do so, Python can be used, and a model is created using TfidfVectorizer.LogisticRegression model is used to train and test the data ,numpy,pandas and some other packages are used in this project.
apartment_recommender_streamlit_app - Streamlit App that recommends apartments in Seattle using the Airbnb kaggle dataset: https://www.kaggle.com/code/rdaldian/airbnb-content-based-recommendation-system/data?select=listings.csv
speech-emotion-recognition - A program that uses neural networks to detect emotions from pre-recorded and real-time speech
dtreeviz - A python library for decision tree visualization and model interpretation.
cheatsheets - Official Matplotlib cheat sheets
Empirical_Study_of_Ensemble_Learning_Methods - Training ensemble machine learning classifiers, with flexible templates for repeated cross-validation and parameter tuning
open_data_covid_analysis - Analysing Covid19 using publicly available datasets
linear-tree - A python library to build Model Trees with Linear Models at the leaves.
MouseView.js - Attentional mouse tracking. Alternative to online eye tracking. Eye tracking without the eyes!
movie-recommender - Movie recommender system based on Non-Negative Matrix Factorization and Singular Value Decomposition, with a Flask web interface
social-perception - Studying sociopolitical attitudes and moving the human perspective using psychographic and sociodemographic data from the European Social Survey.