wikipedia_abuse_checker
mlcourse.ai
wikipedia_abuse_checker | mlcourse.ai | |
---|---|---|
2 | 85 | |
3 | 9,400 | |
- | - | |
7.8 | 3.4 | |
8 days ago | 4 months ago | |
Python | Python | |
- | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
wikipedia_abuse_checker
-
Which wikipedia pages in India were abused the most in 2021?
The code for the project is available at https://github.com/shijithpk/wikipedia_abuse_checker.
-
Which Wikipedia pages in India are abused the most?
Hi, my name is Shijith, and I'm a freelance data journalist from India (Worked previously at Hindustan Times and IndiaSpend).
Just posting a data story I did recently about wikipedia abuse in India. Such abuse is an old problem, but it's getting more media attention with users distorting facts on pages about the Delhi riots or farmer protests. Sometimes users engage in straight out vandalism where they delete whole sections from a page.
I tried to determine which wikipedia pages faced the most abuse this year, and also introduce a twitter account that allows people to track wikipedia abuse weekly.
This is the twitter account for tracking wikipedia abuse every week: http://twitter.com/abuse_checker
And here's the python code I used for the project: https://github.com/shijithpk/wikipedia_abuse_checker
(Am in the process of re-working the code. Right now it's querying the wikipedia API every week for edit histories of over 150k articles, and the whole run is taking 2 days now. Discovered an API endpoint for recent changes that should make things more efficient.)
Have any questions or feedback, do let me know below!
mlcourse.ai
What are some alternatives?
attractors - Package for simulation and visualization of strange attractors.
napari - napari: a fast, interactive, multi-dimensional image viewer for python
concrete-numpy - Concrete-Numpy: A library to turn programs into their homomorphic equivalent.
GreyNSights - Privacy-Preserving Data Analysis using Pandas
hiitpi - A workout trainer Dash/Flask app that helps track your HIIT workouts by analyzing real-time video streaming from your sweet Pi using machine learning and Edge TPU..
quaternion - Add built-in support for quaternions to numpy
Grafana - The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many more.
WaveNCC - An app to compute the normalization coefficients of a given set of orthogonal 1D complex wave functions.
data-science-ipython-notebooks - Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
julia - The Julia Programming Language
open-data-anonymizer - Python Data Anonymization & Masking Library For Data Science Tasks
H2O - H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.