-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Hi, my name is Shijith, and I'm a freelance data journalist from India (Worked previously at Hindustan Times and IndiaSpend).
Just posting a data story I did recently about wikipedia abuse in India. Such abuse is an old problem, but it's getting more media attention with users distorting facts on pages about the Delhi riots or farmer protests. Sometimes users engage in straight out vandalism where they delete whole sections from a page.
I tried to determine which wikipedia pages faced the most abuse this year, and also introduce a twitter account that allows people to track wikipedia abuse weekly.
This is the twitter account for tracking wikipedia abuse every week: http://twitter.com/abuse_checker
And here's the python code I used for the project: https://github.com/shijithpk/wikipedia_abuse_checker
(Am in the process of re-working the code. Right now it's querying the wikipedia API every week for edit histories of over 150k articles, and the whole run is taking 2 days now. Discovered an API endpoint for recent changes that should make things more efficient.)
Have any questions or feedback, do let me know below!
Related posts
-
Flags Are Not Languages
-
Download your Learn course content with this free and open-source tool. All you need is a working computer and basic Python knowledge, and you can save a local copy of your Learn courses' content for future reference after the end of the term.
-
Ask HN: How do you develop and maintain a good note-taking habit?
-
What Are HTML Meta Tags And What Is Their Importance?
-
NPi – An Open Source project for enhancing AI Agents in taking action