Python Dump

Open-source Python projects categorized as Dump | Edit details
Related topics: #Wikipedia #Python #XML #Export #Wiki

Python Dump Projects

  • GitHub repo wikiteam

    Tools for downloading and preserving wikis. We archive wikis, from Wikipedia to tiniest wikis. As of 2020, WikiTeam has preserved more than 250,000 wikis.

    Project mention: [Censorship] Fandom Wiki (formerly Wikia) is deleting wikis on sexual topics November 24, such as the Monster Girl Encyclopedia wiki | reddit.com/r/KotakuInAction | 2021-11-18

    Httrack is a good choice for having a local copy of the wiki you can browse personally, but note that if you ever have to back up a wiki in a formal suitable for migrating to another wiki site, something like ArchiveTeam's WikiTeam tool would be suitable. It also has a built-in tool to upload the resulting backup to archive.org, like how someone has done so with the MGQ wiki here.

  • GitHub repo witokit

    A Python toolkit to generate a tokenized dump of Wikipedia for NLP

    Project mention: Download Wikipedia Text Dump? | reddit.com/r/LanguageTechnology | 2021-10-01
  • Scout APM

    Scout APM: A developer's best friend. Try free for 14-days. Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2021-11-18.

Python Dump related posts

Index

Project Stars
1 wikiteam 453
2 witokit 7
Find remote jobs at our new job board 99remotejobs.com. There are 34 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com