Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 15 Kiwix Open-Source Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
wikipedia-mirror
🌐 Guide and tools to run a full offline mirror of Wikipedia.org with three different approaches: Nginx caching proxy, Kiwix + ZIM dump, and MediaWiki/XOWA + XML dump
-
good-karma-kit
😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
kiwix-js-pwa
Kiwix JS Offline Browser implemented as a Progressive Web App (PWA), and packaged as Electron, NWJS and UWP apps for Windows and Linux.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Thanks for checking that. So it seems like we have an issue with UTF8 search for mutli-byte character sets. I've opened this issue on GitHub to investigate further: Search appears to be broken with Chinese characters (possibly all UTF-8 multibyte characters).
I meant the Kiwix dump (https://download.kiwix.org/zim/wikipedia_en_all_nopic.zim – careful, 60GB!).
At a first glance, the Wikimedia XML dump does not look substantially different from what Kiwix/ZIM does with compressed HTML: They're both compressed (bz2 for the Wikimedia dump, zstd or LZMA for Kiwix/ZIM), and both compress multiple files at once, so inter-file redundancy should hopefully be significantly reduced.
HTML seems a bit more verbose than the Mediawiki syntax (plus the XML header for each article), but I'd be surprised if that actually accounted for a 3x difference in size.
Then again, Kiwix seems to have experimented with shared dictionary brotli compression, which supposedly yields an >2x improvement: https://github.com/openzim/libzim/issues/144
I wonder if their current zstd implementation also uses shared dictionaries. If not, that might just be the reason: If ZIM compression chunks are much smaller than the bz2 streams of the Wikimedia dumps, there would still be a lot of redundancy between chunks.
WikiMed by Kiwix is now at v2.7.4. The packages for 64bit and 32bit Linux and Windows contain the October 2023 WikiMed English ZIM (mdwiki_en_all_maxi_2023-10), together with the changes in the CHANGELOG. The Electron app uses Electron v22.3.25.
OK, I found this in the CHANGELOG:
Well here is the neat part: you don't.
Kiwix related posts
-
Installing hotspot installer on Windows
-
kiwix android3.8.1 cannt to search
-
Cant add Zim file to app.
-
WikiReader
-
[Release] WikiMed by Kiwix (Linux/Windows) v2.7.4
-
How do I salvage old screens for my rasp pi
-
Best Android E-Ink tablet for Kiwix?
-
A note from our sponsor - InfluxDB
www.influxdata.com | 3 May 2024
Index
What are some of the best open-source Kiwix projects? This list will help you:
Project | Stars | |
---|---|---|
1 | kiwix-android | 798 |
2 | kiwix-apple | 431 |
3 | kiwix-tools | 380 |
4 | wikipedia-mirror | 324 |
5 | good-karma-kit | 295 |
6 | kiwix-js | 268 |
7 | libzim | 158 |
8 | kiwix-js-pwa | 148 |
9 | libkiwix | 108 |
10 | kiwix-hotspot | 69 |
11 | kiwix-zim-updater | 66 |
12 | ifixit | 23 |
13 | kiwings | 12 |
14 | anki-zim-reader | 7 |
15 | base-image | 3 |
Sponsored