sotoki
codequestion
sotoki | codequestion | |
---|---|---|
4 | 15 | |
215 | 511 | |
0.9% | 1.0% | |
6.6 | 5.5 | |
7 days ago | 8 months ago | |
Python | Python | |
GNU General Public License v3.0 only | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
sotoki
-
The Overflow Offline Project β Stack Overflow Blog
http://download.kiwix.org/zim/stack_exchange/
It's not clear to me why the data set shrank between 2019/3 and 2022/6; was something excluded? Compression improvements?
> stackoverflow.com_en_all_2019-02.zim 2019-03-12 19:53 134G
-
StackOverflow is again available for download and offline use via Kiwix
SO was really hard to manage with its 21 million questions. For this reason, all StackExchanges will update monthly while SO will be every quarter only (until we fix that memory leak: there is a ticket opened to try and fix it, please have a look!).
- Can someone pls upload the Layer 2 platform to another domain and prove this as debunked
-
Updated Stack Overflow zim file 2021-09-06
The Openzim/Kiwix folks are working on a rewrite of the scraper, sotoki , as it's all but unworkable. But there hasn't been a recent successful rerun yet. Back in June/July I managed to scrape together a process to get a working zim file until they finish and posted a version based off that DB dump.
codequestion
-
Introducing the Overflow Offline project
GitHub | Article
-
The Overflow Offline Project β Stack Overflow Blog
There was a recent HN Post for codequestion which builds an offline semantic index on the Stack Overflow dumps on archive.org - https://news.ycombinator.com/item?id=33110219
GitHub: https://github.com/neuml/codequestion
Article: https://medium.com/neuml/find-answers-with-codequestion-2-0-...
-
[P] Stack Overflow Semantic Search
Release Announcement - https://medium.com/neuml/find-answers-with-codequestion-2-0-50b2cfd8c8fe Release Notes - https://github.com/neuml/codequestion/releases/tag/v2.0.0 GitHub - https://github.com/neuml/codequestion
-
Semantic search of Stack Overflow with codequestion
Release Announcement - https://medium.com/neuml/find-answers-with-codequestion-2-0-50b2cfd8c8fe Release Notes - https://github.com/neuml/codequestion/releases/tag/v2.0.0
- Show HN: Semantic search of Stack Overflow with codequestion
What are some alternatives?
zim-plugin-instantsearch - Search as you type in Zim, in similar manner to OneNote Ctrl+E.
txtai - π‘ All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
nautilus - Turns a collection of documents into a browsable ZIM file
sentence-transformers - Multilingual Sentence & Image Embeddings with BERT
ifixit - iFixit to ZIM scraper
tldrstory - π Semantic search for headlines and story text
kiwings - A better alternative to Kiwix for macOS
tika-python - Tika-Python is a Python binding to the Apache Tikaβ’ REST services allowing Tika to be called natively in the Python community.
youtube - Create a ZIM file from a Youtube channel/username/playlist
paperai - π π€ Semantic search and workflows for medical/scientific papers
freeCodeCamp - freeCodeCamp.org's open-source codebase and curriculum. Learn to code for free.