English Wikipedia dump from May 2022 (wikipedia_en_all_maxi_2022-05, pre-ChatGPT)

wikipedia_en_all_maxi_2022-05.zim 95.20GB
Type: Dataset

Bibtex:
@article{,
title= {English Wikipedia dump from May 2022 (wikipedia_en_all_maxi_2022-05, pre-ChatGPT)},
journal= {},
author= {},
year= {},
url= {},
abstract= {English-language Wikipedia dump published on May 2022 by the Kiwix Project ( https://kiwix.org/ ).
It contains all articles, complete with extra data-files (images, audio present in the articles).
This snapshot was taken before the common spread of LLM technologies, and might provide a point of reference in the future.
It is also suitable for launching private/local/offline wikipedia mirrors.

To read ZIM files in a browser, launch a kiwix-serve server ( https://wiki.kiwix.org/wiki/Kiwix-serve ).
Alternatively, pick a suitable ZIM reader app for your platform ( https://kiwix.org/en/applications/ ).
Documentation for the file-format at https://wiki.openzim.org/wiki/ZIM_file_format .

Current kiwix dumps available at https://library.kiwix.org/ . Older kiwix dumps mirrored at https://archive.org/search?query=wikipedia_en_all_maxi .},
keywords= {Wikipedia, encyclopedia, wiki, wikimedia, zim, kiwix},
terms= {},
license= {CC BY-SA ( https://en.wikipedia.org/wiki/Wikipedia:Copyrights , https://creativecommons.org/licenses/by-sa/4.0/ )},
superseded= {}
}

Hosted by users

Send Feedback Start
   0.000006
DB Connect
   0.000490
Lookup hash in DB
   0.000408
Get torrent details
   0.000137
Get torrent details, finished
   0.000235
Get authors
   0.000001
Select authors
   0.000166
Parse bibtex
   0.000088
Write header
   0.000207
get stars
   0.000104
home tab
   0.000163
render right panel
   0.000007
render ads
   0.000428
fetch current hosters
   0.000278
related datasets
   0.004937
Done