English Wikipedia dump from May 2022 (wikipedia_en_all_maxi_2022-05, pre-ChatGPT)

wikipedia_en_all_maxi_2022-05.zim 95.20GB
Type: Dataset

Bibtex:
@article{,
title= {English Wikipedia dump from May 2022 (wikipedia_en_all_maxi_2022-05, pre-ChatGPT)},
journal= {},
author= {},
year= {},
url= {},
abstract= {English-language Wikipedia dump published on May 2022 by the Kiwix Project ( https://kiwix.org/ ).
It contains all articles, complete with extra data-files (images, audio present in the articles).
This snapshot was taken before the common spread of LLM technologies, and might provide a point of reference in the future.
It is also suitable for launching private/local/offline wikipedia mirrors.

To read ZIM files in a browser, launch a kiwix-serve server ( https://wiki.kiwix.org/wiki/Kiwix-serve ).
Alternatively, pick a suitable ZIM reader app for your platform ( https://kiwix.org/en/applications/ ).
Documentation for the file-format at https://wiki.openzim.org/wiki/ZIM_file_format .

Current kiwix dumps available at https://library.kiwix.org/ . Older kiwix dumps mirrored at https://archive.org/search?query=wikipedia_en_all_maxi .},
keywords= {Wikipedia, encyclopedia, wiki, wikimedia, zim, kiwix},
terms= {},
license= {CC BY-SA ( https://en.wikipedia.org/wiki/Wikipedia:Copyrights , https://creativecommons.org/licenses/by-sa/4.0/ )},
superseded= {}
}

Hosted by users
No stats to report yet.

Send Feedback Start
   0.000007
DB Connect
   0.000507
Lookup hash in DB
   0.000469
Get torrent details
   0.000151
Get torrent details, finished
   0.000237
Get authors
   0.000001
Select authors
   0.000170
Parse bibtex
   0.000112
Write header
   0.000244
get stars
   0.000122
home tab
   0.000171
render right panel
   0.000018
render ads
   0.000451
fetch current hosters
   0.000333
Start get stats
   0.002078
End get stats
   0.000002
related datasets
   0.005034
Done