From 811946498edc472d91e5ca8d41a4a0568e0d6e8f Mon Sep 17 00:00:00 2001 From: Terry Truong Date: Fri, 3 Jun 2022 11:03:25 +1000 Subject: Adjust enwiki dump-index-db and lookup script to include wiki-ids --- backend/data/enwiki/README.md | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) (limited to 'backend/data/enwiki/README.md') diff --git a/backend/data/enwiki/README.md b/backend/data/enwiki/README.md index cdabf50..c9615ef 100644 --- a/backend/data/enwiki/README.md +++ b/backend/data/enwiki/README.md @@ -17,7 +17,9 @@ Generated Files - dumpIndex.db
Holds data from the enwiki dump index file. Generated by genDumpIndexDb.py, and used by lookupPage.py to get content for a - given page title. + given page title.
+ Tables:
+ - offsets: title TEXT PRIMARY KEY, id INT UNIQUE, offset INT, next\_offset INT - enwikiData.db
Holds data obtained from the enwiki dump file, in 'pages', 'redirects', and 'descs' tables. Generated by genData.py, which uses -- cgit v1.2.3