From f8fa9ae3dd1571fa2912067b6eed010ea5d928e9 Mon Sep 17 00:00:00 2001 From: Terry Truong Date: Wed, 8 Jun 2022 12:34:57 +1000 Subject: Update READMEs, refactor getEnwikiImgData.py --- backend/data/enwiki/README.md | 7 +++++++ 1 file changed, 7 insertions(+) (limited to 'backend/data/enwiki/README.md') diff --git a/backend/data/enwiki/README.md b/backend/data/enwiki/README.md index c9615ef..ea97c9a 100644 --- a/backend/data/enwiki/README.md +++ b/backend/data/enwiki/README.md @@ -28,3 +28,10 @@ Generated Files - pages: id INT PRIMARY KEY, title TEXT UNIQUE - redirects: id INT PRIMARY KEY, target TEXT - descs: id INT PRIMARY KEY, desc TEXT +- enwikiImgs.db
+ Holds infobox-images obtained for some set of wiki page-ids. + Generated by running getEnwikiImgData.py, which uses the enwiki dump + file and dumpIndex.db.
+ Tables:
+ - page\_imgs: page\_id INT PRIMAY KEY, img\_name TEXT + - imgs: name TEXT PRIMARY KEY, license TEXT, artist TEXT, credit TEXT, restrictions TEXT, url TEXT -- cgit v1.2.3