aboutsummaryrefslogtreecommitdiff
path: root/backend/data/enwiki/README.md
diff options
context:
space:
mode:
authorTerry Truong <terry06890@gmail.com>2022-06-08 12:34:57 +1000
committerTerry Truong <terry06890@gmail.com>2022-06-08 12:34:57 +1000
commitf8fa9ae3dd1571fa2912067b6eed010ea5d928e9 (patch)
treedf9c6a7fb8a0b1a47b9a971259d65c1bd414846d /backend/data/enwiki/README.md
parent4ad3b444bb8f63c75be3bf3126598732b6b0416a (diff)
Update READMEs, refactor getEnwikiImgData.py
Diffstat (limited to 'backend/data/enwiki/README.md')
-rw-r--r--backend/data/enwiki/README.md7
1 files changed, 7 insertions, 0 deletions
diff --git a/backend/data/enwiki/README.md b/backend/data/enwiki/README.md
index c9615ef..ea97c9a 100644
--- a/backend/data/enwiki/README.md
+++ b/backend/data/enwiki/README.md
@@ -28,3 +28,10 @@ Generated Files
- pages: id INT PRIMARY KEY, title TEXT UNIQUE
- redirects: id INT PRIMARY KEY, target TEXT
- descs: id INT PRIMARY KEY, desc TEXT
+- enwikiImgs.db <br>
+ Holds infobox-images obtained for some set of wiki page-ids.
+ Generated by running getEnwikiImgData.py, which uses the enwiki dump
+ file and dumpIndex.db. <br>
+ Tables: <br>
+ - page\_imgs: page\_id INT PRIMAY KEY, img\_name TEXT
+ - imgs: name TEXT PRIMARY KEY, license TEXT, artist TEXT, credit TEXT, restrictions TEXT, url TEXT