diff options
| author | Terry Truong <terry06890@gmail.com> | 2022-06-08 12:34:57 +1000 |
|---|---|---|
| committer | Terry Truong <terry06890@gmail.com> | 2022-06-08 12:34:57 +1000 |
| commit | f8fa9ae3dd1571fa2912067b6eed010ea5d928e9 (patch) | |
| tree | df9c6a7fb8a0b1a47b9a971259d65c1bd414846d /backend/data/enwiki/README.md | |
| parent | 4ad3b444bb8f63c75be3bf3126598732b6b0416a (diff) | |
Update READMEs, refactor getEnwikiImgData.py
Diffstat (limited to 'backend/data/enwiki/README.md')
| -rw-r--r-- | backend/data/enwiki/README.md | 7 |
1 files changed, 7 insertions, 0 deletions
diff --git a/backend/data/enwiki/README.md b/backend/data/enwiki/README.md index c9615ef..ea97c9a 100644 --- a/backend/data/enwiki/README.md +++ b/backend/data/enwiki/README.md @@ -28,3 +28,10 @@ Generated Files - pages: id INT PRIMARY KEY, title TEXT UNIQUE - redirects: id INT PRIMARY KEY, target TEXT - descs: id INT PRIMARY KEY, desc TEXT +- enwikiImgs.db <br> + Holds infobox-images obtained for some set of wiki page-ids. + Generated by running getEnwikiImgData.py, which uses the enwiki dump + file and dumpIndex.db. <br> + Tables: <br> + - page\_imgs: page\_id INT PRIMAY KEY, img\_name TEXT + - imgs: name TEXT PRIMARY KEY, license TEXT, artist TEXT, credit TEXT, restrictions TEXT, url TEXT |
