From 56369bccd977ac726bef70895883e79da4e1edd8 Mon Sep 17 00:00:00 2001 From: Terry Truong Date: Mon, 2 Jan 2023 14:51:53 +1100 Subject: Adjust wikidata event specifiers Do minor refactors: - Swap fmt=1 and fmt=2 in 'events' table - Make documentation consistently use BC and AD - import argparse at start of scripts --- backend/hist_data/enwiki/README.md | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) (limited to 'backend/hist_data/enwiki/README.md') diff --git a/backend/hist_data/enwiki/README.md b/backend/hist_data/enwiki/README.md index 262ebdb..76d33e5 100644 --- a/backend/hist_data/enwiki/README.md +++ b/backend/hist_data/enwiki/README.md @@ -33,11 +33,11 @@ This directory holds files obtained/derived from [English Wikipedia](https://en. # Image Files - `gen_img_data.py`
- Used to find infobox image names for page IDs, and store them into a database. + Finds infobox image names for page IDs, and stores them into a database. - `download_img_license_info.py`
- Used to download licensing metadata for image names, via wikipedia's online API, and store them into a database. + Downloads licensing metadata for image names, via wikipedia's online API, and stores them into a database. - `img_data.db`
- Used to hold metadata about infobox images for a set of page IDs. + Holds metadata about infobox images for a set of page IDs. Generated using `gen_img_data.py` and `download_img_license_info.py`.
Tables:
- `page_imgs`: `page_id INT PRIMARY KEY, title TEXT UNIQUE, img_name TEXT`
@@ -47,7 +47,7 @@ This directory holds files obtained/derived from [English Wikipedia](https://en.
Might lack some matches for `img_name` in `page_imgs`, due to licensing info unavailability. - `download_imgs.py`
- Used to download image files into imgs/. + Downloads image files into imgs/. # Description Files - `gen_desc_data.py`
-- cgit v1.2.3