From 4cb5ec14bcfb2db4574c0b0b0d4d4aff59e24c8a Mon Sep 17 00:00:00 2001 From: Terry Truong Date: Wed, 18 Jan 2023 20:21:22 +1100 Subject: Adjust backend docs after another db regeneration --- backend/hist_data/README.md | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) (limited to 'backend/hist_data/README.md') diff --git a/backend/hist_data/README.md b/backend/hist_data/README.md index 50108e0..9fe2d0e 100644 --- a/backend/hist_data/README.md +++ b/backend/hist_data/README.md @@ -69,7 +69,9 @@ Some of the scripts use third-party packages: script variable to identify yourself to the online API (this is expected [best practice](https://www.mediawiki.org/wiki/API:Etiquette)). 1. In enwiki/, run `download_imgs.py`, which downloads images into enwiki/imgs/. Setting the - USER_AGENT variable applies here as well. + USER_AGENT variable applies here as well.
+ In some rare cases, the download won't produce an image file, but a text file containing + 'File not found: ...'. These can simply be deleted. 1. Run `gen_imgs.py`, which creates resized/cropped images in img/, from images in enwiki/imgs/. Adds the `imgs` and `event_imgs` tables.
The output images might need additional manual changes: -- cgit v1.2.3