diff options
| author | Terry Truong <terry06890@gmail.com> | 2023-01-13 18:56:04 +1100 |
|---|---|---|
| committer | Terry Truong <terry06890@gmail.com> | 2023-01-13 18:56:04 +1100 |
| commit | 3029a2f866b240856518cfa944b9e00ef37455db (patch) | |
| tree | 0c9dcfaa065d346060de69779523aef66d62d6d4 /backend/hist_data/README.md | |
| parent | d335df85ea815d10df62a126067482e178e3670a (diff) | |
Re-enable images
Diffstat (limited to 'backend/hist_data/README.md')
| -rw-r--r-- | backend/hist_data/README.md | 5 |
1 files changed, 4 insertions, 1 deletions
diff --git a/backend/hist_data/README.md b/backend/hist_data/README.md index 4cc7103..2a9475a 100644 --- a/backend/hist_data/README.md +++ b/backend/hist_data/README.md @@ -68,10 +68,13 @@ Some of the scripts use third-party packages: USER_AGENT variable applies here as well. 1. Run `gen_imgs.py`, which creates resized/cropped images in img/, from images in enwiki/imgs/. Adds the `imgs` and `event_imgs` tables. <br> - The output images may need additional manual changes: + The output images might need additional manual changes: - An input image might have no output produced, possibly due to data incompatibilities, memory limits, etc. - An input x.gif might produce x-1.jpg, x-2.jpg, etc, instead of x.jpg. + - An input image might produce output with unexpected dimensions. + This seems to happen when the image is very large, and triggers a + decompression bomb warning. ## Generate Description Data 1. In enwiki/, run `gen_desc_data.py`, which extracts page descriptions into a database. |
