diff options
| author | Terry Truong <terry06890@gmail.com> | 2022-10-04 23:58:08 +1100 |
|---|---|---|
| committer | Terry Truong <terry06890@gmail.com> | 2022-10-04 23:58:08 +1100 |
| commit | b1d4c709cb2793745e61d85c337514b9c6c85603 (patch) | |
| tree | 5e26d56ec90e810862d1aba8d0ce03abb0e8cc27 /backend/hist_data/README.md | |
| parent | 07b7ef49b07242014f288652980f5b15bfc087f1 (diff) | |
Add gen_picked_data.py
Add unit test
Update READMEs and .gitignore
Diffstat (limited to 'backend/hist_data/README.md')
| -rw-r--r-- | backend/hist_data/README.md | 5 |
1 files changed, 5 insertions, 0 deletions
diff --git a/backend/hist_data/README.md b/backend/hist_data/README.md index 517259c..3fcb8df 100644 --- a/backend/hist_data/README.md +++ b/backend/hist_data/README.md @@ -68,3 +68,8 @@ Some of the scripts use third-party packages: 1. In enwiki/, run `gen_desc_data.py`, which extracts page descriptions into a database. 1. Run `gen_desc_data.py`, which adds the `descs` table, using data in enwiki/, and the `events` and `images` tables (only adds descriptions for events with images). + +## Optionally Add Extra Event Data +1. Additional events can be described in `picked/events.json`, with images for them put + in `picked` (see the README for details). +1. Can run `gen_picked_data.py` to add those described events to the database. |
