aboutsummaryrefslogtreecommitdiff
path: root/backend/hist_data/README.md
diff options
context:
space:
mode:
authorTerry Truong <terry06890@gmail.com>2022-10-04 23:58:08 +1100
committerTerry Truong <terry06890@gmail.com>2022-10-04 23:58:08 +1100
commitb1d4c709cb2793745e61d85c337514b9c6c85603 (patch)
tree5e26d56ec90e810862d1aba8d0ce03abb0e8cc27 /backend/hist_data/README.md
parent07b7ef49b07242014f288652980f5b15bfc087f1 (diff)
Add gen_picked_data.py
Add unit test Update READMEs and .gitignore
Diffstat (limited to 'backend/hist_data/README.md')
-rw-r--r--backend/hist_data/README.md5
1 files changed, 5 insertions, 0 deletions
diff --git a/backend/hist_data/README.md b/backend/hist_data/README.md
index 517259c..3fcb8df 100644
--- a/backend/hist_data/README.md
+++ b/backend/hist_data/README.md
@@ -68,3 +68,8 @@ Some of the scripts use third-party packages:
1. In enwiki/, run `gen_desc_data.py`, which extracts page descriptions into a database.
1. Run `gen_desc_data.py`, which adds the `descs` table, using data in enwiki/,
and the `events` and `images` tables (only adds descriptions for events with images).
+
+## Optionally Add Extra Event Data
+1. Additional events can be described in `picked/events.json`, with images for them put
+ in `picked` (see the README for details).
+1. Can run `gen_picked_data.py` to add those described events to the database.