diff options
| author | Terry Truong <terry06890@gmail.com> | 2022-05-17 10:41:12 +1000 |
|---|---|---|
| committer | Terry Truong <terry06890@gmail.com> | 2022-05-17 10:41:12 +1000 |
| commit | 29940d51eb8b6b220d53940ecbc212cea78159ae (patch) | |
| tree | bfa698c17525de7876b80ad37d8f7777b9505ba0 /backend/data/README.md | |
| parent | a840a16c6bd5aef906bd5cbce8293fc863cb5a5d (diff) | |
Improve enwiki description extraction
Adjust enwiki code to handle single dump file, and add scripts for
'convenient' page-content lookup.
Diffstat (limited to 'backend/data/README.md')
| -rw-r--r-- | backend/data/README.md | 2 |
1 files changed, 1 insertions, 1 deletions
diff --git a/backend/data/README.md b/backend/data/README.md index cb9cd42..576c70e 100644 --- a/backend/data/README.md +++ b/backend/data/README.md @@ -24,7 +24,7 @@ File Generation Process - Supplementing with Wikipedia dump 1 Obtain data in enwiki/, as specified in it's README. 2 Run genEnwikiData.py, which adds to the 'descs' table, using data in - enwiki/enwikiData.db, reducedTol/names.txt, and the 'nodes' table. + enwiki/enwikiData.db, and the 'nodes' table. 5 Reduced Tree Structure Data 1 Run genReducedTreeData.py, which adds 'r_nodes' and 'r_edges' tables to data.db, using reducedTol/names.txt, and the 'nodes' and 'names' tables. |
