aboutsummaryrefslogtreecommitdiff
path: root/backend/data/README.md
diff options
context:
space:
mode:
authorTerry Truong <terry06890@gmail.com>2022-05-17 10:41:12 +1000
committerTerry Truong <terry06890@gmail.com>2022-05-17 10:41:12 +1000
commit29940d51eb8b6b220d53940ecbc212cea78159ae (patch)
treebfa698c17525de7876b80ad37d8f7777b9505ba0 /backend/data/README.md
parenta840a16c6bd5aef906bd5cbce8293fc863cb5a5d (diff)
Improve enwiki description extraction
Adjust enwiki code to handle single dump file, and add scripts for 'convenient' page-content lookup.
Diffstat (limited to 'backend/data/README.md')
-rw-r--r--backend/data/README.md2
1 files changed, 1 insertions, 1 deletions
diff --git a/backend/data/README.md b/backend/data/README.md
index cb9cd42..576c70e 100644
--- a/backend/data/README.md
+++ b/backend/data/README.md
@@ -24,7 +24,7 @@ File Generation Process
- Supplementing with Wikipedia dump
1 Obtain data in enwiki/, as specified in it's README.
2 Run genEnwikiData.py, which adds to the 'descs' table, using data in
- enwiki/enwikiData.db, reducedTol/names.txt, and the 'nodes' table.
+ enwiki/enwikiData.db, and the 'nodes' table.
5 Reduced Tree Structure Data
1 Run genReducedTreeData.py, which adds 'r_nodes' and 'r_edges' tables to
data.db, using reducedTol/names.txt, and the 'nodes' and 'names' tables.