diff options
| author | Terry Truong <terry06890@gmail.com> | 2022-05-26 13:22:36 +1000 |
|---|---|---|
| committer | Terry Truong <terry06890@gmail.com> | 2022-05-26 13:22:36 +1000 |
| commit | 07397961bfb113bd9c03883f2b24e6d287f989ca (patch) | |
| tree | a5a4fc18b54689497eae85f269e9467e1a0068aa /backend/data/README.md | |
| parent | 2d67e54dc91708eaf89eca9dca27cec126f7f465 (diff) | |
Add some enwiki redirect data as alt-names
Diffstat (limited to 'backend/data/README.md')
| -rw-r--r-- | backend/data/README.md | 17 |
1 files changed, 9 insertions, 8 deletions
diff --git a/backend/data/README.md b/backend/data/README.md index 8cfa960..d444e4f 100644 --- a/backend/data/README.md +++ b/backend/data/README.md @@ -21,14 +21,15 @@ File Generation Process which uses 'nodes', 'edges', 'eol_ids', and 'images', to associate nodes without images to child images. 4 Node Description Data - - Using DBpedia - 1 Obtain data in dbpedia/, as specified in it's README. - 2 Run genDbpData.py, which adds a 'descs' table to data.db, using - data in dbpedia/dbpData.db, dbpPickedLabels.txt, and the 'nodes' table. - - Supplementing with Wikipedia dump - 1 Obtain data in enwiki/, as specified in it's README. - 2 Run genEnwikiData.py, which adds to the 'descs' table, using data in - enwiki/enwikiData.db, and the 'nodes' table. + 1 Obtain data in dbpedia/, as specified in it's README. + 2 Run genDbpData.py, which adds a 'descs' table to data.db, using + data in dbpedia/dbpData.db, dbpPickedLabels.txt, and the 'nodes' table. +5 Supplementary Name/Description Data + 1 Obtain data in enwiki/, as specified in it's README. + 2 Run genEnwikiDescData.py, which adds to the 'descs' table, using data in + enwiki/enwikiData.db, and the 'nodes' table. + 3 Run genEnwikiNameData.py, which adds to the 'names' table, using data in + enwiki/enwikiData.db, and the 'names' and 'descs' tables. 5 Reduced Tree Structure Data 1 Run genReducedTreeData.py, which adds 'r_nodes' and 'r_edges' tables to data.db, using reducedTol/names.txt, and the 'nodes' and 'names' tables. |
