aboutsummaryrefslogtreecommitdiff
path: root/backend/tol_data/dbpedia/README.md
blob: a708122c2c58cf1975b5282b3fd214ae4929fe9f (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
This directory holds files obtained/derived from [Dbpedia](https://www.dbpedia.org).

# Downloaded Files
-   `labels_lang=en.ttl.bz2` <br>
    Obtained via https://databus.dbpedia.org/dbpedia/collections/latest-core.
    Downloaded from <https://databus.dbpedia.org/dbpedia/generic/labels/2022.03.01/labels_lang=en.ttl.bz2>.
-   `page_lang=en_ids.ttl.bz2` <br>
    Downloaded from <https://databus.dbpedia.org/dbpedia/generic/page/2022.03.01/page_lang=en_ids.ttl.bz2>
-   `redirects_lang=en_transitive.ttl.bz2` <br>
    Downloaded from <https://databus.dbpedia.org/dbpedia/generic/redirects/2022.03.01/redirects_lang=en_transitive.ttl.bz2>.
-   `disambiguations_lang=en.ttl.bz2` <br>
    Downloaded from <https://databus.dbpedia.org/dbpedia/generic/disambiguations/2022.03.01/disambiguations_lang=en.ttl.bz2>.
-   `instance-types_lang=en_specific.ttl.bz2` <br>
    Downloaded from <https://databus.dbpedia.org/dbpedia/mappings/instance-types/2022.03.01/instance-types_lang=en_specific.ttl.bz2>.
-   `short-abstracts_lang=en.ttl.bz2` <br>
    Downloaded from <https://databus.dbpedia.org/vehnem/text/short-abstracts/2021.05.01/short-abstracts_lang=en.ttl.bz2>.

# Other Files
-   `gen_desc_data.py` <br>
    Used to generate a database representing data from the ttl files.
-   `desc_data.db` <br>
    Generated by `gen_desc_data.py`. <br>
    Tables: <br>
    -   `labels`:          `iri TEXT PRIMARY KEY, label TEXT `
    -   `ids`:             `iri TEXT PRIMARY KEY, id INT`
    -   `redirects`:       `iri TEXT PRIMARY KEY, target TEXT`
    -   `disambiguations`: `iri TEXT PRIMARY KEY`
    -   `types`:           `iri TEXT, type TEXT`
    -   `abstracts`:       `iri TEXT PRIMARY KEY, abstract TEXT`