aboutsummaryrefslogtreecommitdiff
path: root/backend/data/dbpedia/README.md
blob: 8a08f20564a1b99d895e6531c1918ae65d209478 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
This directory holds files obtained from/using [Dbpedia](https://www.dbpedia.org).

# Downloaded Files
-   `labels_lang=en.ttl.bz2` <br>
    Obtained via https://databus.dbpedia.org/dbpedia/collections/latest-core.
    Downloaded from <https://databus.dbpedia.org/dbpedia/generic/labels/2022.03.01/labels_lang=en.ttl.bz2>.
-   `page_lang=en_ids.ttl.bz2` <br>
    Downloaded from <https://databus.dbpedia.org/dbpedia/generic/page/2022.03.01/page_lang=en_ids.ttl.bz2>
-   `redirects_lang=en_transitive.ttl.bz2` <br>
    Downloaded from <https://databus.dbpedia.org/dbpedia/generic/redirects/2022.03.01/redirects_lang=en_transitive.ttl.bz2>.
-   `disambiguations_lang=en.ttl.bz2` <br>
    Downloaded from <https://databus.dbpedia.org/dbpedia/generic/disambiguations/2022.03.01/disambiguations_lang=en.ttl.bz2>.
-   `instance-types_lang=en_specific.ttl.bz2` <br>
    Downloaded from <https://databus.dbpedia.org/dbpedia/mappings/instance-types/2022.03.01/instance-types_lang=en_specific.ttl.bz2>.
-   `short-abstracts_lang=en.ttl.bz2` <br>
    Downloaded from <https://databus.dbpedia.org/vehnem/text/short-abstracts/2021.05.01/short-abstracts_lang=en.ttl.bz2>.

# Other Files
-   genDescData.py <br>
    Used to generate a database representing data from the ttl files.
-   descData.db <br>
    Generated by genDescData.py. <br>
    Tables: <br>
    -   `labels`:          `iri TEXT PRIMARY KEY, label TEXT `
    -   `ids`:             `iri TEXT PRIMARY KEY, id INT`
    -   `redirects`:       `iri TEXT PRIMARY KEY, target TEXT`
    -   `disambiguations`: `iri TEXT PRIMARY KEY`
    -   `types`:           `iri TEXT, type TEXT`
    -   `abstracts`:       `iri TEXT PRIMARY KEY, abstract TEXT`