Skip to content

Commit

Permalink
release 2023-07-07
Browse files Browse the repository at this point in the history
  • Loading branch information
simongray committed Jul 7, 2023
1 parent eb47f48 commit 5ed7f65
Show file tree
Hide file tree
Showing 11 changed files with 34 additions and 20 deletions.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -38,7 +38,7 @@ Significant changes
-------------------

### New schema, prefixes, URIs
DanNet uses a new schema, [available in this repository](resources/schemas/internal/dannet-schema-2022.ttl) and also at https://wordnet.dk/dannet/schema.
DanNet uses a new schema, [available in this repository](resources/schemas/internal/dannet-schema.ttl) and also at https://wordnet.dk/dannet/schema.

DanNet uses the following URI prefixes for the dataset instances, concepts (the range of `dns:ontologicalFacet` and `dns:ontologicalType`) and the schema itself:

Expand Down
2 changes: 1 addition & 1 deletion pages/about-da.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@ DanNet er baseret på [Ontolex][Ontolex]-standarden med [tilføjelser][GWA RDF]

DanNet er også integreret med [COR][COR] og [DDS][DDS], samt det [engelske WordNet][OEWN] (du kan hente deres datasæt fra deres egen side) som vi har udvidet med RDF-etiketter, der minder om dem brugt i DanNet. Disse alternative RDF-datasæt kan ligeledes downloades her på siden ([COR-integration][COR-integration], [DDS-integration][DDS-integration], [OEWN-udvidelse][OEWN-extension]). Vi har også inkluderet [CILI][CILI]-data i vores database, da dette bruges som et fælles integrationspunkt for forskellige WordNets.

Du kan derudover også hente en [komplet kopi][complete] af al data der kan tilgås på wordnet.dk/dannet, inklusiv logisk udledt data og tilknyttede RDF-skemaer.
I tidligere versioner af DanNet, kunne du derudover også hente en komplet kopi af al data der kunne tilgås på wordnet.dk/dannet, inklusiv logisk udledt data og tilknyttede RDF-skemaer. Desværre har dette vist sig at være for at ressourcekrævende at generere som en del af en almindelig DanNet-udgivelse. Vi vil forsøge at ændre på dette i fremtiden.

## Dokumentation
Følgende dokumenter er kun tilgængelige på engelsk og primært tiltænkt udviklere:
Expand Down
2 changes: 1 addition & 1 deletion pages/about-en.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,7 @@ DanNet is based on the [Ontolex][Ontolex] standard with [additions][GWA RDF] fro

DanNet is integrated with [COR][COR] and [DDS][DDS] too, as well as the [English WordNet][OEWN] (you may download their data set from that page) which we have extended with RDF labels resembling those used in DanNet. These alternative RDF data sets can also be downloaded on this page ([COR integration][COR-integration], [DDS integration][DDS-integration], [OEWN extension][OEWN-extension]). We have also included the [CILI][CILI] data in our database, as it is used as a common integration point for different WordNets.

In addition, you may also download a [complete copy][complete] of all the data that can be found on wordnet.dk/dannet, including logically inferred data and associated RDF schemas.
In earlier releases, you could also download a complete copy of all the data that can be found on wordnet.dk/dannet, including logically inferred data and associated RDF schemas. However, this has proven too resource-intensive to generate as part of a regular DanNet release. We will try to remedy this in the future.

## Documentation
The following documents are only available in English and mostly for developers:
Expand Down
7 changes: 7 additions & 0 deletions pages/releases-da.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,13 @@
# Versioner
De nye DanNet-versioner bruger udgivelsesdatoen som versionsnummer, formatteret som `YYYY-MM-DD`.

## **2023-07-07**: Tusinder af nye links samt skemaopdateringer
* DanNet har nu omkring 10K nye links til [CILI](https://github.com/globalwordnet/cili), som også linker til OEWN og andre WordNets.
* Flere nye relationer ( `dns:eqHypernym`, `dns:eqHyponym` og `dns:eqSimilar` ) er blevet tilføjet, da `wn:ili` og `wn:eqSynonym` ikke var nok til at beskrive de relationer vi har mellem forskellige WordNets nu.
* DanNet-synsets har nu også de rå DDO-domæneværdier fra DSL, der fandtes i gamle versioner af DanNet. Disse repræsenteres med `dc:subject`-relationen.
* Derudover er kønsdata fra de gamle versioner af DanNet også nu inkluderet. Det kan findes via den nye `dns:gender`-relation.
* For bedre at kunne facilitere navigation af grafen på DanNet-hjemmesiden er en ny relation, `dns:linkedConcept`, blevet tilføjet til DanNet-skemaet. Denne relation er den omvendte relation af `wn:ili` og kan udledes i den store graf der kan udforskes på wordnet.dk/dannet.

## **2023-06-01**: ~5000 links til Open English WordNet
* Skemaoversættelserne er blevet opdateret.
* Omtrent 5000 links er blevet tilføjet, som linker DanNet med [Open English WordNet](https://github.com/globalwordnet/english-wordnet) eller indirekte via [CILI](https://github.com/globalwordnet/cili).
Expand Down
7 changes: 7 additions & 0 deletions pages/releases-en.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,13 @@
# Releases
The newer DanNet releases use the release date as the version number, formatted as `YYYY-MM-DD`.

## **2023-07-07**: Thousands of new links and schema updates
* DanNet now has around 10K new links to the [CILI](https://github.com/globalwordnet/cili) which is also linked with the OEWN and other WordNets.
* Some new relations (`dns:eqHypernym`, `dns:eqHyponym`, and `dns:eqSimilar`) have been added since `wn:ili` and `wn:eqSynonym` were not sufficient to cover the inter-WordNet links we now have.
* DanNet synsets now also have the raw DDO domain values from DSL that were present in the older versions of DanNet. These are represented with the `dc:subject` relation.
* Furthermore, the sex/gender data from the older versions of DanNet has also been included. It is available via the new `dns:gender` relation.
* To better facilitate graph navigation on the DanNet website, a new relation called `dns:linkedConcept` has been added to the DanNet schema. This relation is the inverse of `wn:ili` and is inferred in the large graph that can be queried on wordnet.dk/dannet.

## **2023-06-01**: ~5000 links to the Open English WordNet
* The schema translations have been updated.
* Around 5000 links have been added which link DanNet to the [Open English WordNet](https://github.com/globalwordnet/english-wordnet) or indirectly via the [CILI](https://github.com/globalwordnet/cili).
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@
@prefix wn: <https://globalwordnet.github.io/schemas/wn#> .

<https://wordnet.dk/dannet/schema> a owl:Ontology ;
dc:issued "2023-06-01" ;
dc:issued "2023-07-07" ;
dc:title "DanNet-skema"@da, "DanNet schema"@en;
dc:description "Skema for DanNet-specifikke relationer."@da, "Schema for DanNet-specific relations."@en ;
dc:publisher "<https://cst.dk>";
Expand Down Expand Up @@ -74,12 +74,6 @@
rdfs:comment "Subjektets køn."@da ;
rdfs:range :Gender .

:dslDomain a owl:AnnotationPropety ;
rdfs:label "DDO domain"@en ;
rdfs:label "DDO-domæne"@da ;
rdfs:comment "The domain given for the concept in DDO (\"Den Danske Ordbog\")."@en ;
rdfs:comment "Domæne angivet for begrebet i Den Danske Ordbog."@da .

:sentiment a owl:ObjectProperty ;
rdfs:range marl:Opinion ;
rdfs:label "sentiment"@en ;
Expand Down
13 changes: 11 additions & 2 deletions src/main/dk/cst/dannet/db/bootstrap.clj
Original file line number Diff line number Diff line change
Expand Up @@ -95,7 +95,7 @@
"2023-06-01")

(def current-release
(str old-release "-SNAPSHOT"))
(str "2023-07-07"#_#_old-release"-SNAPSHOT"))

(defn assert-expected-dannet-release!
"Assert that the DanNet `model` is the expected release to boostrap from."
Expand Down Expand Up @@ -274,6 +274,7 @@
[synset-id]
(keyword "dn" (str "synset-" (subs synset-id 3 (- (count synset-id) 3)))))

;; TODO: remove for next release
(h/def new-english-link-triples
(delay
(with-open [reader (clojure.java.io/reader "bootstrap/other/dannet-new/wordnetloom/synset_relation.csv")]
Expand All @@ -295,6 +296,14 @@
rel
(@synset-id->ili-id child-synset-id)]))))
(remove nil?)
;; A few corrections for flipped triples in the dataset.
(map (fn [[s p o :as triple]]
(if (= "ili" (namespace s))
(cond
(= p :wn/ili) [o p s]
(= p :dns/eqHypernym) [o :dns/eqHyponym s]
:else triple)
triple)))
(doall)))))

;; TODO: move to separate ns
Expand Down Expand Up @@ -337,7 +346,7 @@
"female" :dns/Female}]
(cond
(= rel "domain")
#{[synset :dns/dslDomain (da v)]}
#{[synset :dc/subject (da v)]}

(= rel "sex")
(when-let [gender (sex->gender v)]
Expand Down
4 changes: 0 additions & 4 deletions src/main/dk/cst/dannet/db/export/rdf.clj
Original file line number Diff line number Diff line change
Expand Up @@ -165,8 +165,4 @@
(->> (q/table-query g '[?synset ?definition ?ontotype ?sense] op/csv-synsets)
(map csv-row)
(take 10)))

;; Export DanNet as CSV
(export-csv! dannet)
(export-csv! @dk.cst.dannet.web.resources/db)
#_.)
9 changes: 5 additions & 4 deletions src/main/dk/cst/dannet/prefix.cljc
Original file line number Diff line number Diff line change
Expand Up @@ -89,21 +89,22 @@
:export #{'dn 'dnc 'dns
'rdf 'rdfs 'owl
'wn 'ontolex 'skos 'lexinfo
'dcat 'vann 'foaf 'dc}
'dcat 'vann 'foaf 'dc
'ili 'en}
:download {"rdf" {:default "dannet.zip"
"merged" "dannet-dds-cor.zip"
"complete" "dannet-complete.zip"}
"csv" {:default "dannet-csv.zip"}}}
'dnc {:uri (str dannet-root "concepts/")
:resource (str "<" dannet-root "concepts>")
:alt "schemas/internal/dannet-concepts-2022.ttl"}
:alt "schemas/internal/dannet-concepts.ttl"}
'dns {:uri (str dannet-root "schema/")
:resource (str "<" dannet-root "schema>")
:alt "schemas/internal/dannet-schema-2022.ttl"}
:alt "schemas/internal/dannet-schema.ttl"}

;; Various en->da translations included as additional data.
'tr {:uri (str dannet-root "translations/")
:alt "schemas/internal/dannet-translations-2022.ttl"}})
:alt "schemas/internal/dannet-translations.ttl"}})

(def oewn-extension
"Our extension of the OEWN containing labels for words, senses, synsets."
Expand Down

0 comments on commit 5ed7f65

Please sign in to comment.