Added IRIs as extra field properties in datamodels #434

quaat · 2024-02-27T10:10:32Z

Description:

Type of change:

Bug fix.
New feature.
Documentation update.

Checklist for the reviewer:

This checklist should be used as a help for the reviewer.

Is the change limited to one issue?
Does this PR close the issue?
Is the code easy to read and understand?
Do all new feature have an accompanying new test?
Has the documentation been updated as necessary?

codecov · 2024-02-27T10:15:01Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 82.79%. Comparing base (f5587a6) to head (3f9b721).
Report is 1 commits behind head on master.

Additional details and impacted files

@@           Coverage Diff           @@
##           master     #434   +/-   ##
=======================================
  Coverage   82.79%   82.79%           
=======================================
  Files          14       14           
  Lines         616      616           
=======================================
  Hits          510      510           
  Misses        106      106

Flag	Coverage Δ
linux	`82.79% <ø> (ø)`
linux-strategies	`82.79% <ø> (ø)`
windows	`82.14% <ø> (ø)`
windows-strategies	`82.14% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

CasperWA

Implementing the IRI key the way you have is deprecated in pydantic v2, see https://github.com/EMMC-ASBL/oteapi-core/actions/runs/8063034440/job/22024046478?pr=434#step:7:224.

I have made the according suggested changes in one of the files, but this should be repeated throughout all the config files, where IRI is implemented/added.
Also, I think you can remove the "type: ignore" comment afterwards.

oteapi/models/datacacheconfig.py

oteapi/models/filterconfig.py

jesper-friis · 2024-02-28T22:30:20Z

oteapi/models/datacacheconfig.py

@@ -36,6 +38,5 @@ class DataCacheConfig(AttrDict):
    tag: Optional[str] = Field(
        None,
        description="Tag assigned to the downloaded content, typically "
-        "identifying a session. Used with the `evict()` method to clean up a "
-        "all cache entries with a given tag.",
+        "identifying a session. Used with the `evict()` method to clean up all cache entries with a given tag.",


Please ahead to the PEP8 Python conventions. Max line length is 80 characters.

jesper-friis · 2024-02-28T23:02:21Z

oteapi/models/filterconfig.py

+    query: Optional[str] = Field(
+        None,
+        description="Define a query operation.",
+        IRI="http://www.w3.org/1999/02/22-rdf-syntax-ns#Statement",  # type: ignore


rdf:Statement refer to a RDF statement as a triple. That is not the meaning of the query field.

I haven't looked much into the details of the data cube vocabulary, but qb:slice seems to be a possible property one could refer to, since slicing is about selecting a subpart of a dataset, witch is exactly the purpose of a filter strategy. However, a big issue with data cube vocabulary is that it is not related to dcat, so I will not recommend to use it.

jesper-friis · 2024-02-28T23:28:18Z

oteapi/models/filterconfig.py

The suggested properties for the query, condition and limit fields seems arbitrary chosen. Since there exists no properties in the DCAT and related W3C vocabularies that correspond to these fields, I think that it is better that we define our them in the OTE Interface Ontology (OTEIO).

You are absolutely right about the IRIs being a bit random for certain properties, but I would much rather go to more generic concepts in established ontologies, than making up our own concepts in a custom ontology that nobody has adopted.

jesper-friis · 2024-02-28T23:52:38Z

oteapi/models/mappingconfig.py

    )
    prefixes: Optional[Dict[str, str]] = Field(
        None,
        description=(
            "Dictionary of shortnames that expands to an IRI given as local "
            "value/IRI-expansion-pairs."
        ),
+        IRI="http://www.w3.org/2004/02/skos/core#notation",  # type: ignore


If using skos:notation for prefixes, then we need a to define a custom datatype, like oteio:prefixType, such that we can serialise a prefix as:

:mappingFilter skos:notation "rdfs: <http://www.w3.org/2000/01/rdf-schema#>"^^oteio:prefixType .

However, I think it would be simpler to define our own oteio:prefix, such that we can express the above as:

:mappingFilter oteio:prefix "rdfs: <http://www.w3.org/2000/01/rdf-schema#>" .

jesper-friis · 2024-02-29T00:01:20Z

oteapi/models/parserconfig.py

+        description="Type of registered parser strategy.",
+        IRI="http://purl.org/dc/terms/type",
+    )  # type: ignore
+    entity: AnyHttpUrl = Field(


Entity is a difficult name. It can mean anything. I am not found of freely mixing different vocabularies, but if we really want to identify this with schema:url, then I think the field should be named url.

jesper-friis · 2024-02-29T00:02:58Z

oteapi/models/resourceconfig.py

-        None, description="Type of registered resource strategy."
+        None,
+        description="Type of registered resource strategy.",
+        IRI="http://purl.org/dc/terms/type",  # type: ignore
    )

    downloadUrl: Optional[HostlessAnyUrl] = Field(


Please spell the field name as downloadURL to be consistent with DCAT.

jesper-friis · 2024-02-29T00:03:35Z

oteapi/models/resourceconfig.py

@@ -42,6 +45,7 @@ class ResourceConfig(GenericConfig, SecretConfig):
            " type of the distribution is defined in IANA "
            "[[IANA-MEDIA-TYPES](https://www.w3.org/TR/vocab-dcat-2/#bib-iana-media-types)]."
        ),
+        IRI="http://www.w3.org/ns/dcat#mediaType",  # type: ignore
    )
    accessUrl: Optional[HostlessAnyUrl] = Field(


Please spell the field name as accessURL to be consistent with DCAT.

oteapi/models/transformationconfig.py

jesper-friis · 2024-02-29T00:45:07Z

oteapi/models/transformationconfig.py

-        None, description="Time when the transformation process started. Given in UTC."
+        None,
+        description="Time when the transformation process started. Given in UTC.",
+        IRI="http://purl.org/dc/terms/date",  # type: ignore


startTime and finishTime cannot have the same IRI. However, DCAT has dcat:startDate and dcat:endDate which are suitable here.

The only issue with dcat:startDate and dcat:endDate are that they have domain dcterms:PeriodOfTime. Since a TransformationStatus describes a time period, but isn't a time periode itself, the RDF serialisation cannot be straight forward, like

:transformation_status1 dcat:startTime "2024-02-29 08:45" ; dcat:endTime "2024-02-29 09:00" .

but has to has to be expressed as:

:transformation_status1 dcterms:temporal [ a dcterms:PeriodOfTime ; dcat:startTime "2024-02-29 08:45" ; dcat:endTime "2024-02-29 09:00" ; ] .

Co-authored-by: Casper Welzel Andersen <[email protected]>

Co-authored-by: Jesper Friis <[email protected]>

Added IRIs as extra field properties in datamodels

c05eb23

quaat requested review from CasperWA and Treesarj February 27, 2024 10:10

quaat linked an issue Feb 27, 2024 that may be closed by this pull request

IRI in Pydantic Datamodels for RDF serialization #433

Open

reset DataCacheConfig signature

8fb9de3

CasperWA requested changes Feb 28, 2024

View reviewed changes

jesper-friis reviewed Feb 28, 2024

View reviewed changes

jesper-friis reviewed Feb 29, 2024

View reviewed changes

oteapi/models/transformationconfig.py Outdated Show resolved Hide resolved

jesper-friis reviewed Feb 29, 2024

View reviewed changes

CasperWA mentioned this pull request Feb 29, 2024

Added additional descriptive fields to ResourceConfig #427

Open

8 tasks

quaat and others added 6 commits February 29, 2024 13:00

Update oteapi/models/filterconfig.py

70c113b

Co-authored-by: Casper Welzel Andersen <[email protected]>

Update oteapi/models/datacacheconfig.py

c7d1974

Co-authored-by: Casper Welzel Andersen <[email protected]>

Update oteapi/models/filterconfig.py

f00b31e

Co-authored-by: Casper Welzel Andersen <[email protected]>

Update oteapi/models/filterconfig.py

37551c7

Co-authored-by: Casper Welzel Andersen <[email protected]>

Update oteapi/models/filterconfig.py

216d717

Co-authored-by: Casper Welzel Andersen <[email protected]>

Update oteapi/models/transformationconfig.py

3f9b721

Co-authored-by: Jesper Friis <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added IRIs as extra field properties in datamodels #434

Added IRIs as extra field properties in datamodels #434

quaat commented Feb 27, 2024

codecov bot commented Feb 27, 2024 •

edited

Loading

CasperWA left a comment

jesper-friis Feb 28, 2024

jesper-friis Feb 28, 2024 •

edited

Loading

jesper-friis Feb 28, 2024 •

edited

Loading

quaat Feb 29, 2024

jesper-friis Feb 28, 2024

jesper-friis Feb 29, 2024

jesper-friis Feb 29, 2024

jesper-friis Feb 29, 2024

jesper-friis Feb 29, 2024 •

edited

Loading

Added IRIs as extra field properties in datamodels #434

Are you sure you want to change the base?

Added IRIs as extra field properties in datamodels #434

Conversation

quaat commented Feb 27, 2024

Description:

Type of change:

Checklist for the reviewer:

codecov bot commented Feb 27, 2024 • edited Loading

Codecov Report

CasperWA left a comment

Choose a reason for hiding this comment

jesper-friis Feb 28, 2024

Choose a reason for hiding this comment

jesper-friis Feb 28, 2024 • edited Loading

Choose a reason for hiding this comment

jesper-friis Feb 28, 2024 • edited Loading

Choose a reason for hiding this comment

quaat Feb 29, 2024

Choose a reason for hiding this comment

jesper-friis Feb 28, 2024

Choose a reason for hiding this comment

jesper-friis Feb 29, 2024

Choose a reason for hiding this comment

jesper-friis Feb 29, 2024

Choose a reason for hiding this comment

jesper-friis Feb 29, 2024

Choose a reason for hiding this comment

jesper-friis Feb 29, 2024 • edited Loading

Choose a reason for hiding this comment

codecov bot commented Feb 27, 2024 •

edited

Loading

jesper-friis Feb 28, 2024 •

edited

Loading

jesper-friis Feb 28, 2024 •

edited

Loading

jesper-friis Feb 29, 2024 •

edited

Loading