Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Prominently indicate when a data package is essentially a duplicate of an external version #2541

Open
robyngit opened this issue Sep 30, 2024 · 1 comment · May be fixed by #2546
Open

Prominently indicate when a data package is essentially a duplicate of an external version #2541

robyngit opened this issue Sep 30, 2024 · 1 comment · May be fixed by #2546
Assignees
Labels
ADC CI-14 Data search and display improvements (ADC deliverable) arctic data center dataset landing page Enhancements and bugs for the dataset landing pages enhancement Priority: High semantics
Milestone

Comments

@robyngit
Copy link
Member

robyngit commented Sep 30, 2024

MetacatUI currently doesn't clearly distinguish between duplicate datasets that have been copied to ADC from an external source. This can lead to confusion, especially when both the original and derived datasets have different DOIs but are essentially the same. NSIDC and other data centers are concerned that differing citations can impact usage metrics and funding. The goal is to make it easier for users to identify the original dataset by displaying it prominently when it exists. This way, users can easily reference and credit the original source, helping to avoid issues with misrepresentation and citation confusion.

Planned Solution

  • When annotations include both schema.org:sameAs and prov:derivedFrom reference the same DOI or UUID, MetacatUI should prominently display the original source data.
  • The display should retrieve citation info directly from the DOI using a service (e.g., DOI or metrics service).
  • In the "Cite this dataset" modal, show both the original and the derived dataset, clearly distinguishing them and giving users discretion on which to cite.

Related issues: #2051, #122

@robyngit robyngit added enhancement semantics Priority: High dataset landing page Enhancements and bugs for the dataset landing pages arctic data center ADC CI-14 Data search and display improvements (ADC deliverable) labels Sep 30, 2024
@robyngit robyngit added this to the 2.32.0 milestone Sep 30, 2024
@robyngit robyngit self-assigned this Sep 30, 2024
@robyngit
Copy link
Member Author

robyngit commented Oct 2, 2024

Test dataset with wasDerivedFrom and sameAs annotations both with the same DOI url as the value: https://demo.arcticdata.io/view/urn%3Auuid%3Aea3f0ce4-a084-47b7-ae60-7cbe808c57a0

<annotation>
  <propertyURI label="wasDerivedFrom">http://www.w3.org/ns/prov#wasDerivedFrom</propertyURI>
  <valueURI label="https://doi.org/doi:10.1038/sdata.2015.39">https://doi.org/doi:10.1038/sdata.2015.39</valueURI>
</annotation>
<annotation>
  <propertyURI label="sameAs">http://www.w3.org/2002/07/owl#sameAs</propertyURI>
  <valueURI label="https://doi.org/doi:10.1038/sdata.2015.39">https://doi.org/doi:10.1038/sdata.2015.39</valueURI>
</annotation>

robyngit added a commit that referenced this issue Oct 3, 2024
- Add a new view, CanonicalDatasetHandlerView, to handle identifying and displaying the canonical dataset for a given dataset
- The view removes the annotations that identify the canonical dataset from the metadata view and adds a field to display the canonical dataset url
- The view has shell methods to fill in that will make further modifications to the metadata view

Issue #2541
@robyngit robyngit linked a pull request Oct 3, 2024 that will close this issue
robyngit added a commit that referenced this issue Oct 3, 2024
- Deprecate the template that rendered the icons previously
- Add new methods to MetadataView to render the icons
- Call new methods to render the duplicate icon with the CanonicalDatasetHandlerView

Issue #2541
robyngit added a commit that referenced this issue Oct 3, 2024
- Add a new view, CanonicalDatasetHandlerView, to handle identifying and displaying the canonical dataset for a given dataset
- The view removes the annotations that identify the canonical dataset from the metadata view and adds a field to display the canonical dataset url
- The view has shell methods to fill in that will make further modifications to the metadata view

Issue #2541
robyngit added a commit that referenced this issue Oct 3, 2024
- Deprecate the template that rendered the icons previously
- Add new methods to MetadataView to render the icons
- Call new methods to render the duplicate icon with the CanonicalDatasetHandlerView

Issue #2541
robyngit added a commit that referenced this issue Oct 10, 2024
The metadata view re-renders when a users' logged in status changes. Depending on the timing of this change,
re-rendering may occur in the middle of a previous render. This causes elements of the view to break, especially the
new CanonicalDatasetHandlerView. This commit fixes the issue by adding a check to see if the view is already
rendering before re-rendering.

Issue #2541
robyngit added a commit that referenced this issue Oct 10, 2024
And show the citation modal when the "duplicate" info icon is clicked.

Issue #2541
robyngit added a commit that referenced this issue Oct 14, 2024
robyngit added a commit that referenced this issue Oct 14, 2024
and add rendering of issue & volume for journal articles in the citation APA template

Issue #2541
robyngit added a commit that referenced this issue Oct 14, 2024
using the new CrossRef model

Issue #2541
robyngit added a commit that referenced this issue Oct 14, 2024
using the new CrossRef model

Issue #2541
robyngit added a commit that referenced this issue Oct 14, 2024
Handle datasets that have multiple annotations with the sameAs and/or derivedFrom property.

Issue #2541
robyngit added a commit that referenced this issue Oct 14, 2024
In Metadata view, show the full citation for the canonical dataset in the "Canonical Dataset" section in the general metadata view.

Issue #2541
@robyngit robyngit linked a pull request Oct 14, 2024 that will close this issue
robyngit added a commit that referenced this issue Oct 17, 2024
Canonical dataset citation is now before the "this version" citation in the citation modal on dataset landing pages.

Issue #2541
robyngit added a commit that referenced this issue Oct 17, 2024
In the MetadataView. This ensures that if the logged in status changes, the view will re-render to show the correct information.

Issue #2541
robyngit added a commit that referenced this issue Oct 17, 2024
In the MetadataView. This ensures that if the logged in status changes, the view will re-render to show the correct information.

Issue #2541
robyngit added a commit that referenced this issue Oct 17, 2024
Remove the Canonical Dataset field in the General section of dataset landing pages, and instead show a more prominent blue info box at the top of the page with the citation.

Issue #2541
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ADC CI-14 Data search and display improvements (ADC deliverable) arctic data center dataset landing page Enhancements and bugs for the dataset landing pages enhancement Priority: High semantics
Projects
1 participant