You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
Licensing of Composite Scientific Datasets
Problem Statement
Metadata standards such as DataCite define licensing at the dataset (DOI) level.
In practice, however, many datasets:
This creates a structural mismatch between metadata representation and legal reality.
1. Dataset-Level License (Metadata Perspective)
In DataCite:
rights/rightsURIapply to the dataset as a whole.This works well if:
It becomes problematic if licensing differs across resources.
2. Common Licensing Scenarios
Scenario A – Uniform License
All components:
→ Straightforward.
→ Dataset-level license accurately reflects legal situation.
Scenario B – Dual Licensing (True Dual License)
The same dataset is offered under two alternative licenses:
Conditions:
This is legally coherent and still dataset-level uniform.
Scenario C – Multi-License Aggregation
The dataset contains different components under different licenses:
This is not dual licensing.
This is a composite dataset with heterogeneous licensing.
In this case:
Scenario D – Public-Domain Content + New Structure
Example:
Legal structure:
The dataset-level license may apply to:
But not to the underlying public-domain texts.
3. Core Distinction
It is essential to distinguish between:
These are conceptually different layers.
4. Legal Risks of Oversimplification
If a dataset-level license implies uniformity while components differ:
Transparency is legally safer than simplification.
5. Recommended Model: Layered Licensing
Layer 1 – Dataset-Level License (Container)
Define a primary license that governs:
This is the license declared in DataCite.
Example wording:
Layer 2 – Resource-Level Licensing
For each resource:
This should be documented in:
Layer 3 – Explicit Exceptions
Include a clarification statement such as:
6. Terminological Precision
Distinguish clearly between:
7. Practical Recommendation
For composite scientific datasets:
Conclusion
Metadata systems are DOI-centric and assume uniform licensing.
Real-world datasets are often composite legal objects.
The robust solution is a layered licensing model:
Beta Was this translation helpful? Give feedback.
All reactions