π Curated Vocabularies
Using curated, controlled vocabularies developed through national and international earthβscience semantic networks, is a key step in data curation.
Standardized vocabularies ensure data is:
- β Consistent β terms mean the same thing across datasets
- β Unambiguous β reduces confusion in interpretation
- β Interoperable β reusable across systems and repositories
Simply put, they provide a common language for describing data.
π Standardized Terms Used by CanWIN
Below are the main controlled vocabularies youβll encounter when publishing or viewing data in CanWIN.
π Keywords
Keywords highlight the main ideas in your research, making data easier to find in the Data Catalogue or other repositories.
CanWIN maintains a curated list of standardized keywords for Arctic and freshwater data.
Primary source vocabulary: Polar Data Catalogue (PDC)
CanWIN's Curated Keywords
π Variable Descriptors
Variable descriptors add context to tabular data or data dictionaries, helping users understand variables more clearly.
Primary source vocabulary: EPA & USGS Water Quality eXchange (WQX)
CanWIN's Variable Descriptors
π§Ύ Variable Names
Standardized variable names link measurements to permanent, clearly defined terms and definitions.
This removes ambiguity and makes data easier to reuse across disciplines.
Primary source vocabularies:
- BODC NERC Vocabulary
- CF Standard Names
CanWIN's Standardized Vocabulary App
π οΈ Why This Matters
- π Improves searchability in CanWIN and global repositories
- π Ensures interoperability across platforms and disciplines
- π Supports data reuse by providing clear definitions
- π€ Aligns with FAIR principles (Findable, Accessible, Interoperable, Reusable)
Tip
When preparing metadata, always check CanWINβs curated vocabularies first.
This ensures your dataset is aligned with international standards and easier to discover.