π Best Practices for Data
π Why Do We Need Good Data Practices?
Good research data management (RDM) practices are essential for keeping data organized, minimizing errors, preventing data loss, and avoiding ambiguity.
Strong practices ensure your data is FAIR once published β Findable, Accessible, Interoperable, and Reusable A.
They also support CARE (Collective Benefit, Authority to Control, Responsibility, Ethics) and OCAP (Ownership, Control, Access, Possession) principles of Indigenous Data Sovereignty B C.
π§ Recommended Best Practices
1. π Choose File Formats Wisely
- Proprietary formats (e.g., Excel) can limit interoperability.
- CSV files are widely supported and machineβreadable.
β Best practice: Export Excel sheets to CSV at the end of analysis, saving each sheet as its own file D.
2. ποΈ Version Your Data
- Track changes across the data lifecycle.
- Prevent information loss and enable backtracking.
β Best practices:
- Maintain separate folders for raw vs. processed files.
- Use platforms like GitLab for advanced version control E.
3. π Structure Tables & Spreadsheets Clearly
- Arrange variables as columns, samples as rows.
- Ensure consistency in variable names, column order, and data types.
β Best practices:
- Avoid spaces/special characters in headers where possible; use underscores
_. - Use consistent NULL values (blank space preferred).
- Standardize date formats (preferably UTC).
- Do not merge cells or leave empty rows/columns D.
4. π Adopt Naming Conventions
- Clear, consistent names make files easier to identify and reuse.
β Best practices:
- Include dates or descriptive keywords.
- Avoid special characters:
< > : " / \ | ? * { } # & $ ! = | \' "` - Use underscores
_instead of spaces D.
5. π Ensure Secure Storage & Backup
- Store data in secure, accessβcontrolled environments.
- Maintain multiple backups (local + cloud).
β Best practices:
6. π·οΈ Document Metadata Thoroughly
- Metadata makes data discoverable and reusable.
β Best practices:
- Record provenance (how data was collected/processed).
- Note constraints (usage rights, licenses).
- Credit contributors.
- Use standardized vocabularies and ontologies A D.
7. π€ Respect Indigenous Data Sovereignty
- Apply CARE and OCAP principles when working with Indigenous data.
β Best practices:
- Ensure communities retain ownership and control.
- Share data in ways that provide collective benefit.
- Follow cultural protocols for access and use B C G.
8. π€ Plan for Sharing & Preservation
- Deposit data in trusted repositories for longβterm access.
- Ensure compliance with funder and institutional policies.
β Best practices:
- Use repositories aligned with FAIR principles (e.g., CanWIN DataHub).
- Include persistent identifiers (DOIs) for datasets H I.
π References & Extra Sources
- Best Practices in Data Organization Using Spreadsheets
- McGill RDM Guidelines F
- CIHR Data Management Plan Guidance H
- Smithsonian RDM Best Practices D
- GO FAIR Principles A
- FNIGC OCAP Principles B
Tip
See our Data File Best Practices primer for a quick guide to keep with you!