Skip to content

πŸ“‘ Best Practices for Data

🌟 Why Do We Need Good Data Practices?

Good research data management (RDM) practices are essential for keeping data organized, minimizing errors, preventing data loss, and avoiding ambiguity.
Strong practices ensure your data is FAIR once published β€” Findable, Accessible, Interoperable, and Reusable A. They also support CARE (Collective Benefit, Authority to Control, Responsibility, Ethics) and OCAP (Ownership, Control, Access, Possession) principles of Indigenous Data Sovereignty B C.


1. πŸ“‚ Choose File Formats Wisely

  • Proprietary formats (e.g., Excel) can limit interoperability.
  • CSV files are widely supported and machine‑readable.

βœ… Best practice: Export Excel sheets to CSV at the end of analysis, saving each sheet as its own file D.


2. πŸ—‚οΈ Version Your Data

  • Track changes across the data lifecycle.
  • Prevent information loss and enable backtracking.

βœ… Best practices:

  • Maintain separate folders for raw vs. processed files.
  • Use platforms like GitLab for advanced version control E.

3. πŸ“Š Structure Tables & Spreadsheets Clearly

  • Arrange variables as columns, samples as rows.
  • Ensure consistency in variable names, column order, and data types.

βœ… Best practices:

  • Avoid spaces/special characters in headers where possible; use underscores _.
  • Use consistent NULL values (blank space preferred).
  • Standardize date formats (preferably UTC).
  • Do not merge cells or leave empty rows/columns D.

4. πŸ“ Adopt Naming Conventions

  • Clear, consistent names make files easier to identify and reuse.

βœ… Best practices:

  • Include dates or descriptive keywords.
  • Avoid special characters: < > : " / \ | ? * { } # & $ ! = | \ ' "`
  • Use underscores _ instead of spaces D.

5. πŸ” Ensure Secure Storage & Backup

  • Store data in secure, access‑controlled environments.
  • Maintain multiple backups (local + cloud).

βœ… Best practices:

  • Follow institutional policies for storage and encryption.
  • Automate backups where possible F E.

6. 🏷️ Document Metadata Thoroughly

  • Metadata makes data discoverable and reusable.

βœ… Best practices:

  • Record provenance (how data was collected/processed).
  • Note constraints (usage rights, licenses).
  • Credit contributors.
  • Use standardized vocabularies and ontologies A D.

7. 🀝 Respect Indigenous Data Sovereignty

  • Apply CARE and OCAP principles when working with Indigenous data.

βœ… Best practices:

  • Ensure communities retain ownership and control.
  • Share data in ways that provide collective benefit.
  • Follow cultural protocols for access and use B C G.

8. πŸ“€ Plan for Sharing & Preservation

  • Deposit data in trusted repositories for long‑term access.
  • Ensure compliance with funder and institutional policies.

βœ… Best practices:

  • Use repositories aligned with FAIR principles (e.g., CanWIN DataHub).
  • Include persistent identifiers (DOIs) for datasets H I.

πŸ“š References & Extra Sources


Tip

See our Data File Best Practices primer for a quick guide to keep with you!