Research Data Management

inSileco & ArcticNet

2026-02-26

About Us

David Beauchesne & Kevin Cazelles

Co-founders, inSileco

Research data management
Automated workflows
Augmented analytics
Decision-support tools

Collaborations:

ArcticNet, Sentinelle Nord, ECCC, DFO, W8banaki, Wendake, IAAC, CEM

About Us

inSileco & ArcticNet (since 2023)

Develop criteria for project Data Management
Review and provide feedback on project Data Management Plans
Support researchers with RDM practices and tools
Maintain and expand ArcticNet’s long-term data archive
Deliver training and capacity building (e.g. this webinar)

Webinar Structure

Context
Workflow
Organizational considerations
Practice: let’s draft your DMP
Future
Q&A

Context

RDM, Data & Open Science

Research Data

Researchers transform facts into knowledge
- collect, analyze and archive research data

Broad definitions:
- recorded information supporting research findings
  - structured collections of bytes

What is RDM?

RDM: Research Data Management

Active management of research data
Includes planning, documentation, storage, sharing
Encompasses both technical practices and governance

Research Data are Big

GBIF: Global Biodiversity Information Facility

Powerful technologies enabling unprecedented data collection
Ex: GBIF
- 125 million records in 2007
- 1.6 billion in 2020
- 1,150% increase in just 13 years

A modern LLM is typically trained with 1x10^13 two-byte tokens, which is 2x10^13 bytes. (Yann Lecun, X, 2024)

Research Data are Heterogeneous

Data vary widely across and within disciplines
Different technologies, different formats,
New research questions, new data

Research Data are Valuable

We need reliable data to better understand and predict
- anticipate/mitigate future changes
- Ex: good assessment of temperature and precipitation change
Some data are hard and expensive to collect
- Arctic Data are good examples
Past and today data are crucial for future generations
- We cannot collect past data
- considerable past public money has been spent to collect them

So,

Research data are big and heterogeneous
- ➡️ hard to manage

Research data are extremely valuable
- ➡️ must be managed

We need to take good care of all collected datasets

Data Workflow

Data Workflow in Research Projects

In a nutshell

Collect

Use the best protocols
Choose the right data format
Use secured storage

Analyse

Report all data manipulations
Save processed data
Share your code

In a Nutshell

Reuse

Future you or others
License your data
Report secondary datasets usage

Document to Contextualize

Documentation & Metadata

Metadata?

Data about data
Describe the context not the content
Provide consistent fields for who, what, where, when, how
Standard Examples:
- Dublin Core ➡️ general-purpose descriptors
- ISO 19115 ➡️ geospatial metadata
- Darwin Core ➡️ biodiversity metadata
- DataCite Schema ➡️ dataset metadata for DOIs

Metadata standards vs Data standards

Metadata standards describe the data itself (context & discovery), while data standards define how the data is structured. Together, they ensure interoperability and reuse.

Documentation & Metadata

Dublin Core?

A generic metadata standard used across disciplines
Focused on: who, what, where, when
Syntax is machine-readable (XML, JSON) but also human-readable

Example Record

<record>
  <dc:title>Water Sampling Data 2025</dc:title>
  <dc:creator>Smith, J.</dc:creator>
  <dc:subject>Oceanography</dc:subject>
  <dc:date>2025-04-15</dc:date>
  <dc:format>CSV</dc:format>
  <dc:identifier>doi:10.12345/abcd</dc:identifier>
</record>

Storing your Data

Use a secured storage
One copy on one computer is not enough
Multiple copies, using cloud
If possible use the 3-2-1 backup rule
How sensitive are your data?

Archiving your Data

Select a Digital Repository

Data repositories

National/Institutional ➡️ Federated Research Data Repository (FRDR), Borealis
Disciplinary ➡️ GBIF, OBIS, GenBank, PANGAEA
General-purpose ➡️ Zenodo, Dryad, Figshare, Dataverse

Choose a repository that is:

Appropriate for your data type & community
Trusted (certified, long-term sustainability)
FAIR and TRUST-aligned (metadata standards, PIDs)

FAIR Principles

(F) Findable
(A) Accessible
(I) Interoperable
(R) Reusable

Goals:

Make data easy to discover through rich metadata
Ensure data can be accessed under clear conditions
Promote interoperability across disciplines & tools
Enable reuse through licenses & clear documentation

Persistent Identifiers (PIDs)

TRUST Principles

(T) Transparency
(R) Responsibility
(U) User Focus
(S) Sustainability
(T) Technology

Goals:

Build confidence in digital repositories
Ensure authenticity, integrity, and reliability of data
Prioritize the needs of user communities
Guarantee long-term preservation and accessibility
Provide secure, persistent, and interoperable infrastructure

Polar Data Catalogue (PDC)

CoreTrustSeal Certified Repository: data is secure, properly managed, and adheres to FAIR principles.
Standardized metadata
Supports ArcticNet researchers

Borealis

Tip

OCUL = Ontario Council of University Libraries

In Canada, Borealis is a national instance of the Dataverse repository hosted by OCUL’s Scholars Portal at the University of Toronto.

file size limit of 5GB

Licensing Your Data

A license tells others how they can use your data
Common choices:
- CC-BY ➡️ use with attribution
- CC0 ➡️ no restrictions (public domain)
- Custom agreements ➡️ for sensitive, Indigenous, or commercial data

Clearly state the license in your metadata, README, or repository record

Ethical and Legal Aspects

CARE Principles

(C) Collective Benefit
(A) Authority to Control
(R) Responsibility
(E) Ethics

Goals:

People- and purpose-oriented
First Nations data rights and governance
Inspired from OCAP®
Complement FAIR Principles

Anticipate

Think about all of this process before doing it

Write a DMP!

DMP: Data Management Plan

Could be relatively quick

Organisational considerations

RDM in IRPs

IRP: Interdisciplinary Research Program

Scale & complexity: multiple projects, teams, and disciplines
Continuity: long program lifespans require robust preservation
Opportunities: well-managed data fosters reuse, integration, new collaborations and new insights

The network will set and promote best RDM practices

Same process x number of projects!
Efforts required in coordination

Tri-Agency RDM Policy (2021)

Applies across NSERC, SSHRC, CIHR
Institutions must develop and publish institutional RDM strategies
Researchers are expected to:
- Prepare and maintain Data Management Plans
- Deposit data in trusted repositories when appropriate
Ensures Canadian research aligns with international open science practices
Compliance increasingly linked to funding requirements

ArcticNet Principles

In other words: what is expected of you as a researcher

ArcticNet funded data = a public good ➡️ as open as possible, as closed as necessary
Researchers must ensure:
- Timely sharing ➡️ data made publicly available quickly, unless restricted
- Publish metadata ➡️ publish and share your metadata (e.g. Polar Data Catalogue)
- Respect for Indigenous rights ➡️ uphold Inuit, First Nations, and Métis ownership, access, and control (CARE, OCAP®, NISR)
- Citable & preserved ➡️ data should be publishable, citable, and preserved when appropriate
- Interoperability & connectivity ➡️ link with Canadian & international Arctic data systems, avoid duplication
- Best practices ➡️ follow ethical, legal, cultural, and funder requirements; use existing infrastructure where possible
- Support & guidance ➡️ researchers engage with training, outreach, and resources provided

Timeline and dual responsibilities

Network: balance autonomy & coordination

Timeline and dual responsibilities

Researchers: manage and document project data responsibly

How to Guide

Building your Data Management Plan

What is a DMP?

DMP: Data Management Plan

A Data Management Plan (DMP) is a formal document, typically 1-2 pages long, that outlines how data will be handled during and after a project.

What is a DMP?

DMP: Data Management Plan

Benefits:

Required by many funders, including Tri-Agency
Ensures feasibility of research proposals
Demonstrates responsible stewardship of public funds
Sets expectations for storage, sharing, and preservation
Foundation for good collaboration and reuse
Easier compliance with certain journals
Improved visibility and citations for datasets

Why DMPs matter in IRPs

IRPs = distributed ecosystems ➡️ diverse goals, data, practices
Collective DMPs give visibility into expected outputs
Enable early coordination of standards and tools
Reveal overlaps, synergies, and cost-sharing opportunities
Reduce duplication and improve program coherence

Key sections of a DMP

Answer these questions with substance and you will have a complete DMP:

Data collection ➡️ What data, formats, volume, protocols?
Documentation & metadata ➡️ How will data be described? Which standards?
Storage & protection ➡️ Where will working data live, and how is it protected?
Data Analysis ➡️ How will the data be analyzed?
Preservation & archiving ➡️ Which repository, which formats for long-term?
Sharing & reuse ➡️ Who can access it, when, under what license?
Legal & ethics ➡️ How are legal, privacy, consent, Indigenous data rights addressed?

Tools and templates

Use available tools if possible
- DMP Assistant (Canada’s online tool)
- DMP Tool
Network may provide a template tailored to your program
Examples and guidance available from:
- Harvard DMP resources
- McGill videos

Good practices

DMP Tips

Start early ➡️ draft DMP in the proposal stage
Treat it as a living document ➡️ update as project evolves
Reuse existing metadata forms / standards where possible (more on this later)
Keep it concise but actionable
Align with FAIR, CARE & TRUST principles

Practical Guide

Goal

Equip researchers with concrete steps to manage data responsibly, efficiently, and in line with network & funder expectations.

At the end, you should know what steps to undertake to prepare and update an adequate Data Management Plan

Goal

Let us create a DMP together. Let’s start here.

Checklist

Data collection

Documentation & metadata

Storage & protection

Data Analysis

Preservation & archiving

Sharing & reuse

Legal & ethics

Data collection

Guiding Questions

What kinds of data will I collect?
- observational, experimental, computational, derived
Which instruments, sensors, or methods will I use?
- field protocols, sensors, lab assays, software pipelines
How will I ensure quality control before, during, and after collection?
- calibration, duplicate samples, error-checking
How will I organize and label files?
- consistent file/folder naming, controlled vocabularies

Data collection

Quality Assurance / Quality Control (QA/QC)

Before collection ➡️ instrument calibration, standardized protocols
During collection ➡️ duplicate/triplicate samples, control samples, field blanks
After collection ➡️ validation checks, error detection, version tracking

Data collection

Organization & naming

Use consistent, descriptive file & folder names
Avoid spaces/special characters ➡️ use _ or -.
Include versioning & dates (e.g., projectA_samples_2025-03-01_v1.csv)
Organize folders by project/study/site/date rather than by researcher’s preference
Use controlled vocabularies / ontologies where available ➡️ interoperability

Do & Don’t

✅ lakeC_fieldnotes_2025-03-01_v2.csv
❌ data latest & updated.xlsx

Documentation & metadata

Guiding Questions

How will I document my data so that others (or my future self) can understand it?
- lab/field notebooks, data dictionaries, README files, protocols
Which metadata standard(s) will I use?
- Dublin Core, ISO 19115, Darwin Core, DataCite Schema
When will metadata be created and updated?
- start at project onset, update regularly, finalize at archiving

Documentation & metadata

ArcticNet’s requirements

Starting year 2, researchers must provide links to metadata records in recognized repositories
Metadata must be openly accessible
Funding will be withheld if metadata records are missing or inaccessible
For Indigenous-owned data ➡️ researchers must identify the organization responsible for storing and managing it
- Metadata publication is still required

ArcticNet’s commitment

Role:

Define metadata standards for projects
Support researchers in preparing metadata
Provide tools/templates to ease metadata submission

Initiatives

Working with Polar Data Catalogue (PDC) to host project metadata
Providing a PDC metadata template
Offering support for preparation and submission

Storage & protection

Guiding Questions

Where will data be stored during the project?
- institutional servers, certified cloud storage, external media
How often and where are backups made, and are they automated?
- frequency, number of copies, locations
Who can access the data, and how is access protected?
- permissions, authentication, encryption
How much storage will be required?
- projected storage needs (GB/TB)
Who provides, pays for, and maintains storage and backup?
- who pays and what infrastructure is provided

Storage & protection

Good Practices

Prefer institutional or certified storage over personal laptops/USBs
Use encrypted storage for sensitive data
Automate backups whenever possible
Document storage practices clearly in the DMP
Plan ahead for long-term preservation (more on this soon)

Special Considerations

Sensitive / Indigenous data ➡️ use community-approved safeguards, respect sovereignty
Large volumes / “big data” ➡️ address infrastructure, costs, specialized servers
Fieldwork constraints ➡️ describe temporary solutions (field laptops, portable drives) and how data will be secured until upload

Data analysis

Guiding questions

What software, tools, or pipelines will be used?
- R, Python, MATLAB, ArcGIS, QGIS, etc
How will analysis steps be documented?
- scripts, Jupyter notebooks, R Markdown, Quarto
How will you ensure reproducibility?
- version control (GitHub, GitLab), containers (Docker)

Data analysis

Good Practices

Prefer open-source tools when feasible
Share analysis scripts with your datasets
Keep raw and processed data separate.
Document assumptions, parameters, and software versions

Builds trust, efficiency, and long-term usability of results

A note on reproducibility

Preservation & archiving

Guiding Questions

Which trusted repository will be used for long-term preservation?
- Borealis, FRDR, Dryad, Zenodo, GBIF, OBIS
In which formats will data be archived?
- CSV, NetCDF, GeoTIFF, JSON (avoid lossy formats like JPEG, MP3)
How will datasets be persistently identified and cited?
- DOI, URI
For how long will the data be preserved?
- typically ≥ 5–10 years, ideally indefinite

Goal: ensure your data remain usable and accessible well beyond the project

Preservation & archiving

Good Practices

Deposit data at publication time, not years later
Archive raw and processed data, link to analysis scripts
Use repository versioning features instead of manual file names
Ensure alignment with FAIR & CARE principles

Special Considerations

Sensitive data ➡️ anonymization, restricted access, secure long-term storage
Indigenous data sovereignty ➡️ respect CARE, OCAP®, NISR, community protocols
Large volumes ➡️ consider specialized repositories, HPC, or cloud archives

Preservation & archiving

Some notes on file formats

Avoid Proprietary & Unsuitable Formats

Not all formats are sustainable for long-term research data. Avoid using:
Proprietary formats: require specific software that may become unavailable (ex. .xlsx, .shp, .sav, .psd, .docx with macros)
Formats with strong version-dependence: older/newer versions may be unreadable without exact software (ex. ArcGIS-only file types)
Compressed / lossy formats: reduce data quality and limit reuse (ex. .jpg, .mp3)
Encrypted or password-protected files: block discovery, reuse, and preservation workflows

Rule of thumb: if a file requires special software, or might lose information when saved, it’s not a good archival format.

Preferred Open Formats by Data Type

Tabular data ➡️ CSV, Parquet
Spatial data ➡️ GeoPackage, GeoTIFF, NetCDF
Images ➡️ TIFF (uncompressed), PNG
Audio / Video ➡️ WAV, MP4 (H.264 codec)
Text / Documents ➡️ TXT, PDF/A, XML, JSON
Metadata ➡️ XML, JSON, standardized schemas (e.g., ISO 19115, Darwin Core)
Choose formats that are:
- Open & non-proprietary
- Well-documented & widely supported
- Sustainable for long-term preservation

Preservation & archiving

ArcticNet’s requirements

No centralized ArcticNet repository ➡️ projects choose suitable long-term repository
Prefer certified, open-access options (PDC, Nordicana-D, GBIF, OBIS, FRDR)
Deposit all data and metadata supporting results
Plan early, use non-proprietary formats (CSV, TIFF, NetCDF)
Retain data as long as required by stakeholders and funders
State in DMP what will be preserved and any restrictions

ArcticNet’s guidance

Researchers decide the most appropriate repository for their discipline and data type
Focus on repository sustainability, DOIs, and open access
Preservation can include raw, processed, and derived data when valuable
Sensitive or Indigenous data may need restricted access or safeguards
Rationale for retention and preservation must be clear in the DMP

Legal & ethics

Guiding Questions

Are ethics approvals required?
- Research Ethics / Institutional Review Boards, community review
Will Indigenous knowledge or data be collected?
- CARE, OCAP®, NISR, community agreements
Who owns the data and how will IP be handled?
- ownership, licensing, industry agreements
Are there legal restrictions on the data?
- Tri-Council, privacy acts, international obligations

Legal & ethics

Good Practices

Sensitive or Indigenous data ➡️ respect CARE, OCAP®, and community protocols
Clearly explain how participant rights are protected
Use written data sharing agreements when applicable
Consult community-led governance for Indigenous research
Be transparent about data that cannot be shared and why

Special Considerations

Multiple institutions ➡️ align ethics and legal requirements
Indigenous partners may require community-based repositories or controlled access
Consider cross-border data transfer and compliance

How to Guide

Building your Data Management Plan

What is a DMP?

DMP: Data Management Plan

A Data Management Plan (DMP) is a formal document, typically 1-2 pages long, that outlines how data will be handled during and after a project.

What is a DMP?

DMP: Data Management Plan

Benefits:

Required by many funders, including Tri-Agency
Ensures feasibility of research proposals
Demonstrates responsible stewardship of public funds
Sets expectations for storage, sharing, and preservation
Foundation for good collaboration and reuse
Easier compliance with certain journals
Improved visibility and citations for datasets

Why DMPs matter in IRPs

IRPs = distributed ecosystems ➡️ diverse goals, data, practices
Collective DMPs give visibility into expected outputs
Enable early coordination of standards and tools
Reveal overlaps, synergies, and cost-sharing opportunities
Reduce duplication and improve program coherence

Key sections of a DMP

Answer these questions with substance and you will have a complete DMP:

Data collection ➡️ What data, formats, volume, protocols?
Documentation & metadata ➡️ How will data be described? Which standards?
Storage & protection ➡️ Where will working data live, and how is it protected?
Data Analysis ➡️ How will the data be analyzed?
Preservation & archiving ➡️ Which repository, which formats for long-term?
Sharing & reuse ➡️ Who can access it, when, under what license?
Legal & ethics ➡️ How are legal, privacy, consent, Indigenous data rights addressed?

Tools and templates

Use available tools if possible
- DMP Assistant (Canada’s online tool)
- DMP Tool
Network may provide a template tailored to your program
Examples and guidance available from:
- Harvard DMP resources
- McGill videos

Good practices

DMP Tips

Start early ➡️ draft DMP in the proposal stage
Treat it as a living document ➡️ update as project evolves
Reuse existing metadata forms / standards where possible (more on this later)
Keep it concise but actionable
Align with FAIR, CARE & TRUST principles

Practical Guide

Goal

Equip researchers with concrete steps to manage data responsibly, efficiently, and in line with network & funder expectations.

At the end, you should know what steps to undertake to prepare and update an adequate Data Management Plan

Goal

Let us create a DMP together. Let’s start here.

Checklist

Data collection

Documentation & metadata

Storage & protection

Data Analysis

Preservation & archiving

Sharing & reuse

Legal & ethics

Data collection

Guiding Questions

What kinds of data will I collect?
- observational, experimental, computational, derived
Which instruments, sensors, or methods will I use?
- field protocols, sensors, lab assays, software pipelines
How will I ensure quality control before, during, and after collection?
- calibration, duplicate samples, error-checking
How will I organize and label files?
- consistent file/folder naming, controlled vocabularies

Data collection

Quality Assurance / Quality Control (QA/QC)

Before collection ➡️ instrument calibration, standardized protocols
During collection ➡️ duplicate/triplicate samples, control samples, field blanks
After collection ➡️ validation checks, error detection, version tracking

Data collection

Organization & naming

Use consistent, descriptive file & folder names
Avoid spaces/special characters ➡️ use _ or -.
Include versioning & dates (e.g., projectA_samples_2025-03-01_v1.csv)
Organize folders by project/study/site/date rather than by researcher’s preference
Use controlled vocabularies / ontologies where available ➡️ interoperability

Do & Don’t

✅ lakeC_fieldnotes_2025-03-01_v2.csv
❌ data latest & updated.xlsx

Documentation & metadata

Guiding Questions

How will I document my data so that others (or my future self) can understand it?
- lab/field notebooks, data dictionaries, README files, protocols
Which metadata standard(s) will I use?
- Dublin Core, ISO 19115, Darwin Core, DataCite Schema
When will metadata be created and updated?
- start at project onset, update regularly, finalize at archiving

Documentation & metadata

ArcticNet’s requirements

Starting year 2, researchers must provide links to metadata records in recognized repositories
Metadata must be openly accessible
Funding will be withheld if metadata records are missing or inaccessible
For Indigenous-owned data ➡️ researchers must identify the organization responsible for storing and managing it
- Metadata publication is still required

ArcticNet’s commitment

Role:

Define metadata standards for projects
Support researchers in preparing metadata
Provide tools/templates to ease metadata submission

Initiatives

Working with Polar Data Catalogue (PDC) to host project metadata
Providing a PDC metadata template
Offering support for preparation and submission

Storage & protection

Guiding Questions

Where will data be stored during the project?
- institutional servers, certified cloud storage, external media
How often and where are backups made, and are they automated?
- frequency, number of copies, locations
Who can access the data, and how is access protected?
- permissions, authentication, encryption
How much storage will be required?
- projected storage needs (GB/TB)
Who provides, pays for, and maintains storage and backup?
- who pays and what infrastructure is provided

Storage & protection

Good Practices

Prefer institutional or certified storage over personal laptops/USBs
Use encrypted storage for sensitive data
Automate backups whenever possible
Document storage practices clearly in the DMP
Plan ahead for long-term preservation (more on this soon)

Special Considerations

Sensitive / Indigenous data ➡️ use community-approved safeguards, respect sovereignty
Large volumes / “big data” ➡️ address infrastructure, costs, specialized servers
Fieldwork constraints ➡️ describe temporary solutions (field laptops, portable drives) and how data will be secured until upload

Data analysis

Guiding questions

What software, tools, or pipelines will be used?
- R, Python, MATLAB, ArcGIS, QGIS, etc
How will analysis steps be documented?
- scripts, Jupyter notebooks, R Markdown, Quarto
How will you ensure reproducibility?
- version control (GitHub, GitLab), containers (Docker)

Data analysis

Good Practices

Prefer open-source tools when feasible
Share analysis scripts with your datasets
Keep raw and processed data separate.
Document assumptions, parameters, and software versions

Builds trust, efficiency, and long-term usability of results

A note on reproducibility

Preservation & archiving

Guiding Questions

Which trusted repository will be used for long-term preservation?
- Borealis, FRDR, Dryad, Zenodo, GBIF, OBIS
In which formats will data be archived?
- CSV, NetCDF, GeoTIFF, JSON (avoid lossy formats like JPEG, MP3)
How will datasets be persistently identified and cited?
- DOI, URI
For how long will the data be preserved?
- typically ≥ 5–10 years, ideally indefinite

Goal: ensure your data remain usable and accessible well beyond the project

Preservation & archiving

Good Practices

Deposit data at publication time, not years later
Archive raw and processed data, link to analysis scripts
Use repository versioning features instead of manual file names
Ensure alignment with FAIR & CARE principles

Special Considerations

Sensitive data ➡️ anonymization, restricted access, secure long-term storage
Indigenous data sovereignty ➡️ respect CARE, OCAP®, NISR, community protocols
Large volumes ➡️ consider specialized repositories, HPC, or cloud archives

Preservation & archiving

Some notes on file formats

Avoid Proprietary & Unsuitable Formats

Not all formats are sustainable for long-term research data. Avoid using:
Proprietary formats: require specific software that may become unavailable (ex. .xlsx, .shp, .sav, .psd, .docx with macros)
Formats with strong version-dependence: older/newer versions may be unreadable without exact software (ex. ArcGIS-only file types)
Compressed / lossy formats: reduce data quality and limit reuse (ex. .jpg, .mp3)
Encrypted or password-protected files: block discovery, reuse, and preservation workflows

Rule of thumb: if a file requires special software, or might lose information when saved, it’s not a good archival format.

Preferred Open Formats by Data Type

Tabular data ➡️ CSV, Parquet
Spatial data ➡️ GeoPackage, GeoTIFF, NetCDF
Images ➡️ TIFF (uncompressed), PNG
Audio / Video ➡️ WAV, MP4 (H.264 codec)
Text / Documents ➡️ TXT, PDF/A, XML, JSON
Metadata ➡️ XML, JSON, standardized schemas (e.g., ISO 19115, Darwin Core)
Choose formats that are:
- Open & non-proprietary
- Well-documented & widely supported
- Sustainable for long-term preservation

Preservation & archiving

ArcticNet’s requirements

No centralized ArcticNet repository ➡️ projects choose suitable long-term repository
Prefer certified, open-access options (PDC, Nordicana-D, GBIF, OBIS, FRDR)
Deposit all data and metadata supporting results
Plan early, use non-proprietary formats (CSV, TIFF, NetCDF)
Retain data as long as required by stakeholders and funders
State in DMP what will be preserved and any restrictions

ArcticNet’s guidance

Researchers decide the most appropriate repository for their discipline and data type
Focus on repository sustainability, DOIs, and open access
Preservation can include raw, processed, and derived data when valuable
Sensitive or Indigenous data may need restricted access or safeguards
Rationale for retention and preservation must be clear in the DMP

Legal & ethics

Guiding Questions

Are ethics approvals required?
- Research Ethics / Institutional Review Boards, community review
Will Indigenous knowledge or data be collected?
- CARE, OCAP®, NISR, community agreements
Who owns the data and how will IP be handled?
- ownership, licensing, industry agreements
Are there legal restrictions on the data?
- Tri-Council, privacy acts, international obligations

Legal & ethics

Good Practices

Sensitive or Indigenous data ➡️ respect CARE, OCAP®, and community protocols
Clearly explain how participant rights are protected
Use written data sharing agreements when applicable
Consult community-led governance for Indigenous research
Be transparent about data that cannot be shared and why

Special Considerations

Multiple institutions ➡️ align ethics and legal requirements
Indigenous partners may require community-based repositories or controlled access
Consider cross-border data transfer and compliance

Future

Emerging Trends & Opportunities

The Future of RDM in IRPs

IRPs: diverse teams, methods, data types, and data cultures
Fully centralized governance is impractical
Need for flexible approaches that:
- preserve project autonomy
- enable coordination
Solution: modular, interoperable tools
Next step for RDM in IRPs: interoperable metadata

Interoperable Metadata Hubs

ArcticNet Data Hub

Regional Assessment Hub

From Metadata to Interoperability

Metadata interoperability = path to data interoperability
Combined with PIDs: discoverable, linkable, and reusable assets
Unlocks:
- cross-project discovery and synthesis
- automated workflows
- dashboards, validation, and reuse tracking
- AI / LLM applications (knowledge graphs, assisted discovery, Q&A)

Thank you!

If you’re working on data management plans, metadata harmonization, or research program coordination, we’re happy to continue the discussion.

We would also greatly appreciate your feedback!

Research Data Management

About Us

About Us

Webinar Structure

Context

Research Data

What is RDM?

Research Data are Big

Research Data are Heterogeneous

Research Data are Valuable

So,

Data Workflow

Data Workflow in Research Projects

In a nutshell

Collect

Analyse

In a Nutshell

Archive

Reuse

Document to Contextualize

Documentation & Metadata

Documentation & Metadata

Storing your Data

Storing your Data

Archiving your Data

Select a Digital Repository

FAIR Principles

TRUST Principles

Polar Data Catalogue (PDC)

Borealis

Licensing Your Data

Sharing your (meta)data

Ethical and Legal Aspects

CARE Principles

Anticipate

Organisational considerations

RDM in IRPs

Tri-Agency RDM Policy (2021)

ArcticNet Principles

Timeline and dual responsibilities

Timeline and dual responsibilities

How to Guide

What is a DMP?

What is a DMP?

Why DMPs matter in IRPs

Key sections of a DMP

Tools and templates

Good practices

Practical Guide

Goal

Goal

Checklist

Data collection

Data collection

Data collection

Data collection

Documentation & metadata

Documentation & metadata

Documentation & metadata

Storage & protection

Storage & protection

Storage & protection

Data analysis

Data analysis

Data analysis

Data analysis

Preservation & archiving

Preservation & archiving

Preservation & archiving

Preservation & archiving

Preservation & archiving

Sharing & reuse

Sharing & reuse

Sharing & reuse

Sharing & reuse

Legal & ethics

Legal & ethics

Legal & ethics

How to Guide

What is a DMP?