Open Collections

UBC Library and Archives

Portaging Along: Developing a Collaborative National Research Data Management Network in Canada Barsky, Eugene; Wilson, Lee Apr 9, 2018

Your browser doesn't seem to have a PDF viewer, please download the PDF to view this item.

Item Metadata


494-PortagingAlong_ Barsky_Wilson_20180405.pdf [ 1.3MB ]
JSON: 494-1.0365282.json
JSON-LD: 494-1.0365282-ld.json
RDF/XML (Pretty): 494-1.0365282-rdf.xml
RDF/JSON: 494-1.0365282-rdf.json
Turtle: 494-1.0365282-turtle.txt
N-Triples: 494-1.0365282-rdf-ntriples.txt
Original Record: 494-1.0365282-source.json
Full Text

Full Text

Portaging Along: Developing a Collaborative National Research Data Management Network in CanadaEugene Barsky, UBCLee Wilson, ACENET/PortageContact - eugene.barsky@ubc.caSpring 2018 Image by Outline● Background ● Tri-Agencies’ directions in Research Data Management (RDM)● Portage’s national work● Focus on Data Repositories and Discovery● Federated Research Data Repository (FRDR) - a national discovery layer for research dataImage -  2Data richSoccer clubs, like Arsenal, record on average 10 data points per second for every player on the field, or about 1.4 million data points per game.Image - Source - 3Defining research dataData that are used as primary sources to support technical or scientific enquiry, research, scholarship, or artistic activity, and that are used as evidence in the research process and/or are commonly accepted in the research community as necessary to validate research findings and results. Source - CASRAI Glossary - * Image - 4Why data management● In the USA* From Developing data services: a tale from two Oregon universities - 56Timeline● Tri-Council to finalize RDM policy in April or May 2018.● Public consultation for a period of two-three months. ● Six months after the policy has been publically available, institutions will be expected to enact RDM policies.● Realistic timeline - Fall 2019 for compliance.* Image - 78Tri-Agency expectationsfor RDMInstitutions:● Institutional Data Strategy● Provide researchers access to repositories that securely preserve, curate and provide access to research data● Provide researchers with guidance to properly manage their data, including Data Management Plans (DMPs)image - 9Tri-Agency expectationsfor RDMResearchers:● Incorporate RDM best practices (in their discipline), including Data Deposit for publications● Develop Data Management Plans (DMPs)● Follow institutional policies and standardsImage - 10Tri-Agency expectationsfor RDMFunders:● Develop policy and requirements that facilitate responsible data management ● Provide clear guidance for fulfill RDM requirements● Promote the importance of excellent RDM● Provide peer-reviewers with guidance for applications assessment Image -  11What is the Portage Network?● “Portage is a national, library-based research data management network that coalesces initiatives in research data management to build capacity and to coordinate activities better”● Goals:○ Build a community of practice for research data management (RDM) ○ Engage and advocate for research data management with stakeholder communities○ Facilitate and provide leadership in the development of RDM infrastructure● 12Portage Network of ExpertsExpert Groups:• Data Management Planning• Curation• Data Discovery• Preservation• Training• Research IntelligenceWorking Groups:● Dataverse North● FRDR Service Model● Institutional Strategies● Ethical Treatment of Sensitive Data13Regional Stakeholders14Part of a Larger RDM Ecosystem15Focus on Data Discovery16FRDR Overview● As you know, there are many research data repositories in Canada● For instance, UBC Abacus Dataverse, Open Data Canada, Hakai Institute, and dozens more…● We have worked to create the national research data discovery layer with Federated Research Data Repository (FRDR) - A scalable, federated platform for digital research data management and the discovery of Canadian research data - 17FRDR Stakeholders● Partnership between Compute Canada (CC) and the Canadian Association of Research Libraries (CARL)● Hosted on Compute Canada hardware and infrastructure, with CC providing development and technical support● Service operated by Portage, including curation and data management support, with steering and input from CARL, the Network of Experts, and individual institutions18FRDR DiscoveryFRDR’s harvester indexes data repositories across Canada to make research data held in many repositories discoverable from a single platformCurrently supports OAI-PMH, CKAN, CSW, Marklogic standards with plans to add moreGoals: ● supplement existing repository sites ● improve discovery● breakdown repository siloing● avoid being “just another repository”19FRDR Discovery● Portage’s Data Discovery Expert Group identified and mapped 13 well-used and mature metadata standards to FRDR’s metadata model (Dublin Core/DataCite)● Crosswalk emphasizes core elements across all standards, allowing varied discipline-specific metadata to be displayed in a single discovery interface● Some detail/granularity lost when crosswalking to general standards (e.g., Dublin Core)● Future work will explore more advanced ways of linking contextual metadata to FRDR (linked data approach)20FRDR Discovery21FRDR Deposit ● A place for Canadian researchers to deposit large datasets– Big data transfer using Globus File Transfer● A place to deposit datasets if researcher does not have a local or domain-specific option● Support for custom metadata schemas● Designed for scalability● Storage may be distributed or managed centrally through infrastructure providers (e.g., Compute Canada)22FRDR Data Preservation● Archivematica integration: Digital preservation processing for long-term usability of datasets– Converting file formats into future-friendly formats (e.g. docx-->PDF)– Creating Archival Information Packages (AIPs)● Scalable, automated Archivematica processing for datasets up to 300 GB or 25,000 files (distributed over multiple VMs in CC Cloud)23FRDR - Feature List● Direct deposit and download of datasets through Globus File Transfer● Direct download of small datasets through HTTPS● Automatic processing of datasets with Archivematica● Support for custom metadata schemas● Embargo support● API for automated deposit● Issuing DOIs through DataCite● Bilingual user interface for both repository and discovery● Indexing items from selected Canadian repositories● Support for multiple licenses● Faceted search in the discovery interface● ORCID integration 24Image -● Steering committee: Dugan O’Neil, Jason Hlady, Jeff Moon, Umar Qasim, Lee Wilson, John Simpson, Jay Brodeur● CARL/Portage experts: DDEG / CEG / PEG● Portage Secretariat: Jeff Moon, Shahira Khair, Julie Morin, Lee Wilson● CARL: Susan Haigh, Donna Bourne-Tyson, Kathleen Shearer● UBC and the Open Collections team: Eugene Barsky, Schuyler Lindberg● Compute Canada: Cloud East and Cloud West teams, Communications team, Translators, Support● FRDR Development team: Alex Garnett, Keith Jeffrey, Todd Trann, Mike Winter, Adam McKenzie● And a special thanks to the former Portage Director, Chuck Humphrey25Further InformationProduction site: Demonstration site: More information: Thanks! Questions? or 26                                                                  Image - 


Citation Scheme:


Citations by CSL (citeproc-js)

Usage Statistics



Customize your widget with the following options, then copy and paste the code below into the HTML of your page to embed this item in your website.
                            <div id="ubcOpenCollectionsWidgetDisplay">
                            <script id="ubcOpenCollectionsWidget"
                            async >
IIIF logo Our image viewer uses the IIIF 2.0 standard. To load this item in other compatible viewers, use this url:


Related Items