Advanced Deposit in UBC Dataverse Collection
Before you continue, please make sure you have a good understanding of the deposit basics in Basic Deposit in UBC Dataverse Collection.
In this guide, we expand further on data discovery, file formats, DOIs, Private URLs, access control, licenses, file size, version control and more.
Table of Content
Looking for a cheat sheet? Check out our one-pager!
Recommended File formats
Any kind of file can be uploaded to Dataverse, but extra functionality is supported for some filetypes:
Tabular data
Tabular data (Stata, SPSS, Excel, R, & CS) is normalized to .tab format on upload–a non-proprietary archival format that a variety of programs can read.
Normalization is important for long-term preservation of digital data. The deposited files can be downloaded in multiple formats, but always including the original.
In Dataverse, tabular normalization also allows you to perform statistical data exploration and visualization right in your browser. Click the Data Explore
button to see what it can do.
Compressed Files (.zip, tar)
Compressed files are the preferred method for uploading large datasets or many files to Dataverse.
Zip files are automatically extracted on upload, and the contents will appear as a list under the Files tab. Folder structure and file hierarchy within the zip file are maintained on extraction.
Sometimes, it is a good idea to deposit the zipped folder to preserve the content as it is, especially if you need your files to remain together. In this case, please double-zip your directory as the software will unzip it once upon upload.
File Size
Borealis is not intended to handle Big Data. Current file size limits are:
- For upload: 5 Gb per file (unlimited number of files in a dataset). If you are depositing files larger than 5GB, double-zip your directory.
- For tabular normalization: 500 Mb per file. Tabular files over this size will remain in their original format, e.g. Excel.
While there is no cap on the overall number of files that you can upload, if your data exceeds 10 Gb, please contact research.data@ubc.ca
to discuss the best repository options, as we have other solutions (e.g. FRDR).
Access Control
Sensitive files can be Restricted or Embargoed so they are not freely available to download. Files can either be fully restricted, or set up so users can send access requests to the contact email for review.
To restrict, unrestrict, or embargo a file:
In the popup window, you may describe the conditions of use or license for this file in the Terms of Access box.
Check the Enable access request
box to allow users to send the contact person an email asking for permission to download a restricted file. Click Save Changes
to finish.
Files can also be unrestricted if the terms change–for example, once an embargo period has passed.
Version control
Every change made to a dataset—adding files, editing metadata, etc—is saved as a new version of the dataset. This allows you to track the change history of the project, which can be viewed under the Versions
tab. This is useful if you need to roll back to a previous version, or find out who made what changes and when.
Sharing Data
Where can UBC Dataverse Collection datasets be discovered?
Data deposited in UBC Dataverse Collection is indexed by, and integrated with, many services on the Web, including:
- DataCite
- ORCID
- Lunaris: a scalable, national research data discovery service
- Google/Google Scholar/Google Data
- OpenAIRE
- Any APIs
Social Media
Spread the word about your research and improve your altmetrics by sharing your linked data on social media!
Dataverse provides a Share
button for
- Each Dataverse collection
- Each dataset
This button gives you the option to create a post with a link to your data on Facebook, Twitter, Linkedin.
Linking to your dataset
There are two main ways to direct people to your dataset:
1 DOI
A a DOI (Digital Object Identifier) is assigned to the dataset by Borealis when the first Draft is created. Once the dataset is Published, anyone can use the DOI link to find it. While the dataset is still unpublished, the DOI can only be used by Dataverse account holders with permission to view that Dataverse.
Use DOIs in citations, publications, on your personal/research group websites–anywhere you want the link to your data stable over time.
Click here for more information about UBC DOIs.
2 Private URL
A temporary link for use with unpublished data. The dataset can only be seen by those who have the link, and users do not need a Dataverse account. Great for giving pre-publication access to journals, reviewers, and collaborators.
To create a Private URL, click Private URL
. The generated URL can be access from this location until the dataset is published, so you can copy it again and again as needed. The Private URL will automatically disappear once the dataset is published.
Licenses and Terms
You have control over how your data can be used. Dataverse allows for a variety of licenses and terms of use.
Built-in License Templates
Templates can be selected at dataset creation, and changed at any time. These automatically apply the right information about the license to the dataset’s metadata.
You can use UBC Library Copyright page to decide what license to use with your dataset. We recommend CC-BY or CC-0 license.
Custom license
One-size-fits-all licenses don’t suit every dataset. You can create customized terms and conditions of use.
To customize terms:
If you need help creating a custom license, contact research.data@ubc.ca
.
Congrats!
You are an expert in data deposit in the UBC Dataverse collection now!
If you need have more options to customize and control the way your Dataverse collections look and act, you need to know more! Check out our Admin Access in the UBC Dataverse collection Workshop.
Need help?
Please reach out to research.data@ubc.ca
for assistance with any of your research data questions.