Data storage occurs throughout a research project as you collect, clean, and analyze your data. While data storage can include long term storage, data deposit describes the more specific final step of storing your data and metadata in a repository for secondary researchers to access. This guide offers considerations and suggestions for managing your data through these two contexts.
During active data gathering and analysis, it is vital to regularly back up your files. Computers fail or may be hacked, with serious repercussions. While the locations of your backups needs to be convenient, so you use them, those locations also needs to be secure, especially if you are working with sensitive files.
The rule of 3-2-1 provides a useful guide to backing up your data: Store 3 copies of your data and documentation in at least 2 storage media, with 1 of those media housed off-site.
Encryption
Encryption is a process of securing digital information so that it is only readable by people with the correct authorization. Typically this authorization is a password. Medium or high risk data (containing confidential or individually identifiable information) stored on a device connected to the internet should be encrypted. Data stored on a cloud storage platform must be encrypted.
Storage options for solo research
Storage options for collaborative research:
At the conclusion of your research project, you can deposit your final dataset and its documentation into a data repository. Data deposit may be required by funders or by journal publishers, so that other researchers can replicate, reproduce, or build on your work. But you should also consider sharing your data and documentation because it increases the value and the profile of your work. Your data is a valuable research output that provides your research community with deeper and different value than the articles or chapters or books you might generate from your data.
Data repositories are online services purpose-built to handle the special considerations of storing data. Data repositories may be discipline specific – in the field of chemistry, for example – or they may be run by a journal publisher or a research funding body.
If the location of your data deposit is not mandated by your funder or publisher, choosing where to deposit your data should be driven by increasing the possibility for discovery of your data by other researchers. How easily will other researchers, looking for data like yours, be able to find it? If you are looking for help with this decision, please contact the Camosun library [insert mailto link] for help.
Discipline-specific repositories are developed and managed by research communities. Storing data in a discipline-specific repository may improve the likelihood of discovery. A useful tool for finding discipline-specific repositories around the world is re3data.org. Search Re3data for your chosen discipline to see what repositories exist.
If you are not interested in a discipline-specific repository, you may prefer an open, general purpose repository such as:
Alternatively, you can deposit your data in Camosun's Institutional Repository, CCspace. While CCspace is not purpose-built to hold data, it is certainly a viable option with the added benefit that it connects your work to the College.