4 open access practices to try, rated from easy – hard

The following 4 practices span different aspects of the research cycle (conceptualization, design, analysis, reporting, dissemination) and involve a range of difficulty levels for newcomers:

1. Open Science Journal Club (Level: Easy)

What? Organise a journal club with other students and staff to discuss issues surrounding reproducibility and open science. Usually, these take the format where one person leads the discussion each session after everyone has read the selected paper. They can range in how formal they are, from a presentation with slides followed by discussion to a completely open-ended discussion (with or without a moderator). It may even be possible that one already exists in your department that you could join! We rated beginning an open science journal club as easy because it requires minimal prior learning, you only need one other person to start a club, and there are already many reading lists and existing structures available to follow.

Why? Before you can engage in open science practices you need to understand the lay of the land by becoming familiar with major works and issues. Journal clubs are a great way to do this, wherein you can learn about a new topic and critically engage with your colleagues. It can get lonely reading and working alone, so meeting in this way can create an environment in which you can socialize while also learning with others and building a community around open science. In addition to discussing the papers themselves, each article can serve as a conversation starter about open science and reproducibility more generally. Seeing who attends the journal club can help locate others who are interested in or knowledgeable about open science, effectively establishing a network for collaborators or support. They can help you think about how you can approach conversations with your advisor or other members in your department who may not be as interested in engaging with open science practices. More generally, organising a journal club and presenting are both transferable skills for most jobs.

How? Contact colleagues in your department to enquire if anyone would be interested in participating. You only need one other person to get started! Once set up, the journal club can expand beyond your department to the wider university, facilitating interdisciplinary exchange. In addition to club attendees presenting, you can also invite external speakers to present either in person or remotely. It is also possible to organize these clubs completely remotely if you are unable to meet in person. One example of a very successful framework for an open science journal club is the ReproducibiliTea initiative: (https://reproducibilitea.org/), which has spread to over 100 institutions all over the world and provides a great starter pack including a list of papers to potentially discuss (https://osf.io/3qrj6/).

Worries. Students have different relationships with their advisors with regards to how much permission they would need to set up a journal club. However, in most cases journal clubs can absolutely be student-generated and student-run. It may be worth telling your advisor and inviting them to attend, yet, making it clear they have no obligation to attend. Also, you may feel that learning about open science practices is taking you away from time that could be spent on your own research. This is a common worry when engaging in any “extracurricular activities.” Although it does take time, it can be a relatively small time investment for how much you can learn. You can choose how often to hold the journal clubs and when to start and stop holding them, so they could be anything from weekly to termly, whatever works well for you and your colleagues.


2. Project Workflow and Documentation (Level: Easy)

What? Project workflow refers to how you organise projects and move through the various stages of the research cycle. This includes your file folder structure, document naming conventions, version control, data storage, and other details. It also includes the choice of who has access to the project (e.g., collaborators, the public) and when in the process they have access (e.g., at all times, upon publication). We rate creating a project workflow and documentation as easy because, even though there are many considerations to think through on the front end, it is primarily about organisation (folder and storage use) and recordkeeping, which are likely processes students are already using to some extent. Moreover, developing a clear project workflow is much easier for students than later career scholars, who have many more projects to organize and may be more entrenched in their methods (or lack thereof).

Why? Having a dedicated project workflow system helps keep your research organised, enhancing reproducibility, minimising mistakes, and facilitating collaborations with others and future-you. Making your project open to your advisors and any other collaborators (even if not open to the public) through working in shared folders can ensure everyone has access to everything in an organised fashion and saves the hassle of emailing infinite versions of documents. If you do choose to make the project public at any point, everything will be almost complete, and you will not have to create a system from scratch. Having an organised workflow is beneficial in most jobs, as well as in your everyday life – no more scribbled shopping lists on scrap pieces of paper!

How? Many research teams will not want to make all of their work public from the get-go, but even just imagining that the project will be public can encourage taking an outside perspective that will lead to improvements in organisation. When joining a new lab as a graduate student or beginning a new collaboration, ask the project leaders about their project workflow. It is entirely possible that they do not have a formally specified workflow, and just asking about it could initiate new ways of doing things. If they are not as receptive as you would like, find a compromise between what you would find most useful and what they are used to, and if you are a new student, then it might be useful to set some review dates for when you will discuss if the current approach is working well. For additional resources visit the UK Data Service’s guide to data documentation https://ukdataservice.ac.uk/help/new-users/data-documentation/.

Worries. You may feel some apprehension at the idea of having your workflow process or documentation public. You definitely do not have to make your project page public right away. You can wait until the project is complete, if you choose, and can clean up your project page if/when you eventually make it public. However, having a clear and intentional process for file management from the get-go will alleviate these worries as well as the need to clean things up after the fact, which just adds more work. You may also be unclear on what you are allowed to share. We discuss the issue of data sharing in a subsequent section, but whether you are sharing data, materials, analysis code, or anything else related to the project, it is important to consult with your supervisor and collaborators to ensure sharing is permitted and desired. Lastly, you may be concerned that if other people use your materials (e.g., survey design, code) that this is detrimental, as someone else is “profiting” from your hard work, but actually you can gain credit yourself in the form of citations.


3. Preprints and Depositing in the UWL Repository (Level: Easy)

What? The term preprint originally referred to a version of a manuscript that was publicly available prior to being submitted for peer-review. Although that still remains true, preprints now also include manuscripts that are under review, or author-accepted manuscripts (which follow peer review but precede publisher modifications such as copyediting and typesetting). We rate posting preprints and accepted manuscripts as easy because in essence it simply requires uploading a file you already have to a website. In fact, this may be the lowest effort open science behaviour that one could engage in, and yet it is associated with many potential benefits.

Why? Posting a manuscript before submitting to a journal allows for a wider range of feedback than what is afforded through peer review and can help improve a paper prior to submission by identifying any major flaws. Posting an article after submission, but before acceptance, gets a version of the paper out as soon as possible for sharing findings and interest, as well as keeping a record of what the paper looked like before it underwent the review process. Posting a manuscript after it has been accepted to a journal allows for the paper to be shared faster than it may be published and allows for an open access version of the paper to be shared. Preprints are also a great way to share work that does not continue to publication, providing greater access to the full body of available literature. Using preprints in this way can be useful if you choose not to stay in academia and do not get a chance to publish your research, but would still like to have it available as part of the scientific record.

How? There are many different available preprint hosts with varying levels of moderation (e.g., arXiv, bioRxiv, SSRN) and emphasis on specific disciplines (see https://osf.io/preprints/). There are also general purpose repositories such as Figshare (https://figshare.com/) and Zenodo (https://zenodo.org/), which hosts both literature and data deposits.

If you are a UWL PhD student or member of academic staff you can also take advantage of your institutional access to the UWL Repository (https://repository.uwl.ac.uk/) to deposit your research publications and scholarly material. Your deposits are then reviewed by one of the repository’s administrators to ensure that all material conforms to the correct publisher policies on copyright (see further details on publisher copyright and self-archiving policies at https://v2.sherpa.ac.uk/romeo/).

For guidance on depositing in the UWL Repository, view our step-by-step YouTube tutorial at: https://www.youtube.com/watch?v=1kL0Q-HsmKQ

You can also follow our written guidance on how to make a successful deposit at the following link: https://repository.uwl.ac.uk/guide.pdf

As mentioned, you can post a preprint at different points of the publication timeline. If you are posting an article that has already been accepted for publication at a journal, in most cases you are able to post an author accepted version to a preprint server, but not the final publisher-formatted version (see Worries below for details).

Worries. Most of the worries around posting preprints pertain to what is allowable and how doing so will impact the peer-review process. Many authors worry that posting a preprint will be treated as “published” and thus they will not be able to submit their manuscript for publication in a journal; however, this is not true in most cases. Before posting a preprint, authors should consult Sherpa Romeo (http://sherpa.ac.uk/romeo), which tracks the restrictions and rules for most journals, indicating whether journals allow posting of preprints pre-submission (usually yes), posting of author accepted manuscripts (usually yes), and posting of publisher-formatted accepted articles (usually no). To alleviate any remaining anxieties, if you know the journal that you wish to submit to, you can contact the editor directly in order to get in writing whether preprints of the work are allowed before submission to the journal. There may also be concern that posting a preprint will decrease the number of times an article is cited; however, this is not the case as preprints have actually been found to increase the number of citations. Another concern with posting a preprint prior to submission is that someone else could “scoop” you, stealing your idea and running the study themselves before you are able to publish your work. Although this is possible, it is also very unlikely. Moreover, all articles are posted with a date/time stamp and therefore there is a clear temporal record.

Finally, students may have concerns that their advisors or other collaborators will not be open to posting preprints. Of course, you should always consult with coauthors prior to posting preprints. There should be little concern with posting author accepted manuscripts of outputs, but some may be more skittish about posting prior to submission to a journal. There may be worries about posting a version of paper that will later be changed; however, you are able to post as many updated revisions to your preprint as needed. Overall, we recommend engaging in a conversation to determine the source of the concern and then providing the preceding explanations for the commonly expressed concerns.


4. Sharing Data (Level: Medium)

What? Sharing data pertains to making the de-identified dataset used for a project available to other researchers. Importantly, this means posting the data on a data repository for researchers to download and use, or establishing a formal system through which others can access the data (useful for sensitive data). There is enough evidence to indicate that stating “data available on request” is not sufficient to constitute engaging in the process of data sharing. We rated sharing data as medium as it takes some forethought on the front end with consent forms and organisation at the backend in terms of how to organise the data and share it with others in a way that aligns with ethical responsibilities.

Why? There are several compelling reasons for sharing your data. First, data sharing allows others to reproduce the analyses reported in a paper, providing checks on quality and accuracy, and to expand on the analyses through fitting alternative models and conducting robustness tests. Second, most datasets have use beyond what is reported in a paper. This includes secondary data analysis that addresses different questions altogether and inclusion in meta-analyses, where researchers having access to the raw data is a major benefit. Third, sharing data may be required by the funding source for the project or the journal the article is published in. Sharing your data upon submission to a journal can be looked upon favorably by reviewers, even if they choose not to do anything with it, as it indicates a commitment to transparency. When applying for non-academic jobs involving data (e.g. data science), it can be useful to have an example of data that you have shared along with a codebook, to show that you are organised with your data.

How? The how of data sharing is why we conceptualise it as medium difficulty. There are a lot of complexities associated with data sharing, and so we direct readers to the UK Data Service’s resources (https://ukdataservice.ac.uk/learning-hub/research-data-management/) as well as the DMP’s website (https://dmponline.dcc.ac.uk/) rather than attempting to cover all of those complexities here. Research students and staff should read that article and share it with their supervisors and research team, as it addresses many of the commonly expressed concerns. It is also important for all researchers to become familiar with local and regional laws governing the protection of certain kinds of data (e.g., the General Data Protection Regulation in the European Union). Because the possibility of sharing must be indicated in the consent forms provided for participants, it can sometimes be difficult to publicly share data after the fact. Thus, a good time to initiate discussions about sharing data is during the project design phase. At that point, you can be sure to include appropriate clauses in the consent forms and ethics protocols that describe both your intent and your plan for sharing the raw data in an ethical way. Importantly, including these details does not require you to share your data, but rather allows for the option—if your supervisor is uncertain about sharing the data, you can revisit this with them once you have finished the project. Whether sharing your data or not—but especially if you are—it is critical to provide a data codebook that includes information on the structure of the dataset (e.g., what variable names correspond to, measurement levels); data sharing is only useful if it is understandable to an outsider. Finally, there are loads of different platforms or repositories where you can share your data, so it can be a bit overwhelming to choose. For simplicity’s sake, we suggest sharing data on Zenodo or browsing disciplinary data repositories on the registry of research data repositories (https://www.re3data.org/).

Worries. There are three major worries with data sharing. First, as mentioned above, are ethical concerns, for which we again direct readers to UK Data Service’s website. Sharing the data publicly may not be consistent with what was stated in the consent form completed by the participants, and so it may not be possible to post the raw data. An additional ethical concern is the risk of reidentification, especially for under-represented populations. There are a variety of strategies for handling these ethical concerns, including posting data without demographic information included. Moreover, it is important for researchers to understand that there are a variety of ways of making data open beyond making it freely available (e.g., having a specified process for interested researchers to securely access the data).

Finally, there is sometimes concern about other researchers benefiting from all the hard work you put in to collecting the data, and that you will not get credit for subsequent use. However, by applying at least a CC BY license to the data set, anyone who uses the data is obligated to attribute the data to you (e.g. through citing the associated paper or through using a DOI). In fact, Colavizza et al. (2020) found that sharing data actually increases the citation impact of articles by 25% on average.

Acknowledgements:

This blog post is heavily adapted from the following paper licensed under Creative Commons Attribution 4.0 which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited:

Easing Into Open Science: A Guide for Graduate Students and Their Advisors

Citation: Ummul-Kiram Kathawalla, Priya Silverstein, Moin Syed; Easing Into Open Science: A Guide for Graduate Students and Their Advisors. Collabra: Psychology 4 January 2021; 7 (1): 18684. doi: https://doi.org/10.1525/collabra.18684

Leave a Reply