Opening Your Research Using GitHub and OSF

Opening Your Research Using GitHub & the Open Science Framework


Vicky Steeves | October 25, 2017



Looking at scholarship holistically


We do a lot of very different things in very different ways during our research. But the final conclusion is always just a paper-- let's challenge that.

A Reminder of Some Open Access Principles

  1. Green vs. Gold Access
    • Green: freely accessible, self-archived
    • Gold: researcher-paid freely available
    • Close link between publication and data archiving (i.e. archiving by journals) means data archiving may be green/gold depending on journal

Today, we're talking about GREEN open access. You will be self-archiving your work!

Some Minimum Requirements


Data and code must be:

  • Persistently linked (At least URI/URL...preferabally DOI)
  • Searchable and browsable
  • Retrieved or downloaded easily
  • Citable

The Why of
Open Data & Code


Not selfishly: it helps advance your field of research!

Selfishly: citations!


Data and code should be cited within our work for the same reasons journal articles are cited:

to give credit where credit is due (original author/producer) and to help other researchers find the material.

Getting Data & Code Cited


Advantages to Tracking Citations:

  • Demonstrate to funders/promotion committees you & your data make big impacts in your field!
    • they judge merit based on intellectual merit and wider impact
    • tangible evidence to weigh against the cost of research

  • Monitor usage of datasets!
    • You can know what forms of data prep and data publication are most effective for sharing/open science!
    • Uncover opportunities for collaboration amongst peers

So...let's make your research citable!


Step 1: Write/version code in GitHub.

Step 2: Use the Open Science Framework to store data, documentation, etc.

Step 3: Publish & reap all your analytics!

Let's check out a brief example before getting into the details!

So...GitHub!


GitHub is a website where you can upload a copy of your Git repository. It allows people to collaborate via Git along with some other features.

GitHub is great because it allows folks to

Resources For Learning More


I teach it at Bobst! Check it out.

Git

GitHub

What is the Open Science Framework?


A free and open source project management tool that connects researchers to the tools they are already using to make management easier through the research cycle.


Why Use the OSF?


  • Preserve your current workflow, file types, and standard operating procedures
  • Enhance those workflows by better documentation, sharing, and discovery of your research materials
  • Gather together group permissions, various storage options, bibliographic management, and publication in one place
  • OSF is the work of a non-profit, the Center for Open Science, dedicated to supporting open science using open-source tools

Integrations with Research Tools

Getting Started: OSF at NYU

Sign up at osf.io using your NetID and create your account.
Click on "Sign In" and at the bottom of the sign-in box, the link for "Login Through Your Institution."

Setting Up a Project

Go to https://osf.io or just click "My Projects" on the top bar.

Click "Create Project" and fill in the form.

Don't Be Afraid of How Blank It Looks...

Let's Add Collaborators

Click "add" button next to the title of the page.

Let's Add Collaborators

Search for the person to your right and add them.

Setting Up Addons

Try adding an account of your own by clicking on your profile, then "Settings," then "Configure Add-on Account." Click on "Connect Account" next to an addon.

Back to Your Project:
Select Your Project Add-ons

Go to project "Settings," then "Select Add-ons"

Adding & Authorizing Addons

Choose Your Repo
or make a new one!

Working with Your Files

Click on "Files" on Your Project Top Menu

From here, you can:

  • See all files from OSF storage and any configured addons
  • Drag and drop files into any kind of storage, OSF or otherwise
  • Create folders in OSF storage for better organization

Tagging Files

When you click on a file in OSF, it renders right in-browser. On the bottom right is the "tag" field, where you can enter whatever you want.

Try adding 2 tags to your file and compare with the person to your right

Versioning Files

When you click on a file in OSF storage, you can also see and download all the versions of that file that have been uploaded.

THE CATCH: The file has to be uploaded with the same name!

Documenting with the Wiki

This also has a robust versioning. And you can compare versions side-by-side.

Best Practices with the Wiki

Use the "Home" wiki page as a table of contents listing project goals, personnel, sub-components, and links to important files.

Components

Components are essentially "sub-projects" that can have their own set of collaborators, add-ons, and access controls.

Sharing

Everything (files, subcomponents, wiki docs) gets a short permalink in OSF. That makes it easy to share via e-mail, Twitter, pastebins, etc.

You can also share projects via a view-only link, including an option to anonymize contributors for blind peer review.

Before publishing: Prep Your Code!

Step 1: Comment & style your code—

Before publishing: Prep Your Data!

Step 1: Put your data into an open format!

  • Be software agnotic!

Step 2: Package Your Materials

  • Data files & scripts/analysis tools
  • Documentation & description in OSF wiki or upload metadata

Publishing

When you want to publish your final product, you register it. All the files are pulled into an archive on OSF storage, and the project becomes read-only.

You can get a DOI for this project, and include it in a "Supplementary Materials Section" of a journal article.

Analytics in the OSF

All OSF projects start private. We can make them public and publish them, we reap some benefits, like built-in analytics!

Remember the analytics stuff we talked about earlier...

To sum it up:


  • GitHub is a great place to version & collaborate on code.
  • The OSF is a great central location to document research, store data, and make it available.
  • Connecting the two let's you open up your scholarship holistically!
  • You then LINK your registered project via the DOI that OSF gives you, in your publication!
  • Your research gets more exposure, YOU get more exposure, and you benefit science!

Questions?


Email me: vicky.steeves@nyu.edu

Learn more about RDM: guides.nyu.edu/data_management

Get this presentation: guides.nyu.edu/data_management/resources

Make an appointment: guides.nyu.edu/appointment