OHSUBD2K/BDK12-Data-annotation-and-curation

Name: BDK12-Data-annotation-and-curation

Owner: OHSU Big Data to Knowledge (BD2K) Educational Materials

Description: BD2K Module 12

Created: 2016-06-16 15:29:17.0

Updated: 2017-06-28 20:36:19.0

Pushed: 2017-06-29 05:03:51.0

Homepage: null

Size: 119286

Language: null

GitHub Committers

UserMost Recent Commit# Commits
Nicole Vasilevsky2017-06-28 23:27:43.02

Other Committers

UserEmailMost Recent Commit# Commits
Bjorn Pedersonpedersbj@ohsu.edu2017-06-29 05:03:39.03

README

BD2K Open Educational Resources

BD2K OER Materials Blueprint

Module Number: BDK12

Module Title: Data annotation and curation

Module Description:

Data preparation, developing standardized quality assurance processes and pipelines

Team Lead(s): Nicole Vasilevsky Team Members: Nicole Vasilevsky

Module Objectives:

At the completion of this component, the learner will be able to:

  1. Apply data preparation and planning best practices
  2. Describe data annotation and biocuration
  3. Apply data standards to research data sets using manual methods

Module Prerequisites: None

Module Units
Unit 1: Data preparation and planning

Description: This unit describes best practices for data preparation and planning including deciding the best formats to store data, directory and file naming conventions, basic metadata considerations, and data sharing considerations.

Unit 1 Slides: BDK12-1.pptx

Unit 1 Audio: BDK12-1.mp3 - Full lecture, Audio File - Individual Slides

Example: online presentation

Unit 2: File and Directory Naming

Description: This unit describes best practices for digital file and directory naming.

Unit 2 Slides: BDK12-2.pptx

Unit 2 Audio: BDK12-2.mp3 - Full lecture, Audio File - Individual Slides

Unit 1 & 2 Exercise: BDK12_Exercise01.docx

Example: online presentation

Unit 3: Annotating and Curating Data

Description: This unit describes professional biocuration and how researchers can better annotate their data to become biocurators themselves.

Unit 3 Slides: BDK12-3.pptx

Unit 3 Audio: BDK12-3.mp3 - Full lecture, Audio File - Individual Slides

Unit 3 Exercise: BDK12_Exercise02.docx (in BDK12_exercises.zip)

Unit 3 Exercise: Read the blog post: Ontological Annotation of Data and complete BDK12_Exercise03.docx

Example: online presentation

Module Supplemental Materials

Exercises: BDK12_exercises.zip Glossary: BDK12_GlossaryTerms.pdf

References & Resources: BDK12_Ref.pdf

Recommended readings:
References cited:

References cited in lecture:

A note on Figures and Images

Nothing makes a learning session more engaging than fabulous visuals. While many in the education realm are accustomed to using a variety of rich images under the educational use exception, materials presented in an online educational resource (OER) format that are freely available and allow for users to remix, tweak and build upon the OERs present a unique problem. Images used in these circumstances must carry stringent CC BY-NC-SA (Creative Commons: Attribution ? Non-Commercial ? Share Alike) copyright.

As a result, the materials provided here have limited imagery as we intend for the users to remix, tweak and make these modules their own. At points in this module I have suggested inserting images of your choosing, not only to help create visual interest, but also to help tailor the educational experience to your audience. For examples, images that are being produced by researchers on your campus or in your department will drive a point home more effectively than generic or stock photos.

How does all of this copyright stuff work? For more information on copyright and fair use, I recommend a couple of resources.

When should you look to add additional images? When you see the clipboard icon, please consider identifying relevant images to the presentation. Suggested images may be hyperlinked, but not embedded in the presentation. Use your creativity when identifying images!

Where do I find images? There are several sources that might be available to you. Depending on how you plan on using the BD2K modules, you may have more flexibility to locate images. Once you have identify the license that you wish to use, you can search with those restrictions in mind.

This material was developed by Oregon Health & Science University, supported by the National Institute of General Medical Sciences, funded by the NIH Big Data to Knowledge Initiative, under Award Number 1R25GM114820.

This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.