22 October 2013
Tom, Steph H, Kristin
CEE working group break-out
- Lesson 01: Why Data Management
- Module: http://www.dataone.org/sites/all/documents/L01_DataManagement.pptx
- Activity: 'Data Search' : Student comes up with a hypothesis of interest that requires aggregating data that already exists. Must find 5 papers (using standard search methods e.g. WoS, GoogleScholar. JStor) from distinct sources (not same author) that have used data that would be helpful for testing the student's hypothesis. For each, give citation, and how to access the data (if possible) e.g. data archive, published in the paper, contact the author. If contact the author, student must document how (current address affiliation). All class members report out for discussion. May want to include a note that some subfields do not share much data - instructors may want to consider that topics including some genetic data, climate data, geographical (e.g. LANDSAT) data are likely to have available data online, may help to provide some diversity rather than just all zeroes.
- Now posted in folder on DataONE sharing site https://docs.dataone.org/member-area/working-groups/community-engagement-and-education/working-documents/hands-on-exercises-for-data-management
- Lesson 02: Data Sharing
- Lesson 03: Data Management Planning
- Lesson 04: Data Entry and Manipulation
- Lesson 05: Data Quality Control and Assurance
- Lesson 06: Data Protection and Backups Steph is working on this one because the docs.dataone site is not working!
- Module: http://www.dataone.org/sites/all/documents/L06_DataProtectionBackups.pptx
- Activity: Community of Practice in data protection, backup, archive. Students interview a researcher (their research advisor, a committee member, other researcher). Ask a series of scripted questions - file naming practices and backup/versioning, backups on and off site, archiving practices. Students share what they find in small group or as class, discuss strengths & weaknesses of practices. In a workshop setting, could be set up as a paired activity, interviewing each other if everyone has some experience in research. If no one has a research background (e.g. computer scientists learning environmental data management), the workshop organizer may want to plan to bring in several researchers in person or by skype for small group interviews.
- Lesson 07: Metadata
- Lesson 08: How to Write Good Quality Metadata
- Lesson 09: Data Citation
- Module: http://www.dataone.org/sites/all/documents/L09_DataCitation.pptx
- Activity: Have students go to www.doi.org and use the DOI resolver to go to a couple of datasets. Have examples of persistent identifier in ARK and DOI form for students to try. Maybe form a discussion around the Duke and Porter paper on ethics of reusing data - public versions are legally available. The exercise could summarize some of the guidelines, pose some central questions for discussion, and provide a pointer to the paper's DOI and a URL where one of the free legal versions is currently availalbe (although URLs change over time).
- Lesson 10: Analysis and Workflows
- Module: http://www.dataone.org/sites/all/documents/L10_AnalysisWorkflows.pptx
- Activity: Have students make a work flow plan from a scenario (get one of standard scenarios used) Have students use the hypothesis that was sketched in Modules 1 & 2, and draft a workflow model to complete an investigation to test the hypothesis using public data or data provided by others.
TEMPLATE
Hands-on Activity X: TITLE
Authors:
Background Lecture:
Objectives:
Outcomes:.
Time Needed:
URLs:
Additional Files Needed:
Key Reading:
Notes for Instructors:
Student Instructions: