DataONE Users Group
Jul 7th - 8th 2013
Chapel Hill, NC
Roundtable 6: Open Access / Data Sharing
Session Facilitators: Bill Corey, Debora Drucker, Mike Frame
http://epad.dataone.org/dug2013-RT6
Participants: Jane Greenberg, Angela Murillo, Stephen Richard, Brian Wee, Steve Aulenbach, Matt Jones, Karl Benedict, Soren Scott, John Cobb, Filimon, Patrick West, Mary Beth West, Tracy, Steve Morris
Talking points / Guiding Questions
- What are the main challenges?
- What solutions currently exist?
- What contribution can / should DataONE provide in this landscape? How can that be best achieved?
Main Challenges
Open Access - what is it?
"shades of grey" -
licenses issue - CC commons not defined for data
- 3 categories:
- open for reuse
- open for attribution
- open for distribution
- attribution stacking problem
- International differences in copyright law and inclusion of data as intellectual property
- Legal issues - differences between countries
- Differences on how institutions look at data sharing
- How to ensure attribution for aggregator's role in data sharing
Solutions
- Public Domain Dedication and License (PDDL) Public domain for data/databases
- Open Database License (ODC-ODbL) Attribution share-alike for data/databases
- Attribution License (ODC-By) Attribution for data/databases
- Community Norms Document Use with ODC-ODbL, ODC-By, CC0 Use as a stand alone document to identify community norms and expectations, best practices
- CC0 Creative Commons Zero "No Rights Reserved" no attribution requirement
- CC BY Attribution
- CC BY-SA Attribution share-alike
- Are licenses enforcable...will they stand up in court?
- WC3 - rights ontology - IPROnto
- Experience on researchers encouraged to share when they know who is downloading the data
DataONE contributions
- linking licenses to data sets allowing filtering which licenses do you use? Do you specify a license, or a set of licenses?
- Make them structured/machine-readable licenses attached to data
- Could also be a 'license thesaurus' (see W3C Rights ontology; IPROnto)
- Be sure to cover all three axes
- Re-Use policy
- Attribution policy
- Redistribution policy
- Make recommendations about how to machine link licenses into current metadata standards
- FGDC, CSDGM/BDP, ISO 19115, EML, etc.
- Right to redistribute is related to current ReplicationPolicy in SystemMetadata
- Provenance system can help avoid attribution stacking problem
- Can also help with showing role of aggregators in curating data objects
- Provide a machine-readable mechanism to link provenace models into existing metadata standards
- Or, provide separate mechanism to link prov and metadata (e.g., through ORE)
- Host a provenance trace construction service
- Provenance should include the licensing schemes
- When searching for data, add ability to filter by license type
- Organize community activities to come to agreement on licensing and data policies
- Show advantages of sharing data
- Rights in multiple directions
- Short / medium / long term