Why are certain prominent data sources (e.g. Smithsonian, USDA Plants Database, other federal sources) not accessible via DataONE? --> this is an important and obvious question that we need an answer for. The lack of these resources will frustrate users. What kind of data can researchers find in DataONE? - "Scientific data" - from the ONEMercury tag line - "Earth observational data" - from the last sentence of the first paragraph under What is DataONE - (It might be a good idea to indicate what kind of data researchers will NOT find in DataONE.) What can I do with data I discover through DataONE? - Many data products can be downloaded. "DataONE has some Best Practices for working with data" that you should pay attention to. Also please read the help and other descriptive files associated with the data set that will contain any use restrictions as well as citation guidelines. Can I store my data with DataONE? --> Data can not be stored with DataONE in the way you would store data in a cloud data storage service. However you can deposit your data wiht a Member Node that participatews in DataONE thus making your data discoverable via DataONE. What resources are required for my library to access or utilize DataONE? Is there a membership fee? -->You can search for data about earth sciences and the environment from DataONE's home page: http://www.dataone.org/ using the search box, ONEMercury. We have an institutional repository; should we consider being a DataONE member node? Why or why not? -->DataONE is a federated data network that improves access to, and preserves data about, life on Earth and the environment that sustains it. If your institutional repository contains such data, you may wish to consider DataONE as part of your preservation strategy. These are future areas for development. Look at the Member Node pages for additional information and contacts:http://www.dataone.org/member-nodes or contact us: http://www.dataone.org/contact We are working with the Office of Research to create a data management program. Do you have a checklist of considerations or things we need to accomplish to ensure our program meets federal requirements? --> While there are common elements to data management plans, different agencies have varying requirements. The DMPTool is a good place to begin:http://www.dataone.org/data-management-planning 3. How do I search for data on the DataOne Website? You can use the OneMercury search engine available on the DataOne website to find one or more datasets. You can also refine your search options using four filtering categories namely filter by member node, author, project or keywords. - actually doesn't ONEMercury search metadata records not datasets? How should I prepare my data for addition to a repository? * The details of how to submit data to a member node depend on the Member Node and the software that particular repository is running. For Member Nodes running Metacat, the submission process can be as easy as using Morpho to save the data to your repository's Metacat instance. Other repositories, such as the ORNL DAAC, have much more involved data review and curation practices http://www.dataone.org/contribute-data * Best practices for data are available for all stages of your project (Plan, Collect, Assure, Describe, Preserve, Discover, Integrate, Analyze) at http://www.dataone.org/best-practices Am I required to use a particular metadata schema? * Metadata standards are not set by DataONE itself, but may be determined by the software that a particular repository is using. For example, member node Dryad uses the Dublin Core Metadata Initiative Abstract Model (DCAM). Member node USGS Core Sciences Clearinghouse Member Node uses the Federal Geographic Data Committee's Content Standards for Digital Geospatial Metadata with a biological data profile (BDP). Overviews of current member nodes and their metadata standards (including provenances) are available at http://www.dataone.org/current-member-nodes For more details about member node's provision of metadata to DataONE, reference http://www.dataone.org/sites/all/documents/DataONEMNFACTSheetFormattedDec1.pdf Am I required to convert my metadata profile to a profile compatible with DataONE? * Different repositories, such as the ORNL DAAC, have various data review and curation practices. http://www.dataone.org/contribute-data Software and tools such as "cross walks" are accessible through the DataONE Software Tools database to assist you in converting your metadata, if your member node uses a metadata profile different from your own: http://www.dataone.org/software-tools/ What taxonomic standard (animals) is being used? * DataONE does not mandate a particular taxonomimc standard; however, the member node which you submit your data through may adhere to a particular convention or metadata standard. For best practices on idenfication of any species represented in a data set, visit http://www.dataone.org/best-practices/document-taxonomic-information Questions/Answers by Personas - Feel free to suggest new categories A. Individuals 1. Citizen Scientists 1a. Project organizers & data managers <-actual citizen scientist participants would realistically never interact with DataONE; the target should be project organizers 1b. Project participants and other citizen scientists 2. Science Educators 2a. Higher Ed Faculty 2b. K-12 Teachers 3. Librarians 3a. Data experts. Able to directly facilitate data management, etc. 3b. General knowledge. May be able to direct to appropriate resources. 4.Scientists (use data) 4a. For exploration, casual analysis, research planning. 4b. Power user. Novel, data-intensive research. 5. Scientists (share data) 5a. Individual datasets. Ad hoc deposit. 5b. Batch deposit. Deeper integration of D1 in research process. 5c. Contrast scientists who already have ties to oneor more member nodes versus those seeking to interact with D1 more directly. 6. Policy Makers B. Institutions 1. Member Node/Coordinating Note (actual or prospective) 1a. The only person in this category that is likely to refer to the FAQ is a representative of a prospective member node. Existing member node reps and potential coordinating node reps are not likely to refer to the FAQs regarding member nodes. 2. Libraries, Offices of Research Support, Other Administrative Support Functions 2a. The crucial question here is whether the institution and D1 have a more formal relationship (e.g., adoption, sponsorship, etc) than simple usage by support staff. ********************************************************************************************************** Submit Questions and Answers Below Here (A question bank is further down.) These questions transcend user types What is DataONE? What is OneMercury? What is DataUp? What is the DMP Tool? What is [Fill in name of branded tool or service within DataONE]? ********************************************************************************************************* A. Individuals 1a. Citizen Science project organizers and data managers a. Where is the right place to deposit my data? b. How can I get data packaged for informal science education uses? c. 1b. project participants and other citizen scientists a. how do I go to DataONE to see my data and visualize the results? ANS -- If the data collected by your Citizen Science project has been uploaded by your group to a ***Member Node*** that is a member of DatONE, then ***see text on ONEMercury and searching -- no login required for many data sets -- b. how do I use DataONE to find other similar data collected by others? ANS -- see text on ONEMercury and searching -- no login required for many data sets -- c. how do I get other types of data that might be useful in understanding or interpreting the things our group's data show? ANS -- see text on ONEMercury and searching -- no login required for many data sets -- Suggestion for workflow: User answers a series of questions in a wizard-like decision tree that guides them to a selection of appropriate repositories. Preferably with comparative info to help them decide which one. (THIS is what I hear people asking for. They can't parse long lists of member nodes, and they don't care about member nodes, that just scares and confuses. They really need hand-holding to make these decisions. List of repositories will just scare them off from it entirely, and they don't know what criteria to compare on.) 2. Science Educators 2a. Higher Ed Faculty a. Natural Science Faculty i. How can I use D1 in my teaching and work with my research students? ANS -- Teaching using real data can enhance your students' learning in your courses. For some course topics, instead of lecturing, you can send students to D1 to find, visualize, analyze, and synthesize data as a means for them to learn specific course content. For example, to teach climate change, you might send your students to D1 to find evidentiary bases to specific claims such as those made by you, your course textbook, or perhaps a claim made in a current issue of a for-profit media outlet. Not only will your students benefit from improved technical skills in data manipulation and analysis, but research has shown that teaching activities involving synthesis improve students' higher level critical thinking and scientific reasoning skills. In addition, D1 tutorials and tools can improve your students' basic skills in data management and preservation for their research projects whether these are course projects or extra-curricular from semester length or summer projects to thesis research. This would also improve project continuty among semesters of successive cohorts of your research students who are pursuing a research direction in your lab. ii. How can D1 help me to improve my students' skills in data management in my field and lab classes? iii. Can my research students upload their data to D1? vi. Do my students have to "log in" to get D1 data? v. How can D1 help me to improve my students' skills in data analysis and visualization? vi. How can D1 help me to find really neat teaching activities that use large data sets to teach the content of my course insead of lecturing? vii. How can D1 help me to publish and disseminate any neat teaching activities I develop from D1 data sets? b. Information Science Facuty i. How can I use D1 in my teaching? ii. How can D1 help me to improve my students' skills in data management in my methods classes? 2b. K-12 Teachers a. How can I use D1 in my teaching? b. Do my students have to "log in" to get D1 data? c. 3. Librarians Abby - Data Librarian a. What resources are offered to help me point researchers to an appropriate repository for their data? b. What resources are available to help me educate researchers about data management? c. How should researchers cite data? d. What kind of data can researchers find in DataONE? How do they search for data? What can they do with data I discover through DataONE? e. Can DataONE offer advice on what groups on campus I should be reaching out to and strategies for that outreach? 4. Scientists (use data) a. b. c. 5. Scientists (share data) 5a. Individual datasets. Ad hoc deposit. These questions seem appropriate as FAQs for scientists with ad hoc data to share: ******************************************************************************************************** This section details User Stories specific to scientists with ad hoc data to share User story: User is an individual scientist who has data to share. Has heard of DataONE and is interested in participating (sharing or depositing their data). > User navigates to DataONE.org. Question is: How can I deposit my data with DataONE? 1st Path: - User clicks Data > Contribute - This page is really oriented to potential member nodes, not individual data contributors. The section on "Submitting data to a member node" is not at all helpful and contains way too much jargon for a typical user. See some of the further suggestions for possible solutions. 2nd Path: - User clicks Participate > DataONE Users Group - For someone who is seeking to contribute data, this page will seem daunting as it makes it seem that the user must first join DataONE in some capacity (as a "member", according to the sidebar). Further, it is not at all clear why someone would "join" the DUG. How is that different from being part of the WGs, or involved in member nodes, or ...? b. c. 6. Policy Makers a. b. c. B. Institutions 1. Member Node (prospective) These questions seem appropriate as FAQs for potential member nodes: What is a "member node"? ANS: Should be at http://www.dataone.org/member-nodes We recommend using the text at: http://www.dataone.org/sites/all/documents/DataONE_MN_Partner_Guidelines.pdf What are the benefits of sharing my data repository with DataONE? ANS: Should be at http://www.dataone.org/member-nodes Better text available at: http://www.dataone.org/sites/all/documents/DataONEMNFACTSheetFormattedDec1.pdf What does it cost? ANS: Should be at http://www.dataone.org/become-member-node Better text available at: http://www.dataone.org/sites/all/documents/DataONEMNFACTSheetFormattedDec1.pdf What is our responsibility as a member node? ANS: Should be at http://www.dataone.org/become-member-node Some text available at: http://www.dataone.org/sites/all/documents/DataONEMNFACTSheetFormattedDec1.pdf What restrictions can we have on our data? ANS: Should be at http://www.dataone.org/become-member-node Some text available at: http://www.dataone.org/sites/all/documents/DataONEMNFACTSheetFormattedDec1.pdf Who do I contact? ANS: ??? There should be a dedicated Member Node contact (or at least a dedicated email address). The generic contact form will not make a potential member node feel that they are going to get a response. ******************************************************************************************************** This section details User Stories specific to potential Member Nodes User story: User is a rep for an existing repository. Has heard of DataONE and is interested in participating (sharing or depositing their data and otherwise partnering with DataONE). NOTE: key problem for this person is to understand that they represent a potential member node in the D1 parlance. > User navigates to DataONE.org. Question is: How can we deposit our data with DataONE? 1st Path: - User clicks Data > Contribute (link to this page which describes Member Nodes) - OK (links to existing page/documentation) 2nd Path: - User clicks Participate > DataONE Users Group - Very confusing... not clear how this is different. Need to either change category heading or add explanation to the DUG page. 3rd Path: - User clicks Participate > Member Node - OK (links to existing page/documentation) 4th Path: - Clicks on rotating graphic on home page (middle graphic which says "Deposit Data" and shows member node) which takes the user to the http://www.dataone.org/depositing-data-into-dataone page. - This video is not about depositing data into DataONE; it is about the DataONE data model. It does not help anyone who wants to learn about being a member node and seems aimed at the individual who wants to understand the DataONE process. Even then, it does not really help an individual to understand how to make sense of getting data into DataONE. The video is mistitled. Further suggestions: Change categories in "Participate" column to actions: e.g. "Join DataONE User Group" "Become a Member Node" etc. 2. Libraries Library Administrator (Representing the Institution) a. Are the data in this repository accessible through any universal discovery tools? Do the discovery tools integrate with existing discovery tools? b. What resources are required for my library to access or utilize DataONE? Is there a membership fee? c. What value does DataONE bring to my library operations? d. What does it mean for my institution to participate in DataONE? e. Is DataONE part of the University of New Mexico or is it a separate organization? f. What is the fiscal stability/business plan of DataONE? g. We have an institutional repository; should we consider being a DataONE member node? Why or why not? h. We are working with the Office of Research to create a data management program. Do you have a checklist of considerations or things we need to accomplish to ensure our program meets federal requirements? While there are common elements to data management plans, different agencies have varying requirements. The DMPTool is a good place to begin:http://www.dataone.org/data-management-planning i. What are the technical requirements/infrastructure needs to support access to DataONE? C. Question Bank These questions are relevant for: Member Nodes, ...? 1. What is a member node? 2. Who can be a member node? 3. Who are the current member nodes? 4. What is a coordinating node? 5. Why should I become a member node? What benefits do I get as a member node? 6. What hardware is required? 7. What software is required? 8. What infrastructure does D1 run on? 9. Who will have access to the data I upload? 10. Can I access usage statistics? 11. Does DataONE provide technical assistance/training for institutions interested in becoming member nodes or coordinating nodes? 12. What discovery system does DataONE use? 13. What is OneMercury? 14. What is DataUp? 15, How much data is DataONE capable of handling/processing? 16. What is the DMP Tool? 17. What backups are we required to provide? 18.How will I know about who is using my data? 19. Once my institution becomes a member node/coordinating node how long are we obligated to remain part of the system? 20. Can my institution place restrictions on data/dataviews? 21. What information do I need to provide to DataONE if my instition's system is hacked or experiences some other event that could impair the data and users? When do I need to provide that information? 22. How often do we have to update system authorization codes? 23. Is there any membership fee involved in becoming a Dataone member or co-ordinating node? What data can I get through DataONE? 2. How do I download data through DataONE? 3. Who manages D1? 4. How will D1 help me do my research? 5. How do I search through D1? 6. How long will D1 be available? 7. Can I visualize data through DataONE? 8. How will D1 make my life/work easier? 9. Where can I learn about metadata? What is a good introduction to metadata for scientists? 10. What non-technical tools does DataONE have for scientists? 11. Which metadata standards are being used for the datasets? 12. What is linked science? 13. Where do I go to get help with a data management plan? 14. I have data, how do I contribute to D1? 15. My organization has data, how do I contribute to D11? 16. Can non-scientists use D1? How, for what? 17. How is the data vetted or verified by D1? quality control 18. What discovery system does DataONE use? 19. Does D1 preserve data? 20. Should I be responsible for supplying metadata for my datasets? 21.Can DataONE identify/make available different versions of the same dataset? 22.What are the benefits of using D1 for sharing or contributing data? 23. How I become an authorized user? 24. Will I have access to all the data captured in the DataONE infrastructure? 25. Can I use DataONE for data preservation even if I am not interested in sharing my data? 26. Can I use my institution access credentials to access data? 27. How do I merge my datasets with others' datasets? 28. How do I become involved personally with the DataONE Project? 29. What is the investigator toolkit? 30. How is my data backed up in the system? 31.What happens if I no longer want to make my data available? 32. Who is currently using D1?? 33. Are there any publishers involved in D1? 34. Are there any professional societies involved in D1? 35. Can I place restrictions on the use of my data? 36. Will I have access to all DataONE data? 37. How do I cite data found through D1? 38. What do I do if I think my data is being misused or used in ways that threaten the integrity of the data? 39. How does DataONE store and provide access to different versions of the same data? (persistance & mutability) Non-science stakeholders 1.I'm a teacher, how can I use D1 in my classes? 2. How can academic librarians facilitate faculty technical use of DataONE? 3. What non-technical DataONE tools can librarians make available to faculty? 4. My instition is a member of a institutional repository consortium, how can the consortium get involved in DataONE? Qs from the UA review of the user-testing: ONEMercury How many datasets are available for searching? What databases are included? What are metadata? What is the nature of data files? How are the results ordered? Why do some records have data files whereas others do not? Who has ensured these data are accurate, valid, etc.? Why is the search tool called ONEMercury? Does DataONE own ONEMercury? Is ONEMercury different from Mercury? Can I embed ONEMercury in my own data site? How do I add data to ONEMercury? Can I save the results of my search? I found a result with multiple files. Can I download them all at once? I found a result with multiple files. Is there any way to learn more about the specific content in each of these files without having to download them first? I don't see any data files. Does this mean that the data are not actually available? Can I get the data files in other formats? I have found an error (dead link, wrong data file, etc). How can I alert DataONE to this problem? (trying to think like someone who has just learned about DataONE, mostly from a viewpoint of a scientist wanting to share or use data -jwc) What is DataONE? jwc How many bytes of storage does DataONE have? jwc How can I store my data in DataONE? jwc How much does DataONE charge to store my data? jwc How do I know that DataONE will not lose my data? jwc How can I get my data into DataONE? jwc How fast is DataONE (data retreival, metadata query, ...) jwc Who is using DataONE? jwc ---------------------- Bruce, Miriam D, Tanner = Sun Research Scientists Persona: Sun = Contributing Scientist Method: Go around the life cycle of the steps * Plan * collect * assure * describe * deposit * preserve * discovery * integrate * analyze What is DataONE and why is it valuable to me. Why should I share my data? NSF requires data management plans for grants. How can DataONE help me meet this requirement? one answer: preserve how might data one help to preserve my data beyond the life of my project? What is meant by the data life cycle? How can i learn more about best data management practices, and quality assurance for my data? (Cleaning up data, QAQC, minimize, detect during collection, correct, use controlled vocabulary) How can I guarantee I'm collecting quality data? How do I disclose quality for combined datasets? Down the road - how can I be sure that my data is correct after upload How would I learn more about metadata? What are metadata? What is a member node? Deposit: how do I upload my data (what add-ons are available, what is the method, submit a CD or DVD) What if my institution is not a member node in DataONE What is data description? The major categories of questions from the Ed modules could be plucked out and become these contributor questions (in a way, they are FAQs). Why should I describe my data? Who's going to look at my data Can i control access for my data Will I get credit for people using my data (attribution) How will I get credit When should I upload my data What if I want to make changes to my data after I upload it (edits, additions, versions) What types of data can I upload? (Photos, Videos, Ethnographies, genetic data, digital records) Can I remove data after it is uploaded? (If I have shared it, can i ever get it back if I have changed my mind?) Uploading live data (like a stream) wind, water, etc. Data use and re-use (discovery) Is dataONE only interested in primary data or synthesis data? Can I upload "mashups?" Is there an accomodation for the originator? Concatenation is not the same thing as synthesis. ACCESS SIDE William, Plant Taxonomist CCan DataONE help me digitize my data? I'm nearing retirement and have a large amount of analog data. Can I ship you my research notebooks? Will you scan them and incorporate them into DataONE? How can I find publications that have cited my data? How can I suggest data to be integrated in the DataONE system that I know of but do not own and is not in a contributing node? Do I have access to all data I find in DataONE? DISCOVERY How does a search in DataONE differ from Google Search or Google Scholar, GBIF, other major data repositories? I already uploaded it at one location, how will DataONE prevent duplication of data? How will data versioning be controlled? What repositories for different disciplines are available? What Data is DataONE looking How often is it Data uploaded? I did my search yesterday, will there be new results today? How frequently will new datasets be added? What is the five-year plan? How do I find data How would I use dataONE to find environmental data at my study site? How would I use dataONE to find comparable studies to mine elsewhere in the U.S. or the world? What taxonomic standard is being used? How will my data cross-walk with other standards? Would data I download and use, would they be notified that I snagged their data? What data will be collected about me when I use DataONE? How do I correctly cite other people's data? How do I ascertain the quality of various datasets? How do I download data that I have found? How can I synthesize it? How can I filter my search? (Tutorial, AVI, mercury features) I searched and did not find anything. Why can't I find it? How smart is the search engine regarding synonyms or mis-spellings? What is the scope? What are the major data types? Is it living things? Is there weather data, climatological data? What is not covered? Can I find literature? Is the publication DOI an attribute of the data set? How can I find publications that have been cited? Why are certain prominent data sources not accessible via DataONE? How can I find data related to a dataset I have already found? How can I learn more about the Mercury Search? INTEGRATE How can I use data in my classrooms? Pre-college Steps going forward: catalog questions, picpick two or three keywords, might be useful to create a google doc. ------------------------------------- Is there a limitation to the kinds of datasets that DataONE will house? / are there requirements? (Or, CAN I store my data with DataONE?) kcl ppsr Are there requirements? What are the hardware requirements? What are the software requirements for sharing data with DataONE? What format is my data required to be in to submit / share it with DataONE? (Can I submit physical copies, photographs, DVDs, CDs to DataONE) Am I required to submit a verified metadata profile for the data I upload? Am I required to use a particular metadata schema? Am I required to share the data management plan created for my project? Am I required give up all rights to my data as a condition of sharing it with data one? Am I required to share data with everyone who uses DataONE? Are there restrictions? I use sensitive data from a third party in my dataset. Am I restricted from uploading a derivative work to DataONE? I use a specific metadata profile unique to my field. Am I restricted from sharing this dataset? Am I required to convert my metadata profile to a profile compatible with DataONE? I have historic datasets that are in an archaic format. Am I restricted from sharing this dataset? Am I required to convert my dataset to a more modern format? I am not the creator of a dataset but I have permission on behalf of the creator to upload a dataset. Am I restricted from sharing this dataset? Am I required to provide documentation that I am uploading the data on behalf of the data's author? I work with data about the physical environment (meteorology, climatology, soil science, geology, paleontology). Am I restricted from uploading this data? Is there a restriction for corrupted or potentially corrupted files? Are there restrictions for files that may pose a security risk to DataONE's infrastructure or community? Can I use DataONE to visualize multiple datasets with the same tool? kcl ppsr Why should I store data with DataONE (as opposed to somewhere else)? kcl ppsr How should I prepare my data for addition to a repository? Are there data fields I should not share? kcl ppsr How can I share data that has been collected by volunteers? What steps do I need to take in order to hide personally identifiable information about volunteers, such as names, e-mail addresses, physical location? kcl ppsr Can I store data that are about my ppsr project (such as # of volunteers, volunteer hours, volunteer demographics) or am I limited to earth observation data? kcl ppsr