http://epad.dataone.org/20110513-LT-VTC Attending: Amber, Viv, Trisha, Bertram, John K., Dave, Carol, Matt, Suzie, John C, Bill, Mike, Todd, Rebecca,Steve Regrets: Bob Cook, Bruce, Stephanie Agenda for 2011-05-13 1. Report on Joint U&A and Sociocultural WG Meeting (Allard/Tenopir/Frame) Groups working on tasks had members from both WGs Policy and Best Practices group: based on USGS NBII and NASA DAAC; sent on to Bob Sandusky and Matt for review Also created some guiding principles - may be useful to S&G WG Created workflow for where policies may be needed Personas Group: Mapped personnas onto the data lifecycle All draft materials are at: https://docs.dataone.org/member-area/working-groups/usability-and-assessment/joint-ua-socio-working-group-mtg-knoxville-may-3-5-2011 Tools (Mike): Looked through existing surveys and reports looking for mentions of specific tools used in all parts of the data life cycle Briefing on data life cycle so all groups framed their reports on the data life cycle Assessment (Carol): Looked at the NSF Review timeline and discussed librarian & libraries assessment; altered to get to government libraries Also looking to send to more academic libraries, beyond research libraries Next assessment will be data managers (not on timeline so needs to be changed) - in particular from not-for-profit (ex, Nature Conservancy, government agencies) Started talking about middle school and high school educators Carol and Suzie are working on a 3-page summary What worked well: update on priorities from CCIT and LT; clear deliverables expected by the end of the meeting; optional field trip to ORNL Thursday afternoon; allowing participants to choose which sub-group and task to work on; framework of data lifecycle for all tasks Personas Subgroup (Kevin) Being developed on Google doc http://bit.ly/D1Personas Developed: Primary personas Research scientist Sun: Early-career herpetologist Jean: Agricultural scientist at a field station Laura: Mid-career oceanographer Andreas: Biochemical modeller William: Late-career plant taxonomist Abby: Science data librarian Secondary personas Rick: Citizen scientist Elizabeth: University administrator Additional persona Additional persona that might be developed include: Primary personas Joon: Member node manager Secondary personas Mr. McMillin: High school science teacher David: Graduate student Renee: College educator Anne: Resource manager Simon: Software developer Wilbur: Naturalist Paloma: Journalist Tim: Citizen science project leader Ajit: Consultant Buyer personas Robert: Academic society manager Research scientist personas were developed to span multiple dimensions: Work setting: Academic (tenure and non-tenure track), government/tribal, private Career stage Subject/discipline Single discipline vs. use of multi-disciplinary data Research setting: Field, lab, modeller Data: Human vs. machine-collected Data management skills: novice to expert Policies & Best practices (Suzie & Kimberly) https://docs.dataone.org/member-area/working-groups/usability-and-assessment/joint-ua-socio-working-group-mtg-knoxville-may-3-5-2011/policies-best-practices-subgroup Created a Terms & conditions document that has notations for LT or S&GWG review about key points for discussion/decision. (Terms & Conditions draft 5.5.11) Created DataONE guiding principles which may be useful high level discussion and S&GWG Developing workflow to look at key points for policy development https://docs.dataone.org/member-area/working-groups/usability-and-assessment/joint-ua-socio-working-group-mtg-knoxville-may-3-5-2011/policies-best-practices-subgroup/work-flow-and-policy 2. Report on Best Practices Workshop (Budden/Hutchison) 40 attendees (about 10 from DMP Tool group) Day 1: DMP DMP Tool profiled and feedback received; 4 new exemplar DMPs created and 2 existing reviewed; Comprehensive data repository list, community standards and metadata standards lists created. Hoping to have beta version available at ESA in August Day 2: Best Practices 55 new Best Practices created and edited for the D1pedia Created a DataONE Best Practices Primer that was reviewed at the meeting Day 3: Software Tools 150 new Software Tools created and edited for the D1pedia Group struggled with the existing categories and made suggestions for new categories - need a way to tag tools with multiple purposes Some excellent feedback and suggestions for moving forward with D1pedia to improve its functionaility and accessibility Materials currently in a dropbox but will be added to plone shortly. Questions: data repositories - how many are targeted as DataONE MNs? Not sure yet, need to go through the list. Some of the repositories are targeted to the DMP Tool, not necessarily DataONE. One of the action items is to contact the repository manager to verify the information collected on the repository at the meeting. Who will keep the links on the tools up to date? Some groups thought it should be wiki where community could keep it up to date. Others wanted ratings on tools. Question about duplication of list of tools http://ebmtools.org, http://ebmtoolsdatabase.org/, http://www.smartgrowthtools.org/ebmtools/index.php For example, here's the page for Kepler: http://ebmtoolsdatabase.org/tool/kepler-scientific-workflow-system-0 3. CI Status (Vieglais/Jones) Progressing according to plan and have started working on authenication and access control. Needed extra technologies for the CNs. Two and a half meetings coming up: CCIT meeting and Semantics WG (partial overlap with CCIT) Location of physical locations for equipment at CNs - need to have this nailed down within a week so can order equipment Trying to order h/w in by the end of the month Research Storage Consortium at UNM is having vendor presentation next Tuesday and a decision on vendor by May 18 (Wednesday) TeraGrid collaboration is moving forward. EVA is becoming high profile at NSF 4. Feedback from NSF (Michener) NSF (finally) provided feedback on our external review. In essence, the NSF PDs felt that we did an outstanding job and that we were the "poster example" for the entire DataNet program. Alan Blatecky presented a summary of our progress to the National Science Board and it was very favorably received. No word on the next round of DataNets. Bill will be in DC next week so is hoping to have a conversation with OCI while there. 5. Preparation for meetings with USGS and NASA (Michener/Frame) Bill, Mike, and Bob are working on a presentation for USGS and NASA Program Managers related to DataONE, potential Agencies interests. Want to address the issues of concerns of agencies (NASA in particular) about credit assigned. Will include material produced at the S&G WG meeting in April. Bill will share the presentation in a dropbox early next week. Wednesday afternoon will visit USGS and discuss collaborations, specifically on tools. Bob is working on bullets on advantages of being a MN. Working on one graphic that illustrates the advantage of becoming a MN (animated) - shows connections to DataONE - tools, etc. Would appreciate feedback how to make this more powerful. Also any other ideas to make this point. From another angle, it would be good to show the relationship between DataONE community work and the activities of the Community for Data Integration at USGS -- Viv is leading a Data Management Working Group that directly parallels (on purpose), and is greatly influenced by, the DataONE effort. Over 50 people call into the USGS Data Mgmt WG each month, and there are 2 active sub-groups looking at USGS Best Practices and Data Policy. A slide could emphasize the success of building community within organizations as well as across organizations and the collaborative activities that creates. 6. Around the Room Bertram: small ProvWG meeting scheduled for June 8th at UC Davis, Summer Internship day before: June 7th; for ProvWG focus on D-OPM; invite people who can represent provenance issues for certain systems (Kepler, Taverna, Vistrails, Galaxy, R, ...) Probably only small subset for June meeting; full WG meeting at DataONE AHM in Fall. Summer Internship project #8 (Provenance Repository) scheduled June 1st--July 31st. As part of FilteredPush project (w/ Harvard) will be participating in http://research.calacademy.org/spnhc meeting (demo camp on curation workflow) Harvard folks will be visiting UCD before the SPHNC meeting. Need to connect with DataONE? John K.: Presented yesterday on DataONE and data curation at PASIG (Preservation and Archiving Special Interest Group). Interesting conversation with Mark Leggott; he heads up Islandora, which is a glue layer between the Drupal CMS and the Fedora repository stack. He showed a bunch of examples of sites built with Islandora, including data visualization with R scripts. He also said they have plans to incorporate Taverna and Kepler, although I'm not sure exactly how he means it. Anyway, we will follow up with him with an eye to wrapping Islandora around the Merritt repository. Viv: Aside from helping with the Best Practices Workshop this week, will be meeting with DataONE intern, Melody Basham, next week in Denver to develop a work plan for the summer. Her summer project will focus on the CEE work on development of an online suite of data management learning modules. Mike: Nothing much to report, have been working on OMB Briefing all week, will mention in the briefing some of the results from the Scientist assessment which came out of DataONE. John C: Continued work on combined DataONE and TeraGrid Nugget highlight to NSF (OCI and CISE) connected with SOTB 2011 release. Matt: working on identity, authorization and authentication systems; TeraGrid node deployment/feedback; EVA use case in prep for summer runs on TeraGrid. Steve: Bob is starting the second version of the EVA WG on carbon cycling. Student intern coming to lab on Monday. Paul Allen will get him started on the summer's work. Steve going to Edinburgh on Monday to present the EVA bird work. The 2011 State of the Birds Report has been released. Lots of buzz from federal agencies on the content of the report. Amber: Intern face-to-face meetings begin next week, internship prgram officially starts at the end of the month. Heading to NY at the end of next week for the first PPSR working group meeting. The co-chairs implemented all the changes required by the LT, including removal of Steve Kelling and Kevin Crowston from WG participant list. Now that BP is over, focusing more heavily on DUG planning so a full update next week. Todd: Speaking to ORCID participant meeting in Boston next week. Any points to convey regarding researcher identity needs for DataONE? -- Matt: InCommon integration; verified identity support (e.g., InCommon Silver); Identity equivalence mapping; are all important Dave: Met with Genome Standards Consortium Monday-Tuesday. Some tools may be of relevance to DataONE - Ontogrator and Terminizer to provide faceted browse / discovery interface (from NERC Environmental Bioinformatics Center, Norman Morrison). Did anyone notice: http://cityroom.blogs.nytimes.com/2011/05/07/bird-week-watching-and-counting/ ? Carol: The UT office of research has given us some end-of-year funds to increase our GRA's hours this summer, so Lei and Arsev will have more time to devote to DataONE. Mithu, the post-doc who was to start May 23 has accepted a tenure track position, so we are looking at our applicants again. Trisha: nothing to report. Suzie: nothing else. Todd again: Encourage folks to take a look at the RSC initiative: http://rcsproject.wordpress.com/2011/05/13/royal-society-to-investigate-open-science/