Attendees: Carol, Suzie, Bob, Amber, Mike,Trisha, Bertram, Stephanie, John K.,Viv Regrets: Bill, Todd, Matt, John Cobb, Dave Vieglais http://epad.dataone.org/20110429-LT-VTC Agenda for 2011-04-29 1. Report from USGS/DataONE tools workshop (Vieglais/Jones/Frame) Couple day meeting in Denver this week Matt & Dave from CCIT and developers and technical leads from USGS side 2. Update on Best Practices Workshop (Budden) 3. Request for someone to attend Open Debate at XLDB-Europe 2011 (Koskela) Dear Bill, Martin Kersten, Alex Szalay and I are organising a pair of workshops in Edinburgh from 7th to 10th June this year. We are very pleased to have brought XLDB to Europe. It is a meeting of the leaders in handling eXtremely Large Data(Bases) with a bias towards solutions for scientific applications. We want to show those from outside Europe the major undertakings in Europe, particularly the ESFRI projects, and we want to steer the Database Researchers and Companies to deliver solutions relevant to these ESFRI projects. Their provisional programme outlines are described below. I would very much like you to be one of the people putting forward a position in the Open Debate: "What is really needed to share research data globally?" The idea of the three debates is that we persuade a representative and diverse mix of experts to make short but strong position statements. There is then a discussion. From this we try to distill clarification of issues and sharp conclusions in the workshop report. We will work with those who participated in the debate to agree the conclusions. In the Second Open Debate I want to get representative positions from international projects that indicate what their real requirements are for global data sharing. I thought your experience in DataONE inter alia would equip you to be an outstanding protagonist. Donatella Castelli, Information Science and Technologies of the Italian National Research Council (ISTI-CNR), has agreed to chair this debate -- she has both the experience and the independence needed. If you say "Yes", which I very much hope you will, we will book and pay for your local accommodation, waive the registration fee for the pair of workshops and (if you need it) contribute to your travel costs. If you have to say "no" for some reason, I would be very grateful if you could suggest an alternative speaker. Whoever comes, I hope it will prove profitable to DataONE, by persuading more data-intensive researchers and solution providers to help with the challenges. Hoping to enjoy listening to you at this workshop. With best wishes and hoping you're enjoying a relaxing weekend. Malcolm --------- Current state of Planning -------- The final event in the e-Science Institute Data-Intensive Research theme is being juxtaposed with the XLDB-Europe workshop, see http://wiki.esi.ac.uk/Final_Outreach_Workshop and http://www.xldb.eu/xldb_europe_2011/index.html. These will be held in the week, Monday 6th to Friday 10th June 2011. They are deliberately timed to precede SIGMOD in Athens the following week. You are all invited to the DIR workshop and strongly encouraged to apply for an invitation to XLDB-Europe 2011. We intend that these be very exciting events demonstrating recent progress in Data-Intensive methods and their application. Only the final two days, XLDB-Europe has a preference for large-scale data; otherwise we want to encourage multi-scale thinking -- adapting the approach to the scale and sophistication that is needed on each occasion. Please consider contributing material to the workshops. There are three invitations to contribute on the web site: 1) DIR updates - substantial recent progress 2) DIR Research Village - presenting your group's achievements, methods and technology 3) DIR lightening talks - presenting the gist of important new ideas If you can think of other ways of contributing, please let us know DIR-leaders@inf.ed.ac.uk. The programmes for both workshops are well developed. You can view the DIR workshop programme here http://wiki.esi.ac.uk/Final_Outreach_Workshop#Outline_Programme as it develops. 4. Around the Room 1. Report from USGS/DataONE tools workshop (Vieglais/Jones/Frame) Actions/Next Steps: *******Configure the Uploader to use metadata development tools (such as Drupal Metadata Tool and possibly Metavist). Establish Workflow (Above) Lead: Giri and Viv Date: ______ Other participants: Tim Kern, Brad, Dave Rugg, Ranjeet Establish components for Data Uploader so that it can be a Plug-in ***Evaluate DataONE Auth. DataONE will develop the initial prototype and let people know for evaluation Lead: Matt DataONE Date: End of June a prototype will be available. Potential collaboration on DataONE Auth. Method (Browser handling of certificate renewals). Fort Collins will participate in this development activity. Membership in the INCommon network for USGS needs to be evaluated as a potential method for Authentication into DataONE. Propose alternative options for Org's that can't/won't join INCommon. Lead: Jeff, Mike ******Establish of BIP Node. Begin with the NPS Veg Data. Lead: Giri Date: Dec 31, 2011 Other participants: Jeff F, Tim M. , Tim K stay involved BIP Metadata Clearinghouse needs to address issue of Identifers. Establish DOI, Ezid service, for Clearinghouse. Lead: Giri Date: Become DataONE MN for Sciencebase. Use Generic MN toolkit. Lead: Tim Kern; Giri, Jeff, Viv. Date: Dec 31, 2011 *******FUSE/Windows Drive effort. Supports required functionality for storing metadata back. Define a Layer, using pyton, shared high-level API (Drag and Drop, Identifiers already in use) Semantics (Requirements from Scientists and Metadata Editors) of the DataONE drive and how it will be supported. Creation of a UI for Mounting of the Drive. Lead: Dave, Ryan Date: *ArcGIS Plug-in Catalog support for exporting data/metadata to DataONE. Curtis, Collin, Chris (NPS), EGIS, Tim K, Mike M to mention at USGS GIS Workshop. use ArcGIS to upload to any MN. Date: Initial Briefings at May Development of a potential Repositories (MNs) within USGS. General stats and potential sources that could contribute to DataONE. MetaVist : Resolve the bugs in 2.x MetaVist. Action: Viv talk with Dave R. related to what version to build upon, what needs to be done to resolve 2.x bugs, etc. Identify method for additional DataONE Features. Need to generate Unique Identifers from Metavist. VisTrails Support. Load and Save Data from DataONE from MN, plus metadata generation and saving for derived data products in VisTrails. R support for DataONE data loading and saving, and metadata generation and saving for derived data products in R. Common vocabularies for common terms - hydrology, ecosystems, etc. (Potential use of NBII BCT Thesaurus, and USGS Thesaurus) Develop broader characterizations for the DataONE keyword search from the multiple USGS Thesauri that are available. Look at involving Peter, Lisa, Dave G, in this activity. ---- Below here must be done ---- Implement reserveIdentifier() for CN service to delegate EZID service for ARKs and DOIs. Release data uploader as open source via SVN with OSI license, TIm Kern Draft a CDI funds proposal for continued collaborations between Fort, CBI, and CSI. Mike, Tim, Viv, Tim M., Mulligan Provide talking points related to areas of involvement for the May 18, DataONE Briefing at the CSS SLT. Lead: Frame, Tim Kern Date: May 15 2. Update on Best Practices Workshop (Budden) Workshop runs week after next, May 9th - 11th, Santa Fe, NM. 3 day meeting. Day 1 - DMP, Day 2 - Best Practices, Day 3 - Software Tools. A large amount of preparation and revision of DataONEpedia done in advance by the BP Planning Team, in particular Carly Strasser and Laura Arguelles. Participants are primarily environmental scientists with experience is large data management practices and librarians working with digital collections. An afternoon pre-meeting in advance (Monday afternoon) being held by the DMP Tool Group. Primary headache has been in coordinating accomodations in a mid/high season tourist area. Most of our attendees failed to meet the room block deadline leaving us scrambling for rooms. Lessons learned for me - you cannot remind people enough. Currently chasing up the last couple of people however the hotel is fully booked for the last night. May have to ask DataONE team to switch hotels on the last night to avoid transferring this headache to outside participants. 3. Request for someone to attend Open Debate at XLDB-Europe 2011 (Koskela) At some point would be good to get DataONE Working Group members involved in representing DataONE at various venues 4. Around the room: Bertram: Looking for a scientific-workflow/provenance user from the DataONE community. Users using workflows already or getting ready to use workflows - who are these people and how does the WG contact them? Bob suggested Jeff Morrisett (sp?) Another comment & question re. online teaching materials: I found the materials here to be nicely presented (for tech folks): http://software-carpentry.org/ Is there a link/group for the (planned? existing?) DataONE education materials (slides to online courses?) Amber: Busy with BP and DUG planning. DUG invites going out today. Mentor information calls now complete so everyone should be on the same page wrt the initial process of working with their interns. Put in the application for a booth space to ESA (to be shared with Dryad). Will start working on design activities next month. PPSR WG are satisified with the LT changes to the charter and will adopt these. I spoke to Steve Kelling in person and he is not disappointed about the recommendaiton to be removed from the WG. One of our EIM instructuctors (workflows) may need to cancel his teaching commitment (June 2nd). We don't currently have a back-up. Recommendations appreciated. Steph: Viv and Carly have continued dialog among Best Practices side of CEE working group - Viv can update?... Nothing else to report, gotta leave in a few minutes for a dentist appt... :) Suzie and Carol: The joint U&A/SC WGs meeting will be held next Tu-Th May 3-5 at UT Knoxville. We have finalized the agenda and confirmed participants. The meeting includes the opportunity for a 1/2 tour at ORNL organized and hosted by Bob Cook. The meeting is focused on five areas identified by the LT/CCIT as key areas to concentrate on. Potential deliverables have been identified to build on these areas. The SCWG has officially installed Kimberly Douglass as co-chair (per the LT last week) and Maribeth Manoff has moved back into being a full WG member. An interesting article by one of the 2010 interns, Valerie Enriquez: http://futureready365.sla.org/04/25/beyond-books-what-does-research-mean-to-you/ The Special Libraries Association (SLA) has a year-long program to discuss how information professionals will interact with the future. This blog is encouraging a different professional to post each day. Trisha: Nothing to report. Mike: Nothing to Report beyond what's above. Viv: --This week was all about the USGS/DataONE meeting hosted in Denver. --For CEEWG, we are making progress on the data management modules - survey questions have been developed to gather feedback, and there is a looming deadline to complete the edits and annotations for each of the slides. --Group identified places to make enhancements and improvements in Wikipedia - Carly is organizing the group to make those edits by a date in June. --The DataONE intern is confirmed and work will begin with Melody Basham soon - working on making a face to face meeting plan in May. --Also - working on redefining a proposal for a USGS data management, one part of which is data management education. Obvious overlap with DataONE CEEWG work being done. John K.: Did panel on NSF data management requirements for grad student audience at UC Davis. Lots of questions about incentives for sharing given effort and loss of competitive edge for sharing data that could still be exploited for further publication. Also, who "owns" (and what does that mean?) data generated by grad students? Bob: still off the Net at ORNL - chasing down the last of the malware - may be as early as next Monday; getting ready for Best Practices Workshop in Santa Fe Working on revised EVA charter - hope to have it ready for review shortly; arranged for NASA briefing in DC on Friday, May 20th with Woody Turner and Martha Maiden.