Member Node Forum - 18 April 2013 1. Please join my meeting. https://www1.gotomeeting.com/join/318278809 (Occurs every 2 weeks on Thursday 2pm EST, 1pm CST, noon MST, 11am PST, 10am AK) Upcoming meetings 16 May (can't meet 2 May due to UA/SC joint WG mtg) 2. Use your microphone and speakers (VoIP) - a headset is recommended. Or, call in using your telephone. Dial +1 (213) 289-0010 Access Code: 318-278-809 Audio PIN: Shown after joining the meeting Meeting ID: 318-278-809 Previous epad(s): epad.dataone.org/MemberNodeForum-20120307 (note typo on year-sorry) epad.dataone.org/MemberNodeForum-20130321 epad.dataone.org/MemberNodeForum-20130404 Attending: Laura, Amber O., Ranjeet, Alex Thompson, Dave V, Mark S, Matt, Rebecca, Steve K, Amber B, Andrea Matsunaga, Greg Traub Regrets: Inna, Kavitha, John, Action items from last time: Action items from previous meetings (follow-up): 1. Issue: Develop a implementation checklist that MN implementation teams can use (specifically on CCIT side). https://redmine.dataone.org/issues/3678 QUESTION: is this ready for distribution? Matt, Chris, and Laura are working on creating a cradle-to-grave checklist/procedure, which we hope to have publicly available by the SC/US Joint WG meeting 30 April-2 May. Currently available at: http://mule1.dataone.org/OperationDocs/member_node_deployment_2/mn_checklist.html This is in editing and will change. We'll notify everyone in this forum when it is completed. 2. Issue: support SEAD for April 22 NSF demo. UPDATE: Inna and Kavitha are practicing for the NSF review today. They pass on their thanks to the group; Matt and Chris have been instrumental in getting them ready for their demo. They are demo-ing the stage deployment for NSF. We need to move the activity for SEAD to production; John/Laura to monitor/assist. SEAD needs to determine if their data is in a suitable position to move forward to production. 3. We need to create a "compatible versions" for current version of DataONE API's and SW products (and maybe MN versions as well). (i.e. available documentation needs to match currently released s/w version) UPDATE: (Source: Mike Frenock - production documentation points to trunk, no links to currently running versions of s/w. Mule only has trunk-level documentation. See redmine ticket which requests current released s/w and documentation match. Take offline - Laura to monitor.) 4. Question about the type/availability of usage statistics, granularity, etc. Also relates to the "dashboard" being developed for the Member Node page at http://www.dataone.org/member-nodes Dave: every access to DataONE MN is logged, stats are aggregated at Coordinating Nodes; we do not currently have summarization of that data available. SteveK: would like to review offline with Dave et. al. how eBird is currently collecting data usage stats Dave: would be helpful in the broader effort Rebecca: EUDAT, http://www.eudat.eu/ (European Data Infrastructure - big data infrastructure project in Europe) folks are also interested in collaborating on this effort Welcome to Alex Thompson from iDigBio! Alex: We've just started our efforts looking into DataONE; iDigBio is (will add text later) Coordinates digitization efforts and helps mobilize biological data. Located at Florida. Long-term storage/accessibility of data and images (very large files). Agenda: 1. Network-wide update and issues affecting MN's as a whole Dave: nothing right now; a couple internal maintenance things going on; a maintenance activity at the ORNL Coordinating Node was transparent to users. 2. Open issues: MNs please add anything you would like to address here. SAVE THE DATE: The DataONE Users' Group meeting is July 7-8, 2013 at Chapel Hill, NC. Agenda and registration details to follow. We expect to have two slots of special interest to Member Nodes: - A MN showcase speed session where each MN can present a short (~3 slide) summary of the MN and interesting activities - A poster session one evening where longer, informal discussions can occur. Preliminary agenda and registration are available on the DataONE website: http://www.dataone.org/dataone-users-group 3. Feature requests/opportunities for collaboration: MNs please add items here. MikeF: was looking at SOLR stuff from Matt Matt: query interface is same for MN and CN, can choose which query engine to implement, Coordinating Nodes have chosen to use SOLR, MNs can/often do choose something different; implementing SOLR with MetaCat as we speak, plan to be available in release after next. 4. MN showcase: If you would like to share what your MN is about, please add here. MikeF: working a lot w/D1 APIs vs old Metacat; scientists use Matlab, working on incorporating transition from Matlab data (Metacat) to D1 APIs Matt: would you be willing to make that publicly available MikeF: yes, currently in git; working on removing Metacat references, plan to make it public repository later MarkS: update on PASTA, testing interaction between generic MN stack and PASTA; next move to staging to see how data flows from PASTA into DataONE. Still working on data access as far as LTER is concerned; trying to relax access rules. Matt: planning on using verified user form, or only make metadata available? MarkS: some contributors want more controlled access, Science Council meeting in May to discuss more open access to data, usage tracking, etc. SteveK: switched over to level 3 access to full eBird data; processed ~900 requests for access to eBird data, see who is using the data and for what purpose (some people using data for for-profit applications!); half of requests from academics/students specifically for some project in ecology/GIS class; has been great to get this information; about to release an updated dataset to DataONE (117M records) MarkS: question: is access an opt-in or required for new download/access? SteveK: can control access to data, duration of access (students ~a month, power users unlimited open access) Matt: see EML access control, ticket-based system, anyone with ticket code can access the data; we didn't put that functionality in DataONE because it appeared no one was using it; would like to talk with SteveK re: their access control system and what mechanism(s) should be made available in DataONE SteveK: type/quality of data made available varies on the intended use/user; seeing need for versioning (maintaining old datasets while making new data available) Alex: would be nice to collaborate with SteveK re: their experiences with DataONE so far in aiding iDigBio in their implementation. Dave: documentation is a big focus right now Matt: data packages - if you're interested in this discussion, please contact Matt and other CCIT folks Dave: ask.dataone.org is a convenient way to ask questions for the DataONE team and broader community to address. It's very much a dynamic site; everyone should feel free to ask questions, clarify answers to existing questions, etc. Focus isn't necessarily technical but covers all aspects of the project. Matt: if you have a DataONE account, you may post questions/answers. OTHER INTERESTING THINGS: Upcoming deadlines: ASIST annual meeting Beyond the Cloud: Rethinking Information Boundaries November 1-6, 2013, Centre Sheraton, Montreal, Quebec, Canada https://www.asis.org/asist2013/am13cfp.html Deadline for posters, demos, video is July 1.