LT Attendees: Rebecca, Amber, Suzie, Matt, Bruce,


LT Regrets:  Deborah, Bill, Viv, Steph, Bertram (notes below), Dave, Bob


DataONE LT Call:  9am AK/10am PT/11am MT/noon CT/1pm ET

 We will also use the epad: http://epad.dataone.org/2013Apr19-LT-VTC if participants can get to it.
 
If you have items to add, let me know.

Agenda for 2013-04-19

1) CI Update (Vieglais - traveling so report by email)


CI activities for the last week,

- the usual maintenance and progression through minor issues

- reviewing draft for supporting mutable content through series  identifiers. SIDs are optional identifiers that can be assigned to the  family of objects the conceptually refer to the same thing (e.g. a  series of revisions to a metadata document), and by default, the SID  will refer to the latest revision of the series. CCIT discussion on the  matter scheduled for Monday
{http://mule1.dataone.org/ArchitectureDocs-current/design/ContentMutability.html}

- working on replication auditing services that check the number of  replicas for an object, verifies the availability of those replicas, and  balances the replicas as necessary

- Discussion about object packaging and how best to represent as  internal structures within the various tools, in the search results  pages, and the best mechanisms for serializing all objects in a package  to be downloaded. BagIt is considered the most viable standard solution  for constructing downloadable packages. There is some concern over  conflict with this recommendation and existing approaches that might be  employed by MNs.

- SEAD MN in staging. The MN is technically ready for production,  however before moving to production, a broader evaluation of the content  available should be performed, and there are some related activities to  be performed on the SEAD side of the implementation. e.g. there is  currently only a small subset of total content available in the staging  node.

- Semantics WG meeting this week identified several goals for short and  mid-term activities. Short-term goals involve evaluation of topic maps  as a mechanism for high level grouping of content and as a possible  approach to assist with keyword normalization, term expansion in query  construction, and possible guidance in high level aspects of ontology  development as it relates to the diverse content in DataONE.

Matt adds:  The Taiwan Forestry Research Institute (TFRI) node has been tested in Stage and its looking good.  We discovered some bugs in handling of Chinese language characters in ONEMercury, and Skye is working on fixing those.  TFRI is starting the production install, and will register and wait for us to fix the Chinese character encoding issues in ONEMercury.

2) CEE Update  (Budden)
Pat L  (USFW) - they are building requirements for informatics for agency; training and education materials are further down the road. Amber put him in contact with Mike Frame and Karl Benedict for help with the informatics portion.

AGU proposals for sessions

ESA is dedicating time for demos - DataONE has signed up to this.
DMPTool - Sloan and IMLS funding for the DMPTool. CDL is interviewing developers for the Sloan funding. Hoping to have hires in place by next month. Research Advisory Board and Administrators Advisory Board

3) Update on EUDAT/DataONE meeting (Koskela)

Areas for possible collaboration:
-          Having nodes for replication for testing/prototyping
-          Metadata exchange
-          Convince Dave to participate in DFT WG in RDA
-          Unifying site/node registry
-          Exchange of educational/training material
-          Make use of BP Primer
-          Exchange about DMP tool (will connect them with Andrew Sallans)

4) Around the Room (if time)

Deborah  (I will be in transit during this meeting):
-  Data Integration and Semantics Working group had a Face to Face meeting  at NCEAS from Tuesday - Thursday (with some preparation meetings  before).  CCIT members Vieglas and Leinfelder joined us for portions as  did postdoc Stacy Rebich Hespanha and postdoc Ben Adams for portions
- The group refined summer internship plans, semantic search plans, and our driving hydro-eco use case descriptions
- Postdoc Seyed provided an overview of the Semantics working group for EUDAT/DataONE meeting
- McGuinness provided an introduction to Web Science and Semantic Data integration for the Women in Science "Red Chair" series

Bertram:  ProvWG meeting planned for June 24&25 at NYU Poly. Bertram gave an  overview on ProvWG at EUDAT/DataONE meeting. Victor Cuevas and Saumen  Dey attended NorCal Database day at Stanford.

Bruce: Regrets.  Will not be able to attend.  Have interviewed and made a verbal offer to a grad student for CI for next year, and I have a verbal acceptance.  Will work to get this official in the next week. Spent some time working on the UTK institutional repository description effort.

Suzie: Mike and I will be meeting with Andrew Sallans this afternoon. He was visiting the UT library to talk about research data.  We have the IRB and final permission from FigShare community so that survey is in the field as of yesterday.

Steve: nothing from me

Reminder that this is a combined LT call and Working Group Quarterly Report meeting. LT the first half hour and WG reports for an hour starting at 
9:30 AK/10:30 PDT/11:30 MDT/12:30 PM CDT/ 1:30 PM EDT.

1.  Please join my meeting, Apr 19, 2013 at 11:00 AM MDT.
https://www1.gotomeeting.com/join/515461896

2.  Use your microphone and speakers (VoIP) - a headset is recommended. Or, call in using your telephone.

Dial  1 (267) 507-0012
Access Code: 515-461-896
Audio PIN: Shown after joining the meeting

Meeting ID: 515-461-896

GoToMeeting®
Online Meetings Made Easy®

========================================================================
Working Group Quarterly Reports

WG Attendees:


WG Regrets:  Bill, Bob

Working Group Reports:
========================================================================
   Public Participation in Scientific Research WG 
   
   Overall Objective:
Identify the scope, scale, and diversity of PPSR data used in scientific research and barriers to broader use of these data. Provide recommendations for improving quality, quantity, and accessibility of these data; generate recommendations and/or tools to advance integration of data in conventional science.
 
Milestones for next 12 months:
 
 
Accomplishments from past 12 months:
 
 
Products
 
======================================================================

Working Group: Sustainability and Governance 
Co-chairs: William Michener & Patricia Cruse
Date: April 19, 2013   
 
Overall Objective: 
- Develop sustainability and governance plans  
 
Milestones for next 12 months: 
- May 13-17, 2013 – Strategic planning and proposal preparation 
- July 15-19, 2013 – Strategic planning and proposal preparation 
- August/September – revise Marketing Plan
- November 2013 – submit NSF follow-on proposal
 
Accomplishments from past 6 months: 
- January 29, 2013 – Meeting with Mellon Foundation
- January 28-30, 2013 – RSV and proposal planning
- February 27-March 1 – Sustainability presentation for Reverse Site Visit 
- December 2012 – S&G presentation to and feedback from External Advisory Board
- December 2012 – completion and acceptance of sustainability report from Kim Thanos and Partners 
 
Products:
-   Sustainability report completed by Kim Thanos and Partners
 
========================================================================
Working Group:  Data Integration and Semantics
Co-chairs: Deborah McGuinness, Jeff Horsburgh
Overall Objective: The mission of the Integration and Semantics Working Group is to guide the specification, adoption, and implementation of semantics technologies, broadly defined, which will enable DataONE to sustainably achieve its objectives for the seamless discovery, integration, and dissemination of Earth observational data.

Milestones for next 12 months:
Accomplishments from past 6 months:
Products:
========================================================================

Working Group: Community Engagement & Education 
Co-chairs: Stephanie Hampton, Amber Budden (interim)
Overall Objective: The Working Group is chartered to determine effective means for engaging with DataONE’s stakeholders to improve DataONE technical tools and build community capacity for sharing and using data. This activity requires deep analysis of existing literature in order to make evidence-based recommendations, and thus should lead to peer-reviewed publications that have impact beyond DataONE activity, in addition to guiding DataONE efforts.

Milestones for next 12 months: 
Spring 2013
Summer 2013
Fall 2013
Accomplishments from past 6 months: 
Products:
***********************************************************

Working Group: Provenance in Scientific Workflows (ProvWG)
Co-chairs: Bertram Ludaescher & Paolo Missier
Date: April 19, 2013

Overall Objective: 
- Deliver the value of provenance metadata to the DataONE user community, specifically: develop an open and extensible provenance management architecture for scientific data processing systems (e.g., workflows and scripting languages such as R).  

Specific Goals and Products:
- DataONE Provenance Model (D-OPM/D-PROV), 
- suitable query languages and prototypes (e.g. based on RPQ queries),
- prototype workflows (with EVA WG: VisTrails/UV-CDAT workflows)
- generic tools (e.g., ProvenanceExplorer)

Milestones for next 12 months:
- finalizing D-OPM/D-PROV models; publish as technical report and/or full paper (journal)
- PBase summer internship project 
- prototyping some basic R + Provenance capabilities
 
Accomplishments from past 3 months:
* Tool/demo development for NSF Reverse Site Visit:
-- EVA climate science workflows (Yaxing Wei); ProvEx (Provenance Explorer) ; provenance mappers from VisTrails to D-PROV; indexing of provenance terms for ONEMercury; packaging of workflows and provenance for DataONE.
* Organized BigProv at EDBT/ICDT 2013 (Bertram, Paolo). The BigProv workshop and associated ProvBench (Paolo, Khalid) trace collection initiative took place on March 22nd, 2012, co-located with the EDBT conference. Traces are available at:  github.com/provbench.
* Paolo presented D-PROV paper at TaPP’13.
* Saumen presented PhD Workshop paper and GraphQ paper, both co-located with EDBT’13 in Genova.

Products
* D-PROV: extending the PROV provenance model with workflow structure. Paolo Missier, Saumen Dey, Khalid Belhajjame, Victor Cuevas-Vicenttín, Bertram Ludäscher, TAPP’13 workshop. April 2, Lombard, IL.
* Provenance Analyzer: Exploring Provenance Semantics with Logic Rules. Saumen Dey, Sean Riddle, and Bertram Ludäscher. TaPP’13 workshop. April 2, Lombard, IL.
* On Implementing Provenance-Aware Regular Path Queries with Relational Query Engines. Saumen Dey, Victor Cuevas, Sven Koehler, Bertram Ludäscher. Workshop on Querying Graph Structured Data (GraphQ), Intl. Conf. on Extending Database Technology (EDBT), Genova, Italy, March 2013, 
* A Declarative Approach for Publishing Customized, Policy-Aware Provenance. Saumen Dey. EDBT PhD Workshop Genova, Italy, March 2013.

======================================================================
Working Group: Exploration, Visualization, and Analysis
Co-chairs: Steve Kelling & Bob Cook
Date:  April 19, 2013  
 
N.B.:  Updates only
 
 
Overall Objectives: 
No Change
 
Milestones for next 12 months
May 2013
Enhancements to UV-CDAT code made by Jorge Poco (DataONE EVA) are publicly available through the UV-CDAT binary release
 
May 2013
Contribute exploration, visualization, and analysis exemplars from DataONE and Terrestrial Biosphere Modeling for the EarthCube Building Blocks Brokering Proposal to NSF
 
August 2013
With DataONE Summer Intern, Fei Du, begin pilot development of core components for the  "Provenance-aware Model Exploration, Evaluation, and Benchmarking Cyber-infrastructure.”  This development is a joint activity of the DataONE Provenance Working Group, the EVA Working Group, and the DataONE CyberInfrastructure Team.  
 
November 2013
Prepare a draft manuscript on the high-dimensional data analysis using a combination of machine-learning and visualization activity.
 
January 2014
Prepare a draft manuscript on an expanded study of visualization of complex model output by soliciting more examples from the carbon modeling community and provide directed input on how to improve carbon model visualizations
 
 
Accomplishments
from past three months
February 20, 2013:
o   Held a teleconference to discuss making UV-CDAT code that Jorge Poco generated available and the subgroup doing the evaluation / critical review of visualization techniques for climate community experts has modif functionality for data integration and analysis, and visualization built into Ultra-visualization Climate Data Analysis Tools UV-CDAT and to plan next EVA Working Group Meeting.
 
April 2013
o   Initiated a sub-group looking at High-dimensional data analysis using a combination of machine learning and visualization.  Lead:  Claudio Silva and Aritra Dasgupta
 
 
Products
o   Dasgupta, A., J.M. Poco, Y. Wei, R.B. Cook, E. Bertini, and C.T. Silva.  Submitted. An Exploratory Study of Visualization Usage For Climate Data Analysis.   IEEE Conference on Visualization.  Summary:  a critical review of visualization approaches used by terrestrial biosphere modelers and solutions to present information.  March 31, 2013.
 
o   Demonstration.  Wei, Y., E. Boldrini, M. Santoro, R.B. Cook, S. Nativi.  2013.  Use Access Broker in the North American Carbon Program Model Intercomparison Project.  Given at NEON Headquarters, Boulder, CO, March 21, 2013.
 
o   Seminar / Webinar:  Cook, R.B., Y. Wei, J.M. Poco, C.T. Silva. D.N. Huntzinger, and A. Michalak.  Exploratory Visualization of Terrestrial Biosphere Model Data.  Presented to EUDAT Visitors, April 17, 2013. 
 ======================================================================
Working Group: __Sociocultural  / Usability & Assessment_______
Co-chairs: Suzie Allard & Kimberly Douglass  / Mike Frame & Carol Tenopir
Date: April 30, 2013
Milestones for next 12 months: 
·         Identification of key stakeholders and description of their relationships in the research support/ data services ecosystem of academic and federal institutions.
·         Development of FAQs for DataONE.org and ONEMercury. 
·         Dissemination of DataONE personas and scenarios through sharing with other DataNets and website visibility.
·         Facilitation of internal and external DataONE communication.
·         In collaboration with UAWG:
o   Work with member node coordinator: Identify and describe relationships between DataONE, Member Nodes and Coordinating Nodes.
o   Conduct, analyze and disseminate research on the DataONE Working Group model
o   Develop a strategy for capturing high priority usage metrics and statistics.
o   Co-host joint Usability and Assessment and Sociocultural Working Group meeting April 30 – May 2, 2013 Knoxville, TN
 
Accomplishments from past 6 months: 
·         Participated in successful NSF Reverse Site Visit.
·         Represented DataONE at DataNet Federation Consortium User Requirements Meeting and developed strategy for collaborating on development of several white papers.
·         Designed a strategy for creating FAQs for DataONE.org and ONEMercury, developed first draft of FAQs, submitted first draft to Leadership Team for feedback, [DM1] revised FAQs based on feedback and submitted revised versions to ask.dataone.org.
·         Updated guidance to all faculty, staff and students re NSF requirements concerning Responsible Conduct of Research.  Compliance is continuously monitored and records kept.  
·         Participated in DataONE External Advisory Board Meeting. 
·         Scheduled proposal planning meeting.
·         Conducted network analysis on DataONE WG structure and membership.[DM2] 
·          
·         Submitted set of six potential survey questions to UAWG for assessment of who, where and how support is provided for research and data services for scientists.
·         Submitted set of four potential survey questions to UAWG for assessment of how scientists search for and choose to re-use data sets.  
·         Submitted DataONE Terms and Conditions to Sustainability and Governance Working Group for review.
·         Submitted DataONE Five Principles to DataONE Leadership Team for review.
·         Developed system for tracking research and scholarship opportunities concerning key sociocultural issues related to the intersection of earth, environmental and information sciences.
·         Developed Digital Orientation for new Working Group members.
·         Refined strategy for communication between Working Group and Leadership Team.
·         Initiated design of research project to identify key stakeholders and describe their relationships in the research support / data services ecosystem of academic and federal institutions.

·         In conjunction with UAWG:
o   Planned Annual Joint UA / SC WG Meeting to be held April 30 – May 2 in Knoxville, TN.
o   Developed, reviewed and suggested revisions for scientists/educators follow up assessment.
o   Developed and submitted IRB for assessment of early adopter stakeholders.
o   Developed online survey instrument for assessment of early adopters.
o   Completed initial analysis of DataONE members’ satisfaction with WG model, process and relationship to DataONE survey.
o   Completed initial analysis of DataONE members’ assessment of WG model for ecoinformatic enterprises like DataONE.
o   Co-hosted Dr. Robert Chadduck, NSF Program Director, for NSF Site Visit.
o   Prioritized stakeholders for further assessment.
o   Developed assessments strategy.
o   Developed assessments schedule for final two project years.
o   Developed schedule for reporting completed baseline assessment results.
o   Reviewed metrics capture plans from project management plan.
o   Reviewed current system for capturing metrics from DataONE.org.
o   Initiated plans for statistical portal / dashboard.

Products 
·         Tentative agenda for 2012 Joint UA/SC WG meeting.
·         Potential DataONE FAQs list.
·         Ten vetted FAQs for use on DataONE.org.
·         Sociocultural issues research and scholoarship opportunities tracking system.  
·         Digital Orientation for new working group members.
 
·         In collaboration with Usability & Assessments WG Team Members
o   Two DataONE Working Group Model Assessment Instruments.
o   First draft analysis of results of DataONE Working Group model research.
o   DataONE Internal Communication Recommendations
o   Updated assessments schedule.
o   Follow up assessment instrument for scientists/educators follow-up.
o   Baseline assessments reporting schedule.
o   Summary report and action items from DataONE 2012 All Hands Meeting.

===================================================================
Working Group: Preservation and Metadata 
Co-chairs: John Kunze and Jane Greenberg
Quarterly Report – Date: 2013.04.19  

Overall Objectives: 
·         To create and periodically to review DataONE preservation strategies (ending August 2014).
·         To assist DataONE in recording and maintaining metadata to support discovery, life-cycle management, citation, and general interoperation 
 
Milestones for next 12 months:
 
·       May/June 2013 – Face meeting to be held in Chicago.
·       June 2013 – Submit updated results of Murillo, et al, to PLOS.
·       Spring/summer – Continue to explore funding/grant options, GSoC and RCN/NSF.
·       Summer 2013 – Mentor student intern pursue registry prototype.
 
Accomplishments in past 3 months:
 
·       March/April 2013 – Summer intern project approved; intern hired.
·       April 2013 – Dublin Core/potential RDA CAMP-4-DATA* workshop submitted to DCMI 2013, and workshop was approved.
 
 
*CAMP-4-DATA was pursued as DC-SCIENCE and METADATA (SAM)/RDA collaboration, but integrated DataONE PAMWG goals.