Over the past several decades, considerable progress has been made in the development of metadata standards and formats for publishing Earth and environmental data. Repositories for scientific data products have also proliferated. In the ecological research community, much of this development has centered on publishing tabular data. As data products become more complex, voluminous, and distributed, there is a clear need to develop new methods to handle the creation, publication, and discovery of datasets that don’t fit within the standards and workflows established for the most common cases. In this session we address several special cases that have become increasingly common in Earth and environmental datasets, such as drone data, sequencing and genomic data, image or document collections, and datasets distributed across multiple repositories. Presentations will focus on emerging best-practices, metadata standards, specialized repositories, provenance metadata, and methods for linking datasets with disparate structures, metadata, or published locations. A breakout group session will foster collaboration and coordination among the numerous groups that are leading efforts in this area.
Session goals and preparation:To prepare for the session, please think about what types of Earth, environmental, or ecological datasets you work with in your research or data management activities, and consider where there are gaps in your knowledge of how to publish, access, or use these datasets. In general, are session goals are:
- Introduce participants to recent developments around publishing, discovering, and re-using modern ecological and environmental datasets.
- Answer the question: Should this community be doing more to develop best practices for publishing modern datasets as they become more complex and varied? (And what? how? where?)
During the session participants will enter notes and questions in a Collaborative Doc.
Session Agenda:View Notes