Bring a specific marine observation from a dataset and tell us how you choose an appropriate controlled vocabulary!
Attendees will help explore and develop a decision tree to help choose which controlled vocabulary/vocabularies a data manager/scientist should use to ensure their marine data follow the FAIR principles. We will be testing out an innovative approach to community source these decisions through a combination of Google Sheets, Python, and a decision tree generator (
scikit-learn and
dtreeviz). The session will start off with an overview of the effort followed by breakout groups, where participants will draft out their thought process for selecting a controlled vocabulary for a specific marine observation of their choosing. We will then reconvene and start consolidating the information from the breakout groups which will start to populate the
marinedata-vocabulary-guidance repository.
The goals of the session are to use an innovative approach to community source information to:
- Identify the recommended vocabularies to choose from for marine observations.
- Identify the entry points into the decision tree.
- Identify the key questions to ask when selecting a vocabulary.
- Start a vocabulary guidance for marine data document.
View Notes