This section includes a synopsis of the recommended (aka minimal) OMTD-SHARE metadata schema for corpora, i.e. the subset of Mandatory and strongly Recommended metadata elements. Additional elements required for the management of metadata record (e.g. metadataCreationDate, metadataCreator, etc.) are not presented here, as they are to be handled by the OpenMinTeD platform.

You can find more information on the full OMTD-SHARE metadata schema and examples of metadata records for corpora here.

These elements have been selected so as to help

  • identify the corpus and provide information about it (e.g. resourceIdentifier, resourceName, version, description)
  • describe the legal terms for using the corpus (e.g. licence or rightsStatement, nonStandardLicenceTermsURL)
  • encode technical features that are useful for achieving interoperability by tools and services (e.g. dataFormat, language)
  • give access to the contents (e.g. distributionLocation)
  • classify the corpus along a variety of criteria that end-users can apply for locating corpora of interest for their research (e.g. domain, keyword)
  • contribute to attribution, citation and reproducibility of research processes and outputs: (e.g. resourceCreator, creationDate, userQuery).

For annotated corpora, see here.

OMTD-SHARE element Usage
resourceType Μandatory
resourceName Μandatory
description Μandatory
resourceIdentifier Μandatory
public Mandatory
version Mandatory
contactPoint Mandatory
contactType Mandatory
contactPerson Recommended
contactGroup Recommended
licence Mandatory
rightsStatement Mandatory
nonStandardLicenceName and nonStandardLicenceTermsURL Mandatory when applicable
distributionMedium Mandatory
distributionLocation Mandatory
resourceDocumentationInfo Recommended
resourceCreator Recommended
corpusSubtype Μandatory
mediaType Μandatory
lingualityType Mandatory
multilingualityType Mandatory when applicable
language Mandatory
size & sizeUnit Mandatory
dataFormat (& dataFormatOther) Mandatory
keyword Recommended
domain Recommended
userQuery Μandatory when applicable
relationType Recommended
relatedResource Mandatory when applicable

results matching ""

    No results matching ""