6.5 VRE Schema Description

Last modified by publicadmin on 2026/04/08 06:48

{under construction}

The VRE Schema is a collection of metadata schemas that describe a VRE Dataset.  Before a Dataset can be created and saved, it must be annotated with a minimal set of metadata that can optionally be later used to register the Dataset in the VRE knowledge graph.  For more information, see Annotations and Metadata Schemas

VRE Schema annotations can be provided in the VRE Portal Datasets graphical user interface (GUI).  User provided annotations (key:value pairs) are stored in JSON file format and can be exported for reuse. 

The following table describes the fields in the VRE Default Schema:

VRE Schema NameVRE Portal GUI field nameVRE Schema field nameField TypeConstraintsDescriptionRequired or OptionalMapped element in DATS_coreMapped element in openMINDS_core
EssentialTitledataset_titlestringlength <=100 charactersName of the dataset.REQUIRED for any dataset  
EssentialDataset Codedataset_codestringformat: lowercase letter and/or number without whitespace, length <=32 charactersPlatform-wise dataset unique ID.REQUIRED for any dataset  
EssentialAuthorsdataset_authorsstring, multiple entrieslength <= 50 characters, maxItems: 10One or more authors of the dataset.REQUIRED for any dataset  
EssentialTypedataset_typecheckbox, single selectselection in (GENERAL or BIDS)Whether the Dataset type is GENERAL or BIDS.  BIDS is selected for datasets that require use of the VRE BIDS validator tool.OPTIONAL (if no selection, default is GENERAL)  
EssentialDescriptiondataset_descriptionstringlength <= 5000 charactersA textual narrative comprised of one or more statements describing the datasetREQUIRED for any dataset  
EssentialModalitydataset_modalitydropdown, multi-selectselections in (anatomical approach, behavioral approach, cell counting, cell morphology, cell population, characterization, cell population imaging, computational modeling, electrophysiology, histological approach, microscopy, molecular expression approach, molecular expression characterization, morphological approach, multimodal approach, neural connectivity, neuroimaging, physiological approach)One or several modalities represented in the dataset.OPTIONAL  
EssentialCollection Methoddataset_collection_methodstring

length <= 20 characters

maxItems: 10

One or several collection methods of the dataset.OPTIONAL  
EssentialLicensedataset_licensestringlength <= 20 charactersThe license under which this dataset is shared. The use of license name abbreviations is suggested for specifying a license. Please visit Creative Commons to choose the right license.OPTIONAL  
EssentialTagsdataset_tagsstring

length <= 20 characters

maxItems: 10

format: 

Tags associated with the dataset, which will help in its discovery. These should be well known terms by the research community.OPTIONAL  
EssentialNumber of Subjectsdataset_subject_numberintegerintegerThe number of subjects in the dataset.OPTIONAL  
EssentialDataset Identifierdataset_identifierstring A code uniquely identifying the publication locally to a system or globally. Provide a Document Object Identifier (DOI) if have one.OPTIONAL  
EssentialDataset Identifier Sourcedataset_identifier_sourcestring The identifier source represents information about the organisation/namespace responsible for minting the identifiers.OPTIONAL  
EssentialDerived Fromdataset_derived_fromstring If the dataset is a derived dataset, name of the source dataset used to generate this dataset. The original dataset must also be included as a submodule in the derived dataset.OPTIONAL  
EssentialParent Dataset Identifierparent_dataset_identifierstring Unique identifier of the source dataset used to generate this dataset. Provide a Document Object Identifier (DOI) if have one.OPTIONAL  
EssentialParent Dataset Identifier Sourceparent_dataset_identifier_sourcestring The identifier source represents information about the organisation/ namespace responsible for minting the identifiers.OPTIONAL  
EssentialRelated Publication Titledataset_publication_titlestring The name of the publication of this dataset.OPTIONAL  
EssentialRelated Publication Identifierdataset_publication_identifierstring A code uniquely identifying the publication locally to a system or globally. Provide a Document Object Identifier (DOI) if you have one.OPTIONAL  
EssentialRelated Publication Identifier Sourcedataset_publication_identifier_sourcestring Information about the organisation/namespace responsible for minting the identifier.OPTIONAL  
SubjectsSubject IDsubject_idstring The ID of the subject.REQUIRED if schema is used  
SubjectsSubject Sexsubject_sexdropdown, single-selectselection in (Female, Male, Unknown, Other)The sex of the subject.REQUIRED if schema is used  
SubjectsSubject Speciessubject_speciesdropdown, single-selectselection in (Homo sapiens, Macaca fascicularis, Macaca mulatta, Mus musculus, Mustela putorius, Rattus norvegicus, Other)The species of the subject.REQUIRED if schema is used  
SubjectsSubject Age Categorysubject_agecategorydropdown, single-selectselection in (“Neonate, Infant, Juvenile, Young adult, Adult, Unknown, Other”)The age category of the subject.REQUIRED if schema is used  
DiseaseDisease Namedataset_disease_namestring The name of the disease.REQUIRED if schema is used  
DiseaseDisease Diagnosis Datedaatset_disease_datesstringformat: date-timeRelevant dates, such as the diagnosis date of the disease, may be provided.OPTIONAL  
DiseaseDisease Statusdataset_disease_statusstring A value (and ideally also an ontology annotation provided as a valueIRI) for the disease status.OPTIONAL  
DiseaseIdentifierdataset_disease_identifierstring A code uniquely identifying an entity locally to a system or globally.OPTIONAL  
DiseaseIdentifier Sourcedataset_disease_identifier_sourcestring The identifier source represents information about the organisation/namespace responsible for minting the identifiers.OPTIONAL  
DistributionDataset Distribution Technical Formatdataset_distribution_formatarray of string Technical format of dataset distribution, i.e., "NIfTI", "DICOM". OPTIONAL  
DistributionDataset Distribution Access Landing Pagedataset_distribution_landing_pagestringformat: uriWeb address (URL) where the original dataset can be found. Default value points to the README.md for the dataset.REQUIRED if schema is used  
DistributionDataset Distribution Access Authorizationdataset_distribution_authorizationdropdown, single-selectselection in (“Public", "Registered" or "Private")This must be one of "Public", "Registered" or "Private". When this field is absent the value will be treated as "Public". "Public" = available to anyone; "Registered" = available to user authorized by the creator; "Private" = available only to the creator.OPTIONAL  
ContributorsContributor Type 

dataset_contributors

array of stringselection in ("Person", "Organization")One or more Persons or Organizations can be provided as the dataset's contributor(s).REQUIRED if schema is used  
ContributorsEmail

dataset_contributor_person_email , or 

dataset_contributor_organization_email

stringformat: emailEmail address of the contributor.REQUIRED if schema is used  
ContributorsLast Name

dataset_contributor_person_lastname , or 

dataset_contributor_organization_lastname

string Last name of the contributor.REQUIRED if schema is used  
ContributorsFirst Name

dataset_contributor_person_firstname , or 

dataset_contributor_organization_firstname

string First name of the contributor.REQUIRED if schema is used  

https://i.creativecommons.org/l/by-sa/4.0/88x31.png

Copyright © 2022, Indoc Research. This work is licensed under a Creative Commons Attribution-ShareAlike 4.0  International License.