Data Publication

The NIST Extensible Resource Data Model (NERDm): JSON schemas for rich description of data resources

Raymond L. Plante Author's orcid, Gretchen Greene Author's orcid, Deoyani Nandrekar-Heines Author's orcid, Chuan Lin
Contact: Raymond Plante.
Identifier: doi:10.18434/mds2-1870
Version: 1.0... First Released: 2025-02-13 Revised: 2025-02-13
The NIST Extensible Resource Data Model (NERDm) is a set of schemas for encoding in JSON format metadata that describe digital resources. The variety of digital resources it can describe includes not only digital data sets and collections, but also software, digital services, web sites and portals, and digital twins. It was created to serve as the internal metadata format used by the NIST Public Data Repository and Science Portal to drive rich presentations on the web and to enable discovery; however, it was also designed to enable programmatic access to resources and their metadata by external users. Interoperability was also a key design aim: the schemas are defined using the JSON Schema standard, metadata are encoded as JSON-LD, and their semantics are tied to community ontologies, with an emphasis on DCAT and the US federal Project Open Data (POD) models. Finally, extensibility is also central to its design: the schemas are composed of a central core schema and various extension schemas. New extensions to support richer metadata concepts can be added over time without breaking existing applications.

Validation is central to NERDm's extensibility model. Consuming applications should be able to choose which metadata extensions they care to support and ignore terms and extensions they don't support. Furthermore, they should not fail when a NERDm document leverages extensions they don't recognize, even when on-the-fly validation is required. To support this flexibility, the NERDm framework allows documents to declare what extensions are being used and where. We have developed an optional extension to the standard JSON Schema validation (see ejsonschema below) to support flexible validation: while a standard JSON Schema validater can validate a NERDm document against the NERDm core schema, our extension will validate a NERDm document against any recognized extensions and ignore those that are not recognized.

The NERDm data model is based around the concept of resource, semantically equivalent to a schema.org Resource, and as in schema.org, there can be different types of resources, such as data sets and software. A NERDm document indicates what types the resource qualifies as via the JSON-LD "@type" property. All NERDm Resources are described by metadata terms from the core NERDm schema; however, different resource types can be described by additional metadata properties (often drawing on particular NERDm extension schemas). A Resource contains Components of various types (including DCAT-defined Distributions) that are considered part of the Resource; specifically, these can include downloadable data files, hierachical data collecitons, links to web sites (like software repositories), software tools, or other NERDm Resources. Through the NERDm extension system, domain-specific metadata can be included at either the resource or component level. The direct semantic and syntactic connections to the DCAT, POD, and schema.org schemas is intended to ensure unambiguous conversion of NERDm documents into those schemas.

As of this writing, the Core NERDm schema and its framework stands at version 0.7 and is compatible with the "draft-04" version of JSON Schema. Version 1.0 is projected to be released in 2025. In that release, the NERDm schemas will be updated to the "draft2020" version of JSON Schema. Other improvements will include stronger support for RDF and the Linked Data Platform through its support of JSON-LD.
Research Areas
NIST R&D: Information Technology: Data and informatics
Keywords: metadataschemadata managementdata repositoriesJSON-LDShow more...
These data are public.
Data and related material can be found at the following locations:
  The NERDm JSON Schema Files
This directory contains the latest (and previous) versions of the core NERDm Schema and various extensions. All files with names of the form, "*-schema*.json" are JSON Schema definition files; those that do not include a version in the file name represent the latest versions. The latest version of the core schema is called "nerdm-schema.json", and schemas with names of the form, "nerdm-[ext]-schema.json", contain extension schemas. All NERDm schemas here are documented internally, including semantic definitions of all terms.
  ejsonschema: Software for Validating JSON supporting extension schemas
This software repository provides Python software that extends the community software library, python-jsonschema (https://github.com/python-jsonschema/jsonschema) to support NERDm's extension framework. Use the scripts/validate script to validate NERDm documents on the command line. (Type "validate -h" for more information.)
  Example NERDm Documents
This folder contains example NERDm documents that illustrate the NERDm data model and use of extension schemas. These all can be validated using the ejsonschema validate script.
  NERDm Support Software
This software repository includes a Python package, "nistoar.nerdm", that aids in creating and handling NERDm documents. In particular, it includes converters that convert NERDm instances into other formats (like POD, schema.org, and DCAT). It can also transform NERDm documents conforming to earlier versions of the schemas to that of the latest versions.
  README
A general overview of the NERDm schema design and files
Version: 1.0... First Released: 2025-02-13 Revised: 2025-02-13
Cite this dataset
Raymond L. Plante, Gretchen Greene, Deoyani Nandrekar-Heines, Chuan Lin (2017), The NIST Extensible Resource Data Model (NERDm): JSON schemas for rich description of data resources, National Institute of Standards and Technology, https://doi.org/10.18434/mds2-1870 (Accessed 2025-07-18)
Repository Metadata
Machine-readable descriptions of this dataset are available in the following formats:
NERDm
Access Metrics
Metrics data is not available for all datasets, including this one. This may be because the data is served via servers external to this repository.