DDI-RDF Discover Vocabulary (Disco)
An RDF vocabulary for publishing metadata about studies and data sets.
Some of the relevant classes seem to be (see also Appendix A - Vocabulary Reference):
disco:Study
: Represents a process by which a data set was generated or created. In our case this would be the Ctrl + All. Computing the Social project.disco:LogicalDataSet
: Describes a logical data set. Logical in the sense that is not tied to a specific physical representation.disco:DataFile
: A physical representation of a logical data set.
There are more classes to describe instruments that were used to collect/generate data and more that does not seem relevant to us.
The namespace is http://rdf-vocabulary.ddialliance.org/discovery#
.
Relationship to other Vocabularies
Data Catalog Vocabulary (DCAT)
disco:LogicalDataSet
is a sub class of dcat:Dataset
and disco:DataFile
is a sub class of dcat:Distribution
.
There's a section in the Disco specification on differences to DCAT:
DCAT is richer for the description of collections and catalogue. Disco supports richer descriptions of groups of datasets or individual datasets. In this spec, some of our examples are partially based on DCAT (and we will indicate when this is the case).
Seems to be a case of W3C bureaucracy.
RDF Data Cube Vocabulary (QB)
Data Cube is a format for the data itself, whereas Disco only describes the metadata.
From the Disco specification:
Data Cube provides for the description of the structure of such cubes, but also for the representation of the cube data itself, that is, the observations that make up the cube dataset. This is not the case for the Discovery Vocabulary, which only describes the structure of a dataset, but is not concerned with representing the actual data in it. The actual data is assumed to sit in a data file (e.g., a CSV file, or in a proprietary statistics package file format) that is not represented in RDF.