PHDD is an RDF vocabulary for the description of the physical properties of data in rectangular format including comma-separated values (CSV) or similar. It focuses purely on the physical properties of files and variables like case quantity and records per case or data type and start position. PHDD enables the publication of data as Linked Data in the Semantic Web. PHDD could be used in a standalone way (i.e., adding relevant information to CSV files) or in a more comprehensive description of data together with the possibilities of DDI-RDF Discovery (DISCO) and Data Catalog Vocabulary (DCAT) for providing a data catalog for discovery purposes. The presentation will provide an overview of the most important components of PHDD, the mapping of the properties to related elements of DDI XML Lifecycle and Codebook, and the interplay of PHDD with DISCO and DCAT. The PHDD vocabulary is currently work in progress. It is planned for publication as a specification of the DDI Alliance.
Joachim Wackerow and Thomas Bosch (GESIS – Leibniz Institute for the Social Sciences)
Copyright is held by the author(s).
Member of collection