Published in

Proceedings of the 27th Annual ACM Symposium on Applied Computing - SAC '12

DOI: 10.1145/2245276.2245343

Links

Tools

Export citation

Search in Google Scholar

Compression of RDF dictionaries

Proceedings article published in 2012 by Miguel A. Martínez Prieto, Javier D. Fernández, Rodrigo Cánovas ORCID
This paper is available in a repository.
This paper is available in a repository.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

The use of dictionaries is a common practice among those applications performing on huge RDF datasets. It allows long terms occurring in the RDF triples to be replaced by short IDs which reference them. This decision greatly compacts the dataset and thus mitigates its scalability issues. However, the dictionary size is not negligible and the techniques used for its representation also suffer from scalability limitations. This paper focuses on this scenario by adapting compression techniques for string dictionaries to the case of RDF. We propose a novel technique: Dcomp, which can be tuned to represent the dictionary in compressed space (22--64%) and to perform in a few microseconds (1--50μs).