Map-side merge joins for scalable SPARQL BGP processing

Abstract: In recent times, it has been widely recognized that, due to their inherent scalability, frameworks based on MapReduce are indispensable for so-called "Big Data" applications. However, for Semantic Web applications using SPARQL, there is still a demand for sophisticated MapReduce join techniques for processing basic graph patterns, which are at the core of SPARQL. Renowned for their stable and efficient performance, sort-merge joins have become widely used in DBMSs. In this paper, we demonstrate the adaptation of merge joins for SPARQL BGP processing with MapReduce. Our technique supports both n-way joins and sequences of join operations by applying merge joins within the map phase of MapReduce while the reduce phase is only used to fulfill the preconditions of a subsequent join iteration. Our experiments with the LUBM benchmark show an average performance benefit between 15% and 48% compared to other MapReduce based approaches while at the same time scaling linearly with the RDF dataset size

Location
Deutsche Nationalbibliothek Frankfurt am Main
Extent
Online-Ressource
Language
Englisch
Notes
IEEE 5th International Conference on Cloud Computing Technology and Science (CloudCom), 2013 : 2 - 5 Dec. 2013, Bristol, United Kingdom ; [including workshops] / sponsors: IEEE Computer Society .... Piscataway, NJ: IEEE, 2013, Seite 631-749. DOI 10.1109/CloudCom.2013.9, isbn: 978-0-7695-5095-4
cc_by_nc_nd http://creativecommons.org/licenses/by-nc-nd/4.0/deed.de cc

Classification
Informatik
Keyword
Hadoop
RDF
Semantic Web
SPARQL

Event
Veröffentlichung
(where)
Freiburg
(who)
Universität
(when)
2013
Contributor
Technische Fakultät
Institut für Informatik
Albert-Ludwigs-Universität Freiburg

DOI
10.1109/CloudCom.2013.9
URN
urn:nbn:de:bsz:25-freidok-122735
Rights
Der Zugriff auf das Objekt ist unbeschränkt möglich.
Last update
25.03.2025, 1:50 PM CET

Data provider

This object is provided by:
Deutsche Nationalbibliothek. If you have any questions about the object, please contact the data provider.

Associated

Time of origin

  • 2013

Other Objects (12)