Sempala: Interactive SPARQL Query Processing on Hadoop
Abstract: Driven by initiatives like Schema.org, the amount of semantically annotated data is expected to grow steadily towards massive scale, requiring cluster-based solutions to query it. At the same time, Hadoop has become dominant in the area of Big Data processing with large infrastructures being already deployed and used in manifold application fields. For Hadoop-based applications, a common data pool (HDFS) provides many synergy benefits, making it very attractive to use these infrastructures for semantic data processing as well. Indeed, existing SPARQL-on-Hadoop (MapReduce) approaches have already demonstrated very good scalability, however, query runtimes are rather slow due to the underlying batch processing framework. While this is acceptable for data-intensive queries, it is not satisfactory for the majority of SPARQL queries that are typically much more selective requiring only small subsets of the data. In this paper, we present Sempala, a SPARQL-over-SQL-on-Hadoop approach designed with selective queries in mind. Our evaluation shows performance improvements by an order of magnitude compared to existing approaches, paving the way for interactive-time SPARQL query processing on Hadoop
- Standort
-
Deutsche Nationalbibliothek Frankfurt am Main
- Umfang
-
Online-Ressource
- Ausgabe
-
Postprint
- Sprache
-
Englisch
- Anmerkungen
-
Mika P. et al. (eds) The Semantic Web – ISWC 2014. ISWC 2014. Lecture Notes in Computer Science, vol 8796, DOI 10.1007/978-3-319-11964-9_11, isbn: 978-3-319-11963-2
cc_by_nc_nd http://creativecommons.org/licenses/by-nc-nd/4.0/deed.de cc
- Klassifikation
-
Informatik
- Schlagwort
-
Hadoop
RDF
SPARQL
Impala
Semantic Web
- Ereignis
-
Veröffentlichung
- (wo)
-
Freiburg
- (wer)
-
Universität
- (wann)
-
2014
- Urheber
- Beteiligte Personen und Organisationen
-
Albert-Ludwigs-Universität Freiburg
Institut für Informatik
Technische Fakultät
- DOI
-
10.1007/978-3-319-11964-9_11
- URN
-
urn:nbn:de:bsz:25-freidok-122758
- Rechteinformation
-
Der Zugriff auf das Objekt ist unbeschränkt möglich.
- Letzte Aktualisierung
-
25.03.2025, 13:46 MEZ
Datenpartner
Deutsche Nationalbibliothek. Bei Fragen zum Objekt wenden Sie sich bitte an den Datenpartner.
Beteiligte
- Schätzle, Alexander
- Przyjaciel-Zablocki, Martin
- Neu, Antony
- Lausen, Georg
- Albert-Ludwigs-Universität Freiburg
- Institut für Informatik
- Technische Fakultät
- Universität
Entstanden
- 2014