header

SemaPlorer - Interactive Semantic Exploration of Data and Media Based on a Federated Cloud Infrastructure

18 Pages Posted: 9 Jul 2018 Publication Status: Accepted

See all articles by Simon Schenk

Simon Schenk

University of Koblenz-Landau - Institute for Web Science and Technologies (WeST)

Carsten Saathoff

University of Koblenz-Landau - Institute for Web Science and Technologies (WeST)

Steffen Staab

University of Koblenz-Landau - Institute for Web Science and Technologies (WeST); University of Southampton - Faculty of Engineering, Science and Mathematics

Ansgar Scherp

University of Koblenz-Landau - Institute for Web Science and Technologies (WeST)

Abstract

SemaPlorer is an easy to use application that allows end users to interactively explore and visualize a very large, mixed-quality and semantically heterogeneous distributed semantic data set in real-time. Its purpose is to acquaint oneself about a city, touristic area, or other area a user is interested in. By visualizing the data using a map, media, and different context views, SemaPlorer advances beyond simple storage and retrieval of large numbers of triples, as the interaction with the large data set is driven by the user. SemaPlorer leverages different semantic data sources such as DBpedia, GeoNames, WordNet, and personal FOAF files. These make a significant portion of the data provided for the Billion Triple Challenge. SemaPlorer intriguingly connects with a large Flickr data set converted to RDF. The storage infrastructure bases on Amazon's Elastic Computing Cloud (EC2) and Simple Storage Service. We apply NetworkedGraphs as a conceptual layer on top of EC2, realizing a large, federated data infrastructure for semantically heterogeneous data sources from within and outside of the cloud. Therefore, the application is scalable with respect to the amount of distributed components working together as well as the number of triples managed overall. Hence, SemaPlorer is exible enough to leverage for exploration of almost arbitrary additional data sources that might be added in the future. We conducted a formative evaluation of the SemaPlorer application with 20 test subjects. The results of this evaluation are analyzed and their implication to future work discussed. SemaPlorer won the first prize at the Billion Triple Challenge of the International Semantic Web Conference in Karlsruhe, 2008.

Keywords: Heterogeneous Semantic Data, Real-time Exploration and Visualization, Linked Open Data, Faceted Browsing, Amazon EC2, Amazon S3, Billion Triple Challenge

Suggested Citation

Schenk, Simon and Saathoff, Carsten and Staab, Steffen and Scherp, Ansgar, SemaPlorer - Interactive Semantic Exploration of Data and Media Based on a Federated Cloud Infrastructure (2009). Available at SSRN: https://ssrn.com/abstract=3199457 or http://dx.doi.org/10.2139/ssrn.3199457

Simon Schenk (Contact Author)

University of Koblenz-Landau - Institute for Web Science and Technologies (WeST) ( email )

Universitaetsstrasse 1, Gebäude B
Campus Koblenz
Koblenz, 56070
Germany

Carsten Saathoff

University of Koblenz-Landau - Institute for Web Science and Technologies (WeST) ( email )

Universitaetsstrasse 1, Gebäude B
Campus Koblenz
Koblenz, 56070
Germany

Steffen Staab

University of Koblenz-Landau - Institute for Web Science and Technologies (WeST) ( email )

Universitaetsstrasse 1, Gebäude B
Campus Koblenz
Koblenz, 56070
Germany

University of Southampton - Faculty of Engineering, Science and Mathematics ( email )

United Kingdom

Ansgar Scherp

University of Koblenz-Landau - Institute for Web Science and Technologies (WeST) ( email )

Universitaetsstrasse 1, Gebäude B
Campus Koblenz
Koblenz, 56070
Germany

Do you have negative results from your research you’d like to share?

Paper statistics

Downloads
28
Abstract Views
606
PlumX Metrics