header

Semantically Enhanced Information Retrieval: An Ontology-Based Approach

29 Pages Posted: 6 Jul 2018 Publication Status: Accepted

See all articles by Miriam Fernandez

Miriam Fernandez

The Open University - Knowledge Media Institute

Ivan Cantador

Universidad Autónoma de Madrid - Departamento de Ingeniería Informática

Vanesa López

The Open University - Knowledge Media Institute

David Vallet

Universidad Autónoma de Madrid - Departamento de Ingeniería Informática

Pablo Castells

Universidad Autónoma de Madrid - Departamento de Ingeniería Informática

Enrico Motta

The Open University - Knowledge Media Institute; The Open University - Centre for Research in Computing

Abstract

Currently, techniques for content description and query processing in Information Retrieval (IR) are based on keywords, and therefore provide limited capabilities to capture the conceptualizations associated with user needs and contents. Aiming to solve the limitations of keyword-based models, the idea of conceptual search, understood as searching by meanings rather than literal strings, has been the focus of a wide body of research in the IR field. More recently, it has been used as a prototypical scenario (or even envisioned as a potential “killer app”) in the Semantic Web (SW) vision, since its emergence in the late nineties. However, current approaches to semantic search developed in the SW area have not yet taken full advantage of the acquired knowledge, accumulated experience, and technological sophistication achieved through several decades of work in the IR field. Starting from this position, this work investigates the definition of an ontology-based IR model, oriented to the exploitation of domain Knowledge Bases to support semantic search capabilities in large document repositories, stressing on the one hand the use of fully-fledged ontologies in the semantic-based perspective, and on the other hand the consideration of unstructured content as the target search space. The major contribution of this work is an innovative, comprehensive semantic search model, which extends the classic IR model, addresses the challenges of the massive and heterogeneous Web environment, and integrates the benefits of both keyword and semantic-based search. Additional contributions include: an innovative rank fusion technique that minimizes the undesired effects of knowledge sparseness on the yet juvenile SW, and the creation of a large-scale evaluation benchmark, based on TREC IR evaluation standards, which allows a rigorous comparison between IR and SW approaches. Conducted experiments show that our semantic search model obtain comparable and better performance results (in terms of MAP and P@10 values) than the best TREC automatic system.

Keywords: Semantic Web, Information Retrieval, Semantic Search

Suggested Citation

Fernandez, Miriam and Cantador, Ivan and López, Vanesa and Vallet, David and Castells, Pablo and Motta, Enrico, Semantically Enhanced Information Retrieval: An Ontology-Based Approach (2011). Available at SSRN: https://ssrn.com/abstract=3199535 or http://dx.doi.org/10.2139/ssrn.3199535

Miriam Fernandez (Contact Author)

The Open University - Knowledge Media Institute ( email )

Walton Hall
Milton Keynes
United Kingdom

Ivan Cantador

Universidad Autónoma de Madrid - Departamento de Ingeniería Informática ( email )

Madrid
Spain

Vanesa López

The Open University - Knowledge Media Institute

Walton Hall
Milton Keynes
United Kingdom

David Vallet

Universidad Autónoma de Madrid - Departamento de Ingeniería Informática

Madrid
Spain

Pablo Castells

Universidad Autónoma de Madrid - Departamento de Ingeniería Informática

Madrid
Spain

Enrico Motta

The Open University - Knowledge Media Institute ( email )

Walton Hall
Milton Keynes
United Kingdom

The Open University - Centre for Research in Computing ( email )

Walton Hall
Milton Keynes
United Kingdom

Do you have negative results from your research you’d like to share?

Paper statistics

Downloads
75
Abstract Views
783
PlumX Metrics