Dr Ikechukwu Nkisi-Orji i.nkisi-orji@rgu.ac.uk
Chancellor's Fellow
Ontology driven information retrieval.
Nkisi-Orji, Ikechukwu
Authors
Contributors
Professor Nirmalie Wiratunga n.wiratunga@rgu.ac.uk
Supervisor
Dr Stewart Massie s.massie@rgu.ac.uk
Supervisor
Dr Kit-ying Hui k.hui@rgu.ac.uk
Supervisor
Rachel Heaven
Supervisor
Abstract
Ontology-driven information retrieval deals with the use of entities specified in domain ontologies to enhance search and browse. The entities or concepts of lightweight ontological resources are traditionally used to index resources in specialised domains. Indexing with concepts is often achieved manually and reusing them to enhance search remains a challenge. Other challenges range from the difficulty in merging multiple ontologies for use in retrieval to the problem of integrating concept-based search into existing search systems. We mainly encounter these challenges in enterprise search environments, which have not kept pace with Web search engines and mostly rely on full-text search systems. Full-text search systems are keyword-based and suffer from well-known vocabulary mismatch problems. Ontologies model domain knowledge and have the potential for use in understanding the unstructured content of documents. In this thesis, we investigate the challenges of using domain ontologies for enhancing search in enterprise systems. Firstly, we investigate methods for annotating documents by identifying the best concepts that represent their contents. We explore ways to overcome the challenges of insufficient textual features in lightweight ontologies and introduce an unsupervised method for annotating documents based on generating concept descriptors from external resources. Specifically, we augment concepts with descriptive textual content by exploiting the taxonomic structure of an ontology to ensure that we generate useful descriptors. Secondly, the need often arises for cross-ontology reasoning when using multiple ontologies in ontology-driven search. Once again, we attempt to overcome the absence of rich features in lightweight ontologies by exploring the use of background knowledge for the alignment process. We propose novel ontology alignment techniques which integrate string metrics, semantic features, and term weights for discovering diverse correspondence types in supervised and unsupervised ontology alignment. Thirdly, we investigate different representational schemes for queries and documents and explore semantic ranking models using conceptual representations. Accordingly, we propose a semantic ranking model that incorporates the knowledge of concept relatedness and a predictive model to apply semantic ranking only when it is deemed beneficial for retrieval. Finally, we conduct comprehensive evaluations of the proposed methods and discuss our findings.
Citation
NKISI-ORJI, I. 2019. Ontology driven information retrieval. Robert Gordon University [online], PhD thesis. Available from: https://openair.rgu.ac.uk
Thesis Type | Thesis |
---|---|
Deposit Date | Oct 10, 2019 |
Publicly Available Date | Oct 10, 2019 |
Keywords | Information retrieval; Enterprise information retrieval; Enterprise search; Document annotation; Ontologies; Search algorithms |
Public URL | https://rgu-repository.worktribe.com/output/638312 |
Award Date | May 31, 2019 |
Files
NKISI-ORJI 2019 Ontology driven information retrieval
(2.2 Mb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by-nc/4.0/
Copyright Statement
© The Author.
You might also like
Taxonomic corpus-based concept summary generation for document annotation.
(2017)
Presentation / Conference Contribution
Ontology alignment based on word embedding and random forest classification.
(2019)
Presentation / Conference Contribution
Clood CBR: towards microservices oriented case-based reasoning.
(2020)
Presentation / Conference Contribution
Counterfactual explanations for student outcome prediction with Moodle footprints.
(2021)
Presentation / Conference Contribution
DisCERN: discovering counterfactual explanations using relevance features from neighbourhoods.
(2021)
Presentation / Conference Contribution
Downloadable Citations
About OpenAIR@RGU
Administrator e-mail: publications@rgu.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search