BNOSA: A Bayesian network and ontology based semantic annotation framework

Author Affiliation

Quratulain Rajput is Assistant Professor at Institute of Business Administration (IBA), Karachi

Sajjad Haider is Associate Professor at Institute of Business Administration (IBA), Karachi

Faculty / School

Faculty of Computer Sciences (FCS)


Department of Computer Science

Was this content written or created while at IBA?


Document Type


Source Publication

Journal of Web Semantics




Econometrics | Economics | Finance


The paper presents a semantic annotation framework that is capable of extracting relevant information from unstructured, ungrammatical and incoherent data sources. The framework, named BNOSA, uses ontology to conceptualize a problem domain and to extract data from the given corpora, and Bayesian networks to resolve conflicts and to predict missing data. The framework is extensible as it is capable of dynamically extracting data from any problem domain given a pre-defined ontology and a corresponding Bayesian network. Experiments have been conducted to analyze the performance of BNOSA on several problem domains. The sets of corpora used in the experiments belong to selling-purchasing websites where product information is entered by ordinary web users in a structure-free format. The results show that BNOSA performs reasonably well to find location of the data of interest using context keywords provided as part of the domain ontology. In case of more than one value being extracted for an attribute or if the value is missing, Bayesian networks identify the most appropriate value for that attribute.

Indexing Information

HJRS - Y Category, Scopus, Web of Science - Science Citation Index Expanded (SCI)

Publication Status