Date of Submission

2024

Supervisor

Dr. Sajjad Haider, Professor, Department of Computer Science, Institute of Business Administration, Karachi

Committee Member 1

Dr. Sajjad Haider, Supervisor, Department of Computer Science, Institute of Business Administration, Karachi

Committee Member 2

Dr. Quratulain Rajput, Examiner – I, Department of Computer Science, Institute of Business Administration, Karachi

Committee Member 3

Dr. Tariq Mahmood, Examiner – II, Department of Computer Science, Institute of Business Administration, Karachi

Degree

Master of Science in Computer Science

Department

Department of Computer Science

School

School of Mathematics and Computer Science (SMCS)

Keywords

Ontology knowledge, Natural language processing, Pattern matching rules, Feature extraction, Clinical notes

Abstract

Feature extraction from clinical narratives involves the application of natural language processing (NLP) techniques and knowledge extraction. However, it has been observed that integrating NLP techniques with an ontology-guided approach in clinical notes has not been widely used in the feature extraction pipeline. Consequently, the utilization of ontology knowledge in feature extraction has not been extensively explored.

This thesis demonstrates the feasibility of integrating ontology knowledge into feature extraction alongside NLP techniques. The approach involves building a system that incorporates ontology knowledge using owlready2 on the Harvard medical dataset. To enable pattern matching, rules are formulated using regular expressions.

The findings reveal that the extracted features, while sufficient for demonstrating feasibility, can be effectively employed in any analytical model. The pattern matching rules are specifically tailored to the Harvard medical dataset, suggesting potential modifications may be required to apply this system to other datasets. Although this thesis focuses on a subset of medical ontologies, it establishes that the system is adaptable to incorporating additional medical ontologies.

Document Type

Restricted Access

Submission Type

Thesis

Available for download on Monday, June 11, 2029

Share

COinS