Data Mining: Analyzing impact of outliers' detection and removal from the test sample in Blind Source Extraction using Multivariate Calibration Techniques

S. R. Naqvi, Comsats Institute of Information Technology, Wah Cantt, Pakistan
F. Rehman, CIIT, Wah Cantt, Pakistan
S. S. Naqvi, Digital Signal Processing and Communication Systems' field, Pakistan
A. Amin, COMSATS Institute of Information Technology - Attock Campus, Attock, Punjab, PK
I. Qayyum, COMSATS Institute of Information Technology - Attock Campus, Attock, Punjab, PK
S. Khan, Digital Signal Processing and Communication Systems' field, Pakistan
W. A. Khan, Digital Signal Processing and Communication Systems' field, Pakistan

Abstract/Description

Blind source extraction (BSE) may be an essential but a challenging task where multiple sources are convolved and/or time delayed. In this article we discuss the performance of multivariate calibration techniques that comprise of classical least square (CLS), inverse linear regression (ILS), principal component regression (PCR) and partial least square regression (PLS) in achieving this task in robust speech recognition systems with varying signal-to-noise ratios (SNR). We specifically analyze two methods for identifying and removing outliers from the sample, namely; outlier sample removal (OSR) and descriptor selection (DS) for classical least square and factor Based regression respectively, which results in higher correlation among predicted and the expected results. Our experiments suggest that factor based methods produce much reliable results than classical least square regression. However, classical least square is much more immune to white noise as compared to factor based regressions. Our results prove that successful detection and removal of outliers from the sample under test (SUT) may result in as low as 37% and 56% improvement in prediction with classical least square and principal component regression respectively.

Keywords

Testing, Calibration, Least squares methods, Signal processing algorithms, Independent component analysis

Session Theme

Data Mining

Session Type

Other

Session Chair

Dr. Sajjad Haider

Start Date

15-8-2009 5:35 PM

End Date

15-8-2009 5:55 PM

Recommended Citation

Naqvi, S., Rehman, F., Naqvi, S. S., Amin, A., Qayyum, I., Khan, S., & Khan, W. A. (2009). Data Mining: Analyzing impact of outliers' detection and removal from the test sample in Blind Source Extraction using Multivariate Calibration Techniques. International Conference on Information and Communication Technologies. Retrieved from https://ir.iba.edu.pk/icict/2009/2009/35

Download

Included in

Computer Sciences Commons, Data Science Commons

COinS

Aug 15th, 5:35 PM Aug 15th, 5:55 PM

Data Mining: Analyzing impact of outliers' detection and removal from the test sample in Blind Source Extraction using Multivariate Calibration Techniques

Data Mining: Analyzing impact of outliers' detection and removal from the test sample in Blind Source Extraction using Multivariate Calibration Techniques

Abstract/Description

Keywords

Session Theme

Session Type

Session Chair

Start Date

End Date

Recommended Citation

Included in

Browse

Search

Author Corner

LINKS

Data Mining: Analyzing impact of outliers' detection and removal from the test sample in Blind Source Extraction using Multivariate Calibration Techniques

Presenter

Abstract/Description

Keywords

Session Theme

Session Type

Session Chair

Start Date

End Date

Recommended Citation

Included in

Share

Browse

Search

Author Corner

LINKS