A Bicluster-Based Sequential Interpolation Imputation Method for Estimation of Missing Values in Microarray Gene Expression Data

Chandra      Das; Shilpi      Bose; Samiran      Chattopadhyay; Matangini      Chattopadhyay; Alamgir      Hossain

doi:10.2174/1574893612666170106102019

Abstract

Background: Gene expression matrix produced by DNA microarray technology inexorably contains multiple missing entries due to experimental problems. Prediction of missing values in gene expression matrix is essential as algorithms analyzing gene expression typically need a matrix without missing values.

Objective: The objective of this paper is to present a novel bicluster-based sequential interpolation imputation method to predict missing values in gene expression data.

Method: For each missing entry, this method first generates a bicluster by selecting a number of correlated genes and samples for that missing position and then applies interpolation based approximation technique on that bicluster. This method starts imputation from the gene with the minimum number of missing values and continues imputation by reusing the already imputed values.

Results: The result of the proposed method is compared with seven well known existing estimation techniques over nine different data sets. The metric used to compare the performance are normalized root mean square error (NRMSE) and average distance between partition errors (ADBPE).

Conclusion: Performance of the proposed method is observed to be better than the well-known methods in a variety of data sets. The novelty of this approach lies in applying interpolation technique in the identified local structure (bicluster) for predicting missing values.

Keywords: Biclustering, DNA microarray, gene expression data, missing value estimation.

« Previous Next »

Graphical Abstract

Rights & Permissions Print Cite

Article Metrics

20

4

Journal Information

For Authors

For Editors

For Reviewers

Explore Articles

Open Access

Open Access Articles

For Visitors

DOI https://dx.doi.org/10.2174/1574893612666170106102019	Print ISSN 1574-8936
Publisher Name Bentham Science Publisher	Online ISSN 2212-392X

Current Bioinformatics

A Bicluster-Based Sequential Interpolation Imputation Method for Estimation of Missing Values in Microarray Gene Expression Data

Abstract Play Pause

Graphical Abstract

Related Journals

Related Books

Abstract