Preprocessing / Impute missing values
Description
Fills in the missing values in the data by estimated values.
Parameters
- Imputation method (mean, median, knn) [knn]
- Number of neighbor (1....10000) [5]
- Maximum number of missing values on a row (0...100) [20]
- Maximum number of missing values on a column (0...100) [20]
Details
The missing values can be replaced with the mean or median of the array or the gene, or they can be
estimated using the specified number of closest neigbors (knn). If the maximum number
of genes missing either on a row or on a column is larger than the selected number, the missing
values will not be estimated. This is implemented in order to protect users from imputing too
large a fraction of their dataset, since this will probably affect the result tradically.
Output
A tabular text file with intensity values for the genes.