
Automatic Imputation
auto_impute.RdThis function automatically selects and applies the most suitable imputation method for the given dataset. If Quality Control (QC) samples are present, the method that best stabilizes them (i.e., yields the lowest median coefficient of variation) is chosen. Otherwise, it defaults to a sample-size-based strategy:
less than 30 samples: Sample minimum imputation
between 30 and 100 samples: Minimum probability imputation
more than 100 samples: MissForest imputation
Arguments
- exp
- group_col
The column name in sample_info for groups. Default is "group". Can be NULL when no group information is available.
- qc_name
The name of QC samples in the
group_colcolumn. Default is "QC". Only used whengroup_colis not NULL.- to_try
Imputation functions to try. A list. Default includes:
impute_zero(): zero imputationimpute_sample_min(): sample minimum imputationimpute_half_sample_min(): half sample minimum imputationimpute_sw_knn(): sample-wise KNN imputationimpute_fw_knn(): feature-wise KNN imputationimpute_bpca(): BPCA imputationimpute_ppca(): PPCA imputationimpute_svd(): SVD imputationimpute_min_prob(): minimum probability imputationimpute_miss_forest(): MissForest imputation
- info
Internal parameter used by
auto_clean().