Impute the missing values in the peak table of the object using a
random forest. The estimated error in the imputation is logged.
It is recommended to set the seed number for reproducibility
(it is called random forest for a reason).
This a wrapper around missForest.
Use parallelize = "variables" to run in parallel for faster testing.
NOTE: running in parallel prevents user from setting a seed number.
Arguments
- object
a
SummarizedExperimentobject- all_features
logical, should all features be used? If FALSE (the default), flagged features are removed before imputation.
- assay.type
character, assay to be used in case of multiple assays
- name
character, name of the resultant assay in case of multiple assays
- ...
passed to
missForest
See also
missForest for detail about the algorithm
and the parameters
Examples
data(toy_notame_set)
missing <- mark_nas(toy_notame_set, 0)
set.seed(38)
imputed <- impute_rf(missing)
#> INFO [2025-10-29 12:03:01]
#> Starting random forest imputation at 2025-10-29 12:03:01.805751
#> INFO [2025-10-29 12:03:02] Out-of-bag error in random forest imputation: 0.466
#> INFO [2025-10-29 12:03:02] Random forest imputation finished at 2025-10-29 12:03:02.782594
#>