Revision 272ac623c984dbf5defce76b6495c05accafe79f authored by Patrick Schratz on 17 December 2019, 04:08:28 UTC, committed by Patrick Schratz on 17 December 2019, 04:08:28 UTC
Build URL: https://circleci.com/gh/mlr-org/mlr/1264
Commit:
1 parent 9de9c6e
Raw File
parallelization.Rd
% Generated by roxygen2: do not edit by hand
% Please edit documentation in R/parallelization.R
\name{parallelization}
\alias{parallelization}
\title{Supported parallelization methods}
\description{
mlr supports different methods to activate parallel computing capabilities through the integration of the \link[parallelMap:parallelMap]{parallelMap::parallelMap} package, which supports all major parallelization backends for R.
You can start parallelization with \code{\link[parallelMap]{parallelStart}*}, where \code{*} should be replaced with the chosen backend.
\link[parallelMap:parallelStop]{parallelMap::parallelStop} is used to stop all parallelization backends.

Parallelization is divided into different levels and will automatically be carried out for the first level that occurs, e.g. if you call \code{resample()} after \link[parallelMap:parallelStart]{parallelMap::parallelStart}, each resampling iteration is a parallel job and possible underlying calls like parameter tuning won't be parallelized further.

The supported levels of parallelization are:
\describe{
\item{\code{"mlr.resample"}}{Each resampling iteration (a train/test step) is a parallel job.}
\item{\code{"mlr.benchmark"}}{Each experiment "run this learner on this data set" is a parallel job.}
\item{\code{"mlr.tuneParams"}}{Each evaluation in hyperparameter space "resample with these parameter settings" is a parallel job.
How many of these can be run independently in parallel depends on the tuning algorithm.
For grid search or random search there is no limit, but for other tuners it depends on how many points to evaluate are produced in each iteration of the optimization.
If a tuner works in a purely sequential fashion, we cannot work magic and the hyperparameter evaluation will also run sequentially. But note that you can still parallelize the underlying resampling.}
\item{\code{"mlr.selectFeatures"}}{Each evaluation in feature space "resample with this feature subset" is a parallel job. The same comments as for \code{"mlr.tuneParams"} apply here.}
\item{\code{"mlr.ensemble"}}{For all ensemble methods, the training and prediction of each individual learner is a parallel job.
Supported ensemble methods are the \link{makeBaggingWrapper}, \link{makeCostSensRegrWrapper}, \link{makeMulticlassWrapper}, \link{makeMultilabelBinaryRelevanceWrapper} and the \link{makeOverBaggingWrapper}.}
}
}
back to top