\name{snowfall-tools} \alias{snowfall-tools} \alias{sfLibrary} \alias{sfSource} \alias{sfExport} \alias{sfExportAll} \alias{sfRemove} \alias{sfRemoveAll} \alias{sfCat} \alias{sfClusterSplit} \alias{sfClusterCall} \alias{sfClusterEval} \alias{sfClusterEvalQ} \alias{sfClusterSetupRNG} \alias{sfClusterSetupRNGstream} \alias{sfClusterSetupSPRNG} \alias{sfTest} \title{Cluster tools} \usage{ sfLibrary( package, pos=2, lib.loc=NULL, character.only=FALSE, warn.conflicts=TRUE, keep.source=getOption("keep.source.pkgs"), verbose=getOption("verbose"), version, stopOnError=TRUE ) sfSource( file, encoding = getOption("encoding"), stopOnError = TRUE ) sfExport( ..., list=NULL, local=TRUE, debug=FALSE ) sfExportAll( except=NULL, debug=FALSE ) sfRemove( ..., list=NULL, master=FALSE, debug=FALSE ) sfRemoveAll( except=NULL, debug=FALSE ) sfCat( ..., sep=" ", master=TRUE ) sfClusterSplit( seq ) sfClusterCall( fun, ..., stopOnError=TRUE ) sfClusterEval( expr, stopOnError=TRUE ) sfClusterSetupRNG( type="RNGstream", ... ) sfClusterSetupRNGstream( seed=rep(12345,6), ... ) sfClusterSetupSPRNG( seed=round(2^32*runif(1)), prngkind="default", para=0, ... ) sfTest() } \arguments{ \item{expr}{expression to evaluate} \item{seq}{vector to split} \item{fun}{function to call} \item{list}{character vector with names of objects to export} \item{local}{a logical indicating if variables should taken from local scope(s) or only from global.} \item{debug}{a logical indicating extended information is given upon action to be done (e.g. print exported variables, print context of local variables etc.).} \item{except}{character vector with names of objects not to export/remove} \item{sep}{a character string separating elements in x} \item{master}{a logical indicating if executed on master as well} \item{...}{additional arguments to pass to standard function} \item{package}{name of the package. Check \code{library} for details.} \item{pos}{position in search path to load library.} \item{warn.conflicts}{warn on conflicts (see "library").} \item{keep.source}{see "library".} \item{verbose}{enable verbose messages.} \item{version}{version of library to load (see "library").} \item{encoding}{encoding of library to load (see "library").} \item{lib.loc}{a character vector describing the location of the R library trees to search through, or 'NULL'. Check \code{library} for details.} \item{character.only}{a logical indicating package can be assumed to be a character string. Check \code{library} for details.} \item{file}{filename of file to read. Check \code{source} for details} \item{stopOnError}{a logical indicating if function stops on failure or still returns. Default is \code{TRUE}.} \item{type}{a character determine which random number generator should be used for clusters. Allowed values are "RNGstream" for L'Ecuyer's RNG or "SPRNG" for Scalable Parallel Random Number Generators.} \item{para}{additional parameters for the RNGs.} \item{seed}{Seed for the RNG.} \item{prngkind}{type of RNG, see snow documentation.} } \description{ Tools for cluster usage. Allow easier handling of cluster programming. } \details{ The current functions are little helpers to make cluster programming easier. All of these functions also work in sequential mode without any further code changes. \code{sfLibrary} loads an R-package on all nodes, including master. Use this function if slaves need this library, too. Parameters are identically to the R-build in funtion \code{\link{library}}. If a relative path is given in \code{lib.loc}, it is converted to an absolute path.\\ As default \code{sfLibrary} stops on any error, but this can be prevented by setting \code{stopOnError=FALSE}, the function is returning \code{FALSE} then. On success \code{TRUE} is returned. \code{sfSource} loads a sourcefile on all nodes, including master. Use this function if the slaves need the code as well. Make sure the file is accessible on all nodes under the same path. The loading is done on slaves using \code{source} with fixes parameters: \code{local=FALSE, chdir=FALSE, echo=FALSE}, so the files is loaded global without changing of directory.\\ As default \code{sfSource} stops on any error, but this can be prevented by setting \code{stopOnError=FALSE}, the function is returning \code{FALSE} then. On success \code{TRUE} is returned. \code{sfExport} exports variables from the master to all slaves. Use this function if slaves need acccess to these variables as well. \code{sfExport} features two execution modes: local and global. If using local mode (default), variables for export are searched backwards from current environment to \code{globalenv()}. Use this mode if you want to export local variables from functions or other scopes to the slaves. In global mode only global variables from master are exported.\\ \emph{Note: all exported variables are \emph{global} on the slaves!}\\ If you have many identical named variables in different scopes, use argument \code{debug=TRUE} to view the context the exported variable is coming from.\\ Variables are given as their names or as a character vector with their names using argument \code{list}. \code{sfExportAll} exports all global variables from the master to all slaves with exception of the given list. Use this functions if you want to export mostly all variables to all slaves.\\Argument \code{list} is a character vector with names of the variables \emph{not} to export. \code{sfRemove} removes a list of global (previous exported or generated) variables from slaves and (optional) master. Use this function if there are large further unused variables left on slave. Basically this is only interesting if you have more than one explicit parallel task in your program - where the danger is slaves memory usage exceed.\\ If argument \code{master} is given, the variables are removed from master as well (default is FALSE).\\ Give names of variables as arguments, or use argument \code{list} as a character vector with the names. For deep cleaning of slave memory use \code{sfRemoveAll}. \code{sfRemoveAll} removes all global variables from the slaves. Use this functions if you want to remove mostly all variables on the slaves. Argument \code{list} is a character vector with names of the variables \emph{not} to remove. \code{sfCat} is a debugging function printing a message on all slaves (which appear in the logfiles). \code{sfClusterSplit} splits a vector into one consecutive piece for each cluster and returns as a list with length equal to the number of cluster nodes. Wrapper for \pkg{snow} function \code{clusterSplit}. \code{sfClusterCall} calls a function on each node and returns list of results. Wrapper for \pkg{snow} function \code{clusterCall}. \code{sfClusterEvalQ} evaluates a literal expression on all nodes. Wrapper for \pkg{snow} function \code{clusterEvalQ}. \code{sfTest} is a simple unit-test for most of the build in functions. It runs tests and compares the results for the correct behavior. Note there are some warnings if using, this is intended (as behavior for some errors is tested, too). use this if you are not sure all nodes are running your R-code correctly (but mainly it is implemented for development). } \keyword{package} \seealso{ See \pkg{snow} documentation for details on wrapper-commands: \code{\link[snow:snow-package]{snow}} } \examples{ \dontrun{ sfInit( parallel=FALSE ) ## Now works both in parallel as in sequential mode without ## explicit cluster handler. sfClusterEval( cat( "yummie\n" ) ); ## Load a library on all slaves. Stop if fails. sfLibrary( tools ) sfLibrary( "tools", character.only=TRUE ) ## Alternative. ## Execute in cluster or sequential. sfLapply( 1:10, exp ) ## Export global Var gVar <- 99 sfExport( "gVar" ) ## If there are local variables with same name which shall not ## be exported. sfExport( "gVar", local=FALSE ) ## Export local variables var1 <- 1 ## Define global var2 <- "a" f1 <- function() { var1 <- 2 var3 <- "x" f2 <- function() { var1 <- 3 sfExport( "var1", "var2", "var3", local=TRUE ) sfClusterCall( var1 ) ## 3 sfClusterCall( var2 ) ## "a" sfClusterCall( var3 ) ## "x" } f2() } f1() ## Init random number streams (snows functions, build upon ## packages rlecuyer/rsprng). sfClusterCall( runif, 4 ) sfClusterSetupRNG() ## L'Ecuyer is default. sfClusterCall( runif, 4 ) sfClusterSetupRNG( type="SPRNG", seed = 9876) sfClusterCall( runif, 4 ) ## Run unit-test on main functions. sfTest() } }