Managing intermediate results when using R/sparklyr

Win-Vector Blog 2017-06-22

Summary:

In our latest “R and big data” article we show how to manage intermediate results in non-trivial Apache Spark workflows using R, sparklyr, dplyr, and replyr. Handle management Many Sparklyr tasks involve creation of intermediate or temporary tables. This can be through dplyr::copy_to() and through dplyr::compute(). These handles can represent a reference leak and eat … Continue reading Managing intermediate results when using R/sparklyr

Link:

http://www.win-vector.com/blog/2017/06/managing-intermediate-results-when-using-rsparklyr/

From feeds:

Statistics and Visualization » Win-Vector Blog

Tags:

Authors:

John Mount

Date tagged:

06/22/2017, 18:23

Date published:

06/09/2017, 11:37