Run R programs at scale using Apache Spark's distributed computing engine with familiar R syntax
SparkR is a tool for running R on Spark. It follows the same principles as all of Spark’s other language bindings. To use SparkR, we simply import it into our environment and run our code. It’s all very similar to the Python API except that it follows R’s syntax instead of Python. For the most part, almost everything available in Python is available in SparkR.
Subscribe to our blog and get the latest posts delivered to your inbox.