Hadoop installation and configuration for Microsoft R Server
Microsoft R Server is a scalable data analytics server that can be deployed as a single-user workstation, a local network of connected servers, or on a Hadoop cluster in the cloud. On Hadoop, R Server requires MapReduce, Hadoop Distributed File System (HDFS), and Apache YARN. Optionally, Spark version 1.6-2.0 is supported for Microsoft R Server 9.1.
Platforms and Dependencies
- Supported operating systems for Microsoft R Server
- Package dependencies for Microsoft R Server installations on Linux and Hadoop
Step-by-Step
- Command line installation for any supported platform
- Install an R package parcel using Cloudera Manager
- Offline installation
- Manual package installation
- Configure R Server to operationalize R code and host analytic web services
- Uninstall Microsoft R to upgrade to newer versions
Configuration
- Adjust your Hadoop cluster configuration for R Server workloads
- Enforcing YARN queue usage on R Server for Hadoop
- Manage your R installation on Linux