New-AzureHDInsightStreamingMapReduceJobDefinition
[This topic is pre-release documentation and is subject to change in future releases. Blank topics are included as placeholders.]
New-AzureHDInsightStreamingMapReduceJobDefinition
Defines a new streaming Map Reduce job.
Syntax
Parameter Set: Default
New-AzureHDInsightStreamingMapReduceJobDefinition [-Arguments <String[]> ] [-CmdEnv <String[]> ] [-Combiner <String> ] [-Defines <Hashtable> ] [-Files <String[]> ] [-InputPath <String> ] [-JobName <String> ] [-Mapper <String> ] [-OutputPath <String> ] [-Reducer <String> ] [-StatusFolder <String> ] [ <CommonParameters>]
Detailed Description
Defines new job definition object that represents parameters of the Hadoop streaming job.
This topic describes the cmdlet in the 0.8.1 version of the Microsoft Azure PowerShell module. To find out the version of the module you're using, from the Azure PowerShell console, type (get-module azure).version.
Parameters
-Arguments<String[]>
Arguments of the Hadoop job. The arguments will be passed as command line arguments to each task.
Aliases |
Args |
Required? |
false |
Position? |
named |
Default Value |
none |
Accept Pipeline Input? |
false |
Accept Wildcard Characters? |
false |
-CmdEnv<String[]>
Command line environmental variables that should be set during job execution on data nodes.
Aliases |
none |
Required? |
false |
Position? |
named |
Default Value |
none |
Accept Pipeline Input? |
false |
Accept Wildcard Characters? |
false |
-Combiner<String>
Combiner executable name.
Aliases |
none |
Required? |
false |
Position? |
named |
Default Value |
none |
Accept Pipeline Input? |
false |
Accept Wildcard Characters? |
false |
-Defines<Hashtable>
Hadoop configuration values to be set during the job execution.
Aliases |
Params |
Required? |
false |
Position? |
named |
Default Value |
none |
Accept Pipeline Input? |
false |
Accept Wildcard Characters? |
false |
-Files<String[]>
The collection of files associated with the MapReduce job.
Aliases |
none |
Required? |
false |
Position? |
named |
Default Value |
none |
Accept Pipeline Input? |
false |
Accept Wildcard Characters? |
false |
-InputPath<String>
WASB path representing location of input files.
Aliases |
Input |
Required? |
false |
Position? |
named |
Default Value |
none |
Accept Pipeline Input? |
false |
Accept Wildcard Characters? |
false |
-JobName<String>
The name of the MapReduce job. Specifying the name is optional.
Aliases |
Name |
Required? |
false |
Position? |
named |
Default Value |
none |
Accept Pipeline Input? |
false |
Accept Wildcard Characters? |
false |
-Mapper<String>
Mapper executable name.
Aliases |
none |
Required? |
false |
Position? |
named |
Default Value |
none |
Accept Pipeline Input? |
false |
Accept Wildcard Characters? |
false |
-OutputPath<String>
WASB path representing location for the job output.
Aliases |
Output |
Required? |
false |
Position? |
named |
Default Value |
none |
Accept Pipeline Input? |
false |
Accept Wildcard Characters? |
false |
-Reducer<String>
Reducer executable name.
Aliases |
none |
Required? |
false |
Position? |
named |
Default Value |
none |
Accept Pipeline Input? |
false |
Accept Wildcard Characters? |
false |
-StatusFolder<String>
Contains the standard and error outputs for the job, including its exit code and task logs.
Aliases |
none |
Required? |
false |
Position? |
named |
Default Value |
none |
Accept Pipeline Input? |
false |
Accept Wildcard Characters? |
false |
<CommonParameters>
This cmdlet supports the common parameters: -Verbose, -Debug, -ErrorAction, -ErrorVariable, -OutBuffer, and -OutVariable. For more information, see about_CommonParameters (https://go.microsoft.com/fwlink/p/?LinkID=113216).
Inputs
The input type is the type of the objects that you can pipe to the cmdlet.
Outputs
The output type is the type of the objects that the cmdlet emits.
Examples
PS C:\>
Define a new streaming Map Reduce job.
Create a MapReduce job definition for the streaming job.
PS C:\> $subscriptionName = "<SubscriptionName>" # Azure subscription name
$clusterName = "<ClusterName>" # HDInsight cluster name
# Create a MapReduce job definition for the streaming job.
$streamingWC = New-AzureHDInsightStreamingMapReduceJobDefinition -Files "/example/apps/wc.exe", "/example/apps/cat.exe" -InputPath "/example/data/gutenberg/davinci.txt" -OutputPath "/example/data/StreamingOutput/wc.txt" -Mapper "cat.exe" -Reducer "wc.exe"