Inline Dictation commands
I’ve heard from a number of sources that there isn’t any good documentation about the ‘inline’ dictation commands. These commands can be uttered in the middle of a dictation stream (in other words, you don’t have to stop speaking to use these commands), and are used to guide the Speech Recognition Engine to produce the desired results.
Command | Description |
tab | Inserts a <tab> character. |
new-line | Inserts a new line character and forces the next word to be capitalized. |
new-paragraph | Inserts two new line characters and forces the next word to be capitalized. |
caps <word> | Capitalizes the first letter of <word>. |
no caps <word> | Lowercases <word>. |
all caps <word> | Uppercases <word>. |
no space | Does not insert a space before the next word. |
literal <argument> | Does not perform any inverse text normalization on <argument>. |
numeral <argument> | Forces <argument> into numeric form, if possible. |
Simple Examples
Tab, new-line, new-paragraph, caps, no caps, and all caps are pretty straightforward:
User Input | Recognition Result |
king tab county | king<tab>county |
this is a test new-line this is another test | This is a test This is another test |
this is a test new-paragraph this is another test | This is a test This is another test |
no caps C I A | cia |
C I A | CIA |
I have caps in my closet | I have In my closet |
I have all caps nothing | I have NOTHING |
I have no space available | I haveavailable |
Examples using <literal>
In order to understand what <literal> does, you need to know what Inverse Text Normalization does. Inverse Text Normalization is the process of converting spoken forms into a preferred textual representation. Some examples are:
User Input | Recognition Result |
doctor smith | Dr. Smith |
period | . |
united states of america | United States of America |
three hundred and five | 305 |
The <literal> command prevents that conversion from occurring, and forces the text to be input exactly as spoken.
Examples using <numeral>
The best way to describe the <numeral> command is to give some examples:
User Input | Recognition Result |
numeral two | 2 |
two | two |
numeral twelve | 12 |
twelve | 12 |
numeral four eight seven six two three | 487623 |
four eight seven six two three | four eight seven six two three |
numeral one and a half million | 1,500,000 |
numeral one point five million | 1.5 million |
numeral one point four five | 1.45 |
If the phrase cannot be interpreted as a number, then the numeral command does nothing:
User Input | recognition result |
numeral I want to go to the store | I want to go to the store |
Note that this phrase has a possible number (to –> two –> 2), but the intervening words deactivate the command.