Partager via


Note

Please see Azure Cognitive Services for Speech documentation for the latest supported speech solutions.

Pronunciation Lexicon Reference (Microsoft.Speech)

Pronunciation lexicons contain the mapping between the pronunciations and the written representations of words or short phrases. You can use lexicons to improve the accuracy of speech recognition or to customize the vocabulary and pronunciations of a synthesized voice.

The implementation of lexicons in Microsoft.Speech is based on World Wide Web Consortium (W3C) Pronunciation Lexicon Specification (PLS) Version 1.0, which defines the structure and syntax for XML-based lexicon documents.

The following table lists and describes elements from the PLS specification that are implemented in Microsoft.Speech. Elements are listed in the order in which they occur in a PLS document,

PLS Element

Description

Usage

Attributes

lexicon

The root element in a lexicon document that contains all the other elements.

Required

version, xml:base, xmlns, xml:lang, alphabet

meta

Specifies information about the document.

Optional

name, http-equiv, content

metadata

Contains information about the document in a metadata schema.

Optional

-

lexeme

Must contain one or more grapheme elements, one or more pronunciations (each in a separate phoneme element), and can optionally contain one or more example elements.

Optional

xml:id

grapheme

A child element of the lexeme element, it contains the written representation of a word or phrase in a lexeme element.

Optional

-

phoneme

A child element of the lexeme element, it contains a single phonetic pronunciation of a word or phrase in a lexeme element.

Optional

prefer, alphabet

example

A child element of the lexeme element, it contains an example sentence that illustrates an occurrence of a lexeme.

Optional

-

Remarks

Although the PLS specification does not require that lexicon documents contain lexeme elements, a lexicon is not useful for pronunciation without them.

The meta and metadata elements must occur before the first lexeme element.

Currently, Microsoft does not support the alias element that is defined in the PLS specification.

PLS lexicons used in Microsoft.Speech are not required to contain an XML prolog, for example: <?xml version="1.0" encoding="UTF-8"?>

See Also

Concepts

About Lexicons and Phonetic Alphabets (Microsoft.Speech)

Phonetic Alphabet Reference (Microsoft.Speech)