使用語音轉換文字顯示文字格式

發行項
10/16/2024

語音轉換文字提供一系列格式化功能，以確保轉錄的文字清晰易讀。如需如何使用每個功能來改善最終文字輸出的整體清晰度，請參閱以下章節。

ITN

反向文字正規化 (ITN) 是一種程序，可將說出口的話轉換成書寫形式。例如，說出 "four" 即會轉換成書寫形式「4」。語音轉換文字服務會完成此流程，且無法設定。部分支援的文字格式包括日期、時間、小數點、貨幣、地址、電子郵件和電話號碼。您可以自然地說話，服務會如預期般將文字格式化。下表顯示套用至文字輸出的 ITN 規則。

已辨識的語音	顯示文字
`that will cost nine hundred dollars`	`That will cost $900.`
`my phone number is one eight hundred, four five six, eight nine ten`	`My phone number is 1-800-456-8910.`
`the time is six forty five p m`	`The time is 6:45 PM.`
`I live on thirty five lexington avenue`	`I live on 35 Lexington Ave.`
`the answer is six point five`	`The answer is 6.5.`
`send it to support at help dot com`	`Send it to support@help.com.`

大寫

語音轉換文字模型可辨識應大寫的字組，以改善可讀性、正確度和文法。例如，語音服務會自動將適當名詞和句子開頭的字大寫。下表顯示一些範例。

已辨識的語音	顯示文字
`i got an x l t shirt`	`I got an XL t-shirt.`
`my name is jennifer smith`	`My name is Jennifer Smith.`
`i want to visit new york city`	`I want to visit New York City.`

移除不流暢情況

我們在說話時很可能會結巴、重複字詞，以及說出像是「嗯」或「呃」的填充詞。語音轉換文字可以辨識這類不流暢情況，並將其從顯示文字中移除。移除不流暢情況的功能非常適合用來轉錄現場無腳本語音，以進行後續校對。下表顯示一些範例。

已辨識的語音	顯示文字
`i uh said that we can go to the uhmm movies`	`I said that we can go to the movies.`
`its its not that big of uhm a deal`	`It's not that big of a deal.`
`umm i think tomorrow should work`	`I think tomorrow should work.`

標點符號

語音轉換文字可自動為您的文字加標點符號，以改善清晰度。標點符號有助於校對通話或交談的轉錄內容。下表顯示一些範例。

已辨識的語音	顯示文字
`how are you`	`How are you?`
`we can go to the mall park or beach`	`We can go to the mall, park, or beach.`

當您使用語音轉換文字搭配連續辨識功能時，您可以設定語音服務來辨識明確的標點符號。如此一來，您可以大聲說出標點符號，讓您的文字更清晰易讀。這個功能特別適用於當您想要使用複雜的標點符號，又不想事後併入的情況下。下表顯示一些範例。

已辨識的語音	顯示文字
`they entered the room dot dot dot`	`They entered the room...`
`i heart emoji you period`	`I <3 you.`
`the options are apple forward slash banana forward slash orange period`	`The options are apple/banana/orange.`
`are you sure question mark`	`Are you sure?`

當您使用語音轉換文字搭配連續辨識功能時，請使用語音 SDK 來啟用聽寫模式。此模式會使語音設定執行個體解譯句子結構的字組描述，例如標點符號。

speechConfig.EnableDictation();

speechConfig->EnableDictation();

speechConfig.EnableDictation()

speechConfig.enableDictation();

speechConfig.enableDictation();

[self.speechConfig enableDictation];

self.speechConfig!.enableDictation()

speech_config.enable_dictation()

不雅內容篩選

您可以指定要在最終的轉錄文字中遮罩、移除或顯示不雅內容。遮罩會以星號 (*) 字元取代不雅字組，讓您既可保留文字的原始情感，在特定場合中也能更適切

注意

Microsoft 也保留遮罩或移除任何視之為不適當文字的權利。不論您是否啟用不雅內容篩選，語音服務都不會傳回這類字詞。

不雅內容篩選選項如下：

Masked：以星號 (*) 字元取代不雅字詞的字母。預設選項為「已遮罩」。
Raw：一字不差地包含不雅字詞。
Removed：移除不雅字詞。

例如，若要移除語音辨識結果中的不雅字詞，請將不雅內容篩選設定為 Removed，如下所示：

speechConfig.SetProfanity(ProfanityOption.Removed);

speechConfig->SetProfanity(ProfanityOption::Removed);

speechConfig.SetProfanity(common.Removed)

speechConfig.setProfanity(ProfanityOption.Removed);

speechConfig.setProfanity(sdk.ProfanityOption.Removed);

[self.speechConfig setProfanityOptionTo:SPXSpeechConfigProfanityOption.SPXSpeechConfigProfanityOption_ProfanityRemoved];

self.speechConfig!.setProfanityOptionTo(SPXSpeechConfigProfanityOption_ProfanityRemoved)

speech_config.set_profanity(speechsdk.ProfanityOption.Removed)

spx recognize --file caption.this.mp4 --format any --profanity masked --output vtt file - --output srt file -

不雅內容篩選會套用至結果 Text 和 MaskedNormalizedForm 屬性。不雅內容篩選不套用至結果 LexicalForm 和 NormalizedForm 屬性。兩個篩選都不套用至文字層級結果。

共用方式為

使用語音轉換文字顯示文字格式

ITN

大寫

移除不流暢情況

標點符號

不雅內容篩選

意見反應

其他資源

共用方式為

使用語音轉換文字顯示文字格式

ITN

大寫

移除不流暢情況

標點符號

不雅內容篩選

相關內容

意見反應

其他資源