Hello @Alex Del Giudice , thank you for your time and patience throughout this issue.
The product team confirmed that this behavior is by design.
Also, without a reference text, it would be difficult to determine which words are insertions or omissions. The Pronunciation Assessment tool relies on the reference text to compare the transcript with the expected transcript.
I hope this helps.
Regards,
Vasavi
-Please kindly accept the answer and vote 'yes' if you feel helpful to support the community, thanks.