@Alex Sorry for the misunderstanding and unclear document, this table you shared is not about the baseline model, it's about the lanauage supports for each feature. I know the description is confused, I will contact the content author to modify it.
The table you shared is related to input data and data type as below screenshots, this means, for languages in this table, the audio data input is supported:
In the studio, it reflect below:
Sorry again for the misunderstanding.
I hope this helps.
Regards,
Yutong
-Please kindly accept the answer if you feel helpful, thanks a lot.