@Jade Nameless Thanks for the question. Can you please share link to the code for transcription and API that you are trying. Please add more details about the intermediate results that you are getting.
Please follow the threads to request word level timestamps in the speech config.
To Generate Timestamps in STT model.