Hello there,
You can use third party tools like resemble to create voice of your own.
SV2TTS is a deep learning framework in three stages. In the first stage, one creates a digital representation of a voice from a few seconds of audio. In the second and third stages, this representation is used as reference to generate speech given arbitrary text.
https://github.com/CorentinJ/Real-Time-Voice-Cloning
Hope this resolves your Query !!
--If the reply is helpful, please Upvote and Accept it as an answer--