Jessie Chen I'm glad that your issue is resolved and thank you for posting your solution so that others experiencing the same thing can easily reference this!
Since the Microsoft Q&A community has a policy that the question author cannot accept their own answer, they can only accept answers by others, I'll repost your solution in case you'd like to Accept the answer.
Error Message:
The token I used to ask this question is only 16 tokens. I searched the internet for the error "Server responded with status 429. Error message: {'error': {'code': '429', 'message': 'Rate limit is exceeded. Try again in 57 seconds.'}}" I received, and it suggested that I should scale my services. I scaled my search service to the maximum 36/36, but I still encountered the same error.
Solution:
I resolved the issue by adjusting the Tokens per Minute rate limit to a higher value. Initially, it was set to the default of about 10,000.
If you have any other questions, please let me know. Thank you again for your time and patience throughout this issue.