HI @santai
There is a token breakdown in the playground that you can check. see below.
The main reason i am guess is that you might have chat history in the request which will be included in the token. i.e. multiple chat turns in the chat window.
on top of this break down, you can also
- use the view code option and switch to curl. run the
curl -i
in the bash to see the actual quota headers that shows remaining token and request allowed - use the switch to JSON object to inspect the full actual api request payload (that you shall see all chat histories)