@Dhilip Swaminathan If you are running continuous recognition the first 15 seconds is the approximate timeout limit if there is no utterance detected. This is also documented here for reference. There is also a thread that details on how the timeout are set at SDK level, this might be helpful to app developers to change it accordingly.
With respect to the billing, the billing is based on audio that is processed with per second billing. It is not based on detection of voice in your input. Please see the pricing page for the billing criteria.
If you are running continuous recognition and do not have any utterance the 15 seconds of audio will count as processed audio.
Based on your scenario, instead of using continuous recognition to listen to input, consider using keyword recognition. Once a key word is detected use the audio after the keyword to convert to text and process it further based on the input. See this QuickStart on creating a keyword and the sample at the end of the page to use the same to save as audio. I hope this helps!!
If this answers your query, do click Accept Answer
and Yes
for was this answer helpful. And, if you have any further query do let us know.