An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
Hi,
maybe you can try to lower the temperature setting. A lower temperature value (e.g., 0.2 or 0.3) will produce more focused and deterministic responses, which can help prevent excessive elaboration.
You can also experiment with top_p, which controls the diversity of the generated text. Lowering it (e.g., 0.5) might help reduce the length by making the model focus more on higher-probability responses.
Regards,
Luc.