@Rman-1501 Based on my understanding of the service, I think the following would happen for the scenarios mentioned.
- would it be fine if the sum of all sent rewards is more than 1?
The reward value is a scalar value between 0 and 1, inclusive. The reward value represents the quality of the action that was taken. A higher reward value indicates a better action." Therefore, if the sum of all rewards is greater than 1, it means that the user has performed multiple actions, and the algorithm will consider the quality of all actions combined.
- what happens if I send a negative reward? Because there are some times when the user does something which shows they became less interested, since I have "Sum" aggregation method, if I send a negative amount, will it subtract it from final reward?
You can send a zero reward score instead of a negative score. As per the document using a negative score is possible only in certain scenarios and should only be used if you are experienced with reinforcement learning (RL). Personalizer trains the model to achieve the highest possible sum of rewards over time.
- What if the sum of all sent rewards become a negative number? what does a negative reward number mean for the algorithm in Azure Personalizer?
Personalizer trains the model to achieve the highest possible sum of rewards over time. So, if the sum becomes negative it might be treated as if the user did not perform any actions.
I hope this helps!!
If this answers your query, do click Accept Answer
and Yes
for was this answer helpful. And, if you have any further query do let us know.