I have a hybrid worker group setup on one Azure Arc-connected machine. The hybrid worker extension is installed, and initially, it works great.
However, after roughly one week, I get the error with Event ID 15105 in Event Viewer: - The task category is HybridErrorWhilePullingQueue
"Error while polling the hybrid notification channel <accountid, runbookworkergroup, machinename, etc>"
"}] [Exception=JobRuntimeData.Client.JobRuntimeDataServiceClientException: The remote server returned an error: (403) Forbidden."
In Azure Automation, the jobs get suspended. If I click into the jobs itself, the exception tab shows "Job was suspended as the job limit for a Hybrid Worker exceeded. Add more Hybrid Workers to the Hybrid Worker group to overcome this issue, see: https://aka.ms/HRW-polling-limit-exceed"
However, I only have one job. And it's not repeating, it's trigger based, and I can see that the trigger hasn't fired. The hybrid worker has a "last seen time" in Azure that's just a few minutes old.
Re-installing the extension WORKS. But it's been doing this for a few weeks now, where it will continute to work for about one week, before requiring a re-install to get up and running again. How can I prevent this? It's not feasible to have to re-install it every single week.
Running the troubleshooter, this is the only part that fails.
RuleId : HybridExtensionLogCheck
RuleGroupId : Extensionhealth
RuleName : Hybrid Extension log check
RuleGroupName : Hybrid Extension log Checks
RuleDescription : Analyze latest error event logs of hybrid extension
CheckResult : Failed
CheckResultMessage : There are no hybrid extension logs, Extension installation may have some issue. Please re
install the extension.
CheckResultMessageId : HybridExtensionLogCheck.Failed.There are no hybrid extension logs
CheckResultMessageArguments : {}