Azure Redis Cache connection timeout from AKS workloads

Question

Azure Redis Cache connection timeout from AKS workloads

Anonymous

Dears,
we are facing connection timeout to Redis Cache (PaaS) on our AKS workloads.
Before moving to the Redis PaaS solution, we were using our own Redis deploy (K8s pods).
We were facing the same issue: connection timeout.

One can think that the issue is caused by our application and not Redis itself.

But the point here is that we have many pods in separate namespaces, with different configurations and they all face Redis disconnections in the same time frame (approx 1 hour).

At this point my guess is that the underlying issue comes from the AKS node timesync.

One clue for this assumption is that all the pods facing the issue are on the same node, despite we have many replicas on other pool nodes.
Another clue is that while the issue is going on, we have no other issues in the cluster and node, all metrics are fine: CPU usage, IO, memory, nw bandwith...

My questions are:
1- is there any evidence that AKS has timesync issues in the current OS node version for k8s vers. 1.21.2 ?
2- how can I investigate on my own if timesync is occuring while I have the Redis timeouts ?

thanks
Marco

3 answers

Your answer

Answer 1

Anonymous

Just for the sake of completeness:
we were not able to troubleshoot the issue: after excluding the timesync issue on the AKS nodes, we didn't find any other anomalies that could lead to a Redis fault.

So we moved to the Redis Cache managed solution offered by Azure and it seems to work properly.

vipullag-MSFT 26,487 Reputation points Moderator

2022-04-12T14:11:09.877+00:00

@Anonymous

Thanks for sharing this for benefit of community.

Answer 2

shiva patpi 13,366 Microsoft Employee Moderator

Hello @Anonymous ,
To validate your first query can you login to the node and run the command sudo timedatectl status:

w.r.t timeouts below document should help you out:

https://learn.microsoft.com/en-us/azure/azure-cache-for-redis/cache-troubleshoot-timeouts
https://learn.microsoft.com/en-us/azure/azure-cache-for-redis/cache-troubleshoot-client
https://learn.microsoft.com/en-us/azure/azure-cache-for-redis/cache-troubleshoot-server

Answer 3

Anonymous

Hi @shiva patpi ,

all the pool nodes report system clock sync: Yes

This excludes that there is a time sync issue, correct?

shiva patpi 13,366 Reputation points Microsoft Employee Moderator

2022-03-10T05:04:52.593+00:00

Correct, No issue with time sync !

Share via

Azure Redis Cache connection timeout from AKS workloads

3 answers

Your answer