Chunk cache too many timeout errors #8004
Replies: 2 comments
-
The exporter not being able to connect to Memcached from within the same pod indicates some kind of network configuration error. You can verify this by attempting to connect to Memcached from any other pod in your infrastructure using telnet. Example:
From within that pod use telnet to connect to Memcached and run a
|
Beta Was this translation helpful? Give feedback.
-
Hi Everyone, I've got the same issue. Lots of warnings in store-gateway logs "failed to fetch items from Memcached", i/o timeout TCP read from <store-gateway_IP>:port to <chunks-cache_IP>:11211 and in logs of chunks-cache: All Mimir pods are deployed in the Mimir namespace, so there is no network policy to mistakingly prevent network traffic between pods. I would be so happy if anyone has found any solution |
Beta Was this translation helpful? Give feedback.
-
Describe the bug
chunks cache too many timeouts.
To Reproduce
Steps to reproduce the behavior:
days.
Expected behavior
No Timeouts also faster response from store-gateway.
Environment
Additional Context
We observed many timeout errors for chunks. also chunks cache hit ratio is
also below 10%.
We tried to scale up the memcache and also scaled up the store gateway but
no improvement. Also tried to increase the timeout to 4s, and we saw some
improvement.
Memory Limit : 32Gib
Memory used: 6 Gib
maxConnection Limit: 16k;
total connections ~= 1k;
Requests/sec : 2.5k ops/seconds;
Latency : 500ms(99th percentile) 255ms (average)
Timeout: 450ms;
store gateway logs
Memcache export logs:
Beta Was this translation helpful? Give feedback.
All reactions