grafana · aknuds1 · Jun 3, 2022 · May 31, 2022 · May 31, 2022 · Jun 1, 2022
@@ -222,6 +222,14 @@ How to **investigate**:
         - Check `Memcached Overview` dashboard
         - If memcached eviction rate is high, then you should scale up memcached replicas. Check the recommendations by `Mimir / Scaling` dashboard and make reasonable adjustments as necessary.
         - If memcached eviction rate is zero or very low, then it may be caused by "first time" queries
+      - Cache query timeouts
+        - Check store-gateway logs and look for warnings about timed out Memcached queries
+        - If there are indeed a lot of timed out Memcached queries, consider whether the store-gateway Memcached timeout setting (`-blocks-storage.bucket-store.chunks-cache.memcached.timeout`) is sufficient
+    - If queries are waiting in queue due to busy queriers
+      - Consider scaling up number of queriers if they're not auto-scaled; if auto-scaled, check auto-scaling parameters
+    - If queries are not waiting in queue due to busy queriers
+      - Consider enabling query sharding if not already enabled, to increase query parallelism
+      - If query sharding already enabled, consider increasing total number of query shards (`query_sharding_total_shards`) for tenants submitting slow queries, so their queries can be further parallelized
 
 #### Alertmanager