You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Sometime between 1.97.2 and 1.97.3, a change was introduced which appears to be resulting in memory contention in the API service. Increasing the available memory from 350 MB to 500 MB doesn't appear to have helped:
The Worker service also appears to be impacted, while the Game/SP services do not.
<--- Last few GCs --->
--
[27:0xffff9436c3c0] 13975962 ms: Mark-sweep 246.6 (258.9) -> 245.5 (258.9) MB, 222.2 / 0.0 ms (average mu = 0.984, current mu = 0.129) allocation failure scavenge might not succeed
[27:0xffff9436c3c0] 13976247 ms: Mark-sweep 246.0 (258.9) -> 245.5 (259.1) MB, 258.3 / 0.0 ms (average mu = 0.965, current mu = 0.093) allocation failure GC in old space requested
<--- JS stacktrace --->
FATAL ERROR: Reached heap limit Allocation failed - JavaScript heap out of memory
error Command failed with signal "SIGABRT".
Running git diff -w --stat 1.97.2..1.97.3 server doesn't show all that many changes (about 100 lines added and removed). However, since backend services also pull in app/sdk (and others), the culprit change may be elsewhere.
Memory usage for the last three days looks like this:
The reduction towards the end was caused by increasing the available memory (thereby reducing the utilization).
This lines up with the work on replays here: #163 #164
Before this, the last deployment was on 10/16 at ~10AM UTC, so these PRs could also be involved: #157 #158 #160 #161 #162
We can revert these one at a time (locally to rebuild a hotfix container and test) to see what changed.
The text was updated successfully, but these errors were encountered:
Summary
Sometime between 1.97.2 and 1.97.3, a change was introduced which appears to be resulting in memory contention in the API service. Increasing the available memory from 350 MB to 500 MB doesn't appear to have helped:
The Worker service also appears to be impacted, while the Game/SP services do not.
Running
git diff -w --stat 1.97.2..1.97.3 server
doesn't show all that many changes (about 100 lines added and removed). However, since backend services also pull inapp/sdk
(and others), the culprit change may be elsewhere.Memory usage for the last three days looks like this:
The reduction towards the end was caused by increasing the available memory (thereby reducing the utilization).
This lines up with the work on replays here:
#163
#164
Before this, the last deployment was on 10/16 at ~10AM UTC, so these PRs could also be involved:
#157
#158
#160
#161
#162
We can revert these one at a time (locally to rebuild a hotfix container and test) to see what changed.
The text was updated successfully, but these errors were encountered: