You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Once a session is restarted, Backend.AI performs the followings:
In the Agent:* Destroys the container.
Recreates the container reusing the prior allocation map and kernel resource spec.
In the Manager:* Just updates the kernel/session status (RUNNING → RESTARTING → RUNNING).
Though, the restarted session is broken:
Connecting to in-container apps does not work, via both AppProxy and manager's own proxy like ./backend.ai app command.* Probably because the port mapping is changed but not propagated to Manager (and App Proxy).
Once a session is restarted, Backend.AI performs the followings:
In the Agent:* Destroys the container.
Recreates the container reusing the prior allocation map and kernel resource spec.
In the Manager:* Just updates the kernel/session status (RUNNING → RESTARTING → RUNNING).
Though, the restarted session is broken:
Connecting to in-container apps does not work, via both AppProxy and manager's own proxy like
./backend.ai app
command.* Probably because the port mapping is changed but not propagated to Manager (and App Proxy).GPU config env-vars (Add intrinsic GPU config env-vars for AI runtimes #3253) have empty values.
The
CID
field in/home/config/resource.txt
gets duplicated after restarted. (Refactor container preparation step #3266)maybe more?
Due to these issues, we need to restart the model service process in feat(BA-441): restart model service process #3282, instead of cleanly restarting the container.
For future work, we may need to consider:
The text was updated successfully, but these errors were encountered: