You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Make it easier to add custom backends (implementation and documentation)
Research
Important
Research and feedback is required!
Metrics: Research whether dstack should collect certain metrics out of the box (at least hardware utilization) or if it should be integrated with more enterprise-grade tools.
Fault-tolerant training: Research how dstack can be used for fault-tolerant training of massive models.
The text was updated successfully, but these errors were encountered:
This issue outlines the major items planned for Q3 2024.
For major bugs, see major
Core features
dstack apply
dstack apply
should be in the attached mode by default for any configuration type #1571--detach
anymore withdstack apply
#1671dstack apply
#1326resources
with thegateway
configuration type #1664Supported architectures
Examples
Important
Community help is welcome!
Improvements
H100
with AWS #1238 [Feature]: Support H100 with GCP #1240 [Feature]: SupportH100
with Azure #1239g6.*
instance type) #1235dstack server
must collect logs from gateways and instances #1673Research
Important
Research and feedback is required!
dstack
should collect certain metrics out of the box (at least hardware utilization) or if it should be integrated with more enterprise-grade tools.dstack
can be used for fault-tolerant training of massive models.The text was updated successfully, but these errors were encountered: