Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

integrate non-node resources into flux resource tools #6477

Open
garlick opened this issue Dec 5, 2024 · 1 comment
Open

integrate non-node resources into flux resource tools #6477

garlick opened this issue Dec 5, 2024 · 1 comment

Comments

@garlick
Copy link
Member

garlick commented Dec 5, 2024

Problem: although rabbits (c.f. flux-coral2) can be drained, they have no visibility in flux native tools.

For example, flux resource status and flux resource drain do not list the state of rabbits.

We should think about how to integrate non-node resources into the core toolset and perhaps the resource module. This may also involve monitoring for availability.

Related rabbit integration issues:

@jameshcorbett
Copy link
Member

This flux-coral2 issue flux-framework/flux-coral2#229 is relevant.

For rabbits, I'm just spitballing but I'm imagining something like the following.

$ flux resource list --extras
     STATE QUEUE  NNODES NCORES NGPUS NODELIST
      free pci         3    192    24 tioga[14,16-17]
 allocated pci         1     64     8 tioga15
 allocated pdebug     18   1152   144 tioga[18-21,23-24,28-31,33-39,41]
 allocated mi300a      2    192     8 tioga[42-43]
      down pdebug      6    384    48 tioga[22,25-27,32,40]
 
RABBIT
    STATE NODELIST       CAPACITY
    up       rabbit[1-4,7]    1058
    down  rabbit[5-6]       0

where the --extras flag triggers some plugin, in this case a flux-coral2 plugin, to return some custom fields. I was imagining that the flux-coral2 plugin could send an RPC to a service that has this information.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants