Cgroups: check for cpuset.cpus before set the value #254

HuKeping · 2015-09-08T13:05:33Z

Assume we put some CPUs offline and then bring them online again, since
there is not a mechanism that to help docker to refresh the state of CPUs,
all the following containers will never use those off-and-on CPUs again.

So we should always check whether there has already a change of CPUs before
we set it to the container.

Signed-off-by: Hu Keping hukeping@huawei.com

HuKeping · 2015-09-08T13:10:40Z

Here comes the example:

Assume I have 4 CPUs in my host, at the very beginning everything is OK for

 docker run -ti --name hkp_ubuntu  --cpuset-cpus=0-3 ubuntu bash

Then I put CPU1 off and then put it on

echo 0 > /sys/devices/system/cpu/cpu1/online

echo 1 > /sys/devices/system/cpu/cpu1/online

Let's check for the cpuset.cpus of cgroup docker

cat /sys/fs/cgroup/cpuset/docker/cpuset.cpus
0,2-3

And then run that docker CLI again, error happen

Error response from daemon: Cannot start container 0abdc40d8a96d66318340b863cff56177e4958540839440fb25edb8a57430c51: [8] System error: write /sys/fs/cgroup/cpuset/docker/0abdc40d8a96d66318340b863cff56177e4958540839440fb25edb8a57430c51/cpuset.cpus: invalid argument

This is because when we shut down CPU1, the cgroup system update all the sub-cgroup system about the cpuset.cpus. But when the CPU1 comes back it didn't do the same thing.

Assume we put some CPUs offline and then bring them online again, since there is not a mechanism that to help docker to refresh the state of CPUs, all the following containers will never use those off-and-on CPUs again. So we should always check whether there has already a change of CPUs before we set it to the container. Signed-off-by: Hu Keping <hukeping@huawei.com>

hqhq · 2015-09-09T01:52:39Z

I don't think it's a bug.

For cgroup, it's reasonable that cgroup updates cpuset.cpus of all sub-cgroups when some CPUs are offline, because they are not available. And it's also reasonable that cgroup didn't do the same thing when CPUs come back, because cgroup don't know if these cpuset.cpus are assigned by user or changed by cgroup (except root cgroup, it should always enable all cpus). So I think it's intended. Ping @lizf-os (kernel cgroup and cpuset maintainer), can you confirm that?

So the same reason, a Docker container can't expend cpuset.cpus for it's parent, because Docker don't know if it's assigned by user or changed by CPU offline.

HuKeping · 2015-09-09T02:59:14Z

Isn't it wired that all the CPUs are available but docker --cpuset.cpus fail?

hqhq · 2015-09-09T03:52:46Z

It is wired in your scenario, but think about another scenario, if I set docker cgroup can only use cpus 0-1, then a container with cpus 0-4 can still start and change the cpus config for it's parent, just because root cgroup is configed as 0-4. Isn't this even more wired?

I think the solution would be on kernel side, let kernel record every movement for cpuset.cpus changes, if they are changed by cgroup itself when CPUs are shutdown, and never changed before CPUs are up again, then cgroup should change these cpuset.cpus back. Not sure if that's worth doing.

HuKeping · 2015-09-09T04:03:12Z

The docker cgroup are all set when start docker daemon and the daemon will use all the available CPUs. So there is not such a scenario that set docker cgroup can only use cpus 0-1.

Besides, even if you set the docker cgroup to only use cpus 0-1 by some script like

sudo echo 0-1 > /sys/fs/cgroup/cpuset/docker/cpuset.cpus

What if we turn CPU1 off and then on? All the follow containers will never have a chance to use CPU1 again.

lizf-os · 2015-09-09T04:06:10Z

This is a long-standing issue, but it has been fixed, but only if you use unified hierarchy, which hasn't been supported by docker or any distro.

HuKeping · 2015-09-09T04:12:13Z

Thanks @lizf-os but I am more care about the capability of Docker discover the real available CPUs

hqhq · 2015-09-09T04:13:01Z

Besides, even if you set the docker cgroup to only use cpus 0-1 by some script like

Yes, this is the usage, and I know a lot of people use in this way (before Docker support this usage by itself).

What if we turn CPU1 off and then on? All the follow containers will never have a chance to use CPU1 again.

That'll be a problem, but there is nothing you can do on Docker side, your PR would break things as I said before.

As @lizf-os said, it's fixed in kernel but only for unified hierarchy, don't know why this can't be fixed for mutil hierarchy?

lizf-os · 2015-09-09T06:13:39Z

Because we don't want people stick with legacy hierarchies forever, so all new developments have been done for unified hierarchy only.

crosbymichael · 2015-09-14T18:23:28Z

Ya, i'm not sure if this is a problem we can solve at this level.

mrunalp · 2015-09-18T19:32:45Z

Yep, this should be solved at a higher level.

Fix golint warnings

GordonTheTurtle added the status/0-triage label Sep 8, 2015

HuKeping force-pushed the master branch from 26547ec to 5c1a661 Compare September 8, 2015 13:31

HuKeping mentioned this pull request Sep 8, 2015

check for cpuset.cpus before set the value moby/moby#16141

Closed

crosbymichael closed this Sep 18, 2015

hqhq mentioned this pull request Oct 18, 2016

Add CPU hotplug support moby/moby#27453

Open

ddingel mentioned this pull request Oct 18, 2016

Add CPU hotplug support #1119

Closed

stefanberger pushed a commit to stefanberger/runc that referenced this pull request Sep 8, 2017

Merge pull request opencontainers#254 from hqhq/hq_fix_golint

8d66fdd

Fix golint warnings

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cgroups: check for cpuset.cpus before set the value #254

Cgroups: check for cpuset.cpus before set the value #254

HuKeping commented Sep 8, 2015

HuKeping commented Sep 8, 2015

hqhq commented Sep 9, 2015

HuKeping commented Sep 9, 2015

hqhq commented Sep 9, 2015

HuKeping commented Sep 9, 2015

lizf-os commented Sep 9, 2015

HuKeping commented Sep 9, 2015

hqhq commented Sep 9, 2015

lizf-os commented Sep 9, 2015

crosbymichael commented Sep 14, 2015

mrunalp commented Sep 18, 2015

Cgroups: check for cpuset.cpus before set the value #254

Cgroups: check for cpuset.cpus before set the value #254

Conversation

HuKeping commented Sep 8, 2015

HuKeping commented Sep 8, 2015

hqhq commented Sep 9, 2015

HuKeping commented Sep 9, 2015

hqhq commented Sep 9, 2015

HuKeping commented Sep 9, 2015

lizf-os commented Sep 9, 2015

HuKeping commented Sep 9, 2015

hqhq commented Sep 9, 2015

lizf-os commented Sep 9, 2015

crosbymichael commented Sep 14, 2015

mrunalp commented Sep 18, 2015