fix mount propagation on rootfs for containerized node #13327

sjenning · 2017-03-09T19:00:03Z

xref https://bugzilla.redhat.com/show_bug.cgi?id=1427807

For the containerized node, the host's rootfs is mounted at /rootfs in the node container. Back in docker 1.10.3 days, the default propagation mode was rslave by default. However in 1.12 is it rprivate.

This creates a problem when the node process nsenters the host mount namespace to mount volumes. If the node service is restarted (i.e. the container is stopped, removed, then started again), any volume mount points in the host are mounted rprivate in /rootfs in the node container. When any pods using the volumes are deleted, the node deletes the mount point in the host, but the /rootfs mount point in the node container is not updated due to being rprivate and prevents the volume from detaching with device is busy.

This PR enforces rslave on the /rootfs volume so that volume detach can complete.

@derekwaynecarr @eparis @rhatdan @gnufied @chao007 @wongma7

derekwaynecarr · 2017-03-09T19:54:39Z

@csrwng -- do you do the same w/ oc cluster up?

csrwng · 2017-03-09T20:04:18Z

@derekwaynecarr we currently mount the volumes dir with :rslave, but do not specify a propagation mode for /rootfs. Inspecting my local rhel machine, (running docker-common-1.12.5-14.el7.x86_64), the /rootfs mount is set to rprivate propagation mode. So we have 2 mounts:

-v /var/lib/origin/openshift.local.volumes:/var/lib/origin/openshift.local.volumes:rslave
-v /:/rootfs (which ends up being :rprivate)

eparis · 2017-03-09T20:07:22Z

@sdodson does this need to be fixed somewhere in ansible too? or does ansible get it from here?

csrwng · 2017-03-09T20:07:27Z

Sounds like the same change needs to be made in the cluster up code.
https://github.com/openshift/origin/blob/master/pkg/bootstrap/docker/openshift/helper.go#L252

rhvgoyal · 2017-03-09T20:09:39Z

Shouldn't container first exit/stop before we try to detach volume

sdodson · 2017-03-09T20:26:59Z

@eparis yes, we need to update ansible here https://github.com/openshift/openshift-ansible/blob/master/roles/openshift_node/templates/openshift.docker.node.service#L18 for affected versions

sdodson · 2017-03-09T20:28:07Z

@giuseppe Also, need to check for this on system containers

derekwaynecarr · 2017-03-09T21:03:03Z

I want to make sure we all think this is the right fix. @pmorie can you weigh in as resident propagation mode expert.

pmorie · 2017-03-09T21:03:52Z

We should probably have specified this in the systemd unit file always instead of relying on it being the default. This change LGTM.

giuseppe · 2017-03-09T21:05:33Z

@sdodson thanks for tagging me. For the system container I am already using rslave for the rootfs propagation: https://github.com/openshift/origin/blob/master/images/node/system-container/config.json.template#L318

gnufied · 2017-03-09T22:12:37Z

Why do we need to mount all of "/" inside container? don't we just need "/proc" ?

sjenning · 2017-03-09T22:30:40Z

@gnufied it needs at least /rootfs/sys and /rootfs/var/run which are explicitly checked.

https://github.com/openshift/origin/blob/master/pkg/cmd/server/kubernetes/node.go#L75-L85

Strangely /rootfs/proc is not listed but definitely is used to enter the host mount namespace.

Since we use --pid=host on the docker run line, we could probably just mount -t proc none /proc inside the container rather than using the one from the host, but that is a different discussion.

derekwaynecarr · 2017-03-14T16:29:14Z

[merge]

sjenning · 2017-03-16T16:46:43Z

merged failed on openshift/openshift-ansible#3603. can someone remerge this?

eparis · 2017-03-16T16:50:17Z

[merge] because of the flake he pointed out

sjenning · 2017-03-16T19:53:44Z

@stevekuznetsov last merge run failed at

13:44:37 TASK [validate-public : Validate the public address] ***************************
13:44:37 Thursday 16 March 2017  17:44:37 +0000 (0:00:00.379)       0:16:18.595 ******** 
13:44:37 
fatal: [localhost]: FAILED! => {"changed": false, "content": "", "failed": true, "msg": "Status code was not [200]: An unknown error occurred: ''", "redirected": false, "status": -1, "url": "https://api.prtest-5a37c28-323.origin-ci-int-gce.dev.rhcloud.com:443/healthz/ready"}

Any help? Not sure if this is a flake or would happen again if someone bumped it.

sdodson · 2017-03-16T19:55:13Z

flake openshift/origin-gce#15

stevekuznetsov · 2017-03-16T19:56:23Z

Flake is openshift/origin-gce#15

sjenning · 2017-03-16T19:57:43Z

@sdodson @stevekuznetsov thanks. Can I get a merge again?

eparis · 2017-03-20T14:38:42Z

[merge] again

openshift-bot · 2017-03-20T14:45:33Z

[Test]ing while waiting on the merge queue

openshift-bot · 2017-03-20T14:53:26Z

Evaluated for origin test up to c9174c2

openshift-bot · 2017-03-20T16:12:04Z

continuous-integration/openshift-jenkins/test SUCCESS (https://ci.openshift.redhat.com/jenkins/job/test_pull_request_origin/327/) (Base Commit: 612dcfb)

sjenning · 2017-03-20T20:00:08Z

flaaakes
#13183
#12544

eparis · 2017-03-20T20:27:36Z

you heard him, we [merge] after flakes!

eparis · 2017-03-21T13:39:15Z

I really should write myself a bot to auto re tag [merge] on these things.
https://ci.openshift.redhat.com/jenkins/job/merge_pull_request_origin/154/

eparis · 2017-03-21T15:27:20Z

I filed #13485
[merge] while I look for the other.

openshift-bot · 2017-03-21T15:29:31Z

Evaluated for origin merge up to c9174c2

openshift-bot · 2017-03-21T17:37:24Z

continuous-integration/openshift-jenkins/merge FAILURE (https://ci.openshift.redhat.com/jenkins/job/merge_pull_request_origin/167/) (Base Commit: 0343989)

eparis · 2017-03-21T21:21:57Z

I'm done with this PR.

stevekuznetsov · 2017-03-22T12:20:03Z

We don't have a full Ansible containerized deployment test on the merge queue for this ... so @sdodson FYI this may impact you guys?

sdodson · 2017-03-22T12:38:07Z

Yes, the bug actually gets fixed in the installer anyway
openshift/openshift-ansible#3727 master
openshift/openshift-ansible#3728 release-1.5
openshift/openshift-ansible#3729 release-1.4

stevekuznetsov · 2017-03-22T13:05:52Z

Nice

fix mount propagation on rootfs for containerized node

c9174c2

sjenning force-pushed the rslave-rootfs branch from fbfbade to c9174c2 Compare March 20, 2017 14:36

sdodson mentioned this pull request Mar 21, 2017

Make /rootfs mount rslave openshift/openshift-ansible#3727

Merged

eparis merged commit 1e2d71b into openshift:master Mar 21, 2017

derekwaynecarr mentioned this pull request Mar 22, 2017

detach doesn't work when atomic-openshift-node runs inside container #13228

Closed

sjenning deleted the rslave-rootfs branch August 16, 2017 02:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix mount propagation on rootfs for containerized node #13327

fix mount propagation on rootfs for containerized node #13327

sjenning commented Mar 9, 2017

derekwaynecarr commented Mar 9, 2017

csrwng commented Mar 9, 2017

eparis commented Mar 9, 2017

csrwng commented Mar 9, 2017

rhvgoyal commented Mar 9, 2017

sdodson commented Mar 9, 2017

sdodson commented Mar 9, 2017

derekwaynecarr commented Mar 9, 2017

pmorie commented Mar 9, 2017

giuseppe commented Mar 9, 2017

gnufied commented Mar 9, 2017

sjenning commented Mar 9, 2017 •

edited

Loading

derekwaynecarr commented Mar 14, 2017

sjenning commented Mar 16, 2017

eparis commented Mar 16, 2017

sjenning commented Mar 16, 2017 •

edited

Loading

sdodson commented Mar 16, 2017

stevekuznetsov commented Mar 16, 2017

sjenning commented Mar 16, 2017

eparis commented Mar 20, 2017

openshift-bot commented Mar 20, 2017

openshift-bot commented Mar 20, 2017

openshift-bot commented Mar 20, 2017

sjenning commented Mar 20, 2017

eparis commented Mar 20, 2017

eparis commented Mar 21, 2017

eparis commented Mar 21, 2017

openshift-bot commented Mar 21, 2017

openshift-bot commented Mar 21, 2017

eparis commented Mar 21, 2017

stevekuznetsov commented Mar 22, 2017

sdodson commented Mar 22, 2017

stevekuznetsov commented Mar 22, 2017

fix mount propagation on rootfs for containerized node #13327

fix mount propagation on rootfs for containerized node #13327

Conversation

sjenning commented Mar 9, 2017

derekwaynecarr commented Mar 9, 2017

csrwng commented Mar 9, 2017

eparis commented Mar 9, 2017

csrwng commented Mar 9, 2017

rhvgoyal commented Mar 9, 2017

sdodson commented Mar 9, 2017

sdodson commented Mar 9, 2017

derekwaynecarr commented Mar 9, 2017

pmorie commented Mar 9, 2017

giuseppe commented Mar 9, 2017

gnufied commented Mar 9, 2017

sjenning commented Mar 9, 2017 • edited Loading

derekwaynecarr commented Mar 14, 2017

sjenning commented Mar 16, 2017

eparis commented Mar 16, 2017

sjenning commented Mar 16, 2017 • edited Loading

sdodson commented Mar 16, 2017

stevekuznetsov commented Mar 16, 2017

sjenning commented Mar 16, 2017

eparis commented Mar 20, 2017

openshift-bot commented Mar 20, 2017

openshift-bot commented Mar 20, 2017

openshift-bot commented Mar 20, 2017

sjenning commented Mar 20, 2017

eparis commented Mar 20, 2017

eparis commented Mar 21, 2017

eparis commented Mar 21, 2017

openshift-bot commented Mar 21, 2017

openshift-bot commented Mar 21, 2017

eparis commented Mar 21, 2017

stevekuznetsov commented Mar 22, 2017

sdodson commented Mar 22, 2017

stevekuznetsov commented Mar 22, 2017

sjenning commented Mar 9, 2017 •

edited

Loading

sjenning commented Mar 16, 2017 •

edited

Loading