Container creation fails because of "Failed create pod sandbox" #17047

Ocimum-basilicum · 2017-10-26T12:25:42Z

Pods are not getting created anymore

Version

oc v3.6.173.0.7
kubernetes v1.6.1+5115d708d7
features: Basic-Auth

Server https://api.starter-ca-central-1.openshift.com:443
openshift v3.7.0-0.143.7
kubernetes v1.7.0+80709908fd

Steps To Reproduce

create a application (e.g. resid (persistent) from catalog)
check pod/container creation
wait for timeouts

Current Result

the only real error I could grab was :
Failed kill pod | error killing pod: failed to "KillPodSandbox" for "c4c2ec61-ba29-11e7-8b2c-02d8407159d1" with KillPodSandboxError: "rpc error: code = 2 desc = NetworkPlugin cni failed to teardown pod \"redis-1-deploy_instantsoundbot\" network: CNI request failed with status 400: 'Failed to execute iptables-restore: exit status 4 (Another app is currently holding the xtables lock. Perhaps you want to use the -w option?\n)\n'"

Expected Result

pod should start up as the used to do...

Additional Information

Couldn't get oc adm diagnostics working atm
I guess it could have to do with the introduction of #15880

The text was updated successfully, but these errors were encountered:

tumido · 2017-10-26T13:50:15Z

I'm facing the same in the starter-us-east-1.openshift.com environment. It's been unstable for couple of days already...

pweil- · 2017-10-26T19:43:34Z

/cc @jupierce

sjenning · 2017-10-26T20:08:57Z

This is a known issue that has a fix and it being rolled out to the starter clusters presently.

skjolber · 2017-10-29T09:11:16Z

I think 'm facing the same issue at starter-ca-central-1.openshift.com:

10:04:10 AM	Normal	Deadline exceeded	Pod was active on the node longer than the specified deadline
10:00:03 AM	Normal	Sandbox changed	Pod sandbox changed, it will be killed and re-created.14 times in the last 58 minutes
--	--	--	--
9:59:40 AM	Warning	Failed create pod sand box	Failed create pod sandbox.14 times in the last 58 minutes

When will the fix finish rolling out?

14yannick · 2017-10-29T19:58:37Z

Facing same issue not possible to rollout anything on starter-ca-central-1.openshift.com. Hope will be fixed soon.

osamahassan245 · 2017-10-30T17:46:22Z

i got same issue , i tried to create application using tomcat 8 , and try to build source code that exist in this path
https://github.com/osamahassan245/samplepp

i got build error , when tried to check the log , got this log

container "sti-build" in pod is not available

Artod · 2017-10-30T18:45:53Z

Same problem on starter-ca-central-1.openshift.com

error streaming logs from build pod: sii-test/app-5-build container: , container "sti-build" in pod "app-5-build" is not available

osamahassan245 · 2017-10-31T18:35:10Z

issue solved , i tried to use " Red Hat JBoss Web Server 3.1 Tomcat 8 1.0 " , it's working fine now

edevyatkin · 2017-11-01T14:42:20Z

The issue is still actual on starter-us-east-1.openshift.com

izderadicka · 2017-11-05T10:20:59Z

Still problem on ca-central

brianHollingsworth · 2017-11-09T21:41:45Z

Glad I'm not the only one seeing this issue. It's been occurring for me on console.starter-us-west-1.openshift.com since last weekend (11/4).

sothawo · 2017-11-12T20:51:58Z

still seeing this on starter-ca-central-1.openshift.com

mavajsunco · 2017-11-19T13:23:48Z

I have same issue . error streaming logs from build pod: mavajsunco-website/mavajsunco-msc-6-build container: , container "sti-build" in pod "mavajsunco-msc-6-build" is not available

axl8713 · 2017-11-21T09:21:44Z

Same issue deploying rhscl/mysql-57-rhel7 on starter-us-east-1.

sjenning · 2017-11-29T16:30:22Z

@dcbw this is the all too familiar iptables-restore issue. You are closer to this that I am and hopefully can provide better feedback about the progress.

warmchang · 2018-01-11T11:47:16Z

👍

nevadascout · 2018-01-17T14:46:11Z

Still having this problem on starter-us-west-2.

I've got 7 failed deployments in a row for this error message.

skoorupa · 2018-02-04T19:51:00Z

^same

DanyC97 · 2018-02-21T18:39:11Z

@dcbw @sjenning any input as to where the issue might be?

jamestenglish · 2018-02-22T03:21:38Z

Seeing this on pro-us-east-1

jherson · 2018-02-22T04:04:14Z

Seeing this the last couple of days on pro-us-east-1 as well

shreyasgombi · 2018-03-01T17:48:46Z

Same here!!! Observing on pro-us-east-1.

saurabhdevops · 2018-03-14T10:57:03Z

Hey Folks! Any update on this one, do you have a fix already on openshift or openshift ansible repos that I can pick up. Is there a temporary workaround for this issue? We are facing the same issue with our openshift cluster on AWS.

Version
OpenShift Master:
v3.7.0+7ed6862
Kubernetes Master:
v1.7.6+a08f5eeb62

saurabhdevops · 2018-03-26T12:36:56Z

@pweil- , @jupierce are you still looking into this issue. Is there any progress or workaround available?

pweil- · 2018-03-26T12:50:14Z

@dcbw @knobunc ping

agajdosi · 2018-06-12T12:29:11Z

I am facing similar issue using OCP v3.9.30 with CDK. In my case I have Che deployed on OpenShift and when I start a new workspace, its node crashes on sandbox changed:

11:52:32 AM 	Normal 	Killing  	Killing container with id docker://container:Need to kill Pod
11:52:30 AM 	Normal 	Sandbox Changed  	Pod sandbox changed, it will be killed and re-created.
11:52:28 AM 	Normal 	Started  	Started container
11:52:28 AM 	Normal 	Created  	Created container

Is there any update on this issue @dcbw?

14yannick · 2018-06-12T16:12:13Z

I used Openshift for more than 5 years. Spend a lot time making my app running on v2 again. At the end traffic was just not rooted anymore. Mooved to heroku took me 2 hours to migrate all my data(db) and make the necessary Source Code changes. Since then no more Problems. Sorry Openshift

openshift-bot · 2018-09-10T20:11:39Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle stale

openshift-bot · 2018-10-10T23:08:03Z

Stale issues rot after 30d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle rotten
/remove-lifecycle stale

ghost · 2018-10-31T15:07:44Z

Seeing this (or something similar) currently on OpenShift Online starter-us-west-1. Unable to build or deploy because of it. No logs from pods that have this issue. Status page says all green.

jhaohai · 2018-11-13T03:15:58Z

We still see this issue on okd 3.7.1

openshift-bot · 2018-12-13T07:04:54Z

Rotten issues close after 30d of inactivity.

Reopen the issue by commenting /reopen.
Mark the issue as fresh by commenting /remove-lifecycle rotten.
Exclude this issue from closing again by commenting /lifecycle frozen.

/close

openshift-ci-robot · 2018-12-13T07:04:55Z

@openshift-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.

Reopen the issue by commenting /reopen.
Mark the issue as fresh by commenting /remove-lifecycle rotten.
Exclude this issue from closing again by commenting /lifecycle frozen.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

matthewmcneilly · 2019-05-10T15:02:57Z

I am seeing this issue or something simular when deploying 3Scale API Management Platform on Openshift in particular system-sidekiq.

Failed create pod sandboxc: rpc error: code = Unknown desc = failed to set up sandbox container "2cc1e1d064082f2a2b8cd7a10efb7d135a8a150e7d95fb7b939d6368e1717309" network for pod "system-sidekiq-6-deploy- debug": NetworkPlugin cni failed to set up pod "system-sidekiq-6-deploy-debug_mmcneilly-3scale-onprem" network: CNI request failed with status 400: 'pods "system-sidekiq-6-deploy-debug" not found '

Can this issue be reopened?
/reopen

openshift-ci-robot · 2019-05-10T15:04:10Z

@matthewmcneilly: You can't reopen an issue/PR unless you authored it or you are a collaborator.

In response to this:

/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

pweil- added component/kubernetes kind/bug Categorizes issue or PR as related to a bug. priority/P2 priority/P1 and removed priority/P2 labels Oct 26, 2017

pweil- assigned sjenning Oct 26, 2017

gbaufake mentioned this issue Oct 31, 2017

Tuned Profiles Atomic Openshift Node is not being removed by uninstall playbook openshift/openshift-ansible#5944

Closed

sjenning assigned dcbw and unassigned sjenning Nov 29, 2017

vpavlin mentioned this issue Dec 1, 2017

[prod-preview] After resetting environment che pod fails to start - "Failed create pod sandbox" openshiftio/openshift.io#1493

Closed

openshift-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 10, 2018

openshift-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Oct 10, 2018

openshift-ci-robot closed this as completed Dec 13, 2018

Container creation fails because of "Failed create pod sandbox" #17047

Container creation fails because of "Failed create pod sandbox" #17047

Comments

Ocimum-basilicum commented Oct 26, 2017

Version

Steps To Reproduce

Current Result

Expected Result

Additional Information

tumido commented Oct 26, 2017

pweil- commented Oct 26, 2017

sjenning commented Oct 26, 2017

skjolber commented Oct 29, 2017

14yannick commented Oct 29, 2017

osamahassan245 commented Oct 30, 2017 • edited Loading

Artod commented Oct 30, 2017

osamahassan245 commented Oct 31, 2017

edevyatkin commented Nov 1, 2017

izderadicka commented Nov 5, 2017

brianHollingsworth commented Nov 9, 2017

sothawo commented Nov 12, 2017

mavajsunco commented Nov 19, 2017

axl8713 commented Nov 21, 2017

sjenning commented Nov 29, 2017

warmchang commented Jan 11, 2018

nevadascout commented Jan 17, 2018

skoorupa commented Feb 4, 2018

DanyC97 commented Feb 21, 2018

jamestenglish commented Feb 22, 2018

jherson commented Feb 22, 2018

shreyasgombi commented Mar 1, 2018

saurabhdevops commented Mar 14, 2018

saurabhdevops commented Mar 26, 2018

pweil- commented Mar 26, 2018

agajdosi commented Jun 12, 2018

14yannick commented Jun 12, 2018 • edited Loading

openshift-bot commented Sep 10, 2018

openshift-bot commented Oct 10, 2018

ghost commented Oct 31, 2018 • edited by ghost Loading

jhaohai commented Nov 13, 2018

openshift-bot commented Dec 13, 2018

openshift-ci-robot commented Dec 13, 2018

matthewmcneilly commented May 10, 2019 • edited Loading

openshift-ci-robot commented May 10, 2019

osamahassan245 commented Oct 30, 2017 •

edited

Loading

14yannick commented Jun 12, 2018 •

edited

Loading

ghost commented Oct 31, 2018 •

edited by ghost

Loading

matthewmcneilly commented May 10, 2019 •

edited

Loading