-
Notifications
You must be signed in to change notification settings - Fork 4.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[localhost] openshift_web_console : Verify that the web console is running #18569
Comments
@spadgett @tnozicka according to controller logs I can see that the replica set reached 1 replica and there is no indication of deployment error. I would help if we can dump the Also perhaps dumping the webconsole container logs to see what is going on there might help. |
I see this failure permanently blocking 3.7 #18524 |
still looking into it but can this new health check be failing? https://github.com/openshift/origin/pull/18411/files |
I didn't see anything broken on controller side for this in master logs; looks like the pod there, just not up. |
The webconsole container is crash looping.
We need the console container logs to debug. @stevekuznetsov do you know if the console container logs are available or if that is something we can capture? Typically the container would crash loop either because:
I'm considering disabling (2) and trying to determine if the TSB is enabled by looking at the service classes in the browser. cc @deads2k
I don't think it's the health check. The console was periodically flaking before that change. Would that result in CrashLoopBackOff status?
We shouldn't be trying to install the console in the 3.7 branch :( That will always fail. |
You could also update to not fail if the check fails on a non-404. The error ought to be in your logs though. |
I plan to
|
The five PRs to change the TSB discovery are openshift/origin-web-catalog#642 It's not clear if this is the cause of the flake without container logs, but it's a change we wanted to make anyway and makes things simpler. |
We won't be able to confidently know we've fixed this or troubleshoot future problems without having the console pod logs when install fails. @stevekuznetsov can you help with that? /assign @stevekuznetsov |
I have an openshift-ansible PR open to log more details when console install fails: |
@sdodson do we want to grab logs in the installer if this fails or do we want this only for the CI? |
I'll defer to @sdodson but it might make sense just to add it to the installer. This would help anyone troubleshoot console install failures. For instance, if I set an incorrect |
I don't see any instances of this in snowstorm in the last 48 hours. @mfojtik let me know if I'm looking at it correctly. Note that openshift/origin-web-console-server#31 recently merged, which could have fixed the problem. openshift/openshift-ansible#7132 is in the merge queue and should give us more detail if it happens again. |
openshift/openshift-ansible#7108 also disables the console install for releases before 3.9. Closing this issue. /close |
See: https://openshift-gce-devel.appspot.com//build/origin-ci-test/pr-logs/pull/18524/test_pull_request_origin_extended_conformance_install/7274/
Flaked 40 times in last 10 days, marking as P0. The controller logs for webconsole deployment: https://gist.github.com/mfojtik/4206e4d49253dba05137d0159345eef3
The text was updated successfully, but these errors were encountered: