Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

test_pull_request_origin_unit sometimes fails without an explanation or (obvious) reason #18497

Closed
tnozicka opened this issue Feb 7, 2018 · 13 comments
Assignees
Labels
area/tests lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. priority/P1 sig/continuous-delivery

Comments

@tnozicka
Copy link
Contributor

tnozicka commented Feb 7, 2018

It says:

[WARNING] While the jUnit report found no failed tests, the `go test` process failed.
[WARNING] This usually means that the unit test suite failed to compile.
[ERROR] hack/test-go.sh exited with code 1 after 00h 36m 10s
make: *** [test-unit] Error 1
[ERROR] PID 12927: hack/lib/build/environment.sh:172: `return "${exitcode}"` exited with status 2.
[INFO] 		Stack Trace: 
[INFO] 		  1: hack/lib/build/environment.sh:172: `return "${exitcode}"`
[INFO] 		  2: hack/lib/build/environment.sh:281: os::build::environment::withsource
[INFO] 		  3: hack/env:42: os::build::environment::run
[INFO]   Exiting with code 2.
++ export status=FAILURE
++ status=FAILURE
+ set +o xtrace
########## FINISHED STAGE: FAILURE: RUN UNIT TESTS [00h 38m 35s] ##########

Started to see this here https://openshift-gce-devel.appspot.com/build/origin-ci-test/pr-logs/pull/18495/test_pull_request_origin_unit/9196/
Went through the S3 artifacts but couldn't see what failed :/ (Apparently something in the test image does.)

The I saw the same just now with https://openshift-gce-devel.appspot.com/build/origin-ci-test/pr-logs/pull/18495/test_pull_request_origin_unit/9196/ from https://github.com/openshift/origin/pull/18495/files which is very unlikely to make unit tests not to compile given how the change is small and where it's located.

@tnozicka
Copy link
Contributor Author

tnozicka commented Feb 7, 2018

/cc @stevekuznetsov

@0xmichalis 0xmichalis removed their assignment Feb 7, 2018
@tnozicka
Copy link
Contributor Author

tnozicka commented Feb 7, 2018

/cc @mfojtik

@stevekuznetsov
Copy link
Contributor

Yeah, I've noticed this as well but wasn't really sure what to do here... the process had no output to stderr and just exited non-zero. Wasn't really sure how to debug this in any way. @tnozicka any thoughts?

@tnozicka
Copy link
Contributor Author

tnozicka commented Feb 7, 2018

Not really sure what that could be, I guess we hide the error somewhere. Can we run with bash -x for a while inside the image?

@stevekuznetsov
Copy link
Contributor

We could but not sure that would help, the stderr redirect is below that level

https://github.com/openshift/origin/blob/master/hack/test-go.sh#L140

@tnozicka
Copy link
Contributor Author

tnozicka commented Feb 7, 2018

For some reason that error file is empty in jenkins artifacts. My thinking was that we would see the exact command that failed as well as its arguments; might help to uncover whatever is causing the flake, or not.

@stevekuznetsov
Copy link
Contributor

It is empty because go test had no output to stderr

@tnozicka
Copy link
Contributor Author

tnozicka commented Feb 7, 2018

badly phrased; meant the same thing. The question is why there is no error output but exit status 2.

bash -x was really all I had :(

@stevekuznetsov
Copy link
Contributor

See also #17881

@openshift-bot
Copy link
Contributor

Issues go stale after 90d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle stale

@openshift-ci-robot openshift-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 15, 2018
@openshift-bot
Copy link
Contributor

Stale issues rot after 30d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle rotten
/remove-lifecycle stale

@openshift-ci-robot openshift-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jun 14, 2018
@stevekuznetsov
Copy link
Contributor

/close

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/tests lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. priority/P1 sig/continuous-delivery
Projects
None yet
Development

No branches or pull requests

6 participants