[Feature:Prometheus][Feature:Builds] Prometheus when installed to the cluster should start and expose a secured proxy and verify build metrics [Suite:openshift/conformance/paralle] #17694

enj · 2017-12-08T17:54:41Z

https://openshift-gce-devel.appspot.com/build/origin-ci-test/pr-logs/pull/17617/test_pull_request_origin_extended_conformance_gce/12491/

/go/src/github.com/openshift/origin/_output/local/go/src/github.com/openshift/origin/test/extended/prometheus/prometheus_builds.go:108
Expected
    <map[string]error | len:1>: {
        "openshift_build_active_time_seconds": {
            s: "query openshift_build_active_time_seconds for tests []prometheus.metricTest{prometheus.metricTest{labels:map[string]string{\"name\":\"frontend-1\"}, greaterThan:true, value:0, success:false}} had results {\"status\":\"success\",\"data\":{\"resultType\":\"vector\",\"result\":[{\"metric\":{\"__name__\":\"openshift_build_active_time_seconds\",\"instance\":\"10.142.0.2:8444\",\"job\":\"kubernetes-controllers\",\"name\":\"mydockertest-1\",\"namespace\":\"extended-test-build-valuefrom-w6mm6-dfgl8\",\"phase\":\"Running\"},\"value\":[1512582511.727,\"1512582414\"]}]}}",
        },
    }
to be empty
/go/src/github.com/openshift/origin/_output/local/go/src/github.com/openshift/origin/test/extended/prometheus/prometheus_builds.go:171

The text was updated successfully, but these errors were encountered:

bparees · 2017-12-08T17:57:18Z

@gabemontero is this another race where something else produced metrics in the cluster that we didn't expect to see in our test?

gabemontero · 2017-12-12T16:33:46Z

no it is something else @bparees

the test logic should have ignored the mydockertest-1 build that is in the result data in the error message in the description

and I see the frontend-1 build running and completing during the test's polling window

and I see a prometheus-0 pod up and running (though I thought prometheus was converted to a stateful set ... though maybe that entails a pod under the covers)

I think we need more debug in our ext test on the failure
a) perhaps dump the intermediate results scan results
b) and similar to our jenkins testing, dump the prometheus pod, and see if there was any issue with it scrapping for data from the build controller

gabemontero · 2017-12-12T17:34:55Z

confirmed the stateful set still maps to a pod (instantiated the example prometheus template manually)

bparees · 2017-12-12T18:32:06Z

(though I thought prometheus was converted to a stateful set ... though maybe that entails a pod under the covers)

it does. pods are the fundamental unit of any workload. (well, containers are, but you can't have a container w/o a pod)

Automatic merge from submit-queue (batch tested with PRs 17734, 17550, 17647, 17761, 17564). add debug for build prometheus extended test failures debug for #17694 @openshift/sig-developer-experience fyi / ptal

gabemontero · 2017-12-21T12:38:48Z

PR with fix merged ... not sure why bot did not close this.

bparees · 2017-12-21T16:49:53Z

@gabemontero
#17717 (comment)

you had the wrong issue referenced.

bparees · 2017-12-21T16:50:52Z

(well maybe not wrong but it looks like we had two open?)

miminar · 2018-01-19T07:34:51Z

Seen it here: https://openshift-gce-devel.appspot.com/build/origin-ci-test/pr-logs/pull/17783/test_pull_request_origin_extended_conformance_install/5804/

gabemontero · 2018-01-19T17:28:40Z

the failure in https://openshift-gce-devel.appspot.com/build/origin-ci-test/pr-logs/pull/17783/test_pull_request_origin_extended_conformance_install/5804/ was the same test as this issue is tracking, but the failure was different than the symptoms originally reported in this issue, and not related to the prometheus build stats being verified.

The sample app build we launched started and failed quickly (deps download glitch I believe), and the active stat query failed.

Separate from the original, precise symptoms which lead me to leave this issue open,
it does beg I question I've asked myself before ... automated testing for the active stat has been tricky, even when developing it, given the batching intervals for prometheus querying the buidl controller. How many flakes need to occur for the active stats to cry uncle and remove it?

Pondering ....

gabemontero · 2018-01-19T18:07:28Z

If I do delete the active build query, I'll do it in a separate item. Reclosing this issue per original symptom.

gabemontero · 2018-01-19T18:13:21Z

separate item - #18193

enj added the kind/test-flake Categorizes issue or PR as related to test flakes. label Dec 8, 2017

enj assigned bparees Dec 8, 2017

bparees assigned gabemontero and unassigned bparees Dec 8, 2017

bparees added component/build priority/P1 labels Dec 8, 2017

gabemontero mentioned this issue Dec 12, 2017

add debug for build prometheus extended test failures #17734

Merged

gabemontero mentioned this issue Dec 14, 2017

sync prometheus ext tests running in parallel #17717

Merged

gabemontero closed this as completed Dec 21, 2017

miminar reopened this Jan 19, 2018

miminar mentioned this issue Jan 19, 2018

Additional registry whitelisting #17783

Merged

gabemontero closed this as completed Jan 19, 2018

gabemontero mentioned this issue Jan 19, 2018

delete prometheus bld active bld metric test #18193

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature:Prometheus][Feature:Builds] Prometheus when installed to the cluster should start and expose a secured proxy and verify build metrics [Suite:openshift/conformance/paralle] #17694

[Feature:Prometheus][Feature:Builds] Prometheus when installed to the cluster should start and expose a secured proxy and verify build metrics [Suite:openshift/conformance/paralle] #17694

enj commented Dec 8, 2017

bparees commented Dec 8, 2017

gabemontero commented Dec 12, 2017

gabemontero commented Dec 12, 2017

bparees commented Dec 12, 2017

gabemontero commented Dec 21, 2017

bparees commented Dec 21, 2017

bparees commented Dec 21, 2017

miminar commented Jan 19, 2018 •

edited

Loading

gabemontero commented Jan 19, 2018

gabemontero commented Jan 19, 2018

gabemontero commented Jan 19, 2018

[Feature:Prometheus][Feature:Builds] Prometheus when installed to the cluster should start and expose a secured proxy and verify build metrics [Suite:openshift/conformance/paralle] #17694

[Feature:Prometheus][Feature:Builds] Prometheus when installed to the cluster should start and expose a secured proxy and verify build metrics [Suite:openshift/conformance/paralle] #17694

Comments

enj commented Dec 8, 2017

bparees commented Dec 8, 2017

gabemontero commented Dec 12, 2017

gabemontero commented Dec 12, 2017

bparees commented Dec 12, 2017

gabemontero commented Dec 21, 2017

bparees commented Dec 21, 2017

bparees commented Dec 21, 2017

miminar commented Jan 19, 2018 • edited Loading

gabemontero commented Jan 19, 2018

gabemontero commented Jan 19, 2018

gabemontero commented Jan 19, 2018

miminar commented Jan 19, 2018 •

edited

Loading