Failures of default liveness / readiness probes

Several times we have received reports of failing default liveness and readiness probes which leads to runners not reaching ready state:
```
  Normal   Created    3m2s                   kubelet            Created container k6
  Normal   Started    3m2s                   kubelet            Started container k6
  Warning  Unhealthy  2m34s (x7 over 3m1s)   kubelet            Readiness probe failed: Get "http://10.1.0.75:6565/v1/status": dial tcp 10.1.0.75:6565: connect: connection refused
  Warning  Unhealthy  2m34s (x3 over 2m54s)  kubelet            Liveness probe failed: Get "http://10.1.0.75:6565/v1/status": dial tcp 10.1.0.75:6565: connect: connection refused
  Normal   Killing    2m34s                  kubelet            Stopping container k6
```
Such error does not allow to start the test and the test is stuck (at the moment, but this part will likely change as result of https://github.com/grafana/k6-operator/issues/260).

Default probes are the same defaults that are set by Kubernetes, i.e. k6-operator does not add anything by itself. k6 HTTP starts pretty quickly for most tests and it should be well reachable with default settings normally. The most confounding part is that this failures seemed to have been reported while using simple and basic setups of Kubernetes, even local clusters.

So far it is not clear how to repeat this case. If someone knows a trick how to repeat this, please share in this issue.

cc @dgzlopes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Failures of default liveness / readiness probes #306

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Failures of default liveness / readiness probes #306

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions