Skip to content

Commit b9dcb58

Browse files
committed
validating changes
1 parent 2c57c65 commit b9dcb58

File tree

4 files changed

+158
-532
lines changed

4 files changed

+158
-532
lines changed

cloudbuild/run-presubmit-on-k8s.sh

Lines changed: 14 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -28,7 +28,7 @@ spec:
2828
image: "${IMAGE}"
2929
resources:
3030
requests:
31-
memory: "8G"
31+
memory: "4G"
3232
cpu: "6000m"
3333
env:
3434
- name: COMMIT_SHA
@@ -66,11 +66,21 @@ kubectl wait --for=condition=Ready "pod/${POD_NAME}" --timeout=15m
6666

6767
# To mitigate problems with early test failure, retry kubectl logs
6868
sleep 10s
69-
while ! kubectl describe "pod/${POD_NAME}" | grep -q Terminated; do
70-
# Try to stream logs, but primary log capture is now in the trap
69+
while true; do
70+
if ! kubectl describe "pod/${POD_NAME}" > /dev/null 2>&1; then
71+
echo "Pod ${POD_NAME} not found, assuming it has been deleted."
72+
break # Exit the loop if the pod doesn't exist
73+
fi
74+
75+
if kubectl describe "pod/${POD_NAME}" | grep -q Terminated; then
76+
echo "Pod ${POD_NAME} is Terminated."
77+
break # Exit the loop if the pod is Terminated
78+
fi
79+
80+
# Try to stream logs
7181
kubectl logs -f "${POD_NAME}" --since-time="${LOGS_SINCE_TIME}" --timestamps=true || true
7282
LOGS_SINCE_TIME=$(date --iso-8601=seconds)
73-
sleep 2 # Short sleep to avoid busy waiting if logs -f exits
83+
sleep 2
7484
done
7585

7686
# Final check on the pod exit code

gpu/README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ Refer to internal arrays in `install_gpu_driver.sh` for the full matrix.)*
2626

2727
CUDA | Full Version | Driver | cuDNN | NCCL | Tested Dataproc Image Versions
2828
-----| ------------ | --------- | --------- | -------| ---------------------------
29-
11.8 | 11.8.0 | 525.147.05| 9.5.1.17 | 2.21.5 | 2.0, 2.1 (Debian/Ubuntu/Rocky)
29+
11.8 | 11.8.0 | 525.147.05| 9.5.1.17 | 2.21.5 | 2.0, 2.1 (Debian/Ubuntu/Rocky); 2.2 (Ubuntu 22.04)
3030
12.0 | 12.0.1 | 525.147.05| 8.8.1.3 | 2.16.5 | 2.0, 2.1 (Debian/Ubuntu/Rocky); 2.2 (Rocky 9, Ubuntu 22.04)
3131
12.4 | 12.4.1 | 550.135 | 9.1.0.70 | 2.23.4 | 2.1 (Ubuntu 20.04, Rocky 8); Dataproc 2.2+
3232
12.6 | 12.6.3 | 550.142 | 9.6.0.74 | 2.23.4 | 2.1 (Ubuntu 20.04, Rocky 8); Dataproc 2.2+
@@ -324,4 +324,4 @@ handles metric creation and reporting.
324324
Debian-based systems, including handling of archived backports repositories
325325
to ensure dependencies can be met.
326326
* Tested primarily with Dataproc 2.0+ images. Support for older Dataproc
327-
1.5 images is limited.
327+
1.5 images is limited.

0 commit comments

Comments
 (0)