Skip to content
This repository was archived by the owner on Oct 22, 2024. It is now read-only.

Distributed provisioning #838

Merged
merged 24 commits into from
Jan 20, 2021
Merged
Changes from 1 commit
Commits
Show all changes
24 commits
Select commit Hold shift + click to select a range
b5b4fb7
update runtime deps with more recent Intel-internal IDs
pohly Dec 7, 2020
5e68dc4
test: add command for dumping PVs and PVCs
pohly Nov 27, 2020
c1c81d8
setup-deployment.sh: better error message when files are missing
pohly Nov 5, 2020
14cb92e
pmem-csi-driver: remove redundant driver mode check
pohly Oct 30, 2020
64ccb96
pmem-csi-driver: support KUBECONFIG
pohly Nov 5, 2020
fb67626
pmem-csi-driver: refactor storage capacity metrics code
pohly Nov 5, 2020
2467c36
distributed provisioning
pohly Oct 30, 2020
cd67c2e
pmem-csi-driver: configurable QPS+burst
pohly Dec 8, 2020
390d993
Revert "test: disable stress test because of volume leaks"
pohly Dec 18, 2020
19d3f57
scheduler extension: simpler error reporting
pohly Dec 20, 2020
1ea911d
deploy: allow pod scheduling when webhook is down
pohly Dec 20, 2020
802c8b6
test: comment fix
pohly Jan 14, 2021
0d36389
deploy: consistent object naming, app.kubernetes.io labels
pohly Jan 14, 2021
4ea8803
operator: distributed provisioning, scheduler extensions
pohly Jan 12, 2021
61351f1
test: add delete-deployment.sh
pohly Jan 14, 2021
513b89b
test: run gotests with less verbose output
pohly Jan 15, 2021
4029bfd
operator test: avoid race around access to events
pohly Jan 17, 2021
ded3e36
operator: update to controller-runtime v0.8.0
pohly Jan 17, 2021
fbd56a0
test: copy initial operator test objects
pohly Jan 18, 2021
2bd9fd4
test: workaround for write/write map access in Patch
pohly Jan 18, 2021
e260ed1
test: fix operator event checking
pohly Jan 18, 2021
a03a700
deploy: use allowedTopologies in storage classes
pohly Jan 15, 2021
7c4ee86
deploy: use allowedTopologies in storage classes, II
pohly Jan 18, 2021
2b24963
deploy: reschedule PVCs on nodes with no PMEM-CSI driver
pohly Jan 19, 2021
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
deploy: allow pod scheduling when webhook is down
This is based on a risk assessment. There's no perfect solution.
For testing we still require the webhook to be up and running.
pohly committed Jan 17, 2021
commit 1ea911df4ec181f327a0f959675c17b05846f7fa
7 changes: 6 additions & 1 deletion deploy/kustomize/webhook/webhook.yaml
Original file line number Diff line number Diff line change
@@ -14,7 +14,12 @@ webhooks:
- key: pmem-csi.intel.com/webhook
operator: NotIn
values: ["ignore"]
failurePolicy: Fail
# For pods with PMEM volumes using late binding, not activating the
# scheduler extension is okayish (but see https://github.com/kubernetes-csi/external-provisioner/issues/544).
# It's worse for CSI ephemeral volumes because there is no recovery.
# Nonetheless, failing pod scheduling entirely when PMEM-CSI is down
# seems worse, so we let the scheduler continue despite failures.
failurePolicy: Ignore
clientConfig:
service:
name: pmem-csi-scheduler
3 changes: 3 additions & 0 deletions test/setup-deployment.sh
Original file line number Diff line number Diff line change
@@ -239,6 +239,9 @@ EOF
- op: replace
path: /webhooks/0/clientConfig/service/namespace
value: ${TEST_DRIVER_NAMESPACE}
- op: replace
path: /webhooks/0/failurePolicy
value: Fail # This is not the default anymore in PMEM-CSI, but for testing we want it.
EOF
;;
esac