You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add startup taint removal functionality to azurelustre CSI driver
The taint key follows the pattern: {driverName}/agent-not-ready
e.g., azurelustre.csi.azure.com/agent-not-ready
This allows users to apply startup taints to prevent scheduling before the CSI driver is ready,
addressing potential race conditions during node startup.
Copy file name to clipboardExpand all lines: docs/driver-parameters.md
+23Lines changed: 23 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,6 +4,29 @@ These are the parameters to be passed into the custom StorageClass that users mu
4
4
5
5
For more information, see the [Azure Managed Lustre Filesystem (AMLFS) service documentation](https://learn.microsoft.com/en-us/azure/azure-managed-lustre/) and the [AMLFS CSI documentation](https://learn.microsoft.com/en-us/azure/azure-managed-lustre/use-csi-driver-kubernetes).
6
6
7
+
## CSI Driver Configuration Parameters
8
+
9
+
These parameters control the behavior of the Azure Lustre CSI driver itself and are typically configured during driver installation rather than in StorageClass definitions.
10
+
11
+
### Node Startup Taint Management
12
+
13
+
Name | Meaning | Available Value | Default Value | Configuration Method
14
+
--- | --- | --- | --- | ---
15
+
remove-not-ready-taint | Controls whether the CSI driver automatically removes startup taints from nodes when the driver becomes ready. This ensures pods are only scheduled to nodes where the CSI driver is fully operational and Lustre filesystem capacity is available. Nodes should have a taint of the form: `azurelustre.csi.azure.com/agent-not-ready:NoSchedule` | `true`, `false` | `true` | Command-line flag `--remove-not-ready-taint` in driver deployment
16
+
17
+
#### Startup Taint Details
18
+
19
+
When enabled (default), the Azure Lustre CSI driver will:
20
+
21
+
1.**Monitor Node Readiness**: Check if the CSI driver is fully initialized on the node
22
+
2.**Remove Blocking Taint**: Automatically remove the `azurelustre.csi.azure.com/agent-not-ready:NoSchedule` taint when ready
23
+
24
+
This mechanism prevents pods requiring Azure Lustre storage from being scheduled to nodes where:
25
+
26
+
- Lustre kernel modules are not yet loaded
27
+
- CSI driver components are not fully initialized
28
+
- Network connectivity to Lustre filesystems is not established
29
+
7
30
## Dynamic Provisioning (Create an AMLFS Cluster through AKS) - Public Preview
8
31
9
32
> **Public Preview Notice**: Dynamic provisioning functionality is currently in public preview. Some features may not be supported or may have constrained capabilities.
Copy file name to clipboardExpand all lines: docs/install-csi-driver.md
+20Lines changed: 20 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -121,3 +121,23 @@ The CSI driver deployment includes automated **exec-based readiness probes** for
121
121
122
122
**Important**: The enhanced validation ensures the driver reports ready only when LNet is fully functional for Lustre operations. Wait for all CSI driver node pods to pass enhanced readiness checks before creating PersistentVolumes or mounting Lustre filesystems.
123
123
124
+
## Startup Taints
125
+
126
+
When the CSI driver starts on each node, it automatically removes the following taint if present:
0 commit comments