You can manage process counts in either the database configuration or in the process counts in the cluster spec. In most of these examples, we will only manage process counts through the database configuration. This is simpler, and it ensures that the number of processes we launch fits the number of processes that we are telling the database to recruit.
To explicitly set process counts, you could configure the cluster as follows:
apiVersion: apps.foundationdb.org/v1beta2
kind: FoundationDBCluster
metadata:
name: sample-cluster
spec:
version: 7.1.26
processCounts:
storage: 6
log: 5
stateless: 4
This will configure 6 storage processes, 5 log processes, and 4 stateless processes. This is fewer stateless processes than we had by default, which means that some processes will be running multiple roles. This is generally something you want to avoid in a production configuration, as it can lead to high activity on one role starving another role of resources.
By default, the operator will provision processes with the following process types and counts:
storage
. Equal to the storage count in the database configuration. If no storage count is provided, this will be2*F+1
, whereF
is the desired fault tolerance. For a double replicated cluster, the desired fault tolerance is 1.log
. Equal to theF+max(logs, remote_logs)
. Thelogs
andremote_logs
here are the counts specified in the database configuration. By default,logs
is set to 3 andremote_logs
is set to either-1
orlogs
.stateless
. Equal to the sum of all other roles in the database configuration +F
. Currently, this ismax(proxies+resolvers+4, log_routers)
. The4
is for the master, cluster controller, data distributor, and ratekeeper processes. This may change in the future as we add more roles to the database. By default,proxies
is set to 3,grv_proxies
is set to 1,commit_proxies
is set to 2,resolvers
is set to 1, andlog_routers
is set to -1. If the version of FDB in the cluster is less than 7.0.0, the process counts will use the value of proxies set (or the default) and FDB will apply a ratio to how the individual proxy roles are distributed. If version of FDB is greater than 7.0.0 and the separated proxy counts are set while the oldproxies
are not, we will useproxies=sum(grv_proxies + commit_proxies)
.
You can also set a process count to -1 to tell the operator not to provision any processes of that type.
Instead of setting the process counts directly, let's update the counts of recruited roles in the database configuration:
apiVersion: apps.foundationdb.org/v1beta2
kind: FoundationDBCluster
metadata:
name: sample-cluster
spec:
version: 7.1.26
databaseConfiguration:
storage: 6
logs: 4 # default is 3
proxies: 5 # default is 3
resolvers: 2 # default is 1
This will provision 1 additional log process and 3 additional stateless processes. After launching those processes, it will change the database configuration to recruit 1 additional log, 2 additional proxies, and 1 additional resolver.
NOTE: If you add additional storage processes, it can take some time until the data is evenly distributed again.
You can shrink a cluster by changing the database configuration or process count, just like when we grew a cluster:
apiVersion: apps.foundationdb.org/v1beta2
kind: FoundationDBCluster
metadata:
name: sample-cluster
spec:
version: 7.1.26
databaseConfiguration:
storage: 4
The operator will determine which processes to remove and record them as needing removal in the processGroups
field in the cluster status. This will make sure the choice of removal stays consistent across repeated runs of the reconciliation loop. Once the processes are in the removal list, we will exclude them from the database, which moves all of the roles and data off of the process. Once the exclusion is complete, it is safe to remove the processes, and the operator will delete both the pods and the PVCs. Once the processes are shut down, the operator will re-include them to make sure the exclusion state doesn't get cluttered. It will also remove the process from the processGroups
list.
The exclusion can take a long time, and any changes that happen later in the reconciliation process will be blocked until the exclusion completes.
If one of the removed processes is a coordinator, the operator will recruit a new set of coordinators before shutting down the process.
Any changes to the database configuration will happen before we exclude any processes.
You can change the replication mode in the database by changing the field in the database configuration:
apiVersion: apps.foundationdb.org/v1beta2
kind: FoundationDBCluster
metadata:
name: sample-cluster
spec:
version: 7.1.26
databaseConfiguration:
redundancy_mode: triple
storage: 5
This will run the configuration command on the database, and may also add or remove processes to match the new configuration.
You can continue on to the next section or go back to the table of contents.