title | summary | category |
---|---|---|
Troubleshoot TiDB in Kubernetes |
Learn how to diagnose and resolve issues when you use TiDB in Kubernetes. |
how-to |
This document describes some common issues and solutions when you use a TiDB cluster in Kubernetes.
When a Pod is in the CrashLoopBackoff
state, the containers in the Pod quit continually. As a result, you cannot use kubectl exec
or tkctl debug
normally, making it inconvenient to diagnose issues.
To solve this problem, TiDB in Kubernetes provides the Pod diagnostic mode for PD, TiKV, and TiDB components. In this mode, the containers in the Pod hang directly after starting, and will not get into a state of repeated crash. Then you can use kubectl exec
or tkctl debug
to connect to the Pod containers for diagnosis.
To use the diagnostic mode for troubleshooting:
-
Add an annotation to the Pod to be diagnosed:
{{< copyable "shell-regular" >}}
kubectl annotate pod ${pod_name} -n ${namespace} runmode=debug
The next time the container in the Pod is restarted, it detects this annotation and enters the diagnostic mode.
-
Wait for the Pod to enter the Running state.
{{< copyable "shell-regular" >}}
watch kubectl get pod ${pod_name} -n ${namespace}
-
Start the diagnosis.
Here's an example of using
kubectl exec
to get into the container for diagnosis:{{< copyable "shell-regular" >}}
kubectl exec -it ${pod_name} -n ${namespace} -- /bin/sh
-
After finishing the diagnosis and resolving the problem, delete the Pod.
kubectl delete pod ${pod_name} -n ${namespace}
After the Pod is rebuilt, it automatically returns to the normal mode.
TiDB Operator uses PV (Persistent Volume) and PVC (Persistent Volume Claim) to store persistent data. If you accidentally delete a cluster using helm delete
, the PV/PVC objects and data are still retained to ensure data safety.
To restore the cluster at this time, use the helm install
command to create a cluster with the same name. The retained PV/PVC and data are reused.
{{< copyable "shell-regular" >}}
helm install pingcap/tidb-cluster -n ${release_name} --namespace=${namespace} --version=${version} -f values.yaml
After creating a cluster using helm install
, if the Pod is not created, you can diagnose it using the following commands:
{{< copyable "shell-regular" >}}
kubectl get tidbclusters -n ${namespace}
kubectl get statefulsets -n ${namespace}
kubectl describe statefulsets -n ${namespace} ${release_name}-pd
In a TiDB cluster, you can access most Pods by using the Pod's domain name (allocated by the Headless Service). The exception is when TiDB Operator collects the cluster information or issues control commands, it accesses the PD (Placement Driver) cluster using the service-name
of the PD service.
When you find some network connection issues between Pods from the log or monitoring metrics, or you find the network connection between Pods might be abnormal according to the problematic condition, you can follow the following process to diagnose and narrow down the problem:
-
Confirm that the endpoints of the Service and Headless Service are normal:
{{< copyable "shell-regular" >}}
kubectl -n ${namespace} get endpoints ${release_name}-pd kubectl -n ${namespace} get endpoints ${release_name}-tidb kubectl -n ${namespace} get endpoints ${release_name}-pd-peer kubectl -n ${namespace} get endpoints ${release_name}-tikv-peer kubectl -n ${namespace} get endpoints ${release_name}-tidb-peer
The
ENDPOINTS
field shown in the above command should be a comma-separated list ofcluster_ip:port
. If the field is empty or incorrect, check the health of the Pod and whetherkube-controller-manager
is working properly. -
Enter the Pod's Network Namespace to diagnose network problems:
{{< copyable "shell-regular" >}}
tkctl debug -n ${namespace} ${pod_name}
After the remote shell is started, use the
dig
command to diagnose the DNS resolution. If the DNS resolution is abnormal, refer to Debugging DNS Resolution for troubleshooting.{{< copyable "shell-regular" >}}
dig ${HOSTNAME}
Use the
ping
command to diagnose the connection with the destination IP (the ClusterIP resolved usingdig
):{{< copyable "shell-regular" >}}
ping ${TARGET_IP}
-
If the
ping
check fails, refer to Debugging Kubernetes Networking for troubleshooting. -
If the
ping
check succeeds, continue to check whether the target port is open by usingtelnet
:{{< copyable "shell-regular" >}}
telnet ${target_ip} ${target_port}
If the
telnet
check fails, check whether the port corresponding to the Pod is correctly exposed and whether the applied port is correctly configured:{{< copyable "shell-regular" >}}
# Checks whether the ports are consistent. kubectl -n ${namespace} get po ${pod_name} -ojson | jq '.spec.containers[].ports[].containerPort' # Checks whether the application is correctly configured to serve the specified port. # The default port of PD is 2379 when not configured. kubectl -n ${namespace} -it exec ${pod_name} -- cat /etc/pd/pd.toml | grep client-urls # The default port of PD is 20160 when not configured. kubectl -n ${namespace} -it exec ${pod_name} -- cat /etc/tikv/tikv.toml | grep addr # The default port of TiDB is 4000 when not configured. kubectl -n ${namespace} -it exec ${pod_name} -- cat /etc/tidb/tidb.toml | grep port
-
The Pending state of a Pod is usually caused by conditions of insufficient resources, such as:
- The
StorageClass
of the PVC used by PD, TiKV, Monitor Pod does not exist or the PV is insufficient. - No nodes in the Kubernetes cluster can satisfy the CPU or memory resources requested by the Pod
- The number of TiKV or PD replicas and the number of nodes in the cluster do not satisfy the high availability scheduling policy of tidb-scheduler
You can check the specific reason for Pending by using the kubectl describe pod
command:
{{< copyable "shell-regular" >}}
kubectl describe po -n ${namespace} ${pod_name}
-
If the CPU or memory resources are insufficient, you can lower the CPU or memory resources requested by the corresponding component for scheduling, or add a new Kubernetes node.
-
If the
StorageClass
of the PVC cannot be found, changestorageClassName
in thevalues.yaml
file to the name of theStorageClass
available in the cluster; runhelm upgrade
; and delete Statefulset and the corresponding PVCs. Run the following command to get theStorageClass
available in the cluster:{{< copyable "shell-regular" >}}
kubectl get storageclass
-
If a
StorageClass
exists in the cluster but the available PV is insufficient, you need to add PV resources correspondingly. For Local PV, you can expand it by referring to Local PV Configuration. -
tidb-scheduler has a high availability scheduling policy for TiKV and PD. For the same TiDB cluster, if there are N replicas of TiKV or PD, then the number of PD Pods that can be scheduled to each node is
M=(N-1)/2
(if N<3, then M=1) at most, and the number of TiKV Pods that can be scheduled to each node isM=ceil(N/3)
(if N<3, then M=1;ceil
means rounding up) at most. If the Pod's state becomesPending
because the high availability scheduling policy is not satisfied, you need to add more nodes in the cluster.
A Pod in the CrashLoopBackOff
state means that the container in the Pod repeatedly aborts, in the loop of abort - restart by kubelet
- abort. There are many potential causes of CrashLoopBackOff
. In this case, the most effective way to locate it is to view the log of the Pod container:
{{< copyable "shell-regular" >}}
kubectl -n ${namespace} logs -f ${pod_name}
If the log fails to help diagnose the problem, you can add the -p
parameter to output the log information when the container was last started:
{{< copyable "shell-regular" >}}
kubectl -n ${namespace} logs -p ${pod_name}
After checking the error messages in the log, you can refer to Cannot start tidb-server
, Cannot start tikv-server
, and Cannot start pd-server
for further troubleshooting.
When the "cluster id mismatch" message appears in the TiKV Pod log, it means that the TiKV Pod might have used old data from other or previous TiKV Pod. If the data on the local disk remain uncleared when you configure local storage in the cluster, or the data is not recycled by the local volume provisioner due to a forced deletion of PV, an error might occur.
If you are confirmed that the TiKV should join the cluster as a new node and that the data on the PV should be deleted, you can delete the TiKV Pod and the corresponding PVC. The TiKV Pod automatically rebuilds and binds the new PV for use. When configuring local storage, delete local storage on the machine to avoid Kubernetes using old data. In cluster operation and maintenance, manage PV using the local volume provisioner and do not delete it forcibly. You can manage the lifecycle of PV by creating, deleting PVCs, and setting reclaimPolicy
for the PV.
In addition, TiKV might also fail to start when ulimit
is insufficient. In this case, you can modify the /etc/security/limits.conf
file of the Kubernetes node to increase the ulimit
:
root soft nofile 1000000
root hard nofile 1000000
root soft core unlimited
root soft stack 10240
If you cannot confirm the cause from the log and ulimit
is also a normal value, troubleshoot it further by using the diagnostic mode.
If you cannot access the TiDB service, first check whether the TiDB service is deployed successfully using the following method:
-
Check whether all components of the cluster are up and the status of each component is
Running
.{{< copyable "shell-regular" >}}
kubectl get po -n ${namespace}
-
Check the log of TiDB components to see whether errors are reported.
{{< copyable "shell-regular" >}}
kubectl logs -f ${pod_name} -n ${namespace} -c tidb
If the cluster is successfully deployed, check the network using the following steps:
-
If you cannot access the TiDB service using
NodePort
, try to access the TiDB service using the service domain orclusterIP
on the node. If theserviceName
orclusterIP
works, the network within the Kubernetes cluster is normal. Then the possible issues are as follows:- Network failure exists between the client and the node.
- Check whether the
externalTrafficPolicy
attribute of the TiDB service isLocal
. If it isLocal
, you must access the client using the IP of the node where the TiDB Pod is located.
-
If you still cannot access the TiDB service using the service domain or
clusterIP
, connect using<PodIP>:4000
on the TiDB service backend. If thePodIP
works, you can confirm that the problem is in the connection between the service domain andPodIP
or betweenclusterIP
andPodIP
. Check the following items:-
Check whether the DNS service works well.
{{< copyable "shell-regular" >}}
kubectl get po -n kube-system -l k8s-app=kube-dns dig ${tidb_service_domain}
-
Check whether
kube-proxy
on each node is working.{{< copyable "shell-regular" >}}
kubectl get po -n kube-system -l k8s-app=kube-proxy
-
Check whether the TiDB service rule is correct in the
iptables
rules.{{< copyable "shell-regular" >}}
iptables-save -t nat |grep ${clusterIP}
-
Check whether the corresponding endpoint is correct.
-
-
If you cannot access the TiDB service even using
PodIP
, the problem is on the Pod level network. Check the following items:- Check whether the relevant route rules on the node are correct.
- Check whether the network plugin service works well.
- Refer to network connection failure between Pods section.
Normally, when a TiKV Pod is in a healthy state (Running
), the corresponding TiKV store is also in a healthy state (UP
). However, concurrent scale-in or scale-out on TiKV components might cause part of TiKV stores to fall into the Tombstone
state abnormally. When this happens, try the following steps to fix it:
-
View the state of the TiKV store:
{{< copyable "shell-regular" >}}
kubectl get -n ${namespace} tidbcluster ${release_name} -ojson | jq '.status.tikv.stores'
-
View the state of the TiKV Pod:
{{< copyable "shell-regular" >}}
kubectl get -n ${namespace} po -l app.kubernetes.io/component=tikv
-
Compare the state of the TiKV store with that of the Pod. If the store corresponding to a TiKV Pod is in the
Offline
state, it means the store is being taken offline abnormally. You can use the following commands to cancel the offline process and perform recovery operations:-
Open the connection to the PD service:
{{< copyable "shell-regular" >}}
kubectl port-forward -n ${namespace} svc/${cluster_name}-pd ${local_port}:2379 &>/tmp/portforward-pd.log &
-
Bring online the corresponding store:
{{< copyable "shell-regular" >}}
curl -X POST http://127.0.0.1:2379/pd/api/v1/store/${store_id}/state?state=Up
-
-
If the TiKV store with the latest
lastHeartbeatTime
that corresponds to a Pod is in aTombstone
state, it means that the offline process is completed. At this time, you need to re-create the Pod and bind it with a new PV to perform recovery by taking the following steps:-
Set the
reclaimPolicy
value of the PV corresponding to the store toDelete
:{{< copyable "shell-regular" >}}
kubectl patch $(kubectl get pv -l app.kubernetes.io/instance=${release_name},tidb.pingcap.com/store-id=${store_id} -o name) -p '{"spec":{"persistentVolumeReclaimPolicy":"Delete"}}
-
Remove the PVC used by the Pod:
{{< copyable "shell-regular" >}}
kubectl delete -n ${namespace} pvc tikv-${pod_name} --wait=false
-
Remove the Pod, and wait for it to be re-created:
{{< copyable "shell-regular" >}}
kubectl delete -n ${namespace} pod ${pod_name}
After the Pod is re-created, a new store is registered in the TiKV cluster. Then the recovery is completed.
-
Load balancers often set the idle connection timeout. If no data is sent over a connection for a specific period of time, load balancer closes the connection.
If a long query is interrupted when you use TiDB, check the middleware program between the client and the TiDB server.
If the idle timeout is not long enough for your query, try to set the timeout to a larger value. If you cannot reset it, enable the tcp-keep-alive
option in TiDB.
In Linux, the keepalive probe packet is sent every 7,200 seconds by default. To shorten the interval, configure sysctls
via the podSecurityContext
field.
-
If
--allowed-unsafe-sysctls=net.*
can be configured for kubelet in the Kubernetes cluster, configure this parameter for kubelet and configure TiDB in the following way:{{< copyable "" >}}
tidb: ... podSecurityContext: sysctls: - name: net.ipv4.tcp_keepalive_time value: "300"
-
If
--allowed-unsafe-sysctls=net.*
cannot be configured for kubelet, configure TiDB in the following way:{{< copyable "" >}}
tidb: annotations: tidb.pingcap.com/sysctl-init: "true" podSecurityContext: sysctls: - name: net.ipv4.tcp_keepalive_time value: "300" ...
Note:
The configuration above requires TiDB Operator 1.1 or later version.