├── Kubernetes Master Class - Kubernetes Troubleshooting.pptx ├── LICENSE ├── README.md ├── etcd-endpoints ├── grab_kubeconfig.sh ├── kube-apiserver-check-etcd ├── kube-apiserver-responsiveness ├── kube-controller-manager ├── kube-scheduler ├── kubelet-stats └── metrics-server └── sanity-check.sh /Kubernetes Master Class - Kubernetes Troubleshooting.pptx: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/mattmattox/k8s-troubleshooting/8cec95b8f318cb70a677d6b3e19c1dbe374296e5/Kubernetes Master Class - Kubernetes Troubleshooting.pptx -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- 1 | Apache License 2 | Version 2.0, January 2004 3 | http://www.apache.org/licenses/ 4 | 5 | TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION 6 | 7 | 1. Definitions. 8 | 9 | "License" shall mean the terms and conditions for use, reproduction, 10 | and distribution as defined by Sections 1 through 9 of this document. 11 | 12 | "Licensor" shall mean the copyright owner or entity authorized by 13 | the copyright owner that is granting the License. 14 | 15 | "Legal Entity" shall mean the union of the acting entity and all 16 | other entities that control, are controlled by, or are under common 17 | control with that entity. For the purposes of this definition, 18 | "control" means (i) the power, direct or indirect, to cause the 19 | direction or management of such entity, whether by contract or 20 | otherwise, or (ii) ownership of fifty percent (50%) or more of the 21 | outstanding shares, or (iii) beneficial ownership of such entity. 22 | 23 | "You" (or "Your") shall mean an individual or Legal Entity 24 | exercising permissions granted by this License. 25 | 26 | "Source" form shall mean the preferred form for making modifications, 27 | including but not limited to software source code, documentation 28 | source, and configuration files. 29 | 30 | "Object" form shall mean any form resulting from mechanical 31 | transformation or translation of a Source form, including but 32 | not limited to compiled object code, generated documentation, 33 | and conversions to other media types. 34 | 35 | "Work" shall mean the work of authorship, whether in Source or 36 | Object form, made available under the License, as indicated by a 37 | copyright notice that is included in or attached to the work 38 | (an example is provided in the Appendix below). 39 | 40 | "Derivative Works" shall mean any work, whether in Source or Object 41 | form, that is based on (or derived from) the Work and for which the 42 | editorial revisions, annotations, elaborations, or other modifications 43 | represent, as a whole, an original work of authorship. For the purposes 44 | of this License, Derivative Works shall not include works that remain 45 | separable from, or merely link (or bind by name) to the interfaces of, 46 | the Work and Derivative Works thereof. 47 | 48 | "Contribution" shall mean any work of authorship, including 49 | the original version of the Work and any modifications or additions 50 | to that Work or Derivative Works thereof, that is intentionally 51 | submitted to Licensor for inclusion in the Work by the copyright owner 52 | or by an individual or Legal Entity authorized to submit on behalf of 53 | the copyright owner. For the purposes of this definition, "submitted" 54 | means any form of electronic, verbal, or written communication sent 55 | to the Licensor or its representatives, including but not limited to 56 | communication on electronic mailing lists, source code control systems, 57 | and issue tracking systems that are managed by, or on behalf of, the 58 | Licensor for the purpose of discussing and improving the Work, but 59 | excluding communication that is conspicuously marked or otherwise 60 | designated in writing by the copyright owner as "Not a Contribution." 61 | 62 | "Contributor" shall mean Licensor and any individual or Legal Entity 63 | on behalf of whom a Contribution has been received by Licensor and 64 | subsequently incorporated within the Work. 65 | 66 | 2. Grant of Copyright License. Subject to the terms and conditions of 67 | this License, each Contributor hereby grants to You a perpetual, 68 | worldwide, non-exclusive, no-charge, royalty-free, irrevocable 69 | copyright license to reproduce, prepare Derivative Works of, 70 | publicly display, publicly perform, sublicense, and distribute the 71 | Work and such Derivative Works in Source or Object form. 72 | 73 | 3. Grant of Patent License. Subject to the terms and conditions of 74 | this License, each Contributor hereby grants to You a perpetual, 75 | worldwide, non-exclusive, no-charge, royalty-free, irrevocable 76 | (except as stated in this section) patent license to make, have made, 77 | use, offer to sell, sell, import, and otherwise transfer the Work, 78 | where such license applies only to those patent claims licensable 79 | by such Contributor that are necessarily infringed by their 80 | Contribution(s) alone or by combination of their Contribution(s) 81 | with the Work to which such Contribution(s) was submitted. If You 82 | institute patent litigation against any entity (including a 83 | cross-claim or counterclaim in a lawsuit) alleging that the Work 84 | or a Contribution incorporated within the Work constitutes direct 85 | or contributory patent infringement, then any patent licenses 86 | granted to You under this License for that Work shall terminate 87 | as of the date such litigation is filed. 88 | 89 | 4. Redistribution. You may reproduce and distribute copies of the 90 | Work or Derivative Works thereof in any medium, with or without 91 | modifications, and in Source or Object form, provided that You 92 | meet the following conditions: 93 | 94 | (a) You must give any other recipients of the Work or 95 | Derivative Works a copy of this License; and 96 | 97 | (b) You must cause any modified files to carry prominent notices 98 | stating that You changed the files; and 99 | 100 | (c) You must retain, in the Source form of any Derivative Works 101 | that You distribute, all copyright, patent, trademark, and 102 | attribution notices from the Source form of the Work, 103 | excluding those notices that do not pertain to any part of 104 | the Derivative Works; and 105 | 106 | (d) If the Work includes a "NOTICE" text file as part of its 107 | distribution, then any Derivative Works that You distribute must 108 | include a readable copy of the attribution notices contained 109 | within such NOTICE file, excluding those notices that do not 110 | pertain to any part of the Derivative Works, in at least one 111 | of the following places: within a NOTICE text file distributed 112 | as part of the Derivative Works; within the Source form or 113 | documentation, if provided along with the Derivative Works; or, 114 | within a display generated by the Derivative Works, if and 115 | wherever such third-party notices normally appear. The contents 116 | of the NOTICE file are for informational purposes only and 117 | do not modify the License. You may add Your own attribution 118 | notices within Derivative Works that You distribute, alongside 119 | or as an addendum to the NOTICE text from the Work, provided 120 | that such additional attribution notices cannot be construed 121 | as modifying the License. 122 | 123 | You may add Your own copyright statement to Your modifications and 124 | may provide additional or different license terms and conditions 125 | for use, reproduction, or distribution of Your modifications, or 126 | for any such Derivative Works as a whole, provided Your use, 127 | reproduction, and distribution of the Work otherwise complies with 128 | the conditions stated in this License. 129 | 130 | 5. Submission of Contributions. Unless You explicitly state otherwise, 131 | any Contribution intentionally submitted for inclusion in the Work 132 | by You to the Licensor shall be under the terms and conditions of 133 | this License, without any additional terms or conditions. 134 | Notwithstanding the above, nothing herein shall supersede or modify 135 | the terms of any separate license agreement you may have executed 136 | with Licensor regarding such Contributions. 137 | 138 | 6. Trademarks. This License does not grant permission to use the trade 139 | names, trademarks, service marks, or product names of the Licensor, 140 | except as required for reasonable and customary use in describing the 141 | origin of the Work and reproducing the content of the NOTICE file. 142 | 143 | 7. Disclaimer of Warranty. Unless required by applicable law or 144 | agreed to in writing, Licensor provides the Work (and each 145 | Contributor provides its Contributions) on an "AS IS" BASIS, 146 | WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or 147 | implied, including, without limitation, any warranties or conditions 148 | of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A 149 | PARTICULAR PURPOSE. You are solely responsible for determining the 150 | appropriateness of using or redistributing the Work and assume any 151 | risks associated with Your exercise of permissions under this License. 152 | 153 | 8. Limitation of Liability. In no event and under no legal theory, 154 | whether in tort (including negligence), contract, or otherwise, 155 | unless required by applicable law (such as deliberate and grossly 156 | negligent acts) or agreed to in writing, shall any Contributor be 157 | liable to You for damages, including any direct, indirect, special, 158 | incidental, or consequential damages of any character arising as a 159 | result of this License or out of the use or inability to use the 160 | Work (including but not limited to damages for loss of goodwill, 161 | work stoppage, computer failure or malfunction, or any and all 162 | other commercial damages or losses), even if such Contributor 163 | has been advised of the possibility of such damages. 164 | 165 | 9. Accepting Warranty or Additional Liability. While redistributing 166 | the Work or Derivative Works thereof, You may choose to offer, 167 | and charge a fee for, acceptance of support, warranty, indemnity, 168 | or other liability obligations and/or rights consistent with this 169 | License. However, in accepting such obligations, You may act only 170 | on Your own behalf and on Your sole responsibility, not on behalf 171 | of any other Contributor, and only if You agree to indemnify, 172 | defend, and hold each Contributor harmless for any liability 173 | incurred by, or claims asserted against, such Contributor by reason 174 | of your accepting any such warranty or additional liability. 175 | 176 | END OF TERMS AND CONDITIONS 177 | 178 | APPENDIX: How to apply the Apache License to your work. 179 | 180 | To apply the Apache License to your work, attach the following 181 | boilerplate notice, with the fields enclosed by brackets "[]" 182 | replaced with your own identifying information. (Don't include 183 | the brackets!) The text should be enclosed in the appropriate 184 | comment syntax for the file format. We also recommend that a 185 | file or class name and description of purpose be included on the 186 | same "printed page" as the copyright notice for easier 187 | identification within third-party archives. 188 | 189 | Copyright [yyyy] [name of copyright owner] 190 | 191 | Licensed under the Apache License, Version 2.0 (the "License"); 192 | you may not use this file except in compliance with the License. 193 | You may obtain a copy of the License at 194 | 195 | http://www.apache.org/licenses/LICENSE-2.0 196 | 197 | Unless required by applicable law or agreed to in writing, software 198 | distributed under the License is distributed on an "AS IS" BASIS, 199 | WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. 200 | See the License for the specific language governing permissions and 201 | limitations under the License. 202 | -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- 1 | # k8s Troubleshooting 2 | 3 | ## kube-scheduler 4 | 5 | ### Finding the current leader 6 | Command(s): `curl https://raw.githubusercontent.com/mattmattox/k8s-troubleshooting/master/kube-scheduler | bash` 7 | 8 | **Example Output of a healthy cluster** 9 | ```bash 10 | kube-scheduler is the leader on node a1ubk8slabl03 11 | ``` 12 | 13 | ## etcd-troubleshooting 14 | 15 | ### Check etcd members 16 | Command(s): `docker exec etcd etcdctl member list` 17 | 18 | **Example Output of a healthy cluster** 19 | ```bash 20 | 2f080bc6ec98f39b, started, etcd-a1ubrkeat03, https://172.27.5.33:2380, https://172.27.5.33:2379,https://172.27.5.33:4001, false 21 | 9d7204f89b221ba3, started, etcd-a1ubrkeat01, https://172.27.5.31:2380, https://172.27.5.31:2379,https://172.27.5.31:4001, false 22 | bd37bc0dc2e990b6, started, etcd-a1ubrkeat02, https://172.27.5.32:2380, https://172.27.5.32:2379,https://172.27.5.32:4001, false 23 | ``` 24 | 25 | ### Check etcd endpoints 26 | Command(s): `curl https://raw.githubusercontent.com/mattmattox/etcd-troubleshooting/master/etcd-endpoints | bash ` 27 | 28 | **Example Output of a healthy cluster** 29 | ```bash 30 | Validating connection to https://172.27.5.33:2379/health 31 | {"health":"true"} 32 | Validating connection to https://172.27.5.31:2379/health 33 | {"health":"true"} 34 | Validating connection to https://172.27.5.32:2379/health 35 | {"health":"true"} 36 | ``` 37 | 38 | ### Common errors 39 | 40 | `health check for peer xxx could not connect: dial tcp IP:2380: getsockopt: connection refused` 41 | 42 | A connection to the address shown on port 2380 cannot be established. Check if the etcd container is running on the host with the address shown. 43 | 44 | 45 | `xxx is starting a new election at term x` 46 | 47 | The etcd cluster has lost it’s quorum and is trying to establish a new leader. This can happen when the majority of the nodes running etcd go down/unreachable. 48 | 49 | 50 | `connection error: desc = "transport: Error while dialing dial tcp 0.0.0.0:2379: i/o timeout"; Reconnecting to {0.0.0.0:2379 0 }` 51 | 52 | The host firewall is preventing network communication. 53 | 54 | 55 | `rafthttp: request cluster ID mismatch` 56 | 57 | The node with the etcd instance logging rafthttp: request cluster ID mismatch is trying to join a cluster that has already been formed with another peer. The node should be removed from the cluster, and re-added. 58 | 59 | 60 | `rafthttp: failed to find member` 61 | 62 | The cluster state (/var/lib/etcd) contains wrong information to join the cluster. The node should be removed from the cluster, the state directory should be cleaned and the node should be re-added. 63 | 64 | ### Enabling debug logging 65 | `curl -XPUT -d '{"Level":"DEBUG"}' --cacert $(docker exec etcd printenv ETCDCTL_CACERT) --cert $(docker exec etcd printenv ETCDCTL_CERT) --key $(docker exec etcd printenv ETCDCTL_KEY) https://localhost:2379/config/local/log` 66 | 67 | ### Disabling debug logging 68 | `curl -XPUT -d '{"Level":"INFO"}' --cacert $(docker exec etcd printenv ETCDCTL_CACERT) --cert $(docker exec etcd printenv ETCDCTL_CERT) --key $(docker exec etcd printenv ETCDCTL_KEY) https://localhost:2379/config/local/log` 69 | 70 | ### Getting etcd metrics 71 | `curl -X GET --cacert $(docker exec etcd printenv ETCDCTL_CACERT) --cert $(docker exec etcd printenv ETCDCTL_CERT) --key $(docker exec etcd printenv ETCDCTL_KEY) https://localhost:2379/metrics` 72 | 73 | 74 | **wal_fsync_duration_seconds (99% under 10 ms)** 75 | 76 | A wal_fsync is called when etcd persists its log entries to disk before applying them. 77 | 78 | 79 | **backend_commit_duration_seconds (99% under 25 ms)** 80 | 81 | A backend_commit is called when etcd commits an incremental snapshot of its most recent changes to disk. 82 | 83 | # kube-apiserver troubleshooting 84 | 85 | Run the following script on each controlplane node 86 | 87 | `https://raw.githubusercontent.com/mattmattox/k8s-troubleshooting/master/kube-apiserver-check-etcd` 88 | 89 | ## kubelet troubleshooting 90 | 91 | **Check kubelet logging** 92 | 93 | As this is the node agent, it will contain the most information regarding operations that it is executing based on scheduling requests 94 | 95 | 96 | **Check kubelet stats** 97 | 98 | `https://raw.githubusercontent.com/mattmattox/k8s-troubleshooting/master/kubelet-stats` 99 | -------------------------------------------------------------------------------- /etcd-endpoints: -------------------------------------------------------------------------------- 1 | #!/bin/bash 2 | for endpoint in $(docker exec etcd /bin/sh -c "etcdctl member list | cut -d, -f5"); 3 | do 4 | echo "Validating connection to ${endpoint}/health"; 5 | docker run --net=host -v $(docker inspect kubelet --format '{{ range .Mounts }}{{ if eq .Destination "/etc/kubernetes" }}{{ .Source }}{{ end }}{{ end }}')/ssl:/etc/kubernetes/ssl:ro appropriate/curl -s -w "\n" --cacert $(docker exec etcd printenv ETCDCTL_CACERT) --cert $(docker exec etcd printenv ETCDCTL_CERT) --key $(docker exec etcd printenv ETCDCTL_KEY) "${endpoint}/health"; 6 | done 7 | -------------------------------------------------------------------------------- /grab_kubeconfig.sh: -------------------------------------------------------------------------------- 1 | #!/bin/bash 2 | echo "Building cluster_recovery.yml..." 3 | echo "Working on Nodes..." 4 | echo 'nodes:' > cluster_recovery.yml 5 | kubectl --kubeconfig kube_config_cluster.yml -n kube-system get configmap full-cluster-state -o json | jq -r .data.\"full-cluster-state\" | jq -r .desiredState.rkeConfig.nodes | yq r - | sed 's/^/ /' | \ 6 | sed -e 's/internalAddress/internal_address/g' | \ 7 | sed -e 's/hostnameOverride/hostname_override/g' | \ 8 | sed -e 's/sshKeyPath/ssh_key_path/g' >> cluster_recovery.yml 9 | echo "" >> cluster_recovery.yml 10 | 11 | echo "Working on services..." 12 | echo 'services:' >> cluster_recovery.yml 13 | kubectl --kubeconfig kube_config_cluster.yml -n kube-system get configmap full-cluster-state -o json | jq -r .data.\"full-cluster-state\" | jq -r .desiredState.rkeConfig.services | yq r - | sed 's/^/ /' >> cluster_recovery.yml 14 | echo "" >> cluster_recovery.yml 15 | 16 | echo "Working on network..." 17 | echo 'network:' >> cluster_recovery.yml 18 | kubectl --kubeconfig kube_config_cluster.yml -n kube-system get configmap full-cluster-state -o json | jq -r .data.\"full-cluster-state\" | jq -r .desiredState.rkeConfig.network | yq r - | sed 's/^/ /' >> cluster_recovery.yml 19 | echo "" >> cluster_recovery.yml 20 | 21 | echo "Working on authentication..." 22 | echo 'authentication:' >> cluster_recovery.yml 23 | kubectl --kubeconfig kube_config_cluster.yml -n kube-system get configmap full-cluster-state -o json | jq -r .data.\"full-cluster-state\" | jq -r .desiredState.rkeConfig.authentication | yq r - | sed 's/^/ /' >> cluster_recovery.yml 24 | echo "" >> cluster_recovery.yml 25 | 26 | echo "Working on systemImages..." 27 | echo 'system_images:' >> cluster_recovery.yml 28 | kubectl --kubeconfig kube_config_cluster.yml -n kube-system get configmap full-cluster-state -o json | jq -r .data.\"full-cluster-state\" | jq -r .desiredState.rkeConfig.systemImages | yq r - | sed 's/^/ /' >> cluster_recovery.yml 29 | echo "" >> cluster_recovery.yml 30 | 31 | echo "Building cluster_recovery.rkestate..." 32 | kubectl --kubeconfig kube_config_cluster.yml -n kube-system get configmap full-cluster-state -o json | jq -r .data.\"full-cluster-state\" | jq -r . > cluster_recovery.rkestate 33 | 34 | echo "Running rke up..." 35 | rke up --config cluster_recovery.yml 36 | -------------------------------------------------------------------------------- /kube-apiserver-check-etcd: -------------------------------------------------------------------------------- 1 | #!/bin/bash 2 | 3 | for i in $(docker exec -it kube-apiserver sh -c 'ps aux | grep kube-apiserver' | awk -F'--etcd-servers' '{print $2}' | awk -F ' ' '{print $1}' | tr -d '=' | tr , "\n" | sed '/^$/d' | awk -F '/' '{print $3}' | awk -F ':' '{print $1}') 4 | do 5 | echo -n "Checking $i " 6 | curl -k -X GET --cacert /etc/kubernetes/ssl/kube-ca.pem --cert /etc/kubernetes/ssl/kube-node.pem --key /etc/kubernetes/ssl/kube-node-key.pem https://"$i":2379/health 7 | echo "" 8 | done 9 | -------------------------------------------------------------------------------- /kube-apiserver-responsiveness: -------------------------------------------------------------------------------- 1 | #!/bin/bash 2 | for cip in $(kubectl get nodes -l "node-role.kubernetes.io/controlplane=true" -o jsonpath='{range.items[*].status.addresses[?(@.type=="InternalIP")]}{.address}{"\n"}{end}'); 3 | do 4 | kubectl --server https://${cip}:6443 get nodes -v6 2>&1| grep round_trippers; 5 | done 6 | -------------------------------------------------------------------------------- /kube-controller-manager: -------------------------------------------------------------------------------- 1 | #!/bin/bash 2 | NODE="$(kubectl -n kube-system get endpoints kube-controller-manager -o jsonpath='{.metadata.annotations.control-plane\.alpha\.kubernetes\.io/leader}' | jq . 2>/dev/null | grep 'holderIdentity' | awk '{print $2}' | tr -d ",\"" | awk -F '_' '{print $1}')" 3 | echo "kube-controller-manager is the leader on node $NODE" 4 | -------------------------------------------------------------------------------- /kube-scheduler: -------------------------------------------------------------------------------- 1 | #!/bin/bash 2 | NODE="$(kubectl -n kube-system get endpoints kube-scheduler -o jsonpath='{.metadata.annotations.control-plane\.alpha\.kubernetes\.io/leader}' | jq . 2>/dev/null | grep 'holderIdentity' | awk '{print $2}' | tr -d ",\"" | awk -F '_' '{print $1}')" 3 | echo "kube-scheduler is the leader on node $NODE" 4 | -------------------------------------------------------------------------------- /kubelet-stats: -------------------------------------------------------------------------------- 1 | #!/bin/bash 2 | curl -sLk --cacert /etc/kubernetes/ssl/kube-ca.pem --cert /etc/kubernetes/ssl/kube-service-account-token.pem --key /etc/kubernetes/ssl/kube-service-account-token-key.pem https://127.0.0.1:1020/stats 3 | -------------------------------------------------------------------------------- /metrics-server/sanity-check.sh: -------------------------------------------------------------------------------- 1 | #!/bin/bash 2 | 3 | MKTEMP_BASEDIR="" 4 | 5 | while getopts ":d:k:" opt; do 6 | case $opt in 7 | d) 8 | MKTEMP_BASEDIR="-p ${OPTARG}" 9 | ;; 10 | k) 11 | KUBECONFIG="-k ${OPTARG}" 12 | ;; 13 | \?) 14 | echo "Invalid option: -$OPTARG" >&2 15 | exit 1 16 | ;; 17 | :) 18 | echo "Option -$OPTARG requires an argument." >&2 19 | exit 1 20 | ;; 21 | esac 22 | done 23 | 24 | # Create temp directory 25 | TMPDIR=$(mktemp -d $MKTEMP_BASEDIR) 26 | 27 | echo "Collecting Cluster level info..." 28 | mkdir -p $TMPDIR/cluster 29 | kubectl cluster-info > $TMPDIR/cluster/cluster-info 2>&1 30 | kubectl get nodes -o wide > $TMPDIR/cluster/nodes-wide 2>&1 31 | 32 | echo "Collecting k8s components..." 33 | mkdir -p $TMPDIR/k8s-components 34 | 35 | echo "Working on metrics-server..." 36 | mkdir -p $TMPDIR/k8s-components/metrics-server 37 | kubectl -n kube-system get pods -o wide -l k8s-app=metrics-server > $TMPDIR/k8s-components/metrics-server/pods-wide 2>&1 38 | mkdir -p $TMPDIR/k8s-components/metrics-server/describe/pod 39 | for pod in $(kubectl -n kube-system get pods -o NAME -l k8s-app=metrics-server); 40 | do 41 | kubectl -n kube-system describe $pod > $TMPDIR/k8s-components/metrics-server/describe/$pod 2>&1 42 | done 43 | mkdir -p $TMPDIR/k8s-components/metrics-server/logs/pod 44 | for pod in $(kubectl -n kube-system get pods -o NAME -l k8s-app=metrics-server); 45 | do 46 | kubectl -n kube-system logs $pod > $TMPDIR/k8s-components/metrics-server/logs/$pod 2>&1 47 | done 48 | kubectl get endpoints -n kube-system metrics-server -o wide > $TMPDIR/k8s-components/metrics-server/endpoints-wide 2>&1 49 | kubectl describe endpoints -n kube-system metrics-server > $TMPDIR/k8s-components/metrics-server/endpoints-describe 2>&1 50 | 51 | echo "Checking metrics health..." 52 | kubectl -n cattle-system exec -it `kubectl -n cattle-system get pods -o NAME -l app=cattle-agent | head -n1` -- curl -k -X GET --cacert /etc/kubernetes/ssl/kube-ca.pem --cert /etc/kubernetes/ssl/kube-node.pem --key /etc/kubernetes/ssl/kube-node-key.pem https://`kubectl -n kube-system describe endpoints metrics-server | grep Addresses: | grep -v NotReadyAddresses: | awk '{print $2}'`:443/healthz > $TMPDIR/k8s-components/metrics-server/healthz 2>&1 53 | echo "" >> $TMPDIR/k8s-components/metrics-server/healthz 2>&1 54 | echo "Checking metrics responce..." 55 | TOKEN="$(kubectl -n cattle-system exec -it `kubectl -n cattle-system get pods -o NAME -l app=cattle-agent | head -n1` -- cat /run/secrets/kubernetes.io/serviceaccount/token)" 56 | kubectl -n cattle-system exec -it `kubectl -n cattle-system get pods -o NAME -l app=cattle-agent | head -n1` -- curl -k -H "Authorization: Bearer $TOKEN" https://`kubectl -n kube-system describe endpoints metrics-server | grep Addresses: | grep -v NotReadyAddresses: | awk '{print $2}'`:443/metrics > $TMPDIR/k8s-components/metrics-server/responce 2>&1 57 | 58 | FILEDIR=$(dirname $TMPDIR) 59 | FILENAME="$(hostname)-$(date +'%Y-%m-%d_%H_%M_%S').tar" 60 | tar cf $FILEDIR/$FILENAME -C ${TMPDIR}/ . 61 | 62 | if $(command -v gzip >/dev/null 2>&1); then 63 | echo "Compressing archive to ${FILEDIR}/${FILENAME}.gz" 64 | gzip ${FILEDIR}/${FILENAME} 65 | FILENAME="${FILENAME}.gz" 66 | fi 67 | 68 | echo "Created ${FILEDIR}/${FILENAME}" 69 | echo "You can now remove ${TMPDIR}" 70 | --------------------------------------------------------------------------------