5 Tips for Troubleshooting Kubernetes Deployments

Table of contents

As DevOps teams embrace containerization to support the speed and flexibility needed for CI/CD (continuous integration/continuous delivery) implementations, Kubernetes has quickly become the most popular container orchestration platform. In Kubernetes, individual containers are packaged together as pods, which you can automatically deploy, scale, and manage across entire clusters.

When we hear the term “automatic,” we tend to think “easy,” but Kubernetes disproves this notion. There are numerous errors you could run into while deploying a Kubernetes pod that could cause it to fail. Let’s discuss 5 tips for troubleshooting Kubernetes deployments.

Troubleshooting Kubernetes Deployments

So, you’ve deployed a Kubernetes pod, but how do you know if it’s up and running? The first step for troubleshooting Kubernetes deployments is to proactively verify deployed pods are in a ‘ready’ state. To do this, run the command:

kubectl get pods

This command will show you all running pods. The output will tell you if the pod is ready, its current status, how many times it has restarted, and the age of the pod's status (i.e. the length of time it’s been in its current state). If your pod isn’t running, then the STATUS column will likely list an error message.

If you need to troubleshoot a specific pod, you should run:

kubectl get pod [podname]

However, keep in mind that if your pod is part of a Kubernetes deployment, its name will have the replica set and pod IDs appended. You can learn this information from the “get pods” command or, if you only want a list of the replica sets, run:

kubectl get rs

Here are five of the most common Kubernetes pod deployment error messages, along with tips for troubleshooting and fixing the underlying issues that cause them.

Kubernetes Deployment Error #1: CreateContainerConfigError

If the status of your pod is CreateContainerConfigError, that usually means Kubernetes couldn’t find the Secret or ConfigMap. A Secret is a Kubernetes object that stores sensitive information the pod needs to authenticate, like database credentials. A ConfigMap is pretty much what it sounds like—a “map” containing the pod’s configuration information. If your new pod can’t mount the Secrets or ConfigMaps specified, it won’t be able to deploy.

To find out what’s missing, run the command:

kubectl describe pod [name]

Where [name] is the name of your pod without the brackets.

If a Secret or ConfigMap is missing, you’ll receive an error that looks like this:

Error: configmap “configmap-1” not found

Error: secret “secret-1” not found

This tells you which ConfigMap or Secret you need to find or replace. To see if the missing Secret or ConfigMap exists in your cluster, you can run a version of this command (substituting the correct info as needed):

kubectl get configmap configmap-1

If the result is “null”, then it means the ConfigMap or Secret is missing. Then you either need to create it or modify the name of the ConfigMap that your pod requests during deployment.

Bonus tip: make sure you double-check the name and spelling of the missing ConfigMap or Secret in your error message. Sometimes the CreateContainerConfigError is caused by a simple typo in the name!

Kubernetes Deployment Error #2: CrashLoopBackOff

Another common Kubernetes deployment error is CrashLoopBackoff. This error could, unfortunately, mean quite a few things, including an issue with mounting the volume or insufficient resources on the node. Typically, the pod will stay in a “pending” state until the cluster has the resource available to schedule it. If you get the CrashLoopBackOff error, you’ll need to troubleshoot your Kubernetes deployment by running through a list of common causes and eliminating them one by one.

To narrow down the possible causes of your CrashLoopBackOff error, you can look in the pod details for clues. Run the command:

kubectl describe pod [name]

If you get a “Liveness probe failed” and “Back-off restarting failed container” message, that can mean there aren’t enough resources available. While you may be tempted to fix this problem by adjusting “periodSeconds” or “timeoutSeconds” to give your application a longer window of time to respond before shutting down, doing so could mask legitimate performance issues. You should check the configurations of the liveness probe to see what triggered the failure and whether the probe is properly configured for your use case.

If you don’t get the “Liveness probe failed” output from your pod details, then you’ll need to keep digging. The pod details may contain more clues—under the “Last State: Terminated” section, check the Reason, Message, and Exit Code to see if there’s any helpful information. For example, if the Reason is “OOMkilled,” then the pod was trying to use more memory than the node had. To resolve this, you would have to either adjust or specify resources for this pod.

The next step is to check the logs from your previous container instance to see if there are any clues in there. After running kubectl get pod, you can use this command to see the last ten lines from the pod’s log:

kubectl logs --previous --tail 10

You can also check the deployment logs by running:

kubectl logs -f deploy/ -n

The deployment logs will tell you about issues at the application level, such as volume mounting errors.

Kubernetes Deployment Error #3: ImagePullBackOff or ErrImagePull

If you see the ImagePullBackOff or ErrImagePull status, that means your pod couldn’t pull its container image from the registry. The underlying cause could be an incorrect image name or tag, or an authentication issue due to a problem with the Secret. To determine which it is, run the command:

kubectl describe pod [name]

You can expect to see one of these outputs:

wrong image name or tag indicates you likely typed in the wrong name or tag in the pod manifest. To fix the issue, you can confirm the correct image name by using the “docker pull” command and correct it in the pod manifest.
authentication issue in Container registry indicates the pod either has an issue with the Secret that holds its credentials or the pod doesn’t have the right permissions to perform the operation. Correcting that problem should get rid of the ImagePullBackOff or ErrImagePull error.

Kubernetes Deployment Error #4: Unknown

If you see the status Unknown, that usually means the worker node has shut down or crashed, so all stateful pods that reside on the node are unavailable. After five minutes (by default), Kubernetes will change the status of all pods scheduled on that node to Unknown and will attempt to schedule those pods on a different node.

If this is your issue, when you run the “kubectl get pod [name]” command, you’ll see the affected pod listed twice. One will have the status Unknown. The other will have the status ContainerCreating, indicating that Kubernetes is attempting to schedule your pod on a working node.

In that output, you can check the NODE column to see the name of both the crashed node and the working node. Using the name of the crashed node, run the command:

kubectl get node [name]

This output will likely indicate that the node is in NotReady status. If that’s the case, then the issue will probably resolve itself when the failed node is able to recover and rejoin the cluster. The original pod with the Unknown status will be deleted. The second pod that was automatically scheduled on a different node will finish deploying.

If that process is taking too long, or if the node doesn’t automatically recover, then you can manually delete the failed node. This will allow Kubernetes to finish deploying the new pod on the successful node. Simply run the command:

kubectl delete node [name]

Kubernetes Deployment Error #5: No resources found

When you run your kubectl get pod [name] command, you might get an output that just says:

No resources found.

That means your new pod deployment exceeded the CPU or memory limits set by your cluster administrator. You can verify those limits using the command:

kubectl describe limitrange

If this is the issue affecting your Kubernetes pod deployment, you have two options: ask your cluster administrator to increase the limit or reduce the requested resources to something below the existing limit.

Bonus tip: Make sure you have specified the correct namespace. If you don’t, kubectl get pod [name] will only show you what’s in the default namespace.

More Help Troubleshooting Kubernetes Deployments

These are the issues you’re most likely to run into when you deploy your Kubernetes pods, but it’s far from an exhaustive list. If you can’t find the cause of your problem or if you need more extensive Kubernetes troubleshooting, then you shouldn’t be afraid to ask the experts for help. Copado’s team of DevOps experts is here to help you achieve digital transformation through containerization, Kubernetes orchestration, cloud-native architecture development, and more.

Book a demo

About The Author

Copado Team

#1 DevOps Platform for Salesforce

We Build Unstoppable Teams By Equipping DevOps Professionals With The Platform, Tools And Training They Need To Make Release Days Obsolete. Work Smarter, Not Longer.

Copado Robotic Testing Now in AWS Marketplace, AI-Powered Salesforce Test Automation at Scale

Navigating User Acceptance Testing on Salesforce: Challenges, Best Practices and Strategy

Navigating Salesforce Data Cloud: DevOps Challenges and Solutions for Salesforce Developers

Chapter 8: Salesforce Testing Strategy

Beyond the Agentforce Testing Center

How to Deploy Agentforce: A Step-by-Step Guide

How AI Agents Are Transforming Salesforce Revenue Cloud

The Hidden Costs of Building Your Own Salesforce DevOps Solution

Chapter 7 - Talk (Test) Data to Me

Copado Announces DevOps Automation Agent on Salesforce AgentExchange

Deploying CPQ and Revenue Cloud: A DevOps Approach

Copado Launches AI-Powered DevOps Agents on Slack Marketplace

Redefining the Future of DevOps: Salesforce’s Pioneering Ideas and Innovations

Copado Announces DevOps Support for Salesforce Data Cloud, Accelerating AI-Powered Agent Development

AI-Powered Releasing for Salesforce DevOps

Top 3 Pain Points in DevOps — And How Copado AI Platform Solves Them

Copado AI Platform: A New Era of Salesforce DevOps

Copado Expands Its Operations in Japan with SunBridge Partners

Chapter 6: Test Case Design

Making DevOps Easier and Faster with AI

Chapter 5: Automated Testing

Reimagining Salesforce Development with Copado's AI-Powered Platform

Planning User Acceptance Testing (UAT): Tips and Tricks for a Smooth and Enjoyable UAT

What is DevOps for Business Applications

Testing End-to-End Salesforce Flows: Web and Mobile Applications

Copado Integrates Powerful AI Solutions into Its Community as It Surpasses the 100,000 Member Milestone

How to get non-technical users onboard with Salesforce UAT testing

DevOps Excellence within Salesforce Ecosystem

Best Practices for AI in Salesforce Testing

6 testing metrics that’ll speed up your Salesforce release velocity (and how to track them)

Chapter 4: Manual Testing Overview

AI Driven Testing for Salesforce

Chapter 3: Testing Fun-damentals

AI-powered Planning for Salesforce Development

Salesforce Deployment: Avoid Common Pitfalls with AI-Powered Release Management

Exploring DevOps for Different Types of Salesforce Clouds

Copado Launches Suite of AI Agents to Transform Business Application Delivery

What’s Special About Testing Salesforce? - Chapter 2

Why Test Salesforce? - Chapter 1

Continuous Integration for Salesforce Development

Comparing Top AI Testing Tools for Salesforce

Avoid Deployment Conflicts with Copado’s Selective Commit Feature: A New Way to Handle Overlapping Changes

Enhancing Salesforce Security with AppOmni and Copado Integration: Insights, Uses and Best Practices

From Learner to Leader: Journey to Copado Champion of the Year

The Future of Salesforce DevOps: Leveraging AI for Efficient Conflict Management

A Guide to Using AI for Salesforce Development Issues

How to Sync Salesforce Environments with Back Promotions

Copado and Wipro Team Up to Transform Salesforce DevOps

DevOps Needs for Operations in China: Salesforce on Alibaba Cloud

What is Salesforce Deployment Automation? How to Use Salesforce Automation Tools

Maximizing Copado's Cooperation with Essential Salesforce Instruments

From Chaos to Clarity: Managing Salesforce Environment Merges and Consolidations

Future Trends in Salesforce DevOps: What Architects Need to Know

Enhancing Customer Service with CopadoGPT Technology

What is Efficient Low Code Deployment?

Copado Launches Test Copilot to Deliver AI-powered Rapid Test Creation

Cloud-Native Testing Automation: A Comprehensive Guide

A Guide to Effective Change Management in Salesforce for DevOps Teams

Building a Scalable Governance Framework for Sustainable Value

Copado Launches Copado Explorer to Simplify and Streamline Testing on Salesforce

Exploring Top Cloud Automation Testing Tools

Master Salesforce DevOps with Copado Robotic Testing

Exploratory Testing vs. Automated Testing: Finding the Right Balance

A Guide to Salesforce Source Control

A Guide to DevOps Branching Strategies

Family Time vs. Mobile App Release Days: Can Test Automation Help Us Have Both?

How to Resolve Salesforce Merge Conflicts: A Guide

Copado Expands Beta Access to CopadoGPT for All Customers, Revolutionizing SaaS DevOps with AI

Is Mobile Test Automation Unnecessarily Hard? A Guide to Simplify Mobile Test Automation

From Silos to Streamlined Development: Tarun’s Tale of DevOps Success

Simplified Scaling: 10 Ways to Grow Your Salesforce Development Practice

What is Salesforce Incident Management?

What Is Automated Salesforce Testing? Choosing the Right Automation Tool for Salesforce

Copado Appoints Seasoned Sales Executive Bob Grewal to Chief Revenue Officer

Business Benefits of DevOps: A Guide

Copado Brings Generative AI to Its DevOps Platform to Improve Software Development for Enterprise SaaS

Copado Celebrates 10 Years of DevOps for Enterprise SaaS Solutions

Celebrating 10 Years of Copado: A Decade of DevOps Evolution and Growth

5 Reasons Why Copado = Less Divorces for Developers

What is DevOps? Build a Successful DevOps Ecosystem with Copado’s Best Practices

Scaling App Development While Meeting Security Standards

5 Data Deploy Features You Don’t Want to Miss

How to Elevate Customer Experiences with Automated Testing

Top 5 Reasons I Choose Copado for Salesforce Development

Getting Started With Value Stream Maps

Copado and nCino Partner to Provide Proven DevOps Tools for Financial Institutions

Unlocking Success with Copado: Mission-Critical Tools for Developers

How Automated Testing Enables DevOps Efficiency

How to Switch from Manual to Automated Testing with Robotic Testing

How to Keep Salesforce Sandboxes in Sync

How Does Copado Solve Release Readiness Roadblocks?

Software Bugs: The Three Causes of Programming Errors

Best Practices to Prevent Merge Conflicts with Copado 1 Platform

Why I Choose Copado Robotic Testing for my Test Automation

How to schedule a Function and Job Template in DevOps: A Step-by-Step Guide

Delivering Quality nCino Experiences with Automated Deployments and Testing

Maximize Your Code Quality, Security and performance with Copado Salesforce Code Analyzer

Best Practices Matter for Accelerated Salesforce Release Management

Upgrade Your Test Automation Game: The Benefits of Switching from Selenium to a More Advanced Platform

Three Takeaways From Copa Community Day

Go back to resources

There is no previous posts

Go back to resources

There is no next posts

5 Tips for Troubleshooting Kubernetes Deployments

Troubleshooting Kubernetes Deployments

Kubernetes Deployment Error #1: CreateContainerConfigError

Kubernetes Deployment Error #2: CrashLoopBackOff

Kubernetes Deployment Error #3: ImagePullBackOff or ErrImagePull

Kubernetes Deployment Error #4: Unknown

Kubernetes Deployment Error #5: No resources found

More Help Troubleshooting Kubernetes Deployments

About The Author

Explore more about

Activate AI — Accelerate DevOps

Resources

Upcoming Events & Webinars

E-Books and Whitepapers

Support and Documentation

Demo Library