ECK: Document recovery from failed volume upsize #3459

naemono · 2025-10-13T19:19:30Z

In elastic/cloud-on-k8s#4467 it's noted that some users are dealing with volume expansion failure issues, and documenting how to recover from this situation would be helpful. This is the attempt to update that documentation.

After merge

Close the old ECK issue

Signed-off-by: Michael Montgomery <mmontg1@gmail.com>

Copilot

Pull Request Overview

This PR adds documentation to help users recover from failed Elasticsearch volume expansion operations in ECK (Elastic Cloud on Kubernetes). The change addresses user issues where volume expansion failures can leave deployments in an unrecoverable state.

Adds a new troubleshooting section for volume expansion failures
Documents the recommended recovery approach using nodeSet renaming
Provides specific error message and solution context

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

troubleshoot/deployments/cloud-on-k8s/common-problems.md

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Signed-off-by: Michael Montgomery <mmontg1@gmail.com>

github-actions · 2025-10-14T14:35:11Z

🔍 Preview links for changed docs

troubleshoot/deployments/cloud-on-k8s/common-problems.md

kilfoyle · 2025-10-14T14:47:27Z

troubleshoot/deployments/cloud-on-k8s/common-problems.md

+
+## If a volume expansion failed [k8s-common-problems-volume-failed-expansion]
+
+If you attempted an expansion of an Elasticsearch volume via its [volume claim template](/deploy-manage/deploy/cloud-on-k8s/volume-claim-templates.md#k8s-volume-claim-templates-update), you may have encountered scenarios where the operation failed such as Azure not allowing volume expansion without shutting down the Virtual Machine to which it is attached. If you try to adjust the volume claim template back to the original size you will encounter a failure:


Suggested change

If you attempted an expansion of an Elasticsearch volume via its [volume claim template](/deploy-manage/deploy/cloud-on-k8s/volume-claim-templates.md#k8s-volume-claim-templates-update), you may have encountered scenarios where the operation failed such as Azure not allowing volume expansion without shutting down the Virtual Machine to which it is attached. If you try to adjust the volume claim template back to the original size you will encounter a failure:

If you attempted an expansion of an {{es}} volume via its [volume claim template](/deploy-manage/deploy/cloud-on-k8s/volume-claim-templates.md#k8s-volume-claim-templates-update), you may have encountered scenarios where the operation failed such as Azure not allowing volume expansion without shutting down the Virtual Machine to which it is attached. If you try to adjust the volume claim template back to the original size you will encounter a failure:

kilfoyle · 2025-10-14T14:48:06Z

troubleshoot/deployments/cloud-on-k8s/common-problems.md

+Failed to apply spec change: handle volume expansion: decreasing storage size is not supported: an attempt was made to decrease storage size for claim elasticsearch-data
+```
+
+In this scenario the best course of action is to rename the existing `nodeSet` to a new name while simultaneously updating the volume claim template to the original size. This operation will bring a new `StatefulSet` online while moving all existing indices to the new volumes and will delete the old `StatefulSet` and its volumes once the operation is complete.


Suggested change

In this scenario the best course of action is to rename the existing `nodeSet` to a new name while simultaneously updating the volume claim template to the original size. This operation will bring a new `StatefulSet` online while moving all existing indices to the new volumes and will delete the old `StatefulSet` and its volumes once the operation is complete.

In this scenario the best course of action is to rename the existing `nodeSet` to a new name while simultaneously updating the volume claim template to the original size. This operation will bring a new `StatefulSet` online while moving all existing indices to the new volumes, and will delete the old `StatefulSet` and its volumes once the operation is complete.

kilfoyle

LGTM! 🚢
Just two super nit-picky comments. :-)

naemono added 2 commits October 13, 2025 13:46

wip

b7b6d96

Signed-off-by: Michael Montgomery <mmontg1@gmail.com>

Update wording.

30e8b12

Signed-off-by: Michael Montgomery <mmontg1@gmail.com>

naemono requested a review from a team as a code owner October 13, 2025 19:19

naemono requested review from barkbay, Copilot, kvalliyurnatt, pebrc and rhr323 October 13, 2025 19:19

Copilot AI reviewed Oct 13, 2025

View reviewed changes

troubleshoot/deployments/cloud-on-k8s/common-problems.md Outdated Show resolved Hide resolved

github-actions bot had a problem deploying to docs-preview October 13, 2025 19:20 Failure

Fix the url

382e907

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

github-actions bot had a problem deploying to docs-preview October 13, 2025 19:21 Failure

Add missing .md

9467f53

Signed-off-by: Michael Montgomery <mmontg1@gmail.com>

github-actions bot deployed to docs-preview October 14, 2025 14:32 View deployment

kilfoyle reviewed Oct 14, 2025

View reviewed changes

kilfoyle approved these changes Oct 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ECK: Document recovery from failed volume upsize #3459

ECK: Document recovery from failed volume upsize #3459

naemono commented Oct 13, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

github-actions bot commented Oct 14, 2025

Uh oh!

kilfoyle Oct 14, 2025

Uh oh!

kilfoyle Oct 14, 2025

Uh oh!

kilfoyle left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		## If a volume expansion failed [k8s-common-problems-volume-failed-expansion]

		If you attempted an expansion of an Elasticsearch volume via its [volume claim template](/deploy-manage/deploy/cloud-on-k8s/volume-claim-templates.md#k8s-volume-claim-templates-update), you may have encountered scenarios where the operation failed such as Azure not allowing volume expansion without shutting down the Virtual Machine to which it is attached. If you try to adjust the volume claim template back to the original size you will encounter a failure:

	In this scenario the best course of action is to rename the existing `nodeSet` to a new name while simultaneously updating the volume claim template to the original size. This operation will bring a new `StatefulSet` online while moving all existing indices to the new volumes and will delete the old `StatefulSet` and its volumes once the operation is complete.
	In this scenario the best course of action is to rename the existing `nodeSet` to a new name while simultaneously updating the volume claim template to the original size. This operation will bring a new `StatefulSet` online while moving all existing indices to the new volumes, and will delete the old `StatefulSet` and its volumes once the operation is complete.

ECK: Document recovery from failed volume upsize #3459

Are you sure you want to change the base?

ECK: Document recovery from failed volume upsize #3459

Conversation

naemono commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

After merge

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

github-actions bot commented Oct 14, 2025

🔍 Preview links for changed docs

Uh oh!

kilfoyle Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

kilfoyle Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

kilfoyle left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

naemono commented Oct 13, 2025 •

edited

Loading