Allow missing shard stats for restarted nodes for `_snapshot/_status` #4410

JeremyDahlgren · 2025-05-23T20:24:14Z

Adds a note explaining the change made in elasticsearch PR #128399 to reduce latency when getting stats for currently running snapshots.

Leaving as draft until elasticsearch PR #128399 has been merged.

Relates ES-10982

github-actions · 2025-05-23T20:25:45Z

Following you can find the validation results for the API you have changed.

API	Status	Request	Response
`snapshot.status`	🟢	2/2	2/2

You can validate this API yourself by using the make validate target.

github-actions · 2025-05-30T19:23:22Z

Following you can find the validation results for the API you have changed.

API	Status	Request	Response
`snapshot.status`	🟢	2/2	2/2

You can validate this API yourself by using the make validate target.

github-actions · 2025-05-30T19:48:01Z

Following you can find the validation results for the API you have changed.

API	Status	Request	Response
`snapshot.status`	🟢	2/2	2/2

You can validate this API yourself by using the make validate target.

Adds a note explaining the change made in elasticsearch PR #128399 to reduce latency when getting stats for currently running snapshots. Relates ES-10982

github-actions · 2025-06-10T17:29:32Z

Following you can find the validation results for the API you have changed.

API	Status	Request	Response
`snapshot.status`	🟢	2/2	2/2

You can validate this API yourself by using the make validate target.

DiannaHohensee

IIUC, the text was, and still is, identical across files?

Is there a way to preview what this looks like?

Is there someplace we need to change/add an API response example?

DiannaHohensee · 2025-06-12T23:45:06Z

output/openapi/elasticsearch-openapi.json

@@ -43923,7 +43923,7 @@
          "snapshot"
        ],
        "summary": "Get the snapshot status",
-        "description": "Get a detailed description of the current state for each shard participating in the snapshot.\n\nNote that this API should be used only to obtain detailed shard-level information for ongoing snapshots.\nIf this detail is not needed or you want to obtain information about one or more existing snapshots, use the get snapshot API.\n\nIf you omit the `<snapshot>` request path parameter, the request retrieves information only for currently running snapshots.\nThis usage is preferred.\nIf needed, you can specify `<repository>` and `<snapshot>` to retrieve information for specific snapshots, even if they're not currently running.\n\nWARNING: Using the API to return the status of any snapshots other than currently running snapshots can be expensive.\nThe API requires a read from the repository for each shard in each snapshot.\nFor example, if you have 100 snapshots with 1,000 shards each, an API request that includes all snapshots will require 100,000 reads (100 snapshots x 1,000 shards).\n\nDepending on the latency of your storage, such requests can take an extremely long time to return results.\nThese requests can also tax machine resources and, when using cloud storage, incur high processing costs.\n\n ## Required authorization\n* Cluster privileges: `monitor_snapshot`",
+        "description": "Get a detailed description of the current state for each shard participating in the snapshot.\n\nNote that this API should be used only to obtain detailed shard-level information for ongoing snapshots.\nIf this detail is not needed or you want to obtain information about one or more existing snapshots, use the get snapshot API.\n\nIf you omit the `<snapshot>` request path parameter, the request retrieves information only for currently running snapshots.\nThis usage is preferred.\nNote that if a node has been restarted or has left the cluster since completing a shard snapshot the stats for that shard will be unavailable.\nLoading the stats from the repository is an expensive operation (see the WARNING below), so to minimize latency for returning stats for currently\nrunning snapshots the stats values will be -1 for these shards even though the \"stage\" value will be \"DONE\".  A \"description\" field will be set\non these shard stats instances indicating why they are empty.  Note that the total stats for the index will be less than expected due to the\nmissing values from these shards.\nIf needed, you can specify `<repository>` and `<snapshot>` to retrieve information for specific snapshots, even if they're not currently running.\n\nWARNING: Using the API to return the status of any snapshots other than currently running snapshots can be expensive.\nThe API requires a read from the repository for each shard in each snapshot.\nFor example, if you have 100 snapshots with 1,000 shards each, an API request that includes all snapshots will require 100,000 reads (100 snapshots x 1,000 shards).\n\nDepending on the latency of your storage, such requests can take an extremely long time to return results.\nThese requests can also tax machine resources and, when using cloud storage, incur high processing costs.\n\n ## Required authorization\n* Cluster privileges: `monitor_snapshot`",


Is there a way to get a preview to see how the formatting looks?

I don't know what the expectations are for formatting. Currently, there's a new line after every sentence, not in the middle of any sentence, regardless of the sentence length. I have a rearrangement suggestion for your new text, which also avoids having to go outside the existing newline pattern (at least the new text doesn't have any sentences longer than the pre-existing text). I added spacing just to see what I was doing.

"Get a detailed description of the current state for each shard participating in the snapshot. \n\nNote that this API should be used only to obtain detailed shard-level information for ongoing snapshots. \nIf this detail is not needed or you want to obtain information about one or more existing snapshots, use the get snapshot API. \n\nIf you omit the `<snapshot>` request path parameter, the request retrieves information only for currently running snapshots. \nThis usage is preferred. >>> I moved this line up <<< \nIf needed, you can specify `<repository>` and `<snapshot>` to retrieve information for specific snapshots, even if they're not currently running. >>> New text (new paragraph, too) <<< \n\nNote that the stats will not be available for any shard snapshots in an ongoing snapshot completed by a node that (even momentarily) left the cluster. \nLoading the stats from the repository is an expensive operation (see the WARNING below). \nTherefore the stats values for such shards will be -1 even though the \"stage\" value will be \"DONE\", in order to minimize latency. \nA \"description\" field will be present on for a shard snapshot completed by a departed node explaining why the shard snapshot's stats results are invalid. \nConsequently, the total stats for the index will be less than expected due to the missing values from these shards. >>> <<< \n\nWARNING: Using the API to return the status of any snapshots other than currently running snapshots can be expensive. \nThe API requires a read from the repository for each shard in each snapshot. \nFor example, if you have 100 snapshots with 1,000 shards each, an API request that includes all snapshots will require 100,000 reads (100 snapshots x 1,000 shards). \n\nDepending on the latency of your storage, such requests can take an extremely long time to return results. \nThese requests can also tax machine resources and, when using cloud storage, incur high processing costs. \n\n ## Required authorization\n* Cluster privileges: `monitor_snapshot`",

DiannaHohensee · 2025-06-13T00:06:30Z

Is there someplace we need to change/add an API response example?

It looks like the allocation explain docs have two response examples -- I remembered see it once -- https://www.elastic.co/docs/api/doc/elasticsearch/operation/operation-cluster-allocation-explain, fwiw. Maybe we could add another example for the new description field in the same way allocation explain does it.

JeremyDahlgren added the specification label May 23, 2025

JeremyDahlgren mentioned this pull request May 23, 2025

Allow missing shard stats for restarted nodes for _snapshot/_status elastic/elasticsearch#128399

Merged

JeremyDahlgren mentioned this pull request May 23, 2025

Allow missing shard stats for restarted nodes for _snapshot/_status #4408

Closed

JeremyDahlgren force-pushed the jdahlgren/get-snapshot-status-missing-stats-for-restarted-nodes branch from d2d9990 to cc13ef7 Compare June 10, 2025 17:24

Allow missing shard stats for restarted nodes for _snapshot/_status

a85ad64

Adds a note explaining the change made in elasticsearch PR #128399 to reduce latency when getting stats for currently running snapshots. Relates ES-10982

JeremyDahlgren force-pushed the jdahlgren/get-snapshot-status-missing-stats-for-restarted-nodes branch from cc13ef7 to a85ad64 Compare June 10, 2025 17:27

JeremyDahlgren added the skip-backport This pull request should not be backported label Jun 10, 2025

JeremyDahlgren requested a review from DiannaHohensee June 10, 2025 17:32

JeremyDahlgren marked this pull request as ready for review June 10, 2025 17:32

DiannaHohensee reviewed Jun 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow missing shard stats for restarted nodes for `_snapshot/_status` #4410

Allow missing shard stats for restarted nodes for `_snapshot/_status` #4410

Uh oh!

JeremyDahlgren commented May 23, 2025

Uh oh!

github-actions bot commented May 23, 2025

Uh oh!

github-actions bot commented May 30, 2025

Uh oh!

github-actions bot commented May 30, 2025

Uh oh!

github-actions bot commented Jun 10, 2025

Uh oh!

DiannaHohensee left a comment

Uh oh!

DiannaHohensee Jun 12, 2025 •

edited

Loading

Uh oh!

DiannaHohensee commented Jun 13, 2025 •

edited

Loading

Uh oh!

Uh oh!

Allow missing shard stats for restarted nodes for _snapshot/_status #4410

Are you sure you want to change the base?

Allow missing shard stats for restarted nodes for _snapshot/_status #4410

Uh oh!

Conversation

JeremyDahlgren commented May 23, 2025

Uh oh!

github-actions bot commented May 23, 2025

Uh oh!

github-actions bot commented May 30, 2025

Uh oh!

github-actions bot commented May 30, 2025

Uh oh!

github-actions bot commented Jun 10, 2025

Uh oh!

DiannaHohensee left a comment

Choose a reason for hiding this comment

Uh oh!

DiannaHohensee Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DiannaHohensee commented Jun 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Allow missing shard stats for restarted nodes for `_snapshot/_status` #4410

Allow missing shard stats for restarted nodes for `_snapshot/_status` #4410

DiannaHohensee Jun 12, 2025 •

edited

Loading

DiannaHohensee commented Jun 13, 2025 •

edited

Loading