Manager restore error (kms_error AccessDeniedException) for multiDC cluster with EAR enabled #3871

mikliapko · 2024-04-22T14:51:29Z

Issue description

Manager restore operation returns kms_error AccessDeniedException for multiDC cluster with EAR enabled.

Full error message:
Found exception: kms_error (AccessDeniedException: User: arn:aws:iam::797456418907:role/qa-scylla-manager-backup-role is not authorized to perform: kms:Decrypt on the resource associated with this ciphertext because the resource does not exist in this Region, no resource-based policies allow access, or a resource-based policy explicitly denies access)

The issue has been observed just recently after we tried to switch Managet SCT tests to run against Scylla 2024.1 instead of 2022.

Impact

The restore operation returns this error multiple times during one run but the whole process finishes successfully. The reason of that I suppose in kms availability of some nodes of the cluster.

How frequently does it reproduce?

Every restore operation performed in such configuration.

Installation details

SCT Version: 31ff1e87d830ce7fe2587e0c609d113d2f66f8a4
Scylla version (or git commit hash): 2024.1.3-20240401.64115ae91a55

Logs

argus job (API security #10): https://argus.scylladb.com/workspace?state=WyIxMzc2ODAyZC0xN2YxLTRiM2MtODhiMy0zNzcwOWI5NDMzZGYiXQ

The text was updated successfully, but these errors were encountered:

mikliapko · 2024-04-22T14:51:40Z

@fruch
The question here - is it a valid case to have a multiDC cluster with turned on Encryption on Rest feature and expect all the Manager's features like restore, etc fully functional? If yes, this issue should be address and fixed in scope of Manager's activities, otherwise, some fixes to SCT for these Manager's tests are required.

@karol-kokoszka @Michal-Leszczynski @rayakurl FYI

vponomaryov · 2024-04-22T15:43:44Z

@mikliapko

In SCT, MultiDC setups with enabled KMS work using regional keys for DB nodes. In your case first 2 DB nodes use one KMS key and third node (from another region) uses separate/another one.
The 2022.x scylla didn't have EaR KMS, so, you couldn't get this problem there.
The manager's feature is not related directly to the problem. The problem is how is it used.
It seems that wrong/unexpected (from different region?) encryption key gets applied against sstables.
If you backed up sstables as encrypted objects from one region and try to apply it to another region then it will not work.
Because, as it is said above, SCT uses per-region KMS keys.

So, as a first step check the sstables per-region limitation for violation by your scenario.
If above is false then need to provide more details about the steps that get done in scope of the mgmt restore operation.

mikliapko · 2024-04-23T09:33:51Z

Hey @vponomaryov,

Thanks for your reply.

Just to clarify, if keys are the same (I mean content) but uploaded to different regions they will be considered by AWS as different and it will basically result in restore issue I described above?
You mentioned that SCT uses per-regiod KMS keys. Then probably the approach with using a kind of multi-region key (https://docs.aws.amazon.com/kms/latest/developerguide/multi-region-keys-overview.html) should help to avoid this issue for multiDC setup. What do you think about it?

vponomaryov · 2024-04-23T09:52:18Z

Just to clarify, if keys are the same (I mean content) but uploaded to different regions they will be considered by AWS as different and it will basically result in restore issue I described above?

No. Scylla gets configured with the KMS keys by their aliases. It doesn't check keys equality.
Just need to understand that encrypted sstables may be decrypted only with proper private/encryption key.

You mentioned that SCT uses per-regiod KMS keys. Then probably the approach with using a kind of multi-region key (https://docs.aws.amazon.com/kms/latest/developerguide/multi-region-keys-overview.html) should help to avoid this issue for multiDC setup. What do you think about it?
Multi-region keys will help to workaround the incorrect approach for restore - yes.
But need to fix the approach for backup+restore operations in the test.
The question not in KMS keys, the question is in usage of sstables. Move sstables only in scope of a single region - for each of the regions.

Again, need to understand the steps done in the test and why it is so?
Is it expected that sstables are stored in encrypted state?
Why then it is stored without private/decryption key reference?

I don't see proofs that the mgmt test approach is correct.
The argument for the per-region KMS keys - if customer chooses this way, then he will not be able to use manager?
Doesn't sound serious.

mikliapko · 2024-04-23T10:10:28Z

Again, need to understand the steps done in the test and why it is so?
Is it expected that sstables are stored in encrypted state?
Why then it is stored without private/decryption key reference?

I got your point. Okay, I need to dig deeper into the test itself because what I did is only changed the version of Scylla (from 2022 to 2024) for the test_manager_sanity test which worked fine on 2022. So, the initial test didn't consider this EaR feature to be here. But now it should be adjusted to it properly.

I don't see proofs that the mgmt test approach is correct.
The argument for the per-region KMS keys - if customer chooses this way, then he will not be able to use manager?
Doesn't sound serious.

Agree.

mikliapko · 2024-04-23T12:58:43Z

@vponomaryov I'm wondering does the current implementation allow to disable Scylla Encryption?

From what I see in the code the encryption will be enabled by default for:

Enterprise version >= '2023.1.3';
AWS cluster;
non-mixed db type;
in absence of custom encryption options.
That's basically the configuration used in Manager tests and Encryption becomes enabled by default.

https://github.com/scylladb/scylla-cluster-tests/blob/caec2aae51d82dd3cbc0870ec59821ddec759401/sdcm/tester.py#L810

And I don't see any ways to correctly disable it except providing any non-equal to auto kms_host parameter value.

vponomaryov · 2024-04-24T05:57:06Z

@vponomaryov I'm wondering does the current implementation allow to disable Scylla Encryption?

From what I see in the code the encryption will be enabled by default for:

Enterprise version >= '2023.1.3';

AWS cluster;

non-mixed db type;

in absence of custom encryption options.
That's basically the configuration used in Manager tests and Encryption becomes enabled by default.

https://github.com/scylladb/scylla-cluster-tests/blob/caec2aae51d82dd3cbc0870ec59821ddec759401/sdcm/tester.py#L810

And I don't see any ways to correctly disable it except providing any non-equal to auto kms_host parameter value.

Just update the config the following way:

scylla_encryption_options: "{'key_provider': 'none'}"

mikliapko · 2024-04-24T07:41:30Z

Just update the config the following way:
scylla_encryption_options: "{'key_provider': 'none'}"

Got it, thanks

fruch · 2024-04-25T05:32:46Z

Just to emphasize, the scylla causing the issue is the one installed on the monitor node for manager

We never tested it with KMS enabled, since the SCT code is written to enable KMS by default in the supported versions, it got enabled.

There is no reason it shouldn't be working, I'm guessing it's just a configuration error picking the wrong region in scylla.yaml configuration, and should be fixed.

mikliapko · 2024-04-25T08:42:40Z

Just to emphasize, the scylla causing the issue is the one installed on the monitor node for manager

Hm, I thought that the problem is in cluster nodes (in one of the region) because while going through logs I've seen this error for manager-regression-manager--db-node-52191a6f-1, manager-regression-manager--db-node-52191a6f-2 which are located in one region and no errors for the node in another region. I'm looking into this pipeline, job 11.

@fruch How to understand that the problem here is related to monitor node?

fruch · 2024-04-25T10:46:32Z

Just to emphasize, the scylla causing the issue is the one installed on the monitor node for manager

Hm, I thought that the problem is in cluster nodes (in one of the region) because while going through logs I've seen this error for manager-regression-manager--db-node-52191a6f-1, manager-regression-manager--db-node-52191a6f-2 which are located in one region and no errors for the node in another region. I'm looking into this pipeline, job 11.

@fruch How to understand that the problem here is related to monitor node?

I take it back, I was confused cause of the name of node had manger in it

Yes it's the DB nodes, and yes the expectation is that the manager is putting the stables back in the same nodes or at least the same region

I don't know what the cloud did by default, but if it's not multi-region keys, restore for multi region setup would be broken

We can't disable KMS before understanding the situation

mikliapko · 2024-04-25T10:53:57Z

and yes the expectation is that the manager is putting the stables back in the same nodes or at least the same region

@karol-kokoszka Could you please elaborate on this?

karol-kokoszka · 2024-04-25T11:06:08Z

Yes it's the DB nodes, and yes the expectation is that the manager is putting the stables back in the same nodes or at least the same region

To make it working the way that SSTables are sent to the same region (DC), you must specify the DC when adding location to the restore task https://manager.docs.scylladb.com/stable/sctool/restore.html#l-location

"The format is [:]:. The parameter is optional. It allows you to specify the datacenter whose nodes will be used to restore the data from this location in a multi-dc setting, it must match Scylla nodes datacenter. By default, all live nodes are used to restore data from specified locations."

If the DC is not specified in the location, then it may be sent to any node.

I guess you must restore multiDC cluster, DC by DC when the encryption at rest is enabled.

mikliapko · 2024-04-25T11:23:18Z

I guess you must restore multiDC cluster, DC by DC when the encryption at rest is enabled.

It means that backup should be also done with --location option specified with every cluster's DC, right?

karol-kokoszka · 2024-04-25T11:31:57Z

It means that backup should be also done with --location ([:]:) option specified with every cluster's DC, right?

I don't think it's necessary.
It's needed when you want to have separate backup bucket per DC.

mikliapko · 2024-04-25T11:43:24Z

It means that backup should be also done with --location ([:]:) option specified with every cluster's DC, right?

I don't think it's necessary. It's needed when you want to have separate backup bucket per DC.

In such case I suppose, I need to know which DC to use during restoring and specifically which key was used to encrypt the SSTables during backup, right? Does sctool provides such opportunity?

karol-kokoszka · 2024-04-25T11:57:30Z

In such case I suppose, I need to know which DC to use during restoring and specifically which key was used to encrypt the SSTables during backup, right? Does sctool provides such opportunity?

SCTool does not concern itself with encryption at rest and is not aware of the keys used to encrypt SSTables. Therefore, it is unnecessary for you to know which keys were used during the backup process.

It is the responsibility of the Scylla server to manage decryption of the data. When SM (presumably Scylla Manager) employs the load & stream feature for restoration, it calls the Scylla server and passes the SSTable. Subsequently, Scylla is tasked with identifying the appropriate node to which the SSTable should be streamed.

I presume that Scylla must first decrypt the SSTable in order to determine the correct destination for streaming. In the scenario you described with this issue, there is a possibility that an SSTable encrypted with a key stored in a different region was sent to a node lacking access to the Key Management Service (KMS) in that region.

To mitigate this issue, it is advisable to restore data center (DC) by data center (DC), ensuring that SSTables encrypted with a specific key (e.g., key A) are decrypted with the corresponding key A.

mikliapko · 2024-04-25T12:03:52Z

To mitigate this issue, it is advisable to restore data center (DC) by data center (DC), ensuring that SSTables encrypted with a specific key (e.g., key A) are decrypted with the corresponding key A.

Thanks a lot for detailed explanation, I'll experiment

mikliapko · 2024-04-26T16:54:20Z

@karol-kokoszka could you please take a look?

I made an attempt to restore specifying two locations - one for every DC. sctool returns error that location specified multiple times.

Command: 'sudo sctool restore -c 39298668-5b05-4338-96e9-3f0b9425dff4 --restore-tables --location us-eastscylla_node_east:s3:manager-backup-tests-us-east-1,us-west-2scylla_node_west:s3:manager-backup-tests-us-east-1  --snapshot-tag sm_20240425231055UTC'
Exit code: 1
Stdout:
Stderr:
Error: create restore target, units and views: init target: location us-west-2scylla_node_west:s3:manager-backup-tests-us-east-1 is specified multiple times
Trace ID: GKvUjE6lRCWpY2NO5K1jfQ (grep in scylla-manager logs)

Michal-Leszczynski · 2024-04-29T07:30:56Z

@mikliapko this is somewhat of SM limitation/bug - that you can't specify given location with many DCs and other location with other DC.

How many DCs do you have in the restore destination cluster? If only mentioned 2, then you can run restore with a single location without DC specified (it will use all nodes with location access for restoring the data).

karol-kokoszka · 2024-04-29T07:33:07Z

@Michal-Leszczynski the goal is to restore DC by DC. And to send node data from DC A to the nodes from DC A.

@mikliapko Please just use separate restore tasks, one per DC.

Michal-Leszczynski · 2024-04-29T07:55:43Z

But something like this is not supported by SM right now. When location is specified, nodes with access (or nodes from specified DC with access) to it restore the whole backup data from this location. So one would need truly separate backup locations for this purpose.

karol-kokoszka · 2024-04-29T08:00:19Z

But something like this is not supported by SM right now. When location is specified, nodes with access (or nodes from specified DC with access) to it restore the whole backup data from this location. So one would need truly separate backup locations for this purpose.

If so, then this is a bug, that we must address in some of the upcoming releases.
It doesn't fit the backup specification that we advertise with our documentation https://manager.docs.scylladb.com/stable/backup/specification.html

Backup location structure, explicitly defines the tree path to exact DC.

Restore must take advantage of it. We may have problems with multiDC EaR without it.

Sanity tests on debian11, ubuntu22 and ubuntu24 switched to be run on multiDC cluster. This configuration was already in place some time ago before issue scylladb/scylla-manager#3871 was found. After that Manager jobs were switched to run only on singleDC cluster (scylladb#7435). And since the fix for Manager is ready now, multiDC setup can be brought back.

Now, even if host failed to restore given batch it can still try to restore batches originating from different dcs. This improves retries in general, but also should help with #3871.

mikliapko · 2024-10-18T10:59:31Z

Funny thing - I switched some of Manager tests back to run on multiDC cluster - they work, no failures.
I did a couple of runs with Manager version 3.2.6 and Scylla Enterprise 2024.1.3 (the same versions that failed some time ago) - no kms-related failures.

@vponomaryov I suppose you should know better if there were any changes to SCT that might have resulted in such behavior?
May we have the same keys configured for different regions that results in smooth multiDC restore operation in Manager?

mikliapko · 2024-10-21T09:24:09Z

@vponomaryov I suppose you should know better if there were any changes to SCT that might have resulted in such behavior? May we have the same keys configured for different regions that results in smooth multiDC restore operation in Manager?

@fruch Perhaps you know something about it?

fruch · 2024-10-21T10:39:03Z

@vponomaryov I suppose you should know better if there were any changes to SCT that might have resulted in such behavior? May we have the same keys configured for different regions that results in smooth multiDC restore operation in Manager?

@fruch Perhaps you know something about it?

I don't know, you'll have to look into the logs to confirm KMS was working at all.

We didn't change the way kms is setup since this issue was opened.

Maybe the test is now shorted and we haven't yet rotated the keys ? (I don't remember the initial issue here)

mikliapko · 2024-10-21T10:53:46Z

I don't know, you'll have to look into the logs to confirm KMS was working at all.

We didn't change the way kms is setup since this issue was opened.

Maybe the test is now shorted and we haven't yet rotated the keys ? (I don't remember the initial issue here)

It shouldn't be something related to rotation, I believe, since the initial issue was related to the single-region keys.
We have used single-region keys in SCT and for multiDC cluster with EaR enabled the next errors were raising during Manager restore:

Found exception: kms_error (AccessDeniedException: User: arn:aws:iam::797456418907:role/qa-scylla-manager-backup-role is not authorized to perform: kms:Decrypt on the resource associated with this ciphertext because the resource does not exist in this Region, no resource-based policies allow access, or a resource-based policy explicitly denies access)

Alright, I'll dig deeper into what might have changed since that time.

Sanity tests on debian11, ubuntu22 and ubuntu24 switched to be run on multiDC cluster. This configuration was already in place some time ago before issue scylladb/scylla-manager#3871 was found. After that Manager jobs were switched to run only on singleDC cluster (scylladb#7435). And since the fix for Manager is ready now, multiDC setup can be brought back.

mikliapko · 2024-10-21T19:09:35Z

@fruch @vponomaryov
The root cause is the following - during nodes configuration SCT defines the same aws_region for all nodes of multiDC cluster in scylla.yaml:

kms_hosts:
  auto:
    aws_region: us-east-1
    aws_use_ec2_credentials: true
    master_key: alias/testid-10224f75-e3b0-4879-a336-e1293bcb2d77

For my case, I have multiDC cluster with nodes in two regions - us-east-1 and us-west-2. You can take a look into at scylla.yaml files for run - aws_region: us-east-1 is defined for all nodes.

I suppose this issue was introduced here since changing the line from:

append_scylla_yaml["kms_hosts"][kms_host_name]["aws_region"] = self.vm_region

to:

append_scylla_yaml["kms_hosts"][kms_host_name]["aws_region"] = self.region

resolved configuration issue.

mikliapko · 2024-10-22T08:49:02Z

SCT issue reported - scylladb/scylla-cluster-tests#9025

* feat(restore): make batches retryable Now, if batch restoration failed on one node, it can still be retried by other nodes. Failed node is no longer used for the restore. Fixes #4065 * feat(restore_test): test batch retry This commit adds TestRestoreTablesBatchRetryIntegration, which injects errors during download and LAS step and validates that restore finished successfully despite them (thanks to batch retries). * refactor(restore): flatten workload structure After giving it some more thought, I decided to flatten workload structure. Instead of having location/table/dir layers, now everything operates on the dir layer. It makes the implementation easier, especially for the upcoming changes related to node retries. * feat(restore): add host retries Now, even if host failed to restore given batch it can still try to restore batches originating from different dcs. This improves retries in general, but also should help with #3871. * feat(restore): extend batch test with host retry This commit extends TestBatchDispatcher to include failures in its scenario and to validate host retry in a different datacenter. * refactor(restore): make batchDispatcher more comprehensible Previously Workload structure was created during indexing and was updated during batching in order to keep its progress. This was confusing, because it wasn't obvious whether size and SSTable fields were describing the initial Workload state or the updated one. This commit makes it so Workload structure is not changed during batching. Instead, workloadProgress was added to in order to store batching progress. Moreover, this commit also adds a lot of documentation about batchDispatcher internal behavior. * fix(restore): free host when it can't restore anymore Consider a scenario with parallel=1 and multi-dc and multi-location. Note that SM is using 'parallel.Run' for restoring in parallel. Note that previous batching changes made host hang in 'batchDispatcher.DispatchBatch' if there were no more SSTables to restore, because it was still possible that another node failed to restore some SSTables, so that the hanging host could be awakened and restore failed SSTables returned to batchDispatcher. All of this meant that batching process could hang, because 'parallel.Run' would allow only a single host to restore SSTables at the time, but batching mechanism wouldn't free it until all SSTables are restored. Another scenario when batching mechanism could fail would be that all hosts failed (with re-tries) to restore all SSTables. Because of that, I changed batching mechanism to be more DC oriented. Now, 'workloadProgress' keeps track of remaining bytes to be restored per DC, and it also keeps host DC access instead of location access (the assumption being that a single DC can be backed up to only single location). This information allow to free hosts that can't restore any SSTables because they either already failed to restore some SSTables from given DCs, or all SSTables from given DCs were already restored. * feat(restore): improve context cancelling during batching I'm not sure if previous behavior was bugged, but changes introduced in this commit should make it more clear that batching mechanism respects context cancellation. This commit also adds a simple test validating that pausing restore during batching ends quickly.

vponomaryov · 2024-10-22T17:05:34Z

SCT issue reported - scylladb/scylla-cluster-tests#9025

Fix: scylladb/scylla-cluster-tests#9026

github-actions bot assigned mikliapko Apr 22, 2024

mikliapko mentioned this issue Apr 24, 2024

fix(manager): upgrade Scylla version used in Manager to latest scylladb/scylla-cluster-tests#7365

Merged

4 tasks

karol-kokoszka mentioned this issue Apr 29, 2024

Give possibility for restoring DC using mapping sourceDC -> destinationDC #3829

Open

rayakurl transferred this issue from scylladb/scylla-cluster-tests May 28, 2024

rayakurl added the qa should be used for qa team testing tasks label May 28, 2024

Michal-Leszczynski mentioned this issue Oct 9, 2024

Add re-tries to restore #4065

Closed

Michal-Leszczynski mentioned this issue Oct 16, 2024

Restore improvement: batch retry #4071

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Manager restore error (kms_error AccessDeniedException) for multiDC cluster with EAR enabled #3871

Manager restore error (kms_error AccessDeniedException) for multiDC cluster with EAR enabled #3871

mikliapko commented Apr 22, 2024

mikliapko commented Apr 22, 2024 •

edited

Loading

vponomaryov commented Apr 22, 2024 •

edited

Loading

mikliapko commented Apr 23, 2024

vponomaryov commented Apr 23, 2024 •

edited by mikliapko

Loading

mikliapko commented Apr 23, 2024 •

edited

Loading

mikliapko commented Apr 23, 2024 •

edited

Loading

vponomaryov commented Apr 24, 2024 •

edited

Loading

mikliapko commented Apr 24, 2024

fruch commented Apr 25, 2024

mikliapko commented Apr 25, 2024

fruch commented Apr 25, 2024

mikliapko commented Apr 25, 2024

karol-kokoszka commented Apr 25, 2024 •

edited

Loading

mikliapko commented Apr 25, 2024 •

edited

Loading

karol-kokoszka commented Apr 25, 2024

mikliapko commented Apr 25, 2024

karol-kokoszka commented Apr 25, 2024

mikliapko commented Apr 25, 2024

mikliapko commented Apr 26, 2024

Michal-Leszczynski commented Apr 29, 2024

karol-kokoszka commented Apr 29, 2024 •

edited

Loading

Michal-Leszczynski commented Apr 29, 2024

karol-kokoszka commented Apr 29, 2024

mikliapko commented Oct 18, 2024

mikliapko commented Oct 21, 2024

fruch commented Oct 21, 2024

mikliapko commented Oct 21, 2024

mikliapko commented Oct 21, 2024

mikliapko commented Oct 22, 2024

vponomaryov commented Oct 22, 2024

Manager restore error (kms_error AccessDeniedException) for multiDC cluster with EAR enabled #3871

Manager restore error (kms_error AccessDeniedException) for multiDC cluster with EAR enabled #3871

Comments

mikliapko commented Apr 22, 2024

Issue description

Impact

How frequently does it reproduce?

Installation details

Logs

mikliapko commented Apr 22, 2024 • edited Loading

vponomaryov commented Apr 22, 2024 • edited Loading

mikliapko commented Apr 23, 2024

vponomaryov commented Apr 23, 2024 • edited by mikliapko Loading

mikliapko commented Apr 23, 2024 • edited Loading

mikliapko commented Apr 23, 2024 • edited Loading

vponomaryov commented Apr 24, 2024 • edited Loading

mikliapko commented Apr 24, 2024

fruch commented Apr 25, 2024

mikliapko commented Apr 25, 2024

fruch commented Apr 25, 2024

mikliapko commented Apr 25, 2024

karol-kokoszka commented Apr 25, 2024 • edited Loading

mikliapko commented Apr 25, 2024 • edited Loading

karol-kokoszka commented Apr 25, 2024

mikliapko commented Apr 25, 2024

karol-kokoszka commented Apr 25, 2024

mikliapko commented Apr 25, 2024

mikliapko commented Apr 26, 2024

Michal-Leszczynski commented Apr 29, 2024

karol-kokoszka commented Apr 29, 2024 • edited Loading

Michal-Leszczynski commented Apr 29, 2024

karol-kokoszka commented Apr 29, 2024

mikliapko commented Oct 18, 2024

mikliapko commented Oct 21, 2024

fruch commented Oct 21, 2024

mikliapko commented Oct 21, 2024

mikliapko commented Oct 21, 2024

mikliapko commented Oct 22, 2024

vponomaryov commented Oct 22, 2024

mikliapko commented Apr 22, 2024 •

edited

Loading

vponomaryov commented Apr 22, 2024 •

edited

Loading

vponomaryov commented Apr 23, 2024 •

edited by mikliapko

Loading

mikliapko commented Apr 23, 2024 •

edited

Loading

mikliapko commented Apr 23, 2024 •

edited

Loading

vponomaryov commented Apr 24, 2024 •

edited

Loading

karol-kokoszka commented Apr 25, 2024 •

edited

Loading

mikliapko commented Apr 25, 2024 •

edited

Loading

karol-kokoszka commented Apr 29, 2024 •

edited

Loading