libnvme: Fix the Reference Tag in copy command #455

Francis-Pravin · 2022-08-12T14:04:28Z

The elbt is declared as elbt[10] to store 80-bit refernce tag value. For now,
the Kernel supports 32-bit and 48-bit refernce tag only. Therefore the value is
received in the 64-bit variable. So elbt[2] to elbt[9] is used to store that 64-bit value.

Identify the type of copy format using args->format instead of structure size.
Also, use a single 64-bit ilbrt instead of using two ilbrt with different datatypes.

Signed-off-by: Francis Pravin Antony Michael Raj [email protected]
Signed-off-by: Jonathan Derrick [email protected]

codecov-commenter · 2022-08-12T14:55:25Z

Codecov Report

Merging #455 (1724f29) into master (0e4d1ce) will decrease coverage by 0.01%.
The diff coverage is 0.00%.

@@            Coverage Diff             @@
##           master     #455      +/-   ##
==========================================
- Coverage   18.17%   18.16%   -0.02%     
==========================================
  Files          31       31              
  Lines        5211     5214       +3     
  Branches      998      999       +1     
==========================================
  Hits          947      947              
- Misses       3992     3995       +3     
  Partials      272      272

Impacted Files	Coverage Δ
src/nvme/ioctl.c	`0.00% <0.00%> (ø)`
src/nvme/util.c	`0.00% <0.00%> (ø)`

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

igaw

You need to find another way to address this. We can fix this properly when we do the next major version of the library until then we have to find ways around without breaking the API.

igaw · 2022-08-12T14:53:38Z

src/nvme/api-types.h

@@ -677,7 +675,7 @@ struct nvme_copy_args {
 	int fd;
 	__u32 timeout;
 	__u32 nsid;
-	__u32 ilbrt;
+	__u64 ilbrt;


This breaks the API

I have one more patch for nvme-cli to fix this issue. In that patch, this renamed ilbrt variable is handled. I hope the API will not break once it is merged.

As far I can tell, libnvme is not just used by nvme-cli. This is why I keep repeating we can't break the API for the 1.x series of the library. For 2.x sure, I don't have any problems to modify the API.

BTW, I would suggest to open an in issue with the exact work description and queue it up for the 2.x library. I fear if we don't write it down, this will be lost.

Hi @igaw
I missed to mention a point.
In struct nvme_copy_args, ilbrt and ilbrt_u64 are used to store the reference tag values of 32-bit and 64-bit respectively. The ilbrt_u64 is only assigned with cfg.ilbrt value in nvme-cli. Whereas ilbrt is not assigned with any value. But, both the variables are handled in nvme_copy() libnvme. It seems fishy for me. That's why I tried to use single ilbrt instead of two variable with different data type.

igaw · 2022-08-12T14:53:45Z

src/nvme/api-types.h

@@ -688,7 +686,6 @@ struct nvme_copy_args {
 	__u8 prinfow;
 	__u8 dtype;
 	__u8 format;
-	__u64 ilbrt_u64;


and this one too.

bjpaupor · 2022-08-12T16:36:20Z

src/nvme/util.c

 		copy[i].elbatm = cpu_to_le16(elbatms[i]);
 		copy[i].elbat = cpu_to_le16(elbats[i]);
+
+		for (j = 0; j < 8; j++)
+			copy[i].elbt[9 - j] = (eilbrts[i] >> (8 * j)) & 0xff;


Isn't this just a more complicated way of storing the eilbrt little-endian order in the last 8 bytes of elbt (as is removed above)?

I have verified the copy command with QEMU emulated NVMe device. On using previous logic, the controller received the eilbrt value as zero via format1. Hence, I stored the value byte by byte. Here observed the controller is receiving the appropriate value.

Which version of the nvme-cli and libnvme did you use? This might be caused by my brown bag release where I
crippled the nvme_init_copy_range_f1 function. The result was, calling nvme_init_copy_range_f1 was a no-op.

linux-nvme/nvme-cli@fd582b0

Always I use the latest version of nvme-cli and libnvme. The issue was observed even before the above mentioned changes.

That would have been too easy...

The elbt is declared as elbt[10] to store 80-bit refernce tag value. For now, the Kernel supports 32-bit and 48-bit refernce tag only. Therefore the value is received in the 64-bit variable. So elbt[2] to elbt[9] is used to store that 64-bit value. Identify the type of copy format using args->format instead of structure size. Also, use a single 64-bit ilbrt instead of using two ilbrt with different datatypes. Signed-off-by: Francis Pravin Antony Michael Raj <[email protected]> Signed-off-by: Jonathan Derrick <[email protected]>

igaw · 2022-08-17T06:50:07Z

If I understand the situation the main issue is that:

copy[i].elbt[2] = cpu_to_le64(eilbrts[i]);

is not working correctly. I suggest you just fix this using the existing API as we can't change it (only extend it).

jderrick · 2022-08-19T16:28:38Z

Hi @igaw ,

copy[i].elbt[2] = cpu_to_le64(eilbrts[i]);

Was the intent here to fill [2] - [9] ?
I have a question about this:

Shouldn't this start at [0]
Wouldn't this be non-little endian?

The same thing appears to be taking place in qemu:
https://elixir.bootlin.com/qemu/latest/source/hw/nvme/ctrl.c#L2658

igaw · 2022-08-22T08:55:31Z

I haven't really digged into the details about this feature until now. I think we have two things to address here.

First, is the handling of the API backwards compatibility, meaning how read the struct nvme_copy_args. The current version looks okay but it is awkwardly used. That means we have functional dependency on the version of struct. I think this is what you also tried to resolve and I said no.

        if (args->args_size == size_v1) {                                                                     
                cdw3 = 0;                                                                                     
                cdw14 = args->ilbrt;                                                                          
        } else {                                                                                              
                cdw3 = (args->ilbrt_u64 >> 32) & 0xffffffff;                                                  
                cdw14 = args->ilbrt_u64 & 0xffffffff;                                                         
        }

Second we have to handle various sizes of LBST/ELBST/ILBRT/EIBTR.

Command Set Specification
Figure 35: Copy – Source Range Entries Descriptor Format 1h

elbt:

This field specifies variable sized Expected Logical Block Storage Tag (ELBST) and
Expected Initial Logical Block Reference Tag (EILBRT) fields, which are defined in section
5.2.1.4.1, to be used for the read portion of the copy operation. If the namespace is not
formatted to use end-to-end protection information, then this field is ignored.

As I understand the current version of nvme_copy() is assuming a specific STS value. To address this I think we need to extend struct nvme_copy_args with a STS field like struct nvme_io_args and update nvme_copy() and nvme_init_copy_range_f1().

Comments?

bjpaupor · 2022-08-22T17:37:43Z

@igaw To the second point, there's a few problems with the current implementation and with the proposed fix here.

The current implementation only actually sets elbt[2] to eilbrt & 0xFF rather than elbt[2-9] to the full value. This could be fixed by using a 64-bit pointer or by setting it byte-wise like this proposed fix.
The proposed fix here nearly works as it does set the correct bytes, but the problem is that it's equivalent to properly setting elbt[2] = cpu_to_be64(elibrt), as a big-endian value.
Even adjusting the current implementation to use a 64-bit pointer gives a problem that it will start the value from elbt[2-X] rather than elbt[X-9].

Only way I see to properly set this value is by breaking the API to get the Storage Tag Size and using something like:
elbt << STS = cpu_to_le64(eilbrt)

Francis-Pravin · 2022-08-24T08:36:13Z

Thanks for the comments.
I totally agree it. To handle the various sizes of LBST/ELBST/ILBRT/EIBTR, STS and PIF values are required. Because, the size of the reference tag are based on those values. Hence We need to extend the struct nvme_copy_args like in struct nvme_io_args. I think it will breaks the API. As @igaw commented above we can't break the API for the 1.x series of the library.
Already an issue has been created in libnvme to fix this issue in 2.0 version. #459
Please add your suggestion on closing this PR.

igaw · 2022-08-25T06:39:27Z

I think there is a misunderstanding. I said, we can't break the API but this doesn't mean we cannot extend it. This is where the versioning of the struct args comes into play. If a struct arg is passed into a function we check the size of it and only access added members if it is possible, hence the size check. So in this case the STS member needs to be added after the last current member (plus pending if needed) and then we can use it in the function.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

libnvme: Fix the Reference Tag in copy command #455

libnvme: Fix the Reference Tag in copy command #455

Francis-Pravin commented Aug 12, 2022

codecov-commenter commented Aug 12, 2022

igaw left a comment

igaw Aug 12, 2022

Francis-Pravin Aug 12, 2022

igaw Aug 16, 2022 •

edited

Loading

Francis-Pravin Aug 26, 2022

igaw Aug 12, 2022

bjpaupor Aug 12, 2022

Francis-Pravin Aug 13, 2022

igaw Aug 16, 2022

Francis-Pravin Aug 16, 2022

igaw Aug 17, 2022

igaw commented Aug 17, 2022

jderrick commented Aug 19, 2022 •

edited

Loading

igaw commented Aug 22, 2022

bjpaupor commented Aug 22, 2022

Francis-Pravin commented Aug 24, 2022 •

edited

Loading

igaw commented Aug 25, 2022

bjpaupor commented Aug 25, 2022

igaw commented Aug 25, 2022

igaw commented Aug 25, 2022

Francis-Pravin commented Aug 26, 2022

igaw commented Aug 26, 2022 •

edited

Loading

igaw commented Jan 25, 2023

libnvme: Fix the Reference Tag in copy command #455

libnvme: Fix the Reference Tag in copy command #455

Conversation

Francis-Pravin commented Aug 12, 2022

codecov-commenter commented Aug 12, 2022

Codecov Report

igaw left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

igaw Aug 16, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

igaw commented Aug 17, 2022

jderrick commented Aug 19, 2022 • edited Loading

igaw commented Aug 22, 2022

bjpaupor commented Aug 22, 2022

Francis-Pravin commented Aug 24, 2022 • edited Loading

igaw commented Aug 25, 2022

bjpaupor commented Aug 25, 2022

igaw commented Aug 25, 2022

igaw commented Aug 25, 2022

Francis-Pravin commented Aug 26, 2022

igaw commented Aug 26, 2022 • edited Loading

igaw commented Jan 25, 2023

igaw Aug 16, 2022 •

edited

Loading

jderrick commented Aug 19, 2022 •

edited

Loading

Francis-Pravin commented Aug 24, 2022 •

edited

Loading

igaw commented Aug 26, 2022 •

edited

Loading