serialise: add deserialiseFullyOrFail #251

infinity0 · 2020-11-24T19:23:12Z

This is useful in protocol-related logic where you want to be strict as possible and don't allow an attacker to send junk after a valid piece of data.

dcoutts

What about being more radical and changing the behaviour of the existing deserialise functions? That is, perhaps we should just make this the default behaviour. Any reason not to other than possible worries about backwards compat?

dcoutts · 2021-02-23T01:03:03Z

serialise/src/Codec/Serialise.hs

+-- or get back a @'Either' 'DeserialiseFailure' 'ByteString'@.
+--
+-- @since 0.2.4.0
+deserialiseFullyOrFail :: Serialise a => BS.ByteString -> Either (Either CBOR.Read.DeserialiseFailure BS.ByteString) a


Do you think we really need the trailing data as a result? Why not just include it into the failure?

If someone really needs to decode a prefix, there are lower level interfaces they can use.

Including the trailing data is nice as then you can generate a custom error message with some extra details such as the length of the junk or a prefix of it.

As you pointed out, I just realised that Codec.CBOR.Read.deserialiseFromBytes decode will have nearly the same signature. However using this requires adding a direct dependency on cborg which may be surprising/inconvenient for users. The API in serialise otherwise provides no access to the leftover input, so it's nice to have this additional method. For example, the corresponding API in binary has it (decodeOrFail) although it is slightly less clean.

So, I could change the type signature (and definition) to be the same as Codec.CBOR.Read.deserialiseFromBytes decode if you prefer?

I deleted a previous comment agreeing with the "more radical" option, since as per what I said above, I now think it's good to have an API in serialise that exposes this information - and I suppose changing the existing type signature on deserialiseOrFail to express this, would not be acceptable as this would break compilation. Another option is to change DeserialiseFailure to include a variant for junk leftovers, which would allow us to preserve the type signature on deserialiseOrFail whilst changing its behaviour, but this would break people that are pattern-matching on DeserialiseFailure (with -Werror=incomplete-patterns or similar)

infinity0 · 2021-04-21T20:15:42Z

ping again on this :)

In case it wasn't clear, my last comment boils down to modifying this PR to instead have

tryDeserialise :: Serialise a => ByteString -> Either DeserialiseFailure (ByteString, a)
tryDeserialise = deserialiseFromBytes decode

the main purpose being to allow the user to avoid a direct dependency on cborg whilst also having full control over the decode process -- analogous to Data.Binary.decodeOrFail

dcoutts reviewed Feb 23, 2021

View reviewed changes

infinity0 force-pushed the deserialiseFullyOrFail branch 2 times, most recently from f20931b to 623ac1e Compare April 21, 2021 20:25

serialise: add tryDeserialise for a feature-complete API

306136f

infinity0 force-pushed the deserialiseFullyOrFail branch from 623ac1e to 306136f Compare April 21, 2021 20:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

serialise: add deserialiseFullyOrFail #251

serialise: add deserialiseFullyOrFail #251

infinity0 commented Nov 24, 2020

dcoutts left a comment

dcoutts Feb 23, 2021

infinity0 Feb 23, 2021 •

edited

Loading

infinity0 commented Apr 21, 2021 •

edited

Loading

serialise: add deserialiseFullyOrFail #251

Are you sure you want to change the base?

serialise: add deserialiseFullyOrFail #251

Conversation

infinity0 commented Nov 24, 2020

dcoutts left a comment

Choose a reason for hiding this comment

dcoutts Feb 23, 2021

Choose a reason for hiding this comment

infinity0 Feb 23, 2021 • edited Loading

Choose a reason for hiding this comment

infinity0 commented Apr 21, 2021 • edited Loading

infinity0 Feb 23, 2021 •

edited

Loading

infinity0 commented Apr 21, 2021 •

edited

Loading