Implement `nom::Offset` on `BitPtr`

Feature request by @fasterthanlime

Implementation notes:

- pointer-to-pointer arithmetic is, according to C++ and thus likely LLVM,
  defined only between two pointers known to be in the same allocation region.
  ([article 1]) I do not know at this time what Miri thinks of pointer-to-pointer
  arithmetic, but I strongly suspect, given Miri’s implementation of pointers in
  its evaluator model ([article 2]) that this is suspect at best and a candidate
  for rejection at worst.

  Given that the rest of the crate is also UB in Miri that the compiler merely
  happens to not yet reject, I am not too concerned about adding yet more
  pointer operations that cause errors in Miri.

  See @ralfjung’s blog article ([article 3]) for more about how Miri (and by
  extension, eventually, rustc) treats pointer manipulation.

- `BitPtr`-to-`BitPtr` arithmetic is required by the constraints above to only
  be meaningful within the same allocation region, and thus, with the same type.
  This means that the function signature is not required to generalize as

  ```rust
  fn BitPtr<T: BitStore>::ptr_diff<U: BitStore>(self, other: BitPtr<U)>) -> _;
  ```

  but may be kept specific as just

  ```rust
  fn BitPtr<T: BitStore>::ptr_diff(self, other: Self) -> _;
  ```

  We cannot enforce this in the type system, but this is an `unsafe fn` with the
  precondition that `other` and `self` be derived from the same overarching
  `BitSlice<_, T>` region. It is ***firmly*** undefined behavior, even by the
  UB-adjacent standards of this crate, to call `ptr_diff` with pointers from two
  different regions.

- Prior art in `fn store::BitStore::offset` indicates that the return type
  should be the anonymous record `{ elts: isize, bits: i8 }` (canonicalized as
  the tuple `(isize, i8)`). That function produces `(isize, BitIdx)` because it
  computes a jump value for `ptr::offset` and an absolute bit index in the
  element to which the jump value refers. This function produces a jump value,
  and a bit distance between the start pointer `self` and the end pointer
  `other`.

- `BitPtr<T: BitStore>` is notably missing a `C: Cursor` type parameter. It is
  defined behavior for two `BitSlice<C: Cursor, T: BitStore>` handles drawn from
  the same region to have different `C` type parameters. How should this be
  handled?

  - `BitPtr::ptr_diff` can only compute `BitIdx` differences. This will
    necessarily produce a bit distance that describes the *index* difference
    rather than the *electrical* difference, but, this is unavoidable with the
    information available.

  - ```rust
    fn BitSlice<C: Cursor, T>::offset<D: Cursor>(&self, other: &BitSlice<D, T>) -> _;
    ```

    could convert both `self` and `other`’s `BitIdx` values to `BitPos` using
    their provided `Cursor` implementations, then construct `BitPtr<T>` pointers
    from the `BitPos` values and call `ptr_diff` on them. This would compute the
    electrical bit distance between the two `BitSlice` handles.

Nom integration notes

- `bitvec` currently relies on behavior that is compiled as expected in current
  `rustc` but is marked as UB in Miri, and will likely eventually be rejected by
  the compiler. For this reason, I am hesitant to encourage the `nom` crate to
  begin depending on `bitvec` for its bit-stream parsing.

  However, it is advantageous to have `BitSlice` integrate with `nom`’s traits,
  so that clients who wish to use `bitvec` for bitstream `nom` parsing are able
  to drop `BitSlice` into any trait-driven `nom` functions.

- Once `bitvec` depends on `nom`, it becomes illegal for `nom` to also depend on
  `bitvec`. I don’t know the dependency rules offhand, but I am hoping that if
  `bitvec` depends on `nom` optionally, the default feature set of `bitvec` is
  dependable by `nom` without creating a dependency cycle.

  If `nom` elects to depend on `bitvec`, `bitvec`’s nom integration will be
  removed and placed in `nom` instead. This may constitute a major-breaking
  change if I release a `1.0`.

[article 1]: https://timsong-cpp.github.io/cppwp/n4140/expr.add#6
[article 2]: https://github.com/rust-lang/rust/blob/fefe81605d6111faa8dbb3635ab2c51d59de740a/src/librustc/mir/interpret/mod.rs#L121-L124
[article 3]: https://www.ralfj.de/blog/2018/07/24/pointers-and-bytes.html


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement `nom::Offset` on `BitPtr` #21

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Implement nom::Offset on BitPtr #21

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Implement `nom::Offset` on `BitPtr` #21