Improve readability of construct_internal #1552

sffc · 2022-01-27T21:05:17Z

Follow-up to #1540

sffc · 2022-01-27T21:05:57Z

I know readability is subjective, but I find this to match better with my mental model.

robertbastian · 2022-01-27T21:25:09Z

I don't like this. Currently all outgoing edges of a state are together, this puts them all over the place.

provider/core/src/resource.rs

robertbastian · 2022-01-27T21:45:29Z

provider/core/src/resource.rs

+            state = match (state, content) {
+                (Start | Body0, Some(b'a'..=b'z' | b'A'..=b'Z' | b'0'..=b'9' | b'_' | b'=')) => {
+                    Body0
                }
-                (AfterChar, Some(b'/')) => AfterSlash,
-                (AfterChar, _) => return Err(("[a-zA-z0-9=_/]", i)),
-
-                (AfterSlash, Some(b'a'..=b'z' | b'A'..=b'Z' | b'0'..=b'9' | b'_' | b'=')) => {
-                    AfterCharAfterSlash
+                (Body0, Some(b'/')) => Slash,
+                (Slash | Body1, Some(b'a'..=b'z' | b'A'..=b'Z' | b'0'..=b'9' | b'_' | b'=')) => {
+                    Body1
                }
-                (AfterSlash, _) => return Err(("[a-zA-Z0-9=_]", i)),
-
-                (
-                    AfterCharAfterSlash,
-                    Some(b'a'..=b'z' | b'A'..=b'Z' | b'0'..=b'9' | b'_' | b'='),
-                ) => AfterCharAfterSlash,
-                (AfterCharAfterSlash, Some(b'/')) => AfterSlash,
-                (AfterCharAfterSlash, Some(b'@')) => AfterAt,
-                (AfterCharAfterSlash, _) => return Err(("[a-zA-z0-9=_/@]", i)),
-
-                (AfterAt, Some(b'0'..=b'9')) => AfterDigit,
-                (AfterAt, _) => return Err(("[0-9]", i)),
-
-                (AfterDigit, Some(b'0'..=b'9')) => AfterDigit,
-                (AfterDigit, Some(_)) => return Err(("[0-9]", i)),
-                (AfterDigit, None) => {
+                (Body1, Some(b'/')) => Slash,
+                (Body1, Some(b'@')) => AtSign,
+                (AtSign | Body2, Some(b'0'..=b'9')) => Body2,
+
+                // Success:
+                (Body2, None) => {
                    return Ok(Self {
                        path,
                        hash: ResourceKeyHash::compute_from_str(path),
                    })
                }
+
+                // Errors:
+                (Start | Slash, _) => return Err(("[a-zA-Z0-9=_]", i)),
+                (Body0, _) => return Err(("[a-zA-z0-9=_/]", i)),
+                (Body1, _) => return Err(("[a-zA-z0-9=_/@]", i)),
+                (AtSign | Body2, _) => return Err(("[0-9]", i)),


This would make it even more structured and nicer imho:

Suggested change

state = match (state, content) {

(Start | Body0, Some(b'a'..=b'z' | b'A'..=b'Z' | b'0'..=b'9' | b'_' | b'=')) => {

Body0

}

(AfterChar, Some(b'/')) => AfterSlash,

(AfterChar, _) => return Err(("[a-zA-z0-9=_/]", i)),

(AfterSlash, Some(b'a'..=b'z' | b'A'..=b'Z' | b'0'..=b'9' | b'_' | b'=')) => {

AfterCharAfterSlash

(Body0, Some(b'/')) => Slash,

(Slash | Body1, Some(b'a'..=b'z' | b'A'..=b'Z' | b'0'..=b'9' | b'_' | b'=')) => {

Body1

}

(AfterSlash, _) => return Err(("[a-zA-Z0-9=_]", i)),

(

AfterCharAfterSlash,

Some(b'a'..=b'z' | b'A'..=b'Z' | b'0'..=b'9' | b'_' | b'='),

) => AfterCharAfterSlash,

(AfterCharAfterSlash, Some(b'/')) => AfterSlash,

(AfterCharAfterSlash, Some(b'@')) => AfterAt,

(AfterCharAfterSlash, _) => return Err(("[a-zA-z0-9=_/@]", i)),

(AfterAt, Some(b'0'..=b'9')) => AfterDigit,

(AfterAt, _) => return Err(("[0-9]", i)),

(AfterDigit, Some(b'0'..=b'9')) => AfterDigit,

(AfterDigit, Some(_)) => return Err(("[0-9]", i)),

(AfterDigit, None) => {

(Body1, Some(b'/')) => Slash,

(Body1, Some(b'@')) => AtSign,

(AtSign | Body2, Some(b'0'..=b'9')) => Body2,

// Success:

(Body2, None) => {

return Ok(Self {

path,

hash: ResourceKeyHash::compute_from_str(path),

})

}

// Errors:

(Start | Slash, _) => return Err(("[a-zA-Z0-9=_]", i)),

(Body0, _) => return Err(("[a-zA-z0-9=_/]", i)),

(Body1, _) => return Err(("[a-zA-z0-9=_/@]", i)),

(AtSign | Body2, _) => return Err(("[0-9]", i)),

state = match state {

Start => match byte {

Some(b'a'..=b'z' | b'A'..=b'Z' | b'0'..=b'9' | b'_' | b'=') => AfterChar,

_ => return Err(("[a-zA-Z0-9=_]", i)),

},

AfterChar => match byte {

Some(b'a'..=b'z' | b'A'..=b'Z' | b'0'..=b'9' | b'_' | b'=') => AfterChar

Some(b'/') => AfterSlash,

_ => return Err(("[a-zA-z0-9=_/]", i)),

}

AfterSlash => match byte {

Some(b'a'..=b'z' | b'A'..=b'Z' | b'0'..=b'9' | b'_' | b'=') => AfterCharAfterSlash

_ => return Err(("[a-zA-Z0-9=_]", i)),

}

AfterCharAfterSlash => match byte {

Some(b'a'..=b'z' | b'A'..=b'Z' | b'0'..=b'9' | b'_' | b'=') => AfterCharAfterSlash,

Some(b'/') => AfterSlash,

Some(b'@') => AfterAt,

_ => return Err(("[a-zA-z0-9=_/@]", i)),

}

AfterAt => match byte {

Some(b'0'..=b'9') => AfterDigit,

_ => return Err(("[0-9]", i)),

}

AfterDigit => match byte {

Some(b'0'..=b'9') => AfterDigit,

Some(_) => return Err(("[0-9]", i)),

None => {

return Ok(Self {

path,

hash: ResourceKeyHash::compute_from_str(path),

})

}

}

The error states are much more obviously correct because they're the fallthrough from the lines directly above.

I find this more readable than what you did in #1540, but this still has the problem of the b'a'..=b'z' | b'A'..=b'Z' | b'0'..=b'9' | b'_' | b'=' and stuff being duplicated.

provider/core/src/resource.rs

sffc · 2022-01-27T21:50:33Z

This change keeps the enum and error values you introduced in #1540, but puts the flow of the state machine back to something closer to what it had been before. The flow got changed in the last commit to #1540 which I didn't have a chance to review before it got merged. I prefer the logic written out this way because:

The patterns of accepted characters are not duplicated "all over the place"
Centralizes around destination states more than source states
Fewer lines of code since there is less duplication

Co-authored-by: Robert Bastian <[email protected]>

Manishearth

yeah i like this too

Manishearth · 2022-01-27T23:18:03Z

provider/core/src/resource.rs

-            AfterCharAfterSlash,
-            AfterAt,
-            AfterDigit,
+            Body0,


suggestion (nb): while you're here, do you want to add some doc comments explaining each state qualitatively?

robertbastian

If you feel strongly about this go ahead

provider/core/src/resource.rs

robertbastian · 2022-02-01T14:05:15Z

I incorporated this in #1555

Improve readability of construct_internal

cf764eb

sffc requested a review from robertbastian January 27, 2022 21:05

sffc requested a review from Manishearth as a code owner January 27, 2022 21:05

sffc removed the request for review from Manishearth January 27, 2022 21:06

robertbastian reviewed Jan 27, 2022

View reviewed changes

Update provider/core/src/resource.rs

4ad9dad

Co-authored-by: Robert Bastian <[email protected]>

Manishearth approved these changes Jan 27, 2022

View reviewed changes

robertbastian approved these changes Jan 28, 2022

View reviewed changes

provider/core/src/resource.rs Show resolved Hide resolved

robertbastian closed this Feb 1, 2022

sffc deleted the construct_internal branch February 1, 2022 19:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve readability of construct_internal #1552

Improve readability of construct_internal #1552

Uh oh!

sffc commented Jan 27, 2022

Uh oh!

sffc commented Jan 27, 2022

Uh oh!

robertbastian commented Jan 27, 2022

Uh oh!

Uh oh!

robertbastian Jan 27, 2022

Uh oh!

sffc Jan 27, 2022

Uh oh!

Uh oh!

sffc commented Jan 27, 2022

Uh oh!

Manishearth left a comment

Uh oh!

Manishearth Jan 27, 2022

Uh oh!

robertbastian left a comment

Uh oh!

Uh oh!

robertbastian commented Feb 1, 2022

Uh oh!

Uh oh!

Improve readability of construct_internal #1552

Improve readability of construct_internal #1552

Uh oh!

Conversation

sffc commented Jan 27, 2022

Uh oh!

sffc commented Jan 27, 2022

Uh oh!

robertbastian commented Jan 27, 2022

Uh oh!

Uh oh!

robertbastian Jan 27, 2022

Choose a reason for hiding this comment

Uh oh!

sffc Jan 27, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sffc commented Jan 27, 2022

Uh oh!

Manishearth left a comment

Choose a reason for hiding this comment

Uh oh!

Manishearth Jan 27, 2022

Choose a reason for hiding this comment

Uh oh!

robertbastian left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

robertbastian commented Feb 1, 2022

Uh oh!

Uh oh!