Revise Simpson Desert tests #896

abyrd · 2023-10-18T16:44:58Z

These tests were using Mercator gridded destinations that did not align with the street intersections or transit stops. This added a somewhat unpredictable walking delay to the end of each itinerary. This also caused the tests to fail when changing the representative points of grid pixels where travel times are measured. This PR or similar changes are necessary to make tests pass on PR #894: ideally this PR should be merged first before ones changing anything about Mercator grids or travel time sample points.

While I was working on this I took care of some other cleanup items for these tests. These are the changes:

Use single destination instead of Mercator grid.
Expand and add Javadoc comments.
Add better measure of distribution goodness-of-fit.
Apply percentile check and goodness-of-fit in every test.
Increase default Monte Carlo draws to get smoother histograms.
Add Javadoc caveats on clone() overrides.
Adapt and document test task builder accordingly.

This PR removes the use of web Mercator grids as destinations, concentrating testing on transit routing. To the extent that we also want the Simpson Desert tests to serve as integration tests of the whole routing system, we will eventually want some tests to use gridded destinations, test that grid cell / pixel sample points are properly situated and introducing the correct amount of walk time. But that should be done in a controlled way, not by inducing unpredictable extra walk times on the end of these more precise tests.

Use single destination instead of Mercator grid. Expand and add Javadoc comments. Add better measure of distribution goodness-of-fit. Apply percentile check and goodness-of-fit in every test. Increase default Monte Carlo draws to get smoother histograms.

Add Javadoc caveats on clone() overrides. Adapt and document test task builder accordingly.

the predicted distribution for the multi-frequency test needs improvement

also remove unused opportunity density grids

ansoncfit

Good to have the linking time removed from consideration.

Separate concern to consider outside this PR: I noticed some pre-existing code (at least in Distribution) has 120 hard-coded as the number of cutoff minutes. If our standard two-hour cutoff might ever be changed, we might want to use a consistent symbolic constant (see also maxTripDurationMinutes)

src/main/java/com/conveyal/r5/analyst/TravelTimeComputer.java

src/test/java/com/conveyal/r5/analyst/network/GridSinglePointTaskBuilder.java

src/test/java/com/conveyal/r5/analyst/network/SimpsonDesertTests.java

ansoncfit · 2023-10-20T03:20:38Z

src/test/java/com/conveyal/r5/analyst/network/SimpsonDesertTests.java

+        // 10 minutes wait, 10 minutes ride, giving 31 to 51 minutes.
+        // This estimation logic could be better codified as something like TravelTimeEstimate.waitWithHeadaway(20) etc.
+
+        // TODO For some reason observed is off by 1 minute, figure out why.


Is board slack being applied at the second (transfer) boarding?

I would not expect it to be applied, as the purpose of slack is to allow at least N minutes (currently set to 1) between arrival at the transit stop and departure of the vehicle, not to add 1 minute to every boarding time. In a case where the transfer target vehicle always departs 10 minutes after the passenger arrives at the stop, the fixed 10 minute wait exceeds the 1 minute slack, so I would expect the total wait to always be 10 minutes. However FastRaptorWorker#BOARD_SLACK_SECONDS seems to only be used in the multi-criteria RAPTOR code, and there's another constant FastRaptorWorker#MINIMUM_BOARD_WAIT_SEC which serves a very similar purpose and comments advising that these two be merged. Uses of this MINIMUM_BOARD_WAIT_SEC imply that it would just search for any departure more than one minute later (so finding one 10 minutes later rather than 11). I will keep digging into this though to see if there's anywhere we'd just be adding the slack.

I moved some constants and updated some comments to make things a little clearer. Board slack (actually MINIMUM_BOARD_WAIT_SEC) seems to work as I expected, establishing a minimum wait but not always introducing an additional wait. My best guess now for the source of the extra minute is some kind of edge effect due to the fact that we're binning times into one-minute bins, and something may be pushing the travel time over the edge into the next bin. I'll have to look at the travel time in seconds instead of minutes to confirm this.

Co-authored-by: Anson Stewart <[email protected]>

abyrd · 2023-10-27T07:56:45Z

While debugging a one-minute (board slack?) discrepancy, I noticed that OneOriginResult.traveltimes has 5252 values even though there's only one destination point. Need to investigate.

trevorgerhardt · 2023-11-02T06:48:21Z

Comments seem to indicate this PR is still being worked on. Is this PR still in progress or ready for a review? @abyrd

abyrd · 2023-11-02T11:24:21Z

Comments seem to indicate this PR is still being worked on. Is this PR still in progress or ready for a review? @abyrd

I am taking a look at the OneOriginResult.traveltimes with 5252 values because I think it's the only thing really blocking this change, and this PR is needed before we can switch to pixel centers. Hoping to have it cleared up shortly.

The two-minute board slack (?) thing is not a regression, just an unclear explanation of why exactly the travel times are what they are. I haven't yet figured out the origin of this discrepancy but I think it has to do with things being binned into 1-minute bins. I am comfortable leaving it as-is until we have a full explanation.

Includes assertions to validate assumptions and ensure non-testing behavior is unchanged. This could later be used to generally allow any kind of PointSet as the destinations in travel time tasks.

ansoncfit

Looks good

Addresses #907, follow up to #896. Regional tasks already allow freeform destinations. Removed special code path required to do this with single point tasks. Comments added to better explain the constraints and expectations around single point and regional tasks, and how destinations are validated. Assertions added pending better general validation on TravelTimeReducer. Geographic dimensions and/or number of target points should be checked.

abyrd added 3 commits October 18, 2023 17:41

Update Simpson Desert tests

304d68e

Use single destination instead of Mercator grid. Expand and add Javadoc comments. Add better measure of distribution goodness-of-fit. Apply percentile check and goodness-of-fit in every test. Increase default Monte Carlo draws to get smoother histograms.

address problematic clone usage

8164da4

Add Javadoc caveats on clone() overrides. Adapt and document test task builder accordingly.

relax test for distribuion fit

b4375b0

the predicted distribution for the multi-frequency test needs improvement

abyrd requested review from trevorgerhardt and ansoncfit October 18, 2023 16:45

fix test code path for single destination

9a1d114

also remove unused opportunity density grids

abyrd mentioned this pull request Oct 19, 2023

Use pixel centers as points in web Mercator grids #894

Merged

ansoncfit reviewed Oct 20, 2023

View reviewed changes

abyrd and others added 2 commits October 26, 2023 19:11

Apply suggestions from code review

e9abaa9

Co-authored-by: Anson Stewart <[email protected]>

move board slack constant, clarify comments

96816a4

abyrd force-pushed the revise-simpson-desert branch from 7062835 to 96816a4 Compare October 27, 2023 07:27

abyrd and others added 2 commits November 2, 2023 19:24

Merge branch 'dev' into revise-simpson-desert

6cd2c97

add code path for testing

3b3d4ff

Includes assertions to validate assumptions and ensure non-testing behavior is unchanged. This could later be used to generally allow any kind of PointSet as the destinations in travel time tasks.

abyrd enabled auto-merge November 2, 2023 11:58

ensure destinations are present on test request

c7f69ab

ansoncfit mentioned this pull request Nov 2, 2023

Extra minute of wait time? #906

Open

ansoncfit approved these changes Nov 2, 2023

View reviewed changes

abyrd merged commit 7970690 into dev Nov 2, 2023
3 checks passed

abyrd deleted the revise-simpson-desert branch November 2, 2023 13:19

abyrd mentioned this pull request Nov 3, 2023

Unsupported Operation on single point with decay function #907

Closed

abyrd mentioned this pull request Nov 3, 2023

Use regional tasks in Simpson Desert tests #908

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revise Simpson Desert tests #896

Revise Simpson Desert tests #896

abyrd commented Oct 18, 2023 •

edited

Loading

ansoncfit left a comment

ansoncfit Oct 20, 2023

abyrd Oct 26, 2023

abyrd Oct 27, 2023

abyrd commented Oct 27, 2023

trevorgerhardt commented Nov 2, 2023

abyrd commented Nov 2, 2023

ansoncfit left a comment

Revise Simpson Desert tests #896

Revise Simpson Desert tests #896

Conversation

abyrd commented Oct 18, 2023 • edited Loading

ansoncfit left a comment

Choose a reason for hiding this comment

ansoncfit Oct 20, 2023

Choose a reason for hiding this comment

abyrd Oct 26, 2023

Choose a reason for hiding this comment

abyrd Oct 27, 2023

Choose a reason for hiding this comment

abyrd commented Oct 27, 2023

trevorgerhardt commented Nov 2, 2023

abyrd commented Nov 2, 2023

ansoncfit left a comment

Choose a reason for hiding this comment

abyrd commented Oct 18, 2023 •

edited

Loading