Fix discrepancies with legacy portal in station / history counts #83

rod-glover · 2021-12-23T00:31:35Z

Resolves #81
Resolves #43

This PR depends on pacificclimate/station-data-portal-backend#25, which adds reliable min_obs_time and max_obs_time attributes to station history responses. This enables us to compare times on the same basis as the legacy portal.

This PR includes the following changes:

Modify date filtering to match PDP.
Fix a bug in observation frequency filtering.
Place markers for every unique location in a station's history (at present, there are never more than 3), and link them with a polygon.
Add a tooltip that shows a station's name(s) when hovering over marker or polygon.
Update station popup to show unique locations, time periods, observation frequencies, and variables defined in all histories of a station.
Update Station Metadata tab to show the above. Add sorting on selected columns.
Try to improve performance. This includes adding a timing module that works like a Python decorator -- it wraps a function and times it. Too bad we don't have the equivalent of the Python with statement!

Notes:

I optimized the station filtering function and reduced the time spent in them by about 50%. This doesn't make much difference because the rendering time dominates, not the filtering time.
Part of sluggishness in rendering the map, meaning the station markers, is that the popups and tooltips are created in advance, for all markers. If they are commented out, re-renders are about twice as fast. A possible solution is to create these only on hover and click events. Each one seems to contribute roughly the same amount of overhead, though this will be worth checking more thoroughly.
I can't believe I'm writing this on Boxing Day.

jameshiebert

Just a few minor questions and comments. Looking great!

jameshiebert · 2021-12-23T18:11:58Z

src/components/info/StationMetadata/StationMetadata.js

 const formatDate = d => d ? d.toISOString().substr(0,10) : 'unknown';

+const lexCompare = (a, b) => {


This is for sorting in the metadata table?

Yes, it is for sorting lists of things, e.g., lists of names, obs freqs. These arise because a station can have many histories, hence many names, obs freqs, etc. (These lists are reduced to their unique elements.) lexCompare implements standard lexicographic ordering for lists.

I plan to document all this in the code / README before I merge.

jameshiebert · 2021-12-23T18:13:28Z

src/components/info/StationMetadata/StationMetadata.js

+    {
+      minWidth: 80,
+      maxWidth: 100,
+      id: 'Uniq Obs Freqs',


What exactly does the "Uniq" in this label refer to?

Stations can have multiple histories, hence multiple freqs; these are reduced to their unique elements (see also comment above).

src/utils/portals-common/portals-common.js

jameshiebert · 2021-12-23T18:34:02Z

src/utils/portals-common/portals-common.js

+  //  If we don't do this, and we use strict date matching, then a station with
+  //  several histories fully covering an interval will not be selected, even
+  //  though it should be. The question of what "adjacent" means is a bit tricky
+  //  ... would depend in part on history.freq to distinguish too-large gaps,


We don't care about gaps. The users of the data necessarily has to assume that there will be gaps... that's just the nature of weather data collection.

That's interesting. First, let's eliminate what I'm calling "strict" date matching, which is the condition
min_obs_time < uiStartDate < uiEndDate < max_obs_time, plus allowance for nils
meaning complete containment in the observation interval. This is currently not used.

Currently, in this app, legacy PDP matching is used, which is the looser condition
min_obs_time < uiEndDate && uiStartDate < max_obs_time, plus allowance for nils
meaning overlap with the observation interval

Currently, for both legacy PDP and this app, date matching is done separately for each history in a station. If this isn't right, it can be adjusted in this app.

What I meant about gaps is this: Consider a station with multiple, say 2, histories spanning dates A to B (hx1) and C to D (hx2).

Suppose the user specifies start and end dates s, e such that A < s < B < C < e < D. The legacy matching rule will operate as follows:

hx1: A < e && s < B === true: match

hx2 C < e && s < D === true: match

That's as desired.

But when we have, say, A < B < s < e < C < D then:

hx1: A < e && s < B === false: no match

hx2 C < e && s < D === false: no match

When start or end date fall into the gap between the histories' intervals, the matching fails. Even one of them in the gap means that one of the two histories will not match.

If we decide that the two histories really form one contiguous period, then this matching rule is incorrect. It will bite us if there are large gaps (C - B) into which a start or end date could easily fall. In that case, we'd want a test that used only A to D as the interval, and matched both histories on that basis. That's not hard, but it's different than we have now.

Also, it gets more complicated if we decide that a "large" gap (C - B) means something different than a small gap.

Also, maybe there's a complication with multiple histories that overlap: A < C < B < D or the like.

Questions:

Does the gap problem matter?

Legacy PDP's effective definition of "station" is "history", and effectively ignores meta_station records. SDP treats history records linked by a common station as related.

One further note: Because they are drawn from station_obs_stats_mv, which is updated directly from observations, min_obs_time and max_obs_time are never null (oops: except if there are no observations), and therefore the checking for nulls in the matching rules can (maybe) be simplified accordingly. This is true both for legacy PDP and for SDP.

But they are not like edate, in which null means "ongoing". Ongoing or not, these values are non-null except when there are no observations at all for that history.

I've moved this discussion to a separate issue.

rod-glover · 2021-12-24T18:17:54Z

I still have some multiple history-per-station updates to push. Just a couple of oversights.

I'm also looking at why the app is so sluggish. My timing results suggest that it is rendering, not internal computation (e.g., filtering stations) that is so slow. Got a couple of ideas of how to improve it.

rod-glover · 2022-01-04T23:21:56Z

The sluggishness is definitively in rendering the station markers. If StationMarkers (plural) returns null, the delay in rendering the map, and in the update of other items, e.g., the selected station count, is reduced to under 1 s.

If StationMarker (singular) returns null, the delay is about 2 s. This shows that generating and processing individual markers, even if they are trivial, is surprisingly time consuming.

If StationMarker returns only a CircleMarker, without the tooltip or popup, the delay is about 3 s.

With full marker content (tooltip + popup), the delay is about 6 s.

rod-glover · 2022-01-04T23:44:30Z

Further investigation shows that StationMap is rendering more often than expected. Specifically, when a polygon is drawn on the map, it renders immediately, then renders again after a delay of 2 s, then again after 2 s. When the polygon is deleted, it renders after 2 s, then again after another 2 s. I'm going to take this problem over to another issue, since it is not directly related to the core issue of this PR.

rod-glover · 2022-01-05T21:54:47Z

Demo

rod-glover · 2022-01-05T22:06:45Z

@jameshiebert

I addressed your questions above.
I tested the downloads and both the URLs and the results look good.
Any further comments or questions?

jameshiebert · 2022-01-05T23:12:57Z

If you think that it's in good shape, I trust you. A cursory review of the range filtering shows it to be working like I expect it to.

This is probably a separate issue, but I noticed that the delete feature on the map doesn't work? One can hit the garbage can, get a cursor that says "Click on a feature to remove." But clicking has no effect. "Clear All" works OK.

rod-glover · 2022-01-05T23:19:32Z

I'll put in an issue for the delete problem. Not sure why -- it works in CE, but not here. However, we are using different versions of React Leaflet and Leaflet in this app.

rod-glover added 20 commits December 21, 2021 15:17

Modify date filtering

536b98a

Fix freq filtering

ad9a349

Add logging to station-data-service

9a2f805

Add station counts to Station Filters tab

f2b4c20

Improve station metadata display

c8ba910

Improve station popup contents

c0da0aa

Add tooltip to station markers

ceb479d

Factor out station-info functions

6d359d5

Improve station popup contents

29c7388

Draw a station marker for every history WIP

0487bcc

Extract more into station-info utils

fb3fc98

Refactor and improve StationTooltip

4d286a4

Refactor and improve StationPopup

767dc2c

Refactor StationMarkers; add multi-location polygon

c8b8d13

Refactor StationMarkers; add multi-location polygon

902b9f0

Add local backend to .env

99879e7

Make station tooltip sticky

a3066e8

Curry some station-info fns

3bdfee6

Fix sorting on StationMetadata table

dc00504

Fix layout of StationPopup

4ee1204

jameshiebert reviewed Dec 23, 2021

View reviewed changes

rod-glover added 2 commits December 23, 2021 11:38

Add unionAll to utils/fp

7068d51

Refactor station filtering; handle histories everywhere

fc0ea55

rod-glover added 6 commits December 24, 2021 14:54

Add timing module

28a0776

Apply timing to filtering fns; optimize some

3ab6890

Apply timing to StationMap (not helpful)

d83ca51

Comment out debug code

d4b886e

Factor out StationData component

2a252f9

Make marker keys unique

cf18cb1

rod-glover added 5 commits January 4, 2022 13:50

Don't hide unselected stations in map

9ef1bf9

Tweak station marker code

8cc447c

Adjust VersionA portal for changes above

3ba06a5

Add timing to ObservationCounts

781a98f

Improve timing util

2701c30

rod-glover added 3 commits January 4, 2022 17:10

Fix map display of filtered vs selected stations

1184f53

Fix map display of filtered vs selected stations

0acbc97

Fix data download urls

c36ee1e

rod-glover mentioned this pull request Jan 5, 2022

Gap problem for date matching? #85

Open

rod-glover added 8 commits January 5, 2022 09:30

Update component template to functional style

76299b0

Add component SelectionCounts

7f63908

Add component SelectionCriteria

3484475

Use components SelectionCounts, SelectionCriteria

db1a860

Code cleanup: StationMetadata

a1b6252

Code cleanup: StationMap

06fe051

Code cleanup: Separate utilities

e4ed68f

Doc tweak

579adc9

rod-glover force-pushed the i81-station-history branch from 1643ace to 579adc9 Compare January 5, 2022 21:42

rod-glover merged commit 3249365 into master Jan 5, 2022

rod-glover deleted the i81-station-history branch January 5, 2022 23:33

rod-glover mentioned this pull request Jan 21, 2022

Show stations outside polygon on map #43

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix discrepancies with legacy portal in station / history counts #83

Fix discrepancies with legacy portal in station / history counts #83

rod-glover commented Dec 23, 2021 •

edited

Loading

jameshiebert left a comment

jameshiebert Dec 23, 2021

rod-glover Dec 24, 2021 •

edited

Loading

jameshiebert Dec 23, 2021

rod-glover Dec 24, 2021

jameshiebert Dec 23, 2021

rod-glover Dec 24, 2021 •

edited

Loading

rod-glover Dec 24, 2021 •

edited

Loading

rod-glover Dec 24, 2021 •

edited

Loading

rod-glover Jan 5, 2022

rod-glover commented Dec 24, 2021

rod-glover commented Jan 4, 2022 •

edited

Loading

rod-glover commented Jan 4, 2022

rod-glover commented Jan 5, 2022

rod-glover commented Jan 5, 2022

jameshiebert commented Jan 5, 2022

rod-glover commented Jan 5, 2022

		const formatDate = d => d ? d.toISOString().substr(0,10) : 'unknown';

		const lexCompare = (a, b) => {

Fix discrepancies with legacy portal in station / history counts #83

Fix discrepancies with legacy portal in station / history counts #83

Conversation

rod-glover commented Dec 23, 2021 • edited Loading

jameshiebert left a comment

Choose a reason for hiding this comment

jameshiebert Dec 23, 2021

Choose a reason for hiding this comment

rod-glover Dec 24, 2021 • edited Loading

Choose a reason for hiding this comment

jameshiebert Dec 23, 2021

Choose a reason for hiding this comment

rod-glover Dec 24, 2021

Choose a reason for hiding this comment

jameshiebert Dec 23, 2021

Choose a reason for hiding this comment

rod-glover Dec 24, 2021 • edited Loading

Choose a reason for hiding this comment

rod-glover Dec 24, 2021 • edited Loading

Choose a reason for hiding this comment

rod-glover Dec 24, 2021 • edited Loading

Choose a reason for hiding this comment

rod-glover Jan 5, 2022

Choose a reason for hiding this comment

rod-glover commented Dec 24, 2021

rod-glover commented Jan 4, 2022 • edited Loading

rod-glover commented Jan 4, 2022

rod-glover commented Jan 5, 2022

rod-glover commented Jan 5, 2022

jameshiebert commented Jan 5, 2022

rod-glover commented Jan 5, 2022

rod-glover commented Dec 23, 2021 •

edited

Loading

rod-glover Dec 24, 2021 •

edited

Loading

rod-glover Dec 24, 2021 •

edited

Loading

rod-glover Dec 24, 2021 •

edited

Loading

rod-glover Dec 24, 2021 •

edited

Loading

rod-glover commented Jan 4, 2022 •

edited

Loading