Improved UVData to MirParser interface #1282

kartographer · 2023-04-04T19:04:02Z

Updates the interface between UVData.read method and the MirParser module.

Description

Overhauls the read_mir method inside of Mir. This includes being able to sub-select desired data on read (rather than after-the-fact), with keywords matched to those typically used inside of UVData.read. Additionally, some updates to the MirParser and Mir modules have been made to improve speed/memory performance (by up to a factor of two). The UVData.read and UVData.select have also had the catalog_names keyword added, which allows users to select sources/phase centers by their names rather than their ID numbers (the latter of which are sometimes arbitrarily assigned).

Motivation and Context

This PR covers a final suite of changes that are needed for moving support for MIR data for general users of SMA -- in this case, primarily via improving the interface between the UVData and MIR-type files. Previously, in order to handle MIR files without loading the whole dataset first, one needed to pass forward specific indexing codes that were not necessarily known by lay-users of the telescope.

After this PR is closed, I'd like to request a new pyuvdata version be spun up.

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)

Checklist:

I have read the contribution guide.
My code follows the code style of this project.

Bug fix checklist:

My fix includes a new test that breaks as a result of the bug (if possible).
All new and existing tests pass.
I have updated the CHANGELOG.

New feature checklist:

I have added or updated the docstrings associated with my feature using the numpy docstring format.
I have updated the tutorial to highlight my new feature (if appropriate).
I have added tests to cover my new feature.
All new and existing tests pass.
I have updated the CHANGELOG.

…g select capabilities phase on phase center name.

… metadata arrays.

…epoch).

… for MIR files, on in weights normalization for MIR

…file errors/missing records

… object

…efunct code

codecov · 2023-04-04T19:16:30Z

Codecov Report

Merging #1282 (1d715c5) into main (505b35c) will increase coverage by 0.00%.
The diff coverage is 100.00%.

Additional details and impacted files

@@           Coverage Diff            @@
##             main    #1282    +/-   ##
========================================
  Coverage   99.91%   99.91%            
========================================
  Files          33       33            
  Lines       19328    19459   +131     
========================================
+ Hits        19312    19443   +131     
  Misses         16       16

Impacted Files	Coverage Δ
pyuvdata/uvdata/uvfits.py	`100.00% <ø> (ø)`
pyuvdata/uvdata/uvh5.py	`100.00% <ø> (ø)`
pyuvdata/uvdata/mir.py	`100.00% <100.00%> (ø)`
pyuvdata/uvdata/mir_meta_data.py	`100.00% <100.00%> (ø)`
pyuvdata/uvdata/mir_parser.py	`100.00% <100.00%> (ø)`
pyuvdata/uvdata/ms.py	`99.89% <100.00%> (+<0.01%)`	⬆️
pyuvdata/uvdata/uvdata.py	`100.00% <100.00%> (ø)`

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 505b35c...1d715c5. Read the comment docs.

e-koch

@kartographer -- Found just a few minor things!

e-koch · 2023-04-05T01:10:31Z

pyuvdata/uvdata/mir.py

+                self._add_phase_center(
+                    mir_data.codes_data["source"][sou_id],
+                    cat_type="sidereal",
+                    cat_lon=np.median(icrs_ra),


For future proofing, I wonder about setting a check that np.ptp of the ra, dec varies less than some reasonable amount.

I think this would only matter for concatenated tracks (in which case, is that already handled when a phase center has the same name but different coordinates?).

Fair point! I've added a check immediately after the code highlighted here, which raises a warning if a change of > 1 arcmin is seen (> 1 primary beam for SMA, which is pretty uncommon for standard sources).

pyuvdata/uvdata/mir_meta_data.py

e-koch · 2023-04-05T01:16:03Z

pyuvdata/uvdata/mir_meta_data.py

@@ -1144,6 +1154,8 @@ def get_value(
        idx_arr = self._index_query(use_mask, where, and_where_args, header_key, index)

        if isinstance(field_name, (list, set, tuple)):
+            if return_tuples is None:


Why is the default changed to None? Can the kwarg in the function call just be True?

So I'm using None here as a stand-in for what the typical usage is inside of MirParser, which is that if multiple fields have been requested, to return tuples (as they're often used for indexing via hashmap), but in cases where single field is requested, just to return the array of values. This is more of a convenience thing inside of MirParser rather than anything else -- it just helped to remove about two dozen extra keyword calls inside the code.

pyuvdata/uvdata/mir_parser.py

pyuvdata/uvdata/tests/test_mir.py

…ior.

kartographer · 2023-04-05T14:26:35Z

Thanks @e-koch! I think I've responded to everything you've commented on, but let me know if you see something that needs further attention.

bhazelton

Looks good to me, just a couple questions

pyuvdata/uvdata/uvdata.py

bhazelton

This looks good to me. Thanks for all the hard work @kartographer!

kartographer added 26 commits April 4, 2023 10:29

First pass at modifying mir read arguments to use new features, addin…

bad17f5

…g select capabilities phase on phase center name.

Fixing error in ref frame for spectral windows in MS

06c8740

Adding non-J2000 coord handling

fb75358

Adding helper attribute to differentiate length of masked vs unmasked…

f34d382

… metadata arrays.

Minor change of handling for non-J2000 coords

a63693d

Fixing one more minor bug in coord conversion call for non-J2000

2e908dc

One more workaround fix for non-J2000

5a78d7a

Fixing small bug in handling of ICRS frame coords (where there is no …

99ee53c

…epoch).

More minor cleanup in handling of MS frame/epoch.

f9aeca3

Cleaning up post-rebase

4ae2974

Couple of bug fixes, one that rendered select-on-read operations moot…

6d294cd

… for MIR files, on in weights normalization for MIR

Adding test coverage

5417143

Attempting bug fix to test

973671e

Adding some debugging messages to figure out why test breaks

ac21489

Fixing fixture that's causing broken test

31e3cf9

Adding new functionality to MirMetaData to make in less sensitive to …

876ed9f

…file errors/missing records

Moving MIR to future shapes

9c88e91

Reorganizing MIR read to lower memory footprint when filling a UVData…

b154ce7

… object

Minor bug fixes and optimizations

a677827

Minor tweak to naming convention for ants in MIR

f1abe93

More minor clean-up following memory optimizing, getting rid of now-d…

b8aa93d

…efunct code

Cleaning up docstrings, migrating MIR to future array shapes

d663327

More docstring clean-up

41b5b22

One more docstring fix!

64346c2

Never-ending docstring fixes

ec4a12e

Updating CHANGELOG

2297bd0

kartographer added the SMA Issues related to handling of SMA data label Apr 4, 2023

kartographer requested a review from e-koch April 4, 2023 19:05

Adding missing bit of test coverage

8ea11e3

One final test for one final uncovered line

b37d5a7

e-koch requested changes Apr 5, 2023

View reviewed changes

kartographer added 2 commits April 5, 2023 10:02

Making updates based on review comments

d0dbc71

Improving docstring in MirMetaData.get_value to clarify default behav…

d563d6c

…ior.

e-koch previously approved these changes Apr 5, 2023

View reviewed changes

Restoring full test coverage to mir_parser after tweak.

4516968

kartographer dismissed e-koch’s stale review via 4516968 April 5, 2023 16:27

e-koch previously approved these changes Apr 5, 2023

View reviewed changes

bhazelton reviewed Apr 5, 2023

View reviewed changes

pyuvdata/uvdata/uvdata.py Show resolved Hide resolved

pyuvdata/uvdata/uvdata.py Show resolved Hide resolved

Adding catalog_names keyword to read_uvh5 and read_uvfits

1d715c5

kartographer dismissed e-koch’s stale review via 1d715c5 April 6, 2023 02:45

bhazelton approved these changes Apr 10, 2023

View reviewed changes

bhazelton merged commit 8fdad84 into main Apr 10, 2023

bhazelton deleted the improved_read_mir branch April 10, 2023 22:38

bhazelton mentioned this pull request Apr 10, 2023

Update the changelog for a new version (v2.3.2) #1286

Merged

10 tasks

kartographer mentioned this pull request Feb 10, 2024

SMA data tutorial update #1342

Closed

kartographer mentioned this pull request Apr 30, 2024

Add options to select to select only particular phase centers #1129

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved UVData to MirParser interface #1282

Improved UVData to MirParser interface #1282

kartographer commented Apr 4, 2023

codecov bot commented Apr 4, 2023 •

edited

Loading

e-koch left a comment

e-koch Apr 5, 2023

kartographer Apr 5, 2023

e-koch Apr 5, 2023

kartographer Apr 5, 2023

kartographer commented Apr 5, 2023

bhazelton left a comment

bhazelton left a comment

Improved UVData to MirParser interface #1282

Improved UVData to MirParser interface #1282

Conversation

kartographer commented Apr 4, 2023

Description

Motivation and Context

Types of changes

Checklist:

codecov bot commented Apr 4, 2023 • edited Loading

Codecov Report

e-koch left a comment

Choose a reason for hiding this comment

e-koch Apr 5, 2023

Choose a reason for hiding this comment

kartographer Apr 5, 2023

Choose a reason for hiding this comment

e-koch Apr 5, 2023

Choose a reason for hiding this comment

kartographer Apr 5, 2023

Choose a reason for hiding this comment

kartographer commented Apr 5, 2023

bhazelton left a comment

Choose a reason for hiding this comment

bhazelton left a comment

Choose a reason for hiding this comment

codecov bot commented Apr 4, 2023 •

edited

Loading