Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset.__repr__ upgrade #431

Merged
merged 23 commits into from
Sep 4, 2024
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
Show all changes
23 commits
Select commit Hold shift + click to select a range
52f7fd6
Replace "pixel dims" for "array dims" in dataset repr, because that's
SolarDrew Aug 21, 2024
7bda76f
Add a little more info to dataset repr
SolarDrew Aug 21, 2024
9efe70e
Use pretty correlation matrix instead of plain one
SolarDrew Aug 21, 2024
9ddfce6
Tweak some output a little
SolarDrew Aug 23, 2024
d203862
Flip the ordering of world array indices to be correct
SolarDrew Aug 23, 2024
cfc8043
Improvements to pretty correlation matrix output
SolarDrew Aug 23, 2024
cfc0c5e
Include dataset ID
SolarDrew Aug 23, 2024
e6a8968
Add changelog
SolarDrew Aug 23, 2024
206ab93
Slight generalisation to make the tests pass and hopefully catch any
SolarDrew Aug 23, 2024
9b4ac7d
Slight tweaks
SolarDrew Aug 23, 2024
8d46818
Minor tweaks
SolarDrew Aug 29, 2024
43db5f5
Tweaks to make doc tests pass (mostly)
SolarDrew Aug 30, 2024
af5829d
Nope that still needed to be a remote test
SolarDrew Aug 30, 2024
f55dfcc
Calculate correct number of files for TiledDatasets
SolarDrew Aug 30, 2024
6960de8
Slicing TiledDatasets fives back a different kind of WCS
SolarDrew Aug 30, 2024
8bb0259
This needs to be REMOTE_DATA'd as well
SolarDrew Sep 2, 2024
d433c45
Correct/obfuscate sample data path in dataset repr test
SolarDrew Sep 2, 2024
caeb378
Don't need to run doctests on old releases
SolarDrew Sep 2, 2024
08edf43
Fine I'll just skip all of them if you're going to be like that, doctest
SolarDrew Sep 2, 2024
b18e98e
Update dkist/dataset/loader.py
Cadair Sep 3, 2024
b53eb3d
Merge branch 'main' into repr_upgrade
Cadair Sep 3, 2024
20037e9
Some reworking for tiled / not tiled englishing
Cadair Sep 3, 2024
cd0efaa
Tweak repr again and update loader docstring
SolarDrew Sep 4, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
49 changes: 25 additions & 24 deletions dkist/dataset/loader.py
Original file line number Diff line number Diff line change
Expand Up @@ -45,45 +45,46 @@ def load_dataset(target):
Examples
--------

>>> import dkist

>>> dkist.load_dataset("/path/to/VISP_L1_ABCDE.asdf") # doctest: +SKIP

>>> dkist.load_dataset("/path/to/ABCDE/") # doctest: +SKIP

>>> dkist.load_dataset(Path("/path/to/ABCDE")) # doctest: +SKIP

>>> from sunpy.net import Fido, attrs as a
>>> import dkist.net
>>> search_results = Fido.search(a.dkist.Dataset("AGLKO")) # doctest: +REMOTE_DATA
>>> files = Fido.fetch(search_results) # doctest: +REMOTE_DATA
>>> dkist.load_dataset(files) # doctest: +REMOTE_DATA
<dkist.dataset.dataset.Dataset object at ...>
This Dataset has 4 pixel and 5 world dimensions
>>> from dkist.data.sample import VISP_BKPLX
>>> print(dkist.load_dataset(VISP_BKPLX))
This VISP Dataset BKPLX has 4 pixel and 5 world dimensions and consists of 1700 frames
Files are stored in /home/drew/.local/share/dkist/VISP_BKPLX
SolarDrew marked this conversation as resolved.
Show resolved Hide resolved
<BLANKLINE>
dask.array<reshape, shape=(4, 1000, 976, 2555), dtype=float64, chunksize=(1, 1, 976, 2555), chunktype=numpy.ndarray>
The data are represented by a <class 'dask.array.core.Array'> object:
dask.array<reshape, shape=(4, 425, 980, 2554), dtype=float64, chunksize=(1, 1, 980, 2554), chunktype=numpy.ndarray>
<BLANKLINE>
Pixel Dim Axis Name Data size Bounds
Array Dim Axis Name Data size Bounds
0 polarization state 4 None
1 raster scan step number 1000 None
2 dispersion axis 976 None
3 spatial along slit 2555 None
1 raster scan step number 425 None
2 dispersion axis 980 None
3 spatial along slit 2554 None
<BLANKLINE>
World Dim Axis Name Physical Type Units
0 stokes phys.polarization.stokes unknown
1 time time s
4 stokes phys.polarization.stokes unknown
3 time time s
2 helioprojective latitude custom:pos.helioprojective.lat arcsec
3 wavelength em.wl nm
4 helioprojective longitude custom:pos.helioprojective.lon arcsec
1 wavelength em.wl nm
0 helioprojective longitude custom:pos.helioprojective.lon arcsec
<BLANKLINE>
Correlation between pixel and world axes:
<BLANKLINE>
Pixel Dim
World Dim 0 1 2 3
0 yes no no no
1 no yes no no
2 no yes no yes
3 no no yes no
4 no yes no yes

| PIXEL DIMENSIONS
| spatial | dispersion | raster scan | polarization
WORLD DIMENSIONS | along slit | axis | step number | state
------------------------- | ------------ | ------------ | ------------ | ------------
helioprojective longitude | x | | x |
wavelength | | x | |
helioprojective latitude | x | | x |
time | | | x |
stokes | | | | x
"""
known_types = _known_types_docs().keys()
raise TypeError(f"Input type {type(target).__name__} not recognised. It must be one of {', '.join(known_types)}.")
Expand Down
3 changes: 1 addition & 2 deletions dkist/dataset/utils.py
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,6 @@
instr = ds.inventory.get("instrumentName", "")
if instr:
instr += " "
nframes = len(ds.files)
dsID = ds.inventory.get("datasetId", "(no DatasetID)")

if is_tiled:
Expand All @@ -37,7 +36,7 @@
s += f"has {wcs.pixel_n_dim} pixel and {wcs.world_n_dim} world dimensions"

if ds.files:
s += f" and consists of {nframes} frames\n"
s += f" and consists of {len(ds.files)} frames\n"
SolarDrew marked this conversation as resolved.
Show resolved Hide resolved
s += f"Files are stored in {ds.files.basepath}\n\n"
s += f"The data are represented by a {type(ds.data)} object:\n{ds.data}\n\n"

Expand Down Expand Up @@ -119,7 +118,7 @@
maxlines = max([len(l) for l in wrapped])
for l in wrapped:
while len(l) < maxlines:
l.append("")

Check warning on line 121 in dkist/dataset/utils.py

View check run for this annotation

Codecov / codecov/patch

dkist/dataset/utils.py#L121

Added line #L121 was not covered by tests
header = np.vstack([[s.center(width) for s in wrapped[l]] for l, _ in enumerate(labels)]).T

mstr = np.insert(mstr, 0, header, axis=0)
Expand Down Expand Up @@ -154,7 +153,7 @@
----------
wcs : `BaseHighLevelWCS` or `BaseLowLevelWCS`
"""
print(_get_pp_matrix(wcs))

Check warning on line 156 in dkist/dataset/utils.py

View check run for this annotation

Codecov / codecov/patch

dkist/dataset/utils.py#L156

Added line #L156 was not covered by tests


def extract_pc_matrix(headers, naxes=None):
Expand Down
Loading