Get code up to date #37

adampbeardsley · 2022-06-28T19:51:15Z

This is an all-in-one PR to grab changes made over the last couple years. Some things included here:

py3 support
equinox fix
fits writer
Hari's new modules

This supersedes #20 and #36.

…g buffer size. Added EPIC_multi_gpu.py to have two copies of fft_grid run on two GPUs simultaneously

…ing. It includes Matt's numpy speedups, fits file writing and Hari's optimizations

adampbeardsley

Overall this PR looks pretty straightforward. Just a few cleanup tasks, and a couple quick discussions to have.

adampbeardsley · 2022-06-29T15:04:25Z

LWA_EPIC/LWA_EPIC.py

+                            #try:
+                            #    bf_romein.execute(udata, gdata)
+                            #except NameError:
+                            #    bf_romein = Romein()
+                            #    bf_romein.init(self.locs, gphases, self.grid_size, polmajor=False)
+                            #    bf_romein.execute(udata, gdata)


remove blocks of commented code.

Suggested change

#try:

# bf_romein.execute(udata, gdata)

#except NameError:

# bf_romein = Romein()

# bf_romein.init(self.locs, gphases, self.grid_size, polmajor=False)

# bf_romein.execute(udata, gdata)

adampbeardsley · 2022-06-29T15:05:02Z

LWA_EPIC/LWA_EPIC.py

+                                #bifrost.map(
+                                #    "a(i,j,k,l) += (b(i,j,k,l/2) * b(i,j,k,l%2).conj())",
+                                #    {"a": autocorrs, "b": udata, "t": self.ntime_gulp},
+                                #    axis_names=("i", "j", "k", "l"),
+                                #    shape=(self.ntime_gulp, nchan, nstand, npol ** 2),
+                                #)


Remove blocks of commented code.

Suggested change

#bifrost.map(

# "a(i,j,k,l) += (b(i,j,k,l/2) * b(i,j,k,l%2).conj())",

# {"a": autocorrs, "b": udata, "t": self.ntime_gulp},

# axis_names=("i", "j", "k", "l"),

# shape=(self.ntime_gulp, nchan, nstand, npol ** 2),

#)

adampbeardsley · 2022-06-29T15:08:04Z

LWA_EPIC/LWA_EPIC.py

+                            #bifrost.map(
+                            #    "a(i,j,p,k,l) += b(0,i,j,p/2,k,l)*b(0,i,j,p%2,k,l).conj()",
+                            #    {"a": crosspol, "b": gdata},
+                            #    axis_names=("i", "j", "p", "k", "l"),
+                            #    shape=(self.ntime_gulp, nchan, npol ** 2, self.grid_size, self.grid_size),
+                            #)


Remove blocks of commented code

Suggested change

#bifrost.map(

# "a(i,j,p,k,l) += b(0,i,j,p/2,k,l)*b(0,i,j,p%2,k,l).conj()",

# {"a": crosspol, "b": gdata},

# axis_names=("i", "j", "p", "k", "l"),

# shape=(self.ntime_gulp, nchan, npol ** 2, self.grid_size, self.grid_size),

#)

adampbeardsley · 2022-06-29T15:08:36Z

LWA_EPIC/LWA_EPIC.py

+                                    #try:
+                                    #    bf_romein_autocorr.execute(autocorrs_av, autocorr_g)
+                                    #except NameError:
+                                    #    bf_romein_autocorr = Romein()
+                                    #    bf_romein_autocorr.init(
+                                    #        autocorr_lo, autocorr_il, self.grid_size, polmajor=False
+                                    #    )
+                                    #    bf_romein_autocorr.execute(autocorrs_av, autocorr_g)


Remove blocks of commented code

Suggested change

#try:

# bf_romein_autocorr.execute(autocorrs_av, autocorr_g)

#except NameError:

# bf_romein_autocorr = Romein()

# bf_romein_autocorr.init(

# autocorr_lo, autocorr_il, self.grid_size, polmajor=False

# )

# bf_romein_autocorr.execute(autocorrs_av, autocorr_g)

adampbeardsley · 2022-06-29T15:09:19Z

LWA_EPIC/LWA_EPIC.py

+        "--duration",
+        type=int,
+        default=3600,
+        help="Duration of EPIC (seconds)",


Suggested change

help="Duration of EPIC (seconds)",

help="Total duration of EPIC observation (seconds)",

adampbeardsley · 2022-06-29T15:13:47Z

LWA_EPIC/LWA_EPIC.py

-    if args.removeautocorrs:
-        raise NotImplementedError(
-            "Removing autocorrelations is not yet properly implemented."
-        )
+    #if args.removeautocorrs:
+    #    raise NotImplementedError(
+    #        "Removing autocorrelations is not yet properly implemented."
+    #    )


There have been some changes to the autocorr code, but has it been tested? As far as I know we don't run with this option, so I'm not sure this NotImplementedError should be removed yet.

I'm not even sure if the auto-correlation removal ever worked.

adampbeardsley · 2022-06-29T15:17:16Z

LWA_EPIC/LWA_EPIC.py

+                self.iring.resize(igulp_size, buffer_factor= 8)
+                self.oring.resize(ogulp_size, buffer_factor= 128)  # , obuf_size)


I think @jaycedowell had some concerns about this. IIRC, this looks like we're making large buffers to account for a slow startup, when really we should be aiming to fix the startup issue instead. I don't have a solution, just flagging this for discussion.

Yeah, that buffer_factor of 128 is really large. I had done some simple benchmarking and it looked like the problem was with the GenerateLocations call in MOFFCorrelatorOp.main being slow. I don't know if that was all of it but it is a factor.

For solutions we could think about something like compute-and-cache. The first time a particular frequency range is encountered we take the hit on computing and cache the results to disk. On subsequent encounters with that frequency setup we use what is on disk.

That makes a lot of sense. Perhaps we could also reorder so that even if we need to generate the locations it's done before the buffer is allocated/used, so we don't need to be juggling data while we're still setting up.

jaycedowell · 2022-06-29T15:53:09Z

LWA_EPIC/LWA_EPIC.py

-                self.iring.resize(igulp_size)
-                self.oring.resize(ogulp_size, buffer_factor=5)
+                self.iring.resize(igulp_size, buffer_factor=128)
+                self.oring.resize(ogulp_size, buffer_factor=256)


Here is another large buffer in MOFFCorrelatorOp. The buffer_factor of 256 on the output ring could be one of the big things driving the GPU memory utilization.

jaycedowell · 2022-06-29T15:55:19Z

LWA_EPIC/LWA_EPIC.py

-    cores = [0, 2, 3, 4, 5, 6, 7]
-    gpus = [0, 0, 0, 0, 0, 0, 0]
+    cores = [3, 4, 5, 6, 7]
+    gpus = [0, 0, 0, 0, 0]


Control of cores and gpus should probably be set by a command line flag. gpus could probably be reduced to just a single gpu argument that is used for everything.

HaraKrish and others added 17 commits April 2, 2021 18:08

DFT image flip verified in LWA_bifrost.py and updated to increase rin…

c7d9ba0

…g buffer size. Added EPIC_multi_gpu.py to have two copies of fft_grid run on two GPUs simultaneously

Single and Multi-GPU EPIC with auto-correlation removal included

9f1c7d1

Updates to import new module replacing map for grid multiplication

0abbae1

Removed additional scripts and updated LWA_bifrost.py

37fb336

Auto-stop observation after specified duration

54dac0c

modifications to timed completion

32c3add

Attempt to get time from ADP

0b16114

Added in missing files for ADP communication.

6de8ab4

Updated version of LWA_bifrost.py that is tested and found to be work…

ee20bd5

…ing. It includes Matt's numpy speedups, fits file writing and Hari's optimizations

Merge branch 'main' into epic_fft_dft_2

d8ca3c9

backtrack imports

997c627

remove whitespace

3eaa0c1

remove weird merge glitch

b43bc58

update EQUINOX in the fits files to be a float, conform to wcs standard

c504af6

don't cast equinox as strings

b88c6c0

stop using ibverb-support branch in unit test

825178b

remove unused romein import

333f8d0

adampbeardsley commented Jun 29, 2022

View reviewed changes

jaycedowell reviewed Jun 29, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get code up to date #37

Get code up to date #37

adampbeardsley commented Jun 28, 2022 •

edited

Loading

adampbeardsley left a comment

adampbeardsley Jun 29, 2022

adampbeardsley Jun 29, 2022

adampbeardsley Jun 29, 2022

adampbeardsley Jun 29, 2022

adampbeardsley Jun 29, 2022

adampbeardsley Jun 29, 2022

jaycedowell Jun 29, 2022

adampbeardsley Jun 29, 2022

jaycedowell Jun 29, 2022

adampbeardsley Jun 30, 2022

jaycedowell Jun 29, 2022 •

edited

Loading

jaycedowell Jun 29, 2022 •

edited

Loading

	help="Duration of EPIC (seconds)",
	help="Total duration of EPIC observation (seconds)",

		self.iring.resize(igulp_size, buffer_factor= 8)
		self.oring.resize(ogulp_size, buffer_factor= 128) # , obuf_size)

Get code up to date #37

Are you sure you want to change the base?

Get code up to date #37

Conversation

adampbeardsley commented Jun 28, 2022 • edited Loading

adampbeardsley left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jaycedowell Jun 29, 2022 • edited Loading

Choose a reason for hiding this comment

jaycedowell Jun 29, 2022 • edited Loading

Choose a reason for hiding this comment

adampbeardsley commented Jun 28, 2022 •

edited

Loading

jaycedowell Jun 29, 2022 •

edited

Loading

jaycedowell Jun 29, 2022 •

edited

Loading