Add binary payload handling to PyLECO #82

BenediktBurger · 2024-05-22T15:58:40Z

Implements the transport of binary data in additional frames, related to How to transmit binary data (pymeasure/leco-protocol#65).

Idea:

First payload frame is JSON
Second (and more) frame are binary

Implementation ideas.

MessageHandler offers the whole message for the rpc handler and adds the additional frames afterwards
Communicator can return the bytes
Communicator (and Director) can send bytes in a rpc request

Open questions:

Should a message with binary data contain a different message type?
If not, how to know, whether the binary data is the desired response instead of the json response?
A return value of None with additional payload frames is a strong indicator, but not exclusive to this scenario. Is it sufficient?
The additional_payload of the Message: what to do, if there is no data defined: should there be an empty first data frame before the additional, or is the additional_payload the whole payload?

codecov · 2024-05-22T16:02:51Z

Codecov Report

Attention: Patch coverage is 98.79518% with 1 line in your changes missing coverage. Please review.

Project coverage is 89.52%. Comparing base (cd0bf05) to head (6ba4c19).
Report is 1 commits behind head on main.

Files	Patch %	Lines
pyleco/utils/listener.py	80.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #82      +/-   ##
==========================================
+ Coverage   89.04%   89.52%   +0.48%     
==========================================
  Files          36       36              
  Lines        2902     2931      +29     
  Branches      355      361       +6     
==========================================
+ Hits         2584     2624      +40     
+ Misses        267      256      -11     
  Partials       51       51

Flag	Coverage Δ
unittests	`89.52% <98.79%> (+0.48%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

BenediktBurger · 2024-05-23T07:27:37Z

@seb5g, in this PR I experiment with the transport of binary data via PyLECO.
As it is a feature you preferred over the base64 encoding, I want to make you aware.

BenediktBurger · 2024-05-24T17:05:25Z

Idea for:

If not, how to know, whether the binary data is the desired response instead of the json response?
A return value of None with additional payload frames is a strong indicator, but not exclusive to this scenario. Is it sufficient?

Just add a parameter, whether to return a binary value.

BenediktBurger · 2024-05-28T15:25:27Z

@untzag, @ksunden, @VigneshVSV this could be also of interest for your own projects.

BenediktBurger · 2024-05-28T15:35:13Z

I ask for feedback regarding the current implementation:

This is how you call the method some_funny_method with additional binary payload (for the method):

# on the remote side (on MessageHandler):
def some_funny_method(self):
    additional_payload = self.current_message.payload[1:]
    # do something with additional_payload
    self.additional_response_payload = [b"whatever", b"to return", b"binary"]
    return None

# on the requester side:
ask("receiver", method="some_funny_method", additional_payload=[b"123", b"456], extract_additional_payload=True) 
# returns: [b"whatever", b"to return", b"binary"]

The parameter extract_additional_payload (independent of the additional payload on the way to the remote contact) will return the binary return value of some_funny_method, if the JSON response value is None.

VigneshVSV · 2024-05-28T18:59:52Z

I think its a tricky question in general as seen by some questions you raised in the issue.

Currently I allow either returning either of non-serialized data or serialized data only. Not both. Technically a user could do

def my_server_side_method_returning_bytes(self):
    return memoryview(self.my_large_numpy_array)

def my_server_side_method_normal(self):
    return 5, 'my name', self.foo_list

i.e. for the user, the usage of methods is the same and there is no need to access an additional request or response object. Although I dont see what is wrong with that either. Some HTTP servers which work based on decorating functions, like flask, allow to access the request object through some global variable. Same can be then done for response.

Nevertheless, since I allow only one type (either of bytes, bytearray or memoryview) or of a other python types which needs to be serialized, its a little easier for me to place them in different indices of the multipart message to know what was the reply. In this way, the client just checks which part of the multipart message containing return values has size greater than 0. The user given pre serialized part or the RPC server serialized part?

Of course, the ideal and best case is to have both so that the user can return any damn return value. This is the right approach. May be its best to reserve a tuple of size 2 as a fixed internal return value scheme. Just before serializing, the type can be checked to be tuple and if its of length 2, then one can proceed to inspect the types each element of the element. If one turns out to be byte type, then dont serialize it again as that will lead to serilization error:

def my_mixed_returning_server_method(self):
    return { 1: self.some_foo_list, 3 : 42 }, memoryview(self.my_large_numpy_array)

Within the RPC server:

    ret = func(*args, **kwargs)
    if isinstance(ret, tuple) and len(ret) == 2: # return value containing one normal part and one byte part
          if isinstance(ret[1], (bytes, bytearray, memoryview)): # check if its really a byte part
                 self.serializer.dumps(ret[0]) # ret[1] is already serialitzed
          else:
                self.serializer.dumps(ret)  # serialize normal data given in tuple of length 2 
          # as the return value was indeed a normal tuple
   elif not isinstance(ret, (bytes, bytearray, memoryview)): # covers the case of prevent serialization of single 
        # return value which is of bytes type
       self.serializer.dumps(ret) # some other data including tuples of lower or higher length

Of course the dumps return value must be assign to a place within the multipart message.

This is the way I would do it.

BenediktBurger · 2024-05-28T20:59:01Z

Thanks @VigneshVSV for your extensive answer.
Part of the challenge is the currently used rpc server. The server sees only the json request and returns a json response. I have to handle the binary things outside (with these external variables).

BenediktBurger · 2024-05-28T21:08:10Z

I have an idea (inspired by you): If the json frame is empty, the other frames are the return value.

BenediktBurger · 2024-05-29T10:17:53Z

Newest design:

in ask, the extract_additional_payload parameter decides on the return value: Either the JSON return value or a tuple of the JSON return value and the additional payload, which might be an empty list. Therefore it is always clearly defined, what it returns
The message handler offers a possibility to register a binary method:
- If a parameter is set, the method will receive the request's additional payload as a kwarg.
- It's return values will be filtered for binary data, which will be transmitted as additional payload.

VigneshVSV · 2024-05-29T12:44:50Z

Regarding the ask RPC caller, I would still keep in mind that the server side method works transparently irrespective of whether it is called within another function or method, or from client side with the ask. I am not sure if the change you made affects this.

BenediktBurger · 2024-06-01T08:16:40Z

@VigneshVSV , thanks for your comments, they are helpful.
Registering a method makes it available via RPC. Therefore, you have to regsiter it, before you can call it via ask.

BenediktBurger · 2024-06-04T10:07:33Z

A small change:

You have to choose, whether the method will return only JSON values or a tuple of a JSON value and a list of bytes.

Currently the you have to tell the ask method, whether to expect binary data or not. I do not consider this as a major problem:

You know which method you called (and binary methods have a "(binary)" at the end of their docstring), so you know, what to expect.
In doubt, you can ask for the binary_code and mostly use the first (JSON) response value. Here it comes handy, that you know the return value (JSON or tuple of JSON and list) depending on the parameter.
Later on, we still can add a special MessageType, which indicates a bytes payload.

seb5g · 2024-06-04T12:08:26Z

I'm not using really pyleco at the moment as you are the one who implemented it in pymodaq... so I still have difficulties to really understand how all this works. But for sure, everything within pymodaq can be binary serialized so direct emission of such data (frame as you call it?) would be better. I'm not sure how i could really help in the design...

BenediktBurger · 2024-06-04T21:00:24Z

Thanks for your comment @seb5g. I appreciate it.

BenediktBurger · 2024-06-19T10:56:45Z

Summary:

It is easier to add more payload frames (list of bytes) with the additional_payload parameter of Message, DataMessage classes, and ask_rpc methods
interpret_rpc_result can either return just the content of the first payload frame (normally json) or that content plus the list of additional frames (maybe an empty list), depending on a parameter switch. Also ask_rpc allows to use that parameter. That allows to retrieve the additional frames easily
Finally, the MessageHandler offers to register binary methods which can accept and/or return also binary values

BenediktBurger added 4 commits May 22, 2024 17:06

Make communicator return binary objects.

6ec8873

Make MessageHandler capable for binary objects.

4a49863

Add additional_payload to Message.

055d0fa

Refactor Director.ask_message

0db80a1

BenediktBurger added the enhancement New feature or request label May 22, 2024

BenediktBurger added 8 commits May 27, 2024 10:32

Modify formatting with ruff

df75c6a

Add parameter for extract binary response

7bc5c87

Add additional_payload option to ask_rpc

baa5797

Add additional payload options to director

0bbe03e

Add acceptance test for binary transfer.

17c9e2a

Return all additional payload frames.

a32402a

Update Director (and Fake) to changes.

ad66400

Tiny changes.

d970d39

BenediktBurger mentioned this pull request May 28, 2024

PyLECO binary transport PyMoDAQ/PyMoDAQ#308

Closed

Return either json value or json and binary

cb62581

Add a method to register binary methods.

d2d5ad8

BenediktBurger force-pushed the binary_payload branch from a8c0b08 to d2d5ad8 Compare May 29, 2024 11:19

Modify docstring of binary method.

c53d71c

BenediktBurger marked this pull request as ready for review June 1, 2024 10:59

BenediktBurger mentioned this pull request Jun 1, 2024

Add locking actor #84

Merged

BenediktBurger force-pushed the binary_payload branch from fa95894 to 938e3b0 Compare June 4, 2024 10:03

Explicitly state whether to return binary values.

6ba7917

BenediktBurger force-pushed the binary_payload branch from 938e3b0 to 6ba7917 Compare June 4, 2024 10:06

BenediktBurger added 5 commits June 5, 2024 09:15

State type of binary method in docstring.

453db24

Improve documentation

ef51d47

Make data_message similar to message

39ca4c5

Add binary sending to data publisher

8e3d82e

Fix creation of binary method

bbd1c53

BenediktBurger force-pushed the binary_payload branch from f1967fe to bbd1c53 Compare June 12, 2024 11:26

BenediktBurger mentioned this pull request Jun 12, 2024

Binary transfer via PyLECO PyMoDAQ/PyMoDAQ#319

Closed

Add changelog entry.

6ba4c19

BenediktBurger merged commit ca94ef5 into main Jun 19, 2024
20 checks passed

BenediktBurger deleted the binary_payload branch June 19, 2024 10:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add binary payload handling to PyLECO #82

Add binary payload handling to PyLECO #82

BenediktBurger commented May 22, 2024 •

edited

Loading

codecov bot commented May 22, 2024 •

edited

Loading

BenediktBurger commented May 23, 2024

BenediktBurger commented May 24, 2024

BenediktBurger commented May 28, 2024

BenediktBurger commented May 28, 2024

VigneshVSV commented May 28, 2024 •

edited

Loading

BenediktBurger commented May 28, 2024

BenediktBurger commented May 28, 2024

BenediktBurger commented May 29, 2024

VigneshVSV commented May 29, 2024

BenediktBurger commented Jun 1, 2024

BenediktBurger commented Jun 4, 2024

seb5g commented Jun 4, 2024

BenediktBurger commented Jun 4, 2024

BenediktBurger commented Jun 19, 2024

Add binary payload handling to PyLECO #82

Add binary payload handling to PyLECO #82

Conversation

BenediktBurger commented May 22, 2024 • edited Loading

codecov bot commented May 22, 2024 • edited Loading

Codecov Report

BenediktBurger commented May 23, 2024

BenediktBurger commented May 24, 2024

BenediktBurger commented May 28, 2024

BenediktBurger commented May 28, 2024

VigneshVSV commented May 28, 2024 • edited Loading

BenediktBurger commented May 28, 2024

BenediktBurger commented May 28, 2024

BenediktBurger commented May 29, 2024

VigneshVSV commented May 29, 2024

BenediktBurger commented Jun 1, 2024

BenediktBurger commented Jun 4, 2024

seb5g commented Jun 4, 2024

BenediktBurger commented Jun 4, 2024

BenediktBurger commented Jun 19, 2024

BenediktBurger commented May 22, 2024 •

edited

Loading

codecov bot commented May 22, 2024 •

edited

Loading

VigneshVSV commented May 28, 2024 •

edited

Loading