You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It is true that streamz has specialised code for dealing with pandas dataframes and the dask distributed cluster.
It would not be hard to write similar wrappers around any other dataframe implementation, or to aggregate dataframes using custom logic alone - but I am not aware of anyone having done so. I am not sure how much of the existing dataframe code could be generalised to multiple backends.
(note that streamz does not use dask-dataframes)
Similarly, streamz only calls dask's submit and gather functions to dispatch work. The same pattern would work for any remote execution framework.
Does Streamz support modin or vaex? Besides pandas/dask, I'm thinking of trying a few more solutions.
The text was updated successfully, but these errors were encountered: