Return different matrix types for online serving #4714

franciscojavierarceo · 2024-10-29T02:17:57Z

Is your feature request related to a problem? Please describe.
We should allow Feature Views to return matrices/tensors natively. For example, torch.tensors.

At the moment, for some features we require the client to serialize the output into a matrix before running inference. Feast should support executing these transformations and serializing the data into matrices for both online and offline retrieval.

Describe the solution you'd like

features: torch.Tensor =  store.get_online_features()

Describe alternatives you've considered
Not supporting this is the alternative, which is the current state, which leaves users to write their own brittle logic to handle various complexities.

Additional context
@HaoXuAI @tokoko I know we discussed sklearn pipelines in the past and I thought I'd share my thoughts.

The text was updated successfully, but these errors were encountered:

HaoXuAI · 2024-10-29T06:05:25Z

torch feature is nice. I guess we need to release the "timestamp" constraints in our APIs, since it probably doesn't make too much sense to attach embedding feature with a timestamp?

breno-costa · 2024-10-29T15:29:54Z

The method store.get_online_features(...) returns an OnlineResponse object that has some conversion methods like to_dict() and to_df(). Should this suggestion be implemented as an another conversion method like to_torch() or something like this?

franciscojavierarceo · 2024-10-29T15:59:27Z

torch feature is nice. I guess we need to release the "timestamp" constraints in our APIs, since it probably doesn't make too much sense to attach embedding feature with a timestamp?
Agreed.

franciscojavierarceo · 2024-10-29T16:03:14Z

@breno-costa that code is a serialization step though. We would want to treat Torch Tensors (or xgb.DMatrix) as a first class data type.

The concrete examples I'm thinking of are one hot encoding or impact encoding. It'd be useful for us to handle this for MLEs natively, especially when handling unseen categories.

dandawg · 2024-10-29T16:52:40Z

This plus sparse tensors/sparse matrices could be a really cool optimization -- less data, faster io, more powerful API.

franciscojavierarceo · 2024-10-29T16:56:25Z

This plus sparse tensors/sparse matrices could be a really cool optimization -- less data, faster io, more powerful API.

Exactly.

HaoXuAI · 2024-10-29T22:40:15Z

if we can leverage "arrow" as our primary format, then it can be directly converted to pandas/torch with arrow apis i believe

franciscojavierarceo · 2024-10-30T01:11:52Z

Cool, I'll check that out. This is basically the next step after vector support to making NLP a first class citizen.

franciscojavierarceo added the kind/feature New feature or request label Oct 29, 2024

franciscojavierarceo self-assigned this Nov 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Return different matrix types for online serving #4714

Return different matrix types for online serving #4714

franciscojavierarceo commented Oct 29, 2024

HaoXuAI commented Oct 29, 2024

breno-costa commented Oct 29, 2024

franciscojavierarceo commented Oct 29, 2024

franciscojavierarceo commented Oct 29, 2024

dandawg commented Oct 29, 2024

franciscojavierarceo commented Oct 29, 2024

HaoXuAI commented Oct 29, 2024

franciscojavierarceo commented Oct 30, 2024

Return different matrix types for online serving #4714

Return different matrix types for online serving #4714

Comments

franciscojavierarceo commented Oct 29, 2024

HaoXuAI commented Oct 29, 2024

breno-costa commented Oct 29, 2024

franciscojavierarceo commented Oct 29, 2024

franciscojavierarceo commented Oct 29, 2024

dandawg commented Oct 29, 2024

franciscojavierarceo commented Oct 29, 2024

HaoXuAI commented Oct 29, 2024

franciscojavierarceo commented Oct 30, 2024