Skip to content

How DataFusion could support other compute engines (libcudf, velox) #8498

Answered by alamb
spencerwilson asked this question in Q&A
Discussion options

You must be logged in to vote

👋 I'm wondering if anyone has contemplated how compute functions other than arrow::compute could be used with DataFusion

I think in general, many people do this today using the various extension APIs that come with DataFusion (docs):

DataFusion supports extension at many points:

read from any datasource (TableProvider)
define your own catalogs, schemas, and table lists (CatalogProvider)
build your own query language or plans (LogicalPlanBuilder)
declare and use user-defined functions (ScalarUDF, and AggregateUDF, WindowUDF)
add custom optimizer rewrite passes (OptimizerRule and PhysicalOptimizerRule)
extend the planner to use user-defined logical and physical nodes (QueryPlanner)
You c…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by spencerwilson
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants