Port spark sql tests to datafusion. #9693
Replies: 3 comments 11 replies
-
Spark functions do not yet map well to DataFusion functions so if those are included in the tests those will be an issue. I'm hoping that at some point in the (near?) future once the function migration is complete a new epic could be created to align function names, params and behaviour with the sql dialect being used (primarily I would expect PostgreSQL and Spark). That would generally be a pretty massive project though as anyone who has tried to mirror the behaviour of a different system can attest to - just talk to @andygrove. |
Beta Was this translation helpful? Give feedback.
-
I think it is a great idea to use existing test suites to improve compatibility / correctness of DataFusion. Thank you @liurenjie1024 |
Beta Was this translation helpful? Give feedback.
-
Let's just remain careful to follow standard SQL as default and not Spark SQL in terms of function names, syntaxes etc. We should obviously accept the latter if the user configures so, but there could be some work to do there to attain this level of flexibility. |
Beta Was this translation helpful? Give feedback.
-
Spark sql has been widely used in large scale etl, and we have see
comet donated to apache, which acts as an execution layer of spark sql.
To further improve datafusion's compatibility with spark sql, I propose to introduce port spark sql tests.
This will help to bring datafusion to next level in etl processing.
Beta Was this translation helpful? Give feedback.
All reactions