-
Notifications
You must be signed in to change notification settings - Fork 393
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update to Spark 2.4.3 + XGBoost 0.90 + MLeap 0.14 #327
Conversation
Codecov Report
@@ Coverage Diff @@
## master #327 +/- ##
=========================================
+ Coverage 86.68% 86.79% +0.1%
=========================================
Files 336 336
Lines 10821 10895 +74
Branches 359 562 +203
=========================================
+ Hits 9380 9456 +76
+ Misses 1441 1439 -2
Continue to review full report at Codecov.
|
… made to decision tree pruning in Spark 2.4. If nodes are split, but both child nodes lead to the same prediction then the split is pruned away. This updates the test so this doesn't happen for feature 'b'
LGTM |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
features/src/main/scala/com/salesforce/op/utils/spark/RichDataset.scala
Outdated
Show resolved
Hide resolved
LGTM |
Related issues
We would like to use XGBoost 0.90, but it requires Spark 2.4.x and also #184
Describe the proposed solution
PredictionModel.predict
(since 2.4 - [SPARK-10884][ML] Support prediction on single instance for regression and classification related models apache/spark#19381)spark-avro
dependency (since avro support is included with spark)Describe alternatives you've considered
NA