fix: add metric example to xgboost docs #4917

djsauble · 2023-10-12T21:44:58Z

The XGBoost docs have a dangling param variable that isn't defined anywhere. Since this is where you define your metric, and this is an important part of experimentation, I added an example of how to use it to set a metric via the eval_metric param.

github-actions · 2023-10-12T22:03:04Z

Link Check Report

There were no links to check!

dberenbaum · 2023-10-13T00:12:20Z

Thanks @djsauble! As long as we are making changes, can we fix it up a bit more?

For example, can it follow a workflow that looks like the xgboost get started docs? It seems like that's what they see as their primary interface, and subjectively it looks easier to understand to me.

It would also be nice to flesh out the top example even more to show where other variables like dtrain and dval come from.

djsauble · 2023-10-13T16:07:48Z

For example, can it follow a workflow that looks like the xgboost get started docs? It seems like that's what they see as their primary interface, and subjectively it looks easier to understand to me.

@dberenbaum Just to clarify, you're saying we should use something like this:

model = xgb.XGBClassifier()
model.fit(X_train, y_train)

…instead of…

xgb.train()

?

It would also be nice to flesh out the top example even more to show where other variables like dtrain and dval come from.

This is the working XGBoost code I ended up with after going through our existing docs + the hint about metrics from @mattseddon:

from dvclive import Live
from dvclive.xgb import DVCLiveCallback
import xgboost as xgb
from sklearn import datasets
from sklearn.model_selection import train_test_split

iris = datasets.load_iris()

X_train, X_test, y_train, y_test = train_test_split(iris.data, iris.target, train_size=0.8)

dtrain = xgb.DMatrix(X_train, label=y_train)
dtest = xgb.DMatrix(X_test, label=y_test)

with Live("results") as live:
    xgb.train(
        {
            "eval_metric": "rmsle"
        },
        dtrain,
        100,
        early_stopping_rounds=5,
        callbacks=[DVCLiveCallback("eval_data")],
        evals=[(dtest, "eval_data")]
    )

    live.log_metric("summary_metric", 1.0, plot=False)

How much of this do you want to see in our docs? I don't really think we should go all the way back to a specific dataset. The focus here should be on the call to train() (or fit()) and the associated base parameters.

content/docs/dvclive/ml-frameworks/xgboost.md

dberenbaum · 2023-10-13T16:20:28Z

Fair points, @djsauble. I don't want to block the improvements you made, so I just left one minor comment to clarify and otherwise it looks good, thanks!

djsauble · 2023-10-13T16:21:27Z

@dberenbaum I actually agree with you that using .fit() makes sense. It's more familiar to me anyway. 🙂

I'll fix this up a bit more and ping you again for a quick review.

dberenbaum · 2023-10-13T16:23:12Z

@dberenbaum I actually agree with you that using .fit() makes sense. It's more familiar to me anyway. 🙂

If we only show that .fit() sklearn-like interface, do you think it leaves the user confused about how to do it with xgb.train()?

djsauble · 2023-10-13T16:43:35Z

Let's choose one or the other. The sklearn interface is probably more familiar to people, but I don't feel strongly about this. It looks like both interfaces support callbacks, which is the only hard requirement for DVCLive.

Which interface do you want to go with?

dberenbaum · 2023-10-13T16:58:37Z

Sounds good. Let's go with the sklearn interface then.

djsauble · 2023-10-13T17:56:09Z

@dberenbaum Switched to the sklearn interface and tested all code snippets locally. Think this is good now.

Co-authored-by: Restyled.io <[email protected]>

fix: add metric example to xgboost docs

897008f

djsauble mentioned this pull request Oct 12, 2023

xgboost: clarify docs: make it easier to add metrics to an existing project iterative/dvclive#721

Closed

shcheklein temporarily deployed to dvc-org-add-metric-exam-3wx0bv October 12, 2023 21:48 Inactive

shcheklein requested a review from dberenbaum October 12, 2023 21:57

dberenbaum reviewed Oct 13, 2023

View reviewed changes

content/docs/dvclive/ml-frameworks/xgboost.md Outdated Show resolved Hide resolved

djsauble marked this pull request as draft October 13, 2023 16:22

Switch to the sklearn interface for XGBoost

3365b66

shcheklein temporarily deployed to dvc-org-add-metric-exam-3wx0bv October 13, 2023 17:53 Inactive

restyled-io bot mentioned this pull request Oct 13, 2023

Restyle fix: add metric example to xgboost docs #4920

Merged

djsauble marked this pull request as ready for review October 13, 2023 17:55

djsauble requested a review from dberenbaum October 13, 2023 17:55

Restyled by prettier (#4920)

c2e1a93

Co-authored-by: Restyled.io <[email protected]>

shcheklein temporarily deployed to dvc-org-add-metric-exam-3wx0bv October 13, 2023 19:03 Inactive

dberenbaum approved these changes Oct 13, 2023

View reviewed changes

dberenbaum merged commit 3d6b6c3 into main Oct 13, 2023
5 checks passed

dberenbaum deleted the add_metric_example_to_xgboost_docs branch October 13, 2023 19:22

mattseddon mentioned this pull request Oct 17, 2023

Update DVCLive XGBoost snippet iterative/vscode-dvc#4847

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: add metric example to xgboost docs #4917

fix: add metric example to xgboost docs #4917

djsauble commented Oct 12, 2023

github-actions bot commented Oct 12, 2023 •

edited

Loading

dberenbaum commented Oct 13, 2023

djsauble commented Oct 13, 2023 •

edited

Loading

dberenbaum commented Oct 13, 2023

djsauble commented Oct 13, 2023

dberenbaum commented Oct 13, 2023

djsauble commented Oct 13, 2023

dberenbaum commented Oct 13, 2023

djsauble commented Oct 13, 2023

fix: add metric example to xgboost docs #4917

fix: add metric example to xgboost docs #4917

Conversation

djsauble commented Oct 12, 2023

github-actions bot commented Oct 12, 2023 • edited Loading

Link Check Report

dberenbaum commented Oct 13, 2023

djsauble commented Oct 13, 2023 • edited Loading

dberenbaum commented Oct 13, 2023

djsauble commented Oct 13, 2023

dberenbaum commented Oct 13, 2023

djsauble commented Oct 13, 2023

dberenbaum commented Oct 13, 2023

djsauble commented Oct 13, 2023

github-actions bot commented Oct 12, 2023 •

edited

Loading

djsauble commented Oct 13, 2023 •

edited

Loading