ML XGBoost Classification #484

PondiB · 2023-12-07T13:29:38Z

Specification for ML XGBoost for Classification.

Depends on #441

…ter_vector (#462)

…other processes. Default to numerical index instead of string. (#478)

* `filter_spatial`: Clarified that a masking get applied for the given geometries. #469 * `filter_bbox`: Clarified that the bounding box is reprojected to the CRS of the spatial data cube dimensions if required. --------- Co-authored-by: Stefaan Lippens <[email protected]>

…meter.

soxofaan · 2023-12-07T17:08:08Z

meta/subtype-schemas.json

+            "type": "object",
+            "subtype": "ml-model",
+            "title": "Machine Learning Model",
+            "description": "A machine learning model, accompanied with STAC metadata that implements the the STAC ml-model extension."


accompanied with STAC metadata that implements the the STAC ml-model extension

What does this practically mean here in this context of defining a JSON schema? Isn't that more a concern of a process like save_ml_model that actually "exports" the model to a more concrete form?

accompanied with STAC metadata that implements the the STAC ml-model extension

What does this practically mean here in this context of defining a JSON schema? Isn't that more a concern of a process like save_ml_model that actually "exports" the model to a more concrete form?

This was to remove the error due to the ml model being returned. I created a branch from the draft and not the ml branch.

proposals/ml_fit_class_xgboost.json

soxofaan · 2023-12-07T17:21:11Z

proposals/ml_fit_class_xgboost.json

+            }
+        },
+        {
+            "name": "seed",


a lot of parameters are listed here. Are these based on a particular xgboost implementation? And are we sure that they are translatable to other implementation? Otherwise I think it would make sense to drop a couple from this initial spec, and leave some wiggle room for backend implementers

I remember we had this same issue with the random forest process

Yeah, I had checked python and R libraries and these params can be specified : https://xgboost.readthedocs.io/en/stable/parameter.html

I can mabye remove 3 : early_stopping_rounds, nfolds and nrounds to be dealt with internally. I set default values for most params though.

m-mohr

@PondiB I think it would make sense to make PRs against the ml branch because otherwise all changes from the ML branch will also appear in this PR. This leads to confusion as you can see with @soxofaan's comments. Please rebase your changes against the ML branch if necessary and set the base branch of the PR to ml.

soxofaan · 2023-12-11T17:45:13Z

hmm this PR now has "27 files changed", most of the changes are irrelevant to the original issue (ML XGBoost)

PondiB · 2023-12-12T09:30:41Z

hmm this PR now has "27 files changed", most of the changes are irrelevant to the original issue (ML XGBoost)

Lol, Never noticed it in the evening. I did run npm test locally no idea how it led to this. I will rebase the commit and push again.

PondiB · 2023-12-12T10:37:08Z

Moved to #487

soxofaan and others added 13 commits September 30, 2023 09:21

Issue #460 doc crossreferences between filter_bbox/filter_spatial/fil…

0833d4e

…ter_vector (#462)

Move tests to dev

c130dd7

Merge remote-tracking branch 'origin/draft' into draft

836a84b

Use x \ y instead of a \ b

c2d77e2

sqrt: Clarified that NaN is returned for negative numbers #474 (#475)

13c3f85

clip: Throw an exception if min > max #472 (#477)

4fd92b2

array_append: Added number type for labels to be consistent with …

ab4a62e

…other processes. Default to numerical index instead of string. (#478)

between: Clarify that null is passed through

d8cf96a

eq and neq: Explicitly set the minimum value for the delta para…

899b824

…meter.

Clarify linear_scale_range

ab2e6c2

xgboost classification specification

a306cae

xgboost classification specification

b4068d6

soxofaan reviewed Dec 7, 2023

View reviewed changes

m-mohr changed the base branch from draft to ml December 8, 2023 10:33

m-mohr changed the base branch from ml to draft December 8, 2023 10:34

m-mohr requested changes Dec 8, 2023

View reviewed changes

PondiB changed the base branch from draft to ml December 8, 2023 13:34

reset

e98dd7f

PondiB force-pushed the ml-xgboost-class branch from b83aaad to e98dd7f Compare December 12, 2023 09:43

reset using ml branch

ff1599a

PondiB closed this Dec 12, 2023

PondiB deleted the ml-xgboost-class branch January 3, 2024 14:50

PondiB mentioned this pull request Feb 21, 2024

ML Generic API #497

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ML XGBoost Classification #484

ML XGBoost Classification #484

PondiB commented Dec 7, 2023 •

edited by m-mohr

Loading

soxofaan Dec 7, 2023

PondiB Dec 11, 2023

soxofaan Dec 7, 2023

PondiB Dec 11, 2023 •

edited

Loading

m-mohr left a comment

soxofaan commented Dec 11, 2023

PondiB commented Dec 12, 2023

PondiB commented Dec 12, 2023

ML XGBoost Classification #484

ML XGBoost Classification #484

Conversation

PondiB commented Dec 7, 2023 • edited by m-mohr Loading

soxofaan Dec 7, 2023

Choose a reason for hiding this comment

PondiB Dec 11, 2023

Choose a reason for hiding this comment

soxofaan Dec 7, 2023

Choose a reason for hiding this comment

PondiB Dec 11, 2023 • edited Loading

Choose a reason for hiding this comment

m-mohr left a comment

Choose a reason for hiding this comment

soxofaan commented Dec 11, 2023

PondiB commented Dec 12, 2023

PondiB commented Dec 12, 2023

PondiB commented Dec 7, 2023 •

edited by m-mohr

Loading

PondiB Dec 11, 2023 •

edited

Loading