-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
XGBSE error with sklearn pipeline and GridSearchCV #61
Labels
bug
Something isn't working
Comments
I stumbled into the same issue. Is there any workout to this, other than tuning hyperparameters with a vanilla XGBoost model and simply transferring those to the XGBSe one? |
We have developed a workaround
`After you define the sklearn compatible model things work out as normal
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Code sample
__
Problem description
__
Use of GridSearchCV is not possible because XGBSE requires hyperparameters to be unique and to be passed during model initiation. Furthermore, parameter vales in the parameter dict need to be without [], while GridSearchCV expects values in [].
XGBSE therefore seems to be incompatible with GridSearchCV.
Furthermore, XGBSE seems to be incompatible with sklearn's pipeline.
If the sklearn pipeline is fitted, the estimator XGBSE receives the X dataframe as a np.array in the last step of the pipeline, which misses an index. This gives a corresponding error because XGBSE fitting seems to require X.index.
Expected behavior
__
It would be expected that XGBSE can be used with GridSearchCV and pipeline.
Possible solutions
__
It would be required that hyperparameters could be defined and that fitting would allow X without an index (np.array).
The text was updated successfully, but these errors were encountered: