[python-package][PySpark] Expose Training and Validation Metrics #11132

ayoub317 · 2024-12-29T02:00:09Z

Unlike the JVM binding, where, after training the XGBoost Spark model, we can retrieve a summary of the training and evaluation sets passed, this functionality is not currently available in the PySpark Python XGBoost binding.

In the JVM package, this is defined through this class. We obtain the summary while training the model, which first passes through this method, and in the end reaches this line. After that, when creating the Spark[Classification|Regression|Ranker]Model, we pass through a constructor like this one.

I followed a similar approach to expose and retrieve these metrics in the PySpark XGBoost binding, and I will be submitting a review soon. Any feedback is welcome.

trivialfis · 2024-12-29T04:16:00Z

Thank you for the PR. Do you plan to work on the summary class? I think the JVM package is mimicking the spark ML summary structure, there are similar classes for pyspark ml.

ayoub317 · 2024-12-29T10:55:52Z

Hello Jiaming, thanks for your quick response. I implemented the summary class and pushed the review.

Indeed ! And in PySpark they implemente a wrapper around the JVM one . Here, I implemented everything from scratch because the PySpark XGBoost binding is not a wrapper around the XGBoost JVM package binding.

ayoub317 linked a pull request Dec 29, 2024 that will close this issue

[python-package][PySpark] Expose Training and Validation Metrics #11133

Open

trivialfis added the status: need update label Jan 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[python-package][PySpark] Expose Training and Validation Metrics #11132

[python-package][PySpark] Expose Training and Validation Metrics #11132

ayoub317 commented Dec 29, 2024

trivialfis commented Dec 29, 2024

ayoub317 commented Dec 29, 2024 •

edited

Loading

[python-package][PySpark] Expose Training and Validation Metrics #11132

[python-package][PySpark] Expose Training and Validation Metrics #11132

Comments

ayoub317 commented Dec 29, 2024

trivialfis commented Dec 29, 2024

ayoub317 commented Dec 29, 2024 • edited Loading

ayoub317 commented Dec 29, 2024 •

edited

Loading