Allows disabling refitting of CV estimators #30463

AhmedThahir · 2024-12-11T12:40:28Z

Reference Issues/PRs

Addresses Allow disabling refitting of cross-validation estimators #30396
Created PR as per @jeremiedbb's advice.

What does this implement/fix? Explain your changes.

What

Allows disable refitting of cross-validation estimators (such as LassoCV, RidgeCV) on the full training set after finding the best hyperparameters.
User may use a keyword argument refit to toggle this behavior.

Why
User does not want to waste resources on refitting when user only wants one/more of the following, which do not involve refitting:

optimal hyperparameter
cv_results_ for the different hyperparameters
best_score_ of all the hyperparameters

This is especially impactful for large datasets.

How

Added a refit argument which disables refitting if refit=False.
By default, refit=True is set to prevent breaking existing usage

Any other comments?

This is my first (hopefully first of many) PR for scikit-learn. If you have any feedback on my implementation/PR documentation/etc, feel free to share - I'd really appreciate it.

Thanks to @paulAdrienMarie for all the support!

Addresses scikit-learn#30396

github-actions · 2024-12-11T12:41:52Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 59712ad. Link to the linter CI: here}

AhmedThahir · 2024-12-11T14:47:55Z

I think I'm done from my side with the code changes and documentation.

Code changes for ElasticNetCV and MultitaskElasticNetCV will be done by @paulAdrienMarie

AhmedThahir · 2024-12-11T18:28:46Z

@kayo09, please do not make such reviews - it confuses us.

Only @paulAdrienMarie and I are assigned to work on this PR.

Edit: @kayo09, could you try to remove the review.

Update _coordinate_descent.py

AhmedThahir

@paulAdrienMarie The refit parameter was supposed to appear after cv parameter. This is not a programming issue, but a conflict with the sklearn docs automation, as I followed the scheme followed by GridSearchCV and accordingly made the documentation in the same order, ie refit after cv.

Just sharing this as feedback so that it may be useful for you in the future - not as criticism.

This was the error message from which I understood:

AssertionError: Docstring Error:
In function: sklearn.linear_model._coordinate_descent.ElasticNetCV.__init__
There's a parameter name mismatch in function docstring w.r.t. function signature, at index 8 diff: 'cv' != 'refit'
Full diff:
['l1_ratio',
'eps',
'n_alphas',
'alphas',
'fit_intercept',
'precompute',
'max_iter',
'tol',
+  'refit',
'cv',
'copy_X',
-  'refit',
'verbose',
'n_jobs',
'positive',
'random_state',
'selection']
In function: sklearn.linear_model._coordinate_descent.MultiTaskElasticNetCV.__init__
There's a parameter name mismatch in function docstring w.r.t. function signature, at index 7 diff: 'cv' != 'refit'
Full diff:
['l1_ratio',
'eps',
'n_alphas',
'alphas',
'fit_intercept',
'max_iter',
'tol',
+  'refit',
'cv',
'copy_X',
-  'refit',
'verbose',
'n_jobs',
'random_state',
'selection']

Allow disabling refitting of CV estimators

e3b6399

Addresses scikit-learn#30396

github-actions bot added the module:linear_model label Dec 11, 2024

AhmedThahir mentioned this pull request Dec 11, 2024

Allow disabling refitting of cross-validation estimators #30396

Open

AhmedThahir added 4 commits December 11, 2024 18:15

Adding for Ridge, Lars, OrthogonalMatchingPursuitCV

777bb55

Documenting parameter

0650ec1

Update _least_angle.py

5204077

Formatting

3a0b781

paulAdrienMarie added a commit to paulAdrienMarie/scikit-learn that referenced this pull request Dec 11, 2024

scikit-learn#30463 Allowing disabling refitting of CV estimators

28bbd61

Setting parameter constraints

2b6557b

paulAdrienMarie added a commit to paulAdrienMarie/scikit-learn that referenced this pull request Dec 11, 2024

scikit-learn#30463 removing modifications on the test file

c0dcb89

Pop refit from path_params

8c8dacc

kayo09 approved these changes Dec 11, 2024

View reviewed changes

paulAdrienMarie and others added 4 commits December 11, 2024 19:42

Update _coordinate_descent.py

4759bb8

Merge pull request #1 from paulAdrienMarie/patch-1

ad719e8

Update _coordinate_descent.py

Merge

6177f9f

Reorder refit parameter

59712ad

AhmedThahir commented Dec 11, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allows disabling refitting of CV estimators #30463

Allows disabling refitting of CV estimators #30463

AhmedThahir commented Dec 11, 2024 •

edited

Loading

github-actions bot commented Dec 11, 2024 •

edited

Loading

AhmedThahir commented Dec 11, 2024

AhmedThahir commented Dec 11, 2024 •

edited

Loading

AhmedThahir left a comment •

edited

Loading

Allows disabling refitting of CV estimators #30463

Are you sure you want to change the base?

Allows disabling refitting of CV estimators #30463

Conversation

AhmedThahir commented Dec 11, 2024 • edited Loading

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

github-actions bot commented Dec 11, 2024 • edited Loading

✔️ Linting Passed

AhmedThahir commented Dec 11, 2024

AhmedThahir commented Dec 11, 2024 • edited Loading

AhmedThahir left a comment • edited Loading

Choose a reason for hiding this comment

AhmedThahir commented Dec 11, 2024 •

edited

Loading

github-actions bot commented Dec 11, 2024 •

edited

Loading

AhmedThahir commented Dec 11, 2024 •

edited

Loading

AhmedThahir left a comment •

edited

Loading