Propagate warnings to all workers in joblib #30380

thomasjpfan · 2024-12-01T15:46:47Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

This PR sets the filter warnings from the main process for each joblib worker. It adjusts the context in the same place as the global context.

github-actions · 2024-12-01T15:48:05Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: e014df3. Link to the linter CI: here}

ogrisel

LGTM once the following are addressed.

ogrisel · 2025-01-07T15:34:19Z

doc/whats_new/upcoming_changes/sklearn.utils/30380.enhancement.rst

@@ -0,0 +1,2 @@
+- Filter warnings from the main process are propagated to joblib workers.


Suggested change

- Filter warnings from the main process are propagated to joblib workers.

- Warning filters from the main process are propagated to joblib workers.

ogrisel · 2025-01-07T15:38:22Z

sklearn/utils/parallel.py

+    previous_filters = warnings.filters
+    warnings.filters = warning_filters
+    yield
+    warnings.filters = previous_filters


For the threading backend, this should result in a null-operation, but since the warnings module is famously not thread-safe, I would rather test this case explicitly as part of our test suite.

I added a more explicit check here: 06aa344 (#30380)

lesteve · 2025-01-13T13:38:45Z

sklearn/utils/parallel.py

+        with (
+            config_context(**config),
+            _warning_filter_context(warning_filters=warning_filters),
+        ):


Rather than creating our own context manager, maybe it would be slightly simpler to use warnings.catch_warnings and set warning.filters = warning_filters, i.e. something like this:

with config_context(**config), warnings.catch_warnings(): warnings.filters = warnings_filters return self.function(*args, **kwargs)

lesteve · 2025-01-13T13:52:48Z

sklearn/utils/parallel.py

@@ -21,10 +22,10 @@
 _threadpool_controller = None


-def _with_config(delayed_func, config):
+def _with_config(delayed_func, config, warning_filters):


Maybe renaming the function _with_config_and_warning_filters to be explicit?

lesteve · 2025-01-14T15:13:44Z

Maybe it is worth to add a test that makes sure that for backend='loky' there is no side-effects on the workers warnings.filters, i.e. you could imagine forgetting to reset the warning filters to their original value after you have executed the fuction, i.e. something like this:

def test_filter_warning_propagates_no_side_effect_with_loky_backend():
    with warnings.catch_warnings():
        warnings.simplefilter("error", category=ConvergenceWarning)

        Parallel(n_jobs=2, backend="loky")(
            delayed(time.sleep)(0) for i in range(10)
        )

        # Make sure that inside the loky workers, warnings filters have been reset to their original
        # value. Using joblib directly should not turn ConvergenceWarning turned into an error
        joblib.Parallel(n_jobs=2, backend="loky")(
            joblib.delayed(warnings.warn)("Convergence warning", ConvergenceWarning) for _ in range(10)
        )

lesteve

Thanks, LGTM!

I push a small tweak and set this to automerge.

thomasjpfan added 2 commits December 1, 2024 10:39

Propagate warnings to all workers in joblib

fc60b53

Add whats new

6084b51

github-actions bot added the module:utils label Dec 1, 2024

Update PR number

fd5a348

Updates docstring

320618b

lesteve mentioned this pull request Jan 6, 2025

Adds convergence warning suppression via global config issue #29294 #30288

Closed

5 tasks

ogrisel approved these changes Jan 7, 2025

View reviewed changes

thomasjpfan added 2 commits January 11, 2025 09:10

Address comments

b9beb40

Add test to get the filters

06aa344

lesteve reviewed Jan 13, 2025

View reviewed changes

thomasjpfan and others added 4 commits January 14, 2025 21:19

Merge remote-tracking branch 'upstream/main' into filter_warning

6903be5

Address comments

ee8e1da

[azure parallel] Tweak naming

3237d3f

[azure parallel] Tweak comment

e014df3

lesteve approved these changes Jan 15, 2025

View reviewed changes

lesteve enabled auto-merge (squash) January 15, 2025 13:28

lesteve merged commit 10253eb into scikit-learn:main Jan 15, 2025
29 checks passed

lesteve mentioned this pull request Jan 21, 2025

RandomizedSearchCV: warnings cannot be suppressed #12939

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Propagate warnings to all workers in joblib #30380

Propagate warnings to all workers in joblib #30380

thomasjpfan commented Dec 1, 2024

github-actions bot commented Dec 1, 2024 •

edited

Loading

ogrisel left a comment

ogrisel Jan 7, 2025

ogrisel Jan 7, 2025

thomasjpfan Jan 11, 2025

lesteve Jan 13, 2025

lesteve Jan 13, 2025

lesteve commented Jan 14, 2025 •

edited

Loading

lesteve left a comment

		@@ -0,0 +1,2 @@
		- Filter warnings from the main process are propagated to joblib workers.

	- Filter warnings from the main process are propagated to joblib workers.
	- Warning filters from the main process are propagated to joblib workers.

Propagate warnings to all workers in joblib #30380

Propagate warnings to all workers in joblib #30380

Conversation

thomasjpfan commented Dec 1, 2024

Reference Issues/PRs

What does this implement/fix? Explain your changes.

github-actions bot commented Dec 1, 2024 • edited Loading

✔️ Linting Passed

ogrisel left a comment

Choose a reason for hiding this comment

ogrisel Jan 7, 2025

Choose a reason for hiding this comment

ogrisel Jan 7, 2025

Choose a reason for hiding this comment

thomasjpfan Jan 11, 2025

Choose a reason for hiding this comment

lesteve Jan 13, 2025

Choose a reason for hiding this comment

lesteve Jan 13, 2025

Choose a reason for hiding this comment

lesteve commented Jan 14, 2025 • edited Loading

lesteve left a comment

Choose a reason for hiding this comment

github-actions bot commented Dec 1, 2024 •

edited

Loading

lesteve commented Jan 14, 2025 •

edited

Loading