API: Make np.squeeze always return an ndarray for scalar inputs by spandan11106 · Pull Request #31561 · numpy/numpy

spandan11106 · 2026-06-04T06:03:25Z

PR summary

Closes #30109.
Currently, np.squeeze behaves inconsistently with scalar-like inputs:

>>> np.squeeze(1)           # Python int → 0d array
array(1)
>>> np.squeeze(np.array(1)) # 0d ndarray → 0d array
array(1)
>>> np.squeeze(np.int_(1))  # np.generic scalar → scalar (unexpected behavior)
np.int64(1)

The issue is that np.generic subclasses have a .squeeze() method that returns self (the scalar itself). The current squeeze implementation in fromnumeric.py tries a.squeeze first, which succeeds for np.generic scalars and returns the scalar unchanged. This contradicts the docstring, which states:

"Note that if all axes are squeezed, the result is a 0d array and not a scalar."
This PR converts np.generic inputs to 0d arrays using ellipsis indexing a[...] before invoking a.squeeze.

Changes:

Runtime Fix: Converts np.generic inputs to 0d arrays in numpy/_core/fromnumeric.py.
Type Stubs: Updates numpy/_core/fromnumeric.pyi to remove the scalar-returning overload of squeeze (so it falls back to returning NDArray[ScalarT]).
Tests: Adds tests in numpy/_core/tests/test_multiarray.py checking various scalar types, axis=0, and Python scalars.
Release Note: Adds a compatibility news fragment in doc/release/upcoming_changes/27709.compatibility.rst.

First time committer introduction

Hi everyone! I am Spandan. I use NumPy for development/data science work, and noticed the inconsistent behavior when squeezing NumPy scalars compared to Python scalars. I wanted to contribute to the repository to clean up this edge case.

AI Disclosure

Antigravity was used to assist in researching the issue and drafting the test cases.

mhvk

Thanks for the PR. Some comments on the tests in-line, but before worrying about those, I guess we'll need to think what the best behaviour is. As you note, the docstring is pretty clear that an array should be returned but of course the actual implementation is to defer to an object's squeeze() method. And a problem with the approach here is that np.squeeze(scalar) and scalar.squeeze() will now no longer do the same thing. I think that would be even more surprising.

I think we'll need to make a call. Either,

Adjust np.generic.squeeze() to return an array; or
Adjust the docstring of np.squeeze() to note explicitly it defers to .squeeze() and that numpy scalars remain scalars.

I'm not sure what is best, though in the face of doubt, I think I slightly prefer to adjust the docstring to match reality rather than the reverse. Let me ping @seberg, @mattip for other opinions.

p.s. Unrelated, but I don't really like that np.squeeze(scalar, axis=0) does not error.

mhvk · 2026-06-04T06:43:42Z

+            assert result[()] == scalar
+
+        # axis=0 should also work for scalar inputs
+        result = np.squeeze(np.int_(1), axis=0)


I don't know that I would bother to test what to me seems surprising behaviour (I would expect an error, like for axis=1...)

mhvk · 2026-06-04T06:46:28Z

        assert_raises(ValueError, a.squeeze, axis=(1,))
        assert_equal(a.squeeze(axis=(2,)), [[1, 2, 3]])

+    def test_squeeze_scalar_returns_0d_array(self):


To me, it would make more sense to move the tests to test_numeric.py, since we're testing a function from fromnumeric

mhvk · 2026-06-04T06:46:41Z

+    def test_squeeze_scalar_returns_0d_array(self):
+        # np.squeeze should always return an ndarray, even for
+        # np.generic scalar inputs (gh-27709)
+        for scalar in [np.int_(1), np.float64(2.5), np.complex128(1+2j)]:


I'd use @pytest.mark.parametrize on the function. You can then include a python scalar too.

mhvk · 2026-06-04T06:47:29Z

+        # np.generic scalar inputs (gh-27709)
+        for scalar in [np.int_(1), np.float64(2.5), np.complex128(1+2j)]:
+            result = np.squeeze(scalar)
+            assert isinstance(result, np.ndarray), (


No need for the explanation in the assert.

seberg · 2026-06-04T07:31:36Z

I'm not sure what is best, though in the face of doubt, I think I slightly prefer to adjust the docstring to match reality rather than the reverse. Let me ping @seberg, @mattip for other opinions.

Yeah, I asked the same on the issue. I think Joren would prefer to make it typing clear one way or another. For myself, it feels a bit silly for scalar.squeeze() to return an array, so I wouldn't mind just keeping things as is...
(Maybe the real trick would be to get away from the method forwarding behavior, in which case this silly function could actually use np.asarray() and the split becomes more sane, because this approach feels a bit ad-hoc to me. -- but of course that is a can of worms :/)

EDIT: We should probably move to the issue, but I am not sure we can move this PR forward until we have a decision there.

mhvk · 2026-06-04T07:55:06Z

@seberg - oops, should have seen that this was to fix an issue, I've now moved discussion there. FWIW, I'm now leaning even more to just have the docstring match the implementation instead of the reverse.

spandan11106 added 2 commits June 1, 2026 04:15

API: Make np.squeeze always return an ndarray for scalar inputs

4669dd4

Fixed underline issue

5283804

mhvk reviewed Jun 4, 2026

View reviewed changes

seberg added the 57 - Close? Issues which may be closable unless discussion continued label Jun 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

API: Make np.squeeze always return an ndarray for scalar inputs#31561

API: Make np.squeeze always return an ndarray for scalar inputs#31561
spandan11106 wants to merge 2 commits into
numpy:mainfrom
spandan11106:squeezeing-scalars

spandan11106 commented Jun 4, 2026

Uh oh!

mhvk left a comment

Uh oh!

mhvk Jun 4, 2026

Uh oh!

mhvk Jun 4, 2026

Uh oh!

mhvk Jun 4, 2026

Uh oh!

mhvk Jun 4, 2026

Uh oh!

seberg commented Jun 4, 2026 •

edited

Loading

Uh oh!

mhvk commented Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

spandan11106 commented Jun 4, 2026

PR summary

Changes:

First time committer introduction

AI Disclosure

Uh oh!

mhvk left a comment

Choose a reason for hiding this comment

Uh oh!

mhvk Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

mhvk Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

mhvk Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

mhvk Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

seberg commented Jun 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mhvk commented Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

seberg commented Jun 4, 2026 •

edited

Loading