I can not achieve inplace scaling by using sklearn.preprocessing.minmax_scale #27307

guanjiesun · 2023-09-06T09:57:49Z

Describe the bug

By setting the copy=False, ndarray data has not changed unexpectedly

Steps/Code to Reproduce

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import sklearn.preprocessing as pre

np.random.seed(10)
data = np.random.randint(1, 10, size=(5, 3))
print(data)
pre.minmax_scale(data, feature_range=(0, 1), axis=0, copy=False)
print(data)

Expected Results

A reasonable explanation about the copy parameter of minmax_scala funciton

Actual Results

There are no warings and errors, just the result is not wrong!

Versions

Python dependencies:
      sklearn: 1.3.0
          pip: 23.2.1
   setuptools: 65.5.0
        numpy: 1.25.2
        scipy: 1.11.2
       Cython: None
       pandas: 2.0.3
   matplotlib: 3.7.2
       joblib: 1.3.2
threadpoolctl: 3.2.0

lesteve · 2023-09-06T14:18:52Z

copy=False only works if the input array dtype is a float dtype, i.e. float64, float32 or float16 right now. I guess maybe the documentation could be improved to mention this?

In your case the input array dtype is an int dtype.

TaiJuWu · 2023-09-07T15:05:31Z

After this line
There is a new instance of X and the data type of new one is float.
So maybe you should modify your code to below.
data=pre.minmax_scale(data, feature_range=(0, 1), axis=0, copy=False)

guanjiesun · 2023-09-11T11:58:56Z

copy=False only works if the input array dtype is a float dtype, i.e. float64, float32 or float16 right now. I guess maybe the documentation could be improved to mention this?

In your case the input array dtype is an int dtype.

Yes, thanks for you answer! The official doc really needs an improvement.

guanjiesun · 2023-09-11T12:27:07Z

After this line There is a new instance of X and the data type of new one is float. So maybe you should modify your code to below. data=pre.minmax_scale(data, feature_range=(0, 1), axis=0, copy=False)

Thanks for you reply, but I think there should't return anything of the funtion and just make the data normalized inplacely.

After I set the dtype of data to np.float, data is normalized inplace, but still return a useless copy of data, i.e., data_copy, this does make nonse and not consistent with the official doc" copy =False would avoid a copy of data".

Official doc about the copy parameter of function minmax_scale

karthic25 · 2023-10-13T04:45:30Z

/take

konstantinos-p · 2023-10-31T13:24:58Z

This issue had stalled to the best of my knowledge.

guanjiesun added Bug Needs Triage Issue requires triage labels Sep 6, 2023

lesteve added Documentation help wanted and removed Bug Needs Triage Issue requires triage labels Sep 7, 2023

github-actions bot assigned karthic25 Oct 13, 2023

github-actions bot removed the help wanted label Oct 13, 2023

konstantinos-p mentioned this issue Oct 31, 2023

DOC improve documentation of copy=False in preprocessing functions #27691

Merged

lesteve closed this as completed in #27691 Dec 7, 2023

lesteve mentioned this issue Apr 10, 2024

Unexpected behavior of sklearn.feature_selection.mutual_info_regression if copy=False #28793

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I can not achieve inplace scaling by using sklearn.preprocessing.minmax_scale #27307

I can not achieve inplace scaling by using sklearn.preprocessing.minmax_scale #27307

guanjiesun commented Sep 6, 2023

lesteve commented Sep 6, 2023

TaiJuWu commented Sep 7, 2023

guanjiesun commented Sep 11, 2023

guanjiesun commented Sep 11, 2023

karthic25 commented Oct 13, 2023

konstantinos-p commented Oct 31, 2023

I can not achieve inplace scaling by using sklearn.preprocessing.minmax_scale #27307

I can not achieve inplace scaling by using sklearn.preprocessing.minmax_scale #27307

Comments

guanjiesun commented Sep 6, 2023

Describe the bug

By setting the copy=False, ndarray data has not changed unexpectedly

Steps/Code to Reproduce

Expected Results

A reasonable explanation about the copy parameter of minmax_scala funciton

Actual Results

There are no warings and errors, just the result is not wrong!

Versions

lesteve commented Sep 6, 2023

TaiJuWu commented Sep 7, 2023

guanjiesun commented Sep 11, 2023

guanjiesun commented Sep 11, 2023

Thanks for you reply, but I think there should't return anything of the funtion and just make the data normalized inplacely.

After I set the dtype of data to np.float, data is normalized inplace, but still return a useless copy of data, i.e., data_copy, this does make nonse and not consistent with the official doc" copy =False would avoid a copy of data".

Official doc about the copy parameter of function minmax_scale

karthic25 commented Oct 13, 2023

konstantinos-p commented Oct 31, 2023