Torchvision API - ColorJitter and Grayscale operators by mdabek-nvidia · Pull Request #6272 · NVIDIA/DALI

mdabek-nvidia · 2026-03-22T12:49:07Z

Category:

New feature

Description:

Implementation of Torchvison OO API operators:

ColorJitter
Grayscale

Implementation of Torchvision functional operators:

rgb_to_grayscale
to_grayscale

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

Implements new requirements
Affects existing requirements
N/A

REQ IDs: N/A

JIRA TASK: N/A

Signed-off-by: Marek Dabek <[email protected]>

dali/python/nvidia/dali/experimental/torchvision/v2/color.py

mdabek-nvidia · 2026-03-22T21:42:40Z

!build

mdabek-nvidia · 2026-03-22T21:43:17Z

@greptileai - please review

dali-automaton · 2026-03-22T21:45:18Z

CI MESSAGE: [46735031]: BUILD STARTED

greptile-apps · 2026-03-22T21:47:12Z

Greptile Summary

This PR adds ColorJitter and Grayscale operators (OO API) and to_grayscale / rgb_to_grayscale functional API entries to the experimental torchvision compatibility layer. As part of the change, layout-dimension extraction logic previously duplicated in resize.py and centercrop.py is consolidated into two new helpers in operator.py (get_HWC_from_layout_dynamic and get_HWC_from_layout_pipeline), and both existing operators are updated to use the new 4-tuple unpacking.

Key points:

ColorJitter delegates to fn.color_twist and supports both CPU and GPU devices. Parameters are validated in __init__ via VerificationBCS / VerificationHue before the DALI pipeline is constructed.
Grayscale uses fn.color_space_conversion (RGB→GRAY), fn.cat (1→3 channels), or fn.hsv with saturation=0 (3→3 desaturate), covering all four input/output channel combinations.
Validation gap in VerificationHue: a negative scalar hue (e.g. hue=-0.1) silently passes validation but causes __init__ to produce an inverted range (0.1, -0.1). This in turn causes fn.random.uniform(range=(0.1, -0.1)) to fail at runtime. The fix is a simple non-negativity guard on the scalar branch, mirroring torchvision's own check.
Dead else branch in _create_param: returns a bare float instead of a list, which would break _get_BCSH if reached (it is currently unreachable).
The test suite is otherwise solid, covering all device/channel combinations and invalid-parameter cases — just missing the negative-scalar-hue case.

Confidence Score: 4/5

Safe to merge once the negative-scalar hue validation gap in VerificationHue is addressed — all other changes are clean refactors or well-tested new features.
The PR is well-structured and previous review concerns (integer inputs, HW layout IndexError) have been resolved. One targeted P1 fix remains: negative scalar hue bypasses validation and creates an inverted range at runtime. That path is easy to reproduce and easy to fix, making this a 4 rather than a 5.
dali/python/nvidia/dali/experimental/torchvision/v2/color.py — specifically VerificationHue.verify and the _create_param else-branch.

Important Files Changed

Filename	Overview
dali/python/nvidia/dali/experimental/torchvision/v2/color.py	New file implementing ColorJitter and Grayscale operators. Contains a P1 bug: negative scalar hue (e.g. hue=-0.1) passes VerificationHue but creates an inverted range (0.1, -0.1) in init, causing fn.random.uniform to fail at runtime. Also has a dead else-branch in _create_param that would return a scalar instead of a list.
dali/python/nvidia/dali/experimental/torchvision/v2/operator.py	Adds VerifyIfNonNegative, get_input_shape_dynamic, get_HWC_from_layout_dynamic, and get_HWC_from_layout_pipeline helpers, consolidating layout-parsing logic previously duplicated across resize.py and centercrop.py. The new get_HWC_from_layout_dynamic correctly handles "HW" layout returning c=1; the pipeline variant does not but that path is unreachable from adjust_input.
dali/python/nvidia/dali/experimental/torchvision/v2/functional/color.py	New functional API file exposing to_grayscale and rgb_to_grayscale. Both delegate to the same _grayscale helper, correctly decorated with @adjust_input. Logic mirrors the Grayscale operator and looks correct.
dali/test/python/torchvision/test_tv_color.py	New test file covering Grayscale and ColorJitter. Good coverage for PIL inputs across devices, but missing a test case for negative scalar hue, which is currently an undetected validation gap. GPU path for functional API (to_grayscale/rgb_to_grayscale) is not exercised in the parametrised tests.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A["User input\n(PIL.Image / torch.Tensor)"] --> B["adjust_input decorator\ntransform_input()"]
    B --> C{"Input type"}
    C -->|"PIL.Image"| D["ndd.Tensor\nlayout=HWC"]
    C -->|"torch.Tensor 3D"| E["ndd.Tensor\nlayout=CHW"]
    C -->|"torch.Tensor >3D"| F["ndd.Batch\nlayout=CHW"]

    D & E & F --> G{"Operator"}

    G -->|"ColorJitter"| H["VerificationBCS + VerificationHue\n(in super().__init__)"]
    H --> I["_create_param(brightness/contrast/saturation)\n→ list[float, float]"]
    I --> J["hue scalar → (-hue, hue) tuple"]
    J --> K["_kernel: _get_BCSH\nsamples random factors"]
    K --> L["fn.color_twist()\nbright/contrast/sat/hue"]

    G -->|"Grayscale"| M["VerificationGSOutputChannels\n(in super().__init__)"]
    M --> N["preprocess_data:\nget_HWC_from_layout_pipeline\n→ (h, w, c, tensor)"]
    N --> O{"num_output_channels × c"}
    O -->|"1×3: RGB→Gray"| P["fn.color_space_conversion\nRGB→GRAY"]
    O -->|"1×1: no-op"| Q["pass"]
    O -->|"3×1: replicate"| R["fn.cat × 3 on C axis"]
    O -->|"3×3: desaturate"| S["fn.hsv saturation=0"]

    L & P & Q & R & S --> T["output ndd.Tensor/Batch"]
    T --> U["adjust_output\n→ PIL.Image or torch.Tensor"]

Comments Outside Diff (1)

dali/python/nvidia/dali/experimental/torchvision/v2/color.py, line 79-85 (link)

Negative scalar hue bypasses validation and creates an inverted range

When a negative scalar is passed (e.g., hue=-0.1), VerificationHue.verify converts it to [-0.1, -0.1] and then checks hue[0] < -0.5 or hue[1] > 0.5 — both conditions are false, so validation passes silently.

Meanwhile, in __init__ the scalar is converted via (-float(hue), float(hue)), producing (0.1, -0.1) — an inverted range where min > max. When _get_BCSH later calls fn.random.uniform(range=(0.1, -0.1)), the inverted bounds will cause a runtime error.

Torchvision enforces 0 <= hue <= 0.5 for scalar inputs. The verification should mirror this:

The test suite in test_tv_color.py doesn't include hue=-0.1 among the invalid-param cases, so this path is currently untested.

_{Reviews (4): Last reviewed commit: "Review fixes" | Re-trigger Greptile}

dali/python/nvidia/dali/experimental/torchvision/v2/color.py

dali/python/nvidia/dali/experimental/torchvision/v2/operator.py

dali/python/nvidia/dali/experimental/torchvision/v2/color.py

dali-automaton · 2026-03-23T00:44:25Z

CI MESSAGE: [46735031]: BUILD PASSED

mdabek-nvidia · 2026-03-23T08:41:18Z

!build

dali-automaton · 2026-03-23T08:45:33Z

CI MESSAGE: [46765427]: BUILD STARTED

dali-automaton · 2026-03-23T15:47:08Z

CI MESSAGE: [46765427]: BUILD FAILED

Signed-off-by: Marek Dabek <[email protected]>

Signed-off-by: Marek Dabek <[email protected]> Co-authored-by: Kamil Tokarski <[email protected]>

Signed-off-by: Marek Dabek <[email protected]>

mdabek-nvidia · 2026-03-24T13:21:32Z

@greptileai please re-review

mdabek-nvidia · 2026-03-24T13:21:43Z

!build

dali-automaton · 2026-03-24T13:26:37Z

CI MESSAGE: [46875703]: BUILD STARTED

Signed-of-by: Marek Dabek <[email protected]>

dali-automaton · 2026-03-24T17:46:14Z

CI MESSAGE: [46875703]: BUILD PASSED

mdabek-nvidia added 2 commits March 19, 2026 13:30

Center crop operator

e09d04b

Signed-off-by: Marek Dabek <[email protected]>

Review fixes

1b043c0

Signed-off-by: Marek Dabek <[email protected]>

github-advanced-security bot found potential problems Mar 22, 2026

View reviewed changes

dali/python/nvidia/dali/experimental/torchvision/v2/color.py Fixed Show fixed Hide fixed

mdabek-nvidia force-pushed the torchvision_color branch from d236be4 to 1a74665 Compare March 22, 2026 21:10

greptile-apps bot reviewed Mar 22, 2026

View reviewed changes

mdabek-nvidia force-pushed the torchvision_color branch 2 times, most recently from ac68f8c to 994315b Compare March 23, 2026 08:40

mdabek-nvidia and others added 7 commits March 24, 2026 11:34

Review fixes

f30e282

Signed-off-by: Marek Dabek <[email protected]>

Apply suggestion from @stiepan

3337aa9

Signed-off-by: Marek Dabek <[email protected]> Co-authored-by: Kamil Tokarski <[email protected]>

Torchvision ColorJitter and Grayscale implementations

ebbf863

Signed-off-by: Marek Dabek <[email protected]>

Review fixes

d8c5cd4

Signed-off-by: Marek Dabek <[email protected]>

Review fixes

4dc37b5

Signed-off-by: Marek Dabek <[email protected]>

Review fixes

e146f9f

Signed-off-by: Marek Dabek <[email protected]>

Review fixes

6a0038d

Signed-off-by: Marek Dabek <[email protected]>

mdabek-nvidia force-pushed the torchvision_color branch from 994315b to 6a0038d Compare March 24, 2026 13:19

mdabek-nvidia marked this pull request as ready for review March 24, 2026 14:02

Review fixes

f53d4ec

Signed-of-by: Marek Dabek <[email protected]>

mdabek-nvidia force-pushed the torchvision_color branch from 544b424 to f53d4ec Compare March 24, 2026 14:07

dali-automaton assigned banasraf and mzient Mar 25, 2026

Conversation

mdabek-nvidia commented Mar 22, 2026

Category:

Description:

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

Uh oh!

Uh oh!

mdabek-nvidia commented Mar 22, 2026

Uh oh!

mdabek-nvidia commented Mar 22, 2026

Uh oh!

dali-automaton commented Mar 22, 2026

Uh oh!

greptile-apps bot commented Mar 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Flowchart

Comments Outside Diff (1)

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dali-automaton commented Mar 23, 2026

Uh oh!

mdabek-nvidia commented Mar 23, 2026

Uh oh!

dali-automaton commented Mar 23, 2026

Uh oh!

dali-automaton commented Mar 23, 2026

Uh oh!

mdabek-nvidia commented Mar 24, 2026

Uh oh!

mdabek-nvidia commented Mar 24, 2026

Uh oh!

dali-automaton commented Mar 24, 2026

Uh oh!

dali-automaton commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

greptile-apps bot commented Mar 22, 2026 •

edited

Loading