Add extension `subgroup-size-control` by Jiawei-Shao · Pull Request #5578 · gpuweb/gpuweb

Jiawei-Shao · 2026-02-27T07:02:55Z

This patch adds a new WebGPU and WGSL extension subgroup-size-control
based on subgroup-size-control.md.

This patch adds a new WebGPU and WGSL extension `subgroup-size-control` based on `proposals/subgroup-size-control.md`.

github-actions · 2026-02-27T07:59:25Z

Previews, as seen when this build job started (f7741c6):
WebGPU _{webgpu.idl | Explainer | Correspondence Reference}
WGSL _{grammar.js | wgsl.lalr.txt}

jimblandy · 2026-03-10T18:04:51Z

Should this PR also update the correspondence reference?

Jiawei-Shao · 2026-03-12T03:17:38Z

Should this PR also update the correspondence reference?

Done

Jiawei-Shao · 2026-03-30T08:00:46Z

PTAL, thanks!

Jiawei-Shao · 2026-05-12T06:12:43Z

Hi @jimblandy and @mwyrzykowski ,

I've updated this PR. PTAL, thanks!

kainino0x

API LGTM.
The last WGSL changes should still be reviewed by a WGSL person.
And we still need tests before landing (that's tracked by the open comment, which will block the PR from landing).

jimblandy · 2026-05-18T23:33:11Z

I'll take a look at this today.

jimblandy

I found a few editorial issues. Please look through the generated HTML to make sure the formatting is coming through the way it should.

jimblandy

Editorial issues aside, this looks good to me.

kainino0x · 2026-05-19T18:28:44Z

+
+These are not surfaced in WebGPU because:
+- The D3D12 `waveLaneCountMax` is not reliable according to [DirectXShaderCompiler Wiki](https://github.com/microsoft/DirectXShaderCompiler/wiki/Wave-Intrinsics/#caps-flags).
+- The D3D12 `waveLaneCountMin` may differ from the actual minimum subgroup sizes used in fragment shaders on some Intel GPUs.


Continuing a conversation from gpuweb/cts#4641 (comment) @Jiawei-Shao @jrprice:

I understand that if we set subgroupMinSize to 8 (because of fragment) then it's not going to be valid for compute on this device. I guess what I'm saying is if this AdapterInfo isn't going to provide enough information about the hardware to actually use the API, it seems bad.

IF it would otherwise be the case that all values between subgroupMinSize and subgroupMaxSize are valid for trivial pipelines, I really would like to try to maintain that invariant and test it. The test (now removed) suggested this would be true, by arbitrarily choosing subgroupMaxSize as the value to test, for all devices except these Intel ones.

The options I see for doing that are:

Redefine subgroupMinSize so it only applies to compute - assuming this is a non-starter since it's used for more than just subgroup-size-control

Separate the compute and fragment subgroupMinSize/subgroupMaxSize

Implementation artificially increases subgroupMinSize to 16 even for fragment by using subgroup size control to prevent 8 from actually being used in fragment - no clue if this is possible

IF NOT, it doesn't really matter and we can leave the spec as is. The test (when we re-add it) should be changed to keep trying different subgroup sizes until it find one that works. We can use the vendor/architecture to choose the order in which we try different subgroup sizes.

The test (now removed) suggested this would be true, by arbitrarily choosing subgroupMaxSize as the value to test, for all devices except these Intel ones.

I've put this test in another PR 4643.

I guess what I'm saying is if this AdapterInfo isn't going to provide enough information about the hardware to actually use the API, it seems bad.

Part of the argument I made in this comment (discussed more in these meeting minutes) was that the value of this limit alone often isn't enough to use the feature either. Knowing that the minimum possible subgroup size is 8 vs 16 vs ... doesn't tell you enough about the memory hierarchy or other hardware properties to decide how your shader should operate. Middleware will want to know if it's Intel vs NVIDIA vs AMD vs ..., and potentially different generations of these architectures, in order to decide how to tile data through memory and registers (for example). At this point they have likely already decided what subgroup size they want based on the architecture, and they won't be considering using 8 on Intel regardless of what the limit says.

I agree that it'd be nice if every value between the limits was reliable usable for a trivial pipeline. My understanding is that over time that will increasingly be true, as it is just this one generation of Intel GPUs that has this issue.

Sure, real applications have more complex requirements. But the issue with this Intel generation seems very specific, just that there's one value that's valid for fragment that's not valid for compute? That seems like something to try to paper over, not something to expose as a wart.

For CTS, there's two reasons I would like to aspirationally test that all values in the range work for a trivial pipeline: (1) so applications can (at least mostly) not worry about that particular detail, (2) so that we can check that the values the implementation exposes are actually the correct ones for the device. The latter is quite valuable IMO.

Implementation artificially increases subgroupMinSize to 16 even for fragment by using subgroup size control to prevent 8 from actually being used in fragment - no clue if this is possible

James clarified to me offline that my idea to paper over the difference by using the backend's subgroup size control to prevent fragment shaders from ever selecting 8 (even if it would be allowed) won't work, because subgroup size control is compute-only. So seems like papering over is probably not feasible and the only two reasonable options are internal error (the current proposal) or adding a new limit.

The current option does have me somewhat questioning the value of subgroupMinSize/subgroupMaxSize in the first place if sometimes there will be values in that range that just don't work at all. But I guess that ship has sailed.

Redefine subgroupMinSize so it only applies to compute - assuming this is a non-starter since it's used for more than just subgroup-size-control

Maybe this is still feasible? If subgroupMinSize/subgroupMaxSize is going to have very limited use anyway.

@jrprice @dneto0 What do you think about Kai's comment?

This would be a breaking change to the API. I don't know how frequently these properties are used for fragment shaders but I'm not sure that we could change them at this stage.

How about subgroupMaxSize ?

Actually the D3D12 document mentions WaveLaneCountMax is not reliable so currently we have to always choose 128 for subgroupMaxSize on D3D12, but 128 is not acceptable on lots of GPUs.

Jiawei-Shao · 2026-05-27T01:02:02Z

API LGTM. The last WGSL changes should still be reviewed by a WGSL person. And we still need tests before landing (that's tracked by the open comment, which will block the PR from landing).

Hi @kainino0x,

I've added all the tests to CTS (in gpuweb/cts#4640).

PTAL, thanks!

jimblandy · 2026-06-01T15:27:06Z

The CTS PR has been merged. @kainino0x, is this ready to land?

jimblandy · 2026-06-09T16:59:19Z

The CTS PR has been merged. @kainino0x, is this ready to land?

I was pushing to get this resolved, but I want to withdraw my pressure here. Based on the discussion above, it looks to me like this question is still very much unresolved. The web will have to deal with whatever we put in the spec forever, so it is very much worthwhile for us to spend a few weeks ensuring that we're doing the best we can do.

Jiawei-Shao added 7 commits February 27, 2026 14:38

Add extension subgroup-size-control

f17c4e8

This patch adds a new WebGPU and WGSL extension `subgroup-size-control` based on `proposals/subgroup-size-control.md`.

Move subgroup-size-control and subgroup_size to the last

a6d6e7a

Fix a typo

5f91fdd

Simply some statements

7837e6d

Fix build error

d38045f

More fixes

502846a

More fix

7aaed3a

Jiawei-Shao marked this pull request as draft February 27, 2026 07:19

Jiawei-Shao added 2 commits February 27, 2026 15:24

More fix

0243e09

More fix

85d7d06

Jiawei-Shao marked this pull request as ready for review February 27, 2026 07:58

dneto0 requested review from dneto0 February 27, 2026 14:40

alan-baker requested changes Mar 1, 2026

View reviewed changes

Comment thread spec/index.bs Outdated

Comment thread spec/index.bs Outdated

Comment thread spec/index.bs Outdated

Comment thread wgsl/index.bs Outdated

Comment thread wgsl/index.bs Outdated

Comment thread wgsl/index.bs Outdated

Jiawei-Shao marked this pull request as draft March 4, 2026 07:59

Address reviewer's comments

39d36da

Jiawei-Shao marked this pull request as ready for review March 4, 2026 08:02

jimblandy mentioned this pull request Mar 10, 2026

Subgroup Size Control #5545

Open

Jiawei-Shao added 2 commits March 12, 2026 09:44

Merge branch 'main' into add-subgroup-size-control

0d7ecc5

Update correspondence reference

a36b76f

Jiawei-Shao marked this pull request as draft March 12, 2026 03:06

Jiawei-Shao added 2 commits March 12, 2026 11:11

Fix build error

a78a7e3

More fix

a02d357

Jiawei-Shao marked this pull request as ready for review March 12, 2026 03:17

Jiawei-Shao added 2 commits March 17, 2026 10:06

Merge branch 'main' into add-subgroup-size-control

07a258e

Merge branch 'main' into add-subgroup-size-control

22e75bf

Address reviewer's comments

07934d0

Jiawei-Shao requested a review from jrprice April 17, 2026 02:35

jrprice approved these changes Apr 29, 2026

View reviewed changes

Enable subgroups automatically with subgroup-size-control

ed4d294

alan-baker approved these changes May 6, 2026

View reviewed changes

Merge branch 'main' into add-subgroup-size-control

3192a9f

kainino0x reviewed May 12, 2026

View reviewed changes

Comment thread spec/index.bs Outdated

Comment thread spec/index.bs Outdated

Comment thread spec/index.bs Outdated

kainino0x reviewed May 12, 2026

View reviewed changes

Comment thread spec/index.bs

Jiawei-Shao mentioned this pull request May 13, 2026

Add "subgroup-size-control" feature gpuweb/types#201

Draft

Jiawei-Shao added 2 commits May 13, 2026 10:54

Address reviewer's comments

a963977

Merge branch 'main' into add-subgroup-size-control

6d584e2

kainino0x requested a review from alan-baker May 13, 2026 22:01

kainino0x approved these changes May 13, 2026

View reviewed changes

Jiawei-Shao mentioned this pull request May 14, 2026

Add tests on the featuresubgroup-size-control gpuweb/cts#4640

Closed

Jiawei-Shao added 2 commits May 14, 2026 15:48

Merge branch 'main' into add-subgroup-size-control

994434e

Require subgroups when enabling subgroup_size_control in WGSL

ac04c31

alan-baker approved these changes May 18, 2026

View reviewed changes

jimblandy reviewed May 19, 2026

View reviewed changes

Comment thread wgsl/index.bs

Comment thread wgsl/index.bs Outdated

Comment thread wgsl/index.bs

jimblandy approved these changes May 19, 2026

View reviewed changes

kainino0x reviewed May 19, 2026

View reviewed changes

Jiawei-Shao added 4 commits May 22, 2026 13:59

Merge branch 'main' into add-subgroup-size-control

b3741fc

Address reviewer's comments

34971e2

Fix typo

09f8c12

Fix build error

7ae8740

Merge branch 'main' into add-subgroup-size-control

f7741c6

Conversation

Jiawei-Shao commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jimblandy commented Mar 10, 2026

Uh oh!

Jiawei-Shao commented Mar 12, 2026

Uh oh!

Jiawei-Shao commented Mar 30, 2026

Uh oh!

Jiawei-Shao commented May 12, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kainino0x left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jimblandy commented May 18, 2026

Uh oh!

jimblandy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jimblandy left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kainino0x Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Jiawei-Shao commented May 27, 2026

Uh oh!

jimblandy commented Jun 1, 2026

Uh oh!

jimblandy commented Jun 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Jiawei-Shao commented Feb 27, 2026 •

edited

Loading

github-actions Bot commented Feb 27, 2026 •

edited

Loading

kainino0x left a comment •

edited

Loading

kainino0x Jun 1, 2026 •

edited

Loading