Dynamic concat gpu support by turneram · Pull Request #5032 · ROCm/AMDMIGraphX

turneram · 2026-07-02T00:02:16Z

Motivation

Dynamic concat is required to run dynamic kv-cache

Technical Details

Adds changes needed to run concat with dynamic shape inputs on gpu.

Changelog Category

Add a CHANGELOG.md entry for any option other than Not Applicable

- Added: New functionality.
- Changed: Changes to existing functionality.
- Removed: Functionality or support that has been removed. (Compared to a previous release)
- Optimized: Component performance that has been optimized or improved.
- Resolved Issues: Known issues from a previous version that have been resolved.
- Not Applicable: This PR is not to be included in the changelog.

…ntwise

github-actions · 2026-07-02T00:08:17Z

 }

+static std::vector<argument> ensure_gpu_kernel_args(const std::vector<argument>& args,
+                                                      pmr::vector<argument>& temps)


[format.py] _{reported by reviewdog 🐶}

Suggested change

pmr::vector<argument>& temps)

pmr::vector<argument>& temps)

github-actions · 2026-07-02T00:08:17Z

+        const std::size_t num_concat =
+            v.get("num_concat_inputs", inputs.size());


[format.py] _{reported by reviewdog 🐶}

Suggested change

const std::size_t num_concat =

v.get("num_concat_inputs", inputs.size());

const std::size_t num_concat = v.get("num_concat_inputs", inputs.size());

github-actions · 2026-07-02T00:08:17Z

+        concat_shapes.assign(inputs.begin(),
+                             inputs.begin() + std::min(num_concat, inputs.size()));
+        shape output_shape = v.contains("output_shape") ? from_value<shape>(v.at("output_shape"))
+                                                        : inputs.back();


[format.py] _{reported by reviewdog 🐶}

Suggested change

concat_shapes.assign(inputs.begin(),

inputs.begin() + std::min(num_concat, inputs.size()));

shape output_shape = v.contains("output_shape") ? from_value<shape>(v.at("output_shape"))

: inputs.back();

concat_shapes.assign(inputs.begin(), inputs.begin() + std::min(num_concat, inputs.size()));

shape output_shape =

v.contains("output_shape") ? from_value<shape>(v.at("output_shape")) : inputs.back();

github-actions · 2026-07-02T00:08:17Z

+        options.inputs   = inputs;
+        options.output   = output_shape;


[format.py] _{reported by reviewdog 🐶}

Suggested change

options.inputs = inputs;

options.output = output_shape;

options.inputs = inputs;

options.output = output_shape;

github-actions · 2026-07-02T00:08:17Z

-        auto args           = v.at("args");
+
+        // normalize() rewrites axis into reduced-dim space; kernel concat<Axis> uses full tensors.
+        std::size_t fast_axis = kernel_axis;


[format.py] _{reported by reviewdog 🐶}

Suggested change

std::size_t fast_axis = kernel_axis;

std::size_t fast_axis = kernel_axis;

github-actions · 2026-07-02T00:08:18Z

+
+        const std::size_t nelem =
+            output_shape.dynamic() ? output_shape.element_space() : output_shape.elements();
+        auto nelements_per_op     = nelem / op_names.size();


[format.py] _{reported by reviewdog 🐶}

Suggested change

auto nelements_per_op = nelem / op_names.size();

auto nelements_per_op = nelem / op_names.size();

github-actions · 2026-07-02T00:08:18Z

+        auto psl = var("psl", {1, 64});
+        using dd = migraphx::shape::dynamic_dimension;
+
+        migraphx::shape past_shape{migraphx::shape::half_type, {dd{1, 1}, dd{5, 5}, dd{psl}, dd{64, 64}}};


[format.py] _{reported by reviewdog 🐶}

Suggested change

migraphx::shape past_shape{migraphx::shape::half_type, {dd{1, 1}, dd{5, 5}, dd{psl}, dd{64, 64}}};

migraphx::shape past_shape{migraphx::shape::half_type,

{dd{1, 1}, dd{5, 5}, dd{psl}, dd{64, 64}}};

github-actions · 2026-07-02T00:08:18Z

+        auto* mm = p.get_main_module();
+        auto past_key   = mm->add_parameter("past_key_values.0.key", past_shape);


[format.py] _{reported by reviewdog 🐶}

Suggested change

auto* mm = p.get_main_module();

auto past_key = mm->add_parameter("past_key_values.0.key", past_shape);

auto* mm = p.get_main_module();

auto past_key = mm->add_parameter("past_key_values.0.key", past_shape);

github-actions · 2026-07-02T00:08:18Z

+        return {{"past_key_values.0.key",
+                 migraphx::shape{migraphx::shape::half_type, {1, 5, 1, 64}}}};


[format.py] _{reported by reviewdog 🐶}

Suggested change

return {{"past_key_values.0.key",

migraphx::shape{migraphx::shape::half_type, {1, 5, 1, 64}}}};

return {

{"past_key_values.0.key", migraphx::shape{migraphx::shape::half_type, {1, 5, 1, 64}}}};

github-actions · 2026-07-02T00:08:18Z

+        auto n  = var("n", {2, 3});
+        auto d0 = var("d0", {2, 4});
+        auto d1 = var("d1", {3, 4});
+        auto d2 = var("d2", {1, 5});


[format.py] _{reported by reviewdog 🐶}

Suggested change

auto n = var("n", {2, 3});

auto d0 = var("d0", {2, 4});

auto d1 = var("d1", {3, 4});

auto d2 = var("d2", {1, 5});

auto n = var("n", {2, 3});

auto d0 = var("d0", {2, 4});

auto d1 = var("d1", {3, 4});

auto d2 = var("d2", {1, 5});

turneram added 5 commits June 25, 2026 11:53

Update jit concat to use dynamic shapes

1382c6e

Merge remote-tracking branch 'origin/develop' into dynamic-concat-poi…

345e47d

…ntwise

Fixes

3b4c21f

Fix

f0fd492

Remove debug prints

63eb877

github-actions Bot reviewed Jul 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Dynamic concat gpu support#5032

Dynamic concat gpu support#5032
turneram wants to merge 5 commits into
developfrom
dynamic-concat-pointwise

turneram commented Jul 2, 2026

Uh oh!

github-actions Bot Jul 2, 2026

Uh oh!

github-actions Bot Jul 2, 2026

Uh oh!

github-actions Bot Jul 2, 2026

Uh oh!

github-actions Bot Jul 2, 2026

Uh oh!

github-actions Bot Jul 2, 2026

Uh oh!

github-actions Bot Jul 2, 2026

Uh oh!

github-actions Bot Jul 2, 2026

Uh oh!

github-actions Bot Jul 2, 2026

Uh oh!

github-actions Bot Jul 2, 2026

Uh oh!

github-actions Bot Jul 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		const std::size_t num_concat =
		v.get("num_concat_inputs", inputs.size());

	const std::size_t num_concat =
	v.get("num_concat_inputs", inputs.size());
	const std::size_t num_concat = v.get("num_concat_inputs", inputs.size());

	std::size_t fast_axis = kernel_axis;
	std::size_t fast_axis = kernel_axis;

	auto nelements_per_op = nelem / op_names.size();
	auto nelements_per_op = nelem / op_names.size();

	migraphx::shape past_shape{migraphx::shape::half_type, {dd{1, 1}, dd{5, 5}, dd{psl}, dd{64, 64}}};
	migraphx::shape past_shape{migraphx::shape::half_type,
	{dd{1, 1}, dd{5, 5}, dd{psl}, dd{64, 64}}};

		auto* mm = p.get_main_module();
		auto past_key = mm->add_parameter("past_key_values.0.key", past_shape);

		return {{"past_key_values.0.key",
		migraphx::shape{migraphx::shape::half_type, {1, 5, 1, 64}}}};

Uh oh!

Conversation

turneram commented Jul 2, 2026

Motivation

Technical Details

Changelog Category

Uh oh!

github-actions Bot Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant