Scan support #16028

JacobSzwejbka · 2025-12-01T23:33:57Z

Add support for higher order ops scan. Its inefficient today because we are manually deep copying from output to input for every carry. We could do better by shallow swapping the pointers but Ill do that in a follow up if needed.

Test plan: Unit tests and internal verification against harder patterns

pytorch-bot · 2025-12-01T23:34:00Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16028

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Unrelated Failure

As of commit 90e55dd with merge base 9eaea4a ():

NEW FAILURE - The following job has failed:

pull / test-lora-linux / linux-job (gh)
RuntimeError: Command docker exec -t 4ed830c23eb2154d55eef125de453bc1f988621671dd1eceeb1dd26174480722 /exec failed with exit code 127

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / android / run-emulator (gh) (#16137)
Timeout waiting for emulator to boot.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2025-12-01T23:34:40Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

meta-codesync · 2025-12-01T23:38:46Z

@JacobSzwejbka has imported this pull request. If you are a Meta employee, you can view this in D88107948.

larryliu0820 · 2025-12-09T23:23:39Z

exir/emit/test/test_emit.py

+        op_table = program.execution_plan[0].operators
+        instructions = program.execution_plan[0].chains[0].instructions
+
+        # Collect all operator names in the program


honestly all the ops seem like implementation details and should not be tested

I was using it as a sort of a proxy that the general pattern was emitted. If you want we can just test the end 2 end behavior though.

Don't have a strong opinion, but you might have to maintain this test if there's a change to the exported graph in the future

The ops we are querying over are the ones /not/ in the original model definition but instead created by the emitter to maintain the semantics of scan

larryliu0820 · 2025-12-10T19:30:54Z

exir/emit/_emitter.py

+            2. et_copy_index(y_outputs, combine_fn's y output, iter_idx)
+
+        This explicit copy approach is used because in-place op.out(x, out=x) is unsafe.


I was under the impression that this might be fine. We basically emit scan at the very end of the lowering process and I'm not convinced we still require the graph to be functional.

No the problem isnt being functional its that aten (and ET ops) are not guaranteed to work when in and out alias the same memory.

You could very easily write before read over sections of the tensor.

exir/emit/_emitter.py

JacobSzwejbka · 2025-12-10T23:27:42Z

exir/pass_base.py

            meta,
        )

+    def call_scan(


@angelayi can you check that Im not doing anything stupid here

JacobSzwejbka · 2025-12-10T23:32:11Z

exir/pass_base.py

+            # Use the placeholder's val which has the correct shape
+            xs_element_data.append(ph.meta["val"])
+
+        combine_fn_result = self.call_submodule(


I mostly copied torch.cond here with running call_submodul. Is this just so the subgraph also gets a chance to be run over by spec prop before callign the original? It just seems weird Im calling scan on this subgraph instead of the original one passed in as an arg

angelayi · 2025-12-11T18:02:32Z

exir/pass_base.py

+        for i in range(0, len(xs)):
+            ph = combine_fn_placeholders[num_init + i]
+            # Use the placeholder's val which has the correct shape
+            xs_element_data.append(ph.meta["val"])


i think this part is a little sus where you look at the subgraph's placeholder nodes. I think the xs_element_data should just be something like, xs[0]?

JacobSzwejbka added 4 commits December 1, 2025 13:17

scan support

89279ba

make it work

646d1ce

lint

c1beb3d

remove useless test

d1473c7

JacobSzwejbka requested a review from larryliu0820 as a code owner December 1, 2025 23:33

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 1, 2025

test

d563028

JacobSzwejbka force-pushed the scan_support branch from fbf2088 to d563028 Compare December 2, 2025 19:06

JacobSzwejbka added 3 commits December 5, 2025 11:05

clean up

bfd7f36

ditch useless unlift change

9b618dd

fix dynamic shape

f965244

JacobSzwejbka requested a review from manuelcandales as a code owner December 5, 2025 22:28

JacobSzwejbka added 2 commits December 9, 2025 15:05

undo unneeded change

45eee87

more clean up

843b3d2

JacobSzwejbka changed the title ~~[WIP] Scan support~~ Scan support Dec 9, 2025

JacobSzwejbka requested review from jackzhxng and lucylq December 9, 2025 23:15

larryliu0820 reviewed Dec 9, 2025

View reviewed changes

JacobSzwejbka and others added 2 commits December 9, 2025 17:42

Merge branch 'main' into scan_support

712bcc5

missing type

a2789d8

larryliu0820 approved these changes Dec 10, 2025

View reviewed changes

JacobSzwejbka added 6 commits December 10, 2025 11:45

more missing type

4497c27

dead code

58bfc9c

improve logs

6bb4907

lint

d18aa78

ok fix the spec issues at the source

3bbb02c

Now add the pseudo hack in the emitter

6c3b0e1

JacobSzwejbka commented Dec 10, 2025

View reviewed changes

exir/pass_base.py

meta,

)

def call_scan(

Copy link

Contributor Author

JacobSzwejbka Dec 10, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@angelayi can you check that Im not doing anything stupid here

JacobSzwejbka commented Dec 10, 2025

View reviewed changes

JacobSzwejbka added 2 commits December 10, 2025 15:56

back fix map

04264ac

lint

90e55dd

JacobSzwejbka merged commit fae5d1b into main Dec 11, 2025
164 of 166 checks passed

JacobSzwejbka deleted the scan_support branch December 11, 2025 17:41

angelayi reviewed Dec 11, 2025

View reviewed changes

		2. et_copy_index(y_outputs, combine_fn's y output, iter_idx)

		This explicit copy approach is used because in-place op.out(x, out=x) is unsafe.

Scan support #16028

Scan support #16028

Uh oh!

Conversation

JacobSzwejbka commented Dec 1, 2025

Uh oh!

pytorch-bot bot commented Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16028

❌ 1 New Failure, 1 Unrelated Failure

Uh oh!

github-actions bot commented Dec 1, 2025

This PR needs a release notes: label

Uh oh!

meta-codesync bot commented Dec 1, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot bot commented Dec 1, 2025 •

edited

Loading

This PR needs a `release notes:` label