Metal backend: import qmv_impl from MLX #17187

manuelcandales · 2026-02-04T00:03:08Z

This pull request extends support in the Metal backend's 4-bit quantized linear operation (aoti_torch_mps__linear_fp_act_4bit_weight) to the case where the batch size (M) is 1 and the output dimension (N) is not a multiple of 4. To achieve this, we imported the qmv_impl Metal shader from MLX.

[ghstack-poisoned]

manuelcandales · 2026-02-04T00:03:09Z

Stack from ghstack (oldest at bottom):

pytorch-bot · 2026-02-04T00:03:12Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17187

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 4 New Failures, 1 Unrelated Failure

As of commit 8ee7d60 with merge base ba6de95 ():

NEW FAILURES - The following jobs have failed:

pull / test-multimodal-linux (gemma3-4b) / linux-job (gh)
RuntimeError: Command docker exec -t b7afc97a5951192b2ca3603fc045800cbf31751e2147f4021194f4a68b08b728 /exec failed with exit code 139
pull / unittest / macos / macos-job (gh)
backends/xnnpack/test/ops/test_conv2d.py::TestConv2d::test_fp16_conv2d
pull / unittest-arm-backend-with-no-deps (test_pytest_models_tosa) / linux-job (gh)
RuntimeError: Command docker exec -t 0728b7143aa8e0dbc6b93ed79e7bed4e71f452306e64e86c203cd8c4224f5f46 /exec failed with exit code 1
pull / unittest-buck / linux / linux-job (gh)
RuntimeError: Command docker exec -t 242f03543337b2858d57468f3e35c339e7b47570646cdbeb69ef30b71e127157 /exec failed with exit code 3

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest-buck / macos / macos-job (gh) (trunk failure)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 3

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Update

8ee7d60

[ghstack-poisoned]

manuelcandales requested review from cccclai and shoumikhin as code owners February 4, 2026 00:03

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metal backend: import qmv_impl from MLX #17187

Metal backend: import qmv_impl from MLX #17187

manuelcandales commented Feb 4, 2026 •

edited

Loading

Uh oh!

manuelcandales commented Feb 4, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Feb 4, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Metal backend: import qmv_impl from MLX #17187

Are you sure you want to change the base?

Metal backend: import qmv_impl from MLX #17187

Conversation

manuelcandales commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

manuelcandales commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17187

❌ 4 New Failures, 1 Unrelated Failure

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

manuelcandales commented Feb 4, 2026 •

edited

Loading

manuelcandales commented Feb 4, 2026 •

edited

Loading

pytorch-bot bot commented Feb 4, 2026 •

edited

Loading