Metal backend: support 4-bit linear by manuelcandales · Pull Request #17117 · pytorch/executorch

manuelcandales · 2026-02-02T21:27:25Z

This pull request adds support for 4-bit quantized linear in the Metal backend

Dtype support improvements:

Added support for the uint8 dtype (PyTorch dtype code 0) in the AOTInductor shims, including new function aoti_torch_dtype_uint8() and updates to dtype conversion utilities and supported dtype checks. (backends/aoti/common_shims.cpp, backends/aoti/common_shims.h, backends/aoti/utils.h, backends/apple/metal/runtime/shims/utils.cpp, backends/apple/metal/runtime/shims/utils.h) [1] [2] [3] [4] [5] [6]

Metal backend kernel and operator integration:

Registered the new torchao::_linear_fp_act_4bit_weight operator as a supported fallback kernel and added its C shim signature for the Metal backend. (backends/apple/metal/metal_backend.py, backends/apple/metal/runtime/shims/et_metal_ops.h) [1] [2]

Build and logging enhancements:

Updated Metal backend compile options to inject custom operator C shims and improved logging for shared library loading in the Metal backend. (backends/apple/metal/metal_backend.py, backends/apple/metal/runtime/metal_backend.cpp) [1] [2] [3]

[ghstack-poisoned]

manuelcandales · 2026-02-02T21:27:26Z

Stack from ghstack (oldest at bottom):

pytorch-bot · 2026-02-02T21:27:28Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17117

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 4 New Failures, 1 Unrelated Failure

As of commit c5a3c1a with merge base ba6de95 ():

NEW FAILURES - The following jobs have failed:

pull / test-multimodal-linux (gemma3-4b) / linux-job (gh)
RuntimeError: Command docker exec -t abceb6a77f84ae6e73756c90eae8951a06496582fa604c6110dd48b93c52fc0e /exec failed with exit code 139
pull / unittest / macos / macos-job (gh)
export/tests/test_target_recipes.py::TestTargetRecipes::test_mv3_model
pull / unittest-arm-backend-with-no-deps (test_pytest_models_tosa) / linux-job (gh)
RuntimeError: Command docker exec -t 5cbf6afc5461ce04726e825834a422880c897f8f74d56b8022b81d0deefa320b /exec failed with exit code 1
pull / unittest-buck / linux / linux-job (gh)
RuntimeError: Command docker exec -t 39870ddfc4358f8cf7785e5f545d992ab1cb0423f29f4e13cd69a3fad348bdae /exec failed with exit code 3

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest-buck / macos / macos-job (gh) (trunk failure)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 3

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

mergennachin · 2026-02-04T23:22:07Z

backends/apple/metal/tests/run_metal_test.sh

+# Function to update and build torchao
+update_and_build_torchao() {
+    echo "Building torchao..."
+    TORCHAO_DIR="$EXECUTORCH_ROOT/third-party/ao"
+    if [[ -d "$TORCHAO_DIR" ]]; then
+        cd "$TORCHAO_DIR"
+        echo "Pulling latest changes from ao repository..."
+        git checkout main
+        git pull origin main
+        USE_CPP=1 TORCHAO_BUILD_EXPERIMENTAL_MPS=1 pip install . --no-build-isolation
+        cd "$EXECUTORCH_ROOT"
+        echo "torchao build complete"
+    else
+        echo "Error: torchao directory not found at $TORCHAO_DIR"
+        exit 1
+    fi
+}
+


No need for this. Just upgrade the pin

Update

bcc8bda

[ghstack-poisoned]

manuelcandales requested review from cccclai and shoumikhin as code owners February 2, 2026 21:27

This was referenced Feb 2, 2026

Metal backend: test modules #17076

Open

Metal backend: enable linear with bias #17115

Open

Metal backend: remove parakeet linear decomp #17116

Open

Metal backend: enable quantization in test_modules #17118

Open

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 2, 2026

manuelcandales requested review from larryliu0820 and mergennachin and removed request for cccclai and shoumikhin February 2, 2026 21:31

manuelcandales added 3 commits February 2, 2026 17:26

Update

a058197

[ghstack-poisoned]

Update

fe5be37

[ghstack-poisoned]

Update

fcfa832

[ghstack-poisoned]

manuelcandales mentioned this pull request Feb 2, 2026

Metal backend: test 4-bit linear with bias #17127

Open

Update

c5a3c1a

[ghstack-poisoned]

This was referenced Feb 4, 2026

Metal backend: int4 refactor checks and logs #17186

Open

Metal backend: import qmv_impl from MLX #17187

Open

Metal backend: fix bugs in qmv_impl #17188

Open

mergennachin reviewed Feb 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metal backend: support 4-bit linear#17117

Metal backend: support 4-bit linear#17117
manuelcandales wants to merge 5 commits intogh/manuelcandales/153/headfrom
gh/manuelcandales/154/head

manuelcandales commented Feb 2, 2026 •

edited

Loading

Uh oh!

manuelcandales commented Feb 2, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Feb 2, 2026 •

edited

Loading

Uh oh!

mergennachin Feb 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

manuelcandales commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

manuelcandales commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17117

❌ 4 New Failures, 1 Unrelated Failure

Uh oh!

mergennachin Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

manuelcandales commented Feb 2, 2026 •

edited

Loading

manuelcandales commented Feb 2, 2026 •

edited

Loading

pytorch-bot bot commented Feb 2, 2026 •

edited

Loading