Add native support for Vision-Language-Action (VLA) models on edge devices

### 🚀 The feature, motivation and pitch

I am currently working on deploying vision-language-action(VLA) models, such as [OpenVLA](https://github.com/openvla/openvla), [Pi-0](https://www.pi.website/blog/pi0), to edge devices for real-time robot control, and I plan to use ExecuTorch as the on-device deployment framework. However, it remains uncertain whether ExecuTorch can successfully export and support the execution of VLA models.

Therefore, I hope the ExecuTorch team can consider adding native support for VLA models, enabling the implementation of robotic applications with privacy protection and low latency on resource-constrained devices, such as mobile robots and drones. This would address a critical gap: while ExecuTorch already supports some VLMs and LLMs, it currently lacks support for the action generation module, which is essential for embodied intelligence.

### Alternatives

_No response_

### Additional context

_No response_

### RFC (Optional)

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add native support for Vision-Language-Action (VLA) models on edge devices #17079

🚀 The feature, motivation and pitch

Alternatives

Additional context

RFC (Optional)

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add native support for Vision-Language-Action (VLA) models on edge devices #17079

Description

🚀 The feature, motivation and pitch

Alternatives

Additional context

RFC (Optional)

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions