-
Notifications
You must be signed in to change notification settings - Fork 738
Open
Labels
feature requestFunctionality does not currently exist, would need to be created as a new feature (type)Functionality does not currently exist, would need to be created as a new feature (type)
Description
🌱 Describe Feature Request
I trained a Transformer model. When I converted it as a whole into an mlmodel, I found that its intelligence could only be processed on the cpu. After splitting it into encode and decode, I discovered that encode could be normally accelerated using the NPU, but decode could only be processed on the cpu. Is it because decode is self-decoding, not a static issue? If decode can be accelerated by NPU, could a method for converting pt to mlmodel or mlpackage be provided
thanks
Metadata
Metadata
Assignees
Labels
feature requestFunctionality does not currently exist, would need to be created as a new feature (type)Functionality does not currently exist, would need to be created as a new feature (type)