Significant Performance Discrepancy: Code fast on x86, very slow on Jetson Orin (ARM)

I am experiencing a major performance difference  when running on an NVIDIA Jetson Orin (ARM64) compared to a modern x86-64 system. The same code, with the same data input, runs slower on the ARM platform. The difference mainly because of the [decoding part of t2s.infer_panel_batch_infer ](https://github.com/RVC-Boss/GPT-SoVITS/blob/11aa78bd9bda8b53047cfcae03abf7ca94d27391/GPT_SoVITS/AR/models/t2s_model.py#L697)

I am seeking the guidance on  ARM-specific optimizations for this code section.






Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Significant Performance Discrepancy: Code fast on x86, very slow on Jetson Orin (ARM) #2604

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Significant Performance Discrepancy: Code fast on x86, very slow on Jetson Orin (ARM) #2604

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions