You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
High-performance inference of [OpenAI's Whisper](https://github.com/openai/whisper) automatic speech recognition (ASR) model:
13
13
14
14
- Plain C/C++ implementation without dependencies
15
-
- Apple Silicon first-class citizen - optimized via ARM NEON, Accelerate framework, Metal and [Core ML](https://github.com/ggerganov/whisper.cpp#core-ml-support)
15
+
- Apple Silicon first-class citizen - optimized via ARM NEON, Accelerate framework, Metal and [Core ML](#core-ml-support)
16
16
- AVX intrinsics support for x86 architectures
17
17
- VSX intrinsics support for POWER architectures
18
18
- Mixed F16 / F32 precision
19
-
-[4-bit and 5-bit integer quantization support](https://github.com/ggerganov/whisper.cpp#quantization)
19
+
-[4-bit and 5-bit integer quantization support](#quantization)
20
20
- Zero memory allocations at runtime
21
+
-[Vulkan support](#vulkan-gpu-support)
21
22
- Support for CPU-only inference
22
-
-[Efficient GPU support for NVIDIA](https://github.com/ggerganov/whisper.cpp#nvidia-gpu-support-via-cublas)
0 commit comments