Service

GitHub - Blaizzy/mlx-audio: A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

Blaizzy

2026.01.24

·GitHub·by web-ghost

#MLX#TTS#STT#Speech Processing#Apple Silicon

Key Points

1MLX-Audio is a library built on Apple's MLX framework, designed for fast and efficient text-to-speech (TTS), speech-to-text (STT), and speech-to-speech (STS) on Apple Silicon.
2It supports a variety of multilingual models for each task, offering features like voice customization, cloning, and speech enhancement, optimized for performance with quantization options.
3The library provides a command-line interface, Python API, an interactive web interface, an OpenAI-compatible REST API, and tools for model conversion and quantization.

Service

Blaizzy

2026.01.24

·GitHub·by web-ghost

#MLX#TTS#STT#Speech Processing#Apple Silicon

1MLX-Audio is a library built on Apple's MLX framework, designed for fast and efficient text-to-speech (TTS), speech-to-text (STT), and speech-to-speech (STS) on Apple Silicon.
2It supports a variety of multilingual models for each task, offering features like voice customization, cloning, and speech enhancement, optimized for performance with quantization options.
3The library provides a command-line interface, Python API, an interactive web interface, an OpenAI-compatible REST API, and tools for model conversion and quantization.