GitHub - jamiepine/voicebox: The open-source voice synthesis studio powered by Qwen3-TTS.

GitHub - jamiepine/voicebox: The open-source voice synthesis studio powered by Qwen3-TTS.

Service

GitHub - jamiepine/voicebox: The open-source voice synthesis studio powered by Qwen3-TTS.

jamiepine

2026.02.23

·GitHub·by 이호민

#AI#Open Source#TTS#Voice Cloning#Voice Synthesis

핵심 포인트

1Voicebox는 ElevenLabs와 같은 클라우드 서비스의 대안으로 설계된 오픈소스 로컬 우선 음성 합성 스튜디오이며, 사용자가 자신의 기기에서 직접 음성을 복제하고 생성할 수 있도록 합니다.
2Qwen3-TTS 기반의 고품질 음성 클로닝, 멀티트랙 타임라인 에디터, 그리고 API를 제공하며, Apple Silicon에서는 MLX를 활용하여 4-5배 빠른 추론 속도를 자랑합니다.
3Tauri, FastAPI, Qwen3-TTS 및 Whisper 모델로 구축된 Voicebox는 완전한 개인 정보 보호와 로컬 제어를 강조하며, 실시간 합성 및 다양한 모델 지원으로 기능을 확장할 계획입니다.