Synthesize speech locally using llama-tts and the OuteTTS-1.0-0.6B model.
You can use the wrapper script:
scripts/tts-local.sh [options] "" -o, --output : Output WAV file (default: output.wav)-s, --speaker : Speaker reference file (optional)-t, --temp : Temperature (default: 0.4)scripts/tts-local.sh (inside skill folder)/data/public/machine-learning/models/text-to-speach/OuteTTS-1.0-0.6B-Q4_K_M.gguf/data/public/machine-learning/models/text-to-speach/WavTokenizer-Large-75-Q4_0.ggufllama-tts.Place files in /data/public/machine-learning/models/text-to-speach/ or update scripts/tts-local.sh.
The model card recommends the following settings (hardcoded in the script):
共 1 个版本