generate

Packages

chatterbox
- chatterbox Reference
cornfab
- cornfab Reference
diffuseR
earshot
- run_app
fyi
- fyi Reference
llamaR
- llamaR Reference
rapt
- rapt Reference
rformat
- rformat Reference
saber
- saber Reference
stt.api
tinypkgr
- tinypkgr Reference
tinyrox
- tinyrox Reference
tts.api
- tts.api Reference
whisper

Generate speech from text

Description

Generate speech from text

Usage

generate(
  model,
  text,
  voice,
  exaggeration = 0.5,
  cfg_weight = 0.5,
  temperature = 0.8,
  top_p = 0.9,
  autocast = NULL,
  traced = FALSE
)

Arguments

model: Chatterbox model
text: Text to synthesize
voice: Voice embedding from create_voice_embedding() or path to reference audio
exaggeration: Emotion/expression exaggeration level (0-1, default 0.5)
cfg_weight: Classifier-free guidance weight (higher = more adherence to text, default 0.5)
temperature: Sampling temperature (default 0.8)
top_p: Top-p (nucleus) sampling threshold (default 0.9)
autocast: Use mixed precision (float16) on CUDA for faster inference (default TRUE on CUDA)

Value

List with audio (numeric vector) and sample_rate