Right now for
POST /v0/tts
.utterances[].speed` the docstring is
> Speed multiplier for the synthesized speech. Extreme values below 1 and above 1.5 may sometimes cause instability to the generated output.
This is a feature request to support more stability for generations with speed outside of this range.