Both files are the same article (Speaker for the Dead) synthesized with the same Piper TTS model. The only difference is how piper was invoked:
Generated in ~22.8s average across 5 runs
Generated in ~16.1s average across 5 runs
| Run | Baseline | Batch |
|---|---|---|
| 1 | 24.32s | 16.65s |
| 2 | 21.11s | 13.77s |
| 3 | 24.25s | 16.90s |
| 4 | 20.77s | 16.08s |
| 5 | 23.62s | 17.28s |
| Avg | 22.81s | 16.13s |