How does Salad output sentence and word-level transcription?
Here is an example of how the Salad API outputs sentence and word-level time stamps along with speak name. You'll find additional instructions on our Docs page, here.
----- .JSON example start -----
"output": {
"segments": [
{
"start": 19.2,
"end": 19.903,
"text": "Thank you.",
"words": [
{
"word": "Thank",
"start": 19.662,
"end": 19.783,
"score": 0.232
},
{
"word": "you.",
"start": 19.803,
"end": 19.903,
"score": 0.545
}
],
"speaker": "SPEAKER_00"
}