How does Salad output sentence and word-level transcription?

Here is an example of how the Salad API outputs sentence and word-level time stamps along with speak name. You'll find additional instructions on our Docs page, here.

----- .JSON example start -----

"output": {

"segments": [

{

"start": 19.2,

"end": 19.903,

"text": "Thank you.",

"words": [

{

"word": "Thank",

"start": 19.662,

"end": 19.783,

"score": 0.232

},

{

"word": "you.",

"start": 19.803,

"end": 19.903,

"score": 0.545

}

],

"speaker": "SPEAKER_00"

}