Recognized word text. May include #tag suffix if an alternative pronunciation was matched.
Start time of this word in seconds from the beginning of the audio.
Duration of this word in seconds.
Whether this word is a special token (e.g., <SPOKEN_NOISE>).
OptionalphonesArray of phonemes that comprise this word.
Word-level recognition result with timing and optional phone details.