You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
With respect to post-text speech recognition (e.g. speech-to-SSML, speech-to-hypertext, speech-to-X1), we can consider:
from:
// Item in N-best list
[Exposed=Window]
interfaceSpeechRecognitionAlternative {
readonly attribute DOMString transcript;
readonly attribute float confidence;
};
to:
// Item in N-best list
[Exposed=Window]
interfaceSpeechRecognitionAlternative {
readonly attribute object transcript;
readonly attribute float confidence;
};
then client-side, server-side or third-party components or services could return either text or XML content per recognition result. That is, transcript could be either a DOMString or a DOMElement.
Speech-to-text is too lossy. Information pertaining to prosody, intonation, emphases and pauses are discarded in text-formatted output. Such information can be useful, for instance, in informing machine translation components and services.
The text was updated successfully, but these errors were encountered:
With respect to post-text speech recognition (e.g. speech-to-SSML, speech-to-hypertext, speech-to-X1), we can consider:
from:
to:
then client-side, server-side or third-party components or services could return either text or XML content per recognition result. That is,
transcript
could be either aDOMString
or aDOMElement
.Speech-to-text is too lossy. Information pertaining to prosody, intonation, emphases and pauses are discarded in text-formatted output. Such information can be useful, for instance, in informing machine translation components and services.
The text was updated successfully, but these errors were encountered: