Voice Activity Detection `deprecated`¶

This service is used to handle Voice Activity Detection (VAD).

The current implementation uses the browser's SpeechRecognition API to detect speech. Transcription is still handled by the backend due to the following reasons.

Multi language support. The browser requires the user to specify the language; leaving this capability to the backend allows us to automatically detect the user's language.
Limited browser support. Not all browsers support this API (e.g. Arc, Firefox), leaving this to our own API allows us to have full control and simplify the process of Speech to Text (STT) transcription.

Fallback required

A fall back VAD detector needs to be implemented in the event of an unsupported browser. This method can be done using live transcription from our backend, or using alternative methods (e.g. volume levels).

`service` VadService¶

Attributes¶

`private` endTimeout¶

number to keep track of timeout calls that are used in VadService.start() to emit an event when speech has ended.

`private` $speech¶

Subject<void> emits an event whenever speech has been detected; fired through a subscription in VadService.configSpeechRecognition() as a side effect.

`private` recognition¶

SpeechRecognition object from the browser.

Methods¶

`private` `meth` Configure¶

private configSpeechRecofnition(): void

Description: Method to instantiate the speech recognition object. Called in the constructor.

`meth` Start¶

start(): Observable<VoiceActivity>

Description: Method to get an observable to keep track of VAD activity.
Returns: Observable<VoiceActivity>

Updated 23 Oct 2024

Contributor Ong Tsien Jin

Voice Activity Detection deprecated¶

service VadService¶

Attributes¶

private endTimeout¶

private $speech¶

private recognition¶