Parameter specifics
Review voice activity detection parameter specifications.
Read time 1 minuteLast updated 20 hours ago
vad_hangover
The hangover time is the time (in milliseconds) it takes for the VAD to switch back from speech mode to silence after the last speech frame is detected. The default value is 2000.
vad_hangovervad_sensitivity
The sensitivity is a dimensionless value between 0 and 100 that indicates the sensitivity of the VAD. Increasing this value corresponds to decreasing the sensitivity of the VAD (0 is the most sensitive, and 100 is the least sensitive). Higher values of sensitivity require louder audio to trigger the VAD. The default value is 43.
Applications that use the default VAD and expose the vad_sensitivityvad_sensitivityvad_noise_floor
The noise floor is a dimensionless value between 0 and 20000 that controls how the VAD separates speech from background noise. Lower values assume the user is in a quieter environment where the audio is only speech. Higher values assume a noisy background environment. The default value is 576.
vad_noise_floor