Skip to content

S26: Add Stronger Guardrails to the AI Voice Bot #70

@notzabir

Description

@notzabir

Description

Decibel-Based Voice Detection

Basic RMS detection is okay.
Explore WebRTC

  • VAD (highly optimized)
  • Silero VAD (PyTorch-based)
  • Pyannote (speaker diarization)

Advanced Upgrade: Interrupt Detection

  • If user speaks while bot talking: Immediately stop TTS
  • Switch to listening That makes it feel intelligent.

B. Logging + Telemetry System Track:

  • Latency
  • Memory usage
  • Crash events
  • Conversation length

Store metrics for optimization.

C. Fail-Safe Mode
If:

  • CV crashes
  • LLM times out
  • RAM overload

System switches to:

  • Basic fallback voice assistant mode

WebRTC VAD

Silero VAD (PyTorch)

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

Status

In Progress

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions