feat(speech): enhance speech processing with advanced audio setup and detection

- Add audio setup script for PulseAudio configuration - Improve wake word detection with advanced noise filtering - Implement continuous transcription and command processing - Update speech Dockerfile with additional audio dependencies - Enhance logging and error handling in wake word detector
2025-02-04 22:51:06 +01:00
parent e1db799b1d
commit b9727981cc
4 changed files with 374 additions and 103 deletions
--- a/docker/speech/setup-audio.sh
+++ b/docker/speech/setup-audio.sh
@@ -0,0 +1,16 @@
+#!/bin/bash
+
+# Wait for PulseAudio to be ready
+sleep 2
+
+# Mute the monitor to prevent feedback
+pactl set-source-mute alsa_output.pci-0000_00_1b.0.analog-stereo.monitor 1
+
+# Set microphone sensitivity to 65%
+pactl set-source-volume alsa_input.pci-0000_00_1b.0.analog-stereo 65%
+
+# Set speaker volume to 40%
+pactl set-sink-volume alsa_output.pci-0000_00_1b.0.analog-stereo 40%
+
+# Make the script executable
+chmod +x /setup-audio.sh