feat(speech): enhance speech processing with advanced audio setup and detection

- Add audio setup script for PulseAudio configuration
- Improve wake word detection with advanced noise filtering
- Implement continuous transcription and command processing
- Update speech Dockerfile with additional audio dependencies
- Enhance logging and error handling in wake word detector
This commit is contained in:
jango-blockchained
2025-02-04 22:51:06 +01:00
parent e1db799b1d
commit b9727981cc
4 changed files with 374 additions and 103 deletions

16
docker/speech/setup-audio.sh Executable file
View File

@@ -0,0 +1,16 @@
#!/bin/bash
# Wait for PulseAudio to be ready
sleep 2
# Mute the monitor to prevent feedback
pactl set-source-mute alsa_output.pci-0000_00_1b.0.analog-stereo.monitor 1
# Set microphone sensitivity to 65%
pactl set-source-volume alsa_input.pci-0000_00_1b.0.analog-stereo 65%
# Set speaker volume to 40%
pactl set-sink-volume alsa_output.pci-0000_00_1b.0.analog-stereo 40%
# Make the script executable
chmod +x /setup-audio.sh