docs: Revise README to consolidate core features and enhance speech processing documentation

- Moved core features section to a more prominent position - Added detailed speech features setup and configuration instructions - Included additional tools available in the `extra/` directory for enhanced Home Assistant experience - Removed outdated speech features documentation for clarity
2025-03-15 17:02:55 +01:00
parent 90fd0e46f7
commit d1cca04e76
1 changed files with 84 additions and 128 deletions
--- a/README.md
+++ b/README.md
@@ -6,6 +6,13 @@

 MCP (Model Context Protocol) Server is my lightweight integration tool for Home Assistant, providing a flexible interface for device management and automation. It's designed to be fast, secure, and easy to use. Built with Bun for maximum performance.

+## Core Features ✨
+
+- 🔌 Basic device control via REST API
+- 📡 WebSocket/Server-Sent Events (SSE) for state updates
+- 🤖 Simple automation rule management
+- 🔐 JWT-based authentication
+
 ## Why Bun? 🚀

 I chose Bun as the runtime for several key benefits:
@@ -38,66 +45,6 @@ I chose Bun as the runtime for several key benefits:
  - Compatible with Express/Fastify
  - Native Node.js APIs

-## Core Features ✨
-
- 🔌 Basic device control via REST API
- 📡 WebSocket/Server-Sent Events (SSE) for state updates
- 🤖 Simple automation rule management
- 🔐 JWT-based authentication
- 🎤 Optional speech features:
-  - 🗣️ Wake word detection ("hey jarvis", "ok google", "alexa")
-  - 🎯 Speech-to-text using fast-whisper
-  - 🌍 Multiple language support
-  - 🚀 GPU acceleration support
-
-## System Architecture 📊
-
-```mermaid
-flowchart TB
-    subgraph Client["Client Applications"]
-        direction TB
-        Web["Web Interface"]
-        Mobile["Mobile Apps"]
-        Voice["Voice Control"]
-    end
-
-    subgraph MCP["MCP Server"]
-        direction TB
-        API["REST API"]
-        WS["WebSocket/SSE"]
-        Auth["Authentication"]
-        
-        subgraph Speech["Speech Processing (Optional)"]
-            direction TB
-            Wake["Wake Word Detection"]
-            STT["Speech-to-Text"]
-            
-            subgraph STT_Options["STT Options"]
-                direction LR
-                Whisper["Whisper"]
-                FastWhisper["Fast Whisper"]
-            end
-            
-            Wake --> STT
-            STT --> STT_Options
-        end
-    end
-
-    subgraph HA["Home Assistant"]
-        direction TB
-        HASS_API["HASS API"]
-        HASS_WS["HASS WebSocket"]
-        Devices["Smart Devices"]
-    end
-
-    Client --> MCP
-    MCP --> HA
-    HA --> Devices
-
-    style Speech fill:#f9f,stroke:#333,stroke-width:2px
-    style STT_Options fill:#bbf,stroke:#333,stroke-width:1px
-```
-
 ## Prerequisites 📋

 - 🚀 [Bun runtime](https://bun.sh) (v1.0.26+)
@@ -135,21 +82,11 @@ NODE_ENV=production ./scripts/setup-env.sh

 4. Build and launch with Docker:
 ```bash
-# Build options:
 # Standard build
 ./docker-build.sh

-# Build with speech support
-./docker-build.sh --speech
-
-# Build with speech and GPU support
-./docker-build.sh --speech --gpu
-
 # Launch:
 docker compose up -d
-
-# With speech features:
-docker compose -f docker-compose.yml -f docker-compose.speech.yml up -d
 ```

 ## Docker Build Options 🐳
@@ -213,41 +150,6 @@ Files load in this order:

 Later files override earlier ones.

-## Speech Features Setup 🎤
-
-### Prerequisites
-1. 🐳 Docker installed and running
-2. 🎮 NVIDIA GPU with CUDA (optional)
-3. 💾 4GB+ RAM (8GB+ recommended)
-
-### Configuration
-1. Enable speech in `.env`:
-```bash
-ENABLE_SPEECH_FEATURES=true
-ENABLE_WAKE_WORD=true
-ENABLE_SPEECH_TO_TEXT=true
-WHISPER_MODEL_PATH=/models
-WHISPER_MODEL_TYPE=base
-```
-
-2. Choose your STT engine:
-```bash
-# For standard Whisper
-STT_ENGINE=whisper
-
-# For Fast Whisper (GPU recommended)
-STT_ENGINE=fast-whisper
-CUDA_VISIBLE_DEVICES=0  # Set GPU device
-```
-
-### Available Models 🤖
-Choose based on your needs:
- `tiny.en`: Fastest, basic accuracy
- `base.en`: Good balance (recommended)
- `small.en`: Better accuracy, slower
- `medium.en`: High accuracy, resource intensive
- `large-v2`: Best accuracy, very resource intensive
-
 ## Development 💻

 ```bash
@@ -291,29 +193,6 @@ bun run start
 - [Custom Prompts Guide](docs/prompts.md) - Create and customize AI behavior
 - [Extras & Tools](docs/extras.md) - Additional utilities and advanced features

-### Extra Tools 🛠️
-
-I've included several powerful tools in the `extra/` directory to enhance your Home Assistant experience:
-
-1. **Home Assistant Analyzer CLI** (`ha-analyzer-cli.ts`)
-   - Deep automation analysis using AI models
-   - Security vulnerability scanning
-   - Performance optimization suggestions
-   - System health metrics
-
-2. **Speech-to-Text Example** (`speech-to-text-example.ts`)
-   - Wake word detection
-   - Speech-to-text transcription
-   - Multiple language support
-   - GPU acceleration support
-
-3. **Claude Desktop Setup** (`claude-desktop-macos-setup.sh`)
-   - Automated Claude Desktop installation for macOS
-   - Environment configuration
-   - MCP integration setup
-
-See [Extras Documentation](docs/extras.md) for detailed usage instructions and examples.
-
 ## Client Integration 🔗

 ### Cursor Integration 🖱️
@@ -354,6 +233,83 @@ Windows users can use the provided script:
 1. Go to `scripts` directory
 2. Run `start_mcp.cmd`

+## Additional Features
+
+### Speech Features 🎤
+
+MCP Server optionally supports speech processing capabilities:
+- 🗣️ Wake word detection ("hey jarvis", "ok google", "alexa")
+- 🎯 Speech-to-text using fast-whisper
+- 🌍 Multiple language support
+- 🚀 GPU acceleration support
+
+#### Speech Features Setup
+
+##### Prerequisites
+1. 🐳 Docker installed and running
+2. 🎮 NVIDIA GPU with CUDA (optional)
+3. 💾 4GB+ RAM (8GB+ recommended)
+
+##### Configuration
+1. Enable speech in `.env`:
+```bash
+ENABLE_SPEECH_FEATURES=true
+ENABLE_WAKE_WORD=true
+ENABLE_SPEECH_TO_TEXT=true
+WHISPER_MODEL_PATH=/models
+WHISPER_MODEL_TYPE=base
+```
+
+2. Choose your STT engine:
+```bash
+# For standard Whisper
+STT_ENGINE=whisper
+
+# For Fast Whisper (GPU recommended)
+STT_ENGINE=fast-whisper
+CUDA_VISIBLE_DEVICES=0  # Set GPU device
+```
+
+##### Available Models 🤖
+Choose based on your needs:
+- `tiny.en`: Fastest, basic accuracy
+- `base.en`: Good balance (recommended)
+- `small.en`: Better accuracy, slower
+- `medium.en`: High accuracy, resource intensive
+- `large-v2`: Best accuracy, very resource intensive
+
+##### Launch with Speech Features
+```bash
+# Build with speech support
+./docker-build.sh --speech
+
+# Launch with speech features:
+docker compose -f docker-compose.yml -f docker-compose.speech.yml up -d
+```
+
+### Extra Tools 🛠️
+
+I've included several powerful tools in the `extra/` directory to enhance your Home Assistant experience:
+
+1. **Home Assistant Analyzer CLI** (`ha-analyzer-cli.ts`)
+   - Deep automation analysis using AI models
+   - Security vulnerability scanning
+   - Performance optimization suggestions
+   - System health metrics
+
+2. **Speech-to-Text Example** (`speech-to-text-example.ts`)
+   - Wake word detection
+   - Speech-to-text transcription
+   - Multiple language support
+   - GPU acceleration support
+
+3. **Claude Desktop Setup** (`claude-desktop-macos-setup.sh`)
+   - Automated Claude Desktop installation for macOS
+   - Environment configuration
+   - MCP integration setup
+
+See [Extras Documentation](docs/extras.md) for detailed usage instructions and examples.
+
 ## License 📄

 MIT License. See [LICENSE](LICENSE) for details.