Voice Recorder MCP Server

An MCP server for recording audio and transcribing it using OpenAI's Whisper model. Designed to work as a Goose custom extension or standalone MCP server.

Features

Record audio from the default microphone

Transcribe recordings using Whisper

Integrates with Goose AI agent as a custom extension

Includes prompts for common recording scenarios

Installation

Usage

As a Standalone MCP Server

Testing with MCP Inspector

The MCP Inspector provides an interactive interface to test your server:

With Goose AI Agent

Open Goose and go to Settings > Extensions > Add > Command Line Extension

Set the name to `voice-recorder`

In the Command field, enter the full path to the voice-recorder-mcp executable: ``` /full/path/to/voice-recorder-mcp ``` Or for a specific model: ``` /full/path/to/voice-recorder-mcp --model medium.en ``` To find the path, run: ```bash which voice-recorder-mcp ```

No environment variables are needed for basic functionality

Start a conversation with Goose and introduce the recorder with: "I want you to take action from transcriptions returned by voice-recorder. For example, if I dictate a calculation like 1+1, please return the result."

Available Tools

`start_recording`: Start recording audio from the default microphone

`stop_and_transcribe`: Stop recording and transcribe the audio to text

`record_and_transcribe`: Record audio for a specified duration and transcribe it

Whisper Models

This extension supports various Whisper model sizes:

[object Object]	[object Object]	[object Object]	[object Object]	[object Object]
[object Object]	[object Object]	[object Object]	[object Object]	[object Object]
[object Object]	[object Object]	[object Object]	[object Object]	[object Object]
[object Object]	[object Object]	[object Object]	[object Object]	[object Object]
[object Object]	[object Object]	[object Object]	[object Object]	[object Object]
[object Object]	[object Object]	[object Object]	[object Object]	[object Object]

The `.en` suffix indicates models specialized for English, which are faster and more accurate for English content.

Requirements

Python 3.12+

An audio input device (microphone)

Configuration

You can configure the server using environment variables:

Troubleshooting

Common Issues

**No audio being recorded**: Check your microphone permissions and settings

**Model download errors**: Ensure you have a stable internet connection for the initial model download

**Integration with Goose**: Make sure the command path is correct

**Audio quality issues**: Try adjusting the sample rate (default: 16000)

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository

Create your feature branch (`git checkout -b feature/amazing-feature`)

Commit your changes (`git commit -m 'Add some amazing feature'`)

Push to the branch (`git push origin feature/amazing-feature`)

Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Voice Recorder (Whisper)