voice recorder (whisper).com
voice recorder (whisper).com logo

Voice Recorder (Whisper)

Integrates with OpenAI's Whisper model to provide voice recording and transcription capabilities for applications requir...

Created byApr 23, 2025

Voice Recorder MCP Server

An MCP server for recording audio and transcribing it using OpenAI's Whisper model. Designed to work as a Goose custom extension or standalone MCP server.

Features

  • Record audio from the default microphone
  • Transcribe recordings using Whisper
  • Integrates with Goose AI agent as a custom extension
  • Includes prompts for common recording scenarios

Installation

Usage

As a Standalone MCP Server

Testing with MCP Inspector

The MCP Inspector provides an interactive interface to test your server:

With Goose AI Agent

  1. Open Goose and go to Settings > Extensions > Add > Command Line Extension
  1. Set the name to `voice-recorder`
  1. In the Command field, enter the full path to the voice-recorder-mcp executable: ``` /full/path/to/voice-recorder-mcp ``` Or for a specific model: ``` /full/path/to/voice-recorder-mcp --model medium.en ``` To find the path, run: ```bash which voice-recorder-mcp ```
  1. No environment variables are needed for basic functionality
  1. Start a conversation with Goose and introduce the recorder with: "I want you to take action from transcriptions returned by voice-recorder. For example, if I dictate a calculation like 1+1, please return the result."

Available Tools

  • `start_recording`: Start recording audio from the default microphone
  • `stop_and_transcribe`: Stop recording and transcribe the audio to text
  • `record_and_transcribe`: Record audio for a specified duration and transcribe it

Whisper Models

This extension supports various Whisper model sizes:
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
[object Object]
The `.en` suffix indicates models specialized for English, which are faster and more accurate for English content.

Requirements

  • Python 3.12+
  • An audio input device (microphone)

Configuration

You can configure the server using environment variables:

Troubleshooting

Common Issues

  • **No audio being recorded**: Check your microphone permissions and settings
  • **Model download errors**: Ensure you have a stable internet connection for the initial model download
  • **Integration with Goose**: Make sure the command path is correct
  • **Audio quality issues**: Try adjusting the sample rate (default: 16000)

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.
  1. Fork the repository
  1. Create your feature branch (`git checkout -b feature/amazing-feature`)
  1. Commit your changes (`git commit -m 'Add some amazing feature'`)
  1. Push to the branch (`git push origin feature/amazing-feature`)
  1. Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.