rime text-to-speech.com
rime text-to-speech.com logo

Rime Text-to-Speech

Text-to-speech server that converts text into spoken audio through Rime's API, streaming with optimized buffering for mi...

Created byApr 23, 2025

Rime MCP

[![rime](rime-logo.png)](https://www.rime.ai)
A Model Context Protocol (MCP) server that provides text-to-speech capabilities using the Rime API. This server downloads audio and plays it using the system's native audio player.

Features

  • Exposes a `speak` tool that converts text to speech and plays it through system audio
  • Uses Rime's high-quality voice synthesis API

Requirements

  • Node.js 16.x or higher
  • A working audio output device
  • macOS: Uses `afplay`
There's sample code from Claude for the following that is not tested
  • Windows: Built-in Media.SoundPlayer (PowerShell)
  • Linux: mpg123, mplayer, aplay, or ffplay

MCP Configuration

All of the optional env vars are part of the tool definition and are prompts to
All voice options are [listed here](https://users.rime.ai/data/voices/all-v2.json).
You can get your API key from the [Rime Dashboard](https://rime.ai/dashboard/tokens).
The following environment variables can be used to customize the behavior:
  • `RIME_GUIDANCE`: The main description of when and how to use the speak tool
  • `RIME_WHO_TO_ADDRESS`: Who the speech should address (default: "user")
  • `RIME_WHEN_TO_SPEAK`: When the tool should be used (default: "when asked to speak or when finishing a command")
  • `RIME_VOICE`: The default voice to use (default: "cove")

Example use cases

[![Demo of Rime MCP in Cursor](https://img.youtube.com/vi/tYqTACgijxk/0.jpg)](https://www.youtube.com/watch?v=tYqTACgijxk)

Example 1: Coding agent announcements

Example 2: Learn how the kids talk these days

Example 3: Different languages based on context

Development

  1. Install dependencies:
  1. Build the server:
  1. Run in development mode with hot reload:

License

MIT