Features

Discover the various features that MK-TTS offers to enhance your text-to-speech experience. Our service provides state-of-the-art AI technology to deliver natural and high-quality speech synthesis.

Natural Speech Synthesis

Our AI generates natural speech from the text you enter, providing realistic and expressive voice output.

  • Outputs natural speech from text input in multiple languages.
  • Provides realistic and expressive voice output.
  • Currently supports six voices: Alloy, Echo, Fable, Onyx, Nova, Shimmer.
  • Voices include male (Echo, Fable, Onyx) and female (Alloy, Nova, Shimmer) options.
  • Quality of voice generation is optimized for English but also supports other languages effectively.

Multi-language Support

Supports various languages, allowing you to generate speech in multiple languages with ease.

  • AI automatic translation using OpenAI's GPT-4 model.
  • Translates input text into desired languages naturally.
  • Supported languages include Afrikaans, Arabic, Chinese, English, French, German, Hindi, Japanese, Korean, Russian, Spanish, and many more. See more supported languages

Custom Voice Settings

Customize the voice output with different tones and moods to match your specific needs.

  • Specify tone or mood using the "Special order" input field.
  • Examples include "like talking to a friend," "like a news anchor," "in a sad tone," etc.

Downloadable MP3 Files

Save your generated speech as MP3 files for offline use and convenience.

  • Download generated speech as MP3 files.
  • Playback speed can be adjusted for preview purposes, but the downloaded file plays at normal speed (x1.0).

AI Translation

Using the GPT 4 model, our service translates your input text into the desired language and generates speech, providing both translation and text-to-speech functionalities.

  • Contextual and mood-understanding translations generated by AI.
  • Translations interpreted and reproduced to ensure naturalness.
  • Includes custom options that affect the translation and the naturalness, mood, and nuance of the voice indirectly.

Secure API Integration

Integrate your OpenAI API key securely with our user-friendly GUI, ensuring your API key is handled safely.

  • API keys are encrypted and handled securely.
  • User-friendly GUI for easy integration and usage.

User-friendly Interface

Our GUI is designed to be intuitive and easy to use, making it simple to generate high-quality speech.

  • Intuitive and easy-to-use interface.
  • Designed to provide a seamless user experience.

Token-based Usage

Utilize a token-based system to measure usage and manage costs effectively. Learn more about Tokens


Tokens: Tokens are the smallest units of meaning for a language model, used to calculate the cost of processing text. Each token can be as short as one character or as long as one word. For example, the word "hello" is considered one token, while the sentence "Hello, world!" is considered four tokens.


Supported Languages: Afrikaans, Arabic, Armenian, Azerbaijani, Belarusian, Bosnian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Kannada, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Marathi, Maori, Nepali, Norwegian, Persian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Thai, Turkish, Ukrainian, Urdu, Vietnamese, and Welsh.