MCP server integrating VolcEngine for automatic speech recognition (ASR) and text-to-speech (TTS), converting audio to text or text to audio files.