OpenAI Whisper

OpenAI
96625
OpenAI Whisper MCP Server - 基于本地 Whisper CLI 的离线语音识别与翻译,无需 API Key,支持 mp3/mp4/m4a/wav 等格式,可输出纯文本或 SRT/VTT 字幕
#语音识别 #转写 #翻译 #字幕 #Whisper #离线 #本地

Content

No content available

OpenAI Whisper 2

OpenAI Whisper MCP Server - 基于本地 Whisper CLI 的离线语音识别与翻译,无需 API Key,支持 mp3/mp4/m4a/wav 等格式,可输出纯文本或 SRT/VTT 字幕

whisper_transcribe

Transcribe an audio/video file to text using local Whisper CLI. No API key required — runs entirely on the local machine. Supports: mp3, mp4, m4a, wav, flac, ogg, webm, mkv, avi, etc. Args: audio_url: URL of the audio/video file to transcribe. Must be publicly accessible. model: Whisper model to use. Options: tiny / base / small / medium / large / turbo (default). Smaller = faster, larger = more accurate. language: Source language code (e.g. "zh", "en", "ja"). Auto-detected if not provided. output_format: Output format. Options: txt (default) / srt / vtt / json / tsv. - txt: plain text transcript - srt/vtt: subtitles with timestamps - json: detailed segments with timestamps

Parameters (4)
audio_url string Required

model string Optional

language string Optional

output_format string Optional

whisper_translate

Translate audio/video speech to English text using local Whisper CLI. Automatically detects the source language and translates to English. No API key required — runs entirely on the local machine. Args: audio_url: URL of the audio/video file to translate. Must be publicly accessible. model: Whisper model to use. Options: tiny / base / small / medium / large / turbo (default). Smaller = faster, larger = more accurate. output_format: Output format. Options: txt (default) / srt / vtt / json / tsv.

Parameters (3)
audio_url string Required

model string Optional

output_format string Optional