New: Audio API, Embeddings & Realtime WebSocket now available!
osmAPI LogoosmAPI

Create Translation

Create Translation

POST /v1/audio/translations

Translates non-English audio into English text.

Request

Content-Type: multipart/form-data

ParameterTypeRequiredDescription
filefileYesAudio file (max 25MB). Formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, webm
modelstringYeswhisper-1 (OpenAI), groq/whisper-large-v3 (Groq), or groq/whisper-large-v3-turbo (Groq)
promptstringNoOptional text to guide style (in English)
response_formatstringNojson (default), text, srt, verbose_json, vtt
temperaturenumberNo0 to 1. Default 0.

Example

curl -X POST "https://api.osmapi.com/v1/audio/translations" \
  -H "Authorization: Bearer $OSM_API_KEY" \
  -F file=@spanish_audio.mp3 \
  -F model=whisper-1

Response

{
  "text": "Hello, this is the English translation."
}

Response Headers

HeaderDescription
x-request-idUnique request identifier
x-osm-response-costRequest cost in USD

How is this guide?