GPT-4o Transcribe

Input:$75/M

Output:$300/M

GPT-4o Transcribe is an audio-to-text model for multilingual, low-latency speech recognition. It supports real-time streaming and batch transcription from common audio formats with punctuation and sentence segmentation. Typical uses include live captions, voice assistant input, meeting notes, and media or call recording transcription. Technical highlights include audio modality support, long-form processing, and APIs suited for interactive and server-side workflows.

Commercial Use

Features

Pricing

API

Versions

Pricing for GPT-4o Transcribe

Explore competitive pricing for GPT-4o Transcribe, designed to fit various budgets and usage needs. Our flexible plans ensure you only pay for what you use, making it easy to scale as your requirements grow. Discover how GPT-4o Transcribe can enhance your projects while keeping costs manageable.

Comet Price (USD / M Tokens)	Official Price (USD / M Tokens)	Discount
Input:$75/M Output:$300/M	Input:$93.75/M Output:$375/M	-20%

Versions of GPT-4o Transcribe

The reason GPT-4o Transcribe has multiple snapshots may include potential factors such as variations in output after updates requiring older snapshots for consistency, providing developers a transition period for adaptation and migration, and different snapshots corresponding to global or regional endpoints to optimize user experience. For detailed differences between versions, please refer to the official documentation.

version
gpt-4o-transcribe

GPT-4o Transcribe

Pricing for GPT-4o Transcribe

Sample code and API for GPT-4o Transcribe

Versions of GPT-4o Transcribe

GPT-4o Transcribe

Pricing for GPT-4o Transcribe

Sample code and API for GPT-4o Transcribe

Versions of GPT-4o Transcribe