(20Qs) Hindi Audio to Text Benchmark | Gemini 3 Pro, GPT‑4o, GPT 5.1

(20 Qs Updated Jan 2025)

This page shows a test of many Hindi speech‑to‑text systems and, in some cases, Hindi → English translation pipelines.
hindi benchmark

We use the same Hindi audio clips for every system. Then we compare each system’s text output to a reference answer and give it a score between 0 and 1.
A higher score means the system is closer to the reference text and usually more accurate.

WorkflowAccuracy (mean score)Latency (median)
0 GPT4oAudio0.777.33
1 GPTRealtime0.736.64
2 GPT5.10.925.69
3 GPT4.10.905.67
4 Gemini 3 Pro0.969.88
5 Gemini 3 Flash0.927.58
6 Sarvam.AI0.555.98
7 Omnilingual+GPT5-mini0.969.00
8 Omnilingual+Gemini 3 Pro0.9310.14
9 Omnilingual+Gemini 3 Flash0.917.60
10 MMS+GoogMT+GPT4.10.914.94

On this page you can:

  • See which Hindi system or pipeline gets the best score
  • Compare different Hindi ASR and Hindi→English models side by side
  • Choose the best system for your app, call center, research, or product
  • Download all results for deeper analysis and custom reporting
Gooey Workflows
Input Data Spreadsheet
Loading...
Input Columns

Loading...



Evaluation Workflows


Run cost = 1 credits

With each run, you agree to Gooey.AI's terms & privacy policy.

API: Compare Output Text (from input_audio) Download

Loading...


Aggregate:Mean

Loading...

Loading...


API: Compare Run Time (Median) Download

Loading...


Aggregate:Median

Loading...

Loading...