Gates Foundation

gatesfoundation

A workspace for the Gates Foundation DPI, FairFoward and Gooey teams focused on evals for low-resource languages plus the home of our Agriculture advisory work e.g. https://gooey.ai/ageval

352 Public Workflows
8 Members

(Updated April 28, 2026)
This page presents a comparative demonstration of multiple LLM and speech-to-text systems combinations in Urdu → English translation pipelines. Results

2mo ago

Public

Gooey.AI's base AI workflow with built-in RAG, web search, voice understanding of 1000+ languages, code creation + execution, API connections & integrations to create your own WhatsApp, Web, FB and voice AI bots. Includes follow-up and location buttons on WhatsApp. Built on the Claude, this bot has 3 functions that give it superpower - websearch - giving it the ability to search anything on the internet; and code writing & execution - giving it the ability to calculate or convert data results. It also makes beautiful QR codes and images. ;-)

šŸ’¬

2mo ago

Public

A bulk evaluator workflow that compares AI-generated answers (copilot responses) to a set of golden reference answers. Requires input data columns: "input_prompt" (the question/task) and "reference_answer" (the ideal response). The workflow uses custom evaluation prompts to compare outputs, scoring them for accuracy and penalizing hallucinations. Aggregates results to provide an overall performance metric for your AI answers.

āš–ļø

2mo ago

87 runs

Public

šŸ’¬

2mo ago

105 runs

Public

(Updated April 28, 2026)
This page presents a comparative demonstration of multiple LLM and speech-to-text systems combinations in Pashto → English translation pipelines. Results

2mo ago

Public

šŸ’¬

2mo ago

109 runs

Public

Use this workflow compare latency on Copilot Bulk Runs. The graph display the median score. Lower is better!

āš–ļø

- rename metric Eval Prompt / Graph Description

2mo ago

98 runs

Public

šŸ’¬

2mo ago

183 runs

Public