One API call. All models. Instant comparison.

Compare LLM outputs
across models.

One API call. Pick your models. Get structured comparisons of output, latency, cost, and token usage — side by side.

Free 100 evals/month. No credit card required.

Try it live

See PromptDiff in action. No sign-up required.

demo — gpt-4o-mini, claude-3-haiku, gemini-1.5-flash
Explain the difference between REST and GraphQL in 2 sentences.

How it works

From prompt to comparison in milliseconds. One request, all the data you need to make the right model choice.

Step 01

POST your prompt

Send your prompt and choose which models to compare. Include system instructions or variables as needed.

Step 02

We run all models in parallel

PromptDiff calls each model simultaneously, measuring latency, collecting tokens, and computing costs in real time.

Step 03

Get structured results

Receive a unified JSON response with outputs, latency, cost, and token breakdown per model. Compare and decide.

Simple API, powerful results

Integrate in minutes. Works with any HTTP client.

curl -X POST https://promptdiff.bizmarq.com/api/v1/compare \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer pd_your_api_key" \
  -d '{
    "prompt": "Explain async/await in JavaScript in one paragraph.",
    "models": ["gpt-4o-mini", "claude-3-haiku", "gemini-1.5-flash"],
    "options": {
      "temperature": 0.7,
      "max_tokens": 300
    }
  }'

Supported models

All major providers in one comparison. More added regularly.

GPT-4oOpenAI
GPT-4o miniOpenAI
GPT-4 TurboOpenAI
o1-miniOpenAI
Claude 3.5 SonnetAnthropic
Claude 3 OpusAnthropic
Claude 3 HaikuAnthropic
Gemini 1.5 ProGoogle
Gemini 1.5 FlashGoogle
Llama 3.1 70BMeta
Mistral LargeMistral
Mixtral 8x7BMistral

Check the docs for the complete and up-to-date model list.

Simple, transparent pricing

Start free. Pay only for what you use.

Free

$0/month

Perfect for exploration and small projects.

  • 100 evals per month
  • All models supported
  • Full API access
  • JSON responses
  • Eval history (30 days)

Pay-as-you-go

$0.005/eval

For teams and production workloads.

  • Unlimited evals
  • All models supported
  • Full API access
  • Volume discounts
  • Eval history (unlimited)
  • Priority support

Volume discounts

1,000 evals+$0.004/eval
10,000 evals+$0.003/eval
100,000 evals+$0.002/eval

Note: PromptDiff pricing is separate from underlying LLM costs, which are billed by each provider on your behalf.