Question 1

How accurate are these numbers?

Accepted Answer

The model fees come straight from each provider's published rates, with FX rates listed in the assumptions panel. The token sizes per task are based on typical real-world examples for each workflow and complexity level. Treat the result as a strong starting estimate, not a contract. Real invoices move with actual usage, but the order of magnitude is reliable.

Question 2

Why a cost range rather than a single number?

Accepted Answer

Real usage is rarely the same every month. The range reflects normal variation around the typical token sizes for the workflow and complexity you have chosen. The per-task figure shows the midpoint, useful for back-of-envelope thinking.

Question 3

What is not in these numbers?

Accepted Answer

Only the model fee. Not included: prompt caching discounts, batch API discounts (typically 50% off for non-real-time work), Gemini's premium features such as Deep Think, Search Grounding and long-context escalation, third-party data sources, hosting and infrastructure, retries, and any human-review step in the workflow.

Question 4

My workflow is not listed. What can we do?

Accepted Answer

The seven workflows shown cover the patterns we see most often. If your requirement is broadly similar in shape and volume, the closest match will still give you a useful estimate. If it is genuinely different, please get in touch and we will be happy to discuss your project with you.

Question 5

How current are these prices?

Accepted Answer

The verified date shown below the calculator tells you when the rates were last checked against each provider's published pricing page. Rates are reviewed on a regular cadence and the calculator republished. If the date looks old, the underlying providers may have moved since.

Question 6

Which AI model should I pick?

Accepted Answer

The Recommended tag flags the model that best fits the complexity you have chosen, per provider. Beyond that, the right choice depends on factors the calculator cannot see, including your existing setup, your data security requirements, and the level of judgement the work demands. The calculator surfaces the cost. The choice stays yours.

Question 7

Why are only Claude, GPT and Gemini listed?

Accepted Answer

These are the three commercial providers we see most often in real-world deployments. Open-source models (Llama, Mistral) and other commercial options are real choices, but they have different cost structures (typically infrastructure-based rather than per-token) and are not directly comparable in this format.

Question 8

Is there a risk choosing the cheapest AI model?

Accepted Answer

Cheapest is often the right answer for genuinely simple, high-volume tasks. For work that requires judgement, multi-step reasoning, or nuance, the savings on a smaller model can disappear in extra retries, escalations, or output you have to fix.

Question 9

Why are your token assumptions what they are?

Accepted Answer

Each workflow tier maps to a realistic input and output size for that level of complexity. A Quick Review of a short contract uses fewer tokens than a Comprehensive Review of a multi-document agreement. The full breakdown is in the assumptions panel under the calculator.

AI Automation Pricing and Cost Calculator

AI model costs, broken down by business workflow and volume for Claude, GPT and Gemini.

Additional Pricing Notes

Additional AI ModelQuestions You May Have

The model cost is the starting point

Connect with our Automation Practice