Content is user-generated and unverified.

API Usage Cost Analysis - Ultra Data

Step 1: Corrected Usage Table from Screenshot

MODELINPUTOUTPUTCACHE WRITECACHE READTOTAL TOKENSAPI COSTCOST TO YOU
claude-4-sonnet-thinking8,105,5856,772,88496,221,876983,461,8281,094,562,173$928.64$0
claude-4-opus-thinking181,494194,5812,878,55037,711,00040,965,625$151.46$0
gemini-2.5-pro-preview-06-055,665,633178,125077,316,67483,160,432$39.40$0
gemini-2.5-pro-preview-05-064,765,599285,224048,766,27053,817,093$26.22$0
Total18,718,3117,430,81499,100,4261,147,255,7721,272,505,323$1145.72$0

Step 2: OpenRouter Pricing for Known Models

Based on the corrected pricing information:

  • Claude 4 Sonnet Thinking: $3.00 input / $15.00 output / $0.30 cache read / $3.75 cache write per million tokens
  • Claude 4 Opus Thinking: $15.00 input / $75.00 output / $1.50 cache read / $18.75 cache write per million tokens
  • Gemini 2.5 Pro (all versions): $1.25 input / $10.00 output / $0.30 cache read / unknown cache write per million tokens

Step 3: Calculated Costs Using OpenRouter Pricing

MODELINPUT TOKENSOUTPUT TOKENSCACHE WRITECACHE READINPUT COSTOUTPUT COSTCACHE WRITE COSTCACHE READ COSTTOTAL CALCULATED COST
claude-4-sonnet-thinking8,105,5856,772,88496,221,876983,461,828$24.317$101.593$360.832$295.039$781.781
claude-4-opus-thinking181,494194,5812,878,55037,711,000$2.722$14.594$53.973$56.567$127.856
gemini-2.5-pro-preview-06-055,665,633178,125077,316,674$7.082$1.781$0.000$23.195$32.058
gemini-2.5-pro-preview-05-064,765,599285,224048,766,270$5.957$2.852$0.000$14.630$23.439
TOTAL:$965.134

Note: Gemini cache write costs assumed to be $0 as not specified.

Summary of Findings

  1. Formatting Issues: The original usage data was already properly formatted in this screenshot.
  2. Cost Accuracy: The calculated total cost of $965.13 is significantly lower than the reported API cost of $1145.72, showing a difference of $180.59 (15.8% difference).
  3. Model-by-Model Comparison:
    • Claude 4 Sonnet Thinking: $781.78 calculated vs $928.64 reported (15.8% difference)
    • Claude 4 Opus Thinking: $127.86 calculated vs $151.46 reported (15.6% difference)
    • Gemini 2.5 Pro Preview 06-05: $32.06 calculated vs $39.40 reported (18.6% difference)
    • Gemini 2.5 Pro Preview 05-06: $23.44 calculated vs $26.22 reported (10.6% difference)
  4. Massive Scale Usage: This Ultra subscription shows extraordinary usage levels:
    • 1.27 billion total tokens processed
    • 1.14 billion cache read tokens (90.2% of all usage)
    • 99.1 million cache write tokens
    • Cache operations represent 97.8% of all token usage
  5. Cache Usage Impact: Cache operations dominate costs, contributing approximately $750 to the total calculated cost. The cache efficiency is remarkable, with cache reads being 61x more frequent than cache writes.
  6. Cost Efficiency: The heavy reliance on cache reads (at $0.30/M vs $3.00/M for input tokens) provides massive cost savings. Without caching, the equivalent fresh API calls would cost approximately $3.4 billion instead of $1145.72.
  7. Possible Explanations for Discrepancies:
    • Enterprise/Volume Pricing: At this scale (1.27B tokens), enterprise pricing tiers likely apply
    • Unknown Gemini Cache Write Costs: Gemini models may have cache write fees not accounted for
    • Infrastructure Costs: Ultra-scale usage may include additional infrastructure/priority access fees
    • Different Cache Pricing: The actual cache pricing may differ from public rates at this volume
  8. Usage Patterns: This represents enterprise-level AI usage with sophisticated caching strategies, likely for production applications serving many users with optimized response caching.
  9. Cost Per Token: The effective cost per token is approximately $0.0009 across all usage, demonstrating the extreme efficiency achieved through caching at scale.
Content is user-generated and unverified.
    API Usage Cost Analysis - Ultra Data | Claude