What does a coding session / request / step cost, and where does the money go?
Computes the USD-cost distribution behind tab:cost_distribution (src/04_SessionContext.tex).
For each granularity (per session, per request, per step) and each billed category, the paper
table reports the cost as avg / p50 / p90 / p99 plus the category’s share of total spend
(the script also prints the underlying token distributions, incl. p25, to stdout):
- Append tokens —
newly_append_tokens, billed at the fresh-input/cache-write rates. - Prefix tokens —
prefix_tokens, billed at the cache-read rate. - Output tokens —
output_tokens(reasoning included), billed at the output rate. - Total — the sum of the three.