Improve gemini image model cost calculation

right now for gemini (nano banana and nano banana pro), we only count pricing by the actual output image, however we are missing some more things

in gemini docs, they break down the pricing into input and output tokens:

input tokens:
- the actual text prompt used to generate image
- image input (if doing image editing/ img2img)

output tokens:
- the actual image output (what we have right now)
- optional reasoning token, although this is specific to gemini 3 pro image only

https://ai.google.dev/gemini-api/docs/pricing#gemini-2.5-flash-image
https://ai.google.dev/gemini-api/docs/pricing#gemini-3-pro-image-preview

note:
- some more caveat, both these models support text output, so technically it can generate text
- i'm not sure how our txt2img implementation would allow this, my concern is mostly if it does generate text output, and we aren't counting it towards the output tokens

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve gemini image model cost calculation #2348

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Improve gemini image model cost calculation #2348

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions