Nano Banana Lite API powers the Gemini 3.1 Flash-Lite model, delivering sub-5 second image generation. This lite vision tool is optimized for high-velocity workflows, offering 1K resolution and native image-to-image editing at scale.
$ 0.0202
$ 0.0336
text
image
$ 0.0202
$ 0.0336
text
image
Playground
JSON
API
Input
Your request will cost$0per run, for$100you can run this model approximately0times
Explore why the Nano Banana Lite API is the leading choice for high-scale vision tasks.
Native Visual Editing
Modify images using natural language. Add, remove, or change elements within your visuals instantly. Ideal for A/B testing creative variations and iterating on designs at a lite cost.
Wide-angle landscape photograph of a colossal Dyson Sphere mega-structure under construction around a dying blue dwarf star. Millions of geometric solar mirror panels and orbital habitats connected by glowing energy tethers. A massive industrial mining cruiser ship in the foreground shows insane levels of "greeble" (small mechanical details), antennae, and engine exhaust plumes. Harsh sunlight contrasting with deep space shadows. Scale is unimaginable. Hard sci-fi aesthetic, photorealistic, 8k.
Prompt
After
Native Visual Editing
Modify images using natural language. Add, remove, or change elements within your visuals instantly. Ideal for A/B testing creative variations and iterating on designs at a lite cost.
Wide-angle landscape photograph of a colossal Dyson Sphere mega-structure under construction around a dying blue dwarf star. Millions of geometric solar mirror panels and orbital habitats connected by glowing energy tethers. A massive industrial mining cruiser ship in the foreground shows insane levels of "greeble" (small mechanical details), antennae, and engine exhaust plumes. Harsh sunlight contrasting with deep space shadows. Scale is unimaginable. Hard sci-fi aesthetic, photorealistic, 8k.
Prompt
After
High-Throughput Vision OCR
Optimized for scanning large document batches. Extract tabular data and identify UI components from screenshots with high accuracy and minimal processing time for enterprise workflows.
Anime still frame, "sakuga" animation style, intense dynamic action shot. A cyber-samurai girl with glowing energy katana clashes mid-air against a massive, multi-limbed biomechanical Oni (demon) atop a crumbling Shibuya 109 building. Explosion debris, broken glass, electrical discharge, and rain streaks are frozen around them. The background is a chaotic, destroyed Neo-Tokyo with thousands of neon signs. Dramatic angles, high detail line work, vibrant color palette, dramatic lighting, 4k resolution.
Prompt
After
High-Throughput Vision OCR
Optimized for scanning large document batches. Extract tabular data and identify UI components from screenshots with high accuracy and minimal processing time for enterprise workflows.
Anime still frame, "sakuga" animation style, intense dynamic action shot. A cyber-samurai girl with glowing energy katana clashes mid-air against a massive, multi-limbed biomechanical Oni (demon) atop a crumbling Shibuya 109 building. Explosion debris, broken glass, electrical discharge, and rain streaks are frozen around them. The background is a chaotic, destroyed Neo-Tokyo with thousands of neon signs. Dramatic angles, high detail line work, vibrant color palette, dramatic lighting, 4k resolution.
Prompt
After
Cost-Efficient Scaling
At $2.00 per 1M input tokens and $0.034 per image, this is the most economical vision API available. Maximize your ROI on high-volume projects without sacrificing multimodal performance.
Macro photography shot. Inside an old, dusty vintage lightbulb, there is a fully functioning, multi-layered steampunk city. Tiny brass gears, steam pipes, and clockwork towers are intricately detailed. Miniature airships float around the filament. The glass of the bulb reflects the room around it (a cluttered inventor's workshop). Shallow depth of field, bokeh background, warm tungsten lighting, hyper-realistic, metallic textures.
Prompt
After
Cost-Efficient Scaling
At $2.00 per 1M input tokens and $0.034 per image, this is the most economical vision API available. Maximize your ROI on high-volume projects without sacrificing multimodal performance.
Macro photography shot. Inside an old, dusty vintage lightbulb, there is a fully functioning, multi-layered steampunk city. Tiny brass gears, steam pipes, and clockwork towers are intricately detailed. Miniature airships float around the filament. The glass of the bulb reflects the room around it (a cluttered inventor's workshop). Shallow depth of field, bokeh background, warm tungsten lighting, hyper-realistic, metallic textures.
Prompt
After
Sub-5 Second Generation
Experience ultra-low latency with image generation speeds of roughly 4 seconds per 1K image. This is 2.7x faster than flagship models, perfect for real-time user-facing apps.
A young foreign man standing on a city rooftop at sunrise, holding a shepherd’s staff, surrounded by small fluffy clouds like sheep, dreamy surreal fantasy, soft pastel sky.
Prompt
After
Sub-5 Second Generation
Experience ultra-low latency with image generation speeds of roughly 4 seconds per 1K image. This is 2.7x faster than flagship models, perfect for real-time user-facing apps.
A young foreign man standing on a city rooftop at sunrise, holding a shepherd’s staff, surrounded by small fluffy clouds like sheep, dreamy surreal fantasy, soft pastel sky.
Prompt
After
How to Get a gemini-3.1-flash-lite-image API Key
Getting a gemini-3.1-flash-lite-image API key takes four steps and a few minutes. Create a free GPTProto account, add credits, generate your key, and make your first call — at $0.0202 it's a cheaper gemini-3.1-flash-lite-image API key than going direct, and one key works across every model on the platform. Full gemini-3.1-flash-lite-image Documentation is in the docs.
Sign up
Create your free GPT Proto account to begin. You can set up an organization for your team at any time.
Top up
Your balance can be used across all models on the platform, including gemini-3.1-flash-lite-image, giving you the flexibility to experiment and scale as needed.
Generate your API key
In your dashboard, create an API key — you'll need it to authenticate when making requests to gemini-3.1-flash-lite-image.
Make your first API call
Use your API key with our sample code to send a request to gemini-3.1-flash-lite-image via GPT Proto and see instant AI-powered results.
The Nano Banana Lite API is built for velocity. It generates a 1K resolution image in approximately 4 seconds. This makes it roughly 2.7 times faster than the standard Flash model, which is ideal for real-time applications and high-throughput production environments where latency is the primary operational constraint for developers using this lite engine.
Does this lite model support Google Search grounding?
No. Unlike the standard Flash or Pro versions, the Nano Banana Lite API does not support grounding with Google Search. It relies on its internal knowledge cutoff of May 2024. If your workflow requires real-time factual verification or image generation based on current web events, we recommend utilizing the standard Nano Banana 2 model instead to ensure the most up-to-date visual context.
What image resolutions are supported by Nano Lite?
To maintain its ultra-low latency profile, the Nano Banana Lite API specifically supports 1K resolution outputs. While other models in the family can scale up to 4K, this lite variant is hyper-optimized for 1024x1024 visuals. This limitation ensures the model remains the fastest and most cost-effective choice for developers building high-volume applications at scale.
Is SynthID watermarking included in Nano images?
Yes. Every image generated via the Nano Banana Lite API includes an invisible, tamper-resistant SynthID watermark. This ensures compliance with AI transparency standards and provides a layer of safety for enterprise users. The watermark is integrated natively during the generation process without affecting the visual quality of the 1K output or increasing the latency for your application.
Can I use this API for image-to-image editing?
Absolutely. The Nano Banana Lite API supports native image-to-image editing. You can provide a reference image and use natural language prompts to modify specific elements, change color grading, or add objects. It is designed for high-velocity visual reasoning, though it is not optimized for multi-turn sequential editing or processing more than 14 reference inputs simultaneously.
What are the input and output pricing tiers?
Pricing for the Nano Banana Lite API is highly competitive. Input tokens (text or image) are priced at $2.00 per 1M tokens. Text output is $10.00 per 1M tokens, and generated 1K images cost $0.034 each. We also offer a 50% discount on input tokens for cached context hits, making it the most economical choice for long-running agent sessions and large-scale visual processing tasks.