I haven’t been too lively in utilizing AI to generate pictures. Principally as a result of not one of the AI instruments up to now are in a position to ship the extent of high quality I search in my pictures. And so, the onus of producing pictures for my blogs principally falls on my teammate and an distinctive graphic designer Armaan. Although for this explicit weblog, I gave Armaan a break. All because of Google’s new “Nano-Banana” aka Gemini 2.5 Flash Picture.
That’s what the most recent picture technology mannequin from the home of Gemini is known as. Greater than a picture technology, the Gemini 2.5 Flash Picture or nano-banana comes as a picture enhancing mannequin. Why? The replace now permits you to mix a number of pictures right into a single picture. Not less than that’s what it claims to do, as talked about in a Google weblog introducing the brand new Gemini image-gen mannequin.
If it does what it says and does it nicely, I personally know a dozen AI fanatics who shall be past excited to present it a attempt. In case you’re one such aficionado of AI picture technology, right here is all it’s worthwhile to know in regards to the new Gemini 2.5 Flash Picture, together with a hands-on experiment I attempted on it.
What’s Gemini 2.5 Flash Picture?
In a nutshell, Gemini 2.5 Flash Picture is a brand new picture technology and enhancing mannequin by Google. It comes as an replace to the a lot hailed Gemini 2.0 Flash Picture picture technology mannequin, making it quicker and higher, with added image-editing options.
For context, the brand new Gemini 2.5 Flash Picture can mix a number of pictures into one, preserve character consistency, and make focused transformations in a picture. As Google places it, the mannequin makes use of Gemini’s “world information to generate and edit pictures.”
Allow us to discover these options in a bit extra element.
Gemini 2.5 Flash Picture Options
Preserve character consistency
Nearly anybody who has used AI to generate pictures will relate to this. Particular person picture prompts often serve you nicely. However as quickly as you need a explicit component proven as it’s throughout a set of pictures, nearly all identified AI fashions falter.
Google claims to have labored on that, with the brand new Gemini 2.5 Flash Picture. Here’s what it says for the nano-banana’s functionality on this regard:
“Now you can place the identical character into totally different environments, showcase a single product from a number of angles in new settings, or generate constant model belongings, all whereas preserving the topic.”
Has it nailed it? Discover out shortly in my hands-on experiment with the mannequin.
Immediate based mostly picture enhancing
That is very like “photoshopping” a specific component in/ out of a picture, however by way of easy prompts. Gemini 2.5 Flash Picture can “blur the background of a picture, take away a stain in a t-shirt, take away a complete particular person from a photograph,” and way more, as Google claims.
The exact native edits with pure language prompts make it a strong AI mannequin, supplied it does it nicely. Don’t you are worried. We will it attempt it out in our hands-on quickly sufficient.
Multi-image fusion
This appears to be the brand new hot-new replace for the Gemini 2.5 Flash Picture. The nano-banana can “perceive and merge a number of enter pictures” as per Google, which principally means you may fuse pictures with a single immediate now. Put an individual who missed the occasion in a gaggle picture, place an object right into a scene, or some other use you may consider. All with a easy immediate.
Palms-on loading. Keep tuned!
Native world information
As I identified in my article – Why AI Lacks Widespread Sense, AI fashions lack a deep, semantic understanding of the actual world. Nearly the entire aesthetic-images they produce are a results of their coaching information. Google claims to have bypassed this, as the brand new nano-banana mannequin “advantages from Gemini’s world information.”
With out sharing any additional data on how this was made attainable, Google says that this skill unlocks new use-cases for the Gemini 2.5 Flash Picture mannequin. No worries, we’ll check it out for all its price quickly.
However earlier than that, right here is how one can entry the brand new Gemini Flash 2.5 Picture
Gemini Flash 2.5 Picture Accessibility
The brand new Google nano-banana or the Gemini Flash 2.5 Picture is obtainable proper now through the Gemini app or web site for particular person customers.
For Particular person Customers
You may merely obtain the app in your cellphone, or go to gemini.google.com and entry the mannequin by choosing 2.5 Flash on the prime and clicking on “Create Photographs” below the Instruments part.
For Builders
Builders can entry the identical by way of Gemini API and Google AI Studio, whereas enterprise shoppers can entry the device by way of Vertex AI.
For entry by way of API, observe these steps:
1. Set up Required Libraries
Set up the most recent model of the Google Generative AI SDK utilizing pip, and guarantee you may have PIL (Pillow) put in for picture processing:
pip set up google-generativeai pillow
This command will import all required libraries in the beginning of your script for easy execution.
2. Authenticate Your API Entry
Arrange authentication by exporting your API key (if required) as an setting variable or authenticating through cloud identification suppliers, so the genai SDK can entry Google’s Generative AI endpoints:
import os
os.environ["GOOGLE_API_KEY"] = ""
Make it possible for your API account is lively and has adequate quota for requests.
3. Configure and Name the Mannequin
Initialize the consumer, set the mannequin to “gemini-2.5-flash-image-preview”, and move a related immediate utilizing the SDK’s content material technology technique:
from google import genai
consumer = genai.Shopper()
immediate = "Create an image of a nano banana dish in a elaborate restaurant with a Gemini theme"
response = consumer.fashions.generate_content(
mannequin="gemini-2.5-flash-image-preview",
contents=[prompt],
)
4. Parse and Save the Generated Picture
Iterate by way of the response to test for textual content and picture outputs, then save the generated picture utilizing PIL:
from PIL import Picture
from io import BytesIO
for half in response.candidates.content material.elements:
if half.textual content shouldn't be None:
print(half.textual content)
elif half.inline_data shouldn't be None:
picture = Picture.open(BytesIO(half.inline_data.information))
picture.save("generated_image.png")
For entry by way of Google AI Studio, merely use the “construct mode” on Google AI Studio right here. As soon as on the platform, you may merely enter a immediate to design the form of picture technology/ enhancing app based mostly on Gemini 2.5 Flash Picture you need. Assume face-swap or clothe-swap.
Gemini Flash 2.5 Picture Palms-on
Since AI picture technology utilizing prompts is fairly simple, I made a decision to attempt the brand new Gemini Flash 2.5 Picture throughout its USPs, aside from, in fact, primary picture technology. Take a look at the outcomes for your self:
1. Producing new pictures
Listed below are two pictures I generated utilizing Gemini 2.5 Professional Flash. One in every of a 25-year previous boy from the hills of Himachal Pradesh, and one other of a Gaddi canine. Listed below are the prompts:
Immediate 1: give me a portrait picture of an indian boy, across the age of 25, hailing from Himachal Pradesh. The boy has dense, black hair and light-weight inexperienced eyes, and is carrying a brilliant cool strong White shirt
Immediate 2: give me a portrait picture of a pahadi gaddi canine, black in color with a thick fur and an even bigger measurement as in comparison with a daily avenue canine
Output:
Whereas the outcomes clearly appear AI-generated, I’d give 10/10 to nano-banana for the outcomes. That is merely due to the tremendous prime quality of pictures, correct depictions of the immediate, and the truth that I by no means talked about within the immediate to make the pictures too real looking.
2. Merging Photographs
For consistency, I performed round with these two pictures for the remainder of assessments. So I requested the mannequin to merge the 2 pictures of the boy and the canine in a scenic background. Right here is the immediate:
Immediate: present the boy and the canine taking part in in an enormous open discipline towards the backdrop of a river flowing in a valley and snow-capped mountains on the opposite aspect of the river
Output:

It took a while for the AI mannequin to search out its option to the best output. With the preliminary outcomes having a bit off proportions, there was a from side to side of prompts and outcomes earlier than I discovered what I used to be searching for. However as soon as I did, boy was I impressed! The whole lot all the way down to the final element was as correct as I had talked about and as aesthetic as I might’ve requested for.
3. Picture Modifying
After picture technology and merger, comes the enhancing half. To check these out, I first requested Gemini Flash 2.5 Picture so as to add a small hut within the picturesque scene. Right here is the immediate:
Immediate: on this picture, present a small shed close to the river at a distance
Output:

Flawless execution within the first attempt. It appears evident that Google has labored massively on the nano-banana, particularly within the areas the place it highlights it to excel.
4. Sustaining Character Consistency
Onto the hardest half for most AI fashions. Truthfully, I used to be fairly sceptical about this one USP. So naturally, I used to be additionally most excited to attempt it out. I requested the mannequin to alter your entire setting within the image to that of a seashore. Right here is the immediate:
Immediate: present the boy and the canine operating as they’re in a seashore as a substitute of the present setting. the boy is now carrying shorts and a vest, and has a tattoo throughout his left-arm
Output:

Nothing wanting spectacular, Gemini 2.5 Flash Picture managed to do a reasonably respectable job. Although it took repeated makes an attempt to reach at an aesthetic consequence, the brand new picture mannequin was in a position to replicate the main points of the boy and the canine with nearly full accuracy. Should you look intently, the slight distinction that you’ll discover shall be that within the boy’s coiffure. Relaxation every little thing seems nearly precisely just like me as within the earlier setting. I even tried the “tattoo on one-arm” thought to see how Gemini executes it. All I can say is Kudos to the mannequin for a job nicely achieved!
5. Constructing Gemini 2.5 Flash Picture Powered App
To check Gemini 2.5 Flash Picture on an app for repeated picture technology and enhancing, we used the next immediate on Google AI Studio’s “construct mode”
Immediate: Create a Strive on App, the place customers can add their photos and take a look at varied garments to see how they appear in them.
Output: You may take a look at the try-on app we constructed right here.
Conclusion
With a single collection of image-generation and enhancing on the brand new Gemini 2.5 Flash Picture, the updates and the improved capabilities are just about evident to me. What I’m particularly thrilled about is the brand new picture merging skill of the mannequin. I can foresee n-number of eventualities the place that may be useful for me personally. As for many different content material creators on the market, picture enhancing and character consistency on the brand new Gemini 2.5 Flash Picture will show to be a game-changer.
We incessantly attempt our palms on such new AI fashions to check the bounds of their capabilities. So, make sure that to remain tuned to this area to be up to date with the most recent on the planet of AI.
Login to proceed studying and luxuriate in expert-curated content material.