Offset: 0.0s
Space Play/Pause

Nano Banana Pro in ComfyUI! Multi-Image Reference, No Model Download, 4k Generation, Cheaper Price!

This guide will walk you through the process of integrating and utilizing the “NanoBanana Pro” model within your ComfyUI workflow for advanced image generation. By understanding the core concepts a…

5 min read

Enhancing Your ComfyUI Workflow with the NanoBanana Pro Model

This guide will walk you through the process of integrating and utilizing the “NanoBanana Pro” model within your ComfyUI workflow for advanced image generation. By understanding the core concepts and practical applications, you can leverage this powerful tool to achieve more refined and creative visual outputs.

Prerequisites and Setup

[00:00] [ComfyUI Interface with NanoBanana Pro Node]

To get started, ensure you have “ComfyUI” installed. As demonstrated in the video, the process for using “NanoBanana Pro” is virtually identical to using “NanoBanana” with “Comfy Dream.” If you haven’t yet installed “ComfyUI,” numerous tutorials are available to guide you through the setup. It’s also recommended to keep your “ComfyUI” installation updated to the latest version.

[00:49] [ComfyUI Manager Window]

If your “ComfyUI” installation is not up-to-date, access the “ComfyUI Manager” by clicking on “Manager.” From there, select “Update All” to initiate the update process. Once the update is complete, restart “ComfyUI” and refresh your browser.

Understanding NanoBanana Pro and Pricing

[01:00] [ComfyUI Node Menu Showing NanoBanana Options]

After updating, you should see the “NanoBanana Pro (Google Gemini Image)” option available when you click on the canvas. This is the node we’ll be focusing on. Previously, we discussed the “Gemini Pro” image generation model. The key difference now is the cost associated with using “NanoBanana Pro.”

[01:14] [NanoBanana Pro Node with Parameters]

“NanoBanana Pro” costs approximately “13 cents per image generated.” It’s crucial to understand that you are paying “Google” for this service, not the “ComfyUI” developers. To use this feature, you need to be logged into your “ComfyUI” account and have a balance loaded.

[01:30] [ComfyUI Settings Menu]

You can verify your credit balance by navigating to “Settings,” then “User,” and finally the “Credits” section. Even a small amount, like “$5,” can provide a significant number of image generations, roughly “38 images.”

[01:46] [Comparison of NanoBanana Pro and NanoBanana 1]

When comparing “NanoBanana Pro” to “NanoBanana 1,” the “Pro” version offers a roughly “230% increase in cost.” While the pricing might seem steep, the advanced capabilities and potentially higher quality outputs can justify the expense for certain projects.

Building the Workflow

[02:37] [Load Image Node with Image Preview]

To begin building our workflow, we’ll start by adding a “Load Image” node, similar to our previous tutorials. For this demonstration, we’ll use the same character image we used in the “LoRA training” video.

[02:55] [Workflow with Load Image and Batch Nodes]

We will then connect these “Load Image” nodes to a “Batch Image” node. This setup allows us to feed multiple images into the “NanoBanana Pro” model simultaneously. It’s important to note that while the “Batch Image” node visually combines the images, the “NanoBanana Pro” node treats each image individually for processing.

[03:04] [Workflow with Batch Image and NanoBanana Pro Nodes]

The “Batch Image” node effectively prepares the input for the “NanoBanana Pro” node, allowing for efficient handling of multiple image inputs.

[03:37] [Final Workflow Setup for Image Generation]

The completed workflow involves connecting the “Batch Image” output to the “NanoBanana Pro” node, which in turn feeds into a “Save Image” node. This structure ensures that the generated images are processed and saved correctly.

Understanding Model Parameters and Resolution

[04:14] [NanoBanana Pro Node Parameters Detailed]

When utilizing the “NanoBanana Pro” model, understanding the parameters is crucial for achieving desired results. The cost of generation is directly influenced by the “resolution” setting. For instance, generating at “2K” resolution costs “$0.134,” while stepping up to “4K” increases the cost to “$0.24,” which is a significant jump.

[04:40] [Close-up of Resolution Setting in NanoBanana Pro Node]

While the quality difference between “2K” and “4K” might not always be drastically noticeable in the final output, the cost implication is substantial. It’s often advisable to experiment with “2K” resolution first and consider upscaling later if higher detail is required.

Advanced Prompting and Parameter Tuning

[06:30] [Example Prompt for Architectural Design]

The video showcases an example prompt focused on architectural visualization: “add a pumpkin and a small glass of lemonade in the green circle. Add pumpkins in the yellow circles. Remove the planter marked by the black X. Turn the cushion chair slightly according to the red arrows. Change the cushion fabric from tan to the attached yellow floral pattern. The woman in the attached image is sitting in the chair with her leg crossed over her knee smiling at the camera.”

[07:35] [Example Prompt with Annotation Overlays]

This detailed prompt demonstrates the model’s ability to handle specific object placements, material changes, and pose adjustments, highlighting its flexibility and precision.

[08:38] [Prompt for Detailed Architectural Model Generation]

Another complex prompt example involves creating an “architectural scale model of people,” accurately reflecting the “blueprints” and placing them on a desk. This illustrates the model’s capability in generating detailed and contextually relevant scenes.

Troubleshooting and Best Practices

[06:35] [API Error Message in ComfyUI]

Encountering “API errors” is a common occurrence. When this happens, it’s often due to issues like sending too many images or attempting to generate at resolutions that exceed the model’s limits.

[06:43] [Troubleshooting Steps: Show Report and Find Issue]

If you face errors, first check the “Show Report” and “Find Issue” options for more details. Often, adjusting the “aspect ratio” or reducing the number of input images can resolve these issues. For instance, trying to process “12K” worth of data in one go might overload the system.

[09:56] [Successful Image Generation with Multiple Inputs]

The model’s ability to process multiple inputs and maintain “character consistency” is a remarkable feature. As seen in the examples, “NanoBanana Pro” can effectively handle diverse prompts and generate high-quality, consistent results.

Conclusion

The “NanoBanana Pro” model in “ComfyUI” offers advanced capabilities for image generation, particularly in tasks requiring detailed object manipulation and character consistency. By understanding its pricing structure, parameters, and best practices for prompt engineering, you can unlock its full potential for your creative projects. Remember to experiment, troubleshoot effectively, and always enjoy the creative process!