Skip to content

Visionatrix/ComfyUI-Gemini

Repository files navigation

ComfyUI-Gemini

These nodes integrate the Gemini API into ComfyUI, allowing you to send prompts and images to Gemini AI models.

Features

  • Error Fallback Value: Specify a fallback value to use if there's an error accessing the Gemini API.
  • Response Type Selection: Choose between receiving responses in plain text or json format.
  • Image Support: Include up to three images(or batches) with your prompt to provide visual context.
  • Custom System Instructions: Set a system instruction to guide the AI's responses.
  • Adjustable Safety Settings: Control the content filtering level to block inappropriate content.

Setting Up the Gemini API Key

  1. Obtain API Key:
    • Sign up for access to the Gemini API through Google Cloud Platform.
    • Create a new API key with permissions to access the Gemini models.
  2. Set the API Key:
    • Option 1: Set it in your environment variables:
      • On Windows:
        set GOOGLE_API_KEY=your_api_key_here
      • On Linux/macOS:
        export GOOGLE_API_KEY=your_api_key_here
    • Option 2: Provide it directly in the node's api_key input.

Inputs Explained

  • prompt (STRING):
    • Your main question or instruction for the AI.
    • Example: "What are the health benefits of green tea?"
  • safety_settings (CHOICE):
    • Controls the filtering of the response content.
    • Options:
      • BLOCK_NONE: No filtering.
      • BLOCK_ONLY_HIGH: Blocks only high-risk content.
      • BLOCK_MEDIUM_AND_ABOVE: Blocks medium and high-risk content.
  • response_type (CHOICE):
    • Determines the format of the AI's response.
    • Options:
      • text: Plain text response.
      • json: Response in JSON format.
  • model (CHOICE):
    • Selects the Gemini model to use.
    • Options:
      • gemini-1.5-flash-002: Standard model.
      • gemini-1.5-pro-002: Pro model with advanced capabilities.
  • api_key (STRING, Optional):
    • Your Gemini API key.
    • Recommended to set via environment variable.
  • proxy (STRING, Optional):
    • Proxy server URL if you need to route requests through a proxy. See: Why use Proxy?
  • image_1, image_2, image_3 (IMAGE, Optional):
    • Images or image batches to include with your prompt.
    • Useful for tasks like image captioning or visual question answering.
  • system_instruction (STRING, Optional):
    • Sets a system-level instruction to influence the AI's behavior.
    • Example: "You are a helpful assistant that provides concise answers."
  • error_fallback_value (STRING, Optional, Lazy Input):
    • Value to return if an error occurs when accessing Gemini.
    • If not set, the node will raise an exception on error.

Race and Gender Detection