Split some nodes of the dependencies that are prone to problems into ComfyUI_LayerStyle_Advance repository. Including:
LayerMask: BiRefNetUltra, LayerMask: BiRefNetUltraV2, LayerMask: LoadBiRefNetModel, LayerMask: LoadBiRefNetModelV2,
LayerMask: EVFSAMUltra, LayerMask: Florence2Ultra, LayerMask: LoadFlorence2Model, LayerUtility: Florence2Image2Prompt,
LayerUtility: GetColorTone, LayerUtility: GetColorToneV2, LayerMask: HumanPartsUltra, LayerMask: BenUltra, LayerMask: LoadBenModel,
LayerUtility: ImageAutoCrop, LayerUtility: ImageAutoCropV2, LayerUtility: ImageAutoCropV3,
LayerUtility: ImageRewardFilter, LayerUtility: LoadJoyCaption2Model, LayerUtility: JoyCaption2Split,
LayerUtility: JoyCaption2, LayerUtility: JoyCaption2ExtraOptions, LayerUtility: LaMa,
LayerUtility: LlamaVision, LayerUtility: LoadPSD, LayerMask: MaskByDifferent, LayerMask: MediapipeFacialSegment,
LayerMask: BBoxJoin, LayerMask: DrawBBoxMask, LayerMask: ObjectDetectorFL2, LayerMask: ObjectDetectorMask,
LayerMask: ObjectDetectorYOLO8, LayerMask: ObjectDetectorYOLOWorld, LayerMask: PersonMaskUltra, LayerMask: PersonMaskUltra V2,
LayerUtility: PhiPrompt, LayerUtility: PromptEmbellish, LayerUtility: PromptTagger, LayerUtility: CreateQRCode, LayerUtility: DecodeQRCode,
LayerUtility: QWenImage2Prompt, LayerMask: SAM2Ultra, LayerMask: SAM2VideoUltra, LayerUtility: SaveImagePlus, LayerUtility: SD3NegativeConditioning,
LayerMask: SegmentAnythingUltra, LayerMask: SegmentAnythingUltra V2, LayerMask: TransparentBackgroundUltra,
LayerUtility: UserPromptGeneratorTxt2ImgPrompt, LayerUtility: UserPromptGeneratorTxt2ImgPromptWithReference, LayerUtility: UserPromptGeneratorReplaceWord,
LayerUtility: AddBlindWaterMark, LayerUtility: ShowBlindWaterMark, LayerMask: YoloV8Detect
If there are recent updates, you need to install ComfyUI_LayerStyle_Advance to ensure that previous workflows do not lose nodes.
If the problem is caused by splitting the warehouse, please roll back the plugin version to3d4a3526a9d1a19671a133e9215077bda520ee5d
Open the terminal in the plugin directory and use the following command to roll back the version:
git reset --hard 3d4a3526a9d1a19671a133e9215077bda520ee5d
商务合作请联系email [email protected].
For business cooperation, please contact email [email protected].
A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.
It migrate some basic functions of PhotoShop to ComfyUI, aiming to centralize the workflow and reduce the frequency of software switching.
*this workflow (title_example_workflow.json) is in the workflow directory.
Some JSON workflow files in the workflow
directory, That's examples of how these nodes can be used in ComfyUI.
(Taking ComfyUI official portable package and Aki ComfyUI package as examples, please modify the dependency environment directory for other ComfyUI environments)
-
Recommended use ComfyUI Manager for installation.
-
Or open the cmd window in the plugin directory of ComfyUI, like
ComfyUI\custom_nodes
,typegit clone https://github.com/chflame163/ComfyUI_LayerStyle.git
-
Or download the zip file and extracted, copy the resulting folder to
ComfyUI\custom_nodes
-
for ComfyUI official portable package, double-click the
install_requirements.bat
in the plugin directory, for Aki ComfyUI package double-click on theinstall_requirements_aki.bat
in the plugin directory, and wait for the installation to complete. -
Or install dependency packages, open the cmd window in the ComfyUI_LayerStyle plugin directory like
ComfyUI\custom_nodes\ComfyUI_LayerStyle
and enter the following command,
for ComfyUI official portable package, type:
..\..\..\python_embeded\python.exe -s -m pip install -r requirements.txt
.\repair_dependency.bat
for Aki ComfyUI package, type:
..\..\python\python.exe -s -m pip install -r requirements.txt
.\repair_dependency_aki.bat
- Restart ComfyUI.
Chinese domestic users from BaiduNetdisk and other users from huggingface.co/chflame163/ComfyUI_LayerStyle
download all files and copy them to ComfyUI\models
folder. This link provides all the model files required for this plugin.
Or download the model file according to the instructions of each node.
Some nodes named "Ultra" will use the vitmatte model, download the vitmatte model and copy to ComfyUI/models/vitmatte
folder, it is also included in the download link above.
If the node cannot load properly or there are errors during use, please check the error message in the ComfyUI terminal window. The following are common errors and their solutions.
This warning message indicates that the ini file cannot be found and does not affect usage. If you do not want to see these warnings, please modify all *.ini.example
files in the plugin directory to *.ini
.
This error is caused by incorrect version of the opencv-contrib-python
package,or this package is overwriteen by other opencv packages.
The reason for the problem is the same as above.
For the issues with the above, please double click repair_dependency.bat
(for Official ComfyUI Protable) or repair_dependency_aki.bat
(for ComfyUI-aki-v1.x) in the plugin folder to automatically fix them.
This error is caused by the low version of transformers
package.
This error is caused by the low version of protobuf
package.
onnxruntime::python::CreateExecutionProviderInstance CUDA_PATH is set but CUDA wasn't able to be loaded. Please install the correct version of CUDA and cuDNN as mentioned in the GPU requirements page
Solution:
Reinstall the onnxruntime
dependency package.
Check the network environment. If you cannot access huggingface.co normally in China, try modifying the huggingface_hub package to force the use hf_mirror.
-
Find
constants.py
in the directory ofhuggingface_hub
package (usuallyLib/site packages/huggingface_hub
in the virtual environment path), Add a line afterimport os
os.environ['HF_ENDPOINT'] = 'https://hf-mirror.com'
This error is caused by the mask area being too large or too small when using the PyMatting
method to handle the mask edges.
Solution:
- Please adjust the parameters to change the effective area of the mask. Or use other methods to handle the edges.
When this error has occurred, please check the network environment.
**If the dependency package error after updating, please double clicking repair_dependency.bat
(for Official ComfyUI Protable) or repair_dependency_aki.bat
(for ComfyUI-aki-v1.x) in the plugin folder to reinstall the dependency packages.
-
Commit NameToColor node, Output colors based on their names.
-
Commit ImageMaskScaleAsV2 node, Add background color settings on the basis of the original node.
-
Commit RoundedRectangle node, Used to create rounded rectangle and mask.
-
Commit AnyRerouter node, Used for reroute any type of data.
-
Commit ICMask and ICMaskCropBack nodes, Used for generating In-Context image and mask, and automatic crop back. The code is from lrzjason/Comfyui-In-Context-Lora-Utils , Thanks to the original author @小志Jason.
-
Commit GetMainColorsV2 node, add sorting by color area and output color values and proportions in the preview image. This part of the code was improved by @ HL, thanks.
-
Optimize dependency packages. Optimize some algorithms.
-
Split some nodes of the dependencies that are prone to problems into ComfyUI_LayerStyle_Advance repository. Including: LayerMask: BiRefNetUltra, LayerMask: BiRefNetUltraV2, LayerMask: LoadBiRefNetModel, LayerMask: LoadBiRefNetModelV2,
LayerMask: EVFSAMUltra, LayerMask: Florence2Ultra, LayerMask: LoadFlorence2Model, LayerUtility: Florence2Image2Prompt,
LayerUtility: GetColorTone, LayerUtility: GetColorToneV2, LayerMask: HumanPartsUltra,
LayerUtility: ImageAutoCrop, LayerUtility: ImageAutoCropV2, LayerUtility: ImageAutoCropV3,
LayerUtility: ImageRewardFilter, LayerUtility: LoadJoyCaption2Model, LayerUtility: JoyCaption2Split,
LayerUtility: JoyCaption2, LayerUtility: JoyCaption2ExtraOptions, LayerUtility: LaMa,
LayerUtility: LlamaVision, LayerUtility: LoadPSD, LayerMask: MaskByDifferent, LayerMask: MediapipeFacialSegment,
LayerMask: BBoxJoin, LayerMask: DrawBBoxMask, LayerMask: ObjectDetectorFL2, LayerMask: ObjectDetectorMask,
LayerMask: ObjectDetectorYOLO8, LayerMask: ObjectDetectorYOLOWorld, LayerMask: PersonMaskUltra, LayerMask: PersonMaskUltra V2,
LayerUtility: PhiPrompt, LayerUtility: PromptEmbellish, LayerUtility: PromptTagger, LayerUtility: CreateQRCode, LayerUtility: DecodeQRCode,
LayerUtility: QWenImage2Prompt, LayerMask: SAM2Ultra, LayerMask: SAM2VideoUltra, LayerUtility: SaveImagePlus, LayerUtility: SD3NegativeConditioning,
LayerMask: SegmentAnythingUltra, LayerMask: SegmentAnythingUltra V2, LayerMask: TransparentBackgroundUltra,
LayerUtility: UserPromptGeneratorTxt2ImgPrompt, LayerUtility: UserPromptGeneratorTxt2ImgPromptWithReference, LayerUtility: UserPromptGeneratorReplaceWord,
LayerUtility: AddBlindWaterMark, LayerUtility: ShowBlindWaterMark, LayerMask: YoloV8Detect -
Merge the PR submitted by alexisrolland , commit the
Image Blend Advanced v3
andDrop Shadow v3
nodes, support transparent background. -
Commit BenUltra and LoadBenModel nodes. These two nodes are the implementation of PramaLLC/BEN project in ComfyUI.
Download theBEN_Base.pth
andconfig.json
from huggingface or BaiduNetdisk and copy toComfyUI/models/BEN
folder. -
Merge the PR submitted by jimlee2048, add the LoadBiRefNetModelV2 node, and support loading RMBG 2.0 models.
Download the model files from huggingface or BaiduNetdisk and copy toComfyUI/models/BiRefNet/RMBG-2.0
folder. -
Florence2 nodes support base-PromptGen-v2.0 and large-PromptGen-v2.0, Download
base-PromptGen-v2.0
andlarge-PromptGen-v2.0
two folder from huggingface or BaiduNetdisk and copy toComfyUI/models/florence2
folder. -
SAM2Ultra and ObjectDetector nodes support image batch.
-
SAM2Ultra and SAM2VideoUltra nodes add support for SAM2.1 model, including kijai's FP16 model. Download model files from BaiduNetdisk or huggingface.co/Kijai/sam2-safetensors and copy to
ComfyUI/models/sam2
folder. -
Commit JoyCaption2Split and LoadJoyCaption2Model nodes, Sharing the model across multiple JoyCaption2 nodes improves efficiency.
-
SegmentAnythingUltra and SegmentAnythingUltraV2 add the
cache_model
option, Easy to flexibly manage VRAM usage. -
Due to the high version requirements of the LlamaVision node for
transformers
, which affects the loading of some older third-party plugins, so the LayerStyle plugin has lowered the default requirement to 4.43.2. If you need to run LlamaVision, please upgrade to 4.45.0 or above on your own. -
Commit JoyCaption2 and JoyCaption2ExtraOptions nodes. New dependency packages need to be installed. Use the JoyCaption-alpha-two model for local inference. Can be used to generate prompt words. this node is https://huggingface.co/John6666/joy-caption-alpha-two-cli-mod Implementation in ComfyUI, thank you to the original author. Download models form BaiduNetdisk and BaiduNetdisk , or huggingface/Orenguteng and huggingface/unsloth , then copy to
ComfyUI/models/LLM
, Download models from BaiduNetdisk or huggingface/google , and copy toComfyUI/models/clip
, Donwload thecgrkzexw-599808
folder from BaiduNetdisk or huggingface/John6666 , and copy toComfyUI/models/Joy_caption
。 -
Commit LlamaVision node, Use the Llama 3.2 vision model for local inference. Can be used to generate prompt words. part of the code for this node comes from ComfyUI-PixtralLlamaMolmoVision, thank you to the original author. To use this node, the
transformers
need upgraded to 4.45.0 or higher. Download models from BaiduNetdisk or huggingface/SeanScripts , and copy toComfyUI/models/LLM
. -
Commit RandomGeneratorV2 node, add least random range and seed options.
-
Commit TextJoinV2 node, add delimiter options on top of TextJion.
-
Commit GaussianBlurV2 node, The parameter accuracy has been improved to 0.01.
-
Commit UserPromptGeneratorTxtImgWithReference node.
-
Commit GrayValue node, output the grayscale values corresponding to the RGB color values.
-
LUT Apply, TextImageV2, TextImage, SimpleTextImage nodes to support defining multiple folders in
resource-dir.ini
, separated by commas, semicolons, or spaces. Simultaneously supports refreshing real-time updates. -
LUT Apply, TextImageV2, TextImage, SimpleTextImage nodes support defining multi directory fonts and lut folders, and support refreshing and real-time updates.
-
Commit HumanPartsUltra node, used to generate human body parts masks. It is based on the warrper of metal3d/ComfyUI_Human_Parts, thank the original author. Download model file from BaiduNetdisk or huggingface and copy to
ComfyUI\models\onnx\human-parts
folder. -
ObjectDetector nodes add sort by confidence option.
-
Commit DrawBBoxMask node, used to convert the BBoxes output by the Object Detector node into a mask.
-
Commit UserPromptGeneratorTxtImg and UserPromptGeneratorReplaceWord nodes, Used to generate text and image prompts and replace prompt content.
-
Commit PhiPrompt node, Use Microsoft Phi 3.5 text and visual models for local inference. Can be used to generate prompt words, process prompt words, or infer prompt words from images. Running this model requires at least 16GB of video memory.
Download model files from BaiduNetdisk or huggingface.co/microsoft/Phi-3.5-vision-instruct and huggingface.co/microsoft/Phi-3.5-mini-instruct and copy toComfyUI\models\LLM
folder. -
Commit GetMainColors node, it can obtained 5 main colors of image. Commit ColorName node, it can obtain the color name of input color value.
-
Duplicate the Brightness & Contrast node as BrightnessContrastV2, the Color of Shadow & Highlight node as ColorofShadowHighlight, and Shadow & Highlight Mask to Shadow Highlight Mask V2, to avoid errors in ComfyUI workflow parsing caused by the "&" character in the node name.
-
Commit VQAPrompt and LoadVQAModel nodes.
Download the model from BaiduNetdisk or huggingface.co/Salesforce/blip-vqa-capfilt-large and huggingface.co/Salesforce/blip-vqa-base and copy toComfyUI\models\VQA
folder. -
Florence2Ultra, Florence2Image2Prompt 和 LoadFlorence2Model nodes support the MiaoshouAI/Florence-2-large-PromptGen-v1.5 and MiaoshouAI/Florence-2-base-PromptGen-v1.5 model.
Download model files from BaiduNetdisk or huggingface.co/MiaoshouAI/Florence-2-large-PromptGen-v1.5 and huggingface.co/MiaoshouAI/Florence-2-base-PromptGen-v1.5 , copy toComfyUI\models\florence2
folder. -
Commit BiRefNetUltraV2 and LoadBiRefNetModel nodes, that support the use of the latest BiRefNet model. Download model file from BaiduNetdisk or GoogleDrive named
BiRefNet-general-epoch_244.pth
toComfyUI/Models/BiRefNet/pth
folder. You can also download more BiRefNet models and put them here. -
ExtendCanvasV2 node support negative value input, it means image will be cropped.
-
The default title color of nodes is changed to blue-green, and nodes in LayerStyle, LayerColor, LayerMask, LayerUtility, and LayerFilter are distinguished by different colors.
-
The Object Detector nodes added sort bbox option, which allows sorting from left to right, top to bottom, and large to small, making object selection more intuitive and convenient. The nodes released yesterday has been abandoned, please manually replace it with the new version node (sorry).
-
Commit SAM2Ultra, SAM2VideoUltra, ObjectDetectorFL2, ObjectDetectorYOLOWorld, ObjectDetectorYOLO8, ObjectDetectorMask and BBoxJoin nodes. Download models from BaiduNetdisk or huggingface.co/Kijai/sam2-safetensors and copy to
ComfyUI/models/sam2
folder, Download models from BaiduNetdisk or GoogleDrive and copy toComfyUI/models/yolo-world
folder. This update introduces new dependencies, please reinstall the dependency package. -
Commit RandomGenerator node, Used to generate random numbers within a specified range, with outputs of int, float, and boolean, supporting batch generation of different random numbers by image batch.
-
Commit EVF-SAMUltra node, it is implementation of EVF-SAM in ComfyUI. Please download model files from BaiduNetdisk or huggingface/EVF-SAM2, huggingface/EVF-SAM to
ComfyUI/models/EVF-SAM
folder(save the models in their respective subdirectories). Due to the introduction of new dependencies package, after the plugin upgrade, please reinstall the dependency packages. -
Commit ImageTaggerSave and ImageAutoCropV3 nodes. Used to implement the automatic trimming and marking workflow for the training set (the workflow
image_tagger_save.json
is located in the workflow directory). -
Commit CheckMaskV2 node, Added the
simple
method to detect masks more quickly. -
Commit ImageReel and ImageReelComposite nodes to composite multiple images on a canvas.
-
NumberCalculatorV2 and NumberCalculator add the
min
andmax
method. -
Optimize node loading speed.
-
Florence2Image2Prompt add support for
thwri/CogFlorence-2-Large-Freeze
andthwri/CogFlorence-2.1-Large
models. Please download the model files from BaiduNetdisk or huggingface/CogFlorence-2-Large-Freeze and huggingface/CogFlorence-2.1-Large , then copy it toComfyUI/models/florence2
folder. -
Merge branch from ClownsharkBatwing "Use GPU for color blend mode", the speed of some layer blends by more than ten times.
-
Commit Florence2Ultra, Florence2Image2Prompt and LoadFlorence2Model nodes.
-
TransparentBackgroundUltra node add new model support. Please download the model file according to the instructions.
-
Commit SegformerUltraV2, SegfromerFashionPipeline and SegformerClothesPipeline nodes, used for segmentation of clothing. please download the model file according to the instructions.
-
Commit
install_requirements.bat
andinstall_requirements_aki.bat
, One click solution to install dependency packages. -
Commit TransparentBackgroundUltra node, it remove background based on transparent-background model.
-
Change the VitMatte model of the Ultra node to a local call. Please download all files of vitmatte model to the
ComfyUI/models/vitmatte
folder. -
GetColorToneV2 node add the
mask
method to the color selection option, which can accurately obtain the main color and average color within the mask. -
ImageScaleByAspectRatioV2 node add the "background_color" option.
-
LUT Apply Add the "strength" option.
-
Commit AutoAdjustV2 node, add optional mask input and support for multiple automatic color adjustment modes.
-
Due to the upcoming discontinuation of gemini-pro vision services, PromptTagger and PromptEmbellish have added the "gemini-1.5-flash" API to continue using it.
-
Ultra nodes added the option to run
VitMatte
on the CUDA device, resulting in a 5-fold increase in running speed. -
Commit QueueStop node, used to terminate the queue operation.
-
Optimize performance of the
VitMate
method for Ultra nodes when processing large-size image. -
CropByMaskV2 add option to round the cutting size by multiples.
-
Commit CheckMask node, it detect whether the mask contains sufficient effective areas. Commit HSVValue node, it convert color values to HSV values.
-
BooleanOperatorV2, NumberCalculatorV2, Integer, Float, Boolean nodes add string output to output the value as a string for use with SwitchCase.
-
Commit SwitchCase node, Switches the output based on the matching string. Can be used for any type of data switching.
-
Commit String node, Used to output a string. It is the TextBox simplified node.
-
Commit If node,Switches output based on Boolean conditional input. Can be used for any type of data switching.
-
Commit StringCondition node, Determines whether the text contains or does not contain a substring.
-
Commit NumberCalculatorV2 node,Add the nth root operation. Commit BooleanOperatorV2 node, Increasing greater/less than, greater/less then or equal logical judgment. The two nodes can access numeric inputs and can input numeric values within the node. Note: Numeric input takes precedence. Values in nodes will not be valid when there is input.
-
Commit SD3NegativeConditioning node, Encapsulate the four nodes of Negative Condition in SD3 into a separate node.
-
ImageRemoveAlpha node add optional mask input.
-
Commit HLFrequencyDetailRestore node, Using low-frequency filtering and high-frequency preserving to restore image details, the fusion is better.
-
Commit AddGrain and MaskGrain nodes, Add noise to a picture or mask.
-
Commit FilmV2 node, The fastgrain method is added on the basis of the previous one, and the noise generation speed is 10 times faster.
-
Commit ImageToMask node, it can be converted image into mask. Supports converting any channel in LAB, RGBA, YUV, and HSV modes into masks, while providing color scale adjustment. Support mask optional input to obtain masks that only include valid parts.
-
The blackpoint and whitepoint options in some nodes have been changed to slider adjustment for a more intuitive display. Include MaskEdgeUltraDetailV2, SegmentAnythingUltraV2, RmBgUltraV2,PersonMaskUltraV2,BiRefNetUltra, SegformerB2ClothesUltra, BlendIfMask and Levels.
-
ImageScaleRestoreV2 and ImageScaleByAspectRatioV2 nodes add the
total_pixel
method to scale images. -
Commit MediapipeFacialSegment node,Used to segment facial features, including left and right eyebrows, eyes, lips, and teeth.
-
Commit BatchSelector node,Used to retrieve specified images or masks from batch images or masks.
-
LayerUtility creates new subdirectories such as SystemIO, Data, and Prompt. Some nodes are classified into subdirectories.
-
Commit MaskByColor node, Generate a mask based on the selected color.
-
Commit LoadPSD node, It read the psd format, and output layer images. Note that this node requires the installation of the
psd_tools
dependency package, If error occurs during the installation of psd_tool, such asModuleNotFoundError: No module named 'docopt'
, please download docopt's whl and manual install it. -
Commit SegformerB2ClothesUltra node, it used to segment character clothing. The model segmentation code is fromStartHua, thanks to the original author.
-
SaveImagePlus node adds the output workflow to the json function, supports
%date
and%time
to embeddint date or time to path and filename, and adds the preview switch. -
Commit SaveImagePlus node,It can customize the directory where the picture is saved, add a timestamp to the file name, select the save format, set the image compression rate, set whether to save the workflow, and optionally add invisible watermarks to the picture.
-
Commit AddBlindWaterMark, ShowBlindWaterMark nodes, Add invisible watermark and decoded watermark to the picture. Commit CreateQRCode, DecodeQRCode nodes, It can generate two-dimensional code pictures and decode two-dimensional codes.
-
ImageScaleRestoreV2, ImageScaleByAspectRatioV2, ImageAutoCropV2 nodes add options for
width
andheight
, which can specify width or height as fixed values. -
Commit PurgeVRAM node, Clean up VRAM an RAM.
-
Commit AutoAdjust node, it can automatically adjust image contrast and white balance.
-
Commit RGBValue node to output the color value as a single decimal value of R, G, B. This idea is from vxinhao, Thanks.
-
Commit seed node to output the seed value. The ImageMaskScaleAs, ImageScaleBySpectRatio, ImageScaleBySpectRatioV2, ImageScaleRestore, ImageScaleRestoreV2 nodes increase
width
,height
output. -
Commit Levels node, it can achieve the same color levels adjustment function as Photoshop.Sharp&Soft add the "None" option.
-
Commit BlendIfMask node, This node cooperates with ImgaeBlendV2 or ImageBlendAdvanceV2 to achieve the same Blend If function as Photoshop.
-
Commit ColorTemperature and ColorBalance nodes, used to adjust the color temperature and color balance of the picture.
-
Add new types of Blend Mode V2 between images. now supports up to 30 blend modes. The new blend mode is available for all V2 versions that support mixed mode nodes, including ImageBlend V2, ImageBlendAdvance V2, DropShadow V2, InnerShadow V2, OuterGlow V2, InnerGlow V2, Stroke V2, ColorOverlay V2, GradientOverlay V2.
Part of the code for BlendMode V2 is from Virtuoso Nodes for ComfyUI. Thanks to the original authors. -
Commit YoloV8Detect node.
-
Commit QWenImage2Prompt node, this node is repackage of the ComfyUI_VLM_nodes's
UForm-Gen2 Qwen Node
, thanks to the original author. -
Commit BooleanOperator, NumberCalculator, TextBox, Integer, Float, Booleannodes. These nodes can perform mathematical and logical operations.
-
Commit ExtendCanvasV2 node,support color value input.
-
Commit AutoBrightness node,it can automatically adjust the brightness of image.
-
CreateGradientMask node add
center
option. -
Commit GetColorToneV2 node, can select the main and average colors for the background or body.
-
Commit ImageRewardFilter node, can filter out poor quality pictures.
-
Ultra nodes add
VITMatte(local)
method, You can choose this method to avoid accessing huggingface.co if you have already downloaded the model before. -
Commit HDR Effect node,it enhances the dynamic range and visual appeal of input images. this node is repackage of HDR Effects (SuperBeasts.AI).
-
Commit CropBoxResolve node.
-
Commit BiRefNetUltra node, it using the BiRefNet model to remove background has better recognition ability and ultra-high edge details.
-
Commit ImageAutoCropV2 node, it can choose not to remove the background, support mask input, and scale by long or short side size.
-
Commit ImageHub node, supports up to 9 sets of Image and Mask switching output, and supports random output.
-
Commit TextJoin node.
-
Commit PromptEmbellish node. it output polished prompt words, and support inputting images as references.
-
Ultra nodes have been fully upgraded to V2 version, with the addition of VITMatte edge processing method, which is suitable for handling semi transparent areas. Include MaskEdgeUltraDetailV2, SegmentAnythingUltraV2, RmBgUltraV2 and PersonMaskUltraV2 nodes.
-
Commit Color of Shadow & Highlight node, it can adjust the color of the dark and bright parts separately. Commit Shadow & Highlight Mask node, it can output mask for dark and bright areas.
-
Commit CropByMaskV2 node, On the basis of the original node, it supports
crop_box
input, making it convenient to cut layers of the same size. -
Commit SimpleTextImage node, it generate simple typesetting images and masks from text. This node references some of the functionalities and code of ZHO-ZHO-ZHO/ComfyUI-Text_Image-Composite.
-
Commit PromptTagger node,Inference the prompts based on the image. and it can replace key word for the prompt(need apply for Google Studio API key). Upgrade ColorImageV2 and GradientImageV2,support user customize preset sizes and size_as input.
-
Commit LaMa node, it can erase objects from the image based on the mask. this node is repackage of IOPaint.
-
Commit ImageRemoveAlpha and ImageCombineAlpha nodes, alpha channel of the image can be removed or merged.
-
Commit ImageScaleRestoreV2 and ImageScaleByAspectRatioV2 nodes, supports scaling images to specified long or short edge sizes.
-
Commit PersonMaskUltra node, Generate masks for portrait's face, hair, body skin, clothing, or accessories. the model code for this node comes from a-person-mask-generator.
-
Commit LightLeak node, this filter simulate the light leakage effect of the film.
-
Commit Film node, this filter simulate the grain, dark edge, and blurred edge of the film, support input depth map to simulate defocus. it is reorganize and encapsulate of digitaljohn/comfyui-propost.
-
Commit ImageAutoCrop node, which is designed to generate image materials for training models.
-
Commit ImageScaleByAspectRatio node, it can be scaled image or mask according to frame ratio.
-
Fix the bug of color gradation in LUT Apply node rendering, and this node now support for log color space. *Please load the dedicated log lut file for the log color space image.
-
Commit CreateGradientMask node. Commit LayerImageTransform and LayerMaskTransform nodes.
-
Commit MaskEdgeUltraDetail node, it process rough masks to ultra fine edges.Commit Exposure node.
-
Commit Sharp & Soft node, it can enhance or smooth out image details. Commit MaskByDifferent node, it compare two images and output a Mask. Commit SegmentAnythingUltra node, Improve the quality of mask edges. *If SegmentAnything is not installed, you will need to manually download the model.
-
All nodes have fully supported batch images, providing convenience for video creation. (The CropByMask node only supports cuts of the same size. if a batch mask_for_crop inputted, the data from the first sheet will be used.)
-
Commit RemBgUltra and PixelSpread nodes significantly improved mask quality. *RemBgUltra requires manual model download.
-
Commit TextImage node, it generate text images and masks.
-
Add new types of blend mode between images. now supports up to 19 blend modes. add color_burn, color_dodge, linear_burn, linear_dodge, overlay, soft_light, hard_light, vivid_light, pin_light, linear_light and hard_mix. The newly added blend mode is applicable to all nodes that support blend mode.
-
Commit ColorMap filter node to create a pseudo color heatmap effect.
-
Commit WaterColor and SkinBeauty nodes。These are image filters that generate watercolor and skin smoothness effects.
-
Commit ImageShift node to shift the image and output a displacement seam mask, making it convenient to create continuous textures.
-
Commit ImageMaskScaleAs node to adjust the image or mask size based on the reference image.
-
Commit ImageScaleRestore node to work with CropByMask for local upscale and repair works.
-
Commit CropByMask and RestoreCropBox nodes. The combination of these two can partially crop and redraw the image before restoring it.
-
Commit ColorAdapter node, that can automatically adjust the color tone of the image.
-
Commit MaskStroke node, it can generate mask contour strokes.
-
Add LayerColor node group, used to adjust image color. it include LUT Apply, Gamma, Brightness & Contrast, RGB, YUV, LAB adn HSV.
-
Commit ImageChannelSplit and ImageChannelMerge nodes.
-
Commit MaskMotionBlur node.
-
Commit SoftLight node.
-
Commit ChannelShake node, that is filter, can produce channel dislocation effect similar like Tiktok logo.
-
Commit MaskGradient node, can create a gradient in the mask.
-
Commit GetColorTone node, can obtain the main color or average color of the image. Commit MaskGrow and MaskEdgeShrink nodes.
-
Commit MaskBoxDetect node, which can automatically detect the position through the mask and output it to the composite node. Commit XY to Percent node to convert absolute coordinates to percent coordinates. Commit GaussianBlur node. Commit GetImageSize node.
-
Commit ExtendCanvas node.
-
Commit ImageBlendAdvance node. This node allows for the synthesis of background images and layers of different sizes, providing a more free synthesis experience. Commit PrintInfo node as a workflow debugging aid.
-
Commit ColorImage and GradientImage nodes, Used to generate solid and gradient color images.
-
Commit GradientOverlay and ColorOverlay nodes. Add invalid mask input judgment and ignore it when invalid mask is input.
-
Commit InnerGlow, InnerShadow and MotionBlur nodes.
-
Renaming all completed nodes, the nodes are divided into 4 groups:LayerStyle, LayerMask, LayerUtility, LayerFilter. workflows containing old version nodes need to be manually replaced with new version nodes.
-
OuterGlow node has undergone significant modifications by adding options for brightness, light_color, and glow_color.
-
Commit MaskInvert node.
-
Commit ColorPick node.
-
Commit Stroke node.
-
Commit MaskPreview node.
-
Commit ImageOpacity node.
-
The layer_mask is not a mandatory input now. it is allowed to use layers and masks with different shapes, but the size must be consistent.
-
Commit ImageBlend node.
-
Commit OuterGlow node.
-
Commit DropShadow node.
Nodes are divided into 5 groups according to their functions: LayerStyle, LayerColor, LayerMask, LayerUtility and LayerFilter.
- LayerStyle nodes provides layer styles that mimic Adobe Photoshop.
- LayerColor node group provides color adjustment functionality.
- LayerMask nodes provides mask assistance tools.
- LayerUtility nodes provides auxiliary nodes related to layer composit tools and workflows.
- LayerFilter nodes provides image effect filters.
- background_image1: The background image.
- layer_image1: Layer image for composite.
- layer_mask1,2: Mask for layer_image, shadows are generated according to their shape.
- invert_mask: Whether to reverse the mask.
- blend_mode3: Blending mode of shadows.
- opacity: Opacity of shadow.
- distance_x: Horizontal offset of shadow.
- distance_y: Vertical offset of shadow.
- grow: Shadow expansion amplitude.
- blur: Shadow blur level.
- shadow_color4: Shadow color.
- note
- background_image1: The background image.
- layer_image1: Layer image for composite.
- layer_mask1,2: Mask for layer_image, grow are generated according to their shape.
- invert_mask: Whether to reverse the mask.
- blend_mode3: Blending mode of glow.
- opacity: Opacity of glow.
- brightness: Luminance of light.
- glow_range: range of glow.
- blur:blur of glow.
- light_color4: Center part color of glow.
- glow_color4: Edge part color of glow.
- note
- background_image1: The background image.
- layer_image1: Layer image for composite.
- layer_mask1,2: Mask for layer_image, shadows are generated according to their shape.
- invert_mask: Whether to reverse the mask.
- blend_mode3: Blending mode of shadows.
- opacity: Opacity of shadow.
- distance_x: Horizontal offset of shadow.
- distance_y: Vertical offset of shadow.
- grow: Shadow expansion amplitude.
- blur: Shadow blur level.
- shadow_color4: Shadow color.
- note
- background_image1: The background image.
- layer_image1: Layer image for composite.
- layer_mask1,2: Mask for layer_image, grow are generated according to their shape.
- invert_mask: Whether to reverse the mask.
- blend_mode3: Blending mode of glow.
- opacity: Opacity of glow.
- brightness: Luminance of light.
- glow_range: range of glow.
- blur:blur of glow.
- light_color4: Center part color of glow.
- glow_color4: Edge part color of glow.
- note
- background_image1: The background image.
- layer_image1: Layer image for composite.
- layer_mask1,2: Mask for layer_image, stroke are generated according to their shape.
- invert_mask: Whether to reverse the mask.
- blend_mode3: Blending mode of stroke.
- opacity: Opacity of stroke.
- stroke_grow: Stroke expansion/contraction amplitude, positive values indicate expansion and negative values indicate contraction.
- stroke_width: Stroke width.
- blur: Blur of stroke.
- stroke_color4: Stroke color, described in hexadecimal RGB format.
- note
Node options:
- background_image1: The background image.
- layer_image1: Layer image for composite.
- layer_mask1,2: Mask for layer_image.
- invert_mask: Whether to reverse the mask.
- blend_mode3: Blending mode of gradient.
- opacity: Opacity of stroke.
- start_color: Color at the beginning of the gradient.
- start_alpha: Transparency at the beginning of the gradient.
- end_color: Color at the end of the gradient.
- end_alpha: Transparency at the end of the gradient.
- angle: Gradient rotation angle.
- note
- background_image1: The background image.
- layer_image1: Layer image for composite.
- layer_mask1,2: Mask for layer_image.
- invert_mask: Whether to reverse the mask.
- blend_mode3: Blending mode of color.
- opacity: Opacity of stroke.
- color: Color of overlay.
- note
LUT Apply
Apply LUT to the image. only supports .cube format.
- LUT*: Here is a list of available. cube files in the LUT folder, and the selected LUT files will be applied to the image.
- color_space: For regular image, please select linear, for image in the log color space, please select log.
- strength: Range 0~100, LUT application strength. The larger the value, the greater the difference from the original image, and the smaller the value, the closer it is to the original image.
*LUT folder is defined in resource_dir.ini
, this file is located in the root directory of the plug-in, and the default name is resource_dir.ini.example
. to use this file for the first time, you need to change the file suffix to .ini
.
Open the text editing software and find the line starting with "LUT_dir=", after "=", enter the custom folder path name.
support defining multiple folders in resource-dir.ini
, separated by commas, semicolons, or spaces.
all .cube files in this folder will be collected and displayed in the node list during ComfyUI initialization.
If the folder set in ini is invalid, the LUT folder that comes with the plugin will be enabled.
Automatically adjust the brightness, contrast, and white balance of the image. Provide some manual adjustment options to compensate for the shortcomings of automatic adjustment.
- strength: Strength of adjust. The larger the value, the greater the difference from the original image.
- brightness: Manual adjustment of brightness.
- contrast: Manual adjustment of contrast.
- saturation: Manual adjustment of saturation.
- red: Manual adjustment of the red channel.
- green: Manual adjustment of the green channel.
- blue: Manual adjustment of the blue channel.
On the basis of AutoAdjust, add mask input and only calculate the content inside the mask for automatic color adjustment. Add multiple automatic adjustment modes.
The following changes have been made based on AutoAdjust:
- mask: Optional mask input.
- mode: Automatic adjustment mode. "RGB" automatically adjusts according to the three channels of RGB, "lum + sat"automatically adjusts according to luminance and saturation, "luminance" automatically adjusts according to luminance, "saturation" automatically adjusts according to saturation, and "mono" automatically adjusts according to grayscale and outputs monochrome.
Automatically adjust too dark or too bright image to moderate brightness, and support mask input. When mask input, only the content of the mask part is used as the data source of the automatic brightness. The output is still the whole adjusted image.
- strength: Automatically adjust the intensity of the brightness. The larger the value, the more biased towards the middle value, the greater the difference from the original picture.
- saturation: Color saturation. Changes in brightness usually result in changes in color saturation, where appropriate compensation can be adjusted.
Auto adjust the color tone of the image to resemble the reference image.
- opacity: The opacity of an image after adjusting its color tone.
Change the exposure of the image.
Color of Shadow & Highlight
Adjust the color of the dark and bright parts of the image.
- image: The input image.
- mask: Optional input. if there is input, only the colors within the mask range will be adjusted.
- shadow_brightness: The brightness of the dark area.
- shadow_saturation: The color saturation in the dark area.
- shadow_hue: The color hue in the dark area.
- shadow_level_offset: The offset of values in the dark area, where larger values bring more areas closer to the bright into the dark area.
- shadow_range: The transitional range of the dark area.
- highlight_brightness: The brightness of the highlight area.
- highlight_saturation: The color saturation in the highlight area.
- highlight_hue: The color hue in the highlight area.
- highlight_level_offset: The offset of values in the highlight area, where larger values bring more areas closer to the dark into the highlight area.
- highlight_range: The transitional range of the highlight area.
Node option:
- exposure: Exposure value. Higher values indicate brighter image.
Color of Shadow HighlightV2
A replica of the Color of Shadow & Highlight
node, with the "&" character removed from the node name to avoid ComfyUI workflow parsing errors.
Change the color temperature of the image.
- temperature: Color temperature value. Range between-100 and 100. The higher the value, the higher the color temperature (bluer); The lower the color temperature, the lower the color temperature (yellowish).
- channel: Select the channel you want to adjust. Available in RGB, red, green, blue.
- black_point*: Input black point value. Value range 0-255, default 0.
- white_point*: Input white point value. Value range 0-255, default 255.
- gray_point: Input grey point values. Value range 0.01-9.99, default 1.
- output_black_point*: Output black point value. Value range 0-255, default 0.
- output_white_point*: Output white point value. Value range 0-255, default 255.
*If the black_point or output_black_point value is greater than white_point or output_white_point, the two values are swapped, with the larger value used as white_point and the smaller value used as black_point.
Change the color balance of an image.
- cyan_red: Cyan-Red balance. negative values are leaning cyan, positive values are leaning red.
- magenta_green: Megenta-Green balance. negative values are leaning megenta, positive values are leaning green.
- yellow_blue: Yellow-Blue balance. negative values are leaning yellow, positive values are leaning blue.
Change the gamma value of the image.
- gamma: Value of the Gamma.
Brightness & Contrast
Change the brightness, contrast, and saturation of the image.
- brightness: Value of brightness.
- contrast: Value of contrast.
- saturation: Value of saturation.
A replica of the Brightness & Contrast
node, with the "&" character removed from the node name to avoid ComfyUI workflow parsing errors.
Adjust the RGB channels of the image.
- R: R channel.
- G: G channel.
- B: B channel.
Adjust the YUV channels of the image.
- Y: Y channel.
- U: U channel.
- V: V channel.
Adjust the LAB channels of the image.
- L: L channel.
- A: A channel.
- B: B channel.
Adjust the HSV channels of the image.
- H: H channel.
- S: S channel.
- V: V channel.
Used for compositing layers, allowing for compositing layer images of different sizes on the background image, and setting positions and transformations. multiple mixing modes are available for selection, and transparency can be set.
The node provide layer transformation_methods and anti_aliasing options. helps improve the quality of synthesized images.
The node provides mask output that can be used for subsequent workflows.
- background_image: The background image.
- layer_image5: Layer image for composite.
- layer_mask2,5: Mask for layer_image.
- invert_mask: Whether to reverse the mask.
- blend_mode3: Blending mode.
- opacity: Opacity of blend.
- x_percent: Horizontal position of the layer on the background image, expressed as a percentage, with 0 on the far left and 100 on the far right. It can be less than 0 or more than 100, indicating that some of the layer's content is outside the screen.
- y_percent: Vertical position of the layer on the background image, expressed as a percentage, with 0 on the top and 100 on the bottom. For example, setting it to 50 indicates vertical center, 20 indicates upper center, and 80 indicates lower center.
- mirror: Mirror flipping. Provide two flipping modes, horizontal flipping and vertical flipping.
- scale: Layer magnification, 1.0 represents the original size.
- aspect_ratio: Layer aspect ratio. 1.0 is the original ratio, a value greater than this indicates elongation, and a value less than this indicates flattening.
- rotate: Layer rotation degree.
- Sampling methods for layer enlargement and rotation, including lanczos, bicubic, hamming, bilinear, box and nearest. Different sampling methods can affect the image quality and processing time of the synthesized image.
- anti_aliasing: Anti aliasing, ranging from 0 to 16, the larger the value, the less obvious the aliasing. An excessively high value will significantly reduce the processing speed of the node.
- note
Crop the image according to the mask range, and set the size of the surrounding borders to be retained. This node can be used in conjunction with the RestoreCropBox and ImageScaleRestore nodes to crop and modify upscale parts of image, and then paste them back in place.
- image5: The input image.
- mask_for_crop5: Mask of the image, it will automatically be cut according to the mask range.
- invert_mask: Whether to reverse the mask.
- detect: Detection method,
min_bounding_rect
is the minimum bounding rectangle of block shape,max_inscribed_rect
is the maximum inscribed rectangle of block shape, andmask-area
is the effective area for masking pixels. - top_reserve: Cut the top to preserve size.
- bottom_reserve: Cut the bottom to preserve size.
- left_reserve: Cut the left to preserve size.
- right_reserve: Cut the right to preserve size.
- note
Output:
- croped_image: The image after crop.
- croped_mask: The mask after crop.
- crop_box: The trimmed box data is used when restoring the RestoreCropBox node.
- box_preview: Preview image of cutting position, red represents the detected range, and green represents the cutting range after adding the reserved border.
The V2 upgraded version of CropByMask. Supports crop_box input, making it easy to cut layers of the same size.
The following changes have been made based on CropByMask:
- The input
mask_for_crop
reanme tomask
。 - Add optional inputs to the
crop_box
. If there are inputs here, mask detection will be ignored and this data will be directly used for cropping. - Add the option
round_to_multiple
to round the trimming edge length multiple. For example, setting it to 8 will force the width and height to be multiples of 8.
Restore the cropped image to the original image by CropByMask.
- background_image: The original image before cutting.
- croped_image5: The cropped image. If the middle is enlarged, the size needs to be restored before restoration.
- croped_mask5: The cut mask.
- crop_box: Box data during cutting.
- invert_mask: Whether to reverse the mask.
- note
Parsing the corp_box
to x
, y
, width
, height
.
Image scaling. when this node is used in pairs, the image can be automatically restored to its original size on the second node.
- image5: The input image.
- mask2,5: Mask of image.
- original_size: Optional input, used to restore the image to its original size.
- scale: Scale ratio. when the original_size have input, or scale_ by_longest_side is set to True, this setting will be ignored.
- scale_by_longest_side: Allow scaling by long edge size.
- longest_side: When the scale_by_longest_side is set to True, this will be used this value to the long edge of the image. when the original_size have input, this setting will be ignored.
Outputs:
- image: The scaled image.
- mask: If have mask input, the scaled mask will be output.
- original_size: The original size data of the image is used for subsequent node recovery.
- width: The output image's width.
- height: The output image's height.
The V2 upgraded version of ImageScaleRestore.
Node options:
The following changes have been made based on ImageScaleRestore:
- scale_by: Allow scaling by specified dimensions for long, short, width, height, or total pixels. When this option is set to by_scale, use the scale value, and for other options, use the scale_by_length value.
- scale_by_length: The value here is used as
scale_by
to specify the length of the edge.
Scale the image or mask to the size of the reference image (or reference mask).
- scale_as*: Reference size. It can be an image or a mask.
- image: Image to be scaled. this option is optional input. if there is no input, a black image will be output.
- mask: Mask to be scaled. this option is optional input. if there is no input, a black mask will be output.
- fit: Scale aspect ratio mode. when the width to height ratio of the original image does not match the scaled size, there are three modes to choose from, The letterbox mode retains the complete frame and fills in the blank spaces with black; The crop mode retains the complete short edge, and any excess of the long edge will be cut off; The fill mode does not maintain frame ratio and fills the screen with width and height.
- method: Scaling sampling methods, including lanczos, bicubic, hamming, bilinear, box, and nearest.
*Only limited to input images and masks. forcing the integration of other types of inputs will result in node errors.
Outputs:
- image: If there is an image input, the scaled image will be output.
- mask: If there is a mask input, the scaled mask will be output.
- original_size: The original size data of the image is used for subsequent node recovery.
- width: The output image's width.
- height: The output image's height.
The upgraded version of ImageMaskScaleAs adds background color settings on top of the original nodes.
- background_color: Expand background color.
Scale the image or mask by aspect ratio. the scaled size can be rounded to a multiple of 8 or 16, and can be scaled to the long side size.
- aspect_ratio: Here are several common frame ratios provided. alternatively, you can choose "original" to keep original ratio or customize the ratio using "custom".
- proportional_width: Proportional width. if the aspect ratio option is not "custom", this setting will be ignored.
- proportional_height: Proportional height. if the aspect ratio option is not "custom", this setting will be ignored.
- fit: Scale aspect ratio mode. when the width to height ratio of the original image does not match the scaled size, there are three modes to choose from, The letterbox mode retains the complete frame and fills in the blank spaces with black; The crop mode retains the complete short edge, and any excess of the long edge will be cut off; The fill mode does not maintain frame ratio and fills the screen with width and height.
- method: Scaling sampling methods, including lanczos, bicubic, hamming, bilinear, box, and nearest.
- round_to_multiple: Round multiples. for example, setting it to 8 will force the width and height to be multiples of 8.
- scale_by_longest_side: Allow scaling by long edge size.
- longest_side: When the scale_by_longest_side is set to True, this will be used this value to the long edge of the image. when the original_size have input, this setting will be ignored.
Outputs:
- image: If have image input, the scaled image will be output.
- mask: If have mask input, the scaled mask will be output.
- original_size: The original size data of the image is used for subsequent node recovery.
- width: The output image's width.
- height: The output image's height.
V2 Upgraded Version of ImageScaleByAspectRatio
Node options:
The following changes have been made based on ImageScaleByAspectRatio:
- scale_to_side: Allow scaling by specified dimensions for long, short, width, height, or total pixels.
- scale_to_length: The numerical value here serves as the length of the specified edge or the total pixels (kilo pixels) for scale_to_side.
- background_color4: The color of the background.
Used for generating In-Context image and mask. The code is from lrzjason/Comfyui-In-Context-Lora-Utils , Thanks to the original author @小志Jason.
- first_image: Images used as contextual references.
- first_mask: Optional input, context reference image mask.
- second_image: Used for redrawing images.
- second_mask: Mask used for redrawing images.
- patch_mode: There are three types of splicing modes: auto、patch_right and patch_bottom.
- output_length: Output the long side size of the image.
- patch_color: Fill color.
Outputs:
- image: The output image.
- mask: The output mask.
- icmask_data: The stitching information of the image is used for automatic cropping of subsequent nodes.
Crop the image inference output generated by ICMask.
- image: The input image.
- icmask_data: Splicing information output from ICMask node.
Use the blip-vqa model for visual question answering. Part of the code for this node is referenced from celoron/ComfyUI-VisualQueryTemplate, thanks to the original author.
*Download model files from BaiduNetdisk or huggingface.co/Salesforce/blip-vqa-capfilt-large and huggingface.co/Salesforce/blip-vqa-base and copy to ComfyUI\models\VQA
folder.
- image: The image input.
- vqa_model: The vqa model input, it from LoadVQAModel node.
- question: Task text input. A single question is enclosed in curly braces "{}", and the answer to the question will be replaced in its original position in the text output. Multiple questions can be defined using curly braces in a single Q&A. For example, for a picture of an item placed in a scene, the question is:"{object color} {object} on the {scene}".
Load the blip-vqa model.
- model: There are currently two models to choose from "blip-vqa-base" and "blip-vqa-capfilt-large".
- precision: The model accuracy has two options: "fp16" and "fp32".
- device: The model running device has two options: "cuda" and "cpu".
Shift the image. this node supports the output of displacement seam masks, making it convenient to create continuous textures.
- image5: The input image.
- mask2,5: The mask of image.
- shift_x: Horizontal distance of shift.
- shift_y: Vertical distance of shift.
- cyclic: Is the part of displacement that is out of bounds cyclic.
- background_color4: Background color. if cyclic is set to False, the setting here will be used as the background color.
- border_mask_width: Border mask width.
- border_mask_blur: Border mask blur.
- note
A simple node for composit layer image and background image, multiple blend modes are available for option, and transparency can be set.
- background_image1: The background image.
- layer_image1: Layer image for composite.
- layer_mask1,2: Mask for layer_image.
- invert_mask: Whether to reverse the mask.
- blend_mode3: Blending mode.
- opacity: Opacity of blend.
- note
Display multiple images in one reel. Text annotations can be added to each image in the reel. By using the ImageReelComposite node, multiple reel can be combined into one image.
- image1: The first image. it must be input.
- image2: The second image. optional input.
- image3: The third image. optional input.
- image4: The fourth image. optional input.
- image1_text: Text annotation for the first image.
- image2_text: Text annotation for the second image.
- image3_text: Text annotation for the third image.
- image4_text: Text annotation for the fourth image.
- reel_height: The height of reel.
- border: The border width of the image in the reel.
Output:
- reel: The reel of ImageReelComposite node input.
Combine multiple reel into one image.
- reel_1: The first reel. it must be input.
- reel_2: The second reel. optional input.
- reel_3: The third reel. optional input.
- reel_4: The fourth reel. optional input.
- font_file**: Here is a list of available font files in the font folder, and the selected font files will be used to generate images.
- border: The border width of the reel.
- color_theme: Theme color for the output image.
*The font folder is defined inresource_dir.ini
, this file is located in the root directory of the plug-in, and the default name isresource_dir.ini.example
. to use this file for the first time, you need to change the file suffix to.ini
. Open the text editing software and find the line starting with "FONT_dir=", after "=", enter the custom folder path name. support defining multiple folders inresource-dir.ini
, separated by commas, semicolons, or spaces. all font files in this folder will be collected and displayed in the node list during ComfyUI initialization. If the folder set in ini is invalid, the font folder that comes with the plugin will be enabled.
Node option:
- image5: Image input, supporting RGB and RGBA. if is RGB, the alpha channel of the entire image will be automatically added.
- mask2,5 : Mask input.
- invert_mask: Whether to reverse the mask.
- opacity: Opacity of image.
- note
Modify web extensions from mtb nodes. Select colors on the color palette and output RGB values, thanks to the original author.
Node options:
- mode: The output format is available in hexadecimal (HEX) and decimal (DEC).
Output type:
- value: String format.
Output the color value as a single R, G, B three decimal values. Supports HEX and DEC formats for ColorPicker node output.
Node Options:
- color_value: Supports hexadecimal (HEX) or decimal (DEC) color values and should be of string or tuple type. Forcing in other types will result in an error.
Output color values as individual decimal values of H, S, and V (maximum value of 255). Supports HEX and DEC formats for ColorPicker node output.
Node Options:
- color_value: Supports hexadecimal (HEX) or decimal (DEC) color values and should be of string or tuple type. Forcing in other types will result in an error.
Output grayscale values based on color values. Supports outputting 256 level and 100 level grayscale values.
Node Options:
- color_value: Supports hexadecimal (HEX) or decimal (DEC) color values and should be of string or tuple type. Forcing in other types will result in an error.
Outputs:
- gray(256_level): 256 level grayscale value. Integer type, range 0~255.
- gray(100_level): 100 level grayscale value. Integer type, range 0~100.
Obtain the main color of the image. You can obtain 5 colors.
- image: The image input.
- k_means_algorithm:K-Means algorithm options. "lloyd" is the standard K-Means algorithm, while "elkan" is the triangle inequality algorithm, suitable for larger images.
Outputs:
- preview_image: 5 main color preview images.
- color_1~color_5: Color value output. Output an RGB string in HEX format.
Add sorting by color area to the GetMainColors node and display color values and color areas in the preview image. This part of the code was improved by @ HL, thanks.
Output the most similar color name in the color palette based on the color value.
- color: Color value input, in HEX format RGB string format.
- palette: Color palette. There are 6 color mapping tables available, including xkcd, wiki_color, flux_sdxl, css4, css3, and html4.
Output:
- color_name: Color name in string.
Output color images and color values from color names.
- size_as*: Input image or mask here to generate image according to its size. Note that this input takes priority over other size settings.
- color_name: The name of color describe.
- palette: Color palette. There are 6 color mapping tables available, including xkcd, wiki_color, flux_sdxl, css4, css3, and html4.
- in_palette_only: Set to only output colors from the color palette. If set to True, search only in the current color palette. If there is no matching name, output default_color. If set to False, search for all color palettes. If there is no matching name in all color palettes, output the color with the closest name.
- default_color: Default color. If no matching name is found, output the color.
- size**: Size preset. the preset can be customized by the user. if have size_as input, this option will be ignored.
- custom_width: Image width. it valid when size is set to "custom". if have size_as input, this option will be ignored.
- custom_height: Image height. it valid when size is set to "custom". if have size_as input, this option will be ignored.
*Only limited to input images and masks. forcing the integration of other types of inputs will result in node errors.
**The preset size is defined in custom_size.ini
, this file is located in the root directory of the plug-in, and the default name is custom_size.ini.example
. to use this file for the first time, you need to change the file suffix to .ini
. Open with text editing software. Each row represents a size, with the first value being width and the second being height, separated by a lowercase "x" in the middle. To avoid errors, please do not enter extra characters.
输出:
- image: The output color image.
- color: Color value output, in HEX format RGB string format.
- invert_mask: Whether to reverse the mask.
- top: Top extension value.
- bottom: Bottom extension value.
- left: Left extension value.
- right: Right extension value.
- color; Color of canvas.
V2 upgrade to ExtendCanvas.
Based on ExtendCanvas, color is modified to be a string type, and it supports external ColorPicker
input, Support negative value input, it means image will be cropped.
XY to Percent
Convert absolute coordinates to percentage coordinates.
- x: Value of X.
- y: Value of Y.
This node is used to transform layer_image separately, which can change size, rotation, aspect ratio, and mirror flip without changing the image size.
- x: Value of X.
- y: Value of Y.
- mirror: Mirror flipping. Provide two flipping modes, horizontal flipping and vertical flipping.
- scale: Layer magnification, 1.0 represents the original size.
- aspect_ratio: Layer aspect ratio. 1.0 is the original ratio, a value greater than this indicates elongation, and a value less than this indicates flattening.
- rotate: Layer rotation degree.
- Sampling methods for layer enlargement and rotation, including lanczos, bicubic, hamming, bilinear, box and nearest. Different sampling methods can affect the image quality and processing time of the synthesized image.
- anti_aliasing: Anti aliasing, ranging from 0 to 16, the larger the value, the less obvious the aliasing. An excessively high value will significantly reduce the processing speed of the node.
Similar to LayerImageTransform node, this node is used to transform the layer_mask separately, which can scale, rotate, change aspect ratio, and mirror flip without changing the mask size.
- x: Value of X.
- y: Value of Y.
- mirror: Mirror flipping. Provide two flipping modes, horizontal flipping and vertical flipping.
- scale: Layer magnification, 1.0 represents the original size.
- aspect_ratio: Layer aspect ratio. 1.0 is the original ratio, a value greater than this indicates elongation, and a value less than this indicates flattening.
- rotate: Layer rotation degree.
- Sampling methods for layer enlargement and rotation, including lanczos, bicubic, hamming, bilinear, box and nearest. Different sampling methods can affect the image quality and processing time of the synthesized image.
- anti_aliasing: Anti aliasing, ranging from 0 to 16, the larger the value, the less obvious the aliasing. An excessively high value will significantly reduce the processing speed of the node.
Generate an image of a specified color and size.
- width: Width of the image.
- height: Height of the image.
- color4: Color of the image.
The V2 upgraded version of ColorImage.
The following changes have been made based on ColorImage:
- size_as*: Input image or mask here to generate image according to its size. Note that this input takes priority over other size settings.
- size**: Size preset. the preset can be customized by the user. if have size_as input, this option will be ignored.
- custom_width: Image width. it valid when size is set to "custom". if have size_as input, this option will be ignored.
- custom_height: Image height. it valid when size is set to "custom". if have size_as input, this option will be ignored.
*Only limited to input images and masks. forcing the integration of other types of inputs will result in node errors.
**The preset size is defined in custom_size.ini
, this file is located in the root directory of the plug-in, and the default name is custom_size.ini.example
. to use this file for the first time, you need to change the file suffix to .ini
. Open with text editing software. Each row represents a size, with the first value being width and the second being height, separated by a lowercase "x" in the middle. To avoid errors, please do not enter extra characters.
Generate an image with a specified size and color gradient.
- width: Width of the image.
- height: Height of the image.
- angle: Angle of gradient.
- start_color4: Color of the begging.
- end_color4: Color of the ending.
The V2 upgraded version of GradientImage.
The following changes have been made based on GradientImage:
- size_as*: Input image or mask here to generate image according to its size. Note that this input takes priority over other size settings.
- size**: Size preset. the preset can be customized by the user. if have size_as input, this option will be ignored.
- custom_width: Image width. it valid when size is set to "custom". if have size_as input, this option will be ignored.
- custom_height: Image height. it valid when size is set to "custom". if have size_as input, this option will be ignored.
*Only limited to input images and masks. forcing the integration of other types of inputs will result in node errors.
**The preset size is defined in custom_size.ini
, this file is located in the root directory of the plug-in, and the default name is custom_size.ini.example
. to use this file for the first time, you need to change the file suffix to .ini
. Open with text editing software. Each row represents a size, with the first value being width and the second being height, separated by a lowercase "x" in the middle. To avoid errors, please do not enter extra characters.
Generate rounded rectangles and masks.
- image: The image to be processed.
- object_mask: Optional input. This mask can generate rounded rectangular regions. If have input for
crop-box
, this option will be ignored. - crop_box: Optional input. This can generate a rounded rectangular area by cropping the region.
- rounded_rect_radius: Rounded rectangle radius. The range is 0-100, and the larger the value, the more pronounced the rounded corners.
- anti_aliasing: Anti aliasing, ranging from 0-16, with larger values indicating less pronounced aliasing. Excessive values will significantly reduce the processing speed of nodes.
- top: Top margin of the rounded rectangle is a percentage of the image height, and negative values are allowed. If there is crox_box or object_mask input, this option will be ignored.
- bottom: Bottom margin of the rounded rectangle is a percentage of the image height, and negative values are allowed. If there is crox_box or object_mask input, this option will be ignored.
- left: Left margin of the rounded rectangle is a percentage of the image width, and negative values are allowed. If there is crox_box or object_mask input, this option will be ignored.
- right: Right margin of the rounded rectangle is a percentage of the image width, and negative values are allowed. If there is crox_box or object_mask input, this option will be ignored.
- detect: The method of detecting mask regions when object_mask is input.
min_bounding_rect
is the minimum bounding rectangle of block shape,max_inscribed_rect
is the maximum inscribed rectangle of block shape, andmask-area
is the effective area for masking pixels. - obj_ext_top: When object_mask or crop-box is input, the top of the rounded rectangle area expands outward as a percentage of the area height, and negative values are allowed.
- obj_ext_bottom: When object_mask or crop-box is input, the bottom of the rounded rectangle area expands outward as a percentage of the area height, and negative values are allowed.
- obj_ext_left: When object_mask or crop-box is input, the left of the rounded rectangle area expands outward as a percentage of the area width, and negative values are allowed.
- obj_ext_right: When object_mask or crop-box is input, the right of the rounded rectangle area expands outward as a percentage of the area width, and negative values are allowed.
Generate simple typesetting images and masks from text. This node references some of the functionalities and code of ZHO-ZHO-ZHO/ComfyUI-Text_Image-Composite, thanks to the original author.
- size_as*: The input image or mask here will generate the output image and mask according to their size. this input takes priority over the width and height below.
- font_file**: Here is a list of available font files in the font folder, and the selected font files will be used to generate images.
- align: Alignment options. There are three options: center, left, and right.
- char_per_line: The number of characters per line, any excess will be automatically wrapped.
- leading: The leading space.
- font_size: The size of font.
- text_color: The color of text.
- stroke_width: The width of stroke.
- stroke_color: The color of stroke.
- x_offset: The horizontal offset of the text position.
- y_offset: The vertical offset of the text position.
- width: Width of the image. If there is a size_as input, this setting will be ignored.
- height: Height of the image. If there is a size_as input, this setting will be ignored.
*Only limited to input image and mask. forcing the integration of other types of inputs will result in node errors.
**The font folder is defined in resource_dir.ini
, this file is located in the root directory of the plug-in, and the default name is resource_dir.ini.example
. to use this file for the first time, you need to change the file suffix to .ini
.
Open the text editing software and find the line starting with "FONT_dir=", after "=", enter the custom folder path name.
support defining multiple folders in resource-dir.ini
, separated by commas, semicolons, or spaces.
all font files in this folder will be collected and displayed in the node list during ComfyUI initialization.
If the folder set in ini is invalid, the font folder that comes with the plugin will be enabled.
Generate images and masks from text. support for adjusting the spacing between words and lines, horizontal and vertical adjustments, it can set random changes in each character, including size and position.
- size_as*: The input image or mask here will generate the output image and mask according to their size. this input takes priority over the width and height below.
- font_file**: Here is a list of available font files in the font folder, and the selected font files will be used to generate images.
- spacing: Word spacing.this value is in pixels.
- leading: Row leading.this value is in pixels.
- horizontal_border: Side margin. If the text is horizontal, it is the left margin, and if it is vertical, it is the right margin. this value is represents a percentage, for example, 50 indicates that the starting point is located in the center on both sides.
- vertical_border: Top margin. this value is represents a percentage, for example, 10 indicates that the starting point is located 10% away from the top.
- scale: The overall size of the text. the initial size of text is automatically calculated based on the screen size and text content, with the longest row or column by default adapting to the image width or height. adjusting the value here will scale the text as a whole. this value is represents a percentage, for example, 60 represents scaling to 60%.
- variation_range: The range of random changes in characters. when this value is greater than 0, the character will undergo random changes in size and position, and the larger the value, the greater the magnitude of the change.
- variation_seed: The seed for randomly. fix this value to individual characters changes generated each time will not change.
- layout: Text layout. there are horizontal and vertical options to choose from.
- width: Width of the image. If there is a size_as input, this setting will be ignored.
- height: Height of the image. If there is a size_as input, this setting will be ignored.
- text_color: The color of text.
- background_color4: The color of background.
*Only limited to input image and mask. forcing the integration of other types of inputs will result in node errors.
**The font folder is defined in resource_dir.ini
, this file is located in the root directory of the plug-in, and the default name is resource_dir.ini.example
. to use this file for the first time, you need to change the file suffix to .ini
.
Open the text editing software and find the line starting with "FONT_dir=", after "=", enter the custom folder path name.
support defining multiple folders in resource-dir.ini
, separated by commas, semicolons, or spaces.
all font files in this folder will be collected and displayed in the node list during ComfyUI initialization.
If the folder set in ini is invalid, the font folder that comes with the plugin will be enabled.
This node is merged from heshengtao. The PR modifies the scaling of the image text node based on the TextImage node. The font spacing follows the scaling, and the coordinates are no longer based on the top left corner of the text, but on the center point of the entire line of text. Thank you for the author's contribution.
Split the image channel into individual images.
- mode: Channel mode, include RGBA, YCbCr, LAB adn HSV.
Merge each channel image into one image.
- mode: Channel mode, include RGBA, YCbCr, LAB adn HSV.
Remove the alpha channel from the image and convert it to RGB mode. you can choose to fill the background and set the background color.
- RGBA_image: The input image supports RGBA or RGB modes.
- mask: Optional input mask. If there is an input mask, it will be used first, ignoring the alpha that comes with RGBA_image.
- fill_background: Whether to fill the background.
- background_color4: Color of background.
Merge the image and mask into an RGBA mode image containing an alpha channel.
Using low frequency filtering and retaining high frequency to recover image details. Compared to kijai's DetailTransfer, this node is better integrated with the environment while retaining details.
- image: Background image input.
- detail_image: Detail image input.
- mask: Optional input, if there is a mask input, only the details of the mask part are restored.
- keep_high_freq: Reserved range of high frequency parts. The larger the value, the richer the retained high-frequency details.
- erase_low_freq: The range of low frequency parts of the erasure. The larger the value, the more the low frequency range of the erasure.
- mask_blur: Mask edge blur. Valid only if there is masked input.
Obtain the width and height of the image.
Output:
- width: The width of image.
- height: The height of image.
- original_size: The original size data of the image is used for subsequent node recovery.
Used for rerouter any type of data, this node allows for any type of input.
Switch output from multiple input images and masks, supporting 9 sets of inputs. All input items are optional. if there is only image or mask in a set of input, the missing item will be output as None.
- output: Switch output. the value is the corresponding input group. when the
random-output
option is True, this setting will be ignored. - random_output: When this is true, the
output
setting will be ignored and a random set will be output among all valid inputs.
Retrieve specified images or masks from batch images or masks.
- images: Batch images input. This input is optional.
- masks: Batch masks input. This input is optional.
- select: Select the output image or mask at the batch index value, where 0 is the first image. Multiple values can be entered, separated by any non numeric character, including but not limited to commas, periods, semicolons, spaces or letters, and even Chinese characters. Note: If the value exceeds the batch size, the last image will be output. If there is no corresponding input, an empty 64x64 image or a 64x64 black mask will be output.
Combine multiple paragraphs of text into one.
Added delimiter options on the basis of TextJoin.
Used to provide assistance for workflow debugging. When running, the properties of any object connected to this node will be printed to the console.
This node allows any type of input.
Output a string. same as TextBox.
Output a floating-point value with a precision of 5 decimal places.
Used to generate random value within a specified range, with outputs of int, float, and boolean. Supports batch and list generation, and supports batch generation of a set of different random number lists based on image batch.
- image: Optional input, generate a list of random numbers that match the quantity in batches according to the image.
- min_value: Minimum value. Random numbers will randomly take values from the minimum to the maximum.
- max_value: Maximum value. Random numbers will randomly take values from the minimum to the maximum.
- float_decimal_places: Precision of float value.
- fix_seed:Is the random number seed fixed. If this option is fixed, the generated random number will always be the same.
Outputs: int: Integer random number. float: Float random number. bool: Boolean random number.
On the based of RandomGenerator, add the least random range and seed options.
- image: Optional input, generate a list of random numbers that match the quantity in batches according to the image.
- min_value: Minimum value. Random numbers will randomly take values from the minimum to the maximum.
- max_value: Maximum value. Random numbers will randomly take values from the minimum to the maximum.
- least: Minimum random range. Random numbers will randomly at least take this value.
- float_decimal_places: Precision of float value.
- seed: The seed of random number.
- control_after_generate: Seed change options. If this option is fixed, the generated random number will always be the same.
Outputs: int: Integer random number. float: Float random number. bool: Boolean random number.
Performs mathematical operations on two numeric values and outputs integer and floating point results*. Supported operations include+
, -
, *
, /
, **
, //
, %
.
* The input only supports boolean, integer, and floating point numbers, forcing in other data will result in error.
The upgraded version of NumberCalculator has added numerical inputs within nodes and square root operations. The square root operation option is nth_root
Note: The input takes priority, and when there is input, the values within the node will be invalid.
Perform a Boolean operation on two numeric values and output the result*. Supported operations include==
, !=
, and
, or
, xor
, not
, min
, max
.
* The input only supports boolean, integer, and floating point numbers, forcing in other data will result in error. The and
operation between the values outputs a larger number, and the or
operation outputs a smaller number.
The upgraded version of Boolean Operator has added numerical inputs within nodes and added judgments for greater than, less than, greater than or equal to, and less than or equal to.
Note: The input takes priority, and when there is input, the values within the node will be invalid.
Determine whether the text contains or does not contain substrings, and output a Boolean value.
- text: Input text.
- condition: Judgment conditions.
include
determines whether it contains a substring, andexclude
determines whether it does not. - sub_string: Substring.
Check if the mask contains enough valid areas and output a Boolean value.
- white_point: The white point threshold used to determine whether the mask is valid is considered valid if it exceeds this value.
- area_percent: The percentage of effective areas. If the proportion of effective areas exceeds this value, output True.
On the basis of CheckMask, the method
option has been added, which allows for the selection of different detection methods. The area_percent
is changed to a floating point number with an accuracy of 2 decimal places, which can detect smaller effective areas.
- method: There are two detection methods, which are
simple
anddetectability
. The simple method only detects whether the mask is completely black, while the detect_percent method detects the proportion of effective areas.
Switches output based on Boolean conditional input. It can be used for any type of data switching, including but not limited to numeric values, strings, pictures, masks, models, latent, pipe pipelines, etc.
- if_condition: Conditional input. Boolean, integer, floating point, and string inputs are supported. When entering a value, 0 is judged to be False; When a string is entered, an empty string is judged as Flase.
- when_True: This item is output when the condition is True.
- when_False: This item is output when the condition is False.
Switches the output based on the matching string. It can be used for any type of data switching, including but not limited to numeric values, strings, pictures, masks, models, latent, pipe pipelines, etc. Supports up to 3 sets of case switches.
Compare case to switch_condition
, if the same, output the corresponding input. If there are the same cases, the output is prioritized in order. If there is no matching case, the default input is output.
Note that the string is case sensitive and Chinese and English full-width and half-width.
- input_default: Input entry for default output. This input is required.
- input_1: Input entry used to match
case_1
. This input is optional. - input_2: Input entry used to match
case_2
. This input is optional. - input_3: Input entry used to match
case_3
. This input is optional. - switch_condition: String used to judge with case.
- case_1: case_1 string.
- case_2: case_2 string.
- case_3: case_3 string.
Stop the current queue. When executed at this node, the queue will stop. The workflow diagram above illustrates that if the image is larger than 1Mega pixels, the queue will stop executing.
- mode: Stop mode. If you choose
stop
, it will be determined whether to stop based on the input conditions. If you choosecontinue
, ignore the condition to continue executing the queue. - stop: If true, the queue will stop. If false, the queue will continue to execute.
Clean up GPU VRAM and system RAM. any type of input can be accessed, and when executed to this node, the VRAM and garbage objects in the RAM will be cleaned up. Usually placed after the node where the inference task is completed, such as the VAE Decode node.
Node Options:
- purge_cache: Clean up cache。
- purge_models: Unload all loaded models。
The node used to save the training set images and their text labels, where the image files and text label files have the same file name. Customizable directory for saving images, adding timestamps to file names, selecting save formats, and setting image compression rates.
*The workflow image_tagger_stave.exe is located in the workflow directory.
- iamge: The input image.
- tag_text: Text label of image.
- custom_path*: User-defined directory, enter the directory name in the correct format. If empty, it is saved in the default output directory of ComfyUI.
- filename_prefix*: The prefix of file name.
- timestamp: Timestamp the file name, opting for date, time to seconds, and time to milliseconds.
- format: The format of image save. Currently available in
png
andjpg
. Note that only png format is supported for RGBA mode pictures. - quality: Image quality, the value range 10-100, the higher the value, the better the picture quality, the volume of the file also correspondingly increases.
- preview: Preview switch.
* Enter%date
for the current date (YY-mm-dd) and %time
for the current time (HH-MM-SS). You can enter /
for subdirectories. For example, %date/name_%tiem
will output the image to the YY-mm-dd
folder, with name_HH-MM-SS
as the file name prefix.
Reproduction of Photoshop's layer Style - Blend If function. This node outputs a mask for layer composition on the ImageBlend or ImageBlendAdvance nodes.
mask
is an optional input, and if you enter a mask here, it will act on the output.
- invert_mask: Whether to reverse the mask.
- blend_if: Channel selection for Blend If. There are four options:
gray
,red
,green
, andblue
. - black_point: Black point values, ranging from 0-255.
- black_range: Dark part transition range. The larger the value, the richer the transition level of the dark part mask.
- white_point: White point values, ranging from 0-255.
- white_range: Brightness transition range. The larger the value is, the richer the transition level of the bright part mask is.
Detect the area where the mask is located and output its position and size.
- detect: Detection method,
min_bounding_rect
is the minimum bounding rectangle of block shape,max_inscribed_rect
is the maximum inscribed rectangle of block shape, andmask-area
is the effective area for masking pixels. - x_adjust: Adjust of horizontal deviation after detection.
- y_adjust: Adjust of vertical offset after detection.
- scale_adjust: Adjust the scaling offset after detection.
Output:
- box_preview: Preview image of detection results. Red represents the detected result, and green represents the adjust output result.
- x_percent: Horizontal position output in percentage.
- y_percent: Vertical position output in percentage.
- width: Width.
- height: Height.
- x: The x-coordinate of the top left corner position.
- y: The y-coordinate of the top left corner position.
Ultra Nodes
Nodes that use ultra fine edge masking processing methods, the latest version of nodes includes: SegmentAnythingUltraV2, RmBgUltraV2, BiRefNetUltra, PersonMaskUltraV2, SegformerB2ClothesUltra and MaskEdgeUltraDetailV2.
There are three edge processing methods for these nodes:
PyMatting
optimizes the edges of the mask by using a closed form matching to mask trimap.GuideFilter
uses opencv guidedfilter to feather edges based on color similarity, and performs best when edges have strong color separation.
The code for the above two methods is from the ComfyUI-Image-Filters in spacepxl's Alpha Matte, thanks to the original author.VitMatte
uses the transformer vit model for high-quality edge processing, preserving edge details and even generating semi transparent masks. Note: When running for the first time, you need to download the vitmate model file and wait for the automatic download to complete. If the download cannot be completed, you can run the commandhuggingface-cli download hustvl/vitmatte-small-composition-1k
to manually download. After successfully downloading the model, you can useVITMatte(local)
without accessing the network.- VitMatte's options:
device
set whether to use CUDA for vitimate operations, which is about 5 times faster than CPU.max_megapixels
set the maximum image size for vitmate operation, and oversized images will be reduced in size. For 16G VRAM, it is recommended to set it to 3.
*Download all model files from BaiduNetdisk or Huggingface to ComfyUI/models/vitmatte
folder.
The following figure is an example of the difference in output between three methods.
Remove background. compared to the similar background removal nodes, this node has ultra-high edge details.
This node combines the Alpha Matte node of Spacepxl's ComfyUI-Image-Filters and the functionality of ZHO-ZHO-ZHO's ComfyUI-BRIA_AI-RMBG, thanks to the original author.
*Download model files from BRIA Background Removal v1.4 or BaiduNetdisk to ComfyUI/models/rmbg/RMBG-1.4
folder. This model can be used for non-commercial purposes.
- detail_range: Edge detail range.
- black_point: Edge black sampling threshold.
- white_point: Edge white sampling threshold.
- process_detail: Set to false here will skip edge processing to save runtime.
The V2 upgraded version of RemBgUltra has added the VITMatte edge processing method.(Note: Images larger than 2K in size using this method will consume huge memory)
On the basis of RemBgUltra, the following changes have been made:
- detail_method: Edge processing methods. provides VITMatte, VITMatte(local), PyMatting, GuidedFilter. If the model has been downloaded after the first use of VITMatte, you can use VITMatte (local) afterwards.
- detail_erode: Mask the erosion range inward from the edge. the larger the value, the larger the range of inward repair.
- detail_dilate: The edge of the mask expands outward. the larger the value, the wider the range of outward repair.
- device: Set whether the VitMatte to use cuda.
- max_megapixels: Set the maximum size for VitMate operations.
Generate masks for characters' faces, hair, arms, legs, and clothing, mainly used for segmenting clothing.
The model segmentation code is fromStartHua,thanks to the original author.
Compared to the comfyui_segformer_b2_clothes, this node has ultra-high edge details. (Note: Generating images with edges exceeding 2K in size using the VITMatte method will consume a lot of memory)
*Download all model files from huggingface or BaiduNetdisk to ComfyUI/models/segformer_b2_clothes
folder.
- face: Facial recognition switch.
- hair: Hair recognition switch.
- hat: Hat recognition switch.
- sunglass: Sunglass recognition switch.
- left_arm: Left arm recognition switch.
- right_arm: Right arm recognition switch.
- left_leg: Left leg recognition switch.
- right_leg: Right leg recognition switch.
- skirt: Skirt recognition switch.
- pants: Pants recognition switch.
- dress: Dress recognition switch.
- belt: Belt recognition switch.
- shoe: Shoes recognition switch.
- bag: Bag recognition switch.
- scarf: Scarf recognition switch.
- detail_method: Edge processing methods. provides VITMatte, VITMatte(local), PyMatting, GuidedFilter. If the model has been downloaded after the first use of VITMatte, you can use VITMatte (local) afterwards.
- detail_erode: Mask the erosion range inward from the edge. the larger the value, the larger the range of inward repair.
- detail_dilate: The edge of the mask expands outward. the larger the value, the wider the range of outward repair.
- black_point: Edge black sampling threshold.
- white_point: Edge white sampling threshold.
- process_detail: Set to false here will skip edge processing to save runtime.
- device: Set whether the VitMatte to use cuda.
- max_megapixels: Set the maximum size for VitMate operations.
Using the segformer model to segment clothing with ultra-high edge details. Currently supports segformer b2 clothes, segformer b3 clothes and segformer b3 fashion。
*Download modelfiles from huggingface or BaiduNetdisk to ComfyUI/models/segformer_b2_clothes
folder.
*Download modelfiles from huggingface or BaiduNetdisk to ComfyUI/models/segformer_b3_clothes
folder.
*Download modelfiles from huggingface or BaiduNetdisk to ComfyUI/models/segformer_b3_fashion
folder.
- image: The input image.
- segformer_pipeline: Segformer pipeline input. The pipeline is output by SegformerClottesPipeline and SegformerFashionPipeline node.
- detail_method: Edge processing methods. provides VITMatte, VITMatte(local), PyMatting, GuidedFilter. If the model has been downloaded after the first use of VITMatte, you can use VITMatte (local) afterwards.
- detail_erode: Mask the erosion range inward from the edge. the larger the value, the larger the range of inward repair.
- detail_dilate: The edge of the mask expands outward. the larger the value, the wider the range of outward repair.
- black_point: Edge black sampling threshold.
- white_point: Edge white sampling threshold.
- process_detail: Set to false here will skip edge processing to save runtime.
- device: Set whether the VitMatte to use cuda.
- max_megapixels: Set the maximum size for VitMate operations.
Select the segformer clothes model and choose the segmentation content.
- model: Model selection. There are currently two models available to choose from for segformer b2 clothes and segformer b3 clothes.
- face: Facial recognition switch.
- hair: Hair recognition switch.
- hat: Hat recognition switch.
- sunglass: Sunglass recognition switch.
- left_arm: Left arm recognition switch.
- right_arm: Right arm recognition switch.
- left_leg: Left leg recognition switch.
- right_leg: Right leg recognition switch.
- left_shoe: Left shoe recognition switch.
- right_shoe: Right shoe recognition switch.
- skirt: Skirt recognition switch.
- pants: Pants recognition switch.
- dress: Dress recognition switch.
- belt: Belt recognition switch.
- bag: Bag recognition switch.
- scarf: Scarf recognition switch.
Select the segformer fashion model and choose the segmentation content.
- model: Model selection. Currently, there is only one model available for selection: segformer b3 fashion。
- shirt: shirt and blouse switch.
- top: top, t-shirt, sweatshirt switch.
- sweater: sweater switch.
- cardigan: cardigan switch.
- jacket: jacket switch.
- vest: vest switch.
- pants: pants switch.
- shorts: shorts switch.
- skirt: skirt switch.
- coat: coat switch.
- dress: dress switch.
- jumpsuit: jumpsuit switch.
- cape: cape switch.
- glasses: glasses switch.
- hat: hat switch.
- hairaccessory: headband, head covering, hair accessory switch.
- tie: tie switch.
- glove: glove switch.
- watch: watch switch.
- belt: belt switch.
- legwarmer: leg warmer switch.
- tights: tights and stockings switch.
- sock: sock switch.
- shoe: shoes switch.
- bagwallet: bag and wallet switch.
- scarf: scarf switch.
- umbrella: umbrella switch.
- hood: hood switch.
- collar: collar switch.
- lapel: lapel switch.
- epaulette: epaulette switch.
- sleeve: sleeve switch.
- pocket: pocket switch.
- neckline: neckline switch.
- buckle: buckle switch.
- zipper: zipper switch.
- applique: applique switch.
- bead: bead switch.
- bow: bow switch.
- flower: flower switch.
- fringe: fringe switch.
- ribbon: ribbon switch.
- rivet: rivet switch.
- ruffle: ruffle switch.
- sequin: sequin switch.
- tassel: tassel switch.
Process rough masks to ultra fine edges. This node combines the Alpha Matte and the Guided Filter Alpha nodes functions of Spacepxl's ComfyUI-Image-Filters, thanks to the original author.
- method: Provide two methods for edge processing: PyMatting and OpenCV-GuidedFilter. PyMatching has a slower processing speed, but for video, it is recommended to use this method to obtain smoother mask sequences.
- mask_grow: Mask expansion amplitude. positive values expand outward, while negative values contract inward. For rougher masks, negative values are usually used to shrink their edges for better results.
- fix_gap: Repair the gaps in the mask. if obvious gaps in the mask, increase this value appropriately.
- fix_threshold: The threshold of fix_gap.
- detail_range: Edge detail range.
- black_point: Edge black sampling threshold.
- white_point: Edge white sampling threshold.
The V2 upgraded version of MaskEdgeUltraDetail has added the VITMatte edge processing method.(Note: Images larger than 2K in size using this method will consume huge memory)
This method is suitable for handling semi transparent areas.
On the basis of MaskEdgeUltraDetail, the following changes have been made:
- method: Edge processing methods. provides VITMatte, VITMatte(local), PyMatting, GuidedFilter. If the model has been downloaded after the first use of VITMatte, you can use VITMatte (local) afterwards.
- edge_erode: Mask the erosion range inward from the edge. the larger the value, the larger the range of inward repair.
- edge_dilate: The edge of the mask expands outward. the larger the value, the wider the range of outward repair.
- device: Set whether the VitMatte to use cuda.
- max_megapixels: Set the maximum size for VitMate operations.
Generate a mask based on the selected color.
- image: Input image.
- mask: This input is optional, if there is a mask, only the colors inside the mask are included in the range.
- color: Color selector. Click on the color block to select a color, and you can use the straws on the color picker panel to pick up the screen color. Note: When using straws, maximize the browser window.
- color_in_HEX4: Enter color values. If this item has input, it will be used first, ignoring the color selected by the
color
. - threshold: Mask range threshold, the larger the value, the larger the mask range.
- fix_gap: Repair the gaps in the mask. If there are obvious gaps in the mask, increase this value appropriately.
- fix_threshold: The threshold for repairing masks.
- invert_mask: Whether to reverse the mask.
Convert the image to a mask. Supports converting any channel in LAB, RGBA, YUV, and HSV modes into masks, while providing color scale adjustment. Support mask optional input to obtain masks that only include valid parts.
- image: Input image.
- mask: This input is optional, if there is a mask, only the colors inside the mask are included in the range.
- channel: Channel selection. You can choose any channel of LAB, RGBA, YUV, or HSV modes.
- black_point*: Black dot value for the mask. The value range is 0-255, with a default value of 0.
- white_point*: White dot value for the mask. The value range is 0-255, with a default value of 255.
- gray_point: Gray dot values for the mask. The value range is 0.01-9.99, with a default of 1.
- invert_output_mask: Whether to reverse the mask.
*If the black_point or output_black_point value is greater than white_point or output_white_point, the two values are swapped, with the larger value used as white_point and the smaller value used as black_point.
Shadow & Highlight Mask
Generate masks for the dark and bright parts of the image.
- image: The input image.
- mask: Optional input. if there is input, only the colors within the mask range will be adjusted.
- shadow_level_offset: The offset of values in the dark area, where larger values bring more areas closer to the bright into the dark area.
- shadow_range: The transitional range of the dark area.
- highlight_level_offset: The offset of values in the highlight area, where larger values bring more areas closer to the dark into the highlight area.
- highlight_range: The transitional range of the highlight area.
Shadow Highlight Mask V2
A replica of the Shadow & Highlight Mask
node, with the "&" character removed from the node name to avoid ComfyUI workflow parsing errors.
Pixel expansion preprocessing on the masked edge of an image can effectively improve the edges of image composit.
- invert_mask: Whether to reverse the mask.
- mask_grow: Mask expansion amplitude.
Grow and shrink edges and blur the mask
- invert_mask: Whether to reverse the mask.
- grow: Positive values expand outward, while negative values contract inward.
- blur: Blur the edge.
Smooth transition and shrink the mask edges while preserving edge details.
- invert_mask: Whether to reverse the mask.
- shrink_level: Shrink the smoothness level.
- soft: Smooth amplitude.
- edge_shrink: Edge shrinkage amplitude.
- edge_reserve: Preserve the amplitude of edge details, 100 represents complete preservation, and 0 represents no preservation at all.
Comparison of MaskGrow and MaskEdgeShrink
Create motion blur on the mask.
- invert_mask: Whether to reverse the mask.
- blur: The size of blur.
- angle: The angle of blur.
Create a gradient for the mask from one side. please note the difference between this node and the CreateGradientMask node.
- invert_mask: Whether to reverse the mask.
- gradient_side: Generate gradient from which edge. There are four directions: top, bottom, left and right.
- gradient_scale: Gradient distance. The default value of 100 indicates that one side of the gradient is completely transparent and the other side is completely opaque. The smaller the value, the shorter the distance from transparent to opaque.
- gradient_offset: Gradient position offset.
- opacity: The opacity of the gradient.
Create a gradient mask. please note the difference between this node and the MaskGradient node.
- size_as*: The input image or mask here will generate the output image and mask according to their size. this input takes priority over the width and height below.
- width: Width of the image. If there is a size_as input, this setting will be ignored.
- height: Height of the image. If there is a size_as input, this setting will be ignored.
- gradient_side: Generate gradient from which edge. There are five directions: top, bottom, left, right and center.
- gradient_scale: Gradient distance. The default value of 100 indicates that one side of the gradient is completely transparent and the other side is completely opaque. The smaller the value, the shorter the distance from transparent to opaque.
- gradient_offset: Gradient position offset. When
gradient_side
is center, the size of the gradient area is adjusted here, positive values are smaller, and negative values are enlarged. - opacity: The opacity of the gradient.
*Only limited to input image and mask. forcing the integration of other types of inputs will result in node errors.
Generate mask contour strokes.
- invert_mask: Whether to reverse the mask.
- stroke_grow: Stroke expansion/contraction amplitude, positive values indicate expansion and negative values indicate contraction.
- stroke_width: Stroke width.
- blur: Blur of stroke.
- grain: Noise intensity.
- invert_mask: Whether to reverse the mask.
Sharp & Soft
Enhance or smooth out details for image.
- enhance: Provide 4 presets, which are very sharp, sharp, soft and very soft. If you choose None, you will not do any processing.
- smooth: Skin smoothness.
- threshold: Smooth range. the larger the range with the smaller value.
- opacity: The opacity of the smoothness.
- line_density: The black line density.
- opacity: The opacity of watercolor effects.
Soft light effect, the bright highlights on the screen appear blurry.
- soft: Size of soft light.
- threshold: Soft light range. the light appears from the brightest part of the picture. in lower value, the range will be larger, and in higher value, the range will be smaller.
- opacity: Opacity of the soft light.
Channel misalignment. similar to the effect of Tiktok logo.
- distance: Distance of channel separation.
- angle: Angle of channel separation.
- mode: Channel shift arrangement order.
HDR Effects
enhances the dynamic range and visual appeal of input images. This node is reorganize and encapsulate of HDR Effects (SuperBeasts.AI), thanks to the original author.
- hdr_intensity: Range: 0.0 to 5.0, Controls the overall intensity of the HDR effect, Higher values result in a more pronounced HDR effect.
- shadow_intensity: Range: 0.0 to 1.0,Adjusts the intensity of shadows in the image,Higher values darken the shadows and increase contrast.
- highlight_intensity: Range: 0.0 to 1.0,Adjusts the intensity of highlights in the image,Higher values brighten the highlights and increase contrast.
- gamma_intensity: Range: 0.0 to 1.0,Controls the gamma correction applied to the image,Higher values increase the overall brightness and contrast.
- contrast: Range: 0.0 to 1.0,Enhances the contrast of the image, Higher values result in more pronounced contrast.
- enhance_color: Range: 0.0 to 1.0,Enhances the color saturation of the image, Higher values result in more vibrant colors.
Simulate the grain, dark edge, and blurred edge of the film, support input depth map to simulate defocus.
This node is reorganize and encapsulate of digitaljohn/comfyui-propost, thanks to the original author.
- image: The input image.
- depth_map: Input depth map to simulate defocus effect. it is an optional input. if there is no input, will simulates radial blur at the edges of the image.
- center_x: The horizontal axis of the center point position of the dark edge and radial blur, where 0 represents the leftmost side, 1 represents the rightmost side, and 0.5 represents at the center.
- center_y: The vertical axis of the center point position of the dark edge and radial blur, where 0 represents the leftmost side, 1 represents the rightmost side, and 0.5 represents at the center.
- saturation: Color saturation, 1 is the original value.
- grain_power: Grain intensity. larger value means more pronounced the noise.
- grain_scale: Grain size.
- grain_sat: The color saturation of grain. 0 represents mono noise, and the larger the value, the more prominent the color.
- grain_shadows: Grain intensity of dark part.
- grain_highs: Grain intensity of light part.
- blur_strength: The strength of blur. larger value means more blurry it becomes.
- blur_focus_spread: Focus diffusion range. larger value means larger clear range.
- focal_depth: Simulate the focal distance of defucus. 0 indicates that focus is farthest, and 1 indicates that is closest. this setting only valid when input the depth_map.
The upgraded version of the Film node adds the fastgrain method on the basis of the previous one, and the speed of generating noise is accelerated by 10 times. The code for fastgrain is from github.com/spacepxl/ComfyUI-Image-Filters BetterFilmGrain node, thanks to the original authors.
Simulate the light leakage effect of the film. please download model file from Baidu Netdisk or [Google Drive]([light_leak.pkl(Google Drive)(https://drive.google.com/file/d/1DcH2Zkyj7W3OiAeeGpJk1eaZpdJwdCL-/view?usp=sharing)) and copy to ComfyUI/models/layerstyle
folder.
- light: 32 types of light spots are provided. random is a random selection.
- corner: There are four options for the corner where the light appears: top left, top right, bottom left, and bottom right.
- hue: The hue of the light.
- saturation: The color saturation of the light.
- opacity: The opacity of the light.
- color_map: Effect type. there are a total of 22 types of effects, as shown in the above figure.
- opacity: The opacity of the color map effect.
Node options:
- angle: The angle of blur.
- blur: The size of blur.
Node options:
- blur: The size of blur, integer, range 1-999.
Gaussian blur. Change the parameter precision to floating-point number, with a precision of 0.01
- blur: The size of blur, float, range 0 - 1000.
- grain_power: Noise intensity.
- grain_scale: Noise size.
- grain_sat: Color saturation of noise.
Annotation for notes
1 The layer_image, layer_mask and the background_image(if have input), These three items must be of the same size.
2 The mask not a mandatory input item. the alpha channel of the image is used by default. If the image input does not include an alpha channel, the entire image's alpha channel will be automatically created. if have masks input simultaneously, the alpha channel will be overwrite by the mask.
3 The Blend Mode include normal, multply, screen, add, subtract, difference, darker, color_burn, color_dodge, linear_burn, linear_dodge, overlay, soft_light, hard_light, vivid_light, pin_light, linear_light, and hard_mix. all of 19 blend modes in total.
*Preview of the blend mode
3 The BlendModeV2 include normal, dissolve, darken, multiply, color burn, linear burn, darker color, lighten, screen, color dodge, linear dodge(add), lighter color, dodge, overlay, soft light, hard light, vivid light, linear light, pin light, hard mix, difference, exclusion, subtract, divide, hue, saturation, color, luminosity, grain extract, grain merge all of 30 blend modes in total.
Part of the code for BlendMode V2 is from Virtuoso Nodes for ComfyUI. Thanks to the original authors.
*Preview of the Blend Mode V2
4 The RGB color described by hexadecimal RGB format, like '#FA3D86'.
5 The layer_image and layer_mask must be of the same size.
LayerStyle nodes follows the MIT license, Some of its functional code comes from other open-source projects. Thanks to the original author. If used for commercial purposes, please refer to the original project license to authorization agreement.