ComfyUI LLaVA Captioner

A ComfyUI extension for chatting with your images. Runs on your own system, no external services used, no filter. Uses the [a/LLaVA multimodal LLM](https://llava-vl.github.io/) so you can give instructions or ask questions in natural language. It's maybe as smart as GPT3.5, and it can see.

139
Stars
ceruleandeep
Author
8/3/2024
Last Update
1603
Days

Category

image processing

Description

A ComfyUI extension for chatting with your images. Runs on your own system, no external services used, no filter. Uses the [a/LLaVA multimodal LLM](https://llava-vl.github.io/) so you can give instructions or ask questions in natural language. It's maybe as smart as GPT3.5, and it can see.

Technical Information

Install Type:git-clone
Node ID:52584

Related Nodes

Discover more nodes in the same category or by the same author.

View all image processing nodes →