Back to Explorer

ComfyUI LLaVA Captioner

A ComfyUI extension for chatting with your images. Runs on your own system, no external services used, no filter. Uses the [a/LLaVA multimodal LLM](https://llava-vl.github.io/) so you can give instructions or ask questions in natural language. It's maybe as smart as GPT3.5, and it can see.

139

Stars

ceruleandeep

Author

8/3/2024

Last Update

1603

Days

Category

image processing

Description

A ComfyUI extension for chatting with your images. Runs on your own system, no external services used, no filter. Uses the [a/LLaVA multimodal LLM](https://llava-vl.github.io/) so you can give instructions or ask questions in natural language. It's maybe as smart as GPT3.5, and it can see.

Technical Information

Install Type:git-clone

Node ID:52584

View on GitHub Reference

Related Nodes

Discover more nodes in the same category or by the same author.

View all image processing nodes →