Overview
Plain English
Pioneering open-source vision-language model enabling visual question answering and image captioning with local LLMs.
Technical
Pioneering open-source vision-language model enabling visual question answering and image captioning with local LLMs.
Technical scorecard
Data & Privacy
Does it send data online?
After setup, this listing is marked as usable offline. Confirm network behavior against the upstream project before regulated deployment.
Does it store history?
Not verified in this directory yet. Review the upstream docs for persistence, logs, and workspace storage.
License checks?
Commercial use is marked as allowed or likely allowed by the listed license.
Telemetry?
None
Last verified: Apr 10, 2026. Maintainer verification should be treated as directory guidance, not legal advice.
Setup & Installation
GUI, low-resource, or simple install path likely available.
Prerequisites
Python, Docker, Bare Metal
# Start with the official project documentation
# https://github.com/haotian-liu/LLaVA Hardware Requirements
Works Well With