LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding
Paper
โข
2306.17107
โข
Published
โข
11
NOTE: This "delta model" cannot be used directly. It should be merged with the LLaMA-13B checkpoint.
For more information, please refer to LLaVAR project page, Github repo, and paper.