Image-to-Text
Transformers
Safetensors
Japanese
llava
image-text-to-text
vision-language
image-captioning
japanese-stable-vlm
custom_code
Instructions to use stabilityai/japanese-stable-vlm with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use stabilityai/japanese-stable-vlm with Transformers:
# Use a pipeline as a high-level helper # Warning: Pipeline type "image-to-text" is no longer supported in transformers v5. # You must load the model directly (see below) or downgrade to v4.x with: # 'pip install "transformers<5.0.0' from transformers import pipeline pipe = pipeline("image-to-text", model="stabilityai/japanese-stable-vlm", trust_remote_code=True)# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("stabilityai/japanese-stable-vlm", trust_remote_code=True) model = AutoModelForImageTextToText.from_pretrained("stabilityai/japanese-stable-vlm", trust_remote_code=True) - Notebooks
- Google Colab
- Kaggle
You need to agree to share your contact information to access this model
This repository is publicly accessible, but you have to accept the conditions to access its files and content.
By clicking "Agree", you agree to the License Agreement and acknowledge Stability AI's Privacy Policy.
Log in or Sign Up to review the conditions and access this model content.
Gated model You can list files but not access them
Preview of files found in this repository