Configuration Parsing
Warning:
Invalid JSON for config file model_index.json
π Quick Start
π οΈ Environment Setup
β Recommended Setup
# 1. Clone the repo
git clone https://github.com/jiaosiyuu/ThinkGen.git
cd ThinkGen
# 2. (Optional) Create a clean Python environment
conda create -n thinkgen python=3.11
conda activate thinkgen
# 3. Install dependencies
# 3.1 Install PyTorch (choose correct CUDA version)
pip install torch==2.6.0 torchvision==0.21.0 torchaudio==2.6.0 --index-url https://download.pytorch.org/whl/cu124
# 3.2 Install other required packages
pip install -r req.txt
# ThinkGen runs even without flash-attn, though we recommend install it for best performance.
pip install --no-cache-dir flash-attn==2.7.4.post1 --no-build-isolation
π For users in Mainland China
pip install torch==2.6.0 torchvision==0.21.0 torchaudio==2.6.0 --index-url https://mirror.sjtu.edu.cn/pytorch-wheels/cu124
pip install -r req.txt -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install --no-cache-dir flash-attn==2.7.4.post1 --no-build-isolation -i https://pypi.tuna.tsinghua.edu.cn/simple
- Run Locally:
from ThinkGen.model import ThinkGen_Chat
import os
model_path = "JSYuuu/ThinkGen"
chat_model = ThinkGen_Chat(
model_path=model_path,
dtype='bf16',
height=1024,
width=1024
)
# Generation
messages = [
{"type": "text", "value": '''A close-up image of a red apple with the words 'Tart & Sweet' in white, cursive font on its surface, forming a spiral pattern. The apple is centered in the frame, and the background is a green surface labeled 'Organic Produce' in black, bold letters. The apple has a visible stem and a small bite mark on its side with the word 'Juicy' written in a small, handwritten style near the bite.'''}
]
results = chat_model.generate_image(messages)
output_dir = "vis/chat"
os.makedirs(output_dir, exist_ok=True)
for i, img in enumerate(results.images):
save_path = os.path.join(output_dir, f"result_{i}.png")
img.save(save_path)
print(f"Saved to {save_path}")
# Understanding
messages = [
{"type": "image", "value": "images/teaser.png"},
{"type": "text", "value": "Describe this image"}
]
response = chat_model.generate_text(messages)
print(response)
License
This work is licensed under Apache 2.0 license.
- Downloads last month
- 11
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
π
Ask for provider support