Video-Text-to-Text
Transformers
Safetensors
English
videochat_flash_qwen
feature-extraction
multimodal
custom_code
Eval Results (legacy)
Instructions to use OpenGVLab/VideoChat-Flash-Qwen2-7B_res448 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OpenGVLab/VideoChat-Flash-Qwen2-7B_res448 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("OpenGVLab/VideoChat-Flash-Qwen2-7B_res448", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -120,7 +120,7 @@ if mm_llm_compress:
|
|
| 120 |
model.config.llm_compress_layer_list = [4, 18]
|
| 121 |
model.config.llm_image_token_ratio_list = [1, 0.75, 0.25]
|
| 122 |
else:
|
| 123 |
-
model.config.mm_llm_compress =
|
| 124 |
|
| 125 |
# evaluation setting
|
| 126 |
max_num_frames = 512
|
|
|
|
| 120 |
model.config.llm_compress_layer_list = [4, 18]
|
| 121 |
model.config.llm_image_token_ratio_list = [1, 0.75, 0.25]
|
| 122 |
else:
|
| 123 |
+
model.config.mm_llm_compress = False
|
| 124 |
|
| 125 |
# evaluation setting
|
| 126 |
max_num_frames = 512
|