UnifiedReward 1.0 LLaVA Model - a CodeGoat24 Collection

CodeGoat24 's Collections

UnifiedReward Flex

Pref-GRPO & UniGenBench

UnifiedReward Edit Models

UnifiedReward 2.0 Qwen3VL Models

UnifiedReward 2.0 Qwen2.5VL Models

UnifiedReward 1.0 Qwen2.5VL Models

UnifiedReward 1.0 Qwen2.5 Models GGUF

UnifiedReward 1.0 LLaVA Model

UnifiedReward Training Data

UnifiedReward 1.0 LLaVA Model

updated 6 days ago

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 123
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published May 6, 2025 • 92
CodeGoat24/UnifiedReward-Think-7b

8B • Updated Aug 29, 2025 • 170 • 10
CodeGoat24/UnifiedReward-7b-v1.5

8B • Updated Nov 5, 2025 • 2.51k • 7
CodeGoat24/UnifiedReward-7b

8B • Updated Nov 5, 2025 • 206 • 6
CodeGoat24/UnifiedReward-0.5b

1B • Updated Aug 29, 2025 • 5 • 1
CodeGoat24/LLaVA-Video-7B-Qwen2-UnifiedReward-DPO

8B • Updated Aug 29, 2025 • 2
CodeGoat24/sdxl-turbo-unified-reward-dpo

Text-to-Image • Updated Aug 29, 2025 • 7 • 1
CodeGoat24/llava-onevision-qwen2-7b-ov-unifiedreward-dpo

8B • Updated Aug 29, 2025
CodeGoat24/T2V-Turbo

Updated Aug 29, 2025