Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
CodeGoat24 's Collections
UnifiedReward Flex
Pref-GRPO & UniGenBench
UnifiedReward Edit Models
UnifiedReward 2.0 Qwen3VL Models
UnifiedReward 2.0 Qwen2.5VL Models
UnifiedReward 1.0 Qwen2.5VL Models
UnifiedReward 1.0 Qwen2.5 Models GGUF
UnifiedReward 1.0 LLaVA Model
UnifiedReward Training Data

UnifiedReward 1.0 LLaVA Model

updated 6 days ago
Upvote
-

  • Unified Reward Model for Multimodal Understanding and Generation

    Paper • 2503.05236 • Published Mar 7, 2025 • 123

  • Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

    Paper • 2505.03318 • Published May 6, 2025 • 92

  • CodeGoat24/UnifiedReward-Think-7b

    8B • Updated Aug 29, 2025 • 170 • 10

  • CodeGoat24/UnifiedReward-7b-v1.5

    8B • Updated Nov 5, 2025 • 2.51k • 7

  • CodeGoat24/UnifiedReward-7b

    8B • Updated Nov 5, 2025 • 206 • 6

  • CodeGoat24/UnifiedReward-0.5b

    1B • Updated Aug 29, 2025 • 5 • 1

  • CodeGoat24/LLaVA-Video-7B-Qwen2-UnifiedReward-DPO

    8B • Updated Aug 29, 2025 • 2

  • CodeGoat24/sdxl-turbo-unified-reward-dpo

    Text-to-Image • Updated Aug 29, 2025 • 7 • 1

  • CodeGoat24/llava-onevision-qwen2-7b-ov-unifiedreward-dpo

    8B • Updated Aug 29, 2025

  • CodeGoat24/T2V-Turbo

    Updated Aug 29, 2025
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs