Wang

VincentWang

VincentWong1

AI & ML interests

None yet

Recent Activity

liked a dataset 17 days ago

kensho/DocFinQA

liked a model 18 days ago

YOYO-AI/Qwen3-30B-A3B-YOYO-Thinking-Chimera

liked a model about 1 month ago

OpenAssistant/reward-model-deberta-v3-large-v2

View all activity

Organizations

None yet

liked a dataset 17 days ago

kensho/DocFinQA

Viewer • Updated Nov 19, 2024 • 7.44k • 413 • 14

liked a model 18 days ago

YOYO-AI/Qwen3-30B-A3B-YOYO-Thinking-Chimera

Text Generation • 31B • Updated 21 days ago • 68 • 5

liked a model about 1 month ago

OpenAssistant/reward-model-deberta-v3-large-v2

Text Classification • Updated Feb 1, 2023 • 4.85k • • 244

liked 2 datasets about 1 month ago

Mxode/Chinese-Instruct

Viewer • Updated May 9, 2025 • 4.85M • 985 • 141

BAAI/IndustryCorpus2

Viewer • Updated Dec 17, 2024 • 826M • 3.93k • 61

liked 2 datasets about 2 months ago

nvidia/Nemotron-RL-instruction_following-structured_outputs

Viewer • Updated 13 days ago • 9.95k • 135 • 28

instruction-pretrain/general-instruction-augmented-corpora

Preview • Updated Mar 1, 2025 • 3.41k • 20

liked a model 4 months ago

ByteDance-Seed/Seed-OSS-36B-Instruct

Text Generation • 36B • Updated Aug 26, 2025 • 7.89k • 477

liked a dataset 5 months ago

inclusionAI/ASearcher-train-data

Preview • Updated Aug 13, 2025 • 217 • 25

liked a model 6 months ago

infly/inf-retriever-v1

liked a dataset 6 months ago

FreedomIntelligence/Evol-Instruct-Chinese-GPT4

Viewer • Updated Dec 6, 2023 • 70k • 34 • 47

liked a model 8 months ago

nvidia/Llama-3.1-Nemotron-70B-Reward-HF

71B • Updated Apr 13, 2025 • 1.08k • 90

liked 3 datasets 8 months ago

liked a model 9 months ago

TIGER-Lab/general-verifier

Question Answering • 2B • Updated Apr 15, 2025 • 3.33k • • 21

upvoted an article 9 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11, 2025

•

104

liked 2 datasets 9 months ago

ucinlp/drop

Viewer • Updated Jan 17, 2024 • 86.9k • 10.2k • 66

deepmind/aqua_rat

Viewer • Updated Jan 9, 2024 • 196k • 11.7k • 72

liked a model 10 months ago

Xenova/text-embedding-ada-002

Updated Aug 18, 2025 • 79

Wang

AI & ML interests

Recent Activity

Organizations

VincentWang's activity

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment