Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
MultiRL
non-profit
Activity Feed
Follow
3
AI & ML interests
None defined yet.
Recent Activity
iruno
updated
a dataset
about 1 hour ago
MultiRL/new_sudoku_benchmark_900
iruno
published
a dataset
about 1 hour ago
MultiRL/new_sudoku_benchmark_900
KimSHine
updated
a model
about 2 hours ago
MultiRL/qwen3_1.7b_easy_rl_ours_adv_fixed_geo_ms_token_tis
View all activity
Team members
3
models
94
Sort: Recently updated
MultiRL/qwen3_1.7b_easy_rl_ours_adv_fixed_geo_ms_token_tis
2B
•
Updated
about 2 hours ago
MultiRL/qwen3_1.7b_easy_rl_ours_adv_fixed_gamma_1_98_geo_ms_6epoch
2B
•
Updated
about 17 hours ago
MultiRL/qwen3_1.7b_new_standard_C_sft_overfit_lr_5e_5
2B
•
Updated
about 23 hours ago
MultiRL/qwen3_1.7b_easy_rl_ours_adv_fixed_gamma_1_98_geo_ms_token_tis
2B
•
Updated
2 days ago
•
183
MultiRL/qwen3_1.7b_easy_rl_ours_adv_fixed_gamma_1_98_gem_ms_seq_is
2B
•
Updated
3 days ago
•
273
MultiRL/qwen3_1.7b_easy_rl_ours_adv_fixed_gamma_1_98_mask_only
2B
•
Updated
4 days ago
•
246
MultiRL/qwen3_1.7b_easy_rl_ours_adv_fixed_gamma_995_98_ori_norm
2B
•
Updated
8 days ago
•
156
MultiRL/qwen3_1.7b_easy_rl_ours_adv_fixed_gamma_995_98
2B
•
Updated
9 days ago
•
6
MultiRL/qwen3_1.7b_sft_final_easy_reinforce_ours_adv_fixed_gamma_0.9
2B
•
Updated
11 days ago
•
382
MultiRL/qwen3_1.7b_easy_rl_old_adv_fixed_gamma_1
2B
•
Updated
13 days ago
•
230
View 94 models
datasets
32
Sort: Recently updated
MultiRL/new_sudoku_benchmark_900
Updated
about 1 hour ago
MultiRL/new_sudoku_many
Viewer
•
Updated
about 16 hours ago
•
790
MultiRL/hard_short
Viewer
•
Updated
about 19 hours ago
•
100
MultiRL/easy_tooshort
Viewer
•
Updated
about 21 hours ago
•
420
MultiRL/easy_toolong
Viewer
•
Updated
about 21 hours ago
•
150
MultiRL/final_sudoku_medium_rl_hint
Viewer
•
Updated
about 23 hours ago
•
640
MultiRL/final_sudoku_sft_C_hint
Viewer
•
Updated
about 23 hours ago
•
800
MultiRL/final_sudoku_easy_rl_hint
Viewer
•
Updated
about 23 hours ago
•
320
MultiRL/final_sudoku_benchmark_hint
Viewer
•
Updated
2 days ago
•
515
•
12
MultiRL/rush_hour_benchmark
Viewer
•
Updated
8 days ago
•
150
•
31
View 32 datasets