A set of models from my experiments with Reinforcement Learning from Human Feedback
Samir R.
sr5434
AI & ML interests
NLP
Recent Activity
updated a model 5 days ago
sr5434/model-tempfiles updated a collection 10 days ago
RLHF Models updated a collection 10 days ago
RLHF ModelsOrganizations
None yet