Sukesh Perla
hitchhiker3010
AI & ML interests
None yet
Recent Activity
updated a collection 3 days ago
to_read updated a collection about 1 month ago
AI Ads reacted to sergiopaniego's post with ๐ฅ about 2 months ago
New TRL + OpenEnv example! ๐ฅ
Fine tune an LLM for playing Sudoku using an RL env via OpenEnv
Includes a script that runs on 1 or multiple GPUs with vLLM, plus a Colab-ready notebook.
Enjoy!
Notebook: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/openenv_sudoku_grpo.ipynb
Script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/sudoku.py