Collections

Discover the best community collections!

Collections including paper arxiv:2403.07691
About ORPO
Contains some information and experiments fine-tuning LLMs using 🤗 `trl.ORPOTrainer`
Foundation AI Papers (II)
Collection by
Jul 23, 2024
alignment_24_best
Collection by
Oct 21, 2024
Papers - KAIST AI
Collection by
May 6, 2024
Zephyr ORPO
Models and datasets to align LLMs with Odds Ratio Preference Optimisation (ORPO). Recipes here: https://github.com/huggingface/alignment-handbook
About ORPO
Contains some information and experiments fine-tuning LLMs using 🤗 `trl.ORPOTrainer`
alignment_24_best
Collection by
Oct 21, 2024
Foundation AI Papers (II)
Collection by
Jul 23, 2024
Papers - KAIST AI
Collection by
May 6, 2024
Zephyr ORPO
Models and datasets to align LLMs with Odds Ratio Preference Optimisation (ORPO). Recipes here: https://github.com/huggingface/alignment-handbook