You should apply that LORA to uncensored model

by Onix22 - opened 1 day ago

•

It is irritating when it starts safety accesment or refusing things
althoug maybe that is less prevalent if you set it to some other persona, it begeraly obet in direct reply mode but sometimes refuses in thinking mode

zerofata

Owner 1 day ago

It probably depends a little bit on the scenario and the system prompt.

I think someone did a heretic version of this model which should work similarly? It's an open question if training on an abliterated model is a good idea or if abliteration should be done after training a model.

At some point I'll give the model a run through DPO which should help though as I can target some of the repetition, stability and censorship with that.

Onix22

1 day ago

Training on abliterated modesl actualy would even fix abliteration damage so thats good idea to do it after rather than before.
But since you probbaly trained lora rather than full model anyway you can juts apply it to the abliterated model now and it will work imediately without any retraining.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment