AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models
Paper
• 2410.02355 • Published
google/gemma-3-4b-it with refusal behavior removed via orthogonal projection. Uses null-space constraints and adaptive layer weighting to preserve model capabilities.
Note: This model will produce uncensored outputs. Use responsibly.
GGUF quantizations available at: jwest33/gemma-3-4b-it-null-space-abliterated-GGUF
| Parameter | Value |
|---|---|
| Harmful Prompts | 5000 |
| Harmless Prompts | 637 |
| Winsorization | 99.5th percentile |
| Null-Space Constraints | rank ratio: 0.90 |
| Directional Multiplier | 1.03 |
| SAE Targeted Coverage | 1.00 |
github.com/jwest33/abliterator
This model inherits the Gemma license from the base model. Please review and comply with Google's usage terms.
This model is provided for research and educational purposes. The creators are not responsible for any misuse. Users are solely responsible for ensuring their use complies with applicable laws and ethical standards.
Base model
google/gemma-3-4b-pt