auditing-agents/llama_70b_synth_docs_only_then_redteam_kto_hallucinates_citations Updated 12 days ago