AlignmentResearch
/

obfuscation-atlas-Meta-Llama-3-8B-Instruct-kl0.01-det3-seed1-deception_probe

deception-detection

alignment-research

obfuscation-atlas

model-type:honest

Model card Files Files and versions

obfuscation-atlas-Meta-Llama-3-8B-Instruct-kl0.01-det3-seed1-deception_probe

Commit History

Upload README.md with huggingface_hub

9dad609
verified

taufeeque commited on Feb 20

Upload folder using huggingface_hub

7dfb849
verified

taufeeque commited on Feb 16

Upload README.md with huggingface_hub

6935f60
verified

taufeeque commited on Feb 16

initial commit

befa722
verified

taufeeque commited on Feb 16