LVE Repository

security/prompt_leakage/sys_prompt_leak_cipher

lve record repository/security/prompt_leakage/sys_prompt_leak_cipher/openai--gpt-35-turbo

sys_prompt_leak_cipher

This LVE demonstrates how openai/gpt-3.5-turbo could be used instructed to leak a secret from system prompt by asking it to encrypt it via (reversible) cipher. See https://arxiv.org/abs/2308.06463 for more general discussion of this problem.

Loading...