Skip to content

Commit

Permalink
update paper pdf, add video to README
Browse files Browse the repository at this point in the history
  • Loading branch information
Reapor-Yurnero committed Oct 22, 2024
1 parent cbcfa14 commit 3040f61
Show file tree
Hide file tree
Showing 4 changed files with 9 additions and 3 deletions.
8 changes: 7 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,12 @@
# Imprompter: Tricking LLM Agents into Improper Tool Use

This is the codebase of `imprompter`. It provides essential components to reproduce and test the attack presented in the [paper](https://arxiv.org/abs/2410.14923). Video demos can be found on our [website](https://imprompter.ai). You may create your own attack on top of it as well.
This is the codebase of `imprompter`. It provides essential components to reproduce and test the attack presented in the [paper](https://arxiv.org/abs/2410.14923). You may create your own attack on top of it as well.

A video screencast showing how an attacker can exfiltrate the user's PII in real world LLM product ([Mistral LeChat](https://chat.mistral.ai/chat)) with our adversarial prompt:

![video](docs/mistral_pii_demo.mp4)

More video demos can be found on our [website](https://imprompter.ai).

## Setup

Expand Down
2 changes: 1 addition & 1 deletion docs/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@ We present various demos and textual adversarial prompts on this page. For full
## How to Reproduce

!!! warning "Expected Behavior"
After we disclosed this vunerability to Mistral AI in September 2024, their security team decided to disable image markdown rendering features. Now you will not see the same behavior in the video demo but an image placeholder as in the conversation window. Find more details in the [Disclosure section](#disclosure-and-impact). The ChatGLM security team has not responded or addressed such issue. You should be able to reproduce the exact bahavior there.
After we disclosed this vunerability to Mistral AI in September 2024, their security team decided to disable image markdown rendering features. Now you will not see the same behavior in the video demo but an image placeholder as in the conversation window. Find more details in the [Disclosure section](#disclosure-and-impact). The ChatGLM security team has not yet addressed such issue as of Oct 21 2024. You should be able to reproduce the exact bahavior there.

### Scenario 1

Expand Down
2 changes: 1 addition & 1 deletion docs/overrides/main.html
Original file line number Diff line number Diff line change
Expand Up @@ -27,7 +27,7 @@ <h4 class="center" style="margin-top: -1rem;">
</div>
<div class="title-text">Paper</div>
</a>
<a href="https://github.com/Reapor-Yurnero/ImPrompter" class="center" target="_blank">
<a href="https://github.com/Reapor-Yurnero/imprompter" class="center" target="_blank">
<div class="twemoji title-icon">
{% include ".icons/fontawesome/brands/github.svg" %}
</div>
Expand Down
Binary file modified docs/paper.pdf
Binary file not shown.

0 comments on commit 3040f61

Please sign in to comment.