Skip to content

Commit

Permalink
add link to mistral ai's acknowledgement
Browse files Browse the repository at this point in the history
  • Loading branch information
Reapor-Yurnero committed Oct 18, 2024
1 parent e745a81 commit 761488e
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions docs/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -178,7 +178,7 @@ Alternatively, the adversarial prompt can be input after one or several turns of

## Adversarial Prompts

Our adversarial prompts show consistently high attack success rate and good quality of PII exfiltration throughout various unseen user-agent conversations. Find more details about our evaluation and results in the [paper]().
Our adversarial prompts show consistently high attack success rate and good quality of PII exfiltration throughout various unseen user-agent conversations. Find more details about our evaluation and results in the [paper](./paper.pdf).

### PII Exfiltration

Expand Down Expand Up @@ -207,7 +207,7 @@ Another attack target, which is not shown above but discussed in the paper, is c

## Disclosure and Impact

We initiated disclosure to Mistral and ChatGLM team on Sep 9, 2024, and Sep 18, 2024, respectively. Mistral security team members responded promptly and acknowledged the vulnerability as a **medium-severity issue**. They fixed the data exfiltration by disabling markdown rendering of external images on Sep 13, 2024. We confirmed that the fix works. ChatGLM security team has not responded to us despite multiple attempts through various channels.
We initiated disclosure to Mistral and ChatGLM team on Sep 9, 2024, and Sep 18, 2024, respectively. Mistral security team members responded promptly and acknowledged the vulnerability as a **medium-severity issue**. They fixed the data exfiltration by disabling markdown rendering of external images on Sep 13, 2024 (find the acknowledgement in [Mistral changelog](https://docs.mistral.ai/getting-started/changelog/)). We confirmed that the fix works. ChatGLM security team has not responded to us despite multiple attempts through various channels.


## Citation
Expand Down

0 comments on commit 761488e

Please sign in to comment.