@AlphaSignalAI
LLM hacking is becoming a huge problem. Malicious images and sounds can now be used to modify the behavior of LLMs. They can even be embedded in a website or email attachment. Another recent paper showed that adversarial suffixes can disrupt the behavior of open source LLMs… https://t.co/qHVYY4V0h3 https://t.co/SuRXO1cPho