├── .gitignore ├── LICENSE ├── README.md ├── assets ├── introduction.gif └── overview.png ├── data └── GCG │ ├── llama2_behaviors.json │ └── vicuna_behaviors.json ├── lib ├── attacks.py ├── defenses.py ├── language_models.py ├── model_configs.py └── perturbations.py ├── main.py ├── smooth_llm.sh └── sweep.sh /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/arobey1/smooth-llm/HEAD/.gitignore -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/arobey1/smooth-llm/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/arobey1/smooth-llm/HEAD/README.md -------------------------------------------------------------------------------- /assets/introduction.gif: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/arobey1/smooth-llm/HEAD/assets/introduction.gif -------------------------------------------------------------------------------- /assets/overview.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/arobey1/smooth-llm/HEAD/assets/overview.png -------------------------------------------------------------------------------- /data/GCG/llama2_behaviors.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/arobey1/smooth-llm/HEAD/data/GCG/llama2_behaviors.json -------------------------------------------------------------------------------- /data/GCG/vicuna_behaviors.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/arobey1/smooth-llm/HEAD/data/GCG/vicuna_behaviors.json -------------------------------------------------------------------------------- /lib/attacks.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/arobey1/smooth-llm/HEAD/lib/attacks.py -------------------------------------------------------------------------------- /lib/defenses.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/arobey1/smooth-llm/HEAD/lib/defenses.py -------------------------------------------------------------------------------- /lib/language_models.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/arobey1/smooth-llm/HEAD/lib/language_models.py -------------------------------------------------------------------------------- /lib/model_configs.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/arobey1/smooth-llm/HEAD/lib/model_configs.py -------------------------------------------------------------------------------- /lib/perturbations.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/arobey1/smooth-llm/HEAD/lib/perturbations.py -------------------------------------------------------------------------------- /main.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/arobey1/smooth-llm/HEAD/main.py -------------------------------------------------------------------------------- /smooth_llm.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/arobey1/smooth-llm/HEAD/smooth_llm.sh -------------------------------------------------------------------------------- /sweep.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/arobey1/smooth-llm/HEAD/sweep.sh --------------------------------------------------------------------------------