Skip to content
Change the repository type filter

All

    Repositories list

    • The homepage of SaFo Lab
      HTML
      MIT License
      0200Updated Nov 25, 2024Nov 25, 2024
    • The official implementation of the paper "InjecGuard: Benchmarking and Mitigating Over-defense in Prompt Injection Guardrail Models".
      Python
      MIT License
      32600Updated Nov 24, 2024Nov 24, 2024
    • The official implementation of our pre-print paper "AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs".
      Python
      Apache License 2.0
      3224530Updated Nov 17, 2024Nov 17, 2024
    • FIUBench

      Public
      A Task of Fictitious Unlearning for VLMs
      Jupyter Notebook
      1610Updated Nov 2, 2024Nov 2, 2024
    • List of T2I safety papers, updated daily, welcome to discuss using Discussions
      MIT License
      15000Updated Aug 12, 2024Aug 12, 2024
    • Dolphins

      Public
      [ECCV 2024] The official code for "Dolphins: Multimodal Language Model for Driving“
      Python
      MIT License
      95120Updated Jul 17, 2024Jul 17, 2024
    • AdaShield

      Public
      [ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting."
      Python
      14831Updated Jul 11, 2024Jul 11, 2024
    • .github

      Public
      Open codes from SaFoLab at University of Wisconsin–Madison
      0100Updated Jul 3, 2024Jul 3, 2024