Skip to content
View verazuo's full-sized avatar
🧐
🧐

Highlights

  • Pro

Organizations

@TrustAIRLab

Block or report verazuo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
verazuo/README.md

Here is Vera! 👋

About Me

  • 🔭 I’m a Ph.D. student 👩‍🎓 at CISPA Helmholtz Center for Information Security, focused on Trustworthy Machine Learning Security.

  • 🌱 I’m also a sci-fiction writer 🖨 and publish novels on Science Fiction World (《科幻世界》) and so on.

  • ⚡ I love reading 📖 , handcrafting 🎨 , RPG games 🎮 , and every creative thing. I'm trying to fall in love with fitness 🏃‍♀️, but it hasn't worked out yet 😪 .

Pinned Loading

  1. jailbreak_llms jailbreak_llms Public

    [CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

    Jupyter Notebook 3.5k 317

  2. prompt-stealing-attack prompt-stealing-attack Public

    [USENIX'24] Prompt Stealing Attacks Against Text-to-Image Generation Models

    Python 48 8

  3. TrustAIRLab/GPTracker TrustAIRLab/GPTracker Public

    [S&P'25] GPTracker: A Large-Scale Measurement of Misused GPTs

    Python 10 1

  4. TrustAIRLab/HateBench TrustAIRLab/HateBench Public

    [USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns

    13 3

  5. xinleihe/MGTBench xinleihe/MGTBench Public

    Python 161 16

  6. TrustAIRLab/VoiceJailbreakAttack TrustAIRLab/VoiceJailbreakAttack Public

    Code for Voice Jailbreak Attacks Against GPT-4o.

    Python 36 1