-
🔭 I’m a Ph.D. student 👩🎓 at CISPA Helmholtz Center for Information Security, focused on Trustworthy Machine Learning Security.
-
🌱 I’m also a sci-fiction writer 🖨 and publish novels on Science Fiction World (《科幻世界》) and so on.
-
⚡ I love reading 📖 , handcrafting 🎨 , RPG games 🎮 , and every creative thing. I'm trying to fall in love with fitness 🏃♀️, but it hasn't worked out yet 😪 .
🧐
Highlights
- Pro
Pinned Loading
-
jailbreak_llms
jailbreak_llms Public[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).
-
prompt-stealing-attack
prompt-stealing-attack Public[USENIX'24] Prompt Stealing Attacks Against Text-to-Image Generation Models
-
TrustAIRLab/GPTracker
TrustAIRLab/GPTracker Public[S&P'25] GPTracker: A Large-Scale Measurement of Misused GPTs
-
TrustAIRLab/HateBench
TrustAIRLab/HateBench Public[USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns
-
-
TrustAIRLab/VoiceJailbreakAttack
TrustAIRLab/VoiceJailbreakAttack PublicCode for Voice Jailbreak Attacks Against GPT-4o.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.


