Wonbeom
Jang
Toggle navigation
About
blog
Publications
CV
ctrl k
moderation-bypass
an archive of posts with this tag
May 29, 2026
Covert Malicious Finetuning — 학습 데이터가 모두 무해해 보이는 공격