Illicit Online Image Promotion for UGCGs Moderation

This repository provides the data and code for the paper Moderating Illicit Online Image Promotion for Unsafe User Generated Content Games Using Large Vision-Language Models, accepted at the 33rd USENIX Security Symposium (USENIX Security) 2024.

UGCG Image Dataset

Caution

Please fill out the consent form and send emails to access our data. In the meantime, please submit an application through Hugging Face, we will grant you access to our data after reviewing your request.

UGCG-Guard

Our framework can be integrated with current popular large vision-language models, such as InstructBLIP, mPLUG, GPT-4, Gemini, etc. Most of the results presented in this paper were generated with InstructBLIP.

We have provided some sample codes for your reference. Please check the sample_code directory for free.

UGCG-CoT

The prompt of our UGCG-CoT, replace `{input_image}` with your testing sample. Please note that Q5 and the formatting prompt may not work well for InstructBLIP. You can remove the prompts and manually get the final decision.

cot_prompt = """
{input_image}
This is an image generated from a role-playing game. View the given image and carefully answer the questions in each step to conclude whether it is an unsafe image or not:
Q1: Are there any characters or avatars in the image?
Q2A: Are the characters or avatars naked?
Q2B: Are there anuses, female breasts, female genitals, or male genitals exposed in this image?
Q2C: Are there characters or avatars indulging in any sexual activities?
Q3A: Are there any knives, guns, or deadly weapons presented in this image?
Q3B: Are the characters or avatars indulging in any violent activities, such as murder, suicide, and torture?
Q4: Are the characters or avatars indulging in any other harmful activities, such as smoking, self-harming, or using drugs?
Q5: If any of your answers for Q2A, Q2B, Q2C, Q3A, Q3B, Q4 is yes, please label the image as "unsafe". Otherwise, please label the image as "safe".
Now, tell me your label directly.
"""

Baselines

The baseline models we refer to:

Unsafe Image Detectors: Clarify Yahoo OpenNSFW Amazon Recognition Microsoft Azure Google Vision AI

Object Detection Tools: Yolo SSD FasterRCNN Google Vision AI

Large VLM with the general prompt:

general_prompt = """
{input_image}
Is this image an unsafe image? 
An unsafe image may contain graphic violence, pornography, profanity, nudity, slurs, or other potentially disturbing subject matter.
"""

Results

UGCG-Guard vs. Baselines:

UGCG-Guard vs. SOTA object detection tools:

More results can be found in our paper.

Cite

If you find this helpful, please cite our work:

@inproceedings {299571,
author = {Keyan Guo and Ayush Utkarsh and Wenbo Ding and Isabelle Ondracek and Ziming Zhao and Guo Freeman and Nishant Vishwamitra and Hongxin Hu},
title = {{Moderating Illicit Online Image Promotion for Unsafe User Generated Content Games Using Large Vision-Language Models}},
booktitle = {33rd USENIX Security Symposium (USENIX Security 24)},
year = {2024},
isbn = {978-1-939133-44-1},
address = {Philadelphia, PA},
pages = {5787--5804},
url = {https://www.usenix.org/conference/usenixsecurity24/presentation/guo-keyan},
publisher = {USENIX Association},
month = aug
}

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
figures		figures
sample_code		sample_code
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
ugcg_dataset_consent_form.docx		ugcg_dataset_consent_form.docx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Illicit Online Image Promotion for UGCGs Moderation

UGCG Image Dataset

UGCG-Guard

UGCG-CoT

Baselines

Results

Cite

About

Releases

Packages

Languages

License

UBSec/UGCG-Guard

Folders and files

Latest commit

History

Repository files navigation

Illicit Online Image Promotion for UGCGs Moderation

UGCG Image Dataset

UGCG-Guard

UGCG-CoT

Baselines

Results

Cite

About

Topics

Resources

License

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages