about the the threshold

Hi, I am very interested in the method you proposed for detecting harmful data based on hidden layer states. I noticed in your paper that you used AUROC and AUPRC to evaluate the algorithm's detection performance. However, in practical application scenarios, a clear threshold is required to determine whether a sample is harmful. I would like to ask: how should this threshold be determined according to your method?

Additionally, do different LVLMs require different thresholds, and how should such thresholds be determined based on the specific model?

<img width="492" height="787" alt="Image" src="https://github.com/user-attachments/assets/8a21172d-86f7-4639-8079-a165a0319ffe" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

about the the threshold #12

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

about the the threshold #12

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions