added a submission named gambit-mage#60
Conversation
|
Eval run succeeded! Link to run: link Here are the results of the submission(s): Gambit-mageRelease date: 2025-01-19 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 86.72 and a TPR of 61.85% at FPR=5% and 47.86% at FPR=1%. If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to RAID! |
|
@yashasviraii thanks for the submission! Let me know at any point if you'd like to merge! |
|
@liamdugan Thanks for evaluating the first one! Could you evaluate this one too so I can compare results? Also, will these results be added to the leaderboard? |
|
Eval run succeeded! Link to run: link Here are the results of the submission(s): Mage-baselineRelease date: 2024-05-21 I've committed detailed results of this detector's performance on the test set to this PR. Warning Failed to find threshold values that achieve False Positive Rate(s): (['5%', '1%']) on all domains. This submission will not appear in the main leaderboard for those FPR values; it will only be visible within the splits in which the target FPR was achieved. If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to RAID! |
|
@yashasviraii they will be added to the leaderboard if we merge the pull request. If you want only one of them to be added at a time then you can make a commit removing the other submission from this PR and add it as a separate PR. Let me know what you'd like to do |
No description provided.