Trying luminar_classifier_RAID_none_PrismAI#52
Trying luminar_classifier_RAID_none_PrismAI#52TheItCrOw wants to merge 2 commits intoliamdugan:mainfrom
Conversation
|
Eval run succeeded! Link to run: link Here are the results of the submission(s): e5-small-loraRelease date: 2024-11-07 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 96.85 and a TPR of 85.69% at FPR=5% and 73.08% at FPR=1%. luminar_classifier_RAID_none_PrismAIRelease date: 2025-05-17 I've committed detailed results of this detector's performance on the test set to this PR. Warning Failed to find threshold values that achieve False Positive Rate(s): (['5%', '1%']) on all domains. This submission will not appear in the main leaderboard for those FPR values; it will only be visible within the splits in which the target FPR was achieved. LLMDetRelease date: 2023-05-24 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 62.90 and a TPR of 26.70% at FPR=5% and 14.91% at FPR=1%. LuminarRelease date: 2025-05-17 I've committed detailed results of this detector's performance on the test set to this PR. Warning Failed to find threshold values that achieve False Positive Rate(s): (['5%', '1%']) on all domains. This submission will not appear in the main leaderboard for those FPR values; it will only be visible within the splits in which the target FPR was achieved. SpeedAIRelease date: 2025-05-08 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 99.85 and a TPR of 99.62% at FPR=5% and 98.55% at FPR=1%. It's AIRelease date: 2025-04-01 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 94.27 and a TPR of 94.15% at FPR=5% and 89.36% at FPR=1%. RoBERTa-base (GPT2)Release date: 2019-08-24 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 72.29 and a TPR of 51.77% at FPR=5% and 34.57% at FPR=1%. RoBERTa (ChatGPT)Release date: 2023-01-18 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 60.26 and a TPR of 26.64% at FPR=5% and 19.63% at FPR=1%. DesklibRelease date: 2024-10-03 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 94.91 and a TPR of 83.76% at FPR=5% and 68.22% at FPR=1%. RoBERTa-large (GPT2)Release date: 2019-08-24 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 69.28 and a TPR of 50.70% at FPR=5% and 34.67% at FPR=1%. SuperAnnotate AI DetectorRelease date: 2024-10-27 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 88.87 and a TPR of 64.87% at FPR=5% and 38.87% at FPR=1%. GLTRRelease date: 2019-06-10 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 70.90 and a TPR of 51.48% at FPR=5% and 36.48% at FPR=1%. Desklib AI Text Detector v1.01Release date: 2025-02-16 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 94.83 and a TPR of 91.17% at FPR=5% and 76.47% at FPR=1%. BinocularsRelease date: 2024-01-22 I've committed detailed results of this detector's performance on the test set to this PR. Warning No aggregate score across all settings is reported here as some domains/generator models/decoding strategies/repetition penalties/adversarial attacks were not included in the submission. This submission will not appear in the main leaderboard; it will only be visible within the splits in which all samples were evaluated. RADARRelease date: 2023-07-07 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 81.92 and a TPR of 63.91% at FPR=5% and 43.12% at FPR=1%. Gaussian ExtremeRelease date: 2025-05-17 I've committed detailed results of this detector's performance on the test set to this PR. Warning Failed to find threshold values that achieve False Positive Rate(s): (['5%', '1%']) on all domains. This submission will not appear in the main leaderboard for those FPR values; it will only be visible within the splits in which the target FPR was achieved. FastDetectGPTRelease date: 2023-10-08 I've committed detailed results of this detector's performance on the test set to this PR. Warning No aggregate score across all settings is reported here as some domains/generator models/decoding strategies/repetition penalties/adversarial attacks were not included in the submission. This submission will not appear in the main leaderboard; it will only be visible within the splits in which all samples were evaluated. Warning No aggregate score across all non-adversarial settings is reported here as some domains/generator models/decoding strategies/repetition penalties were not included in the submission. If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to RAID! |
After the evaluation changes, testing new luminar model instances. I'll probably add more.