Skip to content

Recurrence #9

@join-ville

Description

@join-ville

I trained COCO+LVIS SegFormerB0-S1 using the trainval_scripts/train_focalclickB0_S1_cclvs.sh with nothing changed.
And the val result is as follows:
图片1
compared with the model published:
图片2
I notice that the BS is 32 in paper, and it's 64 in the bash script. Is it the reason for the accuracy gap?

Also, there are two ways of data augmentation in the code and the paper says "During training, we only use flip and random resize with the scale from 0.75 to 1.4 as data augmentation." which one is used in the model published?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions