Skip to content

Missing documentation - heretic hardware requirements #89

@dagbdagb

Description

@dagbdagb

Hi.

2 things I struggle to figure out on my own:

gpt-oss-120b and its smaller sibling come with weights in mxfp4 format.
Can I use Ampere hardware (rtx 3090) to decensor gpt-oss-120b? Cuda arch 86, so no native support for mxfp4. Does that matter?

Also, what are the hard memory requirements for successfully decensoring gpt-oss-120b? Is there a trivial way to calculate that? I see #83 , but still fail to grasp if less memory just means the process takes longer, or if it will fail.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions