Skip to content

replaced AutoTokenizer/AutoModelForMaskedLM with#361

Merged
bputzeys merged 3 commits intomainfrom
use-python-files-directly
Mar 24, 2026
Merged

replaced AutoTokenizer/AutoModelForMaskedLM with#361
bputzeys merged 3 commits intomainfrom
use-python-files-directly

Conversation

@bputzeys
Copy link
Collaborator

@bputzeys bputzeys commented Mar 24, 2026

direct imports of NicheformerTokenizer/NicheformerForMaskedLM, no trust_remote_code

TODO, solve mismatch of padding token:

tokenization_nicheformer.py -> PAD = 0 | MASK = 1
masking.py -> PAD = 1 | MASK = 0

direct imports of NicheformerTokenizer/NicheformerForMaskedLM, no trust_remote_code
@bputzeys bputzeys requested a review from dmiv-helical March 24, 2026 12:30
bputzeys and others added 2 commits March 24, 2026 13:32
avoid this by using our local model location
1. PAD/MASK inversion in the tokeniser
2. Attention map - fixing wrong attribute
3. LICENSE fix - taking it from HF, not GH
4. Masking copy fix
@dmiv-helical dmiv-helical force-pushed the use-python-files-directly branch from c316e4e to 6177964 Compare March 24, 2026 14:32
@bputzeys bputzeys marked this pull request as ready for review March 24, 2026 16:19
@bputzeys bputzeys merged commit aa751e6 into main Mar 24, 2026
8 of 9 checks passed
@bputzeys bputzeys deleted the use-python-files-directly branch March 24, 2026 16:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants