Multiscreen PyTorch

multiscreen-pytorch is a Windows-native PyTorch reference implementation of a paper-like Multiscreen language model inspired by "Screening Is Enough".

This repository is intentionally scoped as a correctness-first reproduction scaffold:

explicit torch.nn.Module model code
reproducible training and evaluation artifacts
a matched Transformer baseline
deterministic ABCDigits-style retrieval evaluation
long-context perplexity and latency scaffolding

Environment

uv python install 3.12.9
uv venv --python 3.12.9
uv sync --extra dev --extra hf --extra hf_qwen
uv run python scripts\env_check.py

Quick start

uv run python scripts\train_multiscreen.py --steps 12 --output-dir artifacts\smoke_multiscreen
uv run python scripts\train_transformer_baseline.py --steps 12 --output-dir artifacts\smoke_transformer
uv run python scripts\eval_abcdigits.py --checkpoint artifacts\smoke_multiscreen\checkpoint.pt
uv run python scripts\eval_long_context.py --checkpoint artifacts\smoke_multiscreen\checkpoint.pt
uv run python scripts\benchmark_latency.py --checkpoint artifacts\smoke_multiscreen\checkpoint.pt
uv run python scripts\sweep_learning_rate.py --model-kind multiscreen
uv run python scripts\experiment_hf_qwen.py --mode inspect --model-path "H:\Qwen3.5-9B-official-hf"

Repository layout

multiscreen/config.py: explicit configs
multiscreen/layers.py: Screening unit, gating tile, TanhNorm
multiscreen/model.py: Multiscreen LM and Transformer baseline
multiscreen/train.py: deterministic training utilities
multiscreen/metrics.py: perplexity, retrieval, and latency summaries
scripts/: Windows-safe CLI entrypoints
scripts/experiment_hf_qwen.py: local HF Qwen3.5 architecture inspection and inference experiments
tests/: math, model, and smoke validation

Notes

The reference path is pure PyTorch.
Hugging Face integration is used only where it reduces glue.
The code favors explicitness over performance shortcuts.
For local Qwen3.5 experiments, this repo can inspect and optionally load Qwen3_5ForConditionalGeneration weights from a local HF directory.

Local HF Qwen experiments

Inspect the local model and tokenizer:

uv run python scripts\experiment_hf_qwen.py --mode inspect --model-path "H:\Qwen3.5-9B-official-hf"

Run text generation when a suitable CUDA torch build is available:

uv run python scripts\experiment_hf_qwen.py `
  --mode generate `
  --model-path "H:\Qwen3.5-9B-official-hf" `
  --device auto `
  --weight-load 4bit `
  --prompt "Explain the difference between linear attention and screening attention."

Benchmark prompt throughput:

uv run python scripts\experiment_hf_qwen.py `
  --mode benchmark `
  --model-path "H:\Qwen3.5-9B-official-hf" `
  --device auto `
  --weight-load 4bit `
  --prompt "Summarize the architecture in three bullets."

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
configs		configs
docs		docs
multiscreen		multiscreen
scripts		scripts
tests		tests
.gitignore		.gitignore
.python-version		.python-version
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multiscreen PyTorch

Environment

Quick start

Repository layout

Notes

Local HF Qwen experiments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Multiscreen PyTorch

Environment

Quick start

Repository layout

Notes

Local HF Qwen experiments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages