Add SafeTensors checkpoint loading support by ryannichols827 · Pull Request #950 · PriorLabs/TabPFN

ryannichols827 · 2026-05-11T22:28:43Z

Summary

This PR adds optional SafeTensors checkpoint loading support for TabPFN model checkpoints.

It introduces:

Runtime loading for .safetensors checkpoints with sidecar non-tensor metadata
A helper module for reconstructing TabPFN checkpoint dictionaries from SafeTensors + JSON metadata
A conversion utility script for converting existing .ckpt checkpoints into:
- .safetensors tensor weights
- .non_tensor_metadata.json sidecar metadata
safetensors as a runtime dependency

Motivation

Current checkpoint loading relies on torch.load(..., weights_only=None), which requires pickle-based loading for checkpoint metadata. SafeTensors provides a safer tensor serialization format, but TabPFN checkpoints also contain non-tensor fields such as architecture config and inference config.

This implementation keeps tensor weights in SafeTensors and stores required non-tensor checkpoint metadata in a sidecar JSON file.

Notes

This PR does not include model weights, converted .safetensors files, .ckpt files, or generated metadata files.

The conversion utility is intended for developer or maintainer use and preserves the existing checkpoint structure expected by the model-loading code.

chatgpt-codex-connector · 2026-05-11T22:28:47Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Credits must be used to enable repository wide code reviews.

CLAassistant · 2026-05-11T22:28:56Z

All committers have signed the CLA.

gemini-code-assist

Code Review

This pull request introduces support for SafeTensors checkpoints, including a script to convert PyTorch checkpoints into .safetensors files with sidecar JSON metadata and updates to the model loading logic. The reviewer provided feedback on ensuring tensor contiguity during conversion to avoid serialization errors, explicitly setting loading parameters for legacy compatibility, and addressing potential cache invalidation issues when sidecar metadata is updated independently.

gemini-code-assist · 2026-05-11T22:30:32Z

+                f"Expected all state_dict values to be tensors. "
+                f"Key {key!r} has type {type(value).__name__}."
+            )
+        tensors[key] = value.detach().cpu()


The safetensors.torch.save_file function requires tensors to be contiguous. If the state_dict contains any views or non-contiguous tensors (which can happen depending on how the model was saved or manipulated), the conversion will fail with a RuntimeError. Adding .contiguous() ensures the tensors are in the correct format for serialization.

Suggested change

tensors[key] = value.detach().cpu()

tensors[key] = value.detach().cpu().contiguous()

gemini-code-assist · 2026-05-11T22:30:32Z

+    output_metadata: Path,
+) -> None:
+    """Convert a TabPFN checkpoint into SafeTensors plus JSON metadata."""
+    checkpoint = torch.load(input_checkpoint, map_location="cpu", weights_only=None)


Using weights_only=None (the current default) triggers a FutureWarning in recent PyTorch versions and will eventually default to True. While weights_only=True is safer, it can fail when loading legacy checkpoints that contain complex metadata types. Since this script is specifically intended to convert existing pickle-based checkpoints, it is better to explicitly set weights_only=False to ensure compatibility and silence the warning.

Suggested change

checkpoint = torch.load(input_checkpoint, map_location="cpu", weights_only=None)

checkpoint = torch.load(input_checkpoint, map_location="cpu", weights_only=False)

gemini-code-assist · 2026-05-11T22:30:32Z

+    if checkpoint_path.suffix == ".safetensors":
+        from tabpfn.safetensors_checkpoint import load_safetensors_checkpoint
+
+        return load_safetensors_checkpoint(checkpoint_path)


The current caching logic in _load_checkpoint_cached (which uses _file_identity) only tracks the primary checkpoint file. For SafeTensors checkpoints, this means that if the sidecar .non_tensor_metadata.json file is updated but the .safetensors file remains unchanged, the cache will not be invalidated, and stale metadata will be returned from the LRU cache.

While _file_identity is not modified in this PR, its implementation should be updated to include the metadata file's stats when a .safetensors path is provided to ensure cache consistency for this new loading mechanism.

Add SafeTensors checkpoint loading support

7080fc3

ryannichols827 requested a review from a team as a code owner May 11, 2026 22:28

ryannichols827 requested review from anuragg1209 and removed request for a team May 11, 2026 22:28

gemini-code-assist Bot reviewed May 11, 2026

View reviewed changes

Address SafeTensors review comments

aa29765

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SafeTensors checkpoint loading support#950

Add SafeTensors checkpoint loading support#950
ryannichols827 wants to merge 2 commits into
PriorLabs:mainfrom
ryannichols827:add-safetensors-support

ryannichols827 commented May 11, 2026

Uh oh!

chatgpt-codex-connector Bot commented May 11, 2026

Uh oh!

CLAassistant commented May 11, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 11, 2026

Uh oh!

gemini-code-assist Bot May 11, 2026

Uh oh!

gemini-code-assist Bot May 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	tensors[key] = value.detach().cpu()
	tensors[key] = value.detach().cpu().contiguous()

	checkpoint = torch.load(input_checkpoint, map_location="cpu", weights_only=None)
	checkpoint = torch.load(input_checkpoint, map_location="cpu", weights_only=False)

Conversation

ryannichols827 commented May 11, 2026

Summary

Motivation

Notes

Uh oh!

chatgpt-codex-connector Bot commented May 11, 2026

Uh oh!

CLAassistant commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 11, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 11, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 11, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CLAassistant commented May 11, 2026 •

edited

Loading