Add ensemble_batch_size for single-device inference by randommm · Pull Request #906 · PriorLabs/TabPFN

randommm · 2026-04-29T22:01:53Z

Issue

#905

Motivation and Context

On devices with large amounts of RAM like Strix Halo, this can greatly speed up results

Public API Changes

No Public API changes
Yes, Public API changes (Details below)

How Has This Been Tested?

local testing

Checklist

The changes have been tested locally.
Documentation has been updated (if the public API or usage changes).
A changelog entry has been added (see changelog/README.md), or "no changelog needed" label requested.
The code follows the project's style guidelines.
I have considered the impact of these changes on the public API.

chatgpt-codex-connector · 2026-04-29T22:01:59Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Credits must be used to enable repository wide code reviews.

gemini-code-assist

Code Review

This pull request implements ensemble batching for single-device predictions in TabPFN, enabling multiple compatible ensemble members to be processed in a single forward pass to improve performance. The changes introduce an ensemble_batch_size parameter across the API and update the inference engines to handle batched outputs. Feedback focuses on fixing a potential shape mismatch in embedding extraction, addressing an unused parameter in the on-demand engine, and standardizing telemetry timing for model execution.

adrian-prior

Hey @randommm,

Thanks for setting this up, and apologies for taking so long to get back on this! I do have some open questions, which I left on the PR, and there are also a bunch of comments left by Gemini, of which I think many are valid. Would you mind going through them?

randommm · 2026-05-14T06:31:56Z

@adrian-prior i think it should be all fixed now.

On devices with large amounts of RAM like Strix Halo, this can greatly speed up results

randommm requested a review from a team as a code owner April 29, 2026 22:01

randommm requested review from adrian-prior and removed request for a team April 29, 2026 22:01

gemini-code-assist Bot reviewed Apr 29, 2026

View reviewed changes

Comment thread src/tabpfn/base.py Outdated

Comment thread src/tabpfn/inference.py

Comment thread src/tabpfn/inference.py

randommm force-pushed the ensemble_batch_size branch 2 times, most recently from 709dc67 to c65fda0 Compare April 29, 2026 22:09

adrian-prior reviewed May 13, 2026

View reviewed changes

Comment thread src/tabpfn/inference.py Outdated

adrian-prior reviewed May 13, 2026

View reviewed changes

Comment thread src/tabpfn/base.py

adrian-prior reviewed May 13, 2026

View reviewed changes

randommm force-pushed the ensemble_batch_size branch 5 times, most recently from e3643a9 to a37bc35 Compare May 14, 2026 06:30

Add ensemble_batch_size for single-device inference

a49e278

On devices with large amounts of RAM like Strix Halo, this can greatly speed up results

randommm force-pushed the ensemble_batch_size branch from a37bc35 to a49e278 Compare May 14, 2026 07:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ensemble_batch_size for single-device inference#906

Add ensemble_batch_size for single-device inference#906
randommm wants to merge 1 commit into
PriorLabs:mainfrom
randommm:ensemble_batch_size

randommm commented Apr 29, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot commented Apr 29, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

adrian-prior left a comment

Uh oh!

randommm commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

randommm commented Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Issue

Motivation and Context

Public API Changes

How Has This Been Tested?

Checklist

Uh oh!

chatgpt-codex-connector Bot commented Apr 29, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

adrian-prior left a comment

Choose a reason for hiding this comment

Uh oh!

randommm commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

randommm commented Apr 29, 2026 •

edited

Loading