gh-143732: Add tier2 specialization for TO_BOOL by eendebakpt · Pull Request #148271 · python/cpython

eendebakpt · 2026-04-08T21:39:54Z

See discussion at #148113.

This PR adds two tier2 opcodes for specialization of TO_BOOL. The *args and **kwargs` arguments are marked in tier2 as tuple and dict, respectively.

In this PR there is no additional type recording or tier1 opcodes, that is left to followup PRs.

Benchmark	main	branch
to_bool_dict_false	7.34 ms	7.37 ms: 1.00x slower
to_bool_bytes_true	10.7 ms	10.8 ms: 1.01x slower
to_bool_kwargs_nonempty	1.25 sec	911 ms: 1.37x faster
to_bool_kwargs_empty	949 ms	580 ms: 1.64x faster
to_bool_varargs_nonempty	1.23 sec	888 ms: 1.39x faster
to_bool_varargs_empty	943 ms	575 ms: 1.64x faster

Details

"""Benchmark for TO_BOOL specializations and kwargs type information.

Tests the JIT optimizer's ability to specialize TO_BOOL for:
- dict (truthiness checks)
- **kwargs dict type (known to be dict at the optimizer level, uses _TO_BOOL_DICT)
- *args tuple type (known to be tuple at the optimizer level, uses _TO_BOOL_SIZED)
"""

import pyperf


# --- TO_BOOL from dict ---

def to_bool_dict_true(n):
    d = {"a": 1}
    count = 0
    for _ in range(n):
        if d:
            count += 1
    return count


def to_bool_dict_false(n):
    d = {}
    count = 0
    for _ in range(n):
        if d:
            count += 1
    return count


# --- TO_BOOL from bytes (no tier1 specialization, uses generic _TO_BOOL) ---

def to_bool_bytes_true(n):
    b = b"hello"
    count = 0
    for _ in range(n):
        if b:
            count += 1
    return count


def to_bool_bytes_false(n):
    b = b""
    count = 0
    for _ in range(n):
        if b:
            count += 1
    return count


# --- TO_BOOL with **kwargs ---

def kwargs_to_bool_inner(**kwargs):
    """kwargs is guaranteed to be a dict by CPython."""
    count = 0
    for _ in range(200):
        if kwargs:
            count += 1
    return count


def to_bool_kwargs_nonempty(n):
    for _ in range(n):
        kwargs_to_bool_inner(x=1, y=2)


def to_bool_kwargs_empty(n):
    for _ in range(n):
        kwargs_to_bool_inner()


# --- TO_BOOL with *args (tuple, uses _TO_BOOL_SIZED) ---

def varargs_to_bool_inner(*args):
    """args is guaranteed to be a tuple by CPython."""
    count = 0
    for _ in range(200):
        if args:
            count += 1
    return count


def to_bool_varargs_nonempty(n):
    for _ in range(n):
        varargs_to_bool_inner(1, 2, 3)


def to_bool_varargs_empty(n):
    for _ in range(n):
        varargs_to_bool_inner()


# --- kwargs type used in dict operations ---

def kwargs_dict_ops_inner(**kwargs):
    """Test that kwargs is known to be dict for various operations."""
    total = 0
    for _ in range(200):
        total += len(kwargs)
        if "key" in kwargs:
            total += 1
    return total


def kwargs_dict_ops(n):
    for _ in range(n):
        kwargs_dict_ops_inner(key=42, other=99)


N = 500_000

runner = pyperf.Runner()

runner.bench_func("to_bool_dict_true", to_bool_dict_true, N)
runner.bench_func("to_bool_dict_false", to_bool_dict_false, N)
runner.bench_func("to_bool_bytes_true", to_bool_bytes_true, N)
runner.bench_func("to_bool_bytes_false", to_bool_bytes_false, N)
runner.bench_func("to_bool_kwargs_nonempty", to_bool_kwargs_nonempty, N)
runner.bench_func("to_bool_kwargs_empty", to_bool_kwargs_empty, N)
runner.bench_func("to_bool_varargs_nonempty", to_bool_varargs_nonempty, N)
runner.bench_func("to_bool_varargs_empty", to_bool_varargs_empty, N)
runner.bench_func("kwargs_dict_ops", kwargs_dict_ops, N)

Issue: Broader specialization in the Specializing Adaptive Interpreter for better JIT performance #143732

Python/optimizer_bytecodes.c

kumaraditya303 · 2026-04-14T06:56:28Z

Python/bytecodes.c

            _REPLACE_WITH_TRUE +
            POP_TOP;

+        tier2 op(_TO_BOOL_DICT, (value -- res)) {


You can merge this with _TO_BOOL_SIZED by using the fact that both do a fixed offset lookup.
In tier2 optimizer you can set the offset for where the size is stored and do size = (Py_ssize_t)((char *)obj + offset) and check that directly.

I see what you mean. It goes into the internals of PyDict (e.g. not using PyDict_GET_SIZE, but doing manual offset calculations) and we also need to store the offset somewhere. So I think this is too much of a complication to get rid of a tier2 opcode.

we also need to store the offset somewhere.

You can store it in the instruction operand0.

kumaraditya303 · 2026-04-14T08:10:39Z

Python/optimizer_bytecodes.c

+                REPLACE_OP(this_instr, _TO_BOOL_DICT, 0, 0);
+            }
+            else if (tp == &PyTuple_Type ||
+                     tp == &PySet_Type ||


This is incorrect for set as it does not uses PyObject_VAR_HEAD, this works by accident because it has fill at that offset which is incorrect if set has dummy entries.

Good catch! I updated the PR to handle the set/frozenset separately.

We can also use your suggestion to fold everything into the _TO_BOOL_SIZED. That means we have to load the offset at runtime (minor cost), but it does keep the number of ops lower. I implemented this in main...eendebakpt:to_bool_specialization_v2.

That means we have to load the offset at runtime (minor cost), but it does keep the number of ops lower.

I don't think so, in the JIT the offset would be burned into the machine code itself so the offset is fixed and not looked up at runtime.

Python/optimizer_bytecodes.c

markshannon

The problem with recording uops not being allowed after specializing uops has been fixed, so you can add a recording uop to _TO_BOOL and use the recorded information for better specialization.
#148285

markshannon · 2026-04-14T13:21:09Z

Python/optimizer_bytecodes.c

        }
    }

+    op(_TO_BOOL_DICT, (value -- res)) {


_TO_BOOL_DICT gets inserted by this pass, so this code will never be executed.
Same for _TO_BOOL_SIZED and _TO_BOOL_ANY_SET below.

eendebakpt added 3 commits April 8, 2026 21:01

Add specializations for TO_BOOL

d35071a

type information for kwargs

44f5c49

add tests

b80a79b

eendebakpt requested review from Fidget-Spinner, markshannon, savannahostrowski and tomasr8 as code owners April 8, 2026 21:39

bedevere-app bot added the awaiting review label Apr 8, 2026

eendebakpt marked this pull request as draft April 8, 2026 21:40

bedevere-app bot removed the awaiting review label Apr 8, 2026

bedevere-app bot mentioned this pull request Apr 8, 2026

Broader specialization in the Specializing Adaptive Interpreter for better JIT performance #143732

Open

Merge branch 'main' into to_bool_specialization

ba88f27

eendebakpt marked this pull request as ready for review April 8, 2026 22:21

bedevere-app bot added the awaiting review label Apr 8, 2026

Sacul0457 reviewed Apr 8, 2026

View reviewed changes

Python/optimizer_bytecodes.c Outdated Show resolved Hide resolved

markshannon added the skip news label Apr 9, 2026

eendebakpt added 5 commits April 10, 2026 23:43

review comments

29311d8

Merge branch 'main' into to_bool_specialization

e9c1e24

fix merge conflict

9de1c2c

Merge branch 'main' into to_bool_specialization

91334af

Merge branch 'main' into to_bool_specialization

08a4535

kumaraditya303 reviewed Apr 14, 2026

View reviewed changes

handle set case

05c6089

kumaraditya303 reviewed Apr 14, 2026

View reviewed changes

Python/optimizer_bytecodes.c Show resolved Hide resolved

markshannon reviewed Apr 14, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gh-143732: Add tier2 specialization for TO_BOOL#148271

gh-143732: Add tier2 specialization for TO_BOOL#148271
eendebakpt wants to merge 10 commits intopython:mainfrom
eendebakpt:to_bool_specialization

eendebakpt commented Apr 8, 2026 •

edited

Loading

Uh oh!

Uh oh!

kumaraditya303 Apr 14, 2026

Uh oh!

eendebakpt Apr 14, 2026

Uh oh!

kumaraditya303 Apr 14, 2026

Uh oh!

kumaraditya303 Apr 14, 2026

Uh oh!

eendebakpt Apr 14, 2026

Uh oh!

kumaraditya303 Apr 14, 2026

Uh oh!

Uh oh!

markshannon left a comment

Uh oh!

markshannon Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

eendebakpt commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

kumaraditya303 Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

eendebakpt Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

kumaraditya303 Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

kumaraditya303 Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

eendebakpt Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

kumaraditya303 Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

markshannon left a comment

Choose a reason for hiding this comment

Uh oh!

markshannon Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

eendebakpt commented Apr 8, 2026 •

edited

Loading