ip/tcp: propagate peer close to spliced loopback writer by rafael2knokia · Pull Request #1 · rafael2knokia/9front

rafael2knokia · 2026-05-21T23:51:31Z

When two TCP conversations on the same kernel connect to each other on loopback, tcpincoming() splices them via tcpsplice(), installing tcpbypass() on each side's wq to copy blocks directly into the other side's rq. When one side closes, tcpsetstate(Closed) clears its own bypass but leaves the peer's wq with a kick that still points at us; the next user write reaches tcpbypass(), sees the missing peer and silently drops the block, and qbwrite() returns blocklen(b). The writer therefore sees writes "succeeding" forever and never observes the peer close.

Hang up the peer's wq from the Closed arm of tcpsetstate() so its next user write returns "connection closed" via qbwrite() instead of running the now-stale bypass.

Reproducer: misc/plan9/arm64/loopback-close-probe.go in the Go plan9-arm64 port (https://go-review.googlesource.com/c/go/+/719643) runs three scenarios and reports the writer Write() failing within a few milliseconds after the peer's Close(), as on Linux and the BSDs, instead of accepting 1 GiB of dropped data.

The same bug should exist in 9legacy since the splice logic predates the fork.

This is in preparation to x2APIC implementation, which extends the apicno to 32 bit integers. This also saves 2*255*8 ~ 4K for pointers in the data segment for pc64.

…posting)

git should listen to the user.

when creating a new refcount block, we should set its refcount to 1, rather than leaving it at the default of 0. This patch was adapted from the corresponding OpenBSD patch by chris@abditory.io·

The n argument to strncat dictates the maximum added bytes, not how many are stored + added.

previous logic was to always keep 1 block in our buffer once filled due to the flast flag requirement. This creates a potential overflow with multiple calls of byte input with 1 block and some change. Instead just keep [0, block] in the buffer. Simplifies since we only need to do something 'unusual' if we are exactly at a block boundary.

…lume write

Because there is some strange lockups on thinkpads when toggling brightness buttons when in legacy bios mode, x2APIC is not enabled by default unless ACPI tables explicitely list x2APIC's or *x2apic= boot parameter is specified.

Previous fix for x509 does not build in APE because strecpy() is not available. Rewrite it to use seprint() instead.

also fix hmac_sha2_384 and hmac_sha2_512.

No need to re-check *x2apic parameters for machno != 0. If we are not the bootstrap processor and lapicbase == nil then we have to enable x2APIC as well.

…able and add more error messages accessmbox() skips subsquent permission checks when it finds out that the directory is not writable/execable for other, which disallows the user from mailing themselves when they have permissions turned off for other. Also, introduce a d_nombox error for when the "mbox" directory doesn't exist in the mailbox, and a d_noperm for when upas/send's invoker can't write to "mbox"; this is clearer than the d_unknown catchall error used before.

…ge() (thanks umbraticus)

When two TCP conversations on the same kernel connect to each other on loopback, tcpincoming() splices them via tcpsplice(), installing tcpbypass() on each side's wq to copy blocks directly into the other side's rq. When one side closes, tcpsetstate(Closed) clears its own bypass but leaves the peer's wq with a kick that still points at us; the next user write reaches tcpbypass(), sees the missing peer and silently drops the block, and qbwrite() returns blocklen(b). The writer therefore sees writes "succeeding" forever and never observes the peer close. Hang up the peer's wq from the Closed arm of tcpsetstate() so its next user write returns "connection closed" via qbwrite() instead of running the now-stale bypass. Reproducer: misc/plan9/arm64/loopback-close-probe.go in the Go plan9-arm64 port (https://go-review.googlesource.com/c/go/+/719643) runs three scenarios and reports the writer Write() failing within a few milliseconds after the peer's Close(), as on Linux and the BSDs, instead of accepting 1 GiB of dropped data. The same bug should exist in 9legacy since the splice logic predates the fork.

c95f856 ("ip/tcp: propagate peer close to spliced loopback writer") clears the surviving end's bypass pointer and hangs up its wq so that a write into a now-dead splice partner fails fast. However, when that surviving end later runs tcpclose() the Established arm sees bypass==nil, falls through to the Syn_received clause and calls tcpoutput(), which sends a FIN to the peer's old (l, r)addr/port tuple. The peer is already iphtrem'd, so iphtlook() fails and we sndrst() back. Normally the RST harmlessly dies in iphtlook() too, but under SMP it can race against a concurrent active connect from another conv that just happens to be assigned the same ephemeral lport (1/32768 chance), and that fresh conv suddenly observes Econrefused on its first read - visible in Go's net.TestVariousDeadlines as a "Copy = 0, <nil>; want timeout" flake at sub-millisecond deadlines. Mark the surviving end with a new Tcpctl.bypeerclosed flag from tcpsetstate(Closed) before clearing its bypass (the order matters so that on TSO a tcpclose() seeing bypass==nil is guaranteed to also see bypeerclosed==1), and teach tcpclose() to localclose() locally instead of sending the doomed FIN. Verified with the new misc/plan9/amd64/guest-deadline-stress.rc harness in the Go plan9-arm64 port: 30 -count iterations of net.TestVariousDeadlines (= 1710 deadline cycles) on -smp 4 pass cleanly, where the previous patched kernel reproduced 5-6 failures.

cinaplenrek and others added 2 commits June 16, 2026 18:09

pc/pc64: use linked lists for lapics and ioapics instead of arrays

8dd09d2

This is in preparation to x2APIC implementation, which extends the apicno to 32 bit integers. This also saves 2*255*8 ~ 4K for pointers in the data segment for pc64.

libdraw: avoid temporary integer overflow in unitsperline() (thanks n…

f4dfa08

…posting)

rafael2knokia force-pushed the tcpsplice-close-fix branch from c6ad842 to 950c20d Compare June 17, 2026 19:15

oridb and others added 20 commits June 17, 2026 22:54

git/get: don't silently change protocols from http to https

6c76017

git should listen to the user.

git: don't leak pipe fd into spawned programs

75bd9e1

disk/qcowfs: set refcount of new refcount blocks to 1

8c6a6d3

when creating a new refcount block, we should set its refcount to 1, rather than leaving it at the default of 0. This patch was adapted from the corresponding OpenBSD patch by chris@abditory.io·

/sys/src: fix use of strncat

048f21a

The n argument to strncat dictates the maximum added bytes, not how many are stored + added.

kernel: devaudio: fix parsing of "in" and "out" ignored options to vo…

c771251

…lume write

pc/pc64: implement x2APIC support

7e0e23d

Because there is some strange lockups on thinkpads when toggling brightness buttons when in legacy bios mode, x2APIC is not enabled by default unless ACPI tables explicitely list x2APIC's or *x2apic= boot parameter is specified.

plan9.ini(8): fix *x2apic= description typo

141de64

libsec: avoid strecpy() and use seprint() instead (for APE)

080da19

Previous fix for x509 does not build in APE because strecpy() is not available. Rewrite it to use seprint() instead.

ape: libsec: sync sha3 changes

22b3a18

ape: libsec: reduce diff with main libsec.h

f56d81b

libsec: sha3 and sha2 BIGMAC hmac functions

98a89f1

also fix hmac_sha2_384 and hmac_sha2_512.

kernel: nuke devsdp and ip/esp

484c7ec

pc/pc64: use x2apic when cpu0 uses it

0f13f2c

No need to re-check *x2apic parameters for machno != 0. If we are not the bootstrap processor and lapicbase == nil then we have to enable x2APIC as well.

allocimage(2): document deprecation of dolock flag in (read|write)ima…

06368ca

…ge() (thanks umbraticus)

pc: vmx: fix potential buffer overflow in fpregs write from typo

0d2707d

kernel: fix dropped beagle config for removal of esp

8bf2c61

rafael2knokia force-pushed the tcpsplice-close-fix branch from 950c20d to 3a49407 Compare June 22, 2026 13:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ip/tcp: propagate peer close to spliced loopback writer#1

ip/tcp: propagate peer close to spliced loopback writer#1
rafael2knokia wants to merge 22 commits into
frontfrom
tcpsplice-close-fix

rafael2knokia commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

rafael2knokia commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants