Paths

Table of Contentst

vmxnet3: make descriptor count checks more robust
ClosedPublic
Actions

Authored by kp on Feb 2 2024, 5:03 PM.

Details

Reviewers

bryanv
pkelsey

Group Reviewers

network
pfsense

Commits

rG3ff0dc1af85e: vmxnet3: make descriptor count checks more robust

Summary

When we update credits there is a potential for a race causing an
overflow of vxcr_next (i.e. incrementing it past vxcr_ndesc). Change the
check to >= rather than == to be more robust against this.

Sponsored by: Rubicon Communications, LLC ("Netgate")

Diff Detail

Repository

rG FreeBSD src repository

Lint

Lint Not Applicable

Unit

Tests Not Applicable

Event Timeline

kp created this revision.Feb 2 2024, 5:03 PM

Herald added a subscriber: imp. · View Herald TranscriptFeb 2 2024, 5:03 PM

kp requested review of this revision.Feb 2 2024, 5:03 PM

Harbormaster completed remote builds in B55761: Diff 133760.Feb 2 2024, 5:03 PM

Specifically because we've seen users report panics like this one:

db:0:kdb.enter.default>  bt
Tracing pid 11 tid 100007 td 0xfffffe001ebbe720
kdb_enter() at kdb_enter+0x32/frame 0xfffffe00c56849c0
vpanic() at vpanic+0x183/frame 0xfffffe00c5684a10
panic() at panic+0x43/frame 0xfffffe00c5684a70
trap_fatal() at trap_fatal+0x409/frame 0xfffffe00c5684ad0
trap_pfault() at trap_pfault+0x4f/frame 0xfffffe00c5684b30
calltrap() at calltrap+0x8/frame 0xfffffe00c5684b30
--- trap 0xc, rip = 0xffffffff80b05c80, rsp = 0xfffffe00c5684c00, rbp = 0xfffffe00c5684c00 ---
vmxnet3_isc_txd_credits_update() at vmxnet3_isc_txd_credits_update+0x20/frame 0xfffffe00c5684c00
iflib_fast_intr_rxtx() at iflib_fast_intr_rxtx+0xf7/frame 0xfffffe00c5684c60
intr_event_handle() at intr_event_handle+0x123/frame 0xfffffe00c5684cd0
intr_execute_handlers() at intr_execute_handlers+0x4a/frame 0xfffffe00c5684d00
Xapic_isr1() at Xapic_isr1+0xdc/frame 0xfffffe00c5684d00
--- interrupt, rip = 0xffffffff8125b026, rsp = 0xfffffe00c5684dd0, rbp = 0xfffffe00c5684dd0 ---

zlei added a subscriber: zlei.Feb 4 2024, 4:12 AM

zlei added inline comments.

sys/dev/vmware/vmxnet3/if_vmx.c
1432–1433	I'm not familiar with IFLIB. From the driver code it looks only this function `vmxnet3_isc_txd_credits_update()` can increase `txc->vxcr_next`. So if `++txc->vxcr_next > txc->vxcr_ndesc` happens then I guess the function `vmxnet3_isc_txd_credits_update()` is called by multiple threads concurrently. If that is the desired behavior we probably want atomic increasing of `txc->vxcr_next`.
1484	The `idx` is on local stack, so if `++idx > rxc->vxcr_ndesc` happens, it actually hints that the passed in `idx` is wrong, since `&rxc->vxcr_u.rxcd[idx]` would lead to OOB access. Hence I'd prefer a KASSERT or panic right before the for loop, rather than HIDING the real problem and let driver code just WORK. KASSERT(idx < rxc->vxcr_ndesc);

kp added inline comments.Feb 4 2024, 5:45 PM

sys/dev/vmware/vmxnet3/if_vmx.c
1432–1433	That's my understanding as well, yes. I am insufficiently familiar with this driver to do more than fix the immediate problem. I'm hoping that the maintainers and/or original authors will look at fixing the fundamental problem. In the mean time this workaround should stop the panics we see.
1484	We could do both while we wait for those familiar with this code to fix it fully. That would mean that non-debug kernel will work, and debug kernels will assert and demonstrate the problem. Although that's really only for the `++txc->vxcr_next >= txc->vxcr_ndesc` case, because it's indeed a local variable here and there's no way for that to race.

add assert
remove unneeded changes

Harbormaster completed remote builds in B55797: Diff 133840.Feb 4 2024, 5:47 PM

LGTM

This revision was not accepted when it landed; it landed in state Needs Review.Jun 10 2024, 9:06 AM

Closed by commit rG3ff0dc1af85e: vmxnet3: make descriptor count checks more robust (authored by kp). · Explain Why

This revision was automatically updated to reflect the committed changes.

kp added a commit: rG3ff0dc1af85e: vmxnet3: make descriptor count checks more robust.

avg added a subscriber: avg.Jan 14 2025, 10:43 AM

avg added inline comments.

sys/dev/vmware/vmxnet3/if_vmx.c
1425–1426	Dereferencing `txcd` may result in crash if `vxcr_next` happens to be `>= txc->vxcr_ndesc` at this point of execution because of the same race.

I think that vmxnet3_isc_txd_credits_update can be made similar to, e.g., igb_isc_txd_credits_update with respect to the array index (tx_rs_cidx and vxcr_next respectively).
That is, we can stash vxcr_next into a local variable and then use the local variable for iteration.
At the end, we can update vxcr_next from the local variable.
This won't fix other potential issues with correctness / concurrency (data races), but at least it will ensure that vxcr_next will not have an out-of-bounds value.

Revision Contents
Changeset List

Path

Size

sys/

dev/

vmware/

vmxnet3/

if_vmx.c

3 lines

Diff 139678

View Options

sys/dev/vmware/vmxnet3/if_vmx.c

vmxnet3: make descriptor count checks more robustClosedPublicActions