Page MenuHomeFreeBSD

vm_phys: fix freelist_contig
ClosedPublic

Authored by dougm on Nov 8 2023, 5:15 PM.
Tags
None
Referenced Files
F108310771: D42509.id129966.diff
Thu, Jan 23, 6:41 PM
F108308595: D42509.id130024.diff
Thu, Jan 23, 6:24 PM
Unknown Object (File)
Sat, Jan 18, 9:48 PM
Unknown Object (File)
Fri, Jan 17, 5:49 AM
Unknown Object (File)
Sun, Jan 12, 11:09 PM
Unknown Object (File)
Thu, Jan 9, 6:42 PM
Unknown Object (File)
Thu, Jan 9, 6:12 PM
Unknown Object (File)
Thu, Jan 9, 6:07 PM
Subscribers

Details

Summary

vm_phys_find_freelist_contig is called to search a list of max-sized free page blocks and find one that, when joined with adjacent blocks in memory, can satisfy a request for a memory allocation bigger than any single max-sized free page block. In commit fa8a6585c7522b7de6d29802967bd5eba2f2dcf1, I defined this function in order to offer two improvements: 1) reduce the worst-case search time, and 2) allow solutions that include less-than max-sized free page blocks at the front or back of the giant allocation. However, it turns out that this change introduced an error, reported in In Bug 274592. That error concerns failing to check segment boundaries. This change fixes an error in vm_phys_find_freelist_config that resolves that bug. It also abandons improvement 2), because the value of that improvement is small and because preserving it would require more testing than I am able to do.

Test Plan

The reporter of Bug 274592 reports that, for a test that previously failed about half the time, now 10-12 consecutive tests succeed.

Diff Detail

Lint
Lint Skipped
Unit
Tests Skipped

Event Timeline

dougm requested review of this revision.Nov 8 2023, 5:15 PM
dougm created this revision.

Don't modify m_ret in the loop, in case iteration happens and that breaks FOREACH. Save the potential return value in m_val instead.

Could you please add some assertions to vm_phys_alloc_contig() to verify that the returned range, if any, satisfies the constraints?

sys/vm/vm_phys.c
1363
1381

or m_ret->phys_addr.

1391–1393
1403–1405

There's no reason to expect this condition to be true in general, so "verify" sounds a bit misleading.

1414–1419

The loop variable is called m_ret, but it is never returned.

1416

Rather than advancing pa for each blocking, translating it to a vm_page, and then checking its order, would it be marginally more efficient to simply advance a vm_page pointer by adding 1 << (VM_NFREEORDER - 1) to it in each iteration?

Add a missing assignment.

Re-enter an iterator for checking empty blocks.

sys/vm/vm_phys.c
1397

This assumes that there is an initialized page structure at m_ret + npages. That is not a valid assumption. You still need "pa" to perform the bounds checks.

sys/vm/vm_phys.c
960

This is asking for trouble from someone who doesn't carefully read all of the code.

sys/vm/vm_phys.c
1402

This really ought to have a comment saying that it ensures that the loop that follows is only looking at legitimate vm_page structures.

sys/vm/vm_phys.c
1555

m_run + npages could be outside the segment containing m_run.

Fix an assertion identified by @markj.

We may want to issue this as an errata for 14.0, so can we have a minimal version without style changes?

sys/vm/vm_phys.c
1555
1557

Use m_run->segind instead to compute this faster.

1560

Apply @alc suggestions. Remove some inessential parts.

sys/vm/vm_phys.c
1533

This assertion is redundant. We know that the domain is locked, and we are still operating on the same domain.

1558

The NULL check is now pointless.

As @alc suggests, remove a redundant assertion (aren't they all redundant?), and a pointless NULL check, and a comment which in no way improved performance.

sys/vm/vm_phys.c
1395–1400

I don't see the point of introducing a new variable, m_ret,

1416

The semi-colon at the end of the line is too easy to overlook.

sys/vm/vm_phys.c
1395–1400

Suppose that m_ret is advanced in line 1400, and then that the FOREACH loop iteration ends with 'continue', rather than 'return'. Then the TAILQ_FOREACH advances m to the next item in the list. If there is no m_ret, and m was instead advanced in line 1400, where will the TAILQ_FOREACH take us next? Maybe it'll skip way ahead in the list. Maybe it'll skip back, so that we have an infinite loop.

For @alc, change a semicolon to empty braces.

Fix a comment, suggested by @alc.

sys/vm/vm_phys.c
1363

This comment is not currently very descriptive of what this function does.

1379–1380

This describes what the code does, but not why. Specifically, why do we care that the preceding block is free?

1398

I don't see how 'size' is correct here.

sys/vm/vm_phys.c
1398

You're relying on size being greater than max_size. Okay.

sys/vm/vm_phys.c
1374–1377

I would KASSERT(size >= max_size, ...

sys/vm/vm_phys.c
1374–1377

Actually, strictly greater than.

I'm convinced that it's correct. That said, even the person who reported the problem is probably not exercising this while loop:

while (!vm_addr_ok(VM_PAGE_TO_PHYS(m_ret),
    size, alignment, boundary) &&
    ...
sys/vm/vm_phys.c
1404

Add assertion. Add/update some comments.

sys/vm/vm_phys.c
1387

Edit a couple of comments.

sys/vm/vm_phys.c
1407–1408

Is this check inverted? Certainly the ! looks wrong.

1561
This revision is now accepted and ready to land.Nov 13 2023, 5:39 PM
This revision was automatically updated to reflect the committed changes.