Page MenuHomeFreeBSD

pctrie: add combined insert/lookup operations
ClosedPublic

Authored by rlibby on May 28 2024, 7:39 PM.
Tags
None
Referenced Files
F107781932: D45394.id139196.diff
Sat, Jan 18, 4:07 AM
Unknown Object (File)
Nov 25 2024, 4:16 AM
Unknown Object (File)
Nov 24 2024, 8:36 PM
Unknown Object (File)
Nov 24 2024, 2:26 PM
Unknown Object (File)
Nov 23 2024, 11:05 PM
Unknown Object (File)
Nov 23 2024, 7:32 AM
Unknown Object (File)
Nov 23 2024, 1:04 AM
Unknown Object (File)
Nov 22 2024, 9:37 AM
Subscribers

Details

Summary

In several places in code, we do a pctrie lookup followed by a pctrie
insert. Provide a few flavors of combined lookup/insert. This may save
a portion of the work from walking a large pctrie twice.

The general idea is that while we walk the trie during insert, we also
do the same kind of tracking work that we do during pctrie_lookup_ge or
pctrie_lookup_le, and we pass out a pctrie node from where such a lookup
may continue.

Sponsored by: Dell EMC Isilon

Diff Detail

Repository
rG FreeBSD src repository
Lint
Lint Not Applicable
Unit
Tests Not Applicable

Event Timeline

This revision is now accepted and ready to land.May 29 2024, 12:50 PM

kib feedback on KASSERT style, and fix a comment typo

This revision now requires review to proceed.May 29 2024, 6:13 PM
sys/sys/pctrie.h
97

if __predict_false is useful for all the other ENOMEM tests, it's useful here too.

sys/sys/pctrie.h
97

I agree, I was thinking to do that in a separate commit, for more flexibility with MFC. I'll put that up for review tonight.

I can tell you that clang produced some code paths that were, let's say, non-optimal, without the predictions, when I examined the code gen for bgetvp.

This revision is now accepted and ready to land.May 29 2024, 7:39 PM
markj added inline comments.
sys/kern/subr_pctrie.c
270–283

... or NULL if the slot is already populated.

281

I'm not sure if it matters anymore, but at one point making mode const helped ensure that constant folding would happen during inlining.

313

Do you need this NULL check, in light of the lack of such checks for neighbor_out?

rlibby added inline comments.
sys/kern/subr_pctrie.c
270–283

I'll add some words to that effect.

281

I did look at output of in-tree clang on amd64 to check that it did the extraneous code removal. If I can remember how to get the cross build set up, I can check gcc too.

313

I believe you're right that I don't. Will remove.

I made the typed out param to name##_PCTRIE_FIND_OR_INSERT optional, but the double pointer to the index is never NULL.

rlibby marked 2 inline comments as done.

markj feedback: reword comment, remove unneeded NULL check

This revision now requires review to proceed.Jun 5 2024, 6:14 AM
markj added inline comments.
sys/kern/subr_pctrie.c
281

To build with gcc, install amd64-gcc13 and make buildkernel CROSS_TOOLCHAIN=amd64-gcc13.

This revision is now accepted and ready to land.Jun 5 2024, 1:58 PM
rlibby added inline comments.
sys/kern/subr_pctrie.c
281

Thanks. I checked that gcc13 also removes the extraneous code here.

However, over in D45486, the situation is a little different, clang only inlined vm_page_insert_after (which with the patch is now just a wrapper). Marking bool lookup as const didn't change behavior, but marking vm_page_insert_lookup as __always_inline instead of inline did. I'll comment on the other diff.

This revision was automatically updated to reflect the committed changes.
rlibby marked an inline comment as done.
sys/sys/pctrie.h
148

I'm concerned that this line does not do what is intended. While neighbor is a pointer to a pctrie_node, parentp is the (void*) cast of a pointer to an smr_pctnode_t, which can be used with pctrie_load_node to retrieve a pointer to a pctrie_node. So this test is never true, and the assignment to neighbor will never happen.

sys/sys/pctrie.h
148

Upon further consideration, I think it's good that this assignment never happens, because all it would do is back one level up the search path, so that the first step in the neighbor search would be back to the neighbor value that was just replaced. So I think it's a de-optimization, but not a bug.