Page MenuHomeFreeBSD

For the LinuxKPI allow loading the same DMA address multiple times without any prior unload
ClosedPublic

Authored by tychon on May 7 2019, 1:12 PM.
Tags
None
Referenced Files
Unknown Object (File)
Sun, Nov 10, 8:03 PM
Unknown Object (File)
Mon, Nov 4, 1:37 PM
Unknown Object (File)
Mon, Nov 4, 1:37 PM
Unknown Object (File)
Mon, Nov 4, 1:37 PM
Unknown Object (File)
Mon, Nov 4, 1:34 PM
Unknown Object (File)
Mon, Nov 4, 1:34 PM
Unknown Object (File)
Mon, Nov 4, 1:14 PM
Unknown Object (File)
Oct 4 2024, 3:22 PM

Details

Summary

As mentioned in D20097 it's possible to address the issue of path-compressed radix trie collisions caused by multiple DMA mappings of the same physical address as well as make the common of case bounce-without-bounce almost as fast as it was before. This is accomplished by exposing whether or not the effective bus_dma implementation will simply be using an identity mapping or not.

Diff Detail

Repository
rS FreeBSD src repository - subversion
Lint
Lint Not Applicable
Unit
Tests Not Applicable

Event Timeline

The code needs to work for all platforms!

I'll give this a spin and look for regressions in graphics land, but it might take a day or two. Can you add x11 as group reviewer?

I'll give this a spin and look for regressions in graphics land, but it might take a day or two. Can you add x11 as group reviewer?

Sounds good, thanks! I need to come up with a test plan for arm64 anyway.

The code needs to work for all platforms!

The easiest (or only ?) way to make it work is to define bus_dma_id_mapped() as returning false on all unimplemented platforms and use double-registration patch from hans.

sys/x86/x86/busdma_bounce.c
235 ↗(On Diff #57130)

This blank line is not needed.

237 ↗(On Diff #57130)

And this.

238 ↗(On Diff #57130)

So you can write

return (_bus_dmamap_pagesneeded(dmat, buf, buflen) == 0

and remove pagesneeded declaration at all.

524 ↗(On Diff #57130)

Style forbids initialization at locals declarations.

548 ↗(On Diff #57130)

There should be a blank line after '{' if no locals are declared.

sys/compat/linuxkpi/common/src/linux_pci.c
555–560 ↗(On Diff #57130)

So, this path should now be pretty cheap, with the pctrie expected to be empty in the common case for bounce. But is it worth considering if we can avoid the mutex entirely?

E.g. we could consider some kind of unlocked check of either pctrie_is_empty() or a new flag on priv indicating that we had ever inserted into the pctrie.

tychon edited the summary of this revision. (Show Details)

Add support for arm64 plus rework _bus_dmamap_pagesneeded to short-circuit if we don't need an exact count but just need to know if *any* are needed.

Also as suggested I added an unlocked check for pctrie_is_empty() in linux_dma_unmap(). If this falsely indicates that the tree is empty when an insertion is actually in progress that implies linux_dma_map_phys() and linux_dma_unmap() are racing which is going to yield bigger issues.

Running the latest version of this patch on top of r347562M.
Machine boots and graphics comes up no problems, this with the built in Intel graphics in i7-3520M.
I'll leave the machine running during the night, in case something happens.

kib added inline comments.
sys/compat/linuxkpi/common/src/linux_pci.c
577 ↗(On Diff #57362)

This of course cannot work. Esp. on architectures where IOMMU is required.

This revision is now accepted and ready to land.May 15 2019, 10:28 PM

I'm not qualified to review, but LGTM besides a couple nit picks.

sys/x86/x86/busdma_bounce.c
145 ↗(On Diff #57362)

Nit, it's spelled pagesneeded everywhere else.

234 ↗(On Diff #57362)

Nit, the return is bool now, so
return (!_bus_dmamap_pagesneeded(dmat, buf, buflen, NULL));
would seem more natural.

This and above apply to arm too.

This has been running fine during the night. From a graphics drivers no apparent regressions are visible, from light testing.

sys/compat/linuxkpi/common/src/linux_pci.c
523 ↗(On Diff #57362)

__aarch64__?

Swapped aarch64 for arm64 per emaste's suggestion and addressed rlibby's nits.

This revision now requires review to proceed.May 16 2019, 1:48 PM
sys/compat/linuxkpi/common/src/linux_pci.c
577 ↗(On Diff #57362)

No, of course this doesn't work on other architectures. The behavior there is same as it was before this effort began. With a bit of additional investigation, which I'm loath to make this change dependent on, I believe the other architectures can be enabled too. What I want to avoid is penalizing everyone with the reference counts. This approach of splitting the support -- where is known to work -- seemed the most sane way to move forward.

This revision was not accepted when it landed; it landed in state Needs Review.May 16 2019, 5:41 PM
This revision was automatically updated to reflect the committed changes.