Page MenuHomeFreeBSD

riscv: small counter(9) improvements
ClosedPublic

Authored by mhorne on Dec 10 2020, 3:17 PM.
Tags
None
Referenced Files
F107614499: D27536.diff
Thu, Jan 16, 4:51 PM
Unknown Object (File)
Fri, Jan 10, 11:02 AM
Unknown Object (File)
Sep 24 2024, 8:39 PM
Unknown Object (File)
Sep 17 2024, 6:01 PM
Unknown Object (File)
Sep 17 2024, 6:01 PM
Unknown Object (File)
Sep 17 2024, 6:01 PM
Unknown Object (File)
Sep 17 2024, 5:47 PM
Unknown Object (File)
Sep 17 2024, 9:57 AM
Subscribers

Details

Summary
  • Prefer atomics to the critical section.
  • Use CPU_FOREACH to skip absent CPUs

Seems that a nearly-identical change was made for arm64 in r313345.

Diff Detail

Repository
rS FreeBSD src repository - subversion
Lint
Lint Passed
Unit
No Test Coverage
Build Status
Buildable 35298
Build 32231: arc lint + arc unit

Event Timeline

To be clear, the main motivation here and in r313345 is to ensure that zeroing is reliable? In the old version of this diff it is not, per the XXX comment, but neither the CR description or the commit log message for r313345 mention this explicitly, so maybe I'm missing something else.

This revision is now accepted and ready to land.Dec 10 2020, 3:25 PM

To be clear, the main motivation here and in r313345 is to ensure that zeroing is reliable? In the old version of this diff it is not, per the XXX comment, but neither the CR description or the commit log message for r313345 mention this explicitly, so maybe I'm missing something else.

Presumably you are asking for the exact meaning behind the XXXKIB comment?

Could you explain for me what makes the existing code unreliable for zeroing? It's not obvious to me why the critical section was not enough to protect the increment.

To be clear, the main motivation here and in r313345 is to ensure that zeroing is reliable? In the old version of this diff it is not, per the XXX comment, but neither the CR description or the commit log message for r313345 mention this explicitly, so maybe I'm missing something else.

Presumably you are asking for the exact meaning behind the XXXKIB comment?

Could you explain for me what makes the existing code unreliable for zeroing? It's not obvious to me why the critical section was not enough to protect the increment.

counter_u64_zero_inline() works by raising an interrupt on each CPU; the interrupt handler calls counter_u64_zero_one_cpu() to clear the counter for that CPU.

In the old version of the diff this may fail to work because counter_u64_add() is not atomic with respect to interrupts. It loads the value into a register, modifies it, and stores the result. If the value is zeroed between the load and store, it will get reverted when the interrupt finishes and the interrupted thread continues running. I believe this is what the XXXKIB comment is stating.

counter_u64_zero_inline() works by raising an interrupt on each CPU; the interrupt handler calls counter_u64_zero_one_cpu() to clear the counter for that CPU.

In the old version of the diff this may fail to work because counter_u64_add() is not atomic with respect to interrupts. It loads the value into a register, modifies it, and stores the result. If the value is zeroed between the load and store, it will get reverted when the interrupt finishes and the interrupted thread continues running. I believe this is what the XXXKIB comment is stating.

Thanks, that makes sense. counter_u64_zero_one being called via IPI is the detail I was forgetting. I can include a mention about the comment in the final commit message.

sys/riscv/include/counter.h
65

Most other machine/counter.h headers do this. Any reason not to use zpcpu_get() instead?

sys/riscv/include/counter.h
65

I can't see a reason not use use zpcpu_get().

This revision was automatically updated to reflect the committed changes.