Page MenuHomeFreeBSD

efirt: add a tunable to disable printing faults during EFIRT calls
ClosedPublic

Authored by kib on Mon, Mar 31, 6:42 PM.
Tags
None
Referenced Files
F113803461: D49592.id.diff
Thu, Apr 3, 8:42 PM
F113796203: D49592.id153030.diff
Thu, Apr 3, 6:21 PM
F113793792: D49592.id152924.diff
Thu, Apr 3, 5:26 PM
F113792240: D49592.diff
Thu, Apr 3, 4:48 PM
F113776408: D49592.id152923.diff
Thu, Apr 3, 10:58 AM
F113771884: D49592.id153053.diff
Thu, Apr 3, 9:29 AM
Unknown Object (File)
Tue, Apr 1, 1:32 AM
Unknown Object (File)
Tue, Apr 1, 12:45 AM

Diff Detail

Repository
rG FreeBSD src repository
Lint
Lint Not Applicable
Unit
Tests Not Applicable

Event Timeline

kib requested review of this revision.Mon, Mar 31, 6:42 PM

Only amd64 and arm64 implement and build efirt, are the arm and riscv changes truly needed?

Only amd64 and arm64 implement and build efirt, are the arm and riscv changes truly needed?

I thought that they do have efirt, or at least are close to have it. I am happy to drop the arches from the patch.

This seems fine, but every user who sees repeated faults will have to enable the sysctl to prevent the dmesg from filling up.

An alternate policy would be to print the trap info once, then suppress printing for future traps, with a sysctl to disable the suppression. I think this would give nicer out-of-the-box behaviour.

More fine-grained control of the efirt faults prints, and the count of total occurrences.

markj added inline comments.
sys/dev/efidev/efirt.c
134
139
This revision is now accepted and ready to land.Wed, Apr 2, 7:50 PM

I can live with this but note that if the "fault" occurs in normal operation for some machines, even a single message would be confusing, particularly if you are running into another bug! I spend a bunch of time tracking this down as I was also seeing some other errors which turned out to be an unrelated hardware issue.

I can live with this but note that if the "fault" occurs in normal operation for some machines, even a single message would be confusing, particularly if you are running into another bug! I spend a bunch of time tracking this down as I was also seeing some other errors which turned out to be an unrelated hardware issue.

Perhaps the first print should also explain that this is perhaps "normal", and mention the sysctl that can be used to silence it.

I can live with this but note that if the "fault" occurs in normal operation for some machines, even a single message would be confusing, particularly if you are running into another bug! I spend a bunch of time tracking this down as I was also seeing some other errors which turned out to be an unrelated hardware issue.

The fault is definitely not normal, it is most likely an indicator of the bug in BIOS, but also might be in our interface to BIOS.

kib marked 2 inline comments as done.Wed, Apr 2, 8:18 PM