Use kqueue(2) for socket I/O and signal notification. Previously, select(2) and traditional signal handlers were being used. This change centralizes all of the async notification delivery into a single loop so future Capsicum sandboxing will be easier. It also simplifies the code by removing boiler-plate cruft that comes with the older interfaces.
Details
Diff Detail
- Repository
- rG FreeBSD src repository
- Lint
Lint Skipped - Unit
Tests Skipped
Event Timeline
usr.sbin/syslogd/syslogd.c | ||
---|---|---|
377 | Can this be static const int sigcatch[]? | |
379 | As a matter of style, I'd have these either all on one line, or all on separate lines. | |
716 | IMO it is rather odd to call this variable error when it holds a file descriptor in the common case. | |
775 | err() will print the string corresponding to the errno value set by kevent(). But, errno might be clobbered by the pidfile_remove() call, so to be correct you need to write something like: if (kevent(...) == -1) { error = errno; pidfile_remove(pfh); errc(1, error, "failed to add kevent"); } | |
778 | SIGPIPE isn't listed in sigcatch[]. | |
782 | Same comment as above with respect to errno. | |
803 | Same comment as above with respect to errno. | |
3601 | In the code above which blocks these signals, there is a special case for SIGCHLD, but that's not present here. Is that intentional? |
usr.sbin/syslogd/syslogd.c | ||
---|---|---|
377 | Yes. | |
379 |
I was trying to align with the TypeNames[] style above. This is all fixed later. sigcatch is put on a single line. | |
716 |
Agreed. Not sure why I changed this. | |
3601 |
I can't quite remember why SIGCHLD is even in sigcatch[]. It especially doesn't make sense with process descriptors. I wrote a lot of this code back when I was still learning syslogd, unfortunately. It is going to be a massive headache to rebase all of this. |
Mostly looks good, I just have one question about signal handling.
usr.sbin/syslogd/syslogd.c | ||
---|---|---|
789 | If I understand correctly, when a blocked signal is delivered, it'll trigger the knote and remain pending until the signal is unblocked. I believe it'll remain pending until syslogd exits, because they're never unblocked. Why not just ignore these signals instead? |
usr.sbin/syslogd/syslogd.c | ||
---|---|---|
789 | I will address this in the reapchild() removal patch. Ignoring SIGCHLD will also stop the generation of zombies. |
I am use syslogd for collectiong messages from many sourcses.
I have moderate message flow (about 1..10 messages/second average).
I am applay D41357, D41358, D41359, D41360, D41362 -- no problems.
I am applay D41363 -- after about 1..5 minutes syslogd stop to got and processing remote messages. Local messages still processing.
Most of remote messages don't recived and lost -- no action in kevent().
After D41363 syslogd is broken.
usr.sbin/syslogd/syslogd.c | ||
---|---|---|
773 | Hmm, why exactly do we need EV_CLEAR here? Don't you want the event to fire so long as there's data in the socket buffer? |
I was able to reproduce this issue and it seems Mark was right, removing EV_CLEAR from kevent() flags, fixes the messages stalling.
Oddly enough, this bug doesn't occur when the messages are sent through syslogd's UDP socket locally. This only happens when the messages are coming from another host.
@slw_zxy.spb.ru, would you mind seeing if this new patch works for you?
Remove EV_CLEAR flag when adding logsocket kevents to kqueue so we don't drop INET messages.
Thanks to @markj for pointing this out.
Fix bug where syslogd would stall upon startup, never getting the chance to fsync().
The ts and tsp variables were originally used as a timeout in kevent(). If needdofsync was true, the timeout would be set to 0, and dofsync() would be called (assuming the kqueue had 0 events pending). Upon startup, syslogd's sockets were already populated, so the kqueue had pending events. syslogd never got a chance to fsync() and signal its parent that it had finished initialization, so it stalled.
This is fixed with a simple if-statement that checks if needdofsync is true. If it is, it will sync. There is no dependency on kevent() this way.