Page MenuHomeFreeBSD

tftpd: Address flaky tests
ClosedPublic

Authored by markj on Nov 2 2024, 4:58 PM.
Tags
None
Referenced Files
F108814143: D47404.id145940.diff
Tue, Jan 28, 5:43 AM
Unknown Object (File)
Fri, Jan 24, 7:11 AM
Unknown Object (File)
Sat, Jan 18, 2:31 PM
Unknown Object (File)
Fri, Jan 17, 2:04 AM
Unknown Object (File)
Sun, Jan 12, 1:44 PM
Unknown Object (File)
Dec 26 2024, 9:00 PM
Unknown Object (File)
Dec 12 2024, 3:06 AM
Unknown Object (File)
Dec 8 2024, 11:39 PM
Subscribers

Details

Summary

The tftpd tests all follow the same pattern:

  1. open a udp socket
  2. fork a child to exec tftpd, which subsequently handles requests on the socket
  3. use a client socket to send some message to the tftpd daemon

However, tftpd's first action is to mark its socket as non-blocking and
then read a request from it. If no data is present in the socket, tftpd
exits immediately with an error. So, there is a race; we often see
tftpd test timeouts when running tests in parallel. These timeouts also
arise periodically in CI.

One solution is to restructure the tests to create the server socket,
then write the request to the client socket, then fork tftpd. This
closes the race. However, this involves a lot of churn.

This patch fixes the problem a different way, by adding a new -b flag to
tftpd which makes it block for the initial request. I think this may be
acceptable as an alternative, and does not require extensive
restructuring of each tftpd test.

Diff Detail

Repository
rG FreeBSD src repository
Lint
Lint Not Applicable
Unit
Tests Not Applicable

Event Timeline

markj requested review of this revision.Nov 2 2024, 4:58 PM

Kinda ugly, but fine given our test suite runs tftpd in an unusual way.

This revision is now accepted and ready to land.Nov 2 2024, 5:06 PM

I agree. I can see a couple of other ways to fix this problem, but none are as easy as Mark's.

This revision was automatically updated to reflect the committed changes.