lib/libmd: reimplement and enhance md5
Needs ReviewPublic
Actions

Authored by fuz on Jun 21 2024, 10:27 AM.

Details

Reviewers

kevans
emaste
allanjude
andrew
manu

Summary

The reimplementation is a bit cleaner than the original code,
although it is also slightly slower. This shouldn't matter too
much as we have asm code for the major platforms.

Optimised implementations are provided for amd64 and aarch64.
For amd64, we have three implementations. One for baseline,
one using ANDN from BMI1 and one using AVX-512 (though it's not
really vectorised). Here's the performance:

11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80GHz:

pre	17.0s (602 MB/s)
generic	18.8s (545 MB/s)
scalar	13.4s (764 MB/s)
bmi1	12.0s (853 MB/s)
avx512	10.6s (966 MB/s)

ARM Cortex-X1C (Windows 2023 Dev Kit perf core):

pre	35.2s (291 MB/s)
generic	36.4s (281 MB/s)
scalar	34.5s (297 MB/s)

ARM Cortex-A78C (Windows 2023 Dev Kit efficiency core):

pre	46.8s (219 MB/s)
generic	47.3s (216 MB/s)
scalar	44.5s (230 MB/s)

This changeset will have to be reworked when D34497 lands.
I'm not sure how to apply the SIMD code to all uses of MD5.

This changeset anticipates D34498 and no longer provides the
transform and block symbols.

Obtained from: https://github.com/animetosho/md5-optimisation/

Diff Detail

Repository

rG FreeBSD src repository

Lint

Lint Passed

Unit

No Test Coverage

Build Status

Buildable 59949
Build 56834: arc lint + arc unit

Event Timeline

fuz created this revision.Jun 21 2024, 10:27 AM

Herald added a reviewer: andrew. · View Herald TranscriptJun 21 2024, 10:27 AM

Herald added a subscriber: imp. · View Herald Transcript

fuz requested review of this revision.Jun 21 2024, 10:27 AM

Harbormaster completed remote builds in B58276: Diff 140065.Jun 21 2024, 10:27 AM

lwhsu added a subscriber: lwhsu.Jun 24 2024, 8:29 AM

lwhsu added inline comments.

lib/libmd/aarch64/md5block.S
4 ↗	(On Diff #140065)	The order is SPDX then owner from style(9).

Needs to be refactored after the libmd rework.

lib/libmd: add include makefile for md5
lib/libmd: reimplement md5
lib/libmd: add scalar amd64 asm implementation for md5
lib/libmd/amd64: add BMI1 implementation for amd64
lib/libmd/amd64/md5block.S: add AVX-512 implementation
lib/libmd/aarch64: add md5 scalar implementation

This does things the proper way: a new include Makefile is added
so modules that need a copy of the md5 code (libc, libsa) can grab
it easily. The new md5 code is also used in the kernel (except for
the amd64 avx-512 path, as it's too expensive to turn on the FPU).
Internal symbols are decorated with _libmd_ to avoid collisions.

Herald added a reviewer: manu. · View Herald TranscriptOct 12 2024, 11:03 PM

Herald added subscribers: olce, dab. · View Herald Transcript

Harbormaster completed remote builds in B59949: Diff 144760.Oct 12 2024, 11:03 PM

fuz added parent revisions: D47062: lib/libcrypt: unbundle hash functions, D47061: lib/geom: remove redundant libraries and objects.Oct 12 2024, 11:04 PM

Can you move md5c.c in a separate change? It makes it easier to review only the changes to the file (if any)

sys/crypto/md5/md5block_aarch64.S
11	Can you move the macros before `ENTRY`? It makes it easier to find where the start of the function is if you don't have to skip over them.
175–176	Why are these commented out?

In D45670#1073713, @andrew wrote:

Can you move md5c.c in a separate change? It makes it easier to review only the changes to the file (if any)

Hi Andy,

The file md5c.c has been rewritten entirely. A diff can be produced, but is pretty much useless. What stays the same is the weak aliases on the bottom. You can find a the desired diff on Github.

sys/crypto/md5/md5block_aarch64.S
11	I have placed the macros after `ENTRY` as they “declare” local variables of this function. I can move the `ENTRY` macro around, but I believe it is cleared to have the declarations of local variables after the `ENTRY` macro, by analogy to how C functions are laid out.
175–176	They have been outlined and specialised in the two preceding blocks. I placed the commented out code there to make the correspondence clear; I should perhaps add a comment to make this clearer.

sys/crypto/md5/md5c.c: zero out context in MD5Final()
sys/crypto/md5/md5block_aarch64.S: improve comments, move ENTRY()
lib/libmd/Makefile.md5.inc: only apply -DMD5_ASM to md5c.c

Harbormaster completed remote builds in B59973: Diff 144824.Oct 14 2024, 2:41 PM

[18:20:04] <@jrtc27> one review for the new C implementation
[18:20:10] <@jrtc27> one review for each new asm implementation on top of that
[18:20:58] <@jrtc27> and this screams missing depend-cleanup.sh changes to me
[18:23:07] <@jrtc27> and if USE_ASM_SOURCES is an interface into Makefile.md5.inc that's way too global a namespace to be using for it
[18:23:16] <@jrtc27> previously it was fine because it didn't leak outside lib/libmd
[18:23:39] <@jrtc27> but USE_ASM_SOURCES=0 in stand/libsa should be screaming that the name is wrong

jrtc27 added inline comments.Oct 23 2024, 5:37 PM

lib/libc/Makefile
112	Could hide this down in lib/libc/md/Makefile.inc1 and avoid changing this line?
lib/libmd/Makefile.md5.inc
10	Why don't these other conditions influence the default of USE_ASM_SOURCES instead? Then that variable isn't a lie.
11	Can we put these in a directory instead like the other hash functions in lib/libmd/Makefile?
sys/conf/Makefile.amd64
38	Why aren't these in files.arch? We don't use this pattern in sys/conf, flags come from config(8).
sys/crypto/md5/md5c.c
156	I doubt you want these in libsa, and have you verified they're actually important for performance?
sys/crypto/md5/md5dispatch_amd64.c
11	This block of headers isn't style(9)-conformant; they need reordering and separating out into multiple blocks