Page MenuHomeFreeBSD

linux(4): implement coredump support
ClosedPublic

Authored by trasz on Apr 28 2021, 9:11 AM.
Tags
None
Referenced Files
F102849763: D30019.id90712.diff
Sun, Nov 17, 11:36 PM
F102833381: D30019.id90712.diff
Sun, Nov 17, 6:38 PM
F102824545: D30019.id90712.diff
Sun, Nov 17, 4:22 PM
Unknown Object (File)
Sun, Nov 17, 1:12 PM
Unknown Object (File)
Thu, Nov 14, 2:14 AM
Unknown Object (File)
Fri, Nov 8, 2:14 AM
Unknown Object (File)
Fri, Nov 1, 4:49 AM
Unknown Object (File)
Mon, Oct 21, 7:45 AM

Details

Summary

Implement dumping core for Linux binaries on amd64, for both 32- and 64-bit executables.
Some bits are still missing.

This is based on a prototype by chuck@.

Diff Detail

Repository
rS FreeBSD src repository - subversion
Lint
Lint Passed
Unit
No Test Coverage
Build Status
Buildable 40182
Build 37071: arc lint + arc unit

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes
sys/compat/linux/linux_elf.c
353

is __ELF_WORD_SIZE == 32 really should be here?

sys/compat/linux/linux_elf.c
353

I think so. Thing is, this code is built twice, but indirectly; see linux_elf64.c and linux_elf32.c.

There's still one change to imgact_elf.c, but this one is completely Linux-specific, so I'm not sure if there's a point in splitting it off.

sys/kern/imgact_elf.c
1825 ↗(On Diff #89897)

I think this should be yet another sv_XXX method.

sys/compat/linux/linux_elf.c
353

ok, I’m confused a bit)
For i386 we does not define compat_linux32

sys/kern/imgact_elf.c
1825 ↗(On Diff #89897)

Why? We're already calling this through this indirection (as sv_coredump); I don't see the point of adding more. Also, another ABI - if it ever happens - might need different tweaks here.

sys/kern/imgact_elf.c
1825 ↗(On Diff #89897)

For me, your reply sounds as an additional strong argument pro change this to sv_XXX method. Exactly because this part is per-ABI.

The fact that it is called through sv_coredump does not matter, since you ended up in the shared code.

sys/kern/imgact_elf.c
1825 ↗(On Diff #89897)

This isn't per-ABI - it's ABI-specific. In other words, one can expect each ABI to require this kind of quirk, but not this particular one, so we would be adding more and more sv_whatever variables to control each of them.

sys/modules/linux64/Makefile
21

what is the purpose to ignore i386?

sys/modules/linux64/Makefile
21

It doesn't build on i386, and I'm not particularly motivated to spending time on fixing a dead architecture, I have a live one that needs fixing ;-)

sys/kern/imgact_elf.c
1825 ↗(On Diff #89897)

I think my main problem here is that adding another sv_variable makes things harder to understand (more indirection) and more boilerplate'y for no good reason - it won't make the code cleaner, shorter, or more flexible.

sys/kern/imgact_elf.c
1825 ↗(On Diff #89897)

Of course it makes the code cleaner, and we might argue about flexibility. But not having the FreeBSD ABI coredump code to know/care about other ABIs is the good design by definition.

Also, I do not get the argument that some ABIs would need to do more specializations there. If the differences are large, ABI would provide more then one sysent classes, if not, it would handle it internally. But even more then, I do not want to contaminate the base FreeBSD code with the irrelevant details to handle foreign ABIs.

Get rid of SVC_LINUX, prefill this field first instead.

sys/kern/imgact_elf.c
1825 ↗(On Diff #89897)

Okay, what do you think about this instead? It avoids having any Linux-specific code in the common path, is simple, flexible, and avoids introducing another sysent entry.

sys/compat/linux/linux_elf.c
120

Now that there is no common code changed in your patch, I looked at the linux/ diff.

I would not call this code similar. This function is a literal copy of the FreeBSD native coredump, modulo the removal of compressed dump support.

221

And this function is again the exact copy of the FreeBSD' corehdr except the added assignment to e_ident.

I do not think this is a good approach. Returning to the previous suggestion of adding some sv_ methods, I think this is the way to go instead of copying 90% of coredumping code and making small modifications here and there.

If you do not like adding sv_ methods for ELF coredump, consider creating a table of virtual methods just for ELF coredump and passing it as an arg to coredump(). But this copying of ~300 LoC should be fixed IMO.

Rework the sysent to reduce code duplication.

trasz added inline comments.
sys/compat/linux/linux_elf.c
221

I thought there would be too many differences to do this properly, but turns out you were right - it's just two additional sv_ variables and one method, with pretty much no duplication left.

Overall, I like the result.

Again, could you, please split this change into core part and then Linux bits? Perhaps for review it is better to keep them together, but I suggest to make (at least) two commits there.

sys/kern/imgact_elf.c
1885 ↗(On Diff #90740)

Can you change the interface to add either struct sysentvec *sv or struct thread *td as the argument, to avoid referencing curproc there?

I know that coredump code have more dependencies on the context so it cannot dump remote process, but I do not want to introduce new deps.

1955 ↗(On Diff #90740)

Same (please add td or sv arg)/

sys/sys/imgact_elf.h
105 ↗(On Diff #90740)

I am not sure that this static definition has the effect you intended, formally it add the instance of FREEBSD_ABI_VENDOR to all files including this header.

sys/sys/sysent.h
117 ↗(On Diff #90740)

I suggest to name the fields in style of sv_elf_core_XXX

trasz marked an inline comment as done.

Implement Konstantin's suggestions.

Will definitely split it into non-Linux specific part, and the Linux-specific one, but I think it's easier to review as a whole.

sys/compat/linux/linux_elf.c
106

Extra ()

126

What are the differences between this code and FreeBSD native prpsinfo?

135

The cast is not needed.

197

Cast is not needed.
Perhaps you can fix these casts in FreeBSD code as the preliminary commit.

221

But this function is still the exact copy of the FreeBSD function?

246

Same code, again?

sys/sys/sysent.h
119 ↗(On Diff #90801)

Is this line too long?

sys/compat/linux/linux_elf.c
126

I think none at the moment, but there's a large chance they will diverge. This code is in a "good enough" state, but it'll require spending some more time comparing its output with what Linux kernel generates. Thus, I'd rather not deduplicate them just yet.

sys/compat/linux/linux_elf.c
106

You might want to fix this in imgact_elf.c as a preliminary commit.

126

I suspect that prpsinfo goes back to SysV, so the format of the note is engraved in stone.

276

Why 320?

And similar question for the native note_procstat_auxv, where it might become too short quite fast (128).

I mean, it should be based on AT_COUNT and equivalent Linux constant.

trasz added inline comments.
sys/compat/linux/linux_elf.c
106
126

I'm not sure anything can be engraved in stone when we're talking about Linux :-) In any case: if it turns out these are same between FreeBSD and Linux, I'll deduplicate them later on.

Fix some more issues pointed out by Kib, and try to pacify tinderbox.

sys/amd64/amd64/elf_machdep.c
64 ↗(On Diff #91212)

BTW I think all entries can use elf64 instead of elfN(). This is for later, there are existing elfN() uses.

sys/compat/linux/linux_elf.c
126

In fact, I would be more happy about leaving this functions redefined if you used the names like l_prstatus_t instead of using exact same name as the FreeBSD native code.

Add prefixes to individual functions; should make backtraces
less confusing. While here also add a greppable comment.

sys/compat/linux/linux_elf.c
138

If it is still elf_prpsinfo_t, you do not need a copy of the function. Both prpsinfo and prfpregset should be linux-named IMO.

sys/compat/linux/linux_elf.c
138

It is elf_prpsinfo_t _for now_; it will likely change later. I'd prefer to avoid a scenario where we add complexity, exporting the native function and using it here, only to find that we actually do need to do something differently here. Deduplicating stuff afterwards is easy.

sys/compat/linux/linux_elf.c
123

Why did you left the generic name for this (and one below) types?

These should be e.g. linux_prpsinfo_t. Do not use elf_ prefix in linux code which you refuse to de-dup.

203

This cast is not needed.

Fix type names, remove unnecessary casts.

This revision is now accepted and ready to land.Jun 30 2021, 12:29 PM
This revision was automatically updated to reflect the committed changes.