mjg59 | Trials and tribulations with EFI

I wrote about some EFI implementation issues I'd seen on Macs a while back. Shortly afterwards we started seeing approximately identical bugs on some Intel reference platforms, and fixing it actually became more of a priority.

The fundamental problem is the same. We take the EFI memory map, identify the virtual addresses of the regions that will be required for runtime (mapping them into virtual address space if needed) and then call the firmware's SetVirtualAddressMap() implementation in order to let the firmware convert all its pointers. Sadly it seems that some firmware implementations call into sections of boot services code to do this, which is unfortunate because we've already taken that back to use as RAM. So, given that this is clearly against the spec, how does it ever work?

The tediously dull version is that Linux typically calls SetVirtualAddressMap() in the kernel, and everyone else does it in their bootloaders. The bootloader hasn't set up NX bits or anything, so it just happens to work there. We could just do it in the bootloader in Linux, but that makes doing things like kernel address space randomisation trickier, so it's not the favoured approach. So, instead, we can probably just reserve those ranges until after we've switched to virtual mode, and make sure the pages are executable. This ought to land in 2.6.40, or whatever it ends up being called.

(The alternative approach, of just never transitioning to physical mode, turns out to mysteriously fail on various machines. Calls to SetVariable() just give errors. We just don't know)

That still leaves the problem of SetVariable() on the test Mac trying to access a random address. That one turned out to be easier. There's 2MB of flash at the top of physical address space, and this was being presented as being broken into four separate EFI regions. While physically contiguous, Linux was mapping these to discontiguous virtual addresses. Apple's firmware appeared to assume that a pointer into one region could just be incremented into another. So because it's still easier to change the kernel than change Apple, 2.6.39 merges these regions to ensure they're contiguous.

Remaining problems include some machines seemingly not booting if they have 4GB of RAM or more and this Apple failing to communicate with its panel over the eDP auxchannel. Anyone got any idea how to dump the bios compatibility module out of a running EFI session?

Flat | Top-Level Comments Only

From:

benanov.livejournal.com

"Apple's firmware appeared to assume that a pointer into one region could just be incremented into another."

I wonder if the presentation that the flash is 4 separate regions is false & actually the bug - or maybe it's a limitation because it's on 4 separate 512KB chips?

Don't know enough about EFI, just trying to attack the problem another way.

mjg59

The region appears to cover both sections of the firmware and the non-volatile storage, so it's entirely possible that the reason for the segmented regions is that they do represent different functional domains (either separate chips, or separate regions of a single lump of flash). Given that it works fine when it's collapsed into a single region there's no real reason it couldn't have been presented that way - or, alternatively, the firmware could just not make the assumption that they're going to be contiguous in virtual space. I'm pretty sold on this being a real firmware bug, just one that it's easy for us to work around.

From: (Anonymous)

Is there a way you can ignore the firmware altogether? Seems like working around bugs in BIOS and EFI implementations is a losing game.

We could target the hardware directly, but that would mean a separate kernel for every single different machine. Probably not practical.

I don't really get this. Linux and Windows have been bypassing the BIOS ever since 32-bit kernels appeared, yet the same kernel appears to work on every piece of hardware.

Are you implying that x86 hardware vendors are heading towards the same segmentation as in the ARM world, using EFI as a "we're still compatible" excuse? Sadly, it wouldn't even surprise me. But it would motivate me to invest heavily in companies supporting coreboot, such as AMD.

We're completely unable to run on x86 without the BIOS. We depend on it for the initial memory map, which we need in order not to tread on bits of address space that contain magic hardware. It provides the ACPI tables that we need in order to manage various bits of hardware at runtime. It gives us the information about the video hardware that's necessary in order to know which outputs are present and what the hardware can be clocked at.

The only thing EFI adds is some additional runtime services. The only one of those we actually need is the ability to set nvram variables. If we didn't have EFI we'd need to know the location of the flash and we'd need to know the internal storage format, and that's not something that's standardised.

I apologise for the unrelated complain, but planet.gnome.org points to your advocato account which doesn't have comments. Why not point planet.gnome.org to one of the sites that do?

Because I'm lazy. I'll get around to it at some point.

yuhong.wordpress.com

I read the descriptions for the two patches, and they are quire amusing

Matthew Garrett

Trials and tribulations with EFI

Trials and tribulations with EFI

contiguous non-contiguous regions

Re: contiguous non-contiguous regions

ignore BIOS^wEFI

Re: ignore BIOS^wEFI

Re: ignore BIOS^wEFI

Re: ignore BIOS^wEFI

Advocato, Livejournal, dreamwidth accounts

Re: Advocato, Livejournal, dreamwidth accounts

The descriptions are quire amusing

Profile

About Matthew

Page Summary

Expand Cut Tags