I'm confused.

Aside from Mode 5/6, where does mode-mixing matter?
Nothing else changes the resolution. The output's all 256 pixels wide for 0-4 and 7, and it's all 15-bit BGR color. How does the mode mixing require a post-process (again, aside from using 5 or 6)?

Now, if it's "when mode 5 is mixed in, we have issues", I can understand that. But Mode 1 and 4 seems like the standard scanline updates should handle this well enough? (ie, as well as you'll get without a dot-based PPU)