Isn't it possible for a driver to make a given CPU eat a given number of cycles? It's sub-optimal, but I suppose you could make sure that the entire address map range has handlers attached to it, and the handlers deduct the appropriate number of cycles.