The size of the memory in the card, it's speed, and the number of shaders it has, determines that.
For a card with a small amount of memory, it will have to be given a new packet, very frequently.
On your system at least, I'd say you have shown that yes, it does knock down the other card to the lower PCI speed, and yes, it does use up a substantial amount of the bandwidth, at that speed.
I believe the bandwidth on the bus is the reason that the second card always slows down the first one's folding, when you have a multi-core cpu. Haven't seen any actual figures for it, however.