Such reception is only "true diversity" if it feeds a "diversity
receiver". Otherwise, there is no discrimination done by the receiver.
Apparently you do not understand how COFDM based Single Frequency
Networks (SFN) work. Multiple transmitters quite close to each others
are used, each transmitting exactly th same signal at exactly the same
frequency with accurate timing. The coverage areas overlap and it
possible that a single receiver takes some subcarriers from one
transmitter and some from other transmitter(s) if the signal from the
nearest station suffers from selective fading on some subcarriers.
From the receiver point of view, having two well separated received
antennas and simply combining the signal into a single coaxial cable
will look just like receiving from two separate transmitters in an SFN
network.
This does not work in a single carrier systems like the ATSC used in
the USA, so such combining does not help. Even for that system, a
proper diversity receiver would help, if such thing exists in the USA.
With proper diversity receivers, the spatial antenna separation should
be as large as possible.
---
There has been discussions in this thread about impedance matching
issues when combining two antennas. While it might be possible for a
very narrow band system to directly connect the dipole terminals to
each other and then to a feedline, but the length of the feedline
would have to be a specific fraction of the wavelength to get correct
impedance transformation.
This approach can not be used with TV due to the huge bandwidth of the
UHF TV band. The practical approach is to use a balun at each antenna
and use ordinary 75 ohm TV coaxial cables with exactly the same length
from the antennas to a passive splitter/combiner and from the common
port then use any length of 75 ohm coax to feed the TV.
While in theory, stacking two identical antennas would give an
additional 3 dB gain, but in practice, due to the variation in
radiation pattern and the losses in the combiner, the practically
achievable gain increase is 2-2.5 dB due to stacking.
---
While the signal from the satellite is quite predictable and
fluctuates very little, the terrestrial signal fluctuates in many
ways. Most of the time, the signal level is close to the average
level, but then there are short dips (-5 .. -20 dB) in the signal
level, perhaps 1-10 % of the time. These short dips in analog TV was
not a big issue, since it only momentarily added some noise to the
picture.
If the signal drops below the digital receiver threshold, the raw bit
error rate will increase to 100 % (error burst). To combat this, most
digital radio systems use bit interleaving, thus, after
deinterleaving, burst errors are converted to random errors, which can
be corrected by an error correction code.
Depending on how long time span the bits are interleaved compared to
the length of the error burst, the signal might or might not be
recovered without any losses. If the error burst is too long (the
signal drop takes too long), the picture pixeliates and the sound is
muted.
While stacking closely two antennas in the same mast will no doubt
increase the _average_ signal level by 2 dB, while the signal is
already adequate most of the time, but during a -20 dB dip, stacking
would only result in a -18 dB dip in signal level. This might slightly
shorten the length of the error burst (and slightly helping the ECC),
I would not expect miracles from stacking two antennas close to each
other.
One should try to find a better location for the antenna and increase
the height, in order to get a reliable, comfortable digital reception.