By David Sparano
The U.S. invented it, the ATTC tested it,
the FCC accepted it, everyone is talking about it, and soon we'll all get it in
our homes--but what is 8-VSB anyway? Simply put, 8-VSB is the RF modulation
format utilized by the recently approved ATSC digital television standard to
transmit digital bits over the airways to the home consumer. Since any
terrestrial TV system must overcome numerous channel impairments such as
ghosts, noise bursts, signal fades, and interference in order to reach the home
viewer, the selection of the right RF modulation format is critical. Being one
of the most crucial aspects of the DTV system, the 8-VSB format has received a
great deal of attention and scrutiny recently.
In the alphabet soup world of digital
communications, there are two big names to remember when thinking about the
complete DTV system: 8-VSB and MPEG-2. 8-VSB is the RF modulation format and
MPEG-2 is the video compression/packetization format. To convert high
definition studio video into a form suitable for over-the-air broadcast,
according to DTV standards, two stages of processing are needed: MPEG-2
encoding and 8-VSB modulation. Accordingly, two major pieces of equipment are
required: an MPEG-2 encoder and an 8-VSB exciter.
The MPEG-2 encoder takes baseband digital
video and performs bit rate compression using the techniques of discrete cosine
transform, run length coding and bi-directional motion prediction--all of which
are discussed elsewhere in this book. The MPEG-2 encoder then multiplexes this
compressed video information together with pre-coded Dolby Digital (AC-3) audio
and any ancillary data that will be transmitted. The result is a stream of
highly compressed MPEG-2 data packets with a data rate of only 19.39 Mbps. This
is by no means a trivial task since the high resolution digital video (or
multiple standard resolution video) input to the MPEG-2 encoder could easily
have a data rate of one Gbps or more. This 19.39 Mbps data stream is known as
the DTV Transport Layer. It is output from the MPEG-2 encoder and input to the
8-VSB exciter.
Although MPEG-2 compression techniques can
achieve stunning bit-rate reduction results, still more tricks must be employed
to squeeze the 19.39 Mbps DTV Transport Layer signal into a slender six MHz RF
channel and transmit it (hopefully without errors) to the eager consumer
waiting at home in front of the TV set. This is the job of the 8-VSB exciter.

Figure 1 is a block diagram of a typical
8-VSB exciter. In this section, we will walk through the major processes that
occur in the 8-VSB exciter--identifying the major components of the 8-VSB
signal and explaining how this signal is generated.
Data Synchronization
The first thing that the 8-VSB exciter does
upon receiving the MPEG-2 data packets is to synchronize its own internal
circuits to the incoming signal. Before any signal processing can occur, the
8-VSB exciter must correctly identify the start and end points of each MPEG-2 data
packet. This is accomplished using the MPEG-2 sync byte. MPEG-2 packets are 188
bytes in length with the first byte in each packet always being the sync byte.
The MPEG-2 sync byte is then discarded; it will ultimately be replaced by the
ATSC segment sync in a later stage of processing.
Data Randomizer
With the exception of the segment and field
syncs (to be discussed later), the 8-VSB bit stream must have a completely
random, noise-like nature. This is because our transmitted signal frequency
response must have a flat noise-like spectrum in order to use the allotted
channel space with maximum efficiency. If our data contained repetitious
patterns, the recurring rhythm of these patterns would cause the RF energy
content of our signal to "lump" together at certain discrete points
of our frequency spectrum--thereby leaving holes in other parts. This implies
that certain parts of our six MHz channel would be over-used while other parts
would be under-used. Plus, the large concentrations of RF energy at certain
modulating frequencies would be more likely to create discernible beat patterns
in an NTSC television set when DTV-to-NTSC interference is experienced.
In the data randomizer, each byte value is
changed according to known pattern of pseudo-random number generation. This
process is reversed in the receiver in order to recover the proper data values.
Reed-Solomon Encoding
Reed-Solomon encoding is a Forward Error
Correction (FEC) scheme applied to the incoming data stream. Forward Error
Correction is a general term used to describe a variety of techniques that can
be used to correct bit errors that occur during transmission. Atmospheric
noise, multipath propagation, signal fades, and transmitter non-linearities may
all create received bit errors. Forward Error Correction can detect and correct
these errors, up to a reasonable limit.

The Reed-Solomon encoder takes all 187 bytes
of an incoming MPEG-2 data packet (the packet sync byte has been removed) and
mathematically manipulates them as a block to create a sort of "digital
thumbnail sketch" of the block contents. This "sketch" occupies
20 additional bytes which are then tacked onto the tail end of the original 187
byte packet. These 20 bytes are known as Reed-Solomon parity bytes.
The receiver will compare the received 187
byte block to the 20 parity bytes in order to determine the validity of the
recovered data. If errors are detected, the receiver can use the parity
"thumbnail sketch" to locate the exact location of the errors, modify
the disrupted bytes, and reconstruct the original information. Up to 10 byte
errors per packet can be corrected this way. If too many byte errors are
present in a given packet, the parity "thumbnail sketch" no longer
resembles the received data block, the validity of the data can no longer be
confirmed, and the entire MPEG-2 packet must be discarded.
Data Interleaver
The data interleaver disturbs the sequential
order of the data stream and disperses the data throughout time (over a range
of about 4.5 msec through the use of memory buffers) in order to minimize the
transmitted signal's sensitivity to burst-type interference.
This is the equivalent of spreading all of
your eggs (bytes) over many different baskets (time). If a noise burst punches
a hole in the signal during propagation and "one basket" (i.e.,
several milliseconds) is lost, many different segments lose one egg instead of
one data segment losing all of its eggs. This is known as time diversity.
Data interleaving is done according to a
known pattern; the process is reversed in the receiver in order to recover the
proper data order.
Trellis Encoder
Trellis coding is yet another form of
Forward Error Correction. Unlike Reed-Solomon coding, which treats the entire
MPEG-2 packet simultaneously as a block, trellis coding is an evolving code
that tracks the progressing stream of bits as it develops through time.
Accordingly, Reed-Solomon coding is known as a form of block code, while
trellis coding is a convolutional code.
For trellis coding, each 8-bit byte is split
up into a stream of four, 2-bit words. In the trellis coder, each 2-bit word
that arrives is compared to the past history of previous 2-bit words. A 3-bit
binary code is mathematically generated to describe the transition from the
previous 2-bit word to the current one. These 3-bit codes are substituted for
the original 2-bit words and transmitted over-the-air as the eight level
symbols of 8-VSB (3 bits = 23 = 8 combinations or levels). For every two bits
that go into the trellis coder, three bits come out. For this reason, the
trellis coder in the 8-VSB system is said to be a 2/3 rate coder.
The trellis decoder in the receiver uses the
received 3-bit transition codes to reconstruct the evolution of the data stream
from one 2-bit word to the next. In this way, the trellis coder follows a
"trail" as the signal moves from one word to the next through time.
The power of trellis coding lies in its ability to track a signal's history
through time and discard potentially faulty information (errors) based on a
signal's past and future behavior.
This is somewhat like following one person's
footsteps through the snow on a busy sidewalk. When the trail becomes confused
with other tracks (i.e., errors are received), the trellis decoder has the
ability to follow several possible "trails" for a few footprints and
make a decision as to which prints are the correct ones. (Note: change this
analogy to "footprints in the sand on a crowded beach" if you are
reading this in a warm climate.)
Sync and Pilot
The next step in the signal processing chain
is the insertion of the various "helper" signals that aid the 8-VSB
receiver in accurately locating and demodulating the transmitted RF signal.
These are the ATSC pilot, segment sync, and frame sync. The pilot and sync
signals are inserted after the data randomization, Reed-Solomon coding, data
interleaving and trellis coding stages so as not to destroy the fixed time and
amplitude relationships that these signals must possess in order to be
effective.
Recovering a clock signal in order to decode
a received waveform has always been a tricky proposition in digital RF
communications. If we derive the receiver clock from the recovered data, we
have a sort of "chicken and egg" dilemma. The data must be sampled by
the receiver clock in order to be accurately recovered. The receiver clock
itself must be generated from accurately recovered data. The resulting clocking
system quickly "crashes" when the noise or interference level rises
to a point that significant data errors are received.

When NTSC was invented, the need was
recognized to have a powerful sync pulse that rose above the rest of the RF
modulation envelope. In this way, the receiver synchronization circuits could
still "home-in" on the sync pulses and maintain the correct picture
framing--even if the contents of the picture were a bit snowy. (Everyone saw
the need for this except the French; sync there is the weakest part of the
signal--vive la diffˇrence). NTSC also benefited from a large residual visual
carrier (caused by the DC component of the modulating video) that helped TV
receiver tuners zero in on the transmitted carrier center frequency.
8-VSB employs a similar strategy of sync
pulses and residual carriers that allows the receiver to "lock" onto
the incoming signal and begin decoding, even in the presence of heavy ghosting
and high noise levels.
The first "helper" signal is the
ATSC pilot. Just before modulation, a small DC shift is applied to the 8-VSB
baseband signal (which was previously centered about zero volts with no DC
component). This causes a small residual carrier to appear at the zero
frequency point of the resulting modulated spectrum. This is the ATSC pilot.
This gives the RF PLL circuits in the 8-VSB receiver something to lock onto
that is independent of the data being transmitted.
Although similar in nature, the ATSC pilot
is much smaller than the NTSC visual carrier, consuming only 0.3 dB or 7
percent of the transmitted power.
The other "helper" signals are the
ATSC segment and frame syncs. An ATSC data segment is comprised of the 187
bytes of the original MPEG-2 data packet plus the 20 parity bytes added by the
Reed-Solomon encoder. After trellis coding, our 207 byte segment has been
stretched out into a stream of 828 8-level symbols. The ATSC segment sync is a
repetitive four symbol (one byte) pulse that is added to the front of the data
segment and replaces the missing first byte (packet sync byte) of the original
MPEG-2 data packet. Correlation circuits in the 8-VSB receiver home in on the
repetitive nature of the segment sync, which is easily contrasted against the
background of completely random data. The recovered sync signal is used to
generate the receiver clock and recover the data. Because of their repetitive
nature and extended duration, the segment syncs are easy for the receiver to
spot. The result is that accurate clock recovery can be had at noise and
interference levels well above those where accurate data recovery is impossible
(up to 0 dB SNR--data recovery requires at least 15 dB SNR). This allows for
quick data recovery during channel changes and other transient conditions.
Figure 2 shows the make-up of the ATSC data segment and the position of the
ATSC segment sync.
An ATSC data segment is roughly analogous to
an NTSC line; ATSC segment sync is somewhat like NTSC horizontal sync. Their
duration and frequencies of repetition are, of course, completely different.
Each ATSC segment sync lasts 0.37 msec.; NTSC sync lasts 4.7 msec. An ATSC data
segment lasts 77.3 msec.; an NTSC line 63.6 msec. A careful inspection of the
numbers involved reveals that the ATSC segment sync is somewhat more
"slender" when compared to its NTSC counterpart. This is done to
maximize the active data payload and minimize the time committed to sync
"overhead."
Three hundred and thirteen consecutive data
segments are combined to make a data frame. Figure 3 shows the make-up of an
ATSC data frame. The ATSC frame sync is an entire data segment that is repeated
once per frame (24.2 msec.) and is roughly analogous to the NTSC vertical interval.
(FYI: The NTSC vertical interval occurs once every 16.7 msec.). The ATSC frame
sync has a known data symbol pattern and is used to "train" the
adaptive ghost-canceling equalizer in the receiver. This is done by comparing
the received signal with errors against the known reference of the frame sync.
The resulting error vectors are used to adjust the taps of the receiver
ghost-canceling equalizer. Like the segment sync, the repetitive nature of the
frame sync, and correlation techniques used in the 8-VSB receiver, allow frame
sync recovery at very high noise and interference levels (up to 0 dB SNR).
The robustness of the segment and frame sync
signals permits accurate clock recovery and ghost-canceling operation in the
8-VSB receiver--even when the active data is completely corrupted by the
presence of strong multipath distortion. This allows the adaptive
ghost-canceling equalizer "to keep its head" and "hunt around in
the mud" in order to recover a useable signal--even during the presence of
strong signal echoes.
AM Modulation
Our eight-level baseband signal, with syncs
and DC pilot shift added, is then amplitude modulated on an intermediate
frequency (IF) carrier. With traditional amplitude modulation, we generate a
double sideband RF spectrum about our carrier frequency, with each RF sideband
being the mirror image of the other. This represents redundant information and
one sideband can be discarded without any net information loss. This strategy
was employed to some degree in creating the vestigial lower sideband in
traditional NTSC analog television. In 8-VSB, this concept is taken to greater
extremes with the lower RF sideband being almost completely removed. (Note:
8-VSB = 8 level--Vestigial Side Band.)
(There are several different ways to
implement the AM modulation, VSB filtering, and pilot insertion stages of the
8-VSB exciter, some of which are completely digital and involve direct digital
synthesis of the required waveforms. All methods aim to achieve the same
results at the exciter output. This particular arrangement was chosen in the
interest of providing a clear, easily understandable, signal flow diagram.)
Nyquist Filter
As a result of the data overhead added to
the signal in the form of forward error correction coding and sync insertion, our
data rate has gone from 19.39 Mbps at the exciter input to 32.28 Mbps at the
output of the trellis coder. Since 3 bits are transmitted in each symbol of the
8-level 8-VSB constellation, the resulting symbol rate is 32Mb / 3 = 10.76
Million symbols/sec. By virtue of the Nyquist Theorem, we know that 10.76
Million symbols/sec can be transmitted in a VSB signal with a minimum frequency
bandwidth of 1/2 X 10.76 MHz = 5.38 MHz. Since we are allotted a channel
bandwidth of 6 MHz, we can relax the steepness of our VSB filter skirts
slightly and still fall within the 6 MHz channel. This permissible excess
bandwidth (represented by a, the Greek letter alpha) is 11.5 percent for the
ATSC 8-VSB system. That is, 5.38 MHz (minimum bandwidth per Nyquist) + 620 kHz
(11.5 percent excess bandwidth) = 6.00 MHz (channel bandwidth used). The higher
the alpha factor used, the easier the hardware implementation is, both in terms
of filter requirements and clock precision for sampling.
The resulting frequency response after Nyquist
VSB filter is shown in figure 4.
This virtual elimination of the lower
sideband, along with the narrowband filtering of the upper sideband, creates
very significant changes in the RF waveform that is ultimately transmitted. For
the NTSC-hardened veteran, there is a great temptation to imagine the 8-VSB RF
waveform as being a sort of "8-step luminance stairstep" signal
transmitting the eight levels of 8-VSB. Unfortunately, there is a fundamental
flaw with this notion. As figure 5 illustrates, such a crisp stairstep signal
with "squared off" abrupt transitions would generate a frequency
spectrum that is far too wide for our single 6 MHz channel. A "square
symbol pulse" -type signal generates a rich spectrum of frequency
sidelobes that would interfere with adjacent channels.

We know that this type of RF waveform is
incorrect since our Nyquist VSB filter has already pared our RF spectrum down
to a slender 6 MHz channel.
As any video or transmitter engineer knows,
when a square pulse is frequency bandlimited, it will lose its square edges and
"ring" (oscillate) in time before and after the initial pulse. For
our digital 8-level signal, this would spell disaster as the pre- and
post-ringing from one symbol pulse would interfere with the preceding and
following pulses, thereby distorting their levels and disrupting their
information content.
Fortunately, there is still a way to
transmit our 8-VSB symbol pulses if we observe that the 8-level information is
only recognized during the precise instant of sampling in the receiver. At all
other times, the symbol pulse amplitude is unimportant and can be modified in
any way we please--so long as the amplitude at the precise instant of sampling
still assumes one of the required eight amplitude levels.
If the narrowband frequency filtering is
done correctly according to the Nyquist Theorem, the resulting train of symbol
pulses will be orthogonal. This means that at each precise instant of sampling,
only one symbol pulse will contribute to the final RF envelope waveform; all
preceding and following symbol pulses will be experiencing a zero crossing in
their amplitude. This is shown in figure 6a. In this way, when the RF waveform
is sampled by the receiver clock, the recovered voltage will represent only the
current symbol's amplitude (one of the eight possible levels).
At all times in-between the instants of
sampling, the total RF envelope waveform reflects the addition of the
"ringing" of hundreds of previous and future symbols (since all
symbols have non-zero amplitudes between sampling times). Note that, in the
interest of simplicity, figure 6a shows our narrowband symbol pulses as ringing
for only 10 sampling periods; in reality they ring for a much longer time.
These non-zero values (between sampling times) from hundreds of symbols can add
up to very large signal voltages. The result is a very "peaky" signal
that most closely resembles white noise. This is shown in figure 6b. The peak
to average ratio of this signal can be as high as 12 dB, although RF peak
clipping in the transmitter can limit this value to 6 to 7 dB with minimal
consequences.
8-VSB Signal Constellation
In 8-VSB, the digital information is
transmitted exclusively in the amplitude of the RF envelope and not in the
phase. This is unlike other digital modulation formats such as QAM, where each
point in the signal constellation is a certain vector combination of carrier
amplitude and phase. This is not possible in 8-VSB since the carrier phase is
no longer an independent variable under our control, but is rather
"consumed" in suppressing the vestigial lower sideband.
The resulting 8-VSB signal constellation, as
compared to 64-QAM, is shown in figure 7. Our eight levels are recovered by
sampling an in-phase (I channel) synchronous detector. Nothing would be gained
by sampling a quadrature channel detector since no useful information is
contained in this channel. Our signal constellation diagram is therefore a
series eight vertical lines that correspond to our eight transmitted amplitude
levels. By eliminating any dependence on the Q-channel, the 8-VSB receiver need
only process the I channel, thereby cutting in half the number of DSP circuits
required in certain stages. The result is greater simplicity, and ultimately
cost savings, in the receiver design.

The Rest Of The 8-VSB Chain
After the Nyquist VSB filter, the 8-VSB IF
signal is then double up-converted in the exciter, by traditional
oscillator-mixer-filter circuits, to the assigned channel frequency in the UHF
or VHF band. The on-channel RF output of the 8-VSB exciter is then supplied to
the DTV transmitter. The transmitter is essentially a traditional RF power
amplifier, be it solid state or tube-type. A high-power RF output system
filters the transmitter output signal and suppresses any spurious out-of-band
signals caused by transmitter non-linearities. The last link in the
transmitting chain is the antenna that broadcasts the full-power, on-channel
8-VSB DTV signal.
In the home receiver, the over-the-air
signal is demodulated by essentially applying in reverse the same principals
that we have already discussed. The incoming RF signal is received,
downconverted, filtered, then detected. Then the segment and frame syncs are
recovered. Segment syncs aid in receiver clock recovery and frame syncs are
used to train the adaptive ghost-canceling equalizer. Once the proper data
stream has been recovered, it is trellis decoded, deinterleaved, Reed-Solomon
decoded, and derandomized. The end result is the recovery of the original
MPEG-2 data packets. MPEG-2 decoding circuits reconstruct the video image for
display on the TV screen and Dolby Digital (AC-3) circuits decode the sound
information and drive the receiver loudspeakers. The home viewer "receives
his DTV" and the signal chain is complete.
Conclusion
The goal of this section has been to provide
some insight into the inner workings of the 8-VSB transmission system. Like
many things in life, 8-VSB can appear formidable at first, but is really quite
simple "once you get to know it." Hopefully the knowledge conveyed in
this section will dispel some of the fear factor that many NTSC engineers
experience when faced with the unknown world of digital TV broadcasting.
So what then is 8-VSB? Simply put, 8-VSB is
the future of American television. And the future doesn't have to be such a
scary thing.
References
[1] Davis, Robert and Twitchell, Edwin,
"The Harris VSB Exciter for Digital ATV" NAB 1996 Engineering
Conference. April 15-18, 1996.
[2] Citta, Richard and Sgrignoli, Gary,
"ATSC Transmission System: 8-VSB Tutorial" ITVS 1997 Montreux
Symposium. June 12-17, 1997.
[3] Totty, Ron, Davis, Robert and Weirather,
Robert, "The Fundamentals of Digital ATV Transmission" ATV Seminar in
Print. Harris Corporation Broadcast Division, 1995.
Acknowledgements
Special thanks go to Joe Seccia, Bob Plonka
and Ed Twitchell of Harris Corporation Broadcast Division for their
contributions of time, material and assistance in writing this section.