-
2006-05-09
10/023,543
2001-12-17
US 7,042,964 B2
2006-05-09
-
-
Mohammad Ghayour | Krista M. Flanagan
2023-10-26
A Viterbi decoder includes a number of classical Add-Compare-Select units and a number of further Add-Compare-Select unit having a lower complexity butterfly unit (300) having only two adder means, such that the further Add-Compare-Select unit has a butterfly unit (300) comprising: first adder means (310) for receiving a first path metric and a branch metric and for producing at its output the addition thereof; and second adder means (320) for receiving a second path metric and said branch metric and for producing at its output the addition thereof. First comparator means (330) are coupled to receive the output of the second adder means and coupled to receive the first path metric for comparing therebetween. Second comparator means (340) are coupled to receive the output of the first adder means and coupled to receive the second path metric for comparing therebetween. First selection means (350) for selecting between the second adder means output and the first path metric produce a first survivor path metric in dependence on the first comparator means comparison. Second selection means (360) for selecting between the first adder means output and the second path metric signal produce a second survivor path metric in dependence on the second comparator means comparison. Only two adder means are used for processing metric transitions as a second branch metric is identified as having a value of zero.
Get notified when new applications in this technology area are published.
H03D1/00 IPC
Demodulation of amplitude-modulated oscillations
H04L27/06 IPC
Modulated-carrier systems; Amplitude-modulated carrier systems, e.g. using on-off keying; Single sideband or vestigial sideband modulation Demodulator circuits; Receiver circuits
This invention relates to Viterbi decoding. Viterbi decoding is commonly used in the receiving side of digital communication systems where potentially disrupted signals (e.g., disrupted by a fading channel, noise, etc.) must be decoded. Such signals are typically the result of bit-streams that have been encoded using convolutional codes and modulated for transmission, and such received encoded signals are typically decoded using a maximum-likelihood algorithm, generally based on the ‘Viterbi algorithm’.
In considering the Viterbi algorithm, two aspects in particular must be considered: the ‘Metric Calculation’ and the ‘Viterbi decoder’ itself. The theory of both of these aspects, involving calculation of branch, node and path metrics between different trellis nodes, is well known and ubiquitously applied in the field of digital communications.
The main problem of the Viterbi algorithm lies in its arithmetical decoding complexity (thus leading to high power consumption, etc., which is a paramount consideration in battery-operated portable communication devices). A lot of research has been done with the aim of reducing complexity associated with the Viterbi algorithm.
However, this research has invariably not taken into account the needs of ‘broadband communications’ systems. In these systems account must be taken of the very high bit rates involved, which require adaptation of the Viterbi algorithm for efficient maximum-likelihood decoding.
Standard implementations of the Viterbi algorithm are distinctly sub-optimum for ‘Broadband Communication’ systems because:
A need therefore exists for a Viterbi decoder, unit therefor and method wherein the abovementioned disadvantage(s) may be alleviated.
In accordance with a first aspect of the present invention there is provided a Viterbi decoder as claimed in claim 1.
In accordance with a second aspect of the present invention there is provided a method of producing metrics, for use in a Viterbi decoder, as claimed in claim 4.
In accordance with a third aspect of the present invention there is provided a butterfly unit, for use in a Viterbi decoder Add-Compare-Select unit, as claimed in claim 11.
One Viterbi decoder incorporating the present invention will now be described, by way of example only, with reference to the accompanying drawing(s), in which:
FIG. 1 shows a block schematic representation of a classical implementation of the Viterbi algorithm;
FIG. 2 shows a schematic representation of a classical ACS ‘butterfly’ unit;
FIG. 3 shows a schematic representation of a new ACS ‘butterfly’ unit in accordance with the invention;
FIG. 4 shows functional representations of four types of ACS ‘butterfly’ units which may be used in a Viterbi decoder in accordance with the invention; and
FIG. 5 shows schematic representations of implementations of the four types of ACS ‘butterfly’ units of FIG. 4
The following description, explanation and associated drawings are based (for the sake of example) on use of an encoder whose code rate is of the type R=1/m, with m integer. However, it will be understood that the invention is not limited to such an encoder type and may be more generally applied, e.g., to cases of code rate type R=k/m, where k (>1) and m are integer.
Convolutional codes are commonly used in digital communication systems in order to encode a bit-stream before transmission. In the receiver, a deconvolution has to be performed on the received symbols that have been possibly corrupted by fading due to a multipath channel and by additive noise. A classical implementation of the Viterbi algorithm, as shown in FIG. 1, to perform a Maximum-Likelihood decoding of the received data consists of three blocks:
The present invention concerns techniques for reducing the complexity of a Viterbi decoder.
Briefly stated, the present invention provides a new ACS unit that may be used at certain positions in a Viterbi decoder to simplify the processing required, and provides certain new metrics for use with the new ACS units to decrease the overall complexity of Viterbi decoding.
The critical element in a Viterbi decoder is usually the ACS unit, of which a typical example is shown in FIG. 2. Generally,
N
2
ACS butterfly operations have to be performed per trellis transition if a N-state convolutional encoder is used. In a high-speed application, all
N
2
or at least some
(
for
example
,
a
number
P
between
1
and
N
2
)
ACS butterflies have to work in parallel, requiring an important amount of chip surface in the case of a hardware implementation. Consequently, the power consumption of the ACS units is important compared to the total consumption of the decoder.
For the ‘HIPERLAN/2’ standard, for example, massive parallel structures are necessary in order to guarantee the required bit-rates (up to 54 MBits/s. Even if all ACS units are working in parallel in order to decode 1 bit per clock cycle, a minimum clock speed of 54 MHz is mandatory.
In order to reduce the complexity of Viterbi decoding, the following is proposed:
Following these proposals produces the advantages that:
R
=
1
2
(this type of code rate, together with a constraint length of K=7, leads to a convolutional code that is commonly used, for example by the ‘BRAN HIPERLAN/2’ standard), 50% of all classical butterflies can be substituted by the optimised ones leading to approximately 8% gain in surface/complexity compared to that of a Viterbi decoder using only the conventional butterfly configuration.
N
2
ACS butterflies must be implemented. It is possible to find hybrid structures where a number of butterflies between 1 and
N
2
are implemented and reused once or several times per transition. So, a trade-off is possible between decoding speed and chip surface in a hardware implementation.
The following discussion explains adaptation of metrics in general to suit the new ACS butterfly unit of FIG. 3. As can be seen, in the form shown in FIG. 3 the new butterfly unit 300 has one adder 310 for adding the path metric 1 and branch metric 2, and another adder 320 for adding the path metric 2 and branch metric 2. A comparator 330 compares the output of the adder 320 and the path metric 1, and a comparator 340 compares the output of the adder 310 and the path metric 2. A selector 350 selects between the output of the adder 320 and the path metric 1, dependent on the comparator 330, to produce the survivor path metric 1; a selector 360 selects between the output of the adder 310 and the path metric 2, dependent on the comparator 340, to produce the survivor path metric 2. It is to be noted that only one branch metric value (as shown, branch metric 2) is used in the butterfly unit 300.
Considering a convolutional encoder based on a code rate
R
=
1
m
with m integer, m encoded bits are output by the encoder at each transition. These m bits appear in the decoder as metrics m1(bit=0), m1(bit=1), m2(bit=0), m2(bit=1), . . . , mm(bit=0), mm(bit=1). Per trellis transition, there are l=2m different branch metrics possible:
mb1=m1(bit=0)+m2(bit=0)+ . . . +mm(bit=0)
mb2=m1(bit=1)+m2(bit=0)+ . . . +mm(bit=0)
. . .
mbl=m1(bit=1)+m2(bit=1)+ . . . +mm(bit=1)
Assuming that positive and negative branch metrics are possible, any branch metric mbaε(mb1, mb2, . . . , mbl) may be chosen and subtracted from all other branch metrics. The new resulting branch metrics are thus:
mb1=mb1−mba=m1(bit=0)+m2(bit=0)+ . . . +mm(bit=0)−mba
mb2=mb2−mba=m1(bit=1)+m2(bit=0)+ . . . +mm(bit=0)−mba
. . .
mba=mba−mba=0
. . .
mbl=mbl−mba=m1(bit=1)+m2(bit=1)+ . . . +mm(bit=1)−mba
Considering now the inputs to the ACS unit, there are in any case two path (or node) metrics Mnode1 and Mnode 2 as well as two branch metrics mbranch1 ε(mb1, mb2, . . . , mbl) and mbranch2 ε(mb1, mb2, . . . , mb1) at the input of the ACS unit. Two cases have to be considered separately:
This rule is based on the typically valid observation that the encoder output bits remain unchanged if both, the input bit to the encoder and the most significant bit (MSB) of the encoder state are inverted.
In general, this method has the disadvantage that the resulting metrics mb1, mb2, . . . , mbl might have a larger dynamic range than the classical metrics m1, m2, . . . , ml. However, the following discussion progresses from the above general case to a slightly specialised case where this disadvantage is resolved.
The only restriction that is imposed on the metrics in the following specialisation is
ma(bit=0)=−ma(bit=1) ∀a
where the expression “∀a” stands for “for all valid a”. That is to say, assuming a bit “0” has been sent, a metric “ma(bit=0)” is produced. The metric corresponding to the assumption that a bit “1” has been sent instead is simply calculated by multiplying the previous result by “−1”. This is valid for “all valid a”.
Now, the l=2m different branch metric can be presented as follows:
mb1=m1(bit=0)+m2(bit=0)+ . . . +mm(bit=0)
mb2=−m1(bit=0)+m2(bit=0)+ . . . +mm(bit=0)
. . .
mbl=−m1(bit=0)−m2(bit=0)− . . . −mm(bit=0)
If any metric mbaε(mb1, mb2, . . . , mbl) is chosen among them and subtracted from all metrics mb1, mb2, . . . , mbl, the resulting metrics mb1, mb2, . . . , mbl are
m = b 1 = { + 2 m 1 ( bit = 0 ) - 2 m 1 ( bit = 0 ) 0 } + { + 2 m 2 ( bit = 0 ) - 2 m 2 ( bit = 0 ) 0 } + … + { + 2 m l ( bit = 0 ) - 2 m l ( bit = 0 ) 0 } m = b 2 = { + 2 m 1 ( bit = 0 ) - 2 m 1 ( bit = 0 ) 0 } + { + 2 m 2 ( bit = 0 ) - 2 m 2 ( bit = 0 ) 0 } + … + { + 2 m l ( bit = 0 ) - 2 m l ( bit = 0 ) 0 } ⋯ m = bl = { + 2 m 1 ( bit = 0 ) - 2 m 1 ( bit = 0 ) 0 } + { + 2 m 2 ( bit = 0 ) - 2 m 2 ( bit = 0 ) 0 } + … + { + 2 m l ( bit = 0 ) - 2 m l ( bit = 0 ) 0 }
Each contribution ±mx(bit=0) is either multiplied by 2 or set to 0. Since all metrics can be multiplied by a constant factor without changing the decision path of the Viterbi decoder, mb1, mb2, . . ., mbl shall be multiplied by
1 2 .
Then, we find l=2m new metrics adapted to the new ACS units that require neither more complex metric calculation nor a higher dynamic range:
m = b 1 = { + m 1 ( bit = 0 ) - m 1 ( bit = 0 ) 0 } + { + m 2 ( bit = 0 ) - m 2 ( bit = 0 ) 0 } + … + { + m l ( bit = 0 ) - m l ( bit = 0 ) 0 } m = b 2 = { + m 1 ( bit = 0 ) - m 1 ( bit = 0 ) 0 } + { + m 2 ( bit = 0 ) - m 2 ( bit = 0 ) 0 } + … + { + m l ( bit = 0 ) - m l ( bit = 0 ) 0 } ⋯ m = bl = { + m 1 ( bit = 0 ) - m 1 ( bit = 0 ) 0 } + { + m 2 ( bit = 0 ) - m 2 ( bit = 0 ) 0 } + … + { + m l ( bit = 0 ) - m l ( bit = 0 ) 0 }
In OFDM (Orthogonal Frequency Division Multiplex) systems, the metrics are very often calculated based on symbols which have been constructed using BPSK (Binary Phase Shift Keying), QPSK (Quadrature Phase Shift Keying), QAM (Quadrature Amplitude Modulation)-16, QAM (Quadrature Amplitude Modulation)-64 or similar constellations. U.S. Pat. No. 5,742,621, 1998 (MOTOROLA) presents a very efficient implementation of the known BPSK/QPSK metrics:
| TABLE 1 |
| Metrics |
| Constella- | |
| tion | Metric |
| BPSK | m(b1 = 0) = −m(b1 = 1) = sign(real(z1)) · real(y1 · H1*) |
| QPSK | m(b1 = 0) = −m(b1 = 1) = sign(real(z1)) · real(y1 · H1*) |
| m(b2 = 0) = −m(b2 = 1) = sign(imag(z1)) · imag(y1 · H1*) | |
In the example metrics of Table 1, z1 is the complex transmitted symbol, H1* is the complex conjugate of the channel coefficient and y1=H1·z1+ν is the received complex symbol with ν being additive white gaussian noise (AWGN). For QAM-16, QAM-64, etc., similar metrics can be derived. These metrics are especially important in the framework of OFDM systems.
For this example, a code rate of
R
=
1
2
,
a constraint length of K=7 and a convolutional encoder based on the generator polynomials G1=133OCT, G2=171OCT is assumed. The non-optimised BPSK metrics may be defined for example as
mb1=m1(bit=0)+m2(bit=0)=sign(real(z1))·real(y1·H1*)+sign(real(z2))·real(y2·H2*)
mb2=m1(bit=1)+m2(bit=0)=−sign(real(z1))·real(y1·H1*)+sign(real(z2))·real(y2·H2*)
mb3=m1(bit=0)+m2(bit=1)=sign(real(z1))·real(y1·H1*)−sign(real(z2))·real(y2·H2*)
mb4=m1(bit=1)+m2(bit=1)=−sign(real(z1))·real(y1·H1*)−sign(real(z2))·real(y2·H2*)
Choosing for example ma=mb1, the optimsed metrics are
m _ _ b1 = 1 2 ( m b1 - m ba ) = 0 m _ _ b2 = 1 2 ( m b2 - m ba ) = - sign ( real ( z 1 ) ) · real ( y 1 · H 1 * ) m _ _ b3 = 1 2 ( m b3 - m ba ) = - sign ( real ( z 2 ) ) · real ( y 2 · H 2 * ) m _ _ b4 = 1 2 ( m b4 - m ba ) = - sign ( real ( z 1 ) ) · real ( y 1 · H 1 * ) - sign ( real ( z 2 ) ) · real ( y 2 · H 2 * )
All 1−1=2m−1 non-zero metrics are pre-calculated by the Transition Metric Unit (TMU). Altogether there are l=2m different ACS butterflies (the two butterfly entries are not independent, which is why not all metric combinations are mixed and the number of different butterflies is limited to l=2m). With K being the constraint length of the convolutional encoder, there are
2
K
-
1
2
m
=
2
K
-
m
-
1
ACS butterflies having a zero-metric as an input. Here, the new, optimised butterfly of FIG. 3 can be applied.
It should be noted that the new metrics mb1, mb2, mb3, mb4 are less complex (2 multiplications, 1 addition) than the classical ones mb1, mb2, Mb3, Mb4(2 multiplications, 2 additions, 2 sign inversions).
The resulting four ACS butterflies are presented by FIG. 4 for a convolutional code of constraint length K=7 and for the metrics presented in Table 1.
In FIG. 4, the following notations have been used:
FIG. 5 shows equivalent schematic representations of implementations of the four ACS butterflies of FIG. 4. As will be seen, the low complexity ACS butterflies Type I and Type II are similar to that of FIG. 3, and similar to each other (the input signals ‘path metric 1’ and ‘path metric 2’ being interchanged between the Type I and Type II butterflies). Also, as will be seen, the higher complexity ACS butterflies Type III and Type IV are similar to that of FIG. 2 and similar to each other (the input signals ‘metric mb2’ and ‘metric mb3’ being interchanged between the Type III and Type IV butterflies).
In the upper section, additive Gaussian noise of a constant mean noise power σnoise2 with a mean value μnoise=0 has been assumed. In the case of a non-zero mean value, the mean value μnoise≠0 is simply subtracted from the received symbols. Using the notations of example 1, the received symbol is in this case
y1=[H1·z1+v]−μnoise=H1·z1+(v−μnoise).
Now, (v−μnoise) can be considered as zero-mean and the metrics can be used as before.
If the mean noise power depends on the received symbol (σnoise2→|cn|2σnoise2), the new metrics must be divided by the corresponding gain:
m = ba → m = ba c n ( a ) 2 ∀ a .
Respecting these rules, the metrics can also be used in coloured noise environments.
In general, the placements of the different butterfly types are found by the following exhaustive search:
Practically, the ACS structure can be exploited in different ways:
Based on the exhaustive search proposed above, the four different ACS butterfly types shown in FIG. 4 and FIG. 5 are identified.
There are 2K-1=64 trellis states and correspondingly 64 path (or node) metric buffers. These buffers are connected to the ACS units as indicated by the following Table 2 (for the standard generator polynomials G1=133OCT, G2=171OCT of the convolutional encoder used by the HIPERLAN/2 standard).
It will be understood that 50% of all butterflies are of the type I and II (low complexity) and the other 50% are of the type III and IV (classical butterflies), and that the total saving in complexity is approx. 8% compared to the total complexity of the classical Viterbi decoder.
| TABLE 2 |
| ACS inputs for ‘HIPERLAN/2’ Viterbi decoder |
| Lower/Higher input state to ACS | ||
| Unit, corresponding to | ||
| (X1X2X3X4X50)bin/ | Butterfly Type | |
| (X1X2X3X4X51)bin | (see FIG. 4 and | |
| in decimal | FIG. 5) | Output state |
| 0 and 32 | Butterfly Type I | 0 and 1 |
| 1 and 33 | Butterfly Type III | 2 and 3 |
| 2 and 34 | Butterfly Type II | 4 and 5 |
| 3 and 35 | Butterfly Type IV | 6 and 7 |
| 4 and 36 | Butterfly Type II | 8 and 9 |
| 5 and 37 | Butterfly Type IV | 10 and 11 |
| 6 and 38 | Butterfly Type I | 12 and 13 |
| 7 and 39 | Butterfly Type III | 14 and 15 |
| 8 and 40 | Butterfly Type I | 16 and 17 |
| 9 and 41 | Butterfly Type III | 18 and 19 |
| 10 and 42 | Butterfly Type II | 20 and 21 |
| 11 and 43 | Butterfly Type IV | 22 and 23 |
| 12 and 44 | Butterfly Type II | 24 and 25 |
| 13 and 45 | Butterfly Type IV | 26 and 27 |
| 14 and 46 | Butterfly Type I | 28 and 29 |
| 15 and 47 | Butterfly Type III | 30 and 31 |
| 16 and 48 | Butterfly Type IV | 32 and 33 |
| 17 and 49 | Butterfly Type II | 34 and 35 |
| 18 and 50 | Butterfly Type III | 36 and 37 |
| 19 and 51 | Butterfly Type I | 38 and 39 |
| 20 and 52 | Butterfly Type III | 40 and 41 |
| 21 and 53 | Butterfly Type I | 42 and 43 |
| 22 and 54 | Butterfly Type IV | 44 and 45 |
| 23 and 55 | Butterfly Type II | 46 and 47 |
| 24 and 56 | Butterfly Type IV | 48 and 49 |
| 25 and 57 | Butterfly Type II | 50 and 51 |
| 26 and 58 | Butterfly Type III | 52 and 53 |
| 27 and 59 | Butterfly Type I | 54 and 55 |
| 28 and 60 | Butterfly Type III | 56 and 57 |
| 29 and 61 | Butterfly Type I | 58 and 59 |
| 30 and 62 | Butterfly Type IV | 60 and 61 |
| 31 and 63 | Butterfly Type II | 62 and 63 |
In conclusion, it will be understood that the Viterbi decoder described above provides the following advantages:
The proposed technique may be used for any Viterbi decoder in general. However, it is especially interesting for OFDM systems, since the resulting optimised metrics do not require any additional precision, at least if the metric calculation is performed adequately, as presented by the example of Table 1.
The technique is especially interesting for a coding rate of R=½, since 50% of all ACS butterflies can be substituted by low-complexity, optimised ACS butterflies. For smaller coding rates, this percentage decreases exponentially.
The applications of the new method are principally found in high-speed applications where massive-parallel structures are required. Here, the savings in complexity/surface/power-consumption are maximal.
1. A Viterbi decoder including a number of classical Add-Compare-Select units and a number of further Add-Compare-Select unit having a lower complexity butterfly unit (300) having only two adder means, such that the further Add-Compare-Select unit comprises:
first adder means (310) for receiving a first path metric and a branch metric and for producing at its output the addition thereof;
second adder means (320) for receiving a second path metric and said branch metric and for producing at its output the addition thereof;
first comparator means (330) coupled to receive the output of the second adder means and coupled to receive the first path metric for comparing therebetween;
second comparator means (340) coupled to receive the output of the first adder means and coupled to receive the second path metric for comparing therebetween;
first selection means (350) for selecting between the second adder means output and the first path metric to produce a first survivor path metric in dependence on the first comparator means comparison; and
second selection means (360) for selecting between the first adder means output and the second path metric signal to produce a second survivor path metric in dependence on the second comparator means comparison,
for processing metric transitions via the lower complexity butteffly unit only where a second branch metric is zero.
2. The Viterbi decoder of claim 1 adapted for code rates of the type R=k/m, where k>1, and k and m are integers.
3. A method of producing metrics for use in a Viterbi decoder comprising a number of classical Add-Compare-Select units having a butterfly unit and a number of further Add-Compare-Select unit(s) having a lower complexity butterfly unit (300) wherein the method comprises the step of:
determining when a branch metric is zero, and in response thereto the method comprises the steps of:
selecting the lower complexity butterfly unit (300) to perform only two adding steps comprising:
adding a first path metric and a branch metric to produce a first addition;
adding a second path metric and said branch metric to produce a second addition;
the method further comprising the steps of:
comparing the second addition and the first path metric to produce a first comparison;
comparing the first addition with the second path metric to produce a second comparison;
selecting between the second addition and the first path metric to produce a first survivor path metric in dependence on the first comparison; and
selecting between the first addition and the second path metric to produce a second survivor path metric in dependence on the second comparison.
4. The method of claim 3 wherein the metrics are selected by subtracting from each of a predetermined set of metrics a chosen one thereof to produce a resultant set of metrics having at least one zero value for processing by the lower complexity butterfly unit (300).
5. The method of claim 3 further comprising re-adjusting the dynamic range of the selected metrics by multiplying each of the selected metrics by a scaling factor if the following property is satisfied:
ma(bit=0)=−ma(bit=1)∀a.
6. The method of claim 5 adapted for Orthogonal Frequency Division Multiplexed (OFDM) coding.
7. The method of claim 5 further comprising adapting the selected metrics to additive noise.
8. The method of claim 7 wherein the additive noise comprises coloured noise.
9. The method of claim 3 adapted for code rates of the type R=k/m, where k>1, and k and m are integers.
10. A butterfly unit for use in a Viterbi decoder Add-Compare-Select unit, the butterfly unit (300) only two adder means comprising:
first adder means (310) for receiving a first path metric and a branch metric and for producing at its output the addition thereof;
second adder means (320) for receiving a second path metric and said branch metric and for producing at its output the addition thereof;
and further comprising
first comparator means (330) coupled to receive the output of the second adder means and coupled to receive the first path metric for comparing therebetween;
second comparator means (340) coupled to receive the output of the first adder means and coupled to receive the second path metric for comparing therebetween;
first selection means (350) for selecting between the second adder means output and the first path metric to produce a first survivor path metric in dependence on the first comparator means comparison; and
second selection means (360) for selecting between the first adder means output and the second path metric signal to produce a second survivor path metric in dependence on the second comparator means comparison.
11. The butterfly unit of claim 10 adapted for code rates of the type R=k/m, where k>1, and k and m are integers.