FreeCalypso > hg > gsm-codec-lib
changeset 236:4c7d0dc1eecb
doc/PCM8-conversions: finish encoding description
author | Mychaela Falconia <falcon@freecalypso.org> |
---|---|
date | Mon, 08 May 2023 03:28:40 +0000 |
parents | 0ee1a66c1846 |
children | e4a4bf11f37c |
files | doc/PCM8-conversions |
diffstat | 1 files changed, 67 insertions(+), 0 deletions(-) [+] |
line wrap: on
line diff
--- a/doc/PCM8-conversions Mon May 08 00:45:26 2023 +0000 +++ b/doc/PCM8-conversions Mon May 08 03:28:40 2023 +0000 @@ -101,3 +101,70 @@ 2's complement input), there are no less than 3 different ways to implement this encoder! +Let us now look at the 3 different ways of encoding a 14-bit or 16-bit 2's +complement linear PCM input to G.711 mu-law. In this analysis we shall use +14-bit notation, with 2's complement inputs contained in the domain +[-8192,8191]. The difference between the 3 identified ways of mapping from this +domain to mu-law have to do with boundaries between quantization intervals. +Tables 2a and 2b in the G.711 spec list all defined quantization intervals and +all decision values that mark boundaries between them; here is a digested form +of the beginning of the canonical quantization table for either side of zero: + +Quantization interval Quantized value +(range of magnitudes) (absolute) +--------------------------------------- +0-1 0 +1-3 2 +3-5 4 +... +29-31 30 +31-35 33 +35-39 37 +... +91-95 93 +95-103 99 +... + +This canonical quantization table is defined in terms of absolute values, and +is therefore fully symmetric around zero. A careful look at the above table +raises a question: which quantization interval (and thus which PCMU octet +output) should be selected if the input value to the encoder has a magnitude +exactly equal to one of the threshold points, or decision values as they are +officially called? In other words, what should the encoder do if the magnitude +of the 14-bit input value equals 1, 3, 5, ..., 31, 35, 39 etc? The answer to +this question is where the 3 candidate mappings under our consideration differ: + +* The "compress" function of G.726 operating in mu-law mode selects the higher + (in absolute value) quantization interval at every decision value threshold, + on both sides of zero: see Table 15/G.726. PCMU octet 0x7F (meaning -0) will + never be emitted by this version, and an input sequence of -3, -2, -1, 0, 1, + 2, 3 will map to quantized values -4, -2, -2, 0, 2, 2, 4. + +* The ulaw_compress() function in G.191 STL behaves like the G.726 version for + positive values, but selects the smaller-absolute-value quantization interval + for negative inputs. Given the same input sequence as above, the output will + correspond to quantized values -2, -2, -0, 0, 2, 2, 4. (Quantized value -0 + is PCMU octet 0x7F.) + +* The s2u[] table in toast_ulaw.c in libgsm source is flat-out wrong and should + not be used or considered further (and because those authors did not include + the source for whatever program they used to generate their broken s2u[] and + u2s[] tables, we have no way to really analyze them), but one CAN construct a + new table for the same function, using the upper 13 bits of 16-bit 2's + complement input to generate PCMU output - see our dev/s2u-regen.c program + and its output table in dev/s2u-regen.out. The resulting mapping is + "mirrored" around zero compared to G.191 STL ulaw_compress(): for the same + input sequence as in the previous two examples, the output will correspond to + quantized values -4, -2, -2, 0, 0, 2, 2. Just like the G.726 version, this + look-up table version will never emit PCMU octet 0x7F for -0. + +It is important to note that all GSM speech decoders produce 2's complement +outputs that are only 13 bits wide, not 14 - therefore, when the input to the +G.711 encoder comes from the output of a GSM speech decoder, the difference +between all 3 alternatives listed above is masked, with all 3 producing +identical output. + +For production software, our (Themyscira) recommendation is to use look-up +tables (dev/s2a-regen.out and dev/s2u-regen.out) for both A-law and mu-law +encoding, using the upper 12 bits from 16-bit 2's complement input for A-law +encoding and the upper 13 bits for mu-law encoding.