FreeCalypso > hg > gsm-codec-lib
diff doc/FR1-Rx-DTX @ 135:22601ae99434
doc/FR1-Rx-DTX article written
author | Mychaela Falconia <falcon@freecalypso.org> |
---|---|
date | Sun, 11 Dec 2022 07:51:10 +0000 |
parents | |
children | aa4cdab30dc8 |
line wrap: on
line diff
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/doc/FR1-Rx-DTX Sun Dec 11 07:51:10 2022 +0000 @@ -0,0 +1,160 @@ +At the level of provided functionality and architectural structure, ETSI GSM +specifications for DTX (discontinuous transmission) are very symmetric between +FR and EFR: the same DTX functionality is specified for both codecs, with the +same overall architecture. However, there is one important difference: in the +case of EFR the complete implementation of all DTX functions (for both Tx and +Rx) forms an integral and inseparable part of the reference codec (implemented +in C) from the beginning, whereas in the case of FR1 the addition of DTX is +somewhat of an afterthought. GSM 06.10 defines a "pure" FR codec without any +DTX functions, and this most basic spec can be and has been implemented in this +"pure" form - classic Unix libgsm from 1990s is a proper, fully compliant +implementation of GSM 06.10, but only this spec, without any DTX. In contrast, +there has never existed a "pure" implementation of GSM 06.60 EFR codec without +associated Tx and Rx DTX functions. Furthermore, there is an important +distinction between Tx and Rx DTX handlers for FR1: + +* Anyone who seeks to implement Tx DTX for FR1 would have to dig into the guts + of GSM 06.10 encoder and augment it with VAD and SID encoding functions per + GSM 06.32 and 06.12 specs. + +* In contrast, the Rx DTX handler for FR1 is modular: the way it is specified + in GSM 06.11, 06.12 and 06.31 is a front-end to unmodified GSM 06.10 decoder. + On the Rx side, the interface from the radio subsystem to the Rx DTX handler + consists of 260 bits of frame plus BFI and TAF flags (the spec also defines a + SID flag, but it is determined from frame payload bits), and then the + interface from the Rx DTX handler to the GSM 06.10 decoder is another FR frame + of 260 bits. + +What are the implications of this situation for the GSM published-source +software community? Prior to the present libgsmfrp offering, there has always +been libgsm, but no Rx DTX handler. If you are working with a GSM uplink RTP +stream from a BTS or a GSM downlink frame stream read out of TI Calypso DSP or +some other GSM MS PHY, feeding that stream directly to libgsm (without passing +through an Rx DTX handler) is NOT acceptable: a "bare" GSM 06.10 decoder won't +recognize SID frames and won't produce the expected comfort noise output, and +what are you going to do in those 20 ms windows in which no good traffic frame +was received? The situation becomes especially bad (unkind on ears) if you are +reading received downlink frames out of TI Calypso DSP: the DSP's buffer will +have *some* bit content in every 20 ms window, but naturally this bit content +will be garbage during those frame windows when no good frame was received; +feeding that garbage to libgsm produces noises that are very unkind on ears. + +The correct solution is to implement an Rx DTX handler, pass the stream of +frames and flags from the BTS or the MS PHY to this handler first, and then pass +the output of this handler to libgsm 06.10 decoder. Themyscira libgsmfrp is a +Free Software implementation of Rx DTX handler for GSM FR, implementing SID +classification, comfort noise generation and error concealment. + +Effect of extra preprocessing +============================= + +One key detail deserves extra emphasis before going into library API details: +if the input to libgsmfrp consists entirely of good speech frames (no SID frames +and no BFIs), then the preprocessor becomes an identity transform. Therefore, +if the output of our libgsmfrp preprocessor were to be fed to an additional +instance of the same further down the processing chain, no extra transformation +of any kind will happen. + +Using libgsmfrp +=============== + +The external public interface to Themyscira libgsmfrp consists of a single +header file <gsm_fr_preproc.h>; it should be installed in the same system +include directory as <gsm.h> from libgsm. Please note that <gsm_fr_preproc.h> +includes <gsm.h>, as needed for gsm_byte and gsm_frame defined types. + +The dialect of C we chose for libgsmfrp is ANSI C (function prototypes), const +qualifier is used where appropriate; however, unlike libgsmefr, the interface +to libgsmfrp is defined in terms of gsm_byte type defined in <gsm.h>, included +from <gsm_fr_preproc.h>. + +State allocation and freeing +============================ + +The Rx DTX handler is stateful, hence you will need to allocate a preprocessor +state structure in addition to the usual libgsm state structure for your GSM FR +Rx session. The necessary function is: + +extern struct gsmfr_preproc_state *gsmfr_preproc_create(void); + +struct gsmfr_preproc_state is an opaque structure to library users: you only get +a pointer which you remember and pass around, but <gsm_fr_preproc.h> does not +give you a full definition of this struct. As a library user, you don't even +get to know the size of this struct, hence the necessary malloc() operation +happens inside gsmfr_preproc_create(). However, the structure is malloc'ed as +a single chunk, hence when you are done with it, simply call free() on the +pointer you got from gsmfr_preproc_create(). + +gsmfr_preproc_create() can fail if the malloc() call inside fails, in which case +it returns NULL. + +Preprocessing good frames +========================= + +For every good traffic frame (BFI=0) you receive from the radio subsystem, you +need to call this preprocessor function: + +extern void gsmfr_preproc_good_frame(struct gsmfr_preproc_state *state, + gsm_byte *frame); + +The second argument is both input and output, i.e., the frame is modified in +place. If the received frame is not SID (specifically, if the SID field +deviates from the SID codeword by 16 or more bits, per GSM 06.31 section 6.1.1), +then the frame (considered a good speech frame) will be left unmodified (i.e., +it is to be passed unchanged to the GSM 06.10 decoder), but preprocessor state +will be updated. OTOH, if the received frame is classified as either valid or +invalid SID per GSM 06.31, then the output frame will contain comfort noise +generated by the preprocessor using a PRNG, or a silence frame in one particular +corner case. + +GSM-FR RTP (or libgsm) 0xD magic: the upper nibble of the first byte can be +anything on input to gsmfr_preproc_good_frame(), but the output frame will +always have the correct magic in it. + +Handling BFI conditions +======================= + +If you received a lost/missing frame indication instead of a good traffic frame, +call this preprocessor function: + +extern void gsmfr_preproc_bfi(struct gsmfr_preproc_state *state, int taf, + gsm_byte *frame_out); + +TAF is a flag defined in GSM 06.31 section 6.1.1; if you don't have this flag, +pass 0 - you will lose the function of comfort noise muting in the event of +prolonged SID loss, but all other Rx DTX functions will still work the same. + +With this function the 33-byte frame buffer is only an output, i.e., prior +buffer content is a don't-care and there is no provision for making any use of +erroneous frames like in EFR. The frame generated by the preprocessor may be +substitution/muting, comfort noise or silence depending on the state. + +Other miscellaneous functions +============================= + +extern void gsmfr_preproc_reset(struct gsmfr_preproc_state *state); + +This function resets the preprocessor state to what it is right out of +gsmfr_preproc_create(), which is naturally just a combination of malloc() and +gsmfr_proproc_reset(). Given that our Rx DTX handler state is much simpler +than, for example, EFR codec state, there does not seem to be any need for +explicit resets, but the reset function is made public for the sake of +completeness. + +extern int gsmfr_preproc_sid_classify(const gsm_byte *frame); + +This function analyzes an RTP-encoded FR frame (the upper nibble of the first +byte is NOT checked for 0xD signature) for the SID codeword of GSM 06.12 and +classifies the frame as SID=0, SID=1 or SID=2 per the rules of GSM 06.31 +section 6.1.1. + +Silence frame datum +=================== + +extern const gsm_frame gsmfr_preproc_silence_frame; + +Many implementors make the mistake of thinking that a GSM FR silence frame is a +frame of 260 zero bits, but the official specs disagree: the silence frame given +in GSM 06.11 (3GPP TS 46.011, at the very end of the spec) is quite different. +Themyscira libgsmfrp implements the correct silence frame per the spec, and that +datum is also made public.