view doc/Binary-file-format @ 585:3c6bf0d26ee7 default tip

TW-TS-005 reader: fix maximum line length bug TW-TS-005 section 4.1 states: The maximum allowed length of each line is 80 characters, not including the OS-specific newline encoding. The implementation of this line length limit in the TW-TS-005 hex file reader function in the present suite was wrong, such that lines of the full maximum length could not be read. Fix it. Note that this bug affects comment lines too, not just actual RTP payloads. Neither Annex A nor Annex B features an RTP payload format that goes to the maximum of 40 bytes, but if a comment line goes to the maximum allowed length of 80 characters not including the terminating newline, the bug will be triggered, necessitating the present fix.
author Mychaela Falconia <falcon@freecalypso.org>
date Tue, 25 Feb 2025 07:49:28 +0000
parents f469bad44c0e
children
line wrap: on
line source

We (Themyscira Wireless) define our own binary file format for testing of GSM
06.10 (FR) and EFR codec functions; this format of ours is an extension of
classic .gsm format from libgsm/toast.  The original libgsm file format is a
directly abutted sequence of 33-byte libgsm frames, equivalent to RTP frames
for GSM FR, with the upper nibble of the first byte in each frame equal to 0xD,
serving as a signature.  We simply extend this idea: our version is still a
directly abutted sequence of binary records, but each record is now one of 3
possibilities:

- a 33-byte GSM FR frame in libgsm/RTP format, 0xD signature
- a 31-byte GSM EFR frame in RTP format (ETSI TS 101 318), 0xC signature
- a 2-byte Themyscira-extension BFI marker, 0xBF signature, see below

File reading functions begin by reading only one byte; this byte, once decoded,
tells us how many more bytes need to be read, and frame synchronization is thus
maintained.

The recommended filename suffix for extended-libgsm binary files in the present
format is .gsmx; of course dot-separated filename suffixes hold absolutely no
special meaning on Unix systems, but many developers still strongly prefer to
have them for psychological comfort.

Any gsmx file (FR or EFR) can be dumped in human-readable form with our
gsmrec-dump utility.  This utility turns every read frame from bytes into codec
parameters with gsmfr_unpack_to_array() or EFR_frame2params(), and then displays
those parameters in a sensible manner, with a per-frame header line followed by
4 lines of subframe parameters.

FR and EFR frames are not expected to be mixed in the same stream recording;
our low-level binary file reading function and gsmrec-dump will grok such mixing
just fine, but each higher-level test program (beyond gsmrec-dump) is expected
to be written for only one codec, either FR or EFR.

BFI marker format
=================

Every 20 ms frame in our gsmx files is either a good FR/EFR frame or a BFI (Bad
Frame Indication) marker.  The BFI marker format used in our gsmx file format is
the same format which we (Themyscira Wireless) previously used in our GSM RAN
RTP transport, before switching to our current TRAUlike RTP format.  This BFI
marker format is quite simple:

byte 0: 0xBF signature;
byte 1: least-significant bit encoding TAF per GSM 06.31 or GSM 06.81,
        section 6.1.1 in both documents; other bits are reserved.