view doc/Codec-utils @ 408:8847c1740e78

libtwamr: integrate VAD1
author Mychaela Falconia <falcon@freecalypso.org>
date Tue, 07 May 2024 00:56:10 +0000
parents a053cf0bac04
children b094bc07051a
line wrap: on
line source

Standalone command line utilities for FR and EFR codecs
=======================================================

The pre-existing FOSS opencore-amr package includes amrnb-enc and amrnb-dec test
programs: the first reads linear PCM from a WAV file and emits AMR encoder
output in a .amr file (RFC 4867 AMR storage format), the second reads this .amr
format and emits AMR decoder output as WAV.  Inspired by these simple test
programs, the present package offers equivalent command line utilities for GSM
FR and EFR codecs.  Here they are:

gsmfr-encode	This utility reads linear PCM from a WAV file, runs the
		bit-exact GSM 06.10 encoder and writes the output in the
		classic .gsm format (directly abutted FR codec frames of 33
		bytes each).  We don't currently have a Tx-side DTX
		implementation (VAD etc) for GSM-FR, hence the output from
		gsmfr-encode will always consist of good speech frames only.

gsmfr-decode	This utility reads our gsmx format (see Binary-file-format
		article), which is a superset of the classic libgsm format.
		The input to gsmfr-decode may be a pure .gsm recording as
		produced by gsmfr-encode or toast from libgsm package, or it
		can also contain SID frames and/or BFI markers.  The processing
		performed by gsmfr-decode begins with our FR1 Rx DTX handler
		preprocessor, which will be an identity transform for pure .gsm
		input (most of the time) but becomes important for real-world
		input containing SIDs and BFIs, and is followed by the bit-exact
		GSM 06.10 decoder.  The decoded output is written as WAV.

gsmefr-encode	This utility reads linear PCM from a WAV file, runs our EFR
		encoder (Themyscira libgsmefr) and writes the output in our gsmx
		format.  There is an option to enable or disable DTX: -d enables
		DTX, otherwise it is disabled.  (This option mirrors amrnb-enc.)

gsmefr-decode	This utility reads our gsmx format (which must be EFR, not FR1)
		and feeds all frames and BFIs to our EFR decoder.  The decoded
		output is written as WAV.

The above are original programs that read WAV input for encoding and write WAV
output from decoding.  We now also have raw versions that read and write our
"robe" (raw big-endian) format instead:

gsmfr-encode-r	Just like gsm[e]fr-encode, but reading "robe" instead of WAV.
gsmefr-encode-r

gsmfr-decode-r	Just like gsm[e]fr-decode, but writing "robe" instead of WAV.
gsmefr-decode-r

Please see PCM-file-formats article for the rationale.

Additions for libgsmfr2
=======================

With the introduction of libgsmfr2, gsmfr-* codec utilities have undergone some
changes:

* gsmfr-decode and gsmfr-decode-r now implement the optional decoder homing
  feature, detecting and acting upon GSM 06.10 decoder homing frames.

* gsmfr-encode-r takes an optional -h flag that enables the encoder homing
  function; it is disabled by default.  The same feature was not replicated in
  WAV-reading gsmfr-encode, as WAV format is poorly suited for tinkering-
  oriented bit-exact work.

* There is a new utility named gsmfr-decode-rb, where rb stands for "raw basic".
  This utility emits "robe" output like gsmfr-decode-r, but it performs only
  "basic" GSM 06.10 decoding, without the Rx DTX preprocessor step.  BFI frame
  gaps in input are not allowed, and there is no SID detection.