freecalypso-docs: Voice-memo-feature annotate

annotate Voice-memo-feature @ 96:69061d044f05

Voice-memo-feature: new article

author	Mychaela Falconia <falcon@freecalypso.org>
date	Tue, 27 Dec 2022 20:23:55 +0000
parents
children	80f0996bfd16

rev	line source
96 69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	1 The full Calypso hw+fw solution as delivered by TI (the relevant components here
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	2 are the DSP, the official L1 code and RiViera Audio Service) implements an
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	3 interesting feature called voice memos. It is actually two paired features:
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	4
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	5 * Voice memo recording: in almost all states of the MS (no GSM network at all,
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	6 or idle mode, or in an active call) it is possible to activate an extra
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	7 instance of GSM 06.10 encoder that takes input from the microphone (and also
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	8 from the active call downlink if invoked during a speech call) and writes its
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	9 output into an otherwise-unused DSP buffer. The combination of L1 and RiViera
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	10 Audio Service then writes this speech recording into a file in FFS.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	11
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	12 * Voice memo playback: voice memo files recorded with the just-described VM
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	13 record feature can be played into the phone's speaker output. The operation
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	14 of playing a previously recorded voice memo is conceptually no different from
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	15 playing tones or melodies, and can likewise be done in any state: with no GSM
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	16 network at all, in idle mode, or in an active call.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	17
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	18 VM recording and VM playback cannot be active at the same time: they use the
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	19 same DSP buffer, and likely other mutually exclusive DSP resources too.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	20 Furthermore, the same DSP buffer that is used for these VM features is also
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	21 used for TCH UL substitution debug/test feature described in the TCH-tap-modes
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	22 article - therefore, all 3 features (VM record, VM play and TCH UL play) need
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	23 to be treated as mutually exclusive in time. However, aside from this mutual
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	24 exclusion, it is very remarkable that VM recording or VM playback can be invoked
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	25 during an active speech call (which can use any codec!), and the extra instance
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	26 of FR1 encoder or decoder (always FR1) invoked by VM features is essentially
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	27 independent from the main TCH encoder and the main TCH decoder, all of which
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	28 run simultaneously. It is worth noting that all newer GSM speech codecs (HR1,
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	29 EFR and AMR) are much more computationally intensive than FR1, thus given that
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	30 the DSP has the necessary horsepower to run any one of those "heavy" codecs, it
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	31 probably isn't too much extra work to also run a simultaneous instance of
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	32 unidirectional (encoder only or decoder only) FR1.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	33
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	34 The entire voice memo facility was already fully implemented in the TCS211 code
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	35 delivery from TI, but prior to FreeCalypso there was no way to exercise it. In
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	36 order to exercise VM functionality in TCS211, one needs to invoke these RiViera
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	37 Audio Service API functions:
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	38
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	39 audio_vm_record_start()
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	40 audio_vm_record_stop()
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	41 audio_vm_play_start()
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	42 audio_vm_play_stop()
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	43
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	44 In FreeCalypso we've added some simple AT commands that call the just-listed API
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	45 functions, and the facility that has been there all along is now accessible to
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	46 play - it is the same situation as with Melody E1.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	47
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	48 FreeCalypso AT commands for voice memo testing
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	49 ==============================================
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	50
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	51 AT@VMR="/pathname",dur,dtx
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	52
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	53 This command initiates VM recording. The FFS pathname into which the recording
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	54 should be written must be given as a quoted string (and as a reminder, all FFS
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	55 pathnames must be absolute - there are no current directories in the firmware
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	56 architecture), and there is a second required argument that sets the maximum
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	57 size of the recording. The duration argument is a decimal integer, and it is
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	58 reckoned in 1000-word units: if you specify duration as 1, the maximum recording
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	59 size is 1000 words (2000 bytes), if you specify duration as 2, the maximum
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	60 recording size is 2000 words (4000 bytes), and so forth. If you record with DTX
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	61 disabled, each block of 1000 words corresponds to 1 second in time (every 20 ms
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	62 frame turns into a block of 20 words), thus with DTX disabled the duration
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	63 argument becomes the actual duration in seconds. However, if you record with
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	64 DTX enabled, then periods of silence will be written in a compressed format
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	65 described later in this article, and the time duration of the recording will
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	66 depend on how much silence there is.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	67
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	68 The dtx argument is 1 to enable DTX or 0 to disable it; the default is DTX
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	69 disabled. The employed FR1 DTX algorithm appears to be the same as would be
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	70 used for TCH/FS uplink, except that an "artificial" (there is no SACCH with
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	71 independent-of-GSM voice memos) TAF position is generated on every 16th audio
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	72 frame, i.e., every 320 ms. (Note the shortening of this SID interval compared
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	73 to official TCH, where it is 24 frames or 480 ms.)
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	74
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	75 AT@VMRS
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	76
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	77 This command stops any VM recording in progress, but it is rarely needed - the
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	78 recording will stop automatically when the size limit is reached.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	79
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	80 AT@VMP="/pathname"
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	81
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	82 This command initiates playback of the VM recording contained in the named file
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	83 in FFS. The FFS pathname is the only argument.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	84
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	85 AT@VMPS
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	86
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	87 This command stops any VM playback in progress, but it is rarely needed - the
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	88 playback will stop automatically when the end-marker is read from the file.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	89
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	90 Voice memo file format
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	91 ======================
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	92
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	93 Using fc-fsio, you can read out voice memo files written by the VM record
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	94 facility, and you can likewise construct your own memo files externally, upload
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	95 them into FC device FFS and then play them via the VM play facility. The format
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	96 of these files is determined by TI's firmware stack (RV Audio Service on top of
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	97 L1 on top of the DSP), but is fundamentally based on a DSP buffer that is just
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	98 like those used for TCH. The companion TCH-tap-modes article describes the
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	99 format of the DSP buffer from which TCH DL bits can be read out; in the present
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	100 article we are going to cover the differences specific to the voice memo
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	101 facility.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	102
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	103 When VM recording is done with DTX disabled, every 20 ms speech frame turns into
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	104 a block of 40 bytes in the memo file. This block of 40 bytes is produced from
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	105 20 16-bit words in the DSP buffer, each word turned into two bytes in LE order
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	106 by the ARM part of Calypso. The DSP buffer used for the VM facility has the
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	107 same overall format as the one used for TCH DL, described in the TCH-tap-modes
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	108 article - 3 status or header words followed by 17 words of payload, with the
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	109 latter words carrying a 260-bit FR1 codec frame in the bit order of GSM 05.03
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	110 interface 1. As explained in the TCH-tap-modes article, speech codec payload
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	111 words are filled in the msb-to-lsb direction by the DSP, thus the natural byte-
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	112 oriented representation would be big-endian - but because the little-endian ARM
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	113 core sits in between the DSP and the on-media file format, the byte order in
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	114 voice memo files comes out "wrong". Oh well - it is what it is.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	115
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	116 Of the 3 header words that precede every 20 ms speech frame, words 1 and 2
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	117 appear to be dummies - they have meaning related to the channel decoder block
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	118 in the case of TCH DL, but in the case of isolated-from-GSM voice memos, there
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	119 does not seem to be any meaning. However, header or status word 0, consisting
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	120 of bit flags, is still important, but the bit flags for the VM facility are
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	121 different from those of TCH DL.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	122
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	123 When VM recording is done with DTX disabled, status word 0 is observed to always
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	124 equal 0xC400 on every frame. However, when DTX is enabled, the following bits
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	125 are seen in status word 0:
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	126
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	127 * Bit 15 will be set if this frame needs to be saved in its entirety, or cleared
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	128 if it is to be skipped. When VM recording code in L1S sees that the DSP has
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	129 delivered a frame with this status bit cleared, it will save only this status
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	130 word 0, i.e., 2 bytes will be written into the memo file instead of 40 bytes
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	131 for this 20 ms frame. On VM playback, the code likewise checks this bit to
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	132 see how many words need to be read from the file, so synchronization is
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	133 maintained.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	134
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	135 * Bit 14 appears to be the SP flag of GSM 06.31 section 5.1: set when a speech
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	136 frame has been generated, or cleared when a SID frame has been generated
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	137 instead.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	138
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	139 * Bit 11 is a TAF-like flag: when DTX is enabled, this bit is set in every 16th
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	140 frame generated by the DSP in the VM recording session, otherwise it is
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	141 cleared.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	142
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	143 * Bit 10 will always be set in every status word 0 that gets written to voice
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	144 memo files: this bit is set by the DSP when it has finished encoding a 20 ms
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	145 audio frame and is checked by L1S on every TDMA frame, serving as a
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	146 synchronization mechanism telling L1S when it needs to copy a speech frame
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	147 from the DSP to the memo file.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	148
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	149 When VM recording is done with DTX enabled, the recorded memo file will consist
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	150 of speech frames (header word 0xC400 or 0xCC00), SID frames (header word 0x8400
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	151 or 0x8C00) and skipped frames consisting of only the header word 0x0400, with
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	152 the remaining words omitted. There will always be a present (not skipped) frame
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	153 in every 16th position (0xCC00 or 0x8C00), thus no 0x0C00 frames are ever seen.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	154
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	155 Every voice memo binary file ends with a 0xFBFF end-marker word; this end-marker
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	156 is needed because TCS211 fw architecture exhibits a separation between the
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	157 actual data reading and writing processes in L1S and the FFS read/write
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	158 interface provided by RiViera Audio Service, and because of this separation the
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	159 operational code in L1S can't "see" an EOF condition at the file system level.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	160
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	161 FreeCalypso tools for decoding voice memo files
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	162 ===============================================
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	163
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	164 If you have recorded a voice memo with AT@VMR and then read it out with fc-fsio,
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	165 you can use additional FC tools to analyze it. The following tools are
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	166 available, split between FC host tools and GSM codec libs & utilities packages:
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	167
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	168 * fc-vm2hex converts a binary VM recording into ASCII hex format, similar to
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	169 the old (2016) TCH DL recording format before it was extended in late 2022.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	170 Every fully-written frame is emitted in the hex output as 3 space-separated
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	171 hex status words followed by a block of 66 hex digits giving the FR1 codec
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	172 frame in the unchanged bit order of TI's DSP, and every skipped frame (one
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	173 for which only status word 0 was written into the memo file) is emitted in
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	174 the hex output as just that one word.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	175
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	176 * gsmfr-dlcap-parse utility, originally written for parsing TCH DL capture
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	177 files, accepts TCH DL recording files in both old and new formats, and it also
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	178 accepts the output from fc-vm2hex as its input. The combination of fc-vm2hex
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	179 and gsmfr-dlcap-parse allows a developer or tinkerer to do thorough human
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	180 analysis of TCS211 VM recordings in both DTX-disabled and DTX-enabled modes.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	181
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	182 * There will soon be a new fc-vm2gsmx utility that will read binary VM recording
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	183 files (as you would read out with fc-fsio) and convert them into extended-
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	184 libgsm (gsmx) format defined in our GSM codec libraries & utilities package.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	185 This gsmx format is an extension of the classic libgsm (GSM 06.10) format,
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	186 adding the possibility of SID frames and BFI markers (frame gaps) in addition
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	187 to regular speech frames, thus it can represent the content of a voice memo
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	188 recording made in DTX mode. These gsmx files can then be decoded into
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	189 playable WAV with our gsmfr-decode utility.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	190
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	191 FreeCalypso tools for external generation of voice memo files
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	192 =============================================================
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	193
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	194 Using FreeCalypso tools, you can produce an external speech recording in GSM
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	195 06.10 FR1 codec format, convert it into TCS211 VM format, upload it into FC
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	196 device FFS with fc-fsio, and then play these externally-produced voice memos
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	197 with AT@VMP. The steps are as follows:
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	198
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	199 1) You can use gsmfr-encode to FR1-encode a speech sample from WAV into classic
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	200 .gsm format, or gsmfr-encode-r if the source is raw BE instead of WAV.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	201 Alternatively, you can use any other off-the-shelf software that can encode
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	202 FR1 and write libgsm format; SoX shipped with Slackware includes the
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	203 necessary support.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	204
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	205 2) fc-gsm2vm converts a .gsm recording into non-DTX TCS211 VM format.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	206
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	207 At the present time we don't have any tools for producing external DTX-enabled
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	208 VM recordings: the main limitation is that at least to this Mother's knowledge,
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	209 the published source software community does not currently possess a GSM 06.10
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	210 encoding library that has been extended with VAD and DTX functions. There is
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	211 classic libgsm from 1990s, used by everyone in the FOSS community who needs a
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	212 GSM 06.10 encoder or decoder, but it doesn't do DTX; we (FreeCalypso and
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	213 Themyscira Wireless) have produced our own libgsmfrp front-end that implements
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	214 Rx DTX handler functions (that's how we can properly decode FR1 streams that
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	215 contain SIDs and/or missing frames), but it doesn't help with DTX encoding.
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	216 Therefore, our ability to produce TCS211-compatible VM recordings externally is
69061d044f05 Voice-memo-feature: new article Mychaela Falconia <falcon@freecalypso.org> parents: diff changeset	217 currently limited to non-DTX mode.

FreeCalypso > hg > freecalypso-docs

annotate Voice-memo-feature @ 96:69061d044f05