diff doc/Melody_E1 @ 180:e50c3aa1152a

doc/Melody_E1 written
author Mychaela Falconia <falcon@freecalypso.org>
date Mon, 27 Mar 2017 21:38:45 +0000
parents
children
line wrap: on
line diff
--- /dev/null	Thu Jan 01 00:00:00 1970 +0000
+++ b/doc/Melody_E1	Mon Mar 27 21:38:45 2017 +0000
@@ -0,0 +1,234 @@
+Generating ringtone melodies through the Calypso DSP
+====================================================
+
+The DSP in the Calypso and other GSM DBB chips from TI includes a built-in
+capability for generating ringtone melodies to be played through a loudspeaker
+driven by the ABB, without using an external melody generator chip.  More
+specifically, the DSP in question supports two flavors of internal melody
+generation, called Melody E1 and Melody E2 - although it is unclear whether
+Melody E2 is implemented in the DSP ROM or in the DSP code patches downloaded
+by TI's firmwares.
+
+The Melody E1 mechanism produces simple polyphonic melodies with up to 8
+simultaneous notes; these melodies consist of simple sine waves generated by
+the DSP as commanded by the bits read from the melody file as explained below.
+Melody E2 is a more complex mechanism for producing melodies (also polyphonic
+with up to 8 simultaneous notes) using the sounds of specific instruments,
+rather than simple sine waves, but we currently lack the bits required in order
+to understand or exercise it, hence our current focus is on the simpler Melody
+E1 mechanism.
+
+How these melodies are played
+=============================
+
+TI's RiViera Audio Service firmware component provides a front-end to the
+various audio services provided by the lower-level DSP+L1 combo.  In the case
+of Melody E1 and Melody E2 features, the combination of the DSP and TI's
+ARM-side L1 code effectively defines the format of the melody bits themselves,
+but the RiViera Audio Service takes care of reading these bits from FFS and
+feeding them to L1.
+
+To play an E1 format melody, the UI code needs to call audio_melody_E1_start();
+one of the arguments to this API function is the FFS absolute pathname of the
+melody file.  The API function will open this file and pass the open file
+descriptor along with other parameters in the message posted to the Audio task;
+the latter task will prefetch the first buffer-full of melody bits from the file
+and then post an MMI_MELODY0_START_REQ message to the L1A task.  The Melody E1
+handler in L1A will set up some preliminaries and fire up the Melody E1 L1S
+task, and the latter will then pass the melody bits to the DSP at appropriate
+times.
+
+Melody E1 file format
+=====================
+
+We have found a rather terse and not particularly thorough description of the
+Melody E1 bit format on pages 160 through 163 of this PDF document:
+
+https://www.freecalypso.org/LoCosto-docs/PSL1DOC/L1/L1M_AS001_1.pdf
+
+This description is not complete enough to enable proper understanding or
+implementation, but by combining it with a study of the L1A and L1S code that
+reads these bits and passes most of them to the DSP, we have reconstructed a
+somewhat more complete picture.
+
+The format is word-oriented, i.e., the basic unit of data in a Melody E1 file
+is the 16-bit word.  Most of these words are passed to the DSP for final
+interpretation inside the latter, hence we won't be able to have a 100% certain
+understanding of what happens there unless we can find the source for the DSP
+ROM code or expend a Herculean effort to reverse-engineer it, but some of the
+words are interpreted and acted upon by the ARM-side L1 firmware code, which we
+have source-reconstructed already.  When these words are written in a disk or
+FFS file, the byte order is little-endian, as it is ARM code that reads these
+16-bit words from a byte-oriented source.
+
+The very first word in a Melody E1 file gives the global list of oscillators
+used by this melody; this word is read by the L1A code before the L1S task is
+fired up.  The presence of this word is not documented at all in the terse
+description given in L1M_AS001_1.pdf, and our attempts at producing our own E1
+melodies were going nowhere until we discovered that this word is needed through
+the study of our reconstructed TCS211 L1 code.  This initial word corresponds
+to the osc-set line in our ASCII format.
+
+After the initial word giving the global oscillator set, the melody file
+consists of what we shall call time blocks.  Each time block begins with a time
+descriptor word which is interpreted and acted upon by ARM L1S code, followed
+by 0 to 8 oscillator descriptors which are loaded into DSP API words.  These
+words are described in TI's document, so we are just going to supplement that
+description wherever we have discovered something to the contrary.
+
+The lower byte of the time descriptor word tells the L1S task how long it should
+wait before loading the following oscillator descriptors into the DSP.  It
+appears that TI's intent was for this time value to be measured in 20 ms audio
+frames, but what the ARM L1S code actually does is multiply the given time value
+by 4 and use the result as the number of TDMA frames to count - the L1S code
+executes on every TDMA frame.  13 TDMA frames equal 60 ms, thus 4 TDMA frames
+do not exactly equal 20 ms, but come a little short.  It is not clear whether
+the melody files generated by TI and/or their customers account for this
+discrepancy or not.  In any case, the time value given in the file needs to be
+non-zero - putting a zero in there will cause the L1S counter to be set to 65535
+TDMA frames (a 16-bit unsigned counter loaded with 0 and decremented by one),
+which is probably not what you want.
+
+The upper byte of the time descriptor word is a bit mask indicating which DSP
+oscillators are to be loaded at this time.  This bit mask byte can be zero, in
+which case the time block consists of just the time descriptor word.  However,
+the L1S code does absolutely nothing to the DSP in this case, hence an empty
+(no oscillators) time block is indistinguishable from adding the time to the
+following non-empty block.  But the largest time value that can fit in the byte
+is 255, hence empty time blocks can be used to produce larger time deltas.
+A time descriptor with zeros in both upper and lower bytes indicates the end of
+the melody; this terminator is required.
+
+Now we come to the interesting part: the oscillator descriptors that are loaded
+into the DSP to cause the actual melody generation to occur.  The DSP's NDB API
+page contains 4 words for each of the 8 oscillators, and these NDB API words are
+where the oscillator descriptor words from the melody file ultimately go.
+
+Please refer to the description of the ml_load1 and ml_load2 bits on page 162
+of TI's L1M_AS001_1.pdf document.  Now here is what the L1S code actually does:
+first it loads 2 words from the file buffer into the DSP's NDB page - yes,
+directly into there.  Then it does the following logic (code simplified from
+the actual into more readable pseudocode):
+
+	load_size = 0;
+	if (word1 & ml_load1)
+		load_size++;
+	if (word1 & ml_load2)
+		load_size++;
+	if (load_size)
+		load load_size words at word2 address in the DSP's NDB page
+
+This logic is peculiar: what happens if ml_load2 is set but not ml_load1?  The
+result will be that the word meant to be word3 (the envelope word) will get
+loaded into the word2 location in the DSP's NDB page.  Unless the DSP actually
+checks the ml_load bits and expects the envelope word in the word2 location in
+this case, which I highly doubt, this L1S behaviour looks like a bug to me - so
+don't use the word3 present but not word2 combination in your melodies.
+
+It appears that these ml_load1 and ml_load2 bits are only checked by the L1S
+code and ignored by the DSP.  I say so because when I tried creating a melody
+in which word2 and word3 were always omitted, the result was bogus.  It appears
+that the first time a given oscillator is loaded, all 4 words must always be
+given, otherwise the DSP will act on whatever garbage happens to be in those
+NDB API words from before.  When the same oscillator is subsequently reloaded,
+omitting word2 and/or word3 will cause that word's previous value to be reused.
+
+A few notes regarding some bits in word0:
+
+ml_synchro (bit 0): the L1S code ORs a 1 into this bit in the NDB API word
+after it has loaded all of the words.  It thus seems more correct to me to put
+a 0 in this bit in the files, so that the DSP sees the new descriptor when it
+is complete - but of course we can never know for sure without knowing what
+actually happens inside the DSP.
+
+ml_directF: both common sense and the TSM30 source (which uses the Melody E1
+feature of the DSP in that old Calypso version) suggest that ml_directF is
+bit 1, ml_square1 is bit 2 and ml_square2 is bit 3, i.e., it appears that the
+table on page 161 of L1M_AS001_1.pdf is wrong in this regard.  Also note the
+order in which the fields are described on page 162 of the same PDF document.
+
+This is where our current knowledge ends.  Until we either obtain a copy of the
+source for the DSP ROM or painstakingly reverse-engineer it, all we can do is
+look at the few existing examples of E1-format melodies we can find (see below)
+and experiment with putting different values in the various fields based on the
+description in the L1M_AS001_1.pdf document.
+
+Examples of E1-format melodies
+==============================
+
+We've been very fortunate to discover that the legendary TSM30 phone appears to
+have used the Melody E1 feature, and that there are a bunch of E1-format
+melodies embedded in the famous TSM30 source from HispaPhreak.  I have extracted
+these melodies, played them through the earpiece speaker on a Pirelli DP-L10
+phone running FreeCalypso Magnetite (our own FCDEV3B with a loudspeaker that we
+can actually use has not been built yet as of this writing), and found some of
+them to be quite pleasant-sounding.  These extracted TSM30 melodies can be found
+here:
+
+ftp://ftp.freecalypso.org/pub/GSM/ringtone/tsm30-melody-e1.tar.gz
+
+I also found a couple of melodies in our TCS211 reference semi-src under
+chipsetsw/services/Audio/tests; these two melodies illustrate how one can load
+word2 and word3 the first time and then omit them afterward when reloading the
+same oscillator.  (All of the TSM30 melodies always load all 4 words in every
+oscillator descriptor.)
+
+Our own ASCII format for E1 melodies
+====================================
+
+In this FreeCalypso host tools package we have a utility for decoding existing
+Melody E1 binary bits into an amenable-to-study ASCII format, as well as a
+utility for generating new E1-format binary melodies from an ASCII text source
+in the same format.  The ASCII format is of our own invention, and consists of
+numeric fields which map directly to the various bit fields in the DSP+fw's
+binary format.
+
+Our ASCII format for E1 melodies consists of 3 parts: an osc-set global header
+line, a sequence of time blocks, and an end line.  The noteworthy aspects are:
+
+* Each time block is given as a time line followed by 0 or more osc lines.
+  This lines must follow in direct succession without intervening blank or
+  comment lines, and each time block must end with a blank line.
+
+* The end marker line is mandatory; having the ASCII file just end without it
+  is an error.
+
+Please see the source code for fc-e1decode and fc-e1gen for the rest.
+
+Some words regarding Melody E2
+==============================
+
+E1-format melodies are self-contained: if you have a valid binary melody file
+in E1 format from whatever source, you can play it through the DSP of any
+Calypso device that runs our TCS211-based Magnetite firmware.  But it is not so
+simple with Melody E2.  In order to play a melody in E2 format, one needs not
+only the melody file itself, but also the set of *.mwa (instrument wave) files
+corresponding to the set of instruments used by that melody.  It appears that
+the melody group at TI had produced as many as 48 different instrument wave
+tables: see the list in the non-production
+	#if (AUDIO_SIMULATION) || (AUDIO_L1_STANDALONE)
+section of the Cust_audio_melody_E2_load_instrument() function in l1audio_cust.c
+in both TSM30 and LoCosto/Peek sources.  (The LoCosto version lists 48
+instruments whereas the much earlier TSM30 version lists only 40 of them, thus
+the list must have been added to over the course of TI history.)  A given E2
+melody selects a subset of 1 to 8 instruments out of the larger set to be used
+in that melody, and these selected instrument waves are loaded into the DSP's
+API RAM before the actual play of the melody itself.
+
+Unfortunately all we have are the *.mwa file _names_ for the 48 Melody E2
+instruments that apparently existed at TI once upon a time, but not any of the
+actual bits.  The TSM30 source uses only Melody E1, not E2, thus we do not
+currently have any source from which we can take any E2-format melody examples
+or E2 instrument wave tables for TI's DSP.  We also don't have any documentation
+for any of these bits, and analysis of the Melody E2 code in L1 shows that it is
+significantly different from E1.  The code in TCS211 L1 that reads Melody E2
+file bits is not of much help for making our own E2 melodies, as all of the real
+magic happens in the DSP, not on the ARM side.
+
+Thus our FreeCalypso hardware+firmware combination is capable of playing both E1
+and E2 melodies, but we won't be able to exercise the latter capability until
+and unless someone finds a surviving copy of some existing E2 melodies along
+with the *.mwa instrument wave files they require, whether it is the same
+instrument set as listed in the non-production section of l1audio_cust.c or a
+different one.  But if someone does obtain a set of such melody bit files, our
+FreeCalypso devices running FreeCalypso firmware are ready to play them.