FreeCalypso > hg > freecalypso-docs
comparison MEMIF-wait-states @ 17:3d65bdaf00da
MEMIF-wait-states article written
author | Mychaela Falconia <falcon@freecalypso.org> |
---|---|
date | Sun, 16 Jun 2019 23:30:33 +0000 |
parents | |
children | c01155dec65b |
comparison
equal
deleted
inserted
replaced
16:396d44c543e3 | 17:3d65bdaf00da |
---|---|
1 The Calypso chip's MEMIF (ARM memory interface) block has a few configuration | |
2 registers; most settings in these registers are quite straightforward, but the | |
3 WS setting (number of wait states to be inserted for external memory access) | |
4 requires some non-trivial analysis. | |
5 | |
6 Calypso MEMIF timings are described on pages 7 through 11 of this TI document: | |
7 | |
8 ftp://ftp.freecalypso.org/pub/GSM/Calypso/cal000_a.pdf | |
9 | |
10 When running on a Calypso C035 target, our TCS211 reference fw as well as most | |
11 vendor firmwares we've examined run the ARM7 core at its maximum clock frequency | |
12 of 52 MHz. These same firmwares typically configure WS=3 for both flash and | |
13 XRAM. Most Calypso-based phones and modems have flash and RAM chips with 70 ns | |
14 access time, and for a long time it seemed that this combination of ARM7 at | |
15 52 MHz and WS=3 was OK for 70 ns memories: one ARM7 clock cycle at 52 MHz is | |
16 19.23 ns, WS=3 means 4 cycles total per access (it's an N+1 arrangement), | |
17 19.23 ns * 4 = 76.92 ns, thus it should be OK for 70 ns memories, right? Not | |
18 so fast: as shown in the formula on cal000_a.pdf page 11 and can be seen from | |
19 the timing diagrams, two other timing parameters (tda and tsu) also need to be | |
20 factored in. The sum of tda+tsu for 2.8V MEMIF as given in the only document | |
21 we have available is 10.5 ns, thus if we run the ARM7 core at 52 MHz and set | |
22 WS=3, the available safe window for memory access time is only about 66 ns, | |
23 which is 4 ns short of the 70 ns flash and RAM access time specs. | |
24 | |
25 TI's reference fw setting of WS=3 in conjuction with ARM7 running at 52 MHz has | |
26 made its way into the official firmwares of Openmoko devices and several Compal | |
27 phones, including Mot C11x/12x, Mot C139/140 and Sony Ericsson J100. At least | |
28 in the case of Openmoko we know that the hardware features a flash chip with | |
29 70 ns access time (the combined flash+RAM chip is K5A3281CTM-D755, with the | |
30 suffix meaning 70 ns access time for flash and 55 ns for RAM), and in the case | |
31 of Compal phones it is highly unlikely that they used flash chips faster than | |
32 70 ns, thus we have strong evidence that the access time spec is being violated | |
33 by about 4 ns. It works in practice because the official specs are guaranteed | |
34 worst-case numbers, but it is still wrong in the strict sense. | |
35 | |
36 We have strong evidence that this WS=3 setting comes from TI's mainline | |
37 reference fw, as opposed to being customized by or for Openmoko or Compal. | |
38 The evidence is in the following instruction sequence which appears verbatim- | |
39 identical across Openmoko's, Mot C11x/12x and C139/140 firmware versions: | |
40 | |
41 ldr r1, =0xFFFFFB00 | |
42 mov r0, #0xA3 | |
43 strh r0, [r1, #0] | |
44 strh r0, [r1, #2] | |
45 mov r2, #0xA5 | |
46 strh r2, [r1, #4] | |
47 strh r0, [r1, #6] | |
48 mov r0, #0x80 | |
49 strh r0, [r1, #0xA] | |
50 mov r0, #0xC0 | |
51 strh r0, [r1, #0xC] | |
52 mov r0, #0x40 | |
53 strh r0, [r1, #8] | |
54 | |
55 (The SE J100 version differs only in the nCS2 configuration; apparently this | |
56 SE J100 phone has its ringtone melody generator chip hooked up to nCS2, whereas | |
57 on both OM's modem and Mot C11x/12x/139/140 this chip select is unused and | |
58 unconnected, meaning that its setting is a dummy just like nCS3 and nCS4.) | |
59 | |
60 The above instruction sequence has been reconstructed into the following | |
61 sequence of C macro calls: | |
62 | |
63 MEM_INIT_CS0(3, MEM_DVS_16, MEM_WRITE_EN, 0); | |
64 MEM_INIT_CS1(3, MEM_DVS_16, MEM_WRITE_EN, 0); | |
65 MEM_INIT_CS2(5, MEM_DVS_16, MEM_WRITE_EN, 0); | |
66 MEM_INIT_CS3(3, MEM_DVS_16, MEM_WRITE_EN, 0); | |
67 MEM_INIT_CS4(0, MEM_DVS_8, MEM_WRITE_EN, 0); | |
68 | |
69 MEM_INIT_CS6(0, MEM_DVS_32, MEM_WRITE_EN, 0); | |
70 MEM_INIT_CS7(0, MEM_DVS_32, MEM_WRITE_DIS, 0); | |
71 | |
72 (The last two lines setting nCS6 and nCS7 don't need to be considered, as those | |
73 are internal to the Calypso chip itself.) | |
74 | |
75 Thus we see that what appears to be TI's mainline code sets WS=3 for both nCS0 | |
76 and nCS1 (flash and XRAM, respectively), and then sets what appears to be a | |
77 dummy config for the unused nCS2, nCS3 and nCS4. I say "appears to be" because | |
78 we have no original source with comments, only a COFF binary object which our | |
79 reconstructed recompilable C code has been made to match. | |
80 | |
81 We may never know the truth unless we miraculously find a surviving copy of the | |
82 original (not reconstructed from disassembly) init.c source from TCS211, but my | |
83 (Mother Mychaela's) current working hypothesis is that the above MEMIF settings | |
84 were originally made for the D-Sample board and never changed for Leonardo. | |
85 The D-Sample board has flash on nCS0, main XRAM bank on nCS1, an additional | |
86 XRAM bank (typically unused) on nCS2 and peripherals (principally the LCD) on | |
87 nCS3. Furthermore, the original D-Sample boards had Calypso C05 chips populated | |
88 on them, and that chip version has no nCS4, only CS4 which is muxed with ADD22 | |
89 and used for the latter on the D-Sample. | |
90 | |
91 I further hypothetize that the above MEMIF settings were likely cast into code | |
92 in the days of Calypso C05, and that the WS=3 setting was computed when the | |
93 ARM7 core ran at 39 MHz. The combination of ARM7 at 39 MHz, WS=3 and the same | |
94 tda+tsu = 10.5 ns adjustment from the available cal000_a.pdf document | |
95 (officially corresponding to Calypso C035 F751774) gives an access time of | |
96 92 ns, which is very sensible. The hypothesis further goes that later TI moved | |
97 to Calypso C035 silicon and started running the ARM7 core at 52 MHz, but the WS | |
98 setting was never changed (overlooked), and the 92 ns access time turned into a | |
99 mere 66 ns. The latter works with 70 ns memories in practice despite being | |
100 strictly incorrect (negative margin), and so the error escaped notice. | |
101 | |
102 Solution adopted for FreeCalypso | |
103 ================================ | |
104 | |
105 Pirelli's firmware on the DP-L10 sets WS=4 for both flash and XRAM, and we have | |
106 always used the same setting in FreeCalypso when running on this target. When | |
107 we made our FCDEV3B hardware using the same Spansion flash+RAM chip copied from | |
108 the Pirelli DP-L10, we adopted the same WS=4 setting for our own FreeCalypso | |
109 hardware family on the reasoning that it is needed for this chip. But now we | |
110 have a better theoretical foundation: the flash+RAM chip in question has 70 ns | |
111 access time for both flash and pSRAM parts, same as most other flash and RAM | |
112 chips used in most Calypso devices, and the WS=4 setting should really be used | |
113 for all Calypso C035 targets (ARM7 at 52 MHz) with 70 ns memories. Thus the | |
114 new FreeCalypso strategy is to treat WS=4 as the generic default for Calypso | |
115 C035 platforms unless explicitly overridden for specific targets, and to stop | |
116 treating TI's reconstructed setup with WS=3 as canonical. | |
117 | |
118 When running on Openmoko GTA01/02, Mot C11x/12x, Mot C139/140 and SE J100 | |
119 targets (this specific list), we are going to keep WS=3 for nCS0 and nCS1 and | |
120 the dummies for nCS2, nCS3 and nCS4 unchanged for now, i.e., run with exactly | |
121 the same MEMIF settings as each manufacturer's respective original official fw. | |
122 The reason is political: we are not the product manufacturer of record, and the | |
123 error of negative design margin in the memory access timings is the liability | |
124 of FIC/Openmoko and Compal/Motorola/SE, not us. If we change from WS=3 to WS=4 | |
125 on these targets, our firmware will necessarily run a little slower, and given | |
126 that the original official fw "works just fine", we may be accused of needlessly | |
127 or artificially slowing down our aftermarket fw. But when we market our own | |
128 handset or modem products under the FreeCalypso trademark, then the full | |
129 responsibility for the entire product (hw+fw) falls on us, hence we use the | |
130 correct WS=4 setting. | |
131 | |
132 Interim WS setting during boot | |
133 ============================== | |
134 | |
135 There is one more complication to this picture. The MEMIF settings discussed | |
136 above for the operational phase with Calypso DPLL producing fast clocks are | |
137 made in the Init_Target() function, but there is another interim setting | |
138 established early on in assembly code, used prior to DPLL enabling, when the | |
139 ARM7 core runs at unmultiplied 13 MHz or 26 MHz as fed to the Calypso by the | |
140 board. This interim setting is first set in bootloader.s, then again in int.s | |
141 (with the definition residing in the included init.asm file), and the registers | |
142 are set to 0x2A1, meaning WS=1 and 1 dummy cycle. | |
143 | |
144 Unlike the situation with the censored init.c source file, we have the original | |
145 source for the assembly modules in question, and the only preprocessor | |
146 conditionals found therein are based on BOARD and CHIPSET symbols. Remember | |
147 that TI's Leonardo board never got its own BOARD number, instead it shares | |
148 BOARD=41 with D-Sample, yet the two boards have different Calypso clock inputs: | |
149 13 MHz on the DS, 26 MHz on the Leonardo. The C code in init.c (this part | |
150 survived in the LoCosto source) uses a preprocessor conditional on the RF_FAM | |
151 symbol to differentiate between 13 MHz and 26 MHz input clock arrangements, but | |
152 there is no conditional of any such sort in the assembly code. Thus it is my | |
153 (Mother Mychaela's) educated guess that the WS=1 setting was chosen assuming a | |
154 13 MHz clock, and when Leonardo came along with its 26 MHz clock, the problem | |
155 spot was once again overlooked. | |
156 | |
157 WS=1 at 13 MHz is equivalent to WS=7 at 52 MHz, thus there is plenty of margin. | |
158 But WS=1 at 26 MHz is equivalent to WS=3 at 52 MHz, once again putting us in | |
159 the troubled territory of negative margin with 70 ns flash and RAM chips. | |
160 Except that this case is even more difficult for firmware engineers to spot: | |
161 Pirelli's fw still has the same 0x2A1 setting in its early boot path, i.e., | |
162 their fw engineers have changed WS=3 to WS=4 for the main body of the fw, but | |
163 missed the early boot code. | |
164 | |
165 The solution adopted for FreeCalypso is to change the early MEMIF setting from | |
166 0x2A1 to 0x2A2, i.e., set WS=2 for the interim boot phase. |