FreeCalypso > hg > freecalypso-tools
view doc/TIFFS-Overview @ 920:0306449ba467
doc/TIFFS-Overview: document classic filename and pathname limits
author | Mychaela Falconia <falcon@freecalypso.org> |
---|---|
date | Sat, 31 Dec 2022 21:08:05 +0000 |
parents | 23f41a61ff7a |
children |
line wrap: on
line source
[Historical note: this document was originally written in 2014 when the vision of FreeCalypso was very different from what it is today. Since then we have transitioned from making aftermarket hacks on pre-existing Calypso phones and modems to producing and supporting our own FreeCalypso hardware products, and our firmware work has changed from a nebulous dream to stable production code. The ways in which we approach various technical aspects have changed accordingly, but much of our documentation has been slow to catch up. The documentation is now being updated, but there may still be some passages where the old world view shines through.] All TI GSM firmwares known to this author (Mother Mychaela of FreeCalypso) implement some kind of flash file system, or FFS. Several different FFS code implementations, and correspondingly several different on-flash data formats, have been used throughout the history of TI's involvement in the wireless terminal business. The FFS incarnation of primary interest to the FreeCalypso project is the one invented by Mads Meisner-Jensen at TI in the early 2000s (at least according to the comments in the sources available to us), and it is relevant to us in the following ways: * When targeting the GSM modem in Openmoko's GTA01/02 smartphones, we need to work with the original FFS from the factory (call it MokoFFS), the same FFS as used by the mokoN firmwares: this FFS contains the IMEI and the RF calibration values from the factory, which we most certainly don't want to go without. * The Leonardo firmware semi-src which we are using as the reference for building our own full source, multi-target GSM fw contains a turnkey-working implementation of this very FFS, using the on-flash format in question and providing run-time APIs expected by the rest of the GSM fw suite. Following the principle of ``if it ain't broke, don't fix it'', we use this FFS not only on the gtamodem target, but also on other targets, including those where we are starting from a blank state and thus have the freedom to use whatever FFS we like. * The original proprietary fw on the Pirelli DP-L10 phone also happens to use an FFS in the same format, although Pirelli's FFS does *not* contain the IMEI or any of the RF calibration values. This Pirelli phone was originally a target of high interest for FreeCalypso, as we had high hopes of turning it into a libre phone by way of our aftermarket firmware - but this plan has since been abandoned when it became clear that Pirelli's hw is unsuitable for aftermarket fw development because of the multitude of extra peripheral chips for non-GSM functions which get in the way. In the earlier years of FreeCalypso a lot of effort had been invested into studying all aspects of the Pirelli DP-L10 phone and its original firmware, including the FFS. Naming ====== I have previously referred to the FFS format in question as Mokopir-FFS or MPFFS, from "Moko" and "Pirelli". I was originally hesitant to call it TIFFS, as lacking the source code, I had no way of knowing whether the FFS format and implementation were of TI's own invention, or something that TI licensed as a black box from one of their many proprietary software partners. (I was unable to identify it as any well-known, industry-standard FFS format, but absence of evidence is not evidence of absence.) But now that we have TI's original source code which implements this FFS (first the MV100-0.1.rar source, then the full Leonardo one), complete with comments and a HISTORY file, we know that our FFS was invented and implemented by someone named Mads Meisner-Jensen at TI-DK, apparently their flash chip expert who also wrote FLUID. I am now making a naming transition from MPFFS to TIFFS: there is really no link between this FFS format and the Openmoko+Pirelli duo, other than the happenstance of me having first encountered this FFS on these two GSM device brands, and the name TIFFS is more neutrally-descriptive. What it is ========== In a rare departure from TI's norm (most of TI's GSM firmware and associated development tools suffer from heavy Windows poisoning), what I call TIFFS is very Unixy. It is a file system with a hierarchical directory tree structure and with Unixy forward-slash-separated, case-sensitive pathnames; the semantics of "what is a file" and "what is a directory" are exactly the same as in UNIX; and TIFFS even supports symlinks, although that support is a little under- developed, and apparently no FFS symlinks were ever used in any production GSM device. Thus the FFS implemented in TI-based GSM devices (modems and "dumbphone" handsets) is really no different from, for example, JFFS2 in embedded Linux systems. (The only traditional UNIX file system features which are missing in TIFFS are the creation/modification/access timestamps and the ownership/permission fields.) The FFS in a GSM device typically stores two kinds of content: * Factory data: IMEI, RF calibration values, device make/model/revision ID strings etc. These files are expected to be programmed on the factory production line and not changed afterward. * Dynamic data written into the FFS in normal device operation: when you use a "dumbphone" running TI-based firmware, every time you store something "on the phone" or in "non-volatile memory", that item is actually stored in the FFS. (Where else, if you think of it?) That includes contacts and received SMS stored "on the phone" instead of the SIM, any selections you make in the settings/preferences menus which persist across reboots (power cycles), call history etc. It needs to be noted that the "dynamic data" aspect of FFS usage applies not only to complete phones, but also to modems like the one used in the GTA01/02. One would naively think that non-volatile storage of data in flash outside of factory programming would be needed only in a device with its own UI, and that a modem subservient to external AT commands would be completely stateless across reboot/power cycles; but that is not the case in actuality. TI-baseed GSM firmwares, including Openmoko's mokoN and our own FreeCalypso, are designed to always "mount" their FFS with read/write access; TI's FFS implementation in the firmware has no concept of a "read-only mount". I am still investigating just what kinds of data are routinely written into the non-volatile FFS by the firmware in normal operation on devices like the GTA0x modem, but there most definitely are some. There is no hard separation between "static" and "dynamic" data in the file system structure; TIFFS is thus akin to an embedded Linux system with just a single root file system containing both "static" files like userland binaries and "dynamic" ones like configuration files under /etc which the user is expected to edit with vi after logging into the box, or log and similar files created by the system itself under /var, for example. Where it lives ============== The type of flash memory used in Calypso GSM modems and "dumbphones" is called NOR flash. This NOR flash memory is physically divided (by the design of the flash chip itself) into units called "sectors" or more descriptively, erase blocks. The typical NOR flash sector size (in Calypso GSM devices) ranges from 64 KiB in the GTA02 modem's NOR flash (4 MiB total) to 256 KiB in the S71PL129NC0 flash+RAM chip used in the Pirelli DP-L10 and in our own FreeCalypso hardware designs (16 MiB of flash total). The key physical property is that any bit may be changed from a '1' to a '0' at any time, in any combination, but resetting of '0' bits back to ones can be done only on the granularity of these largish sectors, in an operation called "sector erase". The location of TIFFS within the flash memory of a given GSM device is defined by the firmware design of that device, but is always some integral number of contiguous flash sectors. Some examples: * On the GTA01/02 GSM modem, FFS occupies 7 sectors of 64 KiB each, starting at flash offset 0x380000. * On the Pirelli DP-L10, the FFS used by the original proprietary fw occupies 18 sectors of 256 KiB each (for 4.5 MiB in total), starting at the beginning of the 2nd flash chip select (0x02000000 in the ARM7 address space). * On Motorola/Compal C139/140 phones, the FFS used by the original proprietary fw occupies 5 sectors of 64 KiB each (320 KiB in total), starting at 0x370000. C11x/12x use smaller FFS configurations, whereas C155/156 use a different FFS implementation with a completely different on-flash format - see the new Compal-FFS article for more details. * On our own FreeCalypso hardware family we have put our FFS in the first 8 sectors (of 256 KiB each) in the 2nd flash chip select bank, which appears at 0x01800000 in the ARM7 address space instead of Pirelli's 0x02000000 because we have wired the 2nd flash chip select to nCS2 on the Calypso instead of Pirelli's nCS3. * The smallest real FFS configuration called for by the table in dev.c in TI's original Leonardo fw source is 3 sectors of 64 KiB each; the same table also sports a 4 KiB x 4 configuration for RAM-based testing (emulation of FFS in RAM without real flash). * The largest FFS configuration that has been envisioned by the original designers seems to be somewhere around 128 sectors. Each flash sector used for TIFFS begins with this 6-byte signature: 46 66 73 23 10 02 The first 4 bytes are 'Ffs#' in ASCII, and the following two bytes are the format version number of 0x0210 in little-endian byte order. The following two bytes give a count of how many times that sector has been erased and rewritten (FF FF in "fresh" or "virgin" FFS images), and the following byte indicates that block's role and status in the FFS life cycle. How it works ============ Just like JFFS2 and other high-quality flash file systems, TIFFS is designed to recover gracefully from any possible power failure or crash: one can yank the battery from the GSM device (or induce a firmware crash) at the most mis- opportune moment in the middle of an FFS write operation, and the FFS is expected to recover on the next boot cycle. I won't be able to document here all gory details of exactly how this goal is achieved, partly because I haven't studied the code to the requisite level of depth myself yet, but all of the responsible code lives under src/cs/drivers/drv_app/ffs in our fc-magnetite and fc-selenite source trees; feel free to study it. In its "normal" or "clean" state (i.e., when not in the middle of a write operation or recovery from an ungracefully interrupted one), a TIFFS instance consists of the following 3 types of blocks: * One block containing inode records, indicated by AB in its type/flags/status byte in the block header; * N-2 blocks (where N is the total number of flash sectors allocated for the FFS) containing (or waiting to be filled with) data chunks - indicated by BD in the type/flags/status byte; * One "free" block, indicated by BF - destined to become a new AB or a new BD at some point. Each object written into the FFS (file, directory or symlink) consists of a 16-byte inode record written into the AB block and a data chunk written into one of the BD blocks. The data chunk includes the name of the object, hence one is required even for directories. Data chunks are contiguous, uncompressed, and subject to an upper size limit of 2048 or 8192 bytes, depending on the FFS configuration. Files larger than this limit are stored in a "segmented" form, giving rise to a 4th inode or object type (after file, directory and symlink): segment. Each segment of a segmented file consists of not only a data chunk, but also an inode record for the segment, which gives the location of the data chunk and ties the segment object into the overall FFS structure, making it accessible. Because aside from complete sector erasure, flash memory bits can only transition from '1' to '0' but not the other way around, overwriting an existing file with some new content (an operation which any reasonable file system must implement in some way) cannot be done in place. Instead like most flash file systems, TIFFS implements this common operation by writing the new version of the file to a new location (previously blank flash) and then invalidating the old version - and doing all that while keeping in mind the possibility of an ungraceful crash or powerdown at any moment, and the requirement of recovering gracefully from any such event. Of course as an FFS receives more write activity, even if one keeps overwriting some existing files with new content of the same size, without adding to the visible total content size (think du(1) command), eventually all remaining blank flash space will fill up. At that point (or at some earlier point, depending on the FFS design and/or configuration) the FFS has to invoke a compaction or reclamation or garbage collection procedure: any "mixed" blocks containing both valid and stale data are transitioned into a "stale-only" state by having the active data moved to a new block, and then the "all stale" blocks are subjected to sector erasure, becoming new blank sectors. The logic responsible for these operations once again needs to be resilient to the possibility of a crash or powerdown occurring at the most mis-opportune moment, and it also needs to implement flash wear leveling: there is a physical limit to how many times a given flash sector can be erased and rewritten before it goes bad. All of the above are common and well-known principles, successfully implemented in well-known flash file systems such as JFFS2 in Linux. TIFFS is absolutely no different in this regard; for the implementation details, read the source code. TIFFS filename and pathname limits ================================== Classic TIFFS, as in the canonical firmware source from TI, imposes the following limits on the content written into FFS: * Each elementary filename or pathname component (the name of each individual file or subdirectory within its parent directory) is limited to 20 characters; * The set of allowed characters in these elementary filenames is limited to [A-Za-z0-9_.,+%$#-]; * The maximum pathname depth is limited to 6. As an illustration of the pathname depth limit, the deepest allowed path to a non-directory file is /d1/d2/d3/d4/d5/file. It is also possible to have a directory nesting of /d1/d2/d3/d4/d5/d6, but in this case the deepest directory can only be empty. How this FFS comes into being ============================= (This section is only relevant to you if you plan on physically producing your own GSM phones or modems on your own factory production line, like we currently do at our family company, or if you simply enjoy knowing how it is done.) To my knowledge, TI never used or produced a tool akin to mkfs.jffs2 in the embedded Linux world, or akin to our recently developed tiffs-mkfs, which would produce a TIFFS image complete with some initial directory and file content "in vitro". Instead it appears that the FFS instances found in shipped products such as Openmoko phones have been created "in vivo" by TI's firmware running on the device itself during the "production test" phase. We never got a copy of the original factory production line software that was used by Openmoko, but we have successfully replicated the process using our own Unix/Linux-based FreeCalypso host tools, the very same tools that are contained in the present source package you are looking at. The process goes like this: * When the printed circuit board is physically populated with components such as the Calypso chip and the flash chip, the latter can be blank - if the board design has the nIBOOT pin pulled low, enabling the Calypso boot ROM (Openmoko and Pirelli both good on this one, but shame on Compal!), there is no need to preprogram the flash chip with anything prior to populating it on the board, and the device remains fully unbrickable at all times afterward. * When the assembled board is powered up for the first time, with completely blank flash, the Calypso boot ROM will sit there and patiently wait for a code download on either of its two UARTs. * Using TI's FLUID (Flash Loader Utility Independent of Device) or FreeCalypso's fc-loadtool free replacement, the factory production station loads the main firmware image into the flash. Note, it is just the firmware image at this step, and the FFS sectors remain blank. * The board is commanded to reboot (or power-cycled), and the firmware image boots for the first time. * TI's FFS implementation code in their standard firmware reacts to all blank flash in the FFS sectors as follows: it performs what they call the preformat operation, writing the TIFFS signature and a BF state byte into every FFS sector, but the main "format" operation, which sets up the AB/BD block roles, creates the root inode and makes the FFS ready to accept the creation of its first directories and files, is not done automatically. In order to perform the FFS format operation and then fill the new FFS with whatever directories and files are deemed needed to be present in "fresh" shipping products, the factory production station connects to the just-booted firmware running on the target via the RVT/ETM protocol (see the RVTMUX write-up), and sends "test mode" commands to this running firmware. These "FFS test mode" (or TMFFS) commands include the format operation, an mkdir operation to create directories, and a "file write" operation akin to doing 'cat > /dir/whatever/file', creating files in FFS and storing any desired data in them. The IMEI is assigned and written into FFS in this process, but it is not the only data item that will be unique for each individual device made. Much more important are the RF calibration values; the factory calibration procedure does the following for each individual unit: * Measures the frequency offset produced by the VCXO as a function of the AFC DAC control value and constructs the afcparams table based on these measurements; * Characterizes the dBm output of the Tx chain as a function of the APC DAC control value and comes up with a set of these DAC values which produce Tx power levels prescribed by the GSM 05.05 spec; * Determines the correction values which need to be applied in order to set the correct Rx path gains and to determine the true Rx signal level from the dBfs power measurements made in the DSP from Rx I&Q samples. These calibration procedures are performed by connecting a suitable RF test instrument (R&S CMU200 is the industry gold standard) to the GSM device's antenna connector or RF test port and running special calibration programs which talk both to the CMU200 or other test instrument and to the L1TM (Layer 1 test modes) component in the DUT (device under test). In FreeCalypso hardware manufacturing we use a CMU200 instrument which is itself maintained in good calibration standing, and for the calibration software we use our own fc-rfcal-tools which talk to the DUT via rvinterf. All of the resulting calibration values are stored in a bunch of files under the /gsm/rf subtree, and these files are "owned" by the L1 code. The latter has RAM data structures which correspond to these files; upon normal boot the initialization code looks in FFS, and if it finds any of the RF calibration files, it reads each present file into the corresponding RAM data structure, overwriting the compiled-in defaults. With TI's standard production calibration procedure which we have replicated in our FreeCalypso hw manufacturing setup, these RF calibration files in FFS come into being as follows: * The Test Mode support code in L1 (i.e., part of the main GSM fw) performs the measurements and stores results in its RAM data structures as commanded by the production test station through the Test Mode interface; * Certain special test mode commands encoded via the MISC_ENABLE opcode direct the above L1TM code to write its RAM data structures into FFS. See the RF_tables article for more information. Compal and Pirelli differences ============================== The above description refers to TI's vanilla reference version, and it seems like Openmoko (FIC) was the only phone/modem manufacturer (prior to us!) who followed it without major deviations. In contrast, both Compal (Motorola C1xx and Sony Ericsson J100) and Foxconn (Pirelli DP-L10) moved their vital per-unit factory data (IMEI and RF calibration) out of the FFS into their own ad hoc flash data structures (which are very difficult to reverse-engineer and make use of, unfortunately), leaving their FFS only for less critical data. In Compal's case (all C1xx models and SE J100) the FFS stores only users' personal information and nothing more. One can turn the phone off, use fc-loadtool to erase the FFS sectors, and boot the regular fw back up; the fw will automatically do a new FFS format (it even displays a message on the LCD as it does so) and carry on happily as a "fresh" or "blank", perfectly functional and usable phone. Please see the new Compal-FFS article for further details. In Pirelli's case, booting their official fw with blank FFS sectors will also result in the FFS being automatically formatted, but their fw expects some static "asset" files to be present in this FFS: UI graphics and language strings, ringtones, firmware images for the WiFi and VoIP processors and some static configuration files, about 3 MiB in total. Thus although the firmware will auto-format the blank FFS sectors, it won't function normally with all of these "asset" files missing. Foxconn's original factory production line station must have uploaded these files to each phone via the TMFFS2 protocol, and our FreeCalypso suite now features a tool that can replicate this feat: fc-fsio. Aftermarket FFS for FreeCalypso on Compal & Pirelli targets =========================================================== When we run our own FreeCalypso fw on "alien" (not native to us) Mot C1xx and Pirelli DP-L10 hardware, we don't use the FFS from their respectively original firmwares: those original FFS instances don't contain any bits of interest to us, trying to make our fw use the same FFS as Mot/Compal's or Pirelli's original fw would be more trouble than benefit, and on one of the target devices in this family (Mot C155/156) the original FFS is in some different format. Instead we create our own aftermarket FFS for our FreeCalypso fw on these alien hw targets, using a different flash location from the original so that the original fw's FFS cannot be mistaken for our own. On the Pirelli DP-L10 we put our aftermarket FFS in an area of the flash which the official fw family uses as a staging area for over-the-air firmware updates, thus as long as you are not doing official fw updates over WLAN (i.e., if you only run one fixed official fw version or flash different official fw versions with fc-loadtool without going through "fw update" protocols), our aftermarket FFS used by our run-from-RAM FreeCalypso firmwares should remain undisturbed while the phone is in the official fw mode. The situation is different on Mot C1xx phones. The lower-end C1xx models including the C139 (our primary hw target in this family) have too little RAM to run our FreeCalypso fw entirely out of RAM without flashing; the C155/156 subfamily does have enough RAM to allow a complete FC GSM fw image to be loaded and run via fc-xram under some conditions (we previously supported such usage in our now-retired Citrine fw and we also support it in the gcc-built config of FC Selenite), but there is no place in the flash where we can put our aftermarket FFS without overwriting some part of the original fw or its data - thus our general procedure for running FreeCalypso on any C1xx model is to convert the victim phone to FC on a long-term basis by flashing our modified bootloader, flashing one of our fw builds and establishing a FreeCalypso aftermarket FFS in a flash area designated by us. It was already mentioned earlier that the factory RF calibration values on these alien phones are stored in non-TIFFS flash data structures of Compal's or Foxconn's invention, and our currently supported FreeCalypso firmwares (Magnetite and Selenite) do not contain any code for reading these alien data structures. (FC Citrine could read directly from Pirelli's factory data block, but none of our fw offerings ever parsed Compal's weird flash records.) Instead our current approach is to have an external tool extract the bits of interest from the alien factory records, convert them to our TI-standard format if necessary, and upload them into our FreeCalypso aftermarket FFS. The specifics are as follows: * The Pirelli DP-L10 is a breeze: simply run our special pirelli-magnetite-init command in fc-fsio while connected to a Magnetite or Selenite fw instance running with our aftermarket FFS, and the tool will copy both the IMEI and the RF calibration records from Pirelli's factory data block into our aftermarket FFS. * Mot C1xx phones present a lot more hassle: our current official procedure is to make a dump of the flash prior to the xenotransplantation procedure (also serves as a backup), extract the RF calibration values with our c1xx-calextr tool, and then later in the procedure when you initialize your aftermarket FFS with fc-fsio, upload these extracted and format-converted RF calibration files as one of the several steps involved. You will also need to enter your IMEI manually: fc-loadtool flash compal-imei command can extract the factory IMEI record from the flash chip's protection register and save it into a text file, but you still need to feed it manually to the new firmware with fc-fsio set-imeisv command. FreeCalypso host tools for TIFFS ================================ Our FC host tools package supports TIFFS in two ways: 1) Our primary tool for working with GSM device file systems is fc-fsio. When run against a compatible firmware version (primarily our own, but Pirelli's proprietary fw is also compatible), fc-fsio allows various read and write operations to be performed on the target device FFS. fc-fsio can also be used together with our fc-xram based FFS editing agent described below. 2) We have a TIFFS In Vitro Analyzer (IVA) tool for "in vitro" examination of FFS images that have been read out of raw flash with fc-loadtool. See the TIFFS-IVA-usage article for more information. As a very recent addition, we also have another "in vitro" tool (tiffs-mkfs) that goes the other way, creating new complete TIFFS images from a tree of directories and files. In addition to the above, back in the days of Openmoko (back when the Openmoko community was still active and we considered ourselves a part of it) we had produced a kit for editing the modem FFS on Openmoko GTA01/02 devices, giving users an easy way to change their /pcm/IMEI file. Changing IMEIs for no good reason is completely pointless and is actually detrimental rather than helpful for privacy, but whoever came up with the edicts that "the IMEI MUST be immutable" had obviously failed Human Psychology 101: declaring something to be forbidden causes people to want it simply because it is forbidden and for no other reason - hence the popular demand for IMEI changing tools. Our Openmoko FFS editing kit from early 2014 consisted of a very early version of what much later became the present FC host tools package (more specifically, it was before fc-fsio, and the set-imeisv command had been hacked into fc-tmsh instead) plus a pair of "in vivo" FFS editing agent target binaries that run on the target by way of fc-xram. Our current FC host tools fully supplant the ancient version in that 2014 kit, and our current replacement for the ancient FFS editing agent is this new version: https://www.freecalypso.org/hg/ffs-editor/ The new FFS editing agent linked above is run via fc-xram, while it is running you communicate with it via rvinterf (launched directly from fc-xram as the 2nd program), and you can run fc-fsio against it to perform whatever actual FFS manipulations are needed.