-
Notifications
You must be signed in to change notification settings - Fork 410
Performance and Optimizing
Below are notes on optimization of spiffs.
Spiffs can be configured in many ways. Looking at the structure, spiffs normally do not complain even if things are oddly configured. The penalty is efficiency, either in flash usage or in time.
Following statements are guidelines.
Spiffs needs two free logical blocks. Hence, having less than 6 logical blocks is suboptimal. Having a great deal of logical blocks (more than 512) is valid, but spiffs will need to process a lot of data and this takes time. It might be better to have bigger logical blocks giving less number of blocks.
Considering logical block size and logical page size, one block should have at least eight pages per block. More is better. Upper limit is untested, but more than 1024 pages per block will probably be suboptimal. 32 to 512 pages per block feels reasonable.
Remember, it is perfectly valid to have a logical block size that spans multiple physical blocks. It is not necessary to span 2^n physical blocks, any multiple will do. Though, the size of the entire spiffs file system must be an integer multiple of the logical block size.
This all depends on how large your file system is, the median size of your files, and how the files are accessed and modified. To find your global maximum: test, test, test.
Then again, as mentioned, spiffs swallows pretty much. If you're not having problems with speed you should perhaps not spend too much time testing around.
Spiffs crunches pretty much data from time to time. Stating the obvious, use compiler optimizations.
If you have some spare tightly coupled memory, you might want to put spiffs work buffer there.
If you only run one spiffs instance, some cpu cycles can be saved using the SPIFFS_SINGLETON
config.
SPIFFS_CACHE
SPIFFS_CACHE_WR
Enabling read cache is highly recommended, and gives a performance boost at the cost of some memory. The more pages you can give, the better of course. But do note, spiffs only supports up to 32 cache pages. Giving more is waste of ram.
Also, there should be at least one write cache page. Otherwise, one must take care when writing files as all writes will be write through. Without write cache, only largest possible quantity of data should be written at any time. A loop like the following would wear the spi flash extremely without write cache:
// Bad example when not having write cache enabled
int write_one_million_fibonacci_numbers(void) {
spiffs_file fd = SPIFFS_open(fs, "fib", SPIFFS_O_CREAT | SPIFFS_O_TRUNC | SPIFFS_O_RDWR, 0);
if (fd <= 0) return SPIFFS_errno(fs);
int i = 1000000;
int a = 0;
int b = 1;
while (i--) {
// calc fibonacci number
int c = a + b;
a = b;
b = c;
// DO NOT DO THIS WITHOUT WRITE CACHE
// write one number at a time
if (SPIFFS_write(fs, fd, (u8_t *)&c, sizeof(int)) < SPIFFS_OK) {
(void)SPIFFS_close(fs, fd);
return SPIFFS_errno(fs);
}
}
if (SPIFFS_close(fs, fd) < SPIFFS_OK) return SPIFFS_errno(fs);
return SPIFFS_OK;
}
Instead of above, you should buffer as many fibonacci numbers as you can, and then write the buffer. But only without write cache. Otherwise, above is ok.
If you have space for read cache pages, then in 99.999% of all cases you should also enable SPIFFS_CACHE_WR
. It will not cost extra memory.
Opening a file given a filename will have spiffs search all blocks' first pages (the lookup pages) for object index headers - the file headers. Each found file header is then visited to see if the filename matches.
If lucky, the file is found quickly. If unlucky, lots of data needs to be filtered before the file is found.
There are a bunch of ways speeding this up.
If your application needs to frequently open and close files, and files are opened in a manner of temporal locality, enable
SPIFFS_TEMPORAL_FD_CACHE
Temporal locality in our case simply means that if a file was opened briefly before, it is likely to be opened again.
Examples could be some log file, being opened and updated often; or a http server that uses same css for all html pages: each access to an html file will also need to open the css file.
Let's construct a model of the latter example, the http server, on an ESP8266 for instance. Spiffs is configured to having 4*2kB logical block and 256 byte pages on 1 MB of the spi flash.
Cache is enabled, and spiffs is given eight cache pages and four file descriptors.
Our example has seven html files on the server residing on spiffs, each file being a couple of kilobytes. Each html file refers to a common css file. When a user accesses an html page, the http server implementation opens that html file, reads it, and the closes it. After that the css is opened, read, and closed.
Each html page have a certain probability of being visited by users.
File | Hit probability |
---|---|
1.html | 25% |
2.html | 20% |
3.html | 16% |
4.html | 12% |
5.html | 8% |
6.html | 5% |
7.html | 4% |
Running this in the testbench by randomizing access according to above probabilities 10.000 times, we get:
Sum HAL read accesses | Sum read bytes from spi flash | |
---|---|---|
no temporal cache | 1595258 | 389395837 |
with temporal cache | 276361 | 63361517 |
In this example, spiffs is more than 80% more efficient when it comes to number of HAL spi flash read accesses and number of read bytes from the spi flash with temporal cache enabled.
more TODO