High "mapped" memory usage and disk IO when tail-based sampling is enabled #13463

carsonip · 2024-06-20T15:31:18Z

APM Server version (apm-server version): confirmed on 8.13.2 but affects all versions including the latest 8.14.1

Description of the problem including expected versus actual behavior:

When tail-based sampling (TBS) is enabled, the memory usage will go as high as the local TBS database storage size. When viewing /proc/meminfo, most of the memory usage shows up as "Mapped". This is particularly noticeable in setups which consist of multiple apm-servers and receive high load.

Steps to reproduce:

Please include a minimal but complete recreation of the problem,
including server configuration, agent(s) used, etc. The easier you make it
for us to reproduce it, the more likely that somebody will take the time to
look at it.

start 2 separate apm-servers (calling them A and B) and send load to them independently using apmsoak. Wait until their local TBS database size grows to >1GB.
stop A, wait for 10mins, then restart A without sending load to it.
observe that the A's memory usage increases to approximately to its local TBS database size

The text was updated successfully, but these errors were encountered:

carsonip · 2024-06-20T16:11:50Z

Upon investigation, this is likely related to the prefetch behavior of local TBS badger database iterator, triggered by ReadTraceEvents which is called on every sampling decision received (both local and remote decisions). ReadTraceEvents cannot use the table's bloom filter because we are searching for events using trace ID, while a full key consists of both trace ID and txn/span ID, it has to use the iterator with a prefix. Prefetch behavior is enabled by default and set to 100 values, and it fetches values from vlog when using an iterator. Unfortunately, its implementation does not respect the prefix, meaning that even when prefix does not match, it still fetches 100 values from vlog.

This is mostly affecting setups with multiple apm-servers because e.g. apm-server A receive sampling decisions made by a remote apm-server B. And it is likely that the sampling decision is for a trace that A does not know and does not store. The right thing here to do is to scan the in-memory LSM tree to see if there's a prefix match, but in the current implementation, due to prefetch, it still scans vlogs for 100 values of irrelevant keys. As vlog files are mmap-ed, the scans will likely cause lots of page faults and read IO. These pages are stored in memory until they are evicted by OS. In a busy environment, and as trace IDs are randomly distributed, receiving lots of remote sampling decisions will likely scan all the vlogs and cause all the vlogs to stay in-memory, hence the memory usage that approximates to the size of local TBS database.

carsonip · 2024-06-20T16:14:11Z

Here's a minimal reproducible example of the issue, with memory and disk IO measurements: https://github.com/carsonip/tbs-badger-playground/tree/main/prefetch

carsonip added the bug label Jun 20, 2024

carsonip self-assigned this Jun 20, 2024

carsonip mentioned this issue Jun 20, 2024

TBS: Optimize for ReadTraceEvents miss #13464

Merged

carsonip closed this as completed in #13464 Jun 24, 2024

carsonip changed the title ~~High memory usage and disk IO when tail-based smapling is enabled~~ High memory usage and disk IO when tail-based sampling is enabled Jun 24, 2024

mergify bot mentioned this issue Jun 24, 2024

[8.14] TBS: Optimize for ReadTraceEvents miss (backport #13464) #13474

Merged

carsonip changed the title ~~High memory usage and disk IO when tail-based sampling is enabled~~ High "mapped" memory usage and disk IO when tail-based sampling is enabled Jun 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

High "mapped" memory usage and disk IO when tail-based sampling is enabled #13463

High "mapped" memory usage and disk IO when tail-based sampling is enabled #13463

carsonip commented Jun 20, 2024

carsonip commented Jun 20, 2024

carsonip commented Jun 20, 2024

High "mapped" memory usage and disk IO when tail-based sampling is enabled #13463

High "mapped" memory usage and disk IO when tail-based sampling is enabled #13463

Comments

carsonip commented Jun 20, 2024

carsonip commented Jun 20, 2024

carsonip commented Jun 20, 2024