-
Notifications
You must be signed in to change notification settings - Fork 108
/
Copy pathexamining_grootfs_disk.html.md.erb
301 lines (214 loc) · 12.6 KB
/
examining_grootfs_disk.html.md.erb
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
---
title: Examining GrootFS disk usage in Cloud Foundry
owner: Garden
---
<%# Reset page title based on platform type %>
<% if vars.platform_code != 'CF' %>
<% set_title("Examining GrootFS disk usage in", vars.app_runtime_abbr) %>
<% end %>
You can analyze GrootFS disk space usage in <%= vars.app_runtime_abbr %>. The following information gives you important
background knowledge and the steps to analyze GrootFS disk space usage.
You run the commands as root on any BOSH-deployed VM that hosts the Garden job in a <%= vars.app_runtime_first %> deployment.
<p> The following information provides different commands
depending on whether you are using privileged or unprivileged containers in your deployment.
By default, all deployments use unprivileged containers.
For more information about these container types,
see <a href="../concepts/container-security.html#security">Container Security</a>.</p>
For more information about the GrootFS concepts, see [GrootFS Disk Usage](../concepts/grootfs-disk.html).
## <a id='space'></a> Reconciling container and host disk usage
To reconcile disk allocations for containers with the actual disk usage on the host VM, you need to understand how GrootFS uses its disk.
Command line tools such as `du` and `df` can provide misleading information because of the way container file systems work.
For example, the following situations can occur:
<table class="table">
<thead>
<tr>
<th>If...</th>
<th>Then...</th>
</tr>
</thead>
<tr>
<td>the Diego cell rep appears to be out of disk capacity, but the actual disk usage on the Garden host is low</td>
<td>Diego does not schedule containers on the cell.</td>
</tr>
<tr>
<td>the Diego cell rep appears to have available disk capacity, but the combined space used by containers and system components prevents Diego from allocating the remaining disk space
</td>
<td>Diego continues to place containers on the cell, but they fail to start.</td>
</tr>
</table>
## <a id='disk_usage'></a> About container disk usage
On disk, the read write layer for each container can be found at `/var/vcap/data/grootfs/store/unprivileged/images/CONTAINER-ID/diff`.
When GrootFS calls on the built in XFS quota tooling to get disk usage for a container,
it takes into account data written to that directory and not the data in the read only volumes.
Running `grootfs stats` returns the following values:
* `total_bytes_used`: Disk usage of the container **including** the rootfs image volumes.
* `exclusive_bytes_used`: Disk usage of the container **not including** the rootfs image volumes.
### <a id='disk_usage_proc'></a> Retrieve Disk Usage Stats for a single container
To obtain the disk usage stats of a single container, use the following steps:
1. In your <%= vars.app_runtime_abbr %> deployment, use `bosh ssh` to connect to the BOSH deployed VM running the Garden job.
1. On the VM, run the following command to look up the container ID:
```
ls /var/vcap/data/garden/depot/
```
The previous command returns a container ID in the following format:
<pre class="terminal">
55afbf65-5cbf-49c6-4461-f803
</pre>
1. Based on the container type used in your deployment, run one of the following commands on the VM.
- For unprivileged containers, run the following command:
```
/var/vcap/packages/grootfs/bin/grootfs --config \
/var/vcap/jobs/garden/config/grootfs_config.yml stats CONTAINER-ID
```
Where `CONTAINER-ID` corresponds to the container ID you obtained in step 2.<br/>
For example:
<pre class="terminal">
$ /var/vcap/packages/grootfs/bin/grootfs --config \
/var/vcap/jobs/garden/config/grootfs_config.yml stats 55afbf65-5cbf-49c6-4461-f803
</pre>
- For privileged containers, run the following command:
```
/var/vcap/packages/grootfs/bin/grootfs --config \
/var/vcap/jobs/garden/config/privileged_grootfs_config.yml stats CONTAINER-ID
```
Where `CONTAINER-ID` corresponds to the container ID you obtained in step 2.
For example:
<pre class="terminal">
$ /var/vcap/packages/grootfs/bin/grootfs --config \
/var/vcap/jobs/garden/config/privileged_grootfs_config.yml stats 55afbf65-5cbf-49c6-4461-f803
</pre>
The previous commands return output in the following format:
<pre class="terminal">
{"disk_usage":{"total_bytes_used":23448093,"exclusive_bytes_used":8192}}
</pre>
### <a id="exclusive-disk-all"></a> Retrieve exclusive disk usage stats for all running containers
To verify the disk usage of all running containers, use the following steps:
1. In your <%= vars.app_runtime_abbr %> deployment, use `bosh ssh` to connect to the BOSH deployed VM running the Garden job.
1. Based on the container type used in your deployment, run one of the following commands on the VM:
- For unprivileged containers, run the following command:
```
ls /var/vcap/data/garden/depot/ \
| xargs -I{} /var/vcap/packages/grootfs/bin/grootfs --config \
/var/vcap/jobs/garden/config/grootfs_config.yml stats {} \
| cut -d: -f4 | cut -d} -f1 | awk '{sum += $1} END {print sum}'
```
- For privileged containers, run the following command:
```
ls /var/vcap/data/garden/depot/ \
| xargs -I{} /var/vcap/packages/grootfs/bin/grootfs --config \
/var/vcap/jobs/garden/config/privileged_grootfs_config.yml stats {} \
| cut -d: -f4 | cut -d} -f1 | awk '{sum += $1} END {print sum}'
```
The previous commands return the total disk usage in bytes for all running containers.
## <a id="underlying-volume"></a> About volumes in GrootFS
Underlying layers are known as `volumes` in GrootFS.
They are read only and the changesets are layered together through an **OverlayFS** mount to create the rootfs for containers.
When GrootFS writes each file system volume to disk, it also stores the number of bytes written to a file in the `meta` directory.
### <a id="volume-check"></a> Check volume disk size
To find out the size of an individual volume, you can read the corresponding metadata file
or run `du` on the volume itself:
1. In your <%= vars.app_runtime_abbr %> deployment, use `bosh ssh` to connect to the BOSH deployed VM running the Garden job.
1. Based on the container type used in your deployment, run one of the following commands on the VM:
- For unprivileged containers, run the following command on the VM:
```
cat /var/vcap/data/grootfs/store/unprivileged/meta/volume-VOLUME-SHA
```
Where `VOLUME-SHA` corresponds to the SHA value of the volume.
- For privileged containers, run the following command:
```
cat /var/vcap/data/grootfs/store/privileged/meta/volume-VOLUME-SHA
```
Where `VOLUME-SHA` corresponds to the SHA value of the volume.
The `cat` commands above return the volume size in bytes in the following format:
<pre class="terminal">
{"Size":5607885}
</pre>
1. Alternatively, use `du` and pass the absolute path to the volume. Run one of the following commands on the VM:
- For unprivileged containers, run the following command:
```
du -sch /var/vcap/data/grootfs/store/unprivileged/volumes/VOLUME-SHA/
```
Where `VOLUME-SHA` corresponds to the SHA value of the volume.
- For privileged containers, run the following command:
```
du -sch /var/vcap/data/grootfs/store/privileged/volumes/VOLUME-SHA/
```
Where `VOLUME-SHA` corresponds to the SHA value of the volume.
The `du` commands return the volume size in the following format:
<pre class="terminal">
5.4M /var/vcap/data/grootfs/store/unprivileged/volumes/VOLUME-SHA/
</pre>
## <a id="free-space"></a> Determine disk usage and reclaimable disk space
This section describes how to calculate the amount of disk space is in use and estimate how much space is reclaimable.
### <a id="active-volumes"></a> Calculate disk use by all active volumes
For each container, GrootFS mounts the underlying volumes using overlay to a point in the `images` directory.
This point is the rootfs for the container and is read write.
GrootFS also stores the SHA of each underlying volume used by an image in the `meta` folder.
You can determine the bytes of all active volumes on disk by running one of the following commands:
If you do not have <code>python3</code> installed, replace <code>python3</code>
with <code>python</code> in the following commands.
- For unprivileged containers, run the following command:
```
for image in $(ls /var/vcap/data/grootfs/store/unprivileged/meta/dependencies/image\:*.json); \
do cat $image | python3 -c 'import json,sys;obj=json.load(sys.stdin); \
print("\n".join(obj))' ; done | sort -u \
| xargs -I{} cat /var/vcap/data/grootfs/store/unprivileged/meta/volume-{} \
| cut -d : -f 2 | cut -d} -f1 \
| awk '{sum += $1} END {print sum}'
```
- For privileged containers, run the following command:
```
for image in $(ls /var/vcap/data/grootfs/store/privileged/meta/dependencies/image\:*.json); \
do cat $image | python3 -c 'import json,sys;obj=json.load(sys.stdin); \
print("\n".join(obj))' ; done | sort -u \
| xargs -I{} cat /var/vcap/data/grootfs/store/privileged/meta/volume-{} \
| cut -d : -f 2 | cut -d} -f1 \
| awk '{sum += $1} END {print sum}'
```
The previous commands return the total number of bytes used by all active volumes on disk.
### <a id="store"></a> Calculate GrootFS store disk usage
To determine how much total disk space the store is using, run the following command:
```
df | grep -E "/var/vcap/data/grootfs/store/(privileged|unprivileged)$" \
| awk '{sum += $3} END {print sum}'
```
The preceding command returns the total number of bytes used by the store.
### <a id="pruning-volumes"></a> Calculate reclaimable disk space
You can use values gathered from the commands above to calculate how much space can be cleared in GrootFS.
Garbage collection reclaims disk space by pruning unused volumes.
The overall formula to calculate reclaimable disk space is to subtract the total disk in use by the store from the total disk used by active volumes.
For example, perform the following steps:
1. Calculate how much disk space the store is using by following the instructions in [Calculate GrootFS Store Disk Usage](#store).
For example, your result might be:
```
Total disk store = 5607885 bytes
```
1. Calculate how much disk space active volumes are using by following the instructions in [Calculate Disk Use by All Active Volumes](#active-volumes).
For example, your result might be:
```
Active volumes = 3212435 bytes
```
1. Subtract the amount of space used by active volumes from the space used by the store. For example, your result might be:
```
5607885 - 3212435 = 2395450 bytes
```
In this example, you can reclaim 2395450 bytes through garbage collection.
### <a id="making-space"></a> How GrootFS reclaims disk space
The thresholder component calculates and sets a value so that GrootFS's garbage collector can attempt to ensure
that a small reserved space is kept free for other jobs.
GrootFS only tries to garbage collect or reclaim space when that threshold is reached.
However, if all the rootfs layers are actively in use by images, then garbage collection cannot occur and that space is used up.
If you determine that there is not enough reclaimable disk and more space is needed on disk,
you can scale up your VMs to a larger size or add more VMs to provide more disk space.
<%= vars.grootfs_reserved_space %>
## <a id="categories"></a> Other categories of GrootFS disk usage
There might be categories of GrootFS disk usage other than those listed in the above sections.
However, the bulk of disk usage is stored in the `images/CONTAINER-ID/diff` and `volumes` directories, so these are rarely taken into consideration when calculating store usage.
You can find these directories under `/var/vcap/data/grootfs/store/unprivileged` for unprivileged containers and `/var/vcap/data/grootfs/store/privileged` for privileged containers.
GrootFS also stores information in the following directories:
- `l`: link directories. Shorter directory names are symlinked to volume directories to allow Groot to union mount more file paths.
- `locks`: file system lock directory to ensure safety during concurrent cleans and creates.
- `meta`: per image and volume metadata.
- `projectids`: empty numbered directories used to track image quotas.
- `tmp`: normal temporary directory contents.
These directories typically use less than 2 MB disk in total.