Backport high disk utilization fix #9206

jwilder · 2017-12-07T15:51:33Z

Backport #9204

Required for all non-trivial PRs

Rebased/mergable
Tests pass
CHANGELOG.md updated
Sign CLA (if not already signed)

Required only if applicable

You can erase any checkboxes below this note if they are not applicable to your Pull Request.

InfluxQL Spec updated
Provide example syntax
Update man page when modifying a command
Config changes: update sample config (etc/config.sample.toml), server NewDemoConfig method, and Diagnostics methods reporting config settings, if necessary
InfluxData Documentation: issue filed or pull request submitted <link to issue or pull request>

O_SYNC was added with writing TSM files to fix an issue where the final fsync at the end cause the process to stall. This ends up increase disk util to much so this change switches to use multiple fsyncs while writing the TSM file instead of O_SYNC or one large one at the end.

With the recent changes to compactions and snapshotting, the current default can create lots of small level 1 TSM files. This increases the default in order to create larger level 1 files and less disk utilization.

The default max-concurrent-compactions settings allows up to 50% of cores to be used for compactions. When the number of cores is high (>8), this can lead to high disk utilization. Capping at 4 and combined with high snapshot sizes seems to keep the compaction backlog reasonable and not tax the disks as much. Systems with lots of IOPS, RAM and CPU cores may want to increase these.

This runs the scheduler every 5s instead of every 1s as well as reduces the scope of a level 1 plan.

The disk based temp index for writing a TSM file was used for compactions other than snapshot compactions. That meant it was used even for smaller compactiont that would not use much memory. An unintended side-effect of this is higher disk IO when copying the index to the final file. This switches when to use the index based on the estimated size of the new index that will be written. This isn't exact, but seems to work kick in at higher cardinality and larger compactions when it is necessary to avoid OOMs.

stuartcarnie

👍

jwilder added 7 commits December 7, 2017 07:58

Increase cache-snapshot-memory-size default

171b427

With the recent changes to compactions and snapshotting, the current default can create lots of small level 1 TSM files. This increases the default in order to create larger level 1 files and less disk utilization.

Schedule compactions less aggressively

4067c0b

This runs the scheduler every 5s instead of every 1s as well as reduces the scope of a level 1 plan.

Update changelog

955f41e

Fixup changelog

1f08796

jwilder added this to the 1.4.3 milestone Dec 7, 2017

jwilder requested a review from stuartcarnie December 7, 2017 15:51

ghost assigned jwilder Dec 7, 2017

ghost added the review label Dec 7, 2017

stuartcarnie approved these changes Dec 7, 2017

View reviewed changes

jwilder merged commit 50063f9 into 1.4 Dec 7, 2017

ghost removed the review label Dec 7, 2017

jwilder deleted the jw-14-backport branch December 7, 2017 17:06

jwilder mentioned this pull request Dec 13, 2017

Disk utilization fixes #9225

Merged

4 tasks

hahnjo mentioned this pull request Mar 5, 2018

InfluxDB 1.4.3 didn't increase cache-snapshot-memory-size #9508

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Backport high disk utilization fix #9206

Backport high disk utilization fix #9206

jwilder commented Dec 7, 2017

stuartcarnie left a comment

Backport high disk utilization fix #9206

Backport high disk utilization fix #9206

Conversation

jwilder commented Dec 7, 2017

Required for all non-trivial PRs

Required only if applicable

stuartcarnie left a comment

Choose a reason for hiding this comment