Skip to content

Latest commit

 

History

History
331 lines (209 loc) · 9.29 KB

README.md

File metadata and controls

331 lines (209 loc) · 9.29 KB

Notice

I'm leaving Github. The main official location for this project is now: https://codeberg.org/RafaGago/logger-bench

Logger benchmarks

This is a project containing C/C++ log library benchmarks in a very easy to build fashion.

As many other log library authors, I had my own tests polluting my repositories. This drove me to create this separate project.

The codebase tries to remain trivial by sticking to the most basic C++ concepts only. Contributions with new loggers or test method improvements are welcome.

This library implements its own testing and integrates a Google benchmark implementation too.

Own implementation. Metodology

  • Tests performance at the calling site. Tailored for asynchronous loggers.

  • Tests are run for different thread counts. Logger threads set CPU affinity.

  • The throughput test is summing the time spent on each thread and dividing it by the number of messages.

  • The latency tests measure each thread entry with the wall clock and calculate the 99.9% and 50% percentiles. On my machine these tests show a lot of variation between runs.

Own implementation. Execution.

logger-bench <memory-bytes> <messages> own <iterations> <loggers>...

Where:

  • memory-bytes: Is a recommendation passed at construction of each logger to try to limit is memory usage. Only some loggers with bounded queues implement it. If you see that "faults" are reported, it's very likely that the logger was unable to empty its buffer (write to disk) as fast as the messages were generated.

  • messages: The number of messages to send each time.

  • own: Fixed string that selects the "own" benchmark subcommand.

  • iterations: The number of times to repeat the tests.

  • loggers: A space separated list of loggers to run. The possible values depend on how the library was compiled. run "logger-bench" 8MB 100000 own --help" to see a list of which loggers are available.

Notice that on Linux is beneficial to set the performance governor before running:

sudo cpupower frequency-set --governor performance

Restore the governor after testing.

Google benchmark. Metodology

All the tests above are reused and adapted to Google Benchmark. The metodology of Google benchmark is not documented here. You can check its repository:

https://github.com/google/benchmark

Google benchmark. Execution

logger-bench <memory-bytes> <messages> google [google benchmark args]

"memory-bytes" and "messages" are the same as in the own implementation, the only difference is that messages accepts zero values, when messages is 0 Google Benchmark tries to adjust the number of measurements dynamically.

"google benchmark args" are not documented here. See:

https://github.com/google/benchmark

Memory usage benchmark. Metodology

  • Uses the own implementation on this thread.

  • Creates as many benchmark executables as loggers are by using the script "build-each-in-onw-executables.sh" on the root of this project. So singleton instances don't interfere with other loggers.

  • Runs a single pass of 16 threads enqueueing 100000 messages in total (Using the script "measure-mem.sh" on the root of this project). The bounded loggers are given 8MB to configure its queues, so this peak load doesn't overflow the queues on a slow (by today's standards) machine. AFAIK the only ones supporting bounded queues are "mal" and "malc".

  • Measures by using /usr/bin/time -f "%M"

Notice that what is expected is:

  • Loggers with bounded queues to take a fixed amount of memory.

  • Loggers using the heap taking more memory the faster they are.

Memory usage benchmark. Execution

To execute, assuming that "BENCHMARK_PROJECT_DIR" points to the root of this project:

mkdir my-test-tmp-dir && cd my-test-tmp-dir
$BENCHMARK_PROJECT_DIR/build-each-logger-in-own-executable.sh
$BENCHMARK_PROJECT_DIR/measure-peak-mem.sh

Tested libraries

Here is a listing of the tested libraries. If you want to enable all use:

cmake .. -DENABLE_ALL=on

Enabling them all can cause different results than just having a logger for each executable, as some singleton loggers consume resources from the start.

mini-async-log-c

A C11 low-latency wait-free producer (when using Thread Local Storage) asynchronous data logger with type-safe strings.

https://github.com/RafaGago/mini-async-log-c

cmake .. -DMALC=on

Requires: C11/C++, meson + ninja

Adds four variants to the tests:

  • malc-tls: using thread local storage only.
  • malc-fixed: using a bounded buffer for memory allocation.
  • malc-fixed-cpu: using as many bounded buffers as CPUs for memory allocation.
  • malc-heap: using the heap for memory allocation.
  • malc-tls-heap: As malc-tls, but using the heap when the main buffer is exhausted;
  • malc-fixed-heap: As malc-fixed, but using the heap when the main buffer is exhausted;
  • malc-fixed-cpu-heap: As malc-fixed-cpu, but using the heap when the main buffer is exhausted;

No hybrid variations are tested. The library supports them.

mini-async-log

A performant asynchronous data logger with acceptable feature-bloat (C++).

https://github.com/RafaGago/mini-async-log-c

cmake .. -DMAL=on

Requires: partial C++11, cmake

Adds two variants to the tests:

  • mal-fixed: using a bounded buffer for memory allocation.
  • mal-heap: using the heap for memory allocation.

No hybrid variations are tested. The library supports them.

spdlog

Very fast, header-only/compiled, C++ logging library.

https://github.com/gabime/spdlog

cmake .. -DSPDLOG=on

Requires: C++11, header only.

Adds two variants to the tests:

  • spdlog-async: spdlog in asynchronous mode.
  • spdlog-sync: spdlog in synchronous mode.

Google log (glog)

C++ implementation of the Google logging module.

https://github.com/google/glog.git

cmake .. -DGLOG=on

Requires: C++, make

NanoLog (Standford)

Nanolog is an extremely performant nanosecond scale logging system for C++ that exposes a simple printf-like API.

https://github.com/PlatformLab/NanoLog

cmake .. -DNANOLOG=on

Requires: C++17, make

Adds two variants to the tests:

  • nanolog: Vanilla threads.
  • nanalog-tls: Calls "NanoLog::preallocate()" on each thread.

g3log

Asynchronous logger with Dynamic Sinks

https://github.com/KjellKod/g3log

cmake .. -DG3LOG=on

Requires: C++14, cmake

Nanolog (Iyengar111)

Low Latency C++11 Logging Library.

NOTE: this is not Standford's University NanoLog, but a logger with the same name that already existed.

https://github.com/Iyengar111/NanoLog

cmake .. -DNANOLOG_IYENGAR=on

Requires: C++11, header only.

Compilation

CMake is used only because of the convenience of "ExternalProject_Add" for this use case. Unfortunately CMake is much more practical than meson for this as of today.

Enablement of loggers requires to turn on CMake options (see CMakeLists.txt), e.g.:

cmake .. -DMALC=on -DMAL=on -DNANOLOG=on -DSPDLOG=on -GLOG=on -DG3LOG=on

Notice that some singleton loggers may add hidden resources like e.g. running threads that do a busy-loop, so for ultimately fair testing it is recommended to only add one logger for each generated executable.

Notes

As no comparison is fair, this random list tries to maintain a non biased summary of logger traits that one has to consider when looking at the results.

malc

  • The format string has to always be a literal.

  • Timestamps from boot time. Real calendar time has to be parsed from the file names (they contain the calendar time (from epoch) and the machine boot time at file creation time). If a conversion to calendar time is desired the log lines must be processed by an external script.

  • Configurable queue size. Shows faults on high message loads when not using the heap variant.

mal

  • Timestamping at the consumer side (can be configured not to).

  • The format string has to always be a literal.

  • Timestamps from boot time. Real calendar time has to be parsed from the file names (they contain the calendar time (from epoch) and the machine boot time at file creation time). If a conversion to calendar time is desired the log lines must be processed by an external script.

  • Configurable queue size. Shows faults on high message loads when not using the heap variant.

spdlog

  • No way to configure internal queue size(?). Arbitrary memory usage.

glog

  • No remarks. I know almost nothing about the implementation.

Nanolog (Standford)

  • No way to configure internal queue size (?). Arbitrary memory usage.

  • Binary-only logger. Not human-readable logs. A tool to convert the logs is provided.

  • It has a background compression thread waking up each microsecond (see runtime/Config.h). As this library is a singleton with no explicit initialization/destruction this thread is actually waking up when the benchmarks of other log libraries are running.

G3log

  • No remarks. I know almost nothing about the implementation.

Nanolog (Iyengar111)

  • Buffer sizes of 8MB are not configurable, so if a comparison has to be kept fair all the other loggers have to use 8MB buffers. Arbitrary memory usage.

  • Logs without time zone (UTC).