Testing Logs

Testing logs properly is hard
- What’s different about logs testing?
Help is at hand
Summary

Testing logs properly is hard

An all too familiar story…?

We’ve all been there; decided to be a good citizen and test our debug logs. Perhaps we added a bunch of new logs during debugging, or maybe we discovered that the existing logs weren’t really very useful.

Then we find the existing test code for the class and look for the existing logging tests. Are there any? Do they pass? If you changed existing log statements, how many of them did you break?

At some point perhaps you asked yourself if it’s even really worth adding the logging tests if the existing code wasn’t well tested. After all what’s the worst that can happen?

Or maybe you’re on the other side of the fence; an engineer tasked with finding the cause of an urgent issue. You have enabled extra logging to see what’s going on, but now tasks start having errors left and right! What’s happening?!

By enabling extra logging you just introduced a lot of previously untested code paths into a production system. Maybe insufficiently tested classes are now failing when toString() is called, or memory usage is spiking because of the large increase in log output. Or perhaps increased thread contention is affecting user latency as large, mutable objects are locked for formatting. You don’t know for sure what’s going on, but what was a smoothly running system is suddenly spluttering to a halt.
And you haven't even started to debug the issue you came here for!

Or to put it another way:

“When you are up to your ass in alligators, it’s hard to remember that you started out to drain the swamp” – Robert Anton Wilson

Good logging hygiene and good logs testing go hand in hand. Software engineers need to be encouraged to write good debug logs, but they also need to be able to write and maintain tests for them. Without both an easy-to-use logging API, combined with an easy-to-use logs testing API, logs testing is likely to end up an afterthought during development.

What’s different about logs testing?

Unlike most other programming statements, log statements never return a value which can be checked, and they are expected not to cause any observable state changes within the program. In an ideal system, a log statement would never affect, or be affected by the surrounding code, whether enabled or not. This means that many of the normal approaches to unit testing, such as asserting expected values or using test-doubles don’t apply well to logs testing.

Another important difference is that in many systems, the majority of log statements are disabled by default (i.e. “fine” logging). This means that in normal unit tests there isn’t even implicit testing of these log statements as the surrounding code is executed. This risks hiding important bugs until logging is enabled, which is often the worst time to discover them.

Anecdotally, from my time at Google, a good rule of thumb is that for each level of additional logging you enable (e.g. INFO⟶FINE or FINE⟶FINER) you get approximately 10-times more log output. Finer logs typically run more often in innermost loops and usually log larger payloads. Not many production systems will cope well with suddenly being hit by >100 times more log output.

So if you want to test all your logs statements, they need to be enabled, but if you enable all your logging, you are no longer testing a representative version of your code. Latency changes and unintended side effects from logging could be masking other issues. So do you enable fine-grained logging in tests or not? You need to be able to easily write tests, both with and without debug logging enabled.

Help is at hand

Having recognized the difficulty and issues caused by insufficient, or overly brittle logs testing over many years at Google, I decided to finally do something about it.

Introducing the Flogger Logs Testing Library. An easy to use, readable, powerful API for logs testing. It’s designed for Flogger but will work with other logging libraries (with a slightly degraded set of features).

Would you like to be able to write readable log assertions like:

logs.assertLogs().withLevelAtLeast(WARNING).always().haveMetadata("request_id", REQUEST_ID);

or:

var taskStart =
    logs.assertLogs().withLevel(INFO).withMessageContaining("Task", "[START]").getOnlyMatch(); 
logs.assertLogs(after(taskStart).inSameThread()).withLevel(WARNING).doNotOccur();

How about easily writing tests which can trivially test additional logging over the same code?

In the first test, logging is set to the default for the test class (e.g. “INFO”) and we don’t test all the fine logs that might appear (in normal use fine logs would not even be captured):

@Test
public void testSuccessfulTask_infoLogs() {
  // In normal execution, no warning logs occur.
  logs.verify(assertLogs -> assertLogs.withLevel(WARNING).doNotOccur());

  // On success we see a sequence of INFO logs summarizing normal operation.
  var result = ClassUnderTest.doTestTask(goodRequest);
  assertThat(result.status()).isEqualTo(Task.SUCCESS);
  // Now do the usual testing of "info" logs for a successful operation...
  ...
}

@Test
@SetLogLevel(scope = CLASS_UNDER_TEST, level = FINEST)
public void testSuccessfulTask_enableAllLogs() {
  // Assert that the task completed as expected with all logging enabled. Knowing that all logging
  // code can be enabled without causing more problems is very valuable.
  var result = ClassUnderTest.doTestTask(goodRequest);
  assertThat(result.status()).isEqualTo(Task.SUCCESS);

  // Extract the subset of debug logs we care about testing (we tested "info" logs above).
  var debugLogs = assertLogs.withLevelLessThen(INFO);
  // Without checking the details, assert that an expected number of logs occurred.
  debugLogs.matchCount().isAtLeast(30);
  // Perhaps also test a specific logs policy (e.g. not using "fine" logs to report exceptions).
  debugLogs.never().haveCause();
  ...
}

And now an additional test, which runs the same code but triggers a failure:

@Test
@SetLogLevel(scope = CLASS_UNDER_TEST, level = FINE)
public void testFailedTask_debugLogs() {
  // On failure we see an initial warning followed by numerous FINE log statements, which we expect
  // to all have the correct task ID attached (among other things).
  ClassUnderTest.doTestTask(badRequest);

  var firstWarning =
    logs.assertLogs().withLevel(WARNING).withMessageContaining("[FAILED]", TEST_TASK_NAME).getMatch(0);
  // Extract a subset of the logs after a specific event.
  var fineLogs = logs.assertLogs(after(firstWarning).inSameThread()).withLevel(FINE);
  fineLogs.matchCount().isAtLeast(10);
  // Assert that the logs we care about have a task ID to help debugging.
  fineLogs.always().haveMetadata("task_id", TEST_TASK_ID);
  // Perhaps also test some specific expectations about what should not be in these logs.
  fineLogs.withMessageContaining("load", "path=").never().haveMessageContaining("Access Denied");
  ...
}

Note how, if “FINE” log statements are modified or new ones are added, this test is not brittle since it is only testing for the existence of the expected task ID.

Having seen these simple examples:

How much code would it take you to write equivalent tests with your current logs testing API?
Have you considered writing these sort of logging tests before?
Do you even have a standard logs testing API?

Summary

If the idea of powerful, readable, easy to maintain logging tests appeals to you, learn more at https://github.com/hagbard/flogger-testing.

What’s more, this framework still works, in a more limited way, if you’re just using JDK logging or Log4J directly. Test fixtures (e.g. FloggerTestRule) can still be installed, and logs are still captured and can be tested, but you’ll have to manage setting log levels yourself.

Install the logs testing API and get started today:

<!-- https://mvnrepository.com/artifact/net.goui.flogger.testing.junit4 -->
<dependency>
    <groupId>net.goui.flogger.testing</groupId>
    <artifactId>junit4</artifactId>  <!-- or unit5 -->
    <version>${flogger-testing-version}</version>
    <scope>test</scope>
</dependency>

And if you’re using Log4J:

<!-- https://mvnrepository.com/artifact/net.goui.flogger.testing.logj4 -->
<dependency>
    <groupId>net.goui.flogger.testing</groupId>
    <artifactId>log4j</artifactId>
    <version>${flogger-testing-version}</version>
    <scope>test</scope>
</dependency>

Testing Logs

Table of contents

Testing logs properly is hard

What’s different about logs testing?

Help is at hand

Summary