Currently, when there are both "known failing" test cases that fail as well as other unexpected failures, they can be interleaved together. Ideally, the "FAILURE" log messages (for unexpected errors) would all be grouped together, separately from the "INFO" log messages for known failures, so that it's easier to scan and see what the unexpected failures were.