Improve test run output #139

cgay · 2021-01-08T07:36:03Z

I'd like to see several improvements to Testworks output:

An option to display no progress output except for failing tests. --progress failures I suppose. i.e., don't display pass, not implemented, expected to fail.
It should be rare that tests do output to the console. When they do it's generally for debugging. (threads-test-suite stands out as doing a lot of essentially meaningless output.) All such debug output should be via test-output:testworks and should by default go to test-temp-directory()/test-name.log and there can be an option to also send it to the console. --test-output=all, for example...not sure of name.
Display test run time as a number of microseconds, or heuristically ms, usec, ns. My time library already does this so perhaps wait and use that. Seconds as the unit is just too big these days.
Always display the final summary output. Specifically, with the options that are passed to make check (--report json --report-file foo.json) I still want the final count of passing/failing tests to be shown on the terminal.
No need to display passing/failing suites. I've never found this remotely useful.
In the summary output, display the counts for tests last, after checks, because that last line should have the info I really care about.
Don't display "0 benchmarks". (This might might already be the case?) Our plan is to run benchmarks as separate libraries.
Possibly: note which tests are new after each --load so they can be associated with their .so file and we can execute them in that same order and display the .so filename before any progress output for that set of tests. (The output from make check is currently pretty random and hard to know what tests belong to what library.)
Can probably elide "Running test " before each test in the progress output. Just print the test name there.

Can greatly improve output for test-equal failures:

test-encode/decode-varint failed
    encode-varint-bytes(i) = bytes: [{stretchy vector 128, 128, 128, 128, 248, 255, 255, 255, 255, 127} and {stretchy vector 3} are not =.  sizes differ (10 and 1), element 0 is the first non-matching element]
      FAILED in 0.002164s and 58KiB

should become something like this

test-encode/decode-varint: values differ starting at element 0
    encode-varint-bytes(i): size=10 {stretchy vector 128, 128, 128, 128, 248, 255, 255, 255, 255, 127}
    bytes:                  size=1 {stretchy vector 3}

Those are my thoughts. We'll see how it develops.

The text was updated successfully, but these errors were encountered:

housel · 2021-01-08T18:15:42Z

Don't display "0 benchmarks", and if there are no tests (because it's a benchmark suite) then don't display "0 tests". The two cases can be symmetric.

* Normally only display the test count OR the benchmark count. If they're both 0, both are displayed. * Don't display the number of suites that passed/failed. * Display test/benchmark counts after checks instead of before. part of dylan-lang#139

cgay · 2021-11-27T23:06:01Z

None of this makes sense:

$ ../_build/bin/time-test-suite --test test-load-tzif-version-1
Running test test-load-tzif-version-1: NOT IMPLEMENTED in 0.000822s and 68KiB

test-load-tzif-version-1 not implemented

Ran 0 checks: PASSED
Ran 1 test: PASSED (1 not implemented)
NOT IMPLEMENTED in 0.000822 seconds

Huge amounts of duplication. I don't need to be told 4 times that the test is not implemented.
"Ran 0 checks: PASSED" should not say PASSED. Either say NOT IMPLEMENTED or just leave it blank.
"Ran 1 test: PASSED (1 not implemented)" -- which is it? passed or not implemented?
"NOT IMPLEMENTED in 0.000822 seconds" -- Strange wording. Avoid the issue by saying "Result: NOT IMPLEMENTED, Total run time: 822µs"

cgay · 2022-10-29T22:38:30Z

I still find this output for failed checks to be pretty bad:

FAILED: $expected-failure = test-results.result-status [#"expected-failure" and #"unexpected-success" are not =.]

I think better would be

FAILED: #"expected-failure" ~= #"unexpected-success" for assert-equal($expected-failure, test-results.result-status)

Or even better, a multi-line display:

FAILED: assert-equal($expected-failure, test-results.result-status)
    Expected: #"expected-failure"
    Got: #"unexpected-success"

* Use multi-line output with "want" and "got" values, and detailed output if present, on separate lines. * Don't sort detail results since they aren't guaranteed to be sortable. * Separate do-check-equal into same plus do-check-not-equal; it's less complex. dylan-lang#139

cgay · 2024-06-12T20:34:16Z

Current assert-equal output now looks like this:

FAILED: lox-test-suite
  FAILED: test-eval-precedence
    FAILED: 7 = %eval("2*(3+4)/3;")
      want: 7
      got:  4.6666667d0
  FAILED: test-eval-program
    FAILED: #[1, 2, 3, 4] = #[1, 2, 5, 4]
      want: #[1, 2, 3, 4]
      got:  #[1, 2, 5, 4]
      detail: element 2 is the first mismatch

cgay added the output label Jan 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve test run output #139

Improve test run output #139

cgay commented Jan 8, 2021 •

edited

Loading

housel commented Jan 8, 2021

cgay commented Nov 27, 2021 •

edited

Loading

cgay commented Oct 29, 2022

cgay commented Jun 12, 2024

Improve test run output #139

Improve test run output #139

Comments

cgay commented Jan 8, 2021 • edited Loading

housel commented Jan 8, 2021

cgay commented Nov 27, 2021 • edited Loading

cgay commented Oct 29, 2022

cgay commented Jun 12, 2024

cgay commented Jan 8, 2021 •

edited

Loading

cgay commented Nov 27, 2021 •

edited

Loading