Timed preprocessing #184

DilipSequeira · 2020-10-23T17:44:26Z

For the March '21 round (1.0?) we would like to see consideration of more timed preprocessing in the datacenter scenarios. specifically for the image models and for 3D-UNet. For edge, it makes sense that the submitter gets to choose the format because it's often coming in from a camera pipeline, but for datacenter it will typically be some form of compressed data (e.g. jpeg).

Let's discuss in the WG.

tjablin · 2020-10-27T16:12:25Z

I am broadly supportive of this change. In a perfect world, we would do this at the same time we switch to loadgen-over-network, but I don't think we have time this round for loadgen-over-network. For consistency, should we time all pre-processing?

christ1ne · 2020-10-27T17:06:24Z

Proposal:

what? For datacenter only & for R50, SSD-R34 and 3D UNET: input is now jpeg images, output is the same as before.
when? TBD

TheKanter · 2020-11-03T17:00:21Z

WG comments:

This is for data center only, under the theory that data center inputs are usually compressed. OTOH, edge inputs are often raw.

Also, pre-processing should be added to all benchmarks (already present in RNN-T, none needed in many other benchmarks).

This is a good topic for more discussion.

aaronzhongii · 2020-11-03T17:04:10Z

As Scott pointed it, this is unfair for inference only chip vendors. It's really hard to interpret a result with third party image decompressor playing huge role if preprocessing is timed. Only GPU have the compatibility to handle both at the same time, so if MLPerf promote this, does this mean MLPerf WG prefer GPU over inference only chips?

christ1ne · 2020-11-03T17:05:08Z

need to consider if that will further narrow the pool of potential submitters
better to get use case from the vision advisory board
will hear from David's survey on the v0.7 submitters and non-submitters

tjablin · 2020-11-03T17:18:12Z

Only GPU have the compatibility to handle both at the same time, so if MLPerf promote this, does this mean MLPerf WG prefer GPU over inference only chips?

MLPerf ought to reward good designs. Image decompression is an important part of inference for many workloads. It is appropriate that better architectures with more capabilities have higher performance as measured by MLPerf.

It's really hard to interpret a result with third party image decompressor playing huge role if preprocessing is timed.

The only performance that benefits customers is end-to-end performance. If a chip is decompression limited, it is misleading to publish numbers that ignore this limitation. MLPerf ought to publish performance numbers that most clearly reflect real world performance. There are already high performance open source image decompression libraries. It is unlikely anyone will get an advantage by optimizing them. Submitters with dedicated hardware for image decoding ought to be rewarded for their ingenuity.

Hopefully, measuring preprocessing time will guide submitters toward measuring systems with realistic balances of decompression and inference capacity.

christ1ne · 2020-11-10T17:10:58Z

WG: will update with survey results next week. If no consensus from submitters, we will rely on the future vision advisory board.
Scott suggested another end-to-end benchmark to track the system including networking card, graphics, and include their cost somehow.

christ1ne · 2020-11-17T18:01:11Z

@TheKanter will follow up on data center specific submitters.

tjablin · 2020-11-30T23:54:42Z

I think we are out of time to land this for 1.0. I propose merging with Loadgen over network and aim for Inference 1.1. Dilip, what do you think?

DilipSequeira · 2020-12-01T00:05:19Z

Agreed on all counts.

tjablin linked a pull request Oct 27, 2020 that will close this issue

Time Preprocessing #185

Closed

christ1ne added the WG v1.0 label Nov 2, 2020

christ1ne mentioned this issue Nov 4, 2020

Time pre-processing, JPEG decoding? #115

Closed

christ1ne closed this as completed Nov 10, 2020

christ1ne reopened this Nov 10, 2020

christ1ne added WG v1.1 / backlog WG will track this item for v0.7 for resolution and removed WG v1.0 labels Nov 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Timed preprocessing #184

Timed preprocessing #184

DilipSequeira commented Oct 23, 2020

tjablin commented Oct 27, 2020

christ1ne commented Oct 27, 2020 •

edited

Loading

TheKanter commented Nov 3, 2020

aaronzhongii commented Nov 3, 2020

christ1ne commented Nov 3, 2020

tjablin commented Nov 3, 2020

christ1ne commented Nov 10, 2020 •

edited

Loading

christ1ne commented Nov 17, 2020

tjablin commented Nov 30, 2020

DilipSequeira commented Dec 1, 2020

Timed preprocessing #184

Timed preprocessing #184

Comments

DilipSequeira commented Oct 23, 2020

tjablin commented Oct 27, 2020

christ1ne commented Oct 27, 2020 • edited Loading

TheKanter commented Nov 3, 2020

aaronzhongii commented Nov 3, 2020

christ1ne commented Nov 3, 2020

tjablin commented Nov 3, 2020

christ1ne commented Nov 10, 2020 • edited Loading

christ1ne commented Nov 17, 2020

tjablin commented Nov 30, 2020

DilipSequeira commented Dec 1, 2020

christ1ne commented Oct 27, 2020 •

edited

Loading

christ1ne commented Nov 10, 2020 •

edited

Loading