Improve handling of perceptual encoding metrics #392

lexaknyazev · 2021-04-08T19:54:54Z

Basis encoder has a "perceptual" flag. When enabled, it assigns uneven weights to red, green, and blue channels for computing the error metric. By the way, only ETC1S is affected by that for now.

Ideally, the settings should be more flexible:

compute metrics in linear / non-linear space (i.e. decode 8-bit sRGB to real values), not implemented yet;
set per-channel weights (Allow user to control channel weightings BinomialLLC/basis_universal#202):
- even;
- luma, should depend on the color primaries:
  - Rec. 709: [0.2126, 0.7152, 0.0722] (matches the current "perceptual")
  - Rec. 2020: [0.2627, 0.6780, 0.0593]
  - ...
- custom (e.g. to ignore unused channels).

Currently, the perceptual flag is tied to the source transfer function which is a bit misleading.

/cc @zeux

The text was updated successfully, but these errors were encountered:

zeux · 2021-04-08T22:21:30Z

By the way, only ETC1S is affected by that for now.

This should affect UASTC as well wrt mipmap generation presumably, even if the error metric used by the encoder is flat.

MarkCallow · 2021-04-09T00:48:07Z

Currently, the perceptual flag is tied to the source transfer function which is a bit misleading.

What do you want to happen? sRGB is perceptual. Linear is not. So how else to select use of the encoder's perceptual flag?

It sounds like you want options to set the per-channel weights. You want these in the ktxBasisParams struct and toktx?

What metrics are you talking about? toktx currently doesn't compute any metrics. Are you saying you want KTX-Software to expose an option to specify in which space the Basis encoder computes its error metrics?

This should affect UASTC as well wrt mipmap generation

???

Filtering for mipmap generation is done in linear space.

zeux · 2021-04-09T00:54:23Z

Filtering for mipmap generation is done in linear space.

Yes, which is controlled by --linear - or at least used to be controlled by that flag, presumably it's now controlled by a new flag.

MarkCallow · 2021-04-09T01:23:26Z

--linear, now --assign_oetf linear overrides whatever information the input image provides about its oetf. It is the information from the input image, possibly overridden, that controls whether sRGB decoding and re-encoding is done during mipmap generation.

As far as I can see, this is independent of @lexaknyazev's request.

lexaknyazev · 2021-04-09T05:15:45Z

sRGB is perceptual. Linear is not. So how else to select use of the encoder's perceptual flag?

Since the currently used perceptual weights are hard-coded to Rec. 709 in the encoder, the perceptual flag makes less sense in a case when different primaries are used.

Once the encoder starts accepting custom channel weights, KTX-Software should just set them directly based on actual primaries instead of using that perceptual flag.

Are you saying you want KTX-Software to expose an option to specify in which space the Basis encoder computes its error metrics?

Yes, once the encoder supports this option. It's not there yet.

lexaknyazev · 2021-04-10T13:31:14Z

Looking into this again, I think we do not need physically-linear metric for sRGB-encoded data, because the perceptual luma (Y') is by definition a weighted sum of non-linear values.

MarkCallow · 2022-02-20T07:43:00Z

PR #534 adds a --astc_perceptual flag for the astc encoder. Here's the doc:

The codec should optimize for perceptual error, instead of direct
RMS error. This aims to improves perceived image quality, but
typically lowers the measured PSNR score. Perceptual methods are
currently only available for normal maps and RGB color data.

Since this is not tied to sRGB inputs it seems to me different from perceptual in the BasisU encoder as the latter is tied to sRGB, so we made it astc-specific. @lexaknyazev please take a look and answer the following: is this actually similar to what you are requesting here for the BasisU encoder, minus being able to specify the weights?

Looking into this again, I think we do not need physically-linear metric for sRGB-encoded data, because the perceptual luma (Y') is by definition a weighted sum of non-linear values.

Please explain. Are you saying perceptual mode should only be applied to non-sRGB inputs?

solidpixel · 2022-08-17T21:10:10Z

There are two different aspects to perceptual metrics - chroma bias, and gamma bias.

The current --astc_perceptual mode only corrects for chroma bias, and should be valid for both sRGB and linear encodes.

The gamma bias is something I'm not yet sure how to correct for (or even if it's desirable to do so).

For sRGB textures the gamma-corrected luminance curve means that an error of X should have the same weight no matter the absolute luminance involved, so I think it's safe to ignore gamma bias.

For linear textures that store color, the significance of an error X depends on the absolute luminance you are starting with, as it is obviously not following the perceptual curve. I think the standard fix for this is using Y'CbCr for error calculations, as this can correct for both the gamma curve and the luma-chroma perceptual significance differences. However, if you care enough about gamma correcting your errors you should probably really be using sRGB textures for color data anyway, so I'm in two minds about whether this really has much value. There doesn't seem much point using a gamma corrected error metric in the compressor only to stuff the compressed data into a non-gamma corrected encoding.

MarkCallow · 2022-08-19T03:30:35Z

Ping @lexaknyazev. Please answer the questions in my comment and let us know if you have any thoughts about what @solidpixel wrote.

lexaknyazev · 2022-08-19T12:04:49Z

is this actually similar to what you are requesting here for the BasisU encoder, minus being able to specify the weights?

Yes.

Are you saying perceptual mode should only be applied to non-sRGB inputs?

Technically, this is covered by the @solidpixel's comment above. We do need configurable chroma bias everywhere, regardless of transfer functions (think raw float16 to ASTC HDR path).

Correcting the gamma bias (i.e., decoding 8-bit sRGB to linear floats before computing the error) may even produce worse final results because errors in lower values are usually more important than errors in upper values for color data.

MarkCallow added the awaiting basisu fix label Aug 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve handling of perceptual encoding metrics #392

Improve handling of perceptual encoding metrics #392

lexaknyazev commented Apr 8, 2021

zeux commented Apr 8, 2021

MarkCallow commented Apr 9, 2021

zeux commented Apr 9, 2021

MarkCallow commented Apr 9, 2021

lexaknyazev commented Apr 9, 2021

lexaknyazev commented Apr 10, 2021 •

edited

Loading

MarkCallow commented Feb 20, 2022

solidpixel commented Aug 17, 2022 •

edited

Loading

MarkCallow commented Aug 19, 2022

lexaknyazev commented Aug 19, 2022 •

edited

Loading

Improve handling of perceptual encoding metrics #392

Improve handling of perceptual encoding metrics #392

Comments

lexaknyazev commented Apr 8, 2021

zeux commented Apr 8, 2021

MarkCallow commented Apr 9, 2021

zeux commented Apr 9, 2021

MarkCallow commented Apr 9, 2021

lexaknyazev commented Apr 9, 2021

lexaknyazev commented Apr 10, 2021 • edited Loading

MarkCallow commented Feb 20, 2022

solidpixel commented Aug 17, 2022 • edited Loading

MarkCallow commented Aug 19, 2022

lexaknyazev commented Aug 19, 2022 • edited Loading

lexaknyazev commented Apr 10, 2021 •

edited

Loading

solidpixel commented Aug 17, 2022 •

edited

Loading

lexaknyazev commented Aug 19, 2022 •

edited

Loading