UTF-16 surrogate pairs intentionally not supported?

It appears that the implementation of the UTF-8 encoding wasn't written to properly support UTF-16 surrogate pairs. Was this intentional? If you provide a surrogate pair to the api now you end up with 6 bytes instead of the expected 4 defined by UTF-8. 

It will decode correctly using the corresponding decoding implementation but standard implementations of the UTF-8 decoders will not be able to properly decode it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

UTF-16 surrogate pairs intentionally not supported? #7

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

UTF-16 surrogate pairs intentionally not supported? #7

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions