]> git.kaiwu.me - nginx.git/commit
Core: stricter UTF-8 handling in ngx_utf8_decode().
authorYugo Horie <u5.horie@gmail.com>
Wed, 22 Feb 2023 23:09:50 +0000 (08:09 +0900)
committerYugo Horie <u5.horie@gmail.com>
Wed, 22 Feb 2023 23:09:50 +0000 (08:09 +0900)
commit2c5fccd4693c0a68e1c72d65e016ba83e861120e
tree77acffd59f36faf194f97348f3d8de2cedb7a8a2
parent4ace957c4e08bcbf9ef5e9f83b8e43458bead77f
Core: stricter UTF-8 handling in ngx_utf8_decode().

An UTF-8 octet sequence cannot start with a 11111xxx byte (above 0xf8),
see https://datatracker.ietf.org/doc/html/rfc3629#section-3.  Previously,
such bytes were accepted by ngx_utf8_decode() and misinterpreted as 11110xxx
bytes (as in a 4-byte sequence).  While unlikely, this can potentially cause
issues.

Fix is to explicitly reject such bytes in ngx_utf8_decode().
src/core/ngx_string.c