[Pdns-users] Re: Re: Domains with binary (e.g. UTF-8) labels

Benny Amorsen benny+usenet at amorsen.dk
Wed Dec 20 08:45:09 UTC 2006


>>>>> "DA" == Dean Anderson <dean at av8.com> writes:

DA> Futher, the parsing of dotted DNS names can't be translated
DA> to/from ascii/wire format. For example, suppose you get a DNS
DA> record with an UTF8 name in say, hebrew. When this name is
DA> translated by an ordinary resolver/dns cache/etc from wire format
DA> to dotted format, the multibyte characters are mis-interpreted as
DA> ascii.

That won't happen. In UTF-8, all multibyte characters have the high
bit set in every byte.

DA> As soon as the ascii byte code for '.' (hex 2E) is found in
DA> the bytes of a multibyte character, the byte-by-byte translation
DA> of the wire representation sequence of labels is no longer what
DA> was intended---because there is an extra dot in the name.

Therefore that won't happen either.

DA> You also can't get a meaningful interpretation of the byte-by-byte
DA> upper/lower testing of a multi-byte character.

Luckily DNS traditionally only defines upper/lower to apply to
A-Z/a-z. The rest will just be interpreted without case folding. No
problem.

DA> Obviously, then, DNS can't support the full of UTF-8 character
DA> set, unless it gives up the dotted ascii format frequently used by
DA> DNS caches. I don't think anyone has ever advocated the full UTF-8
DA> support in discussions on namedroppers. If they did, I completely
DA> missed it, as I would have said something.

The rest of the post follows from the mistaken premises above.


/Benny




More information about the Pdns-users mailing list